BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003078
(850 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 999 bits (2584), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 532/875 (60%), Positives = 640/875 (73%), Gaps = 44/875 (5%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDN-VESDSFPNGKHAKEEALNDDIKS 59
MGE+AVC+++ + + + +S TE K+D + L DN E++SFPN K AKE N+DIKS
Sbjct: 1 MGEAAVCVDITTENDNN---TSGTELKRDHQFLIDNDTETESFPNKKQAKES--NEDIKS 55
Query: 60 EVSNPVVSPKEFTSS------LQDITSQETKLVTESCNQV------LSTTSTGNLSSEET 107
EVSNP++SPKE ++ DITSQ T+ + + NQ+ +++T +GN +
Sbjct: 56 EVSNPIISPKENNNNNASSSSWHDITSQPTEELA-TANQLGGVGGEVTSTISGN-----S 109
Query: 108 LSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVA 167
+ +E++ A D +S VS SHVVLEIPKH +S++GIRKITFKFSKRKEDY
Sbjct: 110 CPSSEHSSENNNASICNGDCDS--VSTSHVVLEIPKH-ASTTGIRKITFKFSKRKEDYDT 166
Query: 168 PLAYE----EGRNYTLYD---DLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLS 220
L E R ++ D ++ +G D CA NME+KMSKKV+PN +P+NVKKLLS
Sbjct: 167 RLNQELSPSRSREFSWVDSGTEMPETG--DRYFCAPNMELKMSKKVLPNTFPSNVKKLLS 224
Query: 221 TGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNN 280
TGILDGA VKYIS RE L GI++GGGYLCGCP CNFS+V++A+EFE HAGAKTRHPNN
Sbjct: 225 TGILDGARVKYISPQRE--LYGIIDGGGYLCGCPSCNFSRVLTAYEFELHAGAKTRHPNN 282
Query: 281 HIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDE 340
HIYLENGKPI SIIQELK APLG ++EV+K AGSS NE F+VWKAS H G++ DE
Sbjct: 283 HIYLENGKPICSIIQELKAAPLGAVDEVIKDAAGSSINEEFFQVWKASLHQCNGIIGADE 342
Query: 341 KHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRP--F 398
K LP PHS+ S SS LEES P SFV + + EE KR +RP
Sbjct: 343 KCYSMLPYSPHSLGSYSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRPSSL 402
Query: 399 MHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNK 458
H K+T EGGT++RDNDLHRLLF+PNGLPDG L Y +KGQ++ G KQGNGIVC CC++
Sbjct: 403 SHPKKTNEGGTRRRDNDLHRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCDR 462
Query: 459 EISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGE 518
EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA SLA GQ TTG SDDMC CGDG
Sbjct: 463 EISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGG 522
Query: 519 NLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-GHSSSMSRSV--DLKGGLEAPGAEV 575
+L+ C CP AFH CL +P W CPNC + GH + SRS+ L ++ P EV
Sbjct: 523 DLIFCESCPRAFHLVCLGLKYVPSDVWHCPNCNKFGHGGNFSRSIVIRLTRVVKTPEYEV 582
Query: 576 GGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNR 635
GGCV CR+HDFS TF+DRTVI CDQCE+EFHVGCLR NGLCDLKEIPKD WFC +DCNR
Sbjct: 583 GGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKEIPKDNWFCSNDCNR 642
Query: 636 IHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL 695
I+ ALQ+FVS+ Q IP+ L+ I KH EKG+ DG ND QW++L ++E + SL
Sbjct: 643 IYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWRILMGKSRYQE-DLSL 701
Query: 696 LSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRI 755
LS+A AIFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY V+L VK+VVVSAGLLRI
Sbjct: 702 LSAAAAIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCVLLLVKNVVVSAGLLRI 761
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
FGR+VAELPLVAT RE+QGKG FQALFSCIERLLCSLNV LVLPAAE+AESIWT++FGF
Sbjct: 762 FGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEEAESIWTRRFGF 821
Query: 816 RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
RKM+ E+L +Y R+ QLTIFKGTSMLEK+V + E
Sbjct: 822 RKMTEEQLSQYTRELQLTIFKGTSMLEKEVPLMNE 856
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 988 bits (2555), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/871 (59%), Positives = 628/871 (72%), Gaps = 51/871 (5%)
Query: 1 MGESAVCL------EVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALN 54
MGE CL +V DG+ + E SSR E K+D E + N+E+++ P K KE ALN
Sbjct: 1 MGEGEGCLPVVKDIKVLKDGKTESENSSRMELKRDHECIAGNIETEASPRKKPVKE-ALN 59
Query: 55 DDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD 110
++ SEVSNP++SPK SS+Q ITSQ +L S NQ +++TS+GN S E+LSD
Sbjct: 60 EEGCSEVSNPILSPKYNASSVQTITSQVAELA--STNQAVLGEITSTSSGN-SVPESLSD 116
Query: 111 GDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLA 170
E SR ++ G VS + VVLEIPKHVSS+ GIRKITFKFSK KE Y
Sbjct: 117 ---------EEHSRNGSSDG-VSTTQVVLEIPKHVSST-GIRKITFKFSKSKEAY----- 160
Query: 171 YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVK 230
R + + + A NME+KMSKKVVP YPTNVKKLLSTGILDGA VK
Sbjct: 161 ---NRTNMRVNTCWNLETRNLHFRAPNMELKMSKKVVPKSYPTNVKKLLSTGILDGALVK 217
Query: 231 YISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
YISTSRE++L G++ GYLCGC CNF+KV++A+EFEQHAG +TRHPNNHIYLENGKPI
Sbjct: 218 YISTSREKELQGVIRESGYLCGCSACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPI 277
Query: 291 YSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLP 350
YSIIQ+LKTAPL L+EV+K +AGSS N F+ WKAS H G+ E DE ++ +L + P
Sbjct: 278 YSIIQQLKTAPLSDLDEVIKNIAGSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHP 337
Query: 351 HSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPF-------MHQKR 403
SI+S A+E+S + + +++ MKEM +ERK K+P + K+
Sbjct: 338 QSIVSFPVQAVEDSFTGSRLPL-----KQKELMKEMTQERKHAAKKPSSYIYGSGLQHKK 392
Query: 404 TAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPS 463
++EG KKRDNDLHRLLF+PNGLPDG L Y VKGQR+ G KQGNGIVC C+ E+SPS
Sbjct: 393 SSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPS 452
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFEAHAG AARRQPYRHIYTSNG+TLHDIAISLA GQ TTG SDDMC +CGDG +L+LC
Sbjct: 453 QFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILC 512
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGH--SSSMSRSV--DLKGGLEAPGAEVGGCV 579
+GCP AFH ACL+ +PE WRCP C + ++R + L ++AP +E+GGCV
Sbjct: 513 DGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKVARPIRIQLTRAVKAPESEIGGCV 572
Query: 580 ICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 639
+CR+HDFS + FDDRTV+ CDQCEKEFHVGCLR +GLCDLKE+PKDKWFCCDDC+R+H A
Sbjct: 573 VCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDDCSRVHVA 632
Query: 640 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 699
LQ+ S + IPAS S INRK++EKG L DG +D+QW +L C++E LLS
Sbjct: 633 LQNLASRGPEMIPASVSSMINRKNLEKG-LIDGAADDIQWCILSGKSCYKE-HLPLLSRT 690
Query: 700 TAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGRE 759
TAIFRECFDPI+A GRDLIPVMVYGRNISGQEFGGMY V+L KS VVSAGL+R+FG+E
Sbjct: 691 TAIFRECFDPIVASSGRDLIPVMVYGRNISGQEFGGMYCVVLLAKSTVVSAGLIRVFGQE 750
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
VAELP+VAT +E+QGKG F+ALFSCIE LL SL V+ LVLPAAE+AE+IWT K GF+KMS
Sbjct: 751 VAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNKLGFQKMS 810
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
ER+LKY R+ QLTIFKGTSMLEK+V C+ E
Sbjct: 811 EERMLKYTRELQLTIFKGTSMLEKEVPCIVE 841
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 986 bits (2549), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/899 (57%), Positives = 635/899 (70%), Gaps = 71/899 (7%)
Query: 1 MGESAVCL------EVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALN 54
MGE CL +V DG+ + E SSR E K+D E + N+E+++ P K KE ALN
Sbjct: 20 MGEGEGCLPVVKDIKVLKDGKTESENSSRMELKRDHECIAGNIETEASPRKKPVKE-ALN 78
Query: 55 DDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD 110
++ SEVSNP++SPK SS+Q ITSQ +L S NQ +++TS+GN S E+LSD
Sbjct: 79 EEGCSEVSNPILSPKYNASSVQTITSQVAELA--STNQAVLGEITSTSSGN-SVPESLSD 135
Query: 111 GDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLA 170
E SR ++ G VS + VVLEIPKHV SS+GIRKITFKFSK KE Y + L+
Sbjct: 136 ---------EEHSRNGSSDG-VSTTQVVLEIPKHV-SSTGIRKITFKFSKSKEAYNSKLS 184
Query: 171 YEE-------GRNYTLYDDLGSSGANDGV---------------------LCARNMEIKM 202
E G +++ G G N A NME+KM
Sbjct: 185 SEPLHVLGRVGNSHSYIGYPGDPGRNIASPDTGTNMRVNTCWNLETRNLHFRAPNMELKM 244
Query: 203 SKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVV 262
SKKVVP YPTNVKKLLSTGILDGA VKYISTSRE++L G++ GYLCGC CNF+KV+
Sbjct: 245 SKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKELQGVIRESGYLCGCSACNFTKVL 304
Query: 263 SAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF 322
+A+EFEQHAG +TRHPNNHIYLENGKPIYSIIQ+LKTAPL L+EV+K +AGSS N F
Sbjct: 305 TAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAGSSVNMECF 364
Query: 323 RVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTH 382
+ WKAS H G+ E DE ++ +L + P SI+S A+E+S + + +++
Sbjct: 365 KAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGSRLPL-----KQKEL 419
Query: 383 MKEMLEERKRGVKRPF-------MHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
MKEM +ERK K+P + K+++EG KKRDNDLHRLLF+PNGLPDG L Y
Sbjct: 420 MKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYY 479
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
VKGQR+ G KQGNGIVC C+ E+SPSQFEAHAG AARRQPYRHIYTSNG+TLHDIAIS
Sbjct: 480 VKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAIS 539
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH- 554
LA GQ TTG SDDMC +CGDG +L+LC+GCP AFH ACL+ +PE WRCP C +
Sbjct: 540 LANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFC 599
Query: 555 -SSSMSRS--VDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
++R + L ++AP +E+GGCV+CR+HDFS + FDDRTV+ CDQCEKEFHVGCL
Sbjct: 600 PDRKVARPIRIQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCL 659
Query: 612 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
R +GLCDLKE+PKDKWFCCDDC+R+H ALQ+ S + IPAS S INRK++EKG L D
Sbjct: 660 RDSGLCDLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKG-LID 718
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ 731
G +D+QW +L C++E LLS TAIFRECFDPI+A GRDLIPVMVYGRNISGQ
Sbjct: 719 GAADDIQWCILSGKSCYKE-HLPLLSRTTAIFRECFDPIVASSGRDLIPVMVYGRNISGQ 777
Query: 732 EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCS 791
EFGGMY V+L KS VVSAGL+R+FG+EVAELP+VAT +E+QGKG F+ALFSCIE LL S
Sbjct: 778 EFGGMYCVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSS 837
Query: 792 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
L V+ LVLPAAE+AE+IWT K GF+KMS ER+LKY R+ QLTIFKGTSMLEK+V C+ E
Sbjct: 838 LGVKTLVLPAAEEAEAIWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVPCIVE 896
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 985 bits (2546), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 521/845 (61%), Positives = 610/845 (72%), Gaps = 45/845 (5%)
Query: 33 LTDNVESDSFPNGKHAK-EEALNDDIKSEVSNPV---VSPKEFTSSLQDITSQETKLVTE 88
+ +N + FPN K +K EEA NDD KSEVSNPV VSPK SS DI+ +
Sbjct: 31 VNENETGELFPNKKQSKQEEASNDDTKSEVSNPVRTLVSPKGNGSSSHDISEESPTNACP 90
Query: 89 SCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSS 148
S + L+ + G SS +E + + S ++ VS S VVL+IP+H +S+
Sbjct: 91 SSEETLTVSQEGGGSS----------SEDNTSHQSLRNDTCDSVSMSPVVLKIPEH-AST 139
Query: 149 SGIRKITFKFSKRKEDYVA------PL--AYEEG----RNYTLYDDLGSSGANDGVLCA- 195
+G+RKITFKFSKRKEDY PL ++G RN Y S N
Sbjct: 140 TGVRKITFKFSKRKEDYDTKTSSPHPLHGGIDQGLLYHRNGDYYPRNHSVWVNSCTEMPQ 199
Query: 196 ---RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
R +E+ MSKKVVPN YPTNVKKLL+TGILD A VKYI S ER+LDGI++GGGYLCG
Sbjct: 200 TRERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERELDGIIDGGGYLCG 259
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C C+FSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL +++ V+K V
Sbjct: 260 CSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDV 319
Query: 313 AGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSF 372
AGSS NE FRVWKAS + LV D+K +LP LPHS +S +S AL+ES P S SF
Sbjct: 320 AGSSINEEFFRVWKASLNQSNALVGADKKSYSELPCLPHSHVSYASQALKESFCPISSSF 379
Query: 373 VQDTG--REQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLP 423
+ + +QT+M E K+ KRP + QK+TAE G +KRDNDLHRLLF+P
Sbjct: 380 LYNNNFVSQQTNM-ETSGVNKQTSKRPSFYVPGSATKQKKTAESGVRKRDNDLHRLLFMP 438
Query: 424 NGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
NGLPDG L Y VKGQ++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYT
Sbjct: 439 NGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGMSARRQPYRHIYT 498
Query: 484 SNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
SN +TLHDIAISLA GQ TTG DDMC CGDG +L+ C CP AFHAACLD PE
Sbjct: 499 SNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEG 558
Query: 544 GWRCPNCRQ-GHSSSMSRSV--DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCD 600
W CPNC + GH + +R + L ++ P +VGGC +CR+HDFS TFDDRTVI CD
Sbjct: 559 AWHCPNCNKLGHGGNFARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVILCD 618
Query: 601 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 660
QCEKEFHVGCLR++GLCDLKEIPKD WFCC DCN I+ AL++ VS QTIP S L+TIN
Sbjct: 619 QCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPVSLLNTIN 678
Query: 661 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIP 720
RKH+EKG+L D DVQWQ+L + ++ SLLS A AIFRECFDPI+A+ GRDLIP
Sbjct: 679 RKHVEKGLLVDEAAYDVQWQILM-GKSRNREDLSLLSGAAAIFRECFDPIVAKTGRDLIP 737
Query: 721 VMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
VMVYGRNISGQEFGGMY V+LTV+ VVVSAGLLRIFGREVAELPLVAT RE+QGKG FQA
Sbjct: 738 VMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGKGYFQA 797
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 840
LFSCIERLLCSLNVE LVLPAAE+AESIWT++FGFRKMS +LLKY R+FQLTIFKGTSM
Sbjct: 798 LFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFGFRKMSEGQLLKYTREFQLTIFKGTSM 857
Query: 841 LEKKV 845
LEK+V
Sbjct: 858 LEKEV 862
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 957 bits (2473), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/887 (57%), Positives = 616/887 (69%), Gaps = 80/887 (9%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE VC+ +G+ + SRTE K+D + + E PN K AKE + ND+++SE
Sbjct: 1 MGEEVVCVHALEEGKQENNEESRTELKRDYDQCVADTEPHLSPNKKQAKEVS-NDEVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD--GDER 114
VSNP VS E + QDI+SQ T+ +E+ N L++T N SS+ETLSD G++
Sbjct: 60 VSNPNVSAAELAQTFQDISSQPTE--SENVNHAECGELTSTCLENSSSDETLSDEAGEQN 117
Query: 115 TESSYAETSRTDNNSGDVS-KSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVA------ 167
++ TS++D ++ + S VV+EIPKH +SSSGIRKITFKFSK+KEDY
Sbjct: 118 NNNNNNNTSQSDKDTSSAAMTSRVVMEIPKH-ASSSGIRKITFKFSKKKEDYDYQPPPPM 176
Query: 168 --PLAYEEG---------RNYTLYDD-----------LGSSGANDGVLCARNMEIKMSKK 205
P Y +G Y DD +G D L RNME+KMSKK
Sbjct: 177 HHPALYNDGNYIGFHGDDEEYLARDDCSGGSLESPCGMGYVRDGDLDLYTRNMELKMSKK 236
Query: 206 VVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAH 265
VVPN YPTNVKKLLSTGILDGA VKYI + +L GI++GGGYLCGC +CN+S+V+SA+
Sbjct: 237 VVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQGIIDGGGYLCGCSMCNYSRVLSAY 296
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
EFEQHAGAKTRHPNNHI+LENG+PIYSIIQE+KTAPL +L+EV+K VAGSS NE SF+ W
Sbjct: 297 EFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIKNVAGSSVNEESFQAW 356
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKE 385
K S G V+ + ++ KL +PH+ +I P+S + ++G
Sbjct: 357 KESLLQSNGKVQAHKSYSTKLVGMPHT-----------NIRPSS--YTSNSG-------- 395
Query: 386 MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGC 445
+ QKR+A+G TK+RDNDLHRLLF+PNGLPDG L Y VKGQ+L G
Sbjct: 396 -------------VLQKRSADGCTKRRDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGY 442
Query: 446 KQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG 505
KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA+SLA GQ TTG
Sbjct: 443 KQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIALSLANGQNLTTG 502
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC----RQGHSSSMSRS 561
SDDMC VCGDG +L+LCNGCP AFHAACL +P+SGW+C NC G SS+ R
Sbjct: 503 DSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPDSGWQCLNCIDNAGNGRESSIVRP 562
Query: 562 VDLKGGL--EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ ++ + P E+GGCV+CR HDFS A FD+RTVI CDQCEKE+HVGCLR GLC+L
Sbjct: 563 IMIRLTRVDKTPEVEMGGCVVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCEL 622
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 679
+E+PKDKWFCCDDCNRI+AALQ+ VS A+ IPAS I RKH +KG+ G MND+QW
Sbjct: 623 EELPKDKWFCCDDCNRIYAALQNSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMNDIQW 682
Query: 680 QMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSV 739
++L + E LLS A AIFRECFDPI+A GRDLIPVMVYGRNISGQEFGGMY +
Sbjct: 683 RILSGKSRYPE-HLPLLSRAAAIFRECFDPIVAISGRDLIPVMVYGRNISGQEFGGMYCI 741
Query: 740 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 799
+L V VVVSAGLLRIFGR VAELPLVAT R +QGKG FQ LFSCIERLL SLNVE LVL
Sbjct: 742 VLIVNYVVVSAGLLRIFGRNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVL 801
Query: 800 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
PAA AESIWTKK GFRKMS ++L K+ R+ QLT+F TSMLEK VQ
Sbjct: 802 PAAGDAESIWTKKLGFRKMSEDQLSKHLREVQLTLFNKTSMLEKTVQ 848
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 956 bits (2470), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/884 (57%), Positives = 620/884 (70%), Gaps = 84/884 (9%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE A L + + E +S TE K+D + L ++ E +S N K AKE + N+D++SE
Sbjct: 1 MGEEAAGLAAVTNETLGKENASSTELKRDHQCLDEDTEPESLHNKKQAKEVS-NEDVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSDGDERTE 116
VSNPVVSPKE + DITSQ ++ E+ QV L++ +GN SSE+ +S G R +
Sbjct: 60 VSNPVVSPKE--NHFHDITSQPEEV--ENTTQVERGELTSACSGNSSSED-ISSGGVRCQ 114
Query: 117 SSYAETSRTDNNSGDVSK-SHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGR 175
+ + S+ D + DV++ S VV+EIPKH +SS+GIRKITFKFSK+K + A ++ ++
Sbjct: 115 N---DMSQNDVDMCDVNEVSRVVIEIPKH-ASSTGIRKITFKFSKKKGNNGASVSADKVH 170
Query: 176 NY-----------TLYDDL----------GSSGANDGVLCARNMEIKMSKKVVPNEYPTN 214
+Y +L DD GS+ ++ L ME+KMSKKV+PN YP+N
Sbjct: 171 SYGNSDRDGKPEPSLLDDACTETSAHSCEGSAESSRYSLGPNKMELKMSKKVLPNNYPSN 230
Query: 215 VKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAK 274
VKKLLSTGILDGA VKY+ST+ E +L GI+NGGGY+CGC CNF+ ++SA+EFEQHAG K
Sbjct: 231 VKKLLSTGILDGARVKYVSTTSEMKLQGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFK 290
Query: 275 TRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKG 334
TRHPNNHIYLENG+PIYS+IQE+K+APL IL+EV+ +VAGSS N SF WKAS H
Sbjct: 291 TRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSSVNMNSFEAWKASFHQDSA 350
Query: 335 LVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGV 394
+ E H++KLP L H + E +P + V
Sbjct: 351 NIVV-ENHDVKLPKLSHPV---------ERPNPNFSNAV--------------------- 379
Query: 395 KRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCD 454
+ K+TAE GTK+RDNDLHRLLF+PNGLPDG L Y VKGQR+ G KQGNGI+C
Sbjct: 380 ----LQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCS 435
Query: 455 CCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC 514
CN+EISPSQFEAHAGMAARRQPYRHIYT+NG+TLHDIAISLA GQ+ TTG SDDMC C
Sbjct: 436 HCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAAC 495
Query: 515 GDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ----------GHSSSMSRSV-- 562
G+G +L+ C+ CP A+H CL +PE W CPNCR G S S S+ +
Sbjct: 496 GNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDKVGSNSKAISGGSLSFSKPIVF 555
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
L ++AP E+GGCV+CR HDFSAA FDDRTV+ CDQCE+EFHVGCLR +GLCDLKE+
Sbjct: 556 RLTRVVKAPEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKEL 615
Query: 623 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQML 682
PKDKWFCCD+C+ IH ALQ+ V N AQ IP S I RKH+ KG+L D +NDV+WQ+L
Sbjct: 616 PKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQIL 675
Query: 683 KKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILT 742
F E + LS ATAIFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY V+L
Sbjct: 676 SGKSRFPE-DLPFLSRATAIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCVVLI 734
Query: 743 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 802
V+S+VVSAGLLRIFGREVAELP+VAT RE+QGKG FQ LFSCIERLL SLNV+NLVLPAA
Sbjct: 735 VRSIVVSAGLLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAA 794
Query: 803 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
E AESIWTKK GFRKMS E+L+KY R+ QLTIF GTSMLEK V+
Sbjct: 795 EDAESIWTKKLGFRKMSEEQLIKYMREVQLTIFNGTSMLEKVVE 838
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 952 bits (2462), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 512/890 (57%), Positives = 617/890 (69%), Gaps = 83/890 (9%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE A C+ +G+ + SRTE K+D + + E + PN K AKE + ND+++SE
Sbjct: 1 MGEEADCVHALEEGKKENNEESRTELKRDYDQCVADTEPNVSPNKKQAKEVS-NDEVRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQV----LSTTSTGNLSSEETLSD-----G 111
VSNP VS E + QDI+SQ T+ +E+ N L++T N SS+ETLSD
Sbjct: 60 VSNPNVSAAEHALTFQDISSQPTE--SENVNHAECGELTSTCLENSSSDETLSDEAGEHN 117
Query: 112 DERTESSYAETSRTDNNSGDVS-KSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVA--- 167
+ ++ TS++D ++G + S VV+EIPKHVSSS GIRKITFKFSK+KEDY
Sbjct: 118 NNNNNNNNNNTSQSDKDTGSAAMTSCVVMEIPKHVSSS-GIRKITFKFSKKKEDYDYQPP 176
Query: 168 -----PLAYEEGRNYTLYDD--------------------LGSSGANDGVLCARNMEIKM 202
P Y +G + + D +G D L RNME+KM
Sbjct: 177 PAVHHPALYNDGNHIGFHGDDEEYLARDDCSGGSLESPCGMGYVHDGDLDLYTRNMELKM 236
Query: 203 SKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVV 262
SKKVVPN YPTNVKKLLSTGILDGA VKYI + +L GI++GGGYLCGC +CN+S+V+
Sbjct: 237 SKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQGIIDGGGYLCGCSMCNYSRVL 296
Query: 263 SAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF 322
SA+EFEQHAGAKTRHPNNHI+LENG+PIYSIIQE+KTAPL IL+EV+K VAGSS NE SF
Sbjct: 297 SAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIKNVAGSSVNEESF 356
Query: 323 RVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTH 382
+ WK S G V+ + ++ KL +PH+ +I P+S + +TG
Sbjct: 357 QAWKESLLQSNGKVQAHKSYSTKLVGMPHT-----------NIRPSS--YTSNTG----- 398
Query: 383 MKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLR 442
+ QKR+A+G TK+RDNDLHRLLF+PNGLPDG L Y VKGQ+L
Sbjct: 399 ----------------VLQKRSADGCTKRRDNDLHRLLFMPNGLPDGAELAYYVKGQKLL 442
Query: 443 FGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR 502
G KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIYTSNG+TLHDIA+SLA GQ
Sbjct: 443 GGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYTSNGLTLHDIALSLANGQNL 502
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCR----QGHSSSM 558
TTG SDDMC VCGDG +L+LCNGCP AFHAACL +P+SGW+C NCR G SS+
Sbjct: 503 TTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPDSGWQCLNCRDNAGNGRESSI 562
Query: 559 SRSVDLKGGL--EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
R + ++ + P E+GGCV+CR HDFS A FD+RTVI CDQCEKE+HVGCLR GL
Sbjct: 563 VRPIMIRLTRVDKTPEFEMGGCVVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGL 622
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND 676
C+L+E+PKDKWFCCDDCNRI+ ALQ+ V+ A+ IPAS I RKH +KG+ G MND
Sbjct: 623 CELEELPKDKWFCCDDCNRIYVALQNSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMND 682
Query: 677 VQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGM 736
+QW++L + E LLS A AIFRECFDPI+A GRDLIPVMVYGRNISGQEFGGM
Sbjct: 683 IQWRILSGKSRYPE-HLPLLSRAAAIFRECFDPIVAISGRDLIPVMVYGRNISGQEFGGM 741
Query: 737 YSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVEN 796
Y ++L V SVVVSAGLLRIFGR VAELPLVAT R +QGKG FQ LFSCIERLL SLNVE
Sbjct: 742 YCIVLIVNSVVVSAGLLRIFGRNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEK 801
Query: 797 LVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
LVLPAA AESIWTKK GFRKMS ++L K+ R+ QLT+F TSMLEK VQ
Sbjct: 802 LVLPAAGDAESIWTKKLGFRKMSEDQLSKHLREVQLTLFNKTSMLEKTVQ 851
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 934 bits (2413), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/904 (56%), Positives = 609/904 (67%), Gaps = 68/904 (7%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE AVC++ DG+ + SR E K+D + E++ PN K AKE + ND+++SE
Sbjct: 1 MGEEAVCVQELVDGKTENTDESRLELKRDYNQCVADTEANVPPNKKQAKEVS-NDELRSE 59
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTG--NLSSEETLSD-GDERTES 117
V+NP VS E + DI+SQ T+ S + TSTG N SS +T+SD R
Sbjct: 60 VTNPNVSATEHAQTFHDISSQPTESENVSHAECGELTSTGLENSSSHDTVSDEAGVRNND 119
Query: 118 SYAETSRTDNNSGDVSK---SHVVLEIPKHVSSSSGIRKITFKFSKRKEDY---VAPLAY 171
S + N+ G S S VV+EIPKH +SS+GIRKITFKFSKRKEDY P Y
Sbjct: 120 SDNINNLCQNDKGTSSNDAVSRVVMEIPKH-ASSTGIRKITFKFSKRKEDYDDYQTPTGY 178
Query: 172 EEGRN------------------------YTLYDDLGSSG------------ANDGVLCA 195
+G Y DD ++G D L +
Sbjct: 179 TDGSGSDYGFGYGNGSGYGYGYGYHGDDEYLANDDYNNNGLVESSYGRGYVPYEDSELYS 238
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL 255
NME+KMSKKVVPN +P NVKKLLSTGILDGA VKYI + +LDGI+ GGYLCGC +
Sbjct: 239 GNMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVELDGIIGDGGYLCGCSM 298
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C++S+V+SA+EFEQHAGAKTRHPNNHI+LENGKPIYSII E+KTA +EV+K VAGS
Sbjct: 299 CSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKNVAGS 358
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
S NEGSF+VWK S V +K++ K +PH+ S S + S V++
Sbjct: 359 SINEGSFQVWKESLLQSNKKVPTQKKYSTKSTGIPHTYNSQSIESASSFSSLR----VRN 414
Query: 376 TGREQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+Q ++ + +E KR VK+P + QKR+A+G TKKRDNDLHRLLF+PNGLPD
Sbjct: 415 HFEQQMYVNQTADEWKRVVKKPSTYTYYSGIPQKRSADGCTKKRDNDLHRLLFMPNGLPD 474
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G L Y VKGQ+L G KQGNGIVC CC+ EISPSQFEAHAGMAARRQPYRHIY SNG+T
Sbjct: 475 GAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIYASNGLT 534
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LHDIA+SLA GQ TTG SDDMC VCGDG +L+LCNGCP AFHAACL +PESGW C
Sbjct: 535 LHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLHSVPESGWHCL 594
Query: 549 NC------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQC 602
NC +G M R + + P EVGGCV+CR++DFS FDDRTVI CDQC
Sbjct: 595 NCEDNTGDERGARPIMIRLTRVD---KEPEYEVGGCVVCRANDFSVDKFDDRTVIICDQC 651
Query: 603 EKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRK 662
EKE+HVGCLR GLC+L+E+PKDKWFCCDDCNRI+ ALQ+ VS A TIP+S I RK
Sbjct: 652 EKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSVSAGADTIPSSLSELIIRK 711
Query: 663 HIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVM 722
H ++G+ G MND+QW++L + E LLS A AIFRECFDPI+A GRDLIPVM
Sbjct: 712 HEDRGLCTYGDMNDIQWRILSGKSRYAE-HLPLLSRAAAIFRECFDPIVAISGRDLIPVM 770
Query: 723 VYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
VYGRNISGQEFGGMY ++L V S+VVSAGLLRIFGR +AELPLVAT RE+QGKG FQALF
Sbjct: 771 VYGRNISGQEFGGMYCIVLIVNSIVVSAGLLRIFGRNIAELPLVATSREHQGKGYFQALF 830
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
SCIERLL SLNVE LVLPAA AESIWTKK GF KMS ++L K+ ++ QLT+F TS+LE
Sbjct: 831 SCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFHKMSEDQLTKHLKEVQLTLFNKTSVLE 890
Query: 843 KKVQ 846
K VQ
Sbjct: 891 KMVQ 894
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 887 bits (2292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 488/891 (54%), Positives = 598/891 (67%), Gaps = 92/891 (10%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVESDSFPNGKHAKEEALNDDIKSE 60
MGE +CLE+P + E SS + K RE L ++ + D FPN K AKE A NDDI SE
Sbjct: 1 MGEGTICLEMPME-----ENSSMGQFK--RERLDESNKGDHFPNKKQAKE-ASNDDITSE 52
Query: 61 VSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYA 120
+SNPV SP E TS +D++SQ K C S + SEET+SD ++S A
Sbjct: 53 ISNPVASPVESTSLFRDVSSQPVKSGLGEC-------SGSDFGSEETVSD-----DASVA 100
Query: 121 ETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLY 180
+S+T+ +S DV S VLEIPKH+SS+ GI KITFK SK K+++
Sbjct: 101 GSSQTEQSS-DVLPSRFVLEIPKHLSST-GITKITFKLSKPKKEF--------------- 143
Query: 181 DDL---GSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
DDL + GV +KM KK+V YP+NVKKLL TGIL+GA VKYIST
Sbjct: 144 DDLPVIKDHTCDVGV-------VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPV 196
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+SA+EFE HAGAKTRHPNNHI+LENG+ +Y+I+QEL
Sbjct: 197 RELQGIIHSGGYLCGCTTCSFSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQEL 256
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKA-----------------SHHLRKGLVEHDE 340
KTAP +LEEV++ VAGS+ NE + WK S ++ ++ D
Sbjct: 257 KTAPRDVLEEVIRNVAGSALNEEGLQAWKGDGQQVSNRVIACLTGTISRNIPLSVIYLDP 316
Query: 341 --KHNMKLPSLPHSIISCSSLA------LEESISPTSCSFVQDTGREQTHMKEMLEERKR 392
++ L S S SCS L L+ES S T CS E+T+ K+ L+E KR
Sbjct: 317 VVTYHFVLLSETDSCESCSYLGSGTGPGLDESQSLTPCSVENHYFPEKTYAKDTLDEPKR 376
Query: 393 GVKRPFMH------QKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCK 446
K+ H K+ +EG +KRDNDLHRLLF+PNGLPDG L Y VK Q+L G K
Sbjct: 377 IAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYK 436
Query: 447 QGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG 506
QG+GIVC CC++EISPSQFEAHAGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG
Sbjct: 437 QGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGD 496
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLK 565
SDDMC +CGDG +LLLC GCP AFH ACL +PE W C +C G SS + + D
Sbjct: 497 SDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSVPEGTWYCSSCNDGPISSKKATATDPS 556
Query: 566 GG-----------LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
G ++AP +E+GGCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+N
Sbjct: 557 GNARPIVIRLSRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLREN 616
Query: 615 GLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTM 674
GLCDLKEIP++KWFCC DC+RIH A+Q+ VS QTIP L I RK EKGI D
Sbjct: 617 GLCDLKEIPQEKWFCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNG- 675
Query: 675 NDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFG 734
+ V+W++L + E LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFG
Sbjct: 676 DIVEWRILSGKSRYPE-HLPLLSRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFG 734
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
GMY ++L V S+VVSA LLRIFG++VAELP+VAT REYQG+G FQ L++C+E LL SLNV
Sbjct: 735 GMYCLVLIVNSLVVSAALLRIFGQQVAELPIVATSREYQGRGYFQGLYACVENLLSSLNV 794
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
ENLVLPAAE+AESIWTKKFGF KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 795 ENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 845
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 885 bits (2286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/869 (55%), Positives = 589/869 (67%), Gaps = 78/869 (8%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELLTDNVE----SDSFPNGKHAKEEALNDD 56
MGE +CLE+P + + K+DR L D+ + D FP+ K AKE A NDD
Sbjct: 1 MGEGTICLEMPKE--------ENGQLKRDR--LDDDTDEGNKGDHFPSKKQAKE-ASNDD 49
Query: 57 IKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTE 116
I SE+SNPV SP E TS +D++SQ K C S + SEET+SD +
Sbjct: 50 ITSEISNPVASPVESTSLFRDVSSQPVKSGLVEC-------SGSDFGSEETVSD-----D 97
Query: 117 SSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYV-APLAYEEGR 175
+S +S+T+ +S DV S VLEIPKH+SS+ GI KITFK SK K+++ PL
Sbjct: 98 ASVVGSSQTEQSS-DVLPSRFVLEIPKHLSST-GITKITFKLSKPKKEFDDLPL------ 149
Query: 176 NYTLYDDLGSSGANDGVLCARNMEIKM-SKKVVPNEYPTNVKKLLSTGILDGACVKYIST 234
+ D +G +KM KK+V YP+NVKKLL TGIL+GA VKYIST
Sbjct: 150 ---IKDHTWDAGV-----------VKMPKKKIVSLSYPSNVKKLLETGILEGARVKYIST 195
Query: 235 SRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSII 294
RQL GI++ GGYLCGC CNFSKV+SA+EFEQHAGAKTRHPNNHI+LEN + +Y+I+
Sbjct: 196 PPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIV 255
Query: 295 QELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSII 354
QELKTAP +LEEV++ VAGS+ NE R WKAS + + + + HS +
Sbjct: 256 QELKTAPRVVLEEVIRNVAGSALNEEGLRAWKASFQQSNSMSDRNYITD-------HSTV 308
Query: 355 SCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAEGG 408
S L+ES S T CS E+T+ K+ L+E KR K+ H K+ +EG
Sbjct: 309 SYLGPGLDESQSLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGS 368
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
+KRDNDLHRLLF+PNGLPDG L Y VK Q+L G KQG+GIVC CC++EISPSQFEAH
Sbjct: 369 NRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAH 428
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
AGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC GCP
Sbjct: 429 AGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQ 488
Query: 529 AFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPGAEVG 576
AFH ACL +PE W C +C G SS + + D G ++AP +++G
Sbjct: 489 AFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIG 548
Query: 577 GCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 636
GCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+NG CDLKEIP++KWFCC +C+RI
Sbjct: 549 GCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRI 608
Query: 637 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 696
H A+Q+ VS QT+P L I RK EKGI D + V+W++L + E LL
Sbjct: 609 HTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTD-IGDTVEWRILSGKSRYPE-HLPLL 666
Query: 697 SSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIF 756
S A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA LLRIF
Sbjct: 667 SRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIF 726
Query: 757 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 816
G+EVAELP+VAT REYQG+G FQ L++C+E LL SLNVENLVLPAAE+AESIWTKKFGF
Sbjct: 727 GQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFT 786
Query: 817 KMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 787 KMSDQQLQEYQKEVQLTIFKGTSMLEKKV 815
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 837 bits (2161), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/875 (52%), Positives = 571/875 (65%), Gaps = 64/875 (7%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELL-TDNV-ESDSFPNGKHAKEEALNDDIK 58
MGE+ VC ++ + K+DR DN+ E + + + K +E NDD+K
Sbjct: 1 MGEATVCF-------AKENSETKKDLKRDRLCFEQDNLDEEELYSSNKRQTKEPSNDDMK 53
Query: 59 SEVSNPVVSPK-EFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTES 117
SE+SNPV SP + SS +DITS K + S ++V S + S ET++D
Sbjct: 54 SEISNPVPSPVVDNASSFRDITSNPAK--SSSGDRVGSCSG-----SYETITD------E 100
Query: 118 SYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNY 177
++E + NS D S EIPKH+S++ GI KITFK SKR ED+ +E
Sbjct: 101 KHSEYCSSLANS-DAVPSSFEREIPKHLSTT-GITKITFKLSKRNEDFCDLPMIQEHTWE 158
Query: 178 TLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
++ SS + +KM KK+ + +NVKKLL TGILDGA VKY+STS
Sbjct: 159 GYPSNVASS----------TLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLSTSAA 208
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+ A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL
Sbjct: 209 RELQGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQEL 268
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISC- 356
+ AP +LEEV++KVAGS+ +E F+ WK S K + E D H M HS S
Sbjct: 269 RIAPPDVLEEVIRKVAGSALSEEGFQAWKGSFQQDKNMTEDDSNHIMD-----HSFQSLV 323
Query: 357 ----SSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAE 406
S +L+ES S T C + RE+ K+ K K+ H K+ +
Sbjct: 324 SYPGSGWSLDESQSSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHKKVSG 383
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFE 466
GG KRDNDLHRLLFLPNGLPDG L Y VK Q+L G KQG+GIVC CC+ +ISPSQFE
Sbjct: 384 GGKWKRDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFE 443
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLA-MGQRRTTGGSDDMCHVCGDGENLLLCNG 525
AHAGMA RRQPYR I+ S+G++LHDIA+SLA G TTG SDDMC +CG+G +LLLC G
Sbjct: 444 AHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAG 503
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS----------VDLKGGLEAPGAEV 575
CP AFH ACL +PE W C +C G +S + + L ++AP +E+
Sbjct: 504 CPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASDPNLKPIVIRLTRVVKAPESEI 563
Query: 576 GGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNR 635
GGCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+N LCDLK IP+DKWFCC DC+R
Sbjct: 564 GGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCCSDCSR 623
Query: 636 IHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL 695
IH LQ S QTIP L TI+RK+ EKGI D N V+W+ML + E L
Sbjct: 624 IHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNG-NTVEWRMLSGKSRYPE-HLPL 681
Query: 696 LSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRI 755
LS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA LLRI
Sbjct: 682 LSRAATIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRI 741
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
FG++VAELP+VAT REYQG+G FQ LF+C+E LL SLNVENL+LPAAE+AESIWT KFGF
Sbjct: 742 FGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTNKFGF 801
Query: 816 RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
KM+ RL +YQR+ QLTIFKGTSMLEKKV E
Sbjct: 802 TKMTEHRLQRYQREVQLTIFKGTSMLEKKVPSFSE 836
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 826 bits (2134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/858 (53%), Positives = 564/858 (65%), Gaps = 99/858 (11%)
Query: 30 RELLTDNVE----SDSFPNGKHAKEEALNDDIKSEVSNPVVSPKEFTSSLQDITSQETKL 85
R+ L D+ + D FP+ K AKE A NDDI SE+SNPV SP E TS +D++SQ K
Sbjct: 11 RDRLDDDTDEGNKGDHFPSKKQAKE-ASNDDITSEISNPVASPVESTSLFRDVSSQPVKS 69
Query: 86 VTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHV 145
C S + SEET+SD ++S +S+T+ +S DV S VLEIPKH+
Sbjct: 70 GLVEC-------SGSDFGSEETVSD-----DASVVGSSQTEQSS-DVLPSRFVLEIPKHL 116
Query: 146 SSSSGIRKITFKFSKRKEDYV-APLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKM-S 203
SS+ GI KITFK SK K+++ PL + D +G +KM
Sbjct: 117 SST-GITKITFKLSKPKKEFDDLPL---------IKDHTWDAGV-----------VKMPK 155
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK+V YP+NVKKLL TGIL+GA VKYIST RQL GI++ GGYLCGC CNFSKV+S
Sbjct: 156 KKIVSLSYPSNVKKLLETGILEGARVKYISTPPVRQLLGIIHSGGYLCGCTTCNFSKVLS 215
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
A+EFEQHAGAKTRHPNNHI+LEN + +Y+I+QELKTAP +LEEV++ VAGS+ NE R
Sbjct: 216 AYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSALNEEGLR 275
Query: 324 VWKA-----SHHLRKGL-----------VEHDEKHNMKLPSLPHSIISCSSL--ALEESI 365
WK S+ + L V ++ L S S SCS L L+ES
Sbjct: 276 AWKGDGQQVSNRVIACLTGIILRIIPLSVNALLSYHFVLLSETDSCESCSYLGPGLDESQ 335
Query: 366 SPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAEGGTKKRDNDLHRL 419
S T CS E+T+ K+ L+E KR K+ H K+ +EG +KRDNDLHRL
Sbjct: 336 SLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRL 395
Query: 420 LFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYR 479
LF+PNGLPDG L Y VK Q ISPSQFEAHAGMAARRQPYR
Sbjct: 396 LFMPNGLPDGTELAYYVKTQ--------------------ISPSQFEAHAGMAARRQPYR 435
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
HI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC GCP AFH ACL
Sbjct: 436 HIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQS 495
Query: 540 IPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPGAEVGGCVICRSHDFS 587
+PE W C +C G SS + + D G ++AP +++GGCV CRSHDFS
Sbjct: 496 MPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIGGCVFCRSHDFS 555
Query: 588 AATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNR 647
FDDRTVI CDQCEKE+HVGCLR+NG CDLKEIP++KWFCC +C+RIH A+Q+ VS
Sbjct: 556 IGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRIHTAVQNSVSCG 615
Query: 648 AQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECF 707
QT+P L I RK EKGI D + V+W++L + E LLS A IFRECF
Sbjct: 616 PQTLPTPLLDMICRKDREKGIFTD-IGDTVEWRILSGKSRYPE-HLPLLSRAAVIFRECF 673
Query: 708 DPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 767
DPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA LLRIFG+EVAELP+VA
Sbjct: 674 DPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQEVAELPIVA 733
Query: 768 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ 827
T REYQG+G FQ L++C+E LL SLNVENLVLPAAE+AESIWTKKFGF KMS ++L +YQ
Sbjct: 734 TSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQ 793
Query: 828 RDFQLTIFKGTSMLEKKV 845
++ QLTIFKGTSMLEKKV
Sbjct: 794 KEVQLTIFKGTSMLEKKV 811
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 812 bits (2097), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/873 (52%), Positives = 562/873 (64%), Gaps = 62/873 (7%)
Query: 1 MGESAVCLEVPADGEMDVECSSRTESKQDRELL-TDNV-ESDSFPNGKHAKEEALNDDIK 58
MGE+ VC ++ + K+DR DN+ E + + + K +E NDD+K
Sbjct: 1 MGEATVCF-------AKENSETKKDLKRDRLCFEQDNLDEEELYSSNKRQTKEPSNDDMK 53
Query: 59 SEVSNPVVSPK-EFTSSLQDITSQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTES 117
SE+SNPV SP + SS +DITS K + S ++V S + S ET++D
Sbjct: 54 SEISNPVPSPVVDNASSFRDITSNPAK--SSSGDRVGSCSG-----SYETITD------E 100
Query: 118 SYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNY 177
++E + NS D S EIPKH+S++ GI KITFK SKR ED+ +E
Sbjct: 101 KHSEYCSSLANS-DAVPSSFEREIPKHLSTT-GITKITFKLSKRNEDFCDLPMIQEHTWE 158
Query: 178 TLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE 237
++ SS + +KM KK+ + +NVKKLL TGILDGA VKY+STS
Sbjct: 159 GYPSNVASS----------TLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLSTSAA 208
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R+L GI++ GGYLCGC C+FSKV+ A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL
Sbjct: 209 RELQGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQEL 268
Query: 298 KTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISC- 356
+ AP +LEEV++KVAGS+ +E F+ WK S K + E D H M HS S
Sbjct: 269 RIAPPDVLEEVIRKVAGSALSEEGFQAWKGSFQQDKNMTEDDSNHIMD-----HSFQSLV 323
Query: 357 ----SSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRTAE 406
S +L+ES S T C + RE+ K+ K K+ H K+ +
Sbjct: 324 SYPGSGWSLDESQSSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHKKVSG 383
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFE 466
GG KRDNDLHRLLFLPNGLPDG L Y VK Q+L G KQG+GIVC CC+ +ISPSQFE
Sbjct: 384 GGKWKRDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFE 443
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLA-MGQRRTTGGSDDMCHVCGDGENLLLCNG 525
AHAGMA RRQPYR I+ S+G++LHDIA+SLA G TTG SDDMC +CG+G +LLLC G
Sbjct: 444 AHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAG 503
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQG-------HSSSMSRSVDLKGGLEAPGAEVGGC 578
CP AFH ACL +PE W C +C G +S + +L + +
Sbjct: 504 CPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCKIATASWLYTYFNLNANILVLHSAYSLS 563
Query: 579 VIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 637
I RSHDFS FDDRTVI CDQCEKE+HVGCLR+N LCDLK IP+DKWFCC DC+RIH
Sbjct: 564 PISDRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCCSDCSRIH 623
Query: 638 AALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLS 697
LQ S QTIP L TI+RK+ EKGI D N V+W+ML + E LLS
Sbjct: 624 RVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNG-NTVEWRMLSGKSRYPE-HLPLLS 681
Query: 698 SATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFG 757
A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA LLRIFG
Sbjct: 682 RAATIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFG 741
Query: 758 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 817
++VAELP+VAT REYQG+G FQ LF+C+E LL SLNVENL+LPAAE+AESIWT KFGF K
Sbjct: 742 QKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTNKFGFTK 801
Query: 818 MSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
M+ RL +YQR+ QLTIFKGTSMLEKKV E
Sbjct: 802 MTEHRLQRYQREVQLTIFKGTSMLEKKVPSFSE 834
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 811 bits (2095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/845 (52%), Positives = 554/845 (65%), Gaps = 56/845 (6%)
Query: 27 KQDRELL-TDNVESDSFPNGKHAKEEALNDDIKSEVSNPVVSPK-EFTSSLQDITSQETK 84
K+DR + DN+E + + + K +E NDDI+SE+SNPV SP + SS +DITS K
Sbjct: 10 KRDRLCIEQDNLEEELYSSNKRQAKEPSNDDIRSEISNPVASPVVDNASSFRDITSNPAK 69
Query: 85 LVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKH 144
+ S ++V S + + +S+E S+ Y + + S D S V EIPKH
Sbjct: 70 --SSSGDRVGSCSGSYEATSDEKRSE--------YCSSLADSSQSSDAVPSSFVREIPKH 119
Query: 145 VSSSSGIRKITFKFSKRKEDYV-APLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMS 203
+S++ GI KITFK SKR ED P+ E S +N + + + +KM
Sbjct: 120 LSTT-GITKITFKLSKRNEDVCDLPMIQEHTWE--------GSPSN---VASSTLGVKML 167
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
K+ +P+NVKKLL+TGILDGA VKY+S S R+L GI++ GGYLCGC +C+FSKV+
Sbjct: 168 DKIDSTNFPSNVKKLLATGILDGARVKYLSISPARELQGIIHSGGYLCGCTVCDFSKVLG 227
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
A+EFE+HAG KT+HPNNHIYLENG+P+Y++IQEL+ AP +LEEV++KVAGS+ +E F+
Sbjct: 228 AYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVAPPDVLEEVIRKVAGSALSEEGFQ 287
Query: 324 VWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHM 383
WK S + D H M S +++ES S T + R++
Sbjct: 288 AWKESFQ------QDDSNHIMDYSFQSLVSYPGSGWSIDESQSSTPYFPENNYFRKKIST 341
Query: 384 KEMLEERKRGVKRPFMH------QKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
K+ E K K+ H K+ A GG KRDNDLHRLLFLPNGLPDG L Y VK
Sbjct: 342 KDTRHEHKPKAKKVTSHMFGMGCHKKAAGGGKWKRDNDLHRLLFLPNGLPDGTELAYFVK 401
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
Q+L G KQG+GIVC CC+ EISPSQFEAHAGMA RRQPYRHI+ S+G++LHDIA+SLA
Sbjct: 402 SQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGMAGRRQPYRHIHISSGLSLHDIAMSLA 461
Query: 498 -MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
G TTG SDDMC +CGDG +LLLC GCP AFH ACL +PE W C +C G +S
Sbjct: 462 DGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTS 521
Query: 557 SMSRSV------DLKGGLE----APGAEVGGCVIC------RSHDFSAATFDDRTVIYCD 600
+ + + G + + V C RS DFS FDDRTVI CD
Sbjct: 522 CKTATATDPNLKSIVGSIAIFSLSAHIRVLHSAYCFSPISDRSLDFSIGKFDDRTVILCD 581
Query: 601 QCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTIN 660
QCEKE+HVGCLR+N LCDLK IP+DKWFCC DC+RIH ALQ S QTIP L TI+
Sbjct: 582 QCEKEYHVGCLRENDLCDLKGIPQDKWFCCSDCSRIHTALQSSASCGPQTIPTVLLDTIS 641
Query: 661 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIP 720
RK+ EKGI D N V+W+ML + E LLS A IFRECFDPI+A+ GRDLIP
Sbjct: 642 RKYREKGICIDNGDN-VEWRMLSGKSRYAE-HLPLLSRAATIFRECFDPIVAKSGRDLIP 699
Query: 721 VMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
VMVYGRNISGQEFGGMY ++L V S+VVSA LLRIFG++VAELP+VAT REYQG+G FQ
Sbjct: 700 VMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQG 759
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 840
LF+C+E LL SLNVENL+LPAAE+AESIWTKKFGF KM+ +L KYQR+ QLTIFKGTSM
Sbjct: 760 LFACVENLLSSLNVENLLLPAAEEAESIWTKKFGFTKMTEHQLQKYQREVQLTIFKGTSM 819
Query: 841 LEKKV 845
LEKKV
Sbjct: 820 LEKKV 824
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/601 (62%), Positives = 441/601 (73%), Gaps = 48/601 (7%)
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++++SA+EFEQHAG KTRHPNNHIYLENG+PIYS+IQE+K+APL IL+EV+ +VAGSS
Sbjct: 213 ITEILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSSV 272
Query: 318 NEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTG 377
N SF WKAS H + E H++KLP L H + E +P + V
Sbjct: 273 NMNSFEAWKASFHQDSANIVV-ENHDVKLPKLSHPV---------ERPNPNFSNAV---- 318
Query: 378 REQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
+ K+TAE GTK+RDNDLHRLLF+PNGLPDG L Y VK
Sbjct: 319 ---------------------LQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVK 357
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
GQR+ G KQGNGI+C CN+EISPSQFEAHAGMAARRQPYRHIYT+NG+TLHDIAISLA
Sbjct: 358 GQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLA 417
Query: 498 MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ----- 552
GQ+ TTG SDDMC CG+G +L+ C+ CP A+H CL +PE W CPNCR
Sbjct: 418 SGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDKVGSN 477
Query: 553 -----GHSSSMSRSV--DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
G S S S+ + L ++AP E+GGCV+CR HDFSAA FDDRTV+ CDQCE+E
Sbjct: 478 SKAISGGSLSFSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQCERE 537
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
FHVGCLR +GLCDLKE+PKDKWFCCD+C+ IH ALQ+ V N AQ IP S I RKH+
Sbjct: 538 FHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIRKHVG 597
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYG 725
KG+L D +NDV+WQ+L F E + LS ATAIFRECFDPI+A+ GRDLIPVMVYG
Sbjct: 598 KGLLVDEALNDVRWQILSGKSRFPE-DLPFLSRATAIFRECFDPIVAKSGRDLIPVMVYG 656
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
RNISGQEFGGMY V+L V+S+VVSAGLLRIFGREVAELP+VAT RE+QGKG FQ LFSCI
Sbjct: 657 RNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREHQGKGYFQVLFSCI 716
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
ERLL SLNV+NLVLPAAE AESIWTKK GFRKMS E+L+KY R+ QLTIF GTSMLEK V
Sbjct: 717 ERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREVQLTIFNGTSMLEKVV 776
Query: 846 Q 846
+
Sbjct: 777 E 777
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 6/84 (7%)
Query: 218 LLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTR 276
+ G+ DGA + Y +R L G G G LC CN + +S +FE HAG A R
Sbjct: 341 FMPNGLPDGAELAYF-VKGQRILGGFKQGNGILCSH--CN--REISPSQFEAHAGMAARR 395
Query: 277 HPNNHIYLENGKPIYSIIQELKTA 300
P HIY NG ++ I L +
Sbjct: 396 QPYRHIYTTNGLTLHDIAISLASG 419
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/850 (50%), Positives = 518/850 (60%), Gaps = 120/850 (14%)
Query: 33 LTDNVESDSFPNGKHAK-EEALNDDIKSEVSNPV---VSPKEFTSSLQDITSQETKLVTE 88
+ +N + FPN K AK EEA NDD KSEVSNPV VSPK SS DI+ +
Sbjct: 34 VNENETGELFPNKKQAKQEEASNDDTKSEVSNPVITLVSPKGNGSSSHDISEESPTNACP 93
Query: 89 SCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSS 148
S + L+ + G G +E + + S ++ VS S VVLEIP+H +S+
Sbjct: 94 SSEETLTVSQEG----------GGSSSEDNTSNQSPRNDTCDSVSMSPVVLEIPEH-AST 142
Query: 149 SGIRKITFKFSKRKEDY---VAPLAYEEG--------RNYTLYDDLGSSGANDGVLCA-- 195
+G+RKITFKFSKRKEDY ++P G RN Y S N
Sbjct: 143 TGVRKITFKFSKRKEDYDTKISPHPLHGGIDQGLLYHRNGDYYPRNHSVWVNSCTEMPQT 202
Query: 196 --RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
R ME+ MSKKVVPN YPTNVKKLL+TGILD A VKYI S ER+LDGI++GGGYLCGC
Sbjct: 203 RERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERELDGIIDGGGYLCGC 262
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CNFSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL +++ V+K VA
Sbjct: 263 SSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 322
Query: 314 GSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFV 373
GSS NE FRVWKAS + LV D+K + +LP LPHS +S +S AL+ES P S SF+
Sbjct: 323 GSSINEEFFRVWKASLNQSNALVGADKKCHSELPCLPHSHVSYASQALKESFCPISSSFL 382
Query: 374 QDTG--REQTHMKEMLEERKRGVKRPFMH-------QKRTAEGGTKKRDNDLHRLLFLPN 424
+ +Q +M E K+ KRP ++ QK+TAE G +KRDNDLHRLLF+PN
Sbjct: 383 YNNNFVSQQMYM-ETSGVNKQTSKRPSLYFPGSATKQKKTAESGVRKRDNDLHRLLFMPN 441
Query: 425 GLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS 484
GLPDG L Y VKGQ++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYTS
Sbjct: 442 GLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTS 501
Query: 485 NGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG 544
NG+TLHDIAISLA GQ TTG DDMC CGDG +L+ +H LL
Sbjct: 502 NGLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLM--------WHVWIYRILL----- 548
Query: 545 WRCPNCRQGHSSSMSRS--VDLKGGLEAPGAEVGGCVICRSH------DFSAATFDD-RT 595
+ Q + +R + L + P VG CV+CR + DF+ A
Sbjct: 549 -KVLGIVQIDGGNFARPTVIRLTRVGKIPEYNVGDCVVCRLNLLKFLIDFTLANSRKCLN 607
Query: 596 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS 655
V+ +KEFHVGCLR++GLCDL+EIP+D WFCC DCN I+ AL++ VS Q IPAS
Sbjct: 608 VMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFCCQDCNNIYVALRNSVSTGVQKIPASL 667
Query: 656 LSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECG 715
L+ INRKH+EKG+L D DVQWQ+L + ++ SLLS A AIFR ++ C
Sbjct: 668 LNIINRKHVEKGLLVDEAAYDVQWQILM-GKSRNREDLSLLSGAAAIFR------VSSCT 720
Query: 716 RDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
L+ + L S+
Sbjct: 721 LHLMKCWFF----------------LAFNSL----------------------------- 735
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 835
F LFSCIERLLCSLNVE LVLPAAE +IWT++FGFRKMS +LLKY R+FQLTIF
Sbjct: 736 --FNPLFSCIERLLCSLNVEQLVLPAAE---TIWTRRFGFRKMSEGQLLKYTREFQLTIF 790
Query: 836 KGTSMLEKKV 845
KGTSMLEK+V
Sbjct: 791 KGTSMLEKEV 800
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/513 (61%), Positives = 375/513 (73%), Gaps = 20/513 (3%)
Query: 351 HSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMH------QKRT 404
HS +S L+ES S T CS E+T+ K+ L+E KR K+ H K+
Sbjct: 10 HSTVSYLGPGLDESQSLTPCSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKV 69
Query: 405 AEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+EG +KRDNDLHRLLF+PNGLPDG L Y VK Q+L G KQG+GIVC CC++EISPSQ
Sbjct: 70 SEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQ 129
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FEAHAGMAARRQPYRHI+ S+G++LHDIA+SLA G TTG SDDMC +CGDG +LLLC
Sbjct: 130 FEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCA 189
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGG-----------LEAPG 572
GCP AFH ACL +PE W C +C G SS + + D G ++AP
Sbjct: 190 GCPQAFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARPIVIRLSRVVKAPE 249
Query: 573 AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 632
+++GGCV CRSHDFS FDDRTVI CDQCEKE+HVGCLR+NG CDLKEIP++KWFCC +
Sbjct: 250 SDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSN 309
Query: 633 CNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE 692
C+RIH A+Q+ VS QT+P L I RK EKGI D + V+W++L + E
Sbjct: 310 CSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTD-IGDTVEWRILSGKSRYPE-H 367
Query: 693 KSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGL 752
LLS A IFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGGMY ++L V S+VVSA L
Sbjct: 368 LPLLSRAAVIFRECFDPIVAKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAAL 427
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
LRIFG+EVAELP+VAT REYQG+G FQ L++C+E LL SLNVENLVLPAAE+AESIWTKK
Sbjct: 428 LRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKK 487
Query: 813 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
FGF KMS ++L +YQ++ QLTIFKGTSMLEKKV
Sbjct: 488 FGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 520
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 218 LLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTR 276
+ G+ DG + Y + ++ L G G G +C C S+ +S +FE HAG A R
Sbjct: 86 FMPNGLPDGTELAYYVKT-QKLLQGYKQGSGIVCSC----CSREISPSQFEAHAGMAARR 140
Query: 277 HPNNHIYLENGKPIYSIIQEL 297
P HI++ +G ++ I L
Sbjct: 141 QPYRHIFISSGLSLHDIAMSL 161
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 301/718 (41%), Positives = 422/718 (58%), Gaps = 52/718 (7%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 1712 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 1771
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 1772 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 1831
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GS F V ++
Sbjct: 1832 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGS------FPVKRS-- 1883
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ SP + + T +K +
Sbjct: 1884 ------LPADEA--AKMDPLGNSCIKRNN-------SPATS--IHRTSERARLLKPIPVT 1926
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
+ G + + + G K+D LHRL+F GLPDG + Y G++L G K+G
Sbjct: 1927 KSSG-SALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 1985
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 1986 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 2045
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 2046 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 2105
Query: 562 VDLKG------------GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
+ G + P AEV CV+CR +DFS + F RT+I CDQCEKEFH+G
Sbjct: 2106 GRVSGVDPIEQITKRCIRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIG 2165
Query: 610 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
CLR + + DLKE+P KWFCC +C RIH+ALQ + +P S L+ I KH KG+
Sbjct: 2166 CLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLE 2225
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNI 728
N V+W++L + + + LLS A AIF + FDPII + GRDLIP MVYGRN+
Sbjct: 2226 SIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNV 2283
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+G FQ LFSCIE+L
Sbjct: 2284 RGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKL 2343
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV 845
L LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+ F+GT MLEK V
Sbjct: 2344 LAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGV 2401
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/718 (42%), Positives = 423/718 (58%), Gaps = 52/718 (7%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 180 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 239
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 240 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 299
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GS F V ++
Sbjct: 300 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGS------FPVKRS-- 351
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ + SI TS E+ + + +
Sbjct: 352 ------LPADEA--AKMDPLGNSCIKRNN-SPATSIHRTS---------ERARLLKPIPV 393
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
K + + + G K+D LHRL+F GLPDG + Y G++L G K+G
Sbjct: 394 TKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 453
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 454 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 513
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 514 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 573
Query: 562 VDLKG------------GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
+ G + P AEV CV+CR +DFS + F RT+I CDQCEKEFH+G
Sbjct: 574 GRVSGVDPIEQITKRCIRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIG 633
Query: 610 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
CLR + + DLKE+P KWFCC +C RIH+ALQ + +P S L+ I KH KG+
Sbjct: 634 CLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLE 693
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNI 728
N V+W++L + + + LLS A AIF + FDPII + GRDLIP MVYGRN+
Sbjct: 694 SIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNV 751
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+G FQ LFSCIE+L
Sbjct: 752 RGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKL 811
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV 845
L LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+ F+GT MLEK V
Sbjct: 812 LAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGV 869
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 281/686 (40%), Positives = 406/686 (59%), Gaps = 68/686 (9%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNF 258
KKV ++P +K LL TGIL+G V+YI S+ E L G+++G G +C C C
Sbjct: 297 KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGGVISGSGIICFCNNCKG 356
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HAG+ + P +IYLE G + I+ + EE ++ G S
Sbjct: 357 KEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSFDQTEEFIQSAIGRSL- 415
Query: 319 EGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGR 378
+ + + + E D M ++ CS + +S P + + +
Sbjct: 416 ---VKRTAICLNCKGRIPESDTGIAM--------LLCCSCM---DSKKPQAIDLLSLS-- 459
Query: 379 EQTHMKEMLEERKRGVKRPFMHQKRT------------AEGGTKKRDNDLHRLLFLPNGL 426
+MKE + +P + K + G ++D LH+L+F + L
Sbjct: 460 -HYYMKEFWADHLIITPKPNVLSKSSDTITKSVSTRGKIHGRITRKDLRLHKLVFEEDIL 518
Query: 427 PDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
PDG + Y +GQ+L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG
Sbjct: 519 PDGTEVAYYARGQKLLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNG 578
Query: 487 MTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
++LH+++ISL+ G++ + +DD+C +C DG +LL C+GCP +FH C+ IP W
Sbjct: 579 VSLHELSISLSKGRKFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLQCIPTGIWY 638
Query: 547 CPNCRQ--------GHSSS----------------MSRSVDLKGGLEAPGAEVGGCVICR 582
C C+ H+++ +R + + +E EVGGC +CR
Sbjct: 639 CKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEV---EVGGCALCR 695
Query: 583 SHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD 642
HDFS + F RTVI CDQCEKEFHVGCL++N + DLKE+P+ KWFCC +CNRIH+AL+
Sbjct: 696 CHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEK 755
Query: 643 FVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQ--WQMLKKAQCFEEKEKSLLSSAT 700
V + +P S L ++ +K ++G ++NDV+ W++L ++ +SLLS A
Sbjct: 756 LVVLGGEKLPESILVSVQKKIEDQG---SASINDVEIRWRVLNWKMLSSDETRSLLSKAV 812
Query: 701 AIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGRE 759
+IF +CFDPI+ + GRD IP M+YGRNI GQEFGG+Y +LTV VVS G+ RIFG E
Sbjct: 813 SIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAE 872
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
VAELPLVAT +QG+G FQ+L++CIER L LNV+NLVLPAA++AES+W KFGF K+
Sbjct: 873 VAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESLWINKFGFSKLP 932
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
E +++++R +Q+ IF+GTSML+K+V
Sbjct: 933 PEEVMEFKRHYQMMIFQGTSMLQKEV 958
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 23/152 (15%)
Query: 360 ALEESISPTSCSFVQ--DTGREQTHMKEMLEERKRGVKRPF------MHQKRTAEGGTKK 411
AL++++ PTS + +TG + E + + P + + + + KK
Sbjct: 244 ALKQNVEPTSLEHLSKCNTGVAMQVITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKK 303
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF-------GCKQGNGIVCDCCN---KEI- 460
L LL G+ +G R+ YI +G +++ G G+GI+C C N KE+
Sbjct: 304 FPAKLKDLL--DTGILEGLRVRYI-RGSKIKALGETGLGGVISGSGIICFCNNCKGKEVV 360
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
SP+ FE HAG + +R P +IY G TL DI
Sbjct: 361 SPTLFELHAGSSNKRPP-EYIYLETGNTLRDI 391
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 296/691 (42%), Positives = 394/691 (57%), Gaps = 43/691 (6%)
Query: 184 GSSGANDGVLCARN-MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYIS-TSRERQLD 241
G SG N + RN ME+KMSKK+V N P VKKL TG LDG V Y+ + L
Sbjct: 110 GGSGINGALGAPRNKMELKMSKKIVVNRKPMTVKKLFDTGFLDGVSVVYMGGIKKASGLR 169
Query: 242 GIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP 301
G++ GG LC C LCN +V+ +FE HA + R +I LENGK + +++ + A
Sbjct: 170 GVIRDGGILCSCCLCNGRRVIPPSQFEIHACKQYRRAAQYICLENGKSLLDLLRACRGAT 229
Query: 302 LGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLAL 361
L LE V+ S E F + VE S S +++
Sbjct: 230 LHTLEVTVQNFVCSPHEERYFTCKRCKGCFPSSFVERVGPICRSCVESRKSEESSNNVVG 289
Query: 362 EESISPTSCSFVQDTGREQTHMKEMLEE-RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLL 420
+ SP + + + ++ RK+ K F+ D LH+L+
Sbjct: 290 KRVRSPRPVVLSNPSSTSELSVSSQVKRHRKKRTKLVFISISSVL-------DQRLHKLV 342
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRH 480
F NGLPDG + Y +GQ+L G K G+GIVC CCN EISPSQFE HAG A+R++PY +
Sbjct: 343 FEENGLPDGTEVAYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAY 402
Query: 481 IYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI 540
IYTSNG++LH++AISL+ ++ + +DD+C VC DG NLLLC+GCP AFH C I
Sbjct: 403 IYTSNGVSLHELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSI 462
Query: 541 PESGWRCPNCRQGHSSS------------------------MSRSVDLKGGLEAPGAEVG 576
P W C C+ +R + + +EA ++
Sbjct: 463 PRGDWYCQFCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEA---DLS 519
Query: 577 GCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 636
C +CR DFS + F RT+I CDQCEKE+HVGCLR + + LKE+P+ W CC+DC RI
Sbjct: 520 SCALCRGVDFSRSGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRI 579
Query: 637 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 696
H+ L++ + A+ +P S L I +K EKG+ + DV+W++L + + + LL
Sbjct: 580 HSTLENLLVKGAERLPESLLGVIKKKQEEKGL---EPIIDVRWRLLN-GKIASPETRPLL 635
Query: 697 SSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRI 755
A +IF ECF+PI+ A GRDLIP MVYGRN+ GQEFGGMY +L V S VVSAG+LRI
Sbjct: 636 LEAVSIFHECFNPIVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRI 695
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
FG +VAELPLVAT GKG FQ LFSCIERLL LNV+NLVLPAAE+AESIWT KFGF
Sbjct: 696 FGSDVAELPLVATSNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGF 755
Query: 816 RKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
KM+ + L Y+++ Q+ FKGT+ML K V
Sbjct: 756 SKMNPDELTNYRKNCHQMVSFKGTNMLHKMV 786
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 281/679 (41%), Positives = 406/679 (59%), Gaps = 54/679 (7%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNF 258
KKV ++P +K LL TGIL+G V+YI S+ E L G+++G G +C C C
Sbjct: 297 KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGGVISGSGIICFCNNCKG 356
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HAG+ + P +IYLE G + I+ + EE ++ G S
Sbjct: 357 KEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSFDQTEEFIQSAIGRSL- 415
Query: 319 EGSFRVWKASHHLRKGLVEHDEKHNMKL-----PSLPHSIISCSSLALEESISPTSCSFV 373
+ + + + E D M L S + S S + S +P S
Sbjct: 416 ---VKRTAICLNCKGRIPESDTGIAMLLCCSCMDSRKPQVSSSPSPSPSPSPTPIVFS-- 470
Query: 374 QDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLT 433
+++T +L + + + + + G ++D LH+L+F + LPDG +
Sbjct: 471 ----KDRTPKPNVLSKSSDTITKSVSTRGKI-HGRITRKDLRLHKLVFEEDILPDGTEVA 525
Query: 434 YIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA 493
Y +GQ+L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+++
Sbjct: 526 YYARGQKLLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELS 585
Query: 494 ISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ- 552
ISL+ G++ + +DD+C +C DG +LL C+GCP +FH C+ IP W C C+
Sbjct: 586 ISLSKGRKFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNL 645
Query: 553 -------GHSSS----------------MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAA 589
H+++ +R + + +E EVGGC +CR HDFS +
Sbjct: 646 FQKEKFVEHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEV---EVGGCALCRCHDFSKS 702
Query: 590 TFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQ 649
F RTVI CDQCEKEFHVGCL++N + DLKE+P+ KWFCC +CNRIH+AL+ V +
Sbjct: 703 GFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGE 762
Query: 650 TIPASSLSTINRKHIEKGILFDGTMNDVQ--WQMLKKAQCFEEKEKSLLSSATAIFRECF 707
+P S L ++ +K ++G ++NDV+ W++L ++ +SLLS A +IF +CF
Sbjct: 763 KLPESILVSVQKKIEDQG---SASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCF 819
Query: 708 DPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 766
DPI+ + GRD IP M+YGRNI GQEFGG+Y +LTV VVS G+ RIFG EVAELPLV
Sbjct: 820 DPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLV 879
Query: 767 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 826
AT +QG+G FQ+L++CIER L LNV+NLVLPAA++AES+W KFGF K+ E ++++
Sbjct: 880 ATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADEAESLWINKFGFSKLPPEEVMEF 939
Query: 827 QRDFQLTIFKGTSMLEKKV 845
+R +Q+ IF+GTSML+K+V
Sbjct: 940 KRHYQMMIFQGTSMLQKEV 958
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 23/152 (15%)
Query: 360 ALEESISPTSCSFVQ--DTGREQTHMKEMLEERKRGVKRPF------MHQKRTAEGGTKK 411
AL++++ PTS + +TG + E + + P + + + + KK
Sbjct: 244 ALKQNVEPTSLEHLSKCNTGVAMQVITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKK 303
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF-------GCKQGNGIVCDCCN---KEI- 460
L LL G+ +G R+ YI +G +++ G G+GI+C C N KE+
Sbjct: 304 FPAKLKDLL--DTGILEGLRVRYI-RGSKIKALGETGLGGVISGSGIICFCNNCKGKEVV 360
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
SP+ FE HAG + +R P +IY G TL DI
Sbjct: 361 SPTLFELHAGSSNKRPP-EYIYLETGNTLRDI 391
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 324/843 (38%), Positives = 447/843 (53%), Gaps = 124/843 (14%)
Query: 86 VTESCNQVLSTTSTGNLSS------EETLSDGDERTESSYAETSRTDNNSGDVSKSHVVL 139
VTE+ V+ T + L S EE GD +T+S + D + +
Sbjct: 243 VTEAMMNVVKTDNEEQLDSLMEPTMEELPMSGDCKTKSEAIK--------ADCAYGSALA 294
Query: 140 EIPKHV-------SSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGV 192
+P+ V SS +R+ T K K+ ++ E+ T +D S +
Sbjct: 295 TVPESVENSKVSTSSEKPLRRFTRSCLKTKQQAMSASPAED----TKAEDALESDEASAI 350
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGG 247
+E+KMSKKV + PT +K LL+TG+L+G V+Y+ ++ E+ L G++ G
Sbjct: 351 GTTSKLEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQGVIQGS 410
Query: 248 GYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEE 307
G LC C C +KVV+ ++FE HAG+ + P +IYL+NGK + ++ K AP LE
Sbjct: 411 GILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEA 470
Query: 308 VVKKVAG--------------SSFNEGSF----------RVWKASHHL----------RK 333
++ G +S E SF K S R
Sbjct: 471 AIRNATGAGDARKSTFCLNCKASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRD 530
Query: 334 GLVEH---------DEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
G +E D+ + + LP L L+ SP+S S ++ T R +
Sbjct: 531 GQLEFIFLLNYYWADDLYKLGLPDL---------RGLQ--WSPSSNSVLKSTERMSSGTC 579
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
+ R ++ K EG LP DG L Y V+G++L G
Sbjct: 580 PPSKVHGRLTRKDLRMHKLVFEGDV------------LP----DGTALAYYVRGKKLLEG 623
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT 504
K+G I C CC E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+++I L+M +R ++
Sbjct: 624 YKKGGAIFCYCCQSEVSPSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSS 683
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVD 563
+DD+C +C DG +LL C+ CP AFH C+ IP W C C RS +
Sbjct: 684 DENDDLCSICADGGDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKFDRSAN 743
Query: 564 LKGGLEAPG-------------------AEVGGCVICRSHDFSAATFDDRTVIYCDQCEK 604
G AEVG CV+CRSHDFS + F +TVI CDQCEK
Sbjct: 744 AIAAGRVAGIDALEQITKCSIRIVDTLHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEK 803
Query: 605 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 664
E+HV CL ++ + DLKE+PKDKWFCC +CN IH ALQ VS+ Q++P S + IN K I
Sbjct: 804 EYHVKCLEEHNMDDLKELPKDKWFCCKECNSIHYALQKLVSDGEQSLPDSLMGIINEK-I 862
Query: 665 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII--AECGRDLIPVM 722
+ L D ++NDV+W++L EE + LS A +IF + FDPI + DLIP M
Sbjct: 863 KAKNLEDNSINDVKWRLLSGKNSTEET-RVWLSGAVSIFHDSFDPIADSSTSRLDLIPTM 921
Query: 723 VYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
VYGRN Q+FGGM IL V S+VVSAG++RIFG+EVAELPLVAT + QGKG FQ+LF
Sbjct: 922 VYGRNFKDQDFGGMLCAILMVNSLVVSAGVIRIFGKEVAELPLVATSLDCQGKGYFQSLF 981
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
IE LL SL V+ LVLPAAE+AESIWTKKFGF+ ++ E L Y+ ++QL IF+GT+ML+
Sbjct: 982 YSIENLLKSLGVKYLVLPAAEEAESIWTKKFGFQHITPEELKHYKDNYQLMIFQGTAMLQ 1041
Query: 843 KKV 845
K+V
Sbjct: 1042 KQV 1044
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 514 bits (1323), Expect = e-142, Method: Compositional matrix adjust.
Identities = 283/677 (41%), Positives = 394/677 (58%), Gaps = 50/677 (7%)
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL 255
+N+E+KMSKK+ + P VK+L TG+L+G V Y+ + L G + G LC C
Sbjct: 186 KNLELKMSKKIALDNIPMTVKELFETGLLEGVPVVYMGGKKAFCLRGTIKDVGILCYCSF 245
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C +V+ +FE HA + R +I ENGK + ++ + +PL LE ++
Sbjct: 246 CKGCRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISG 305
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
E +F + LV K+ L S + + E + SP + ++
Sbjct: 306 LPKEKTFTCKRCKGTYPTILVG-------KVGPLCSSCVE----SKESNGSPACETNIKS 354
Query: 376 TGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
+ + + L GV + + +D LH+L+F GLPDG + Y
Sbjct: 355 RSSKPATVSKSLNSALEGV-----SSENKCQWKITTKDQRLHKLVFEDGGLPDGTEVAYY 409
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
+GQ+L G K+G GI+C CCN E+SPS FEAHAG A R++PY +IYTSNG++LH++AIS
Sbjct: 410 ARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVSLHELAIS 469
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--- 552
L+ G++ + +DD+C VC DG +L+LC+GCP AFH C IP W C C+
Sbjct: 470 LSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQFCQNMFQ 529
Query: 553 --------------GHSSSM-------SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATF 591
G S + R + + +E AE+ GCV+CR +DFS + F
Sbjct: 530 REKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIE---AELTGCVLCRGYDFSRSGF 586
Query: 592 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
RT+I CDQC KEFHVGCLR + + +LKE+PK KWFCC DC RIH+AL+ ++ A+ I
Sbjct: 587 GPRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLAREAEII 646
Query: 652 PASSLSTINRKHIEKGILFDGTMN--DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 709
P L + +K+ EKG+ T+N DV+W++L E K LLS A AIF+ECFDP
Sbjct: 647 PNKLLEVVMKKNEEKGL---ETVNNIDVRWKLLTGKSASPE-TKLLLSQALAIFQECFDP 702
Query: 710 IIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 769
I+ GRDLIP+MVYG+N GQ++GGMY +L V S VVSA ++RIFG+EVAELPLVAT
Sbjct: 703 IVDTTGRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLVATS 762
Query: 770 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 829
GKG FQ LFS IE+LL L V ++VLPAAE+AESIWT KFGF+K+ ++L KY++
Sbjct: 763 NGNHGKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSKYRKS 822
Query: 830 -FQLTIFKGTSMLEKKV 845
Q+ FKGTSML+K V
Sbjct: 823 CCQILTFKGTSMLQKAV 839
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 317/866 (36%), Positives = 462/866 (53%), Gaps = 72/866 (8%)
Query: 27 KQDRELLTDNVESDSFPNGKHAKEEALNDDIKSEVSNPVVSPKEFTSSL---QDITSQET 83
++D L D + P +E NDD+K + + S +I + E
Sbjct: 190 EEDSRKLEDVTIKEEIPKRNEPEEVLGNDDLKRYADGNDQCEERISGSSPNSMNIDNFEN 249
Query: 84 KLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPK 143
+ S N++ T+ L E+ SD + E + +S NS + + +
Sbjct: 250 QNGEHSKNEMEKVTAMNELL--ESKSDMNNVNEEGTSMSSVILMNSEGGAIDSLPINDST 307
Query: 144 HVSSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGA--NDGVLCARNMEIK 201
V +R+ T K K + A ++ + + DD GS A N G + +K
Sbjct: 308 KVEKEKPMRRFTRSLLKPKMEIGQEYAVKD--SSSAADDAGSPSAASNSGTM------LK 359
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLC 256
+ K ++PT +K LL +GIL+G VKY+ S+ R L G+++G LC C C
Sbjct: 360 VWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSC 419
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG-S 315
++VV+ FE HAG+ + P +IYLENG + ++ K A L L+E + G S
Sbjct: 420 RGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCS 479
Query: 316 SFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEESISPTSCSFVQ 374
S +F + R L E +M L CS + L++S + +
Sbjct: 480 SLKNSTFCL-----KCRGKLAEASTGRSMTL---------CSQCMVLKDSQASIPATTDT 525
Query: 375 DTGREQTHM---KEMLEERKRGVKRPF------MHQKRTAEGGTKKRDNDLHRLLFLPNG 425
D G ++ + + +L + V + + ++G +D +H+L+F +
Sbjct: 526 DKGYAESDVCAYRIVLTPKSHPVSKSSDSVLKCSTSRSKSQGRLTVKDLRMHKLVFEEDV 585
Query: 426 LPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
LPDG + Y +GQ+L G K+G GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSN
Sbjct: 586 LPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSN 645
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
G++LH++AISL+ ++ +T +DD+C +C DG +LL C+ CP A+H CL IP W
Sbjct: 646 GVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLCCDVCPRAYHKDCLALPEIPTGRW 705
Query: 546 RCPNCRQG--------HSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRSH 584
C C H+++ + + G ++ A+ GGCV CR H
Sbjct: 706 YCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVKTMDADFGGCVFCRGH 765
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 644
DF F RTV+ CDQCEKEFHVGCL+ + + DLKE+PK WFCC DC RIH+AL+ V
Sbjct: 766 DFDK-IFGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCCSDCCRIHSALEKLV 824
Query: 645 SNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR 704
+ + SSL+ IN+K EK D + DV+W++L +LLS A AI
Sbjct: 825 LRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAGDTAALLSEALAILH 884
Query: 705 ECFDPII-----AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGRE 759
E F+PI+ ++ RDLI MV+G N+ GQEFGGMY +L + VVS ++R FG E
Sbjct: 885 EQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGMYCAVLMINQAVVSCAIIRFFGLE 944
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
+AELPLVAT + QGKG FQALF+CIE+LL LN++NLVLPAAE+AESIW KFGFRK++
Sbjct: 945 LAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAEEAESIWINKFGFRKLT 1004
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
E LK+++D+Q+ +F+GTSML K V
Sbjct: 1005 HEEFLKFRKDYQMMVFQGTSMLHKPV 1030
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 278/666 (41%), Positives = 385/666 (57%), Gaps = 74/666 (11%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKV 261
VP +P +K+LL TGIL+ V+YI SR R L G++ G G LC C C +KV
Sbjct: 204 VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGESGLRGVIKGSGILCSCNSCKGTKV 263
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
V+ + FE HAG+ + P +IYLENG + ++ K A L L+E ++ G S
Sbjct: 264 VTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAWKNAALDSLDEAIRVAIGCSM---- 319
Query: 322 FRVWKASHHLR-KGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQ 380
+ K++ L KG + N K+ + H ++ +S S S + GR
Sbjct: 320 --IKKSTFCLNCKGRISEAGIGNSKVLCMLHELL----------VSKCSSSGSKSYGR-- 365
Query: 381 THMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQR 440
K+D LH+L+F NGLP+G + Y V+GQ+
Sbjct: 366 ----------------------------VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQ 397
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+ +ISL+ G+
Sbjct: 398 LLVGYKRGSGIFCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGR 457
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-------- 552
+ +DD+C +C DG NLL C+GCP FH C+ IP+ W C C
Sbjct: 458 EISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFV 517
Query: 553 GHSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
H+++ + + G + E+GGC +CR H+FS + F RTV+ C
Sbjct: 518 EHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLC 577
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
DQCEKEFHVGCLR++ + DLKE+PK KWFCC DC RI+++LQ V + + +P + L+TI
Sbjct: 578 DQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTI 637
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLI 719
K+ G D++W+++ + + SLLS A +IF E FDPI GRDL+
Sbjct: 638 KEKYGRNGSACSKD-PDIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADAAGRDLL 696
Query: 720 PVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 779
P MV+G++ +FGGMY ILT+ S VVSA RIFG+EVAELPLVAT + QG+G FQ
Sbjct: 697 PDMVHGKSTREWDFGGMYCAILTISSQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQ 756
Query: 780 ALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTS 839
LFSC+E LL L V +LVLPAAE AESIWT KFGF K+++E+ ++RD+Q+ F+GT
Sbjct: 757 TLFSCLEGLLGVLEVRSLVLPAAEGAESIWTNKFGFNKVTQEQRNNFRRDYQMVTFQGTL 816
Query: 840 MLEKKV 845
ML+K V
Sbjct: 817 MLQKLV 822
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 280/676 (41%), Positives = 392/676 (57%), Gaps = 56/676 (8%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRER--QLDGIVNGGGYLCGCPLCNFSKV 261
K +V ++ P VK+L TG+LDG V Y+ ++ +L G + GG LC C LCN +V
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+ +FE HA + +I LENGK + +++ + APL LE ++ S E
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 322 FRV------WKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQD 375
F + +S+ R GL+ +SC E S +
Sbjct: 207 FTCKNCRGCFPSSNVERVGLL----------------CLSCVESRKSEK------SSIHA 244
Query: 376 TGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYI 435
G+ K ++ + + + + ++ KR LH+L+F +GLP+G + Y
Sbjct: 245 VGKRIRSSKLSVKLKTAPITSKCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYY 304
Query: 436 VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS 495
+GQ+L G K +GIVC CCN EISPSQFE HAG A+RR+PY IYTSNG++LH++AI
Sbjct: 305 ARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYTSNGVSLHELAIF 364
Query: 496 LAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG-- 553
L+ + TT +D +C VC DG NLLLC+GCP AFH C IP W C C+
Sbjct: 365 LSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFL 424
Query: 554 ------HSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRSHDFSAATFDDR 594
H++ + ++G ++ GAE+GGCV+CRS DFS + F R
Sbjct: 425 RERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPR 484
Query: 595 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 654
T+I CDQCEKE+HVGCLR + + LKE+P+ WFCC+DC RIH+ L++ + A+ +P S
Sbjct: 485 TIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLENLLIRVAERLPES 544
Query: 655 SLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AE 713
L I +K + + + + DV+W++L + + + LL A ++F ECFDPI+
Sbjct: 545 LLDVIKKKQVGRCLEPLNEI-DVRWKLL-NGKIASPETRPLLLEAVSMFHECFDPIVDPA 602
Query: 714 CGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 773
GRDLIP MVYGRN+ Q+FGGMY +L V S VVSAG++RIFGR++AELPLVAT + +
Sbjct: 603 AGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAELPLVATRYKNR 662
Query: 774 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQL 832
GKG FQ LF+CIERLL LNV+NLVLPAAE+A SIWT+KFGF KM +L Y+ + Q+
Sbjct: 663 GKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKPNQLTNYRMNCHQI 722
Query: 833 TIFKGTSMLEKKV-QC 847
FKGT+ML K V QC
Sbjct: 723 MAFKGTNMLHKTVPQC 738
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 279/676 (41%), Positives = 391/676 (57%), Gaps = 52/676 (7%)
Query: 200 IKMSKKV-VPNEYPTNVKKLLSTGILDGACVKYISTSRER--QLDGIVNGGGYLCGCPLC 256
+K +KK+ V ++ P VK+L TG+LDG V Y+ ++ +L G + GG LC C LC
Sbjct: 90 LKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCRLC 149
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
N +V+ +FE HA + +I LENGK + +++ + APL LE ++ S
Sbjct: 150 NGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQNFINSP 209
Query: 317 FNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDT 376
E F R L ++ P H ++ + + +++
Sbjct: 210 PEEKYF----TCKSCRGPLGQYYS------PIHVHVVLLNLNSVSLLKL--------RNS 251
Query: 377 GREQTHMKEMLEERKRGV--KRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTY 434
GR++ L + + V + + ++ KR LH+L+F +GLP+G + Y
Sbjct: 252 GRQEQSWSSKLSVKLKTVPITSKCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAY 311
Query: 435 IVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
+GQ+L G K GIVC CCN E+SPSQFE HAG A+RR+PY +IYTSNG++LH++AI
Sbjct: 312 YARGQKLLEGIKTRCGIVCRCCNTEVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAI 371
Query: 495 SLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
L+ + TT +D +C VC DG NLLLC+GCP AFH C IP W C C+ H
Sbjct: 372 FLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQICQ--H 429
Query: 555 SSSMSRSVDLKGGLEAP-----------------------GAEVGGCVICRSHDFSAATF 591
+ R V A GAE+GGCV+CRS DFS + F
Sbjct: 430 TFLRERPVLYNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGF 489
Query: 592 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
RT+I CDQCEKE+HVGCLR + LKE+P+ WFCC+DC IH+ L++ + A+ +
Sbjct: 490 GPRTIIICDQCEKEYHVGCLRDHKKAYLKELPEGDWFCCNDCTIIHSTLENLLIRVAERL 549
Query: 652 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 711
P + L I +K +E+ + + DV+W++L + + + LL A ++F ECFDPI+
Sbjct: 550 PEALLDVIKKKQVERCLEPLNEI-DVRWKLL-NGKIASPETRPLLLEAVSMFHECFDPIV 607
Query: 712 -AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 770
GRDLIP MVYGRN+ Q+FGGMY +L V S VVSAG++RIFGR++AELPLVAT
Sbjct: 608 DPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAELPLVATRY 667
Query: 771 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD- 829
+ +GKG FQ LF+CIERLL LNV+NLVLPAAE+AESIWT+KFGF KM ++L Y+ +
Sbjct: 668 KNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAESIWTEKFGFSKMKLDQLTNYRMNC 727
Query: 830 FQLTIFKGTSMLEKKV 845
Q+ FKGT+ML K V
Sbjct: 728 HQIMAFKGTNMLHKTV 743
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 288/718 (40%), Positives = 401/718 (55%), Gaps = 101/718 (14%)
Query: 152 RKITFKFSKRKEDYVAPLA--YEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN 209
++ T K KED V L Y + + D ++GA + + + +KMSKK+ N
Sbjct: 1777 KRFTRSALKSKEDTVESLESDYNFCNSVAIGVDEKTNGAVRSLTSPKKLGLKMSKKIALN 1836
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQ 269
+ P ++ LL TG+L+G V Y + +L G + G G LC C LC S+VV +FE
Sbjct: 1837 KVPLTIRDLLETGMLEGYPVTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFEL 1896
Query: 270 HAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASH 329
HA RH +IYL+NGK ++ ++ K APL LE ++ GSF V ++
Sbjct: 1897 HACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAI------GSFPVKRS-- 1948
Query: 330 HLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEE 389
+ DE K+ L +S I ++ SP +
Sbjct: 1949 ------LPADEA--AKMDPLGNSCIKRNN-------SPATS------------------- 1974
Query: 390 RKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGN 449
+H RT+E + RD LHRL+F GLPDG + Y G++L G K+G
Sbjct: 1975 ---------IH--RTSE---RARDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGF 2020
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDD 509
GI C CC+ E+S SQFEAHAG A+R++PY +IYTSNG++LH++AISL+ G++ + +DD
Sbjct: 2021 GIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDD 2080
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--------GHSSSMSRS 561
+C +CGDG NLLLC+GCP AFH C IP+ W C C+ H+++ +
Sbjct: 2081 LCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAA 2140
Query: 562 VDLKG------------GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
+ G + P AEV CV+CR +DFS + F RT+I CDQ
Sbjct: 2141 GRVSGVDPIEQITKRCIRIVNPEAEVSACVLCRGYDFSKSGFGPRTIILCDQ-------- 2192
Query: 610 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
E+P KWFCC +C RIH+ALQ + +P S L+ I KH KG+
Sbjct: 2193 -----------ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLE 2241
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNI 728
N V+W++L + + + LLS A AIF + FDPII + GRDLIP MVYGRN+
Sbjct: 2242 SIADYN-VRWRLL-SGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNV 2299
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
GQ+F G+Y ++TV S VVSAG+LR+FG+EVAELPLVAT + QG+G FQ LFSCIE+L
Sbjct: 2300 RGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKL 2359
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV 845
L LNV + VLPAAE+AE IWTKKFGF+K++ ++L +Y++ F Q+ F+GT MLEK V
Sbjct: 2360 LAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGV 2417
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 234/340 (68%), Positives = 267/340 (78%), Gaps = 4/340 (1%)
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
Q++ G KQGNGIVC CC EISPSQFE+HAGM+ARRQPYRHIYTSNG+TLHDIAISLA
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-GHSSS 557
GQ TTG DDMC CGDG +L+ C CP AFHAACLD PE W CPNC + GH +
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNKLGHGGN 136
Query: 558 MSRSV--DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+R + L ++ P +VGGC +CR+HDFS TFDDRTVI CDQCEKEFHVGCLR++G
Sbjct: 137 FARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESG 196
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN 675
LCDLKEIPKD WFCC DCN I+ AL++ VS QTIPAS L+ INRKH+EKG+L D
Sbjct: 197 LCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLNIINRKHVEKGLLVDEAAY 256
Query: 676 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQEFGG 735
DVQWQ+L + ++ SLLS A AIFRECFDPI+A+ GRDLIPVMVYGRNISGQEFGG
Sbjct: 257 DVQWQIL-MGKSRNREDLSLLSGAAAIFRECFDPIVAKTGRDLIPVMVYGRNISGQEFGG 315
Query: 736 MYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
MY V+LTV+ VVVSAGLLRIFGREVAELPLVAT RE+QGK
Sbjct: 316 MYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGK 355
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 284/702 (40%), Positives = 395/702 (56%), Gaps = 59/702 (8%)
Query: 181 DDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR---- 236
DD GSS V KM ++PT +K LL +GIL+G VKY+ +
Sbjct: 264 DDTGSSS----VAAITTTPTKMFAIDGLKKFPTKLKDLLDSGILEGQKVKYLRGPKVRGP 319
Query: 237 -ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ 295
E+ L G+V G LC C C +VV+ FE HAG+ + P +I+LENG + ++
Sbjct: 320 GEKGLHGVVKESGILCFCDDCKGKEVVTPTIFELHAGSANKRPPEYIFLENGNTLRDVMN 379
Query: 296 ELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIIS 355
K + L IL+E ++ G F K S+ + ++ S +
Sbjct: 380 ACKNSSLDILDEAIRLSIG-------FTPSKKSNFCLSC------RGSITGAGTRKSKVL 426
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDND 415
CS LE S + DT +E+T + E + + + ++G K+D
Sbjct: 427 CSQ-CLELKDSQAILAPETDT-KERTPRPSPVPESSSALLKS-SPSRSNSQGRLTKKDIR 483
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARR 475
+H+L+F LPDG + Y +G++L G K+G GI C CCN E+SPSQFEAHAG A+RR
Sbjct: 484 MHKLVFEEEVLPDGTEVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRR 543
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL 535
+PY HIYTSNG++LH++AISL+ +R +T +DD+C +C DG LL C+ CP AFH CL
Sbjct: 544 KPYLHIYTSNGVSLHELAISLSKCRRHSTKENDDLCQICRDGGKLLCCDVCPRAFHQECL 603
Query: 536 DPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPG-------------------AE 574
IP+ W C C + R+ + G AE
Sbjct: 604 SLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQITRRCIRIVKTFEAE 663
Query: 575 VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK---EIPKDKWFCCD 631
VGGCV CR HDF TF RTVI CDQCEKEFHVGCL+++ + DLK E+P KWFCC
Sbjct: 664 VGGCVFCRGHDFE-RTFGPRTVIICDQCEKEFHVGCLKEHQMQDLKAICELPTGKWFCCT 722
Query: 632 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 691
C RIH+ALQ V + +P SSL+ I +KH E+ G +D++W++L K +
Sbjct: 723 GCERIHSALQKLVIRGEEKLPDSSLNFIKKKH-EESASESGGGDDIRWRLLSKKTDPSDV 781
Query: 692 EKSLLSSATAIFRECFDPIIAECGR------DLIPVMVYGRNISGQEFGGMYSVILTVKS 745
+SLLS A AIF E F PI + + D IP MV G ++ GQ+ GGMY +L V
Sbjct: 782 TESLLSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQDLGGMYCAVLLVNH 841
Query: 746 VVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKA 805
VVSA ++RIFG+E+AELP+VAT + QG+G FQ LF+CIE+LL LNV+NLVLPAAE+
Sbjct: 842 EVVSAAVMRIFGQELAELPIVATSSKSQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEV 901
Query: 806 ESIWTKKFGFRKMSRE--RLLKYQRDFQLTIFKGTSMLEKKV 845
ESIWT KFGF ++++ RL++Y++ +Q+ F+G+ ML+K V
Sbjct: 902 ESIWTNKFGFSTITQDEVRLMEYRKSYQIMEFQGSLMLQKPV 943
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 294/766 (38%), Positives = 403/766 (52%), Gaps = 96/766 (12%)
Query: 166 VAPLAYEEGRNYTLYDDLG--SSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGI 223
+ PLA + + D G S N +E+KMSKK+ N+ P V++L TG+
Sbjct: 172 ITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGL 231
Query: 224 LDGACVKYISTSR--ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNH 281
L+G V Y+ + + L G + G LC C CN +V+ +FE HA + + +
Sbjct: 232 LEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQY 291
Query: 282 IYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF--RVWKASHHLRKGLV--- 336
I LENGK + +++ K + LE V+ + SS E F R K G V
Sbjct: 292 ICLENGKSLLDLLKACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSSVGQVGPL 350
Query: 337 ------EHDEKHNMKLPSLPHSII-----------------------------------S 355
K + LP+ P S I S
Sbjct: 351 CPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKS 410
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTA---------- 405
SS S SP S R + K L+ RK+ +K M + +
Sbjct: 411 KSSEYTSISRSPRSAPM-----RIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAK 465
Query: 406 -EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+ +D LH+L+F +GLPDG + Y +GQ+L G K+G+GI+C CCN +SPSQ
Sbjct: 466 NQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQ 525
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAG ++R++PY +IYTSNG++LH++AISL+ G++ + +DD+C +C DG NLLLC+
Sbjct: 526 FEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCD 585
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPGAE-------- 574
GCP AFH C P W C C+ + +V+ G +
Sbjct: 586 GCPRAFHKECASLSSTPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKR 645
Query: 575 -----------VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ GCV+CR DFS + F RT+I CDQCEKEFHVGCL+ + + LKE+P
Sbjct: 646 CIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELP 705
Query: 624 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTMNDVQWQM 681
+ KWFC C RIH+ALQ + + +P S L +NRK E I D DV W++
Sbjct: 706 RGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVD---VDVSWRL 762
Query: 682 LKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVI 740
+ + + + LLS A AIF + FDPI+ GRDLIP MVYGR++ GQEFGGMY I
Sbjct: 763 IS-GKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAI 821
Query: 741 LTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLP 800
L V S VVSA +LR+FG+++AELPLVAT GKG FQ LFSCIERLL L V+ LVLP
Sbjct: 822 LIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLP 881
Query: 801 AAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
AAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSML+K V
Sbjct: 882 AAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTV 927
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 274/710 (38%), Positives = 405/710 (57%), Gaps = 93/710 (13%)
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLCGCPLCNFSKVVSA 264
++P+ +K LL+TGIL+G V Y+ ++ E+ L G++ G LC C +CN +VV+
Sbjct: 312 KFPSKLKDLLATGILEGLPVMYMKGAKVLFAGEKGLQGVIQDSGVLCFCKICNGVEVVTP 371
Query: 265 HEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTA------PLGILEEVVKKVAGSSFN 318
FE HAG+ + P +IY+ +G ++++ A PL ++E V+K+ G
Sbjct: 372 TVFELHAGSANKRPPEYIYIHDGN-CGKTLRDVMNACCCCDFPLESMDEAVQKLLGD--- 427
Query: 319 EGSFRVWKASHHLR-KGLVEHDEKHNMKL---------------------PSLPHSIISC 356
F + K+S L +G + K L P P ++
Sbjct: 428 ---FTMKKSSICLNCRGACKGVSKLVCDLCLASPPQTAMASRKVISQPVQPRSPEPVVIQ 484
Query: 357 SSL-------ALEESISPTSC-SFVQ----DTG-REQTHMKEMLEERKRGVKRPFMHQKR 403
SL +L+ + P S + VQ DTG + ++ M RG
Sbjct: 485 KSLDNEVQPNSLDNEVPPNSLDNEVQPNSLDTGVQPKSFSNGMKHSASRG---------- 534
Query: 404 TAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPS 463
++G ++D LH+L+F + LPDG L Y GQ+L G K+G GI C CCN+++S S
Sbjct: 535 KSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGCGIFCTCCNEQVSAS 594
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR-TTGGSDDMCHVCGDGENLLL 522
QFEAHAG A+RR+PY HIYTSNG++LH+++ISL+ RR + +DD+C +C DG +LL
Sbjct: 595 QFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLC 654
Query: 523 CNGCPLAFHAACLDPLLIPESGWRCPNC-----RQGHSSSMSRSVDLKGGLEAPG----- 572
C+GCP AFH C+ IP W C C + H ++ G + P
Sbjct: 655 CDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELM 714
Query: 573 ------------AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
+ GGC +C +FS +F RTVI CDQCEKE+HVGCL+++ + +L+
Sbjct: 715 NKRCIRVVKTVEVDHGGCALCSRPNFSK-SFGPRTVIICDQCEKEYHVGCLKEHNMENLE 773
Query: 621 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 680
++P+ WFC +C+ IH AL D V+++ + +P LS I +KH EK + G DV+W+
Sbjct: 774 KLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEKSLEI-GAGLDVKWR 832
Query: 681 MLKKAQCFEEKE----KSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGG 735
++ + + + LLS A AIF E FDPI+ + GRD IP M++GRNI GQ+F G
Sbjct: 833 VMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTMLFGRNIRGQDFSG 892
Query: 736 MYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVE 795
+Y +LTV +VSAG+ R+FG E+AELPLVAT ++QG+G FQ LFSCIE LL SLNV+
Sbjct: 893 IYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVK 952
Query: 796 NLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
NLVLPAA++AESIWT KFGF K+ ++ + KY++ +++ IF+GTS+L+K V
Sbjct: 953 NLVLPAADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPV 1002
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 276/744 (37%), Positives = 415/744 (55%), Gaps = 91/744 (12%)
Query: 183 LGSSGANDGV----------LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYI 232
+G +G +DGV + +K S + ++P ++ LL+TGIL+G V Y+
Sbjct: 244 VGVAGISDGVKRETEAGASLVMTTPSSVKFSNRGKLKKFPAKLRDLLATGILEGLPVMYM 303
Query: 233 STSR-----ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+ E+ L G++ G LC C +C +VV+ FE HAG+ + P +IY+ +G
Sbjct: 304 KGVKVLFDGEKGLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDG 363
Query: 288 KPIYSIIQELKTA-----PLGILEEVVKKVAGSSFNEGSFRVWKASHHLR-----KGL-- 335
++ + PL ++E V+K+ G F + K+S L KG+
Sbjct: 364 NSGKTLRDVMNACCCCDFPLESMDEAVQKLLGD------FTMKKSSICLNCRGACKGVSR 417
Query: 336 --------------VEHDEKHNMKL-PSLPHSIISCSSLALEESISPTSC-SFVQ----D 375
V ++ + + P P ++ SL + + P S + VQ D
Sbjct: 418 LVCDSCLVSPAQTAVASNKGISQPVQPRSPEPVVIQKSL--DNEVQPNSLHNEVQPNKLD 475
Query: 376 TGREQTHMKEMLEE-------RKRGVKRPFMHQ--KRTAEGGTKKRDNDLHRLLFLPNGL 426
TG + + +E + + H + ++G ++D LH+L+F + L
Sbjct: 476 TGMQPNSLDNGMEPDSLNNSMKPKSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVL 535
Query: 427 PDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
PDG L Y GQ+L G K+G GI C CCN+++S SQFEAHAG A+RR+PY HIYTSNG
Sbjct: 536 PDGTELAYYAHGQKLLVGYKKGYGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNG 595
Query: 487 MTLHDIAISLAMGQRR-TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
++LH+++ISL+ RR + +DD+C +C DG +LL C+GCP AFH C+ IP W
Sbjct: 596 ISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGTW 655
Query: 546 RCPNC-----RQGHSSSMSRSVDLKGGLEAPG-----------------AEVGGCVICRS 583
C C + H ++ G + P + GGC +C
Sbjct: 656 YCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSR 715
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
+FS +F +TVI CDQCEKE+HVGCL+ + + +L+E+P WFC +C++IH AL D
Sbjct: 716 PNFSK-SFGPQTVIICDQCEKEYHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDL 774
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK-KAQCFEEKEKSLLSSATAI 702
V+++ + +P L+ I +KH EK + G DV+W+++ K + + LLS A AI
Sbjct: 775 VASKEKDVPDPLLNLIKKKHEEKSLDI-GAGLDVKWRVINWKLDSDSVETRKLLSKAVAI 833
Query: 703 FRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
F E FDPI+ + GRD IP M++GRNI GQ+F G+Y +LTV +VSAG+ R+FG E+A
Sbjct: 834 FHERFDPIVDSTSGRDFIPAMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIA 893
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
ELPLVAT ++QG+G FQ LFSCIE LL SLNV+NLVLPAA++AESIWT KFGF K+ ++
Sbjct: 894 ELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQD 953
Query: 822 RLLKYQRDFQLTIFKGTSMLEKKV 845
+ KY++ +++ IF+GTS+L+K V
Sbjct: 954 EINKYKKFYRMMIFQGTSVLQKPV 977
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 277/689 (40%), Positives = 385/689 (55%), Gaps = 67/689 (9%)
Query: 197 NMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSR-----ERQLDGIVNGGGYLC 251
N KM ++PT +K LL +GIL+G VKY+ ++ E+ L G+V G LC
Sbjct: 300 NTPTKMFSIDGSKKFPTKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILC 359
Query: 252 GCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKK 311
C C +VV+ F HAG+ + P +I LENG + ++ K + L L+E ++
Sbjct: 360 FCDDCKGKEVVTPAIFVLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRL 419
Query: 312 VAGSSFNEGSFRVWKASHHL-----RKGLVEHDEKHNMK---LPSLPHSIISCSSLALEE 363
G S ++ S W + RK V + +K S P + A
Sbjct: 420 STGFSPSKKSNFCWNCRGSITGAGSRKSKVLCSQCFGLKDFQAGSAPKTAKK-ERTAKPH 478
Query: 364 SISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLP 423
S+ +SC+ ++ + ++G K+D H+L+F
Sbjct: 479 SVPESSCNLLKSS-----------------------LSGSKSQGRVTKKDIRTHKLVFEE 515
Query: 424 NGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
LPDG + Y +G++L G K+G GI C CCN E+SPSQFEAHAG A+RR+PY +IYT
Sbjct: 516 EVLPDGTEVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIYT 575
Query: 484 SNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
SNG++LH++AISL+ G+R + +DD+C +C DG LL C+ CP AFH CL IP
Sbjct: 576 SNGVSLHELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPRG 635
Query: 544 GWRCPNCRQGHSSS--MSRSVDLKGGLEAPG-------------------AEVGGCVICR 582
W C C + R+ + G AEVGGCV CR
Sbjct: 636 KWYCKYCLNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVKTFEAEVGGCVFCR 695
Query: 583 SHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD 642
HDF TF RTVI CDQCEKEFHVGCL+++ + DLKE+PK KWFCC C RIH+ALQ
Sbjct: 696 GHDFE-RTFGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCCTGCERIHSALQK 754
Query: 643 FVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAI 702
V + +P SSL+ I +KH E+ G +DV+W++L K + ++LLS A AI
Sbjct: 755 LVIRGEEKLPDSSLNFI-KKH-EESASESGCSDDVRWRLLSKKTDSSDVTEALLSDAVAI 812
Query: 703 FRECFDPIIAECGR------DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIF 756
F ECFDPI + + D IP MV G N+ GQ+ GGMY +L V VVVS ++RIF
Sbjct: 813 FHECFDPITVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLVNHVVVSVAVVRIF 872
Query: 757 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 816
G+E+AELP+VAT +QG+G FQ LF+CIE+LL LNV+NLVLPAAE+ SIW KFGF
Sbjct: 873 GQELAELPIVATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVGSIWKNKFGFG 932
Query: 817 KMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
++++ L++Y+R +Q+ +F+G ML+K V
Sbjct: 933 AITQDELMEYRRRYQIMVFQGALMLQKPV 961
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 295/775 (38%), Positives = 404/775 (52%), Gaps = 105/775 (13%)
Query: 166 VAPLAYEEGRNYTLYDDLG--SSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGI 223
+ PLA + + D G S N +E+KMSKK+ N+ P V++L TG+
Sbjct: 172 ITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGL 231
Query: 224 LDGACVKYISTSR--ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNH 281
L+G V Y+ + + L G + G LC C CN +V+ +FE HA + + +
Sbjct: 232 LEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQY 291
Query: 282 IYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSF--RVWKASHHLRKGLV--- 336
I LENGK + +++ K + LE V+ + SS E F R K G V
Sbjct: 292 ICLENGKSLLDLLKACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSSVGQVGPL 350
Query: 337 ------EHDEKHNMKLPSLPHSII-----------------------------------S 355
K + LP+ P S I S
Sbjct: 351 CPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKS 410
Query: 356 CSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTA---------- 405
SS S SP S R + K L+ RK+ +K M + +
Sbjct: 411 KSSEYTSISRSPRSAPM-----RIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAK 465
Query: 406 -EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQ 464
+ +D LH+L+F +GLPDG + Y +GQ+L G K+G+GI+C CCN +SPSQ
Sbjct: 466 NQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQ 525
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAG ++R++PY +IYTSNG++LH++AISL+ G++ + +DD+C +C DG NLLLC+
Sbjct: 526 FEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCD 585
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPGAE-------- 574
GCP AFH C IP W C C+ + +V+ G +
Sbjct: 586 GCPRAFHKECASLSSIPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKR 645
Query: 575 -----------VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ GCV+CR DFS + F RT+I CDQCEKEFHVGCL+ + + LKE+P
Sbjct: 646 CIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELP 705
Query: 624 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTMNDVQWQM 681
+ KWFC C RIH+ALQ + + +P S L +NRK E I D DV W++
Sbjct: 706 RGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVD---VDVSWRL 762
Query: 682 LKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVI 740
+ + + + LLS A AIF + FDPI+ GRDLIP MVYGR++ GQEFGGMY I
Sbjct: 763 I-SGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAI 821
Query: 741 LTVKSVVVSAGLLRIFGR---------EVAELPLVATCREYQGKGCFQALFSCIERLLCS 791
L V S VVSA +LR+FG+ ++AELPLVAT GKG FQ LFSCIERLL
Sbjct: 822 LIVNSFVVSAAMLRVFGQYCRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAF 881
Query: 792 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
L V+ LVLPAAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSML+K V
Sbjct: 882 LKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTV 936
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 464 bits (1193), Expect = e-127, Method: Compositional matrix adjust.
Identities = 271/682 (39%), Positives = 385/682 (56%), Gaps = 67/682 (9%)
Query: 200 IKM-SKKVVPNE---YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYL 250
+KM S +VP+ +PT +K+LL TGIL+ V+YI R ++ L G++ G G L
Sbjct: 197 VKMGSSALVPSTLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGIL 256
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
C C C + VV+ + FE HAG+ + P +IYLENG + S++ A L L+E ++
Sbjct: 257 CYCDTCKGTNVVTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIR 316
Query: 311 KVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEES-ISPT 368
GSS + +F + K ++ S++ C S + L+ES SP
Sbjct: 317 VAIGSSIKKSTFCF--------------NCKGSISEVGTSDSLVLCESCVGLKESHASPA 362
Query: 369 SCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+ D + T + + + R K+D LH+L F N LP+
Sbjct: 363 QPTGTSDRSTKTTTVSKCSSSGSKNYGR------------VTKKDVGLHKLAFGENDLPE 410
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G ++Y V+G+RL G K+G I+C CCN E+SPSQFEAH+G A+RR+PY HIYTSNG++
Sbjct: 411 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 470
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LH++++SL G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C
Sbjct: 471 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 530
Query: 549 NC-----------RQGHSSSMSRSVDLKGGLEA--------------PGAEVGGCVICRS 583
C R ++ + R GG++ E GGC +CR
Sbjct: 531 FCLNTLQKGKFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRR 586
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
H+FS + F TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+
Sbjct: 587 HEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKI 646
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 703
V + + +P L I +++ KG + G D++W++L + + SLLS A ++F
Sbjct: 647 VVRQEEELPDDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRRASATEAGSLLSQALSLF 705
Query: 704 RECFDPIIAECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAEL 763
E F+PI GRDL+ MV+ + EFGGMY ILTV VVSA R+ G+EVAEL
Sbjct: 706 HEQFNPIADAEGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAEL 765
Query: 764 PLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERL 823
PLVAT + QG+G FQAL++CIERLLC L V +LVLPAAE AES+W KF F KM +E L
Sbjct: 766 PLVATRSDCQGQGYFQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEEL 825
Query: 824 LKYQRDFQLTIFKGTSMLEKKV 845
RDFQ+ F+GTSML+K V
Sbjct: 826 NHLCRDFQMMTFQGTSMLQKPV 847
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 267/651 (41%), Positives = 366/651 (56%), Gaps = 70/651 (10%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKV 261
VP +P +K+LL TGIL+ V+YI SR R L G++ G G LC C C +KV
Sbjct: 204 VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGESGLRGVIKGSGILCSCNSCKGTKV 263
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
V+ + FE HAG+ + P +IYLENG + ++ K A L L+E ++ G S
Sbjct: 264 VTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAWKNAALDSLDEAIRVAIGCSM---- 319
Query: 322 FRVWKASHHLR-KGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQ 380
+ K++ L KG + N K ++ S L L+ES + S Q TG
Sbjct: 320 --IKKSTFCLNCKGRISEAGIGNSK-------VLCLSCLQLKESQASPS----QVTGSSD 366
Query: 381 THM---KEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK 437
+H+ K R + G K+D LH+L+F NGLP+G + Y V+
Sbjct: 367 SHLRSPKPSTISRSAESVSKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVR 426
Query: 438 GQ---RLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
GQ +L G K+G+GI C CCN E+SPSQFEAHAG A+RR+PY HIYTSNG++LH+ +I
Sbjct: 427 GQVVTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSI 486
Query: 495 SLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ-- 552
SL+ G+ + +DD+C +C DG NLL C+GCP FH C+ IP+ W C C
Sbjct: 487 SLSRGREISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCNNML 546
Query: 553 ------GHSSSMSRSVDLKGG-------------LEAPGAEVGGCVICRSHDFSAATFDD 593
H+++ + + G + E+GGC +CR H+FS + F
Sbjct: 547 QKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGP 606
Query: 594 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 653
RTV+ CDQCEKEFHVGCLR++ + DLKE+PK KWFCC DC RI+++LQ V + + +P
Sbjct: 607 RTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPH 666
Query: 654 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE 713
+ L+TI K+ G D++W+++ + + SLLS A +IF E FDPI
Sbjct: 667 NVLTTIKEKYGRNGSACSKDP-DIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA 725
Query: 714 CGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 773
GRDL+P MV+G S VVSA RIFG+EVAELPLVAT + Q
Sbjct: 726 AGRDLLPDMVHG-------------------SQVVSAAAFRIFGKEVAELPLVATRSDCQ 766
Query: 774 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLL 824
G+G FQ LFSC+E LL L V +LVLPAAE AESIWT KFGF K+++E+ +
Sbjct: 767 GQGYFQTLFSCLEGLLGVLEVRSLVLPAAEGAESIWTNKFGFNKVTQEQYI 817
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 273/702 (38%), Positives = 378/702 (53%), Gaps = 75/702 (10%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFS 259
MSKK+ P VK+L TG+L+G V Y+ + + L G + G LC C CN
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKK-------- 311
+V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 312 ------------VAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSL 359
+A F S + RK + +E+ ++ PS S+ S
Sbjct: 121 RTFTCKRCKEQVLALEYFKPASLSTSSQDNTPRKKKRKPEEQDSITKPSKSASVYLSSRK 180
Query: 360 ALEESISPTSCSFVQDT----GREQTH-------MKEMLEERKRGVKRPFMHQKRTAEGG 408
+ ISP F G +K + RK PF G
Sbjct: 181 RKYKKISPRLVCFFYPIDILFGLVMLSPFPFLWLVKIFVFIRKYAYLSPF-----CPFSG 235
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
+ +D LHRL+F GLPDG L Y +GQ + + NK S SQ +
Sbjct: 236 YQSQDQRLHRLVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQ-KLL 294
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
G Y +IYTSNG++LH++AISL+ ++ ++ +DD+C +C DG NLLLC+GCP
Sbjct: 295 GG-------YAYIYTSNGVSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPR 347
Query: 529 AFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG---------------- 572
AFH C +P W C C+ ++ + V+ A G
Sbjct: 348 AFHKGCASIPTVPSGDWYCQYCQ--NTFEREKLVEHNANASAAGRDSGIDSIEQITKRCF 405
Query: 573 -------AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
AE+ GC +CR +DF + F RT+I CDQCEKEFHVGCLR + + +LKE+PK
Sbjct: 406 RIVKNIEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKG 465
Query: 626 KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKA 685
WFCC DC+RIH+ LQ + A+ +P S L+ I +KH E+G+ + DV+W +L
Sbjct: 466 NWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEERGLNISNNI-DVRWTLLS-G 523
Query: 686 QCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVK 744
+ + K LLS A +IF+ECFDPI+ + GRDLIP+MVYG+N GQ++GGMY +LT+
Sbjct: 524 KIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTIN 583
Query: 745 SVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 804
S +VSAG+LR+FG EVAELPLVAT GKG FQ LFSCIE+LL LNV+NLVLPAAE+
Sbjct: 584 SSIVSAGILRVFGEEVAELPLVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEE 643
Query: 805 AESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
AESIWT+KFGF+K+ E+L KY++ Q+ F+GTSML+K V
Sbjct: 644 AESIWTEKFGFQKIKPEQLNKYRKSCCQMVRFEGTSMLQKAV 685
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 260/684 (38%), Positives = 376/684 (54%), Gaps = 59/684 (8%)
Query: 198 MEIKMSKKVV-PNEYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPL 255
ME+KMSKKV +++P N + LLSTG+L+G V YI S++ L G++ G C C
Sbjct: 507 MEMKMSKKVAFLSKHPGNTRDLLSTGLLEGMPVMYIIPNSKKPVLKGVIAGCNIRCFCVK 566
Query: 256 CNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGS 315
C+ SK ++ + FE HAG+ +HP +IYL NG + +++ +++PL L++ ++
Sbjct: 567 CDGSKTITTYFFELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDP 626
Query: 316 SFNEGSFRVWKASHHLRKGLVE----HDEKHNMKLPSLPHSIISCSSLALEESISPTSCS 371
+ L E HD P P S ++CS
Sbjct: 627 MLIRTRMNCLNCNGELPSQTEEQFLCHDCCPESNQPQDPTSPLACSKS------------ 674
Query: 372 FVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGER 431
+ KE L +R K + G +D LH+L+F L DG
Sbjct: 675 ----SSSLTPSSKESLLKRMSA-------SKGASTGKVTTKDTGLHKLVF--KVLLDGTE 721
Query: 432 LTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHD 491
+ Y V GQ+ G + I C+ CNK +SPS FEAHAG +RR+PY +I+TSNG++LH+
Sbjct: 722 VNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHE 781
Query: 492 IAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC- 550
+++S++ + + +DD+C CG G ++ C CP +FH AC+ ++P W C NC
Sbjct: 782 LSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCT 841
Query: 551 -------------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATF 591
RQ S+ + + + ++GGC +C+ DF+ A F
Sbjct: 842 ILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCALCKKKDFNNAVF 901
Query: 592 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
D+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCCD C+ I ++L +S A +
Sbjct: 902 DERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISEGAHPL 961
Query: 652 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 711
S + I +KH KG++ D +++WQ++ E+ SLLSSA + + FDPII
Sbjct: 962 SESDVDIIRKKHESKGLVMDAN-TEIRWQLVAGRSATEDG-NSLLSSAVPVIHQSFDPII 1019
Query: 712 -AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 766
A GRDLIP MV+GR + GQ++ GMY +LTV S VVSA LLR+ G +VAELPLV
Sbjct: 1020 EAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLV 1079
Query: 767 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 826
AT + QG G FQ LFSCIERLL SL V++ +LPAA +AE+IW KKFGF K+ ++++ Y
Sbjct: 1080 ATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAY 1139
Query: 827 QRDFQLTIFKGTSMLEKKVQCLPE 850
LT+F GT L K + LPE
Sbjct: 1140 LNGGHLTVFHGTLNLYKAIP-LPE 1162
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 243/516 (47%), Positives = 327/516 (63%), Gaps = 33/516 (6%)
Query: 359 LALEESISPTSCSFVQD-TGREQTHMKEM--LEERKRGVKRPFMHQKRTAEGGTKKRDND 415
LALE + +F QD T R++ E L + V + ++ RD
Sbjct: 211 LALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVASVHLSPRKRKYKKISPRDQR 270
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEI-SPSQFEAHAGMAAR 474
LHRL+F GLPDG L Y +GQ+L G K+G GI+C CCN E+ SPS FEAHAG A R
Sbjct: 271 LHRLVFEEGGLPDGTELAYYARGQKLLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATR 330
Query: 475 RQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC 534
++PY IYTSNG++LHD+AISL+ ++ ++ +DD+C +C DG +LLLC+GCP AFH C
Sbjct: 331 KKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCIICADGGDLLLCDGCPRAFHKGC 390
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG---------------------- 572
+P W C +C+ ++ + V+ A G
Sbjct: 391 ASLSTVPSGDWYCQHCQ--NTFQREKFVEHNANAFAAGRVSEIDSIEQITKRCFRIVKNV 448
Query: 573 -AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 631
AE+ GC +CR +DF + F RT+I CDQCEKEFHVGCLR + + +LKE+PK WFCC
Sbjct: 449 EAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELPKGNWFCCM 508
Query: 632 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK 691
DC+RIH+ LQ + A+ +P S L+ I +KH EKG+ ++ DV+W +L + +
Sbjct: 509 DCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISNSI-DVRWTLLS-GKIASPE 566
Query: 692 EKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 750
K LLS A +IF+ECFDPI+ + GRDLIP+MVYG+N GQ++GGMY +L V S +VSA
Sbjct: 567 NKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLIVNSCIVSA 626
Query: 751 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 810
G+LR+FG EVAELPLVAT GKG FQ LFSCIE+LL LNV+NLVLPAAE+AESIW
Sbjct: 627 GILRVFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWI 686
Query: 811 KKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
+KFGF+K+ E+L KY+++ Q+ F+GTSML+K V
Sbjct: 687 EKFGFQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAV 722
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 2/111 (1%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFS 259
MSKK+ + P VK+L TG+L+G V Y+ + + L G + G LC C CN
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
+V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQ 111
Score = 39.3 bits (90), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 44/95 (46%), Gaps = 5/95 (5%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK+ P + + G+ DG + Y + ++ L G G G LC C CN +VVS
Sbjct: 262 KKISPRDQRLHRLVFEEGGLPDGTELAYYARG-QKLLGGYKRGFGILCHC--CN-CEVVS 317
Query: 264 AHEFEQHAGAKTR-HPNNHIYLENGKPIYSIIQEL 297
FE HAG TR P IY NG ++ + L
Sbjct: 318 PSTFEAHAGWATRKKPYACIYTSNGVSLHDLAISL 352
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 253/663 (38%), Positives = 379/663 (57%), Gaps = 52/663 (7%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P N+++LL+TG+L+G V YI S++ L G++ G C C CN +K +SA+ FE
Sbjct: 209 KHPGNIRELLNTGMLEGMPVMYIIPHSKKAVLKGVITGCNIRCFCLSCNGAKAISAYYFE 268
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
QHAG+ +HP ++IYL NG + +++ +PL LEE ++ SS +
Sbjct: 269 QHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIR----SSIDP--------- 315
Query: 329 HHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFV-QDTGREQTHMKEML 387
+++ + + + LPS +++ L ++ P + S+ + KE L
Sbjct: 316 -VVKRSRINCLNCNELVLPSSHENVLCQVCLESKQPQDPLTASYTCNGSSSLSRSSKEAL 374
Query: 388 EERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQ 447
++ +K + G +DN LH+L+F N L DG + Y V GQR G +
Sbjct: 375 ------LRNISSGKKGGSAGKVTNKDNRLHKLVF--NVLLDGTEVAYYVDGQRKVDGYIK 426
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS 507
+ I C+ CN+ +SPS FEAHAG +RR+PY +I+TSNG++LH++A+ ++ + +
Sbjct: 427 DHRIYCNHCNRVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELAMKISKDMELSERET 486
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--GHSSSMSRSVDLK 565
DD+C CG G ++ C CP +FH AC+ +P + W C +CR +++++ + K
Sbjct: 487 DDLCRECGQGGDIFPCKICPRSFHPACVGLSKVP-AEWYCDSCRNLVQKEKALAKNKNAK 545
Query: 566 GGLEAPGAE------------------VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFH 607
G + +GGC +C+ DF+ A FD+RTVI CDQCEKE+H
Sbjct: 546 AAGRQAGVDSIEQIMKRAIRIVPISDDLGGCALCKQKDFNNAVFDERTVILCDQCEKEYH 605
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
VGCL+ +LKE+P+++WFCC C+ ++L +S+ AQ + L I +KH +G
Sbjct: 606 VGCLQSQWQVELKELPEEEWFCCSSCSETRSSLDKIISDGAQLLADPDLEIIKKKHETRG 665
Query: 668 ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR 726
+ D T D++WQ+L + E+ LLS+A IF + FDPI A GRDLIP MV GR
Sbjct: 666 LCMD-TSKDLKWQLLSGKRATEDGS-ILLSAAVPIFHQSFDPIREALTGRDLIPEMVNGR 723
Query: 727 N----ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
+ GQ++ GMY +LTV S VVSA LLR+ G +VAELPLVAT ++ QG G FQALF
Sbjct: 724 GPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVMGGDVAELPLVATSQDVQGLGYFQALF 783
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
SCIER+L SL +++ VLPAA +AE IW KFGF ++S E L Y LTIF GTS +
Sbjct: 784 SCIERVLVSLKIKHFVLPAAHEAEGIWMNKFGFSRISPEELEAYLNGAHLTIFHGTSYMY 843
Query: 843 KKV 845
K V
Sbjct: 844 KAV 846
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 252/672 (37%), Positives = 376/672 (55%), Gaps = 67/672 (9%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P+N+++LL+TG+L+G V+YI +S++ L G++ G C C CN SK V ++ FE
Sbjct: 382 KHPSNIRELLNTGLLEGMPVRYIIPSSKKAVLKGVITGCNIRCFCLSCNGSKDVCSYFFE 441
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
QHAG+ +HP +HIYL NG + +++ +++PL LE+ ++ +
Sbjct: 442 QHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCLNCN 501
Query: 329 HHLRKGLVE----------HDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGR 378
HL E + K + PS ++ S SSL P+S F+
Sbjct: 502 EHLSSSQTEIFGSFLCQRCLEPKQHQDPPSPSYACKSNSSLI------PSSKDFL----- 550
Query: 379 EQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKG 438
+K+ ++ K + G +D LH+L+F L DG + Y V G
Sbjct: 551 ---------------LKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDG 593
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
QR G + I C+ CN+ +SPS FEAHAG RR+PY +I+TSNG++LH++++ ++
Sbjct: 594 QRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISK 653
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------- 550
+ +DD+C CG G ++ C CP +FH AC+ +P S W C NC
Sbjct: 654 DMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEK 712
Query: 551 ------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIY 598
RQ S+ + + + ++GGC +C+ DF+ + FD+RTVI
Sbjct: 713 ALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDDLGGCALCKQKDFNNSVFDERTVIL 772
Query: 599 CDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLST 658
CDQCEKE+HVGCLR DLKE+P+ +WFCC+ C+ I ++L +S+ A + S +
Sbjct: 773 CDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDI 832
Query: 659 INRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRD 717
I +KH KG+ D T D++W++L + E+ + LLS+A I + FDPII + GRD
Sbjct: 833 IRKKHEMKGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPIIHQSFDPIIEVQSGRD 890
Query: 718 LIPVMVYGRN----ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 773
LIP MV GR + GQ++ GMY +LT+ + VVSA LLR+ G EVAELPLVAT ++ Q
Sbjct: 891 LIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQ 950
Query: 774 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 833
G G FQALFSCIER+L SL +++ +LPAA++AE IW KFGF K+ +E+ Y LT
Sbjct: 951 GLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLNGAHLT 1010
Query: 834 IFKGTSMLEKKV 845
IF GTS L K +
Sbjct: 1011 IFHGTSNLYKAI 1022
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 253/674 (37%), Positives = 375/674 (55%), Gaps = 69/674 (10%)
Query: 210 EYPTNVKKLLSTGILDGACVKYISTSRERQ---LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
++P+N+++LL+TG+L+G V+YI S + Q L G++ G C C CN SK V ++
Sbjct: 371 KHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCSYF 430
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWK 326
FEQHAG+ +HP +HIYL NG + +++ +++PL LE+ ++
Sbjct: 431 FEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCLN 490
Query: 327 ASHHLRKGLVE----------HDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDT 376
+ HL E + K + PS ++ S SSL P+S F+
Sbjct: 491 CNEHLSSSQTEIFGSFLCQRCLEPKQHQDPPSPSYACKSNSSLI------PSSKDFL--- 541
Query: 377 GREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIV 436
+K+ ++ K + G +D LH+L+F L DG + Y V
Sbjct: 542 -----------------LKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYV 582
Query: 437 KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISL 496
GQR G + I C+ CN+ +SPS FEAHAG RR+PY +I+TSNG++LH++++ +
Sbjct: 583 DGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKI 642
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC------ 550
+ + +DD+C CG G ++ C CP +FH AC+ +P S W C NC
Sbjct: 643 SKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQK 701
Query: 551 --------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTV 596
RQ S+ + + + ++GGC +C+ DF+ + FD+RTV
Sbjct: 702 EKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISDDLGGCALCKQKDFNNSVFDERTV 761
Query: 597 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 656
I CDQCEKE+HVGCLR DLKE+P+ +WFCC+ C+ I ++L +S+ A + S +
Sbjct: 762 ILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDI 821
Query: 657 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECG 715
I +KH KG+ D T D++W++L + E+ + LLS+A I + FDPII + G
Sbjct: 822 DIIRKKHEMKGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPIIHQSFDPIIEVQSG 879
Query: 716 RDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCRE 771
RDLIP MV GR + GQ++ GMY +LT+ + VVSA LLR+ G EVAELPLVAT ++
Sbjct: 880 RDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKD 939
Query: 772 YQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQ 831
QG G FQALFSCIER+L SL +++ +LPAA++AE IW KFGF K+ +E+ Y
Sbjct: 940 LQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLNGAH 999
Query: 832 LTIFKGTSMLEKKV 845
LTIF GTS L K +
Sbjct: 1000 LTIFHGTSNLYKAI 1013
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 264/700 (37%), Positives = 380/700 (54%), Gaps = 71/700 (10%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNF 258
KK + N +P +K + GIL+G V Y+ ++ R+ L G++ G G LC C C
Sbjct: 371 KKFLRN-FPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIG 429
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS FE HA + + P +I LE+G + ++ K PL LEE ++ V G
Sbjct: 430 IQVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILK 489
Query: 319 EGSFRVW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPT 368
+ S + S + K +E E PS + ++ SS S+ P
Sbjct: 490 KSSLCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPK 546
Query: 369 SC------SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRD 413
S S Q REQ K +L E K + H K T ++D
Sbjct: 547 SILRRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKD 600
Query: 414 NDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAA 473
LH+L+F + LPDG + Y V G+++ G K+G GI C CCNK +SPS FEAHAG A+
Sbjct: 601 LRLHKLVFEDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCAS 660
Query: 474 RRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAA 533
RR+P++HIYT+NG++LH+++++L+M QR + +DD+C +C DG L+ C+ CP ++H
Sbjct: 661 RRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKV 720
Query: 534 CLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG--------AEVGG-------- 577
C +P W C C + + VD A G AE+
Sbjct: 721 CASLPSLPSERWSCKYCV--NMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSS 778
Query: 578 --------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
CV+CR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC
Sbjct: 779 FVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFC 838
Query: 630 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQC 687
C I+ L + + + + + L+ + +K E+ T D++W++L
Sbjct: 839 SLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLT 898
Query: 688 FEEKEKSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKS 745
+ K LL+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V
Sbjct: 899 SSDDTKILLAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDE 957
Query: 746 VVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKA 805
V+VS G+ R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A
Sbjct: 958 VIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEA 1017
Query: 806 ESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+SIWT KFGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 1018 KSIWTDKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 1057
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 228/464 (49%), Positives = 297/464 (64%), Gaps = 35/464 (7%)
Query: 413 DNDLHRLLFLPNGLPDGERLTYIVKGQ------RLRFGCKQGNGIVCDCCNKEISPSQFE 466
DN LH+L+F NGLPDG L Y GQ +L G K+G+GIVC CCN EISPSQFE
Sbjct: 261 DNRLHKLVFEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFE 320
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGC 526
HAG A+R++PY +IYTSNG++LH+++ISL+ ++ + +DD+C VC DG NLLLC+GC
Sbjct: 321 VHAGWASRKKPYAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGC 380
Query: 527 PLAFHAACLDPLLIPESGWRCPNCRQGHSSS--MSRSVDLKGGLEAPG------------ 572
P AFH C IP W C C+ ++ +V+ G
Sbjct: 381 PRAFHKECASLSSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCI 440
Query: 573 -------AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
AE+ C +CR DFS + F RT+I CDQCEKE+HVGCLR + + LKE+PK
Sbjct: 441 RIVKDIDAELSACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKG 500
Query: 626 KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND--VQWQMLK 683
W CC+DC RIH+ L++ + A+ +P S L+ I +K EKG+ +ND V+W++L
Sbjct: 501 NWLCCNDCTRIHSTLENVLVRGAERLPKSLLAVIKKKQEEKGL---DPINDINVRWRLLS 557
Query: 684 KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILT 742
+ E + LL A +IF ECFDPI+ A GRDLI MVYG+++ GQEFGGMY +L
Sbjct: 558 GKKASPE-TRPLLLEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLI 616
Query: 743 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 802
V S VVSAG+LRIFG ++AELPLVAT GKG FQALFSCIERLL + V+NLVLPAA
Sbjct: 617 VNSSVVSAGMLRIFGTDIAELPLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAA 676
Query: 803 EKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV 845
E+A+SIWT KFGF K+ + L Y+R+ Q F+GT+ML K V
Sbjct: 677 EEAQSIWTDKFGFSKIKPDELANYRRNCNQFVTFQGTNMLHKMV 720
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C LC +V+S +FE HA + R +I ENGK + +++ +
Sbjct: 35 LRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRG 94
Query: 300 APLGILEEVVKKVAGSSFNEGSF 322
APL LE ++ + S E F
Sbjct: 95 APLHDLEATIQNIVCSPPEEKYF 117
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 16/107 (14%)
Query: 199 EIKMSKKVVPNEYPT----NVKKLL--STGILDGACVKY-----ISTSRERQLDGIVNGG 247
E K++KK+VP +PT + KL+ G+ DG+ + Y + + R++ L+G G
Sbjct: 244 EWKITKKLVPYSFPTCGDNRLHKLVFEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGS 303
Query: 248 GYLCGCPLCNFSKVVSAHEFEQHAGAKTR-HPNNHIYLENGKPIYSI 293
G +C C CN +S +FE HAG +R P +IY NG ++ +
Sbjct: 304 GIVCRC--CNTE--ISPSQFEVHAGWASRKKPYAYIYTSNGVSLHEL 346
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 217/466 (46%), Positives = 298/466 (63%), Gaps = 25/466 (5%)
Query: 402 KRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEIS 461
K T+ G ++D+ LH+L+FL LP+G + Y V G+RL G + GI C CCN +S
Sbjct: 274 KSTSAGRLTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYIKEPGIHCHCCNTVVS 333
Query: 462 PSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLL 521
PSQFE HAG AARR+PY +IY SNG++LH++++SL+ G++ + SDD+C +C DG LL
Sbjct: 334 PSQFEGHAGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQSDDLCSICSDGGELL 393
Query: 522 LCNGCPLAFHAACLDPLLIPESGWRCPNC--RQGHSSSMSRS--------VDLKGGLE-- 569
LC+ CP AFH C+D +P+ W C C RQ SS++ + +D +E
Sbjct: 394 LCDTCPRAFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQI 453
Query: 570 ---------APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
P GGC +C+ HDF F RTV+ CDQC +E+HVGCL+++ + DL
Sbjct: 454 FTRSIRIATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLT 513
Query: 621 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 680
+P+ W+C DC RI ++D +S A+ +PA I +K +KG+ DG + DV+W+
Sbjct: 514 ALPEGAWYCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDL-DVRWR 572
Query: 681 MLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSV 739
+L+ E K +LS A AIF E FDPII GRDLIP MVYGR++ Q++ GMY
Sbjct: 573 VLRDKS--SEDSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGRSVRDQDYTGMYCA 630
Query: 740 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 799
+LTV + VVSAGL RI GRE AELPLVAT R+ QG G FQALF CIERLL SL V+ VL
Sbjct: 631 VLTVGNTVVSAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVL 690
Query: 800 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
PAA++A SIWT++FGF K+SR+ LL++ + + T+F+GTS L K +
Sbjct: 691 PAADEAVSIWTQRFGFSKISRDELLEHLKGARTTVFQGTSTLHKLI 736
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 7/80 (8%)
Query: 216 KKLLSTGIL-DGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-A 273
K + +GIL +G V Y + R LDG + G C C CN VVS +FE HAG A
Sbjct: 290 KLVFLSGILPEGTDVGYYVGGK-RLLDGYIKEPGIHCHC--CN--TVVSPSQFEGHAGRA 344
Query: 274 KTRHPNNHIYLENGKPIYSI 293
R P ++IY+ NG ++ +
Sbjct: 345 ARRKPYHNIYMSNGVSLHEL 364
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 214/459 (46%), Positives = 293/459 (63%), Gaps = 25/459 (5%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F GLP+G L Y +GQ+L G K G GI C CC E+SPS FEAHAG
Sbjct: 518 RKDQGLHKLVFERGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKSEVSPSLFEAHAG 577
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AF
Sbjct: 578 WASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSANDNNDLCVICADGGNLLLCDSCPRAF 637
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLK----GGLEAPG-------------- 572
H C+ IP W C C +S ++ ++ G LE
Sbjct: 638 HIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQSAGRCIRVVK 697
Query: 573 ---AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
AE GCV+C DF + F RT+I CDQCEKE+H+GCL + DLKE+PK WFC
Sbjct: 698 NMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFC 757
Query: 630 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFE 689
DC RI++ LQ + A+T+ SSL I RK E+ ++ + D++W+++ +
Sbjct: 758 SMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQ-ERTDVYSISDLDIRWRLI-SGKVTS 815
Query: 690 EKEKSLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVV 748
+ + LLS A AIF +CFDPI+ GR+LIP MVYG+ + GQ++GG+ +LTV + VV
Sbjct: 816 PESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRMVYGKTMQGQDYGGICCAVLTVNATVV 875
Query: 749 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 808
SAGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +
Sbjct: 876 SAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPL 935
Query: 809 WTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKVQ 846
W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 936 WMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVH 974
Score = 76.3 bits (186), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 59/108 (54%), Gaps = 2/108 (1%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V++L TGILDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRRRPETVRELFETGILDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVV 309
+S +FE HA + R + +I ENGK + ++ + PL LE +
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATI 360
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 250/666 (37%), Positives = 353/666 (53%), Gaps = 56/666 (8%)
Query: 209 NEYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
++P NVK+LL TG+L+G V YI S++ + G++ G C C CN S+ +S + F
Sbjct: 545 TKHPGNVKELLQTGLLEGMPVMYIIPNSKKAVVKGVITGCNIRCFCIKCNGSRALSTYFF 604
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKA 327
E HAG+ +HP HIYL NG + +++ + L LEE +
Sbjct: 605 ELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCLNC 664
Query: 328 SHHLRKGLVEHDEKH---NMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
HL EH H + K P P S +
Sbjct: 665 GGHLPSSETEHFLCHCCLDSKQPQDPPS----------------------PSYSCSKSDS 702
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
M K + +K + G +D LH+L+F L DG + Y V GQR G
Sbjct: 703 SMTPSFKSSSVKISSIKKAGSSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDG 760
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT 504
+ I C+ C++ +SPS FEAHAG +RR+PY +I+TSNG++LH++++ ++ +
Sbjct: 761 YIKDQRIYCNHCSRVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELSMKISKDMELSE 820
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------------- 550
+DD+C CG G ++ C CP +FH AC+ P S W C NC
Sbjct: 821 RETDDLCRECGLGGDIFPCKMCPRSFHPACVRLSEFP-SEWFCDNCSNLVQKEKALAANK 879
Query: 551 ------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEK 604
RQ S+ + + + ++GGC +C+ DF+ A FD+RTVI CDQCEK
Sbjct: 880 NAKAAGRQAGVDSIEQIMKRAIRIVPICDDLGGCALCKKKDFNNAVFDERTVILCDQCEK 939
Query: 605 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 664
E+HVGCLR DLKE+P +WFCC C+ I + L +S+ AQ + S L I +KH
Sbjct: 940 EYHVGCLRTQWQVDLKELPDGEWFCCSSCSEIRSCLDKMISDGAQPLSGSDLEIIRKKHE 999
Query: 665 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 723
+G+ D + D++WQ+L E+ LLSSA I + FDPII A GRDLIP MV
Sbjct: 1000 SRGLSMDTDI-DIRWQLLAGRSATEDGSL-LLSSAVPIIHQSFDPIIEANTGRDLIPEMV 1057
Query: 724 YGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQ 779
GR + GQ++ GMY ++T+ S VVSA LLRI G +VAELPLVAT + QG G FQ
Sbjct: 1058 NGRRPKEGMPGQDYSGMYCAVITLGSTVVSAALLRIMGGDVAELPLVATSMDLQGLGYFQ 1117
Query: 780 ALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTS 839
LFSC+ER+L SL +++ +LPAA++AE+IW KKFGF ++ +E+L Y LT+F GTS
Sbjct: 1118 VLFSCMERMLISLKIKHFMLPAAQEAEAIWMKKFGFSRIPQEQLEAYLNGAHLTVFHGTS 1177
Query: 840 MLEKKV 845
L K V
Sbjct: 1178 NLYKAV 1183
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 424 bits (1090), Expect = e-115, Method: Compositional matrix adjust.
Identities = 268/705 (38%), Positives = 372/705 (52%), Gaps = 110/705 (15%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLC 256
ME+KMSKK+ P N+K LL+TG+L+G VKYI +R L G++ G LC C C
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAVLRGVIKRVGILCSCSSC 203
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
VVS + FE HAG+ +HP+++I+LENG ++ I++ A L +L+ ++ G +
Sbjct: 204 KGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGPA 263
Query: 317 FNEGSFR--VWKASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEE-SISPTSCSF 372
+ +FR K+S LR G K + S S S +S + +PTS +
Sbjct: 264 PKKRTFRCQTCKSSFATLRTG------KFALLCDSCLESKGSQNSTRTSKIGRNPTSSA- 316
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
+ ++ + RG K F K T+ G ++D LH+L F+ LP+G +
Sbjct: 317 --RRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKGLHKLAFMSGVLPEGTDV 374
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
Y V G+RL G + GI C CCN +SPSQFEAHAG AARR+PY +IY SNG++LH++
Sbjct: 375 GYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHEL 434
Query: 493 AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-- 550
++SL+ G+ + SDD+C +C DG LLLC+ CP AFH C+ IP W C C
Sbjct: 435 SVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCEN 494
Query: 551 RQGHSSSM----------------------SRSVDLKGGLEAPGAEVGGCVIC------- 581
RQ SS+ +RS+ + P GGC +C
Sbjct: 495 RQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIA---TTPVTGFGGCALCSMSGFMD 551
Query: 582 ------RS--------------HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
RS HDFS F RTV+ CDQ
Sbjct: 552 KQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------------------- 592
Query: 622 IPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQM 681
+P+ W+C DC RI L+D +S A+ I + + I RK+ +K + DG + DV+W++
Sbjct: 593 LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDL-DVRWRV 651
Query: 682 LKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVYGRNISGQEFGGMYSVI 740
LK + K +LS A AIF E FDPII GRDLIP MVYG
Sbjct: 652 LKDKSSADSKL--VLSKAVAIFHESFDPIIQIATGRDLIPAMVYG--------------- 694
Query: 741 LTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLP 800
+ VVSAGL R+ G E+AELPLVAT R+ QG G FQALF CIERLL SL V++ VLP
Sbjct: 695 ----NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLP 750
Query: 801 AAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
AA++AESIWT++FGF K++++ L +Y + + T+F+GTS L K V
Sbjct: 751 AADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLV 795
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 421 bits (1082), Expect = e-114, Method: Compositional matrix adjust.
Identities = 257/673 (38%), Positives = 364/673 (54%), Gaps = 94/673 (13%)
Query: 211 YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAH 265
+PT +K+LL TGIL+ V+YI R ++ L G++ G G LC C C + VV+ +
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
FE HAG+ + P +IYLENG + S++ A L L+E ++ GSS + +F
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTFCF- 330
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEESISPTSCSFVQDTGREQTHMK 384
+ K ++ S++ C S + L+ES S Q TG+++ +
Sbjct: 331 -------------NCKGSISEVGTSDSLVLCESCVGLKES----HASPAQPTGQQKQRLC 373
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQ----- 439
HQ+ GG D LH+L F N LP+G ++Y V+G+
Sbjct: 374 PSA-----------AHQEVRIMGG----DVGLHKLAFGENDLPEGSEVSYYVRGEVGTMR 418
Query: 440 --RLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLA 497
RL G K+G I+CDCCN E+SPSQFEAH+G A+RR+PY HIYTSNG++LH++++SL
Sbjct: 419 SKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLL 478
Query: 498 MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC------- 550
G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C C
Sbjct: 479 RGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKG 538
Query: 551 ----RQGHSSSMSRSVDLKGGLE--------------APGAEVGGCVICRSHDFSAATFD 592
R ++ + R GG++ + E GGC +CR H+FS + F
Sbjct: 539 KFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKSQTDEAGGCALCRRHEFSTSGFG 594
Query: 593 DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 652
TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+ V R + +P
Sbjct: 595 PHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKGKWFCCRDCKDINSSLRKIVVRREEELP 654
Query: 653 ASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA 712
L I +++ KG + G D++W++L + SLLS A ++F E F+PI
Sbjct: 655 DDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRXASATEAGSLLSQALSLFHEQFNPIAD 713
Query: 713 ECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 772
GRDL+ MV+ + EFGGMY ILTV VVSA R+ G+EVAELPLVAT +
Sbjct: 714 AEGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDC 773
Query: 773 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 832
QG+ V +LVLPAAE AES+W KF F KM +E L RDFQ+
Sbjct: 774 QGQ------------------VNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQM 815
Query: 833 TIFKGTSMLEKKV 845
F+GTSML+K V
Sbjct: 816 MTFQGTSMLQKPV 828
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/458 (46%), Positives = 290/458 (63%), Gaps = 25/458 (5%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F GLP+G L Y +GQ+L G K G GI C CC E+SPS FEAHAG
Sbjct: 518 RKDQGLHKLVFDRGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKCEVSPSLFEAHAG 577
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AF
Sbjct: 578 WASRRKPYFYIYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAF 637
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLK----GGLEAPG-------------- 572
H C+ IP W C C +S ++ ++ G LE
Sbjct: 638 HIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVK 697
Query: 573 ---AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
AE GCV+C DF + F RT+I CDQCEKE+H+GCL + DLKE+PK WFC
Sbjct: 698 NMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFC 757
Query: 630 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFE 689
DC RI++ LQ + A+ + SSL I K E+ ++ + D++W+++ +
Sbjct: 758 SMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQ-ERNDVYSISDLDIRWRLI-SGKVTS 815
Query: 690 EKEKSLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVV 748
+ + LLS A AIF +CFDPI+ G +LIP MVYG+ + GQ++GG+ +LTV + VV
Sbjct: 816 PESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVV 875
Query: 749 SAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESI 808
SAGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +
Sbjct: 876 SAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPL 935
Query: 809 WTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 936 WMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPV 973
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 2/121 (1%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V+ L TG+LDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+S +FE HA + R + +I ENGK + ++ + PL LE + + E
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 322 F 322
F
Sbjct: 373 F 373
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 407 bits (1047), Expect = e-110, Method: Compositional matrix adjust.
Identities = 241/632 (38%), Positives = 345/632 (54%), Gaps = 65/632 (10%)
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW- 325
FE HA + + P +I LE+G + ++ K PL LEE ++ V G + S +
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSSLCLSC 61
Query: 326 ---------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSC------ 370
S + K +E E PS + ++ SS S+ P S
Sbjct: 62 QGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPKSILRRSKS 118
Query: 371 SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLF 421
S Q REQ K +L E K + H K T ++D LH+L+F
Sbjct: 119 SPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKDLRLHKLVF 172
Query: 422 LPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHI 481
+ LPDG + Y V G+++ G K+G GI C CCNK +SPS FEAHAG A+RR+P++HI
Sbjct: 173 EDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHI 232
Query: 482 YTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
YT+NG++LH+++++L+M QR + +DD+C +C DG L+ C+ CP ++H C +P
Sbjct: 233 YTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLP 292
Query: 542 ESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG--------AEVGG---------------- 577
W C C + + VD A G AE+
Sbjct: 293 SERWSCKYCV--NMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSV 350
Query: 578 CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 637
CV+CR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC C I+
Sbjct: 351 CVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEIN 410
Query: 638 AALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEKEKSL 695
L + + + + + L+ + +K E+ T D++W++L + K L
Sbjct: 411 TTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKIL 470
Query: 696 LSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLL 753
L+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V V+VS G+
Sbjct: 471 LAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIF 529
Query: 754 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 813
R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT KF
Sbjct: 530 RVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKF 589
Query: 814 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
GF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 590 GFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 621
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 225 DGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNNHIY 283
DG V Y + E+ L G G G C C CN KVVS FE HAG A R P HIY
Sbjct: 179 DGTEVGYF-VAGEKMLVGYKKGFGIHCSC--CN--KVVSPSTFEAHAGCASRRKPFQHIY 233
Query: 284 LENGKPIYSI 293
NG ++ +
Sbjct: 234 TTNGVSLHEL 243
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 244/695 (35%), Positives = 363/695 (52%), Gaps = 96/695 (13%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNF 258
KK + N +P +K++ + GIL+G V Y+ ++ R+ L G++ G G LC C C
Sbjct: 360 KKFLRN-FPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKG 418
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+VVS +E HA + + P +I LE+G + ++ K P LEE ++ V G +
Sbjct: 419 IQVVSTAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVGPNLK 478
Query: 319 EGSFRVW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPT 368
+ S + S + K +E E PS + ++ SS S+ P
Sbjct: 479 KSSLCLNCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSS---RPSVDPK 535
Query: 369 SCSFVQDTGREQTHMKEMLEER--KRGV-------KRPFMHQKRTAEGGTKKRDNDLHRL 419
S + Q++ +E + + GV + K ++G ++D LH+L
Sbjct: 536 SILSRSKSSPRQSNRQEQPTRKSTEPGVVPGTILSESKSSSIKSNSQGKLTRKDVRLHKL 595
Query: 420 LFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYR 479
+F + LPDG + Y V G E+SPS FEAHAG A+RR+P++
Sbjct: 596 VFEDDILPDGTEVGYFVAG--------------------EVSPSSFEAHAGCASRRKPFQ 635
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
HIYT+NG++LH+++++L+M QR + +DD+C +C DG C
Sbjct: 636 HIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG---------------VCASLSS 680
Query: 540 IPESGWRCPNC-------------RQGHSSSMSRSVDLKG-------------GLEAPGA 573
+P W C C ++ + VD G E P
Sbjct: 681 LPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPSV 740
Query: 574 EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDC 633
CV+CR H F F+ RTVI CDQCEKEFHVGCL+++ + DLKE+P++KWFC DC
Sbjct: 741 ----CVLCRGHSFCRLGFNSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDC 796
Query: 634 NRIHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEK 691
+I+ L + + + + + L+ I K E+ D T D++W++L ++
Sbjct: 797 EKINTTLGNLIIRGEEKLTNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGKLTSSDE 856
Query: 692 EKSLLSSATAIFRECFDPIIAECGR-DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 750
K LL+ A +I E FDPI R DLIP MVYGR GQ+F GMY +L V V+VS
Sbjct: 857 TKILLAKAVSILHERFDPISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSV 916
Query: 751 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 810
G+ R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT
Sbjct: 917 GIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWT 976
Query: 811 KKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
KFGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 977 DKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 1011
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 260/705 (36%), Positives = 364/705 (51%), Gaps = 124/705 (17%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLC 256
ME+KMSKK+ P N+K LL+TG+L+G VKYI +R L G++ G LC C C
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAVLRGVIKRVGILCSCSSC 202
Query: 257 NFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSS 316
VVS + FE HAG+ +HP+++I+LENG ++ I++ A L +L+ ++ G +
Sbjct: 203 KGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGPA 262
Query: 317 FNEGSFR--VWKASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEE-SISPTSCSF 372
+ +FR K+S LR G K + S S S +S + +PTS +
Sbjct: 263 PKKRTFRCQTCKSSFATLRTG------KFALLCDSCLESKGSQNSTRTSKIGRNPTSSA- 315
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
+ ++ + RG K F K T+ G ++D LH+L F+ LP+G +
Sbjct: 316 --RRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKGLHKLAFMSGVLPEGTDV 373
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
Y V G+ +SPSQFEAHAG AARR+PY +IY SNG++LH++
Sbjct: 374 GYYVGGK--------------------VSPSQFEAHAGRAARRKPYHNIYMSNGVSLHEL 413
Query: 493 AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-- 550
++SL+ G+ + SDD+C +C DG LLLC+ CP AFH C+ IP W C C
Sbjct: 414 SVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCEN 473
Query: 551 RQGHSSSMSRS--------VDLKGGLE-----------APGAEVGGCVIC---------- 581
RQ SS++ + +D +E P GGC +C
Sbjct: 474 RQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCSMSGFMDKQS 533
Query: 582 ---RS--------------HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
RS HDFS F RTV+ CDQ +P+
Sbjct: 534 VLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA-------------------LPE 574
Query: 625 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKK 684
W+C DC RI L+D +S A+ I + + I RK+ +K + DG + DV+W++LK
Sbjct: 575 GAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDL-DVRWRVLKD 633
Query: 685 AQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVYGRNISGQEFGGMYSVILTV 743
+ K +LS A AIF E FDPII GRDLIP MVYG
Sbjct: 634 KSSADSKL--VLSKAVAIFHESFDPIIQIATGRDLIPAMVYG------------------ 673
Query: 744 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 803
+ VVSAGL R+ G E+AELPLVAT R+ QG G FQALF CIERLL SL V++ VLPAA+
Sbjct: 674 -NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAAD 732
Query: 804 KAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
+AESIWT++FGF K++++ L +Y + + T+F+GTS L K V L
Sbjct: 733 EAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLVPKL 777
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 208/461 (45%), Positives = 281/461 (60%), Gaps = 48/461 (10%)
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGM 471
RD +H+++F+ L +G + Y V G+ +SPSQFEAHAG
Sbjct: 443 RDKGMHKVVFMSGVLQEGTDVGYYVGGK--------------------VSPSQFEAHAGR 482
Query: 472 AARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFH 531
AARR+PY +IY SNG++LH+++ISL GQ+ + SDD+C +C DG LLLC+ CP AFH
Sbjct: 483 AARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTCPRAFH 542
Query: 532 AACLDPLLIPESGWRCPNC--RQGHSSSMSRS--------VDLKGGLE-----------A 570
C+ P+ W C C RQ S ++ + V+ LE
Sbjct: 543 RECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATT 602
Query: 571 PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCC 630
P GGC +C+ HDFS F RTV+ CDQC +E+HVGCL+++ + DL +P+ W+C
Sbjct: 603 PETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCS 662
Query: 631 DDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEE 690
DC RI+ LQD +++ + +P L I +K KG D + DV+W++LK +
Sbjct: 663 TDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADL-DVRWRVLKDKS--SD 719
Query: 691 KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVS 749
K +LS A AIF E FDPII GRDLIP MVYGR+ Q++ GMY +LTV + VVS
Sbjct: 720 DSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNTVVS 779
Query: 750 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 809
AGL RI G E+AELPLVAT R+ QG G FQALFSCIERLL SL V++ VLPAAE+AESIW
Sbjct: 780 AGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEEAESIW 839
Query: 810 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK---KVQC 847
T++FGF K+S++ L +Y + + T+F+GTS L K KV+C
Sbjct: 840 TERFGFTKISQDELREYLKGGRTTVFQGTSNLHKLVAKVEC 880
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 41/64 (64%)
Query: 260 KVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNE 319
+VVS + FE HAG+ +HP+++I+LENG ++ +++ L +LE V+K G + +
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 320 GSFR 323
+FR
Sbjct: 406 RTFR 409
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 241/686 (35%), Positives = 358/686 (52%), Gaps = 58/686 (8%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGCPLCNFSKV 261
K ++ P V+ L TG+LDG V Y+ T + + L GI+ GG LC C C+++ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 262 VSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGS 321
+S +FE HA + R + +I ENGK + ++ + PL LE + + E
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 322 FRVWK------ASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISP----TSCS 371
F + SH R E + + SL +S + AL +++ ++ +
Sbjct: 373 FTCKRCKEGFCVSHARRPS--ESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSAST 430
Query: 372 FVQDTGREQTHMKEMLEERKRGVKRPFMHQK--RTAEGGTKKRDNDLHRLLFL------- 422
V + ++ K+ML +H AE + L L+ L
Sbjct: 431 NVSSQKKCRSKFKKMLVF--------LLHNDVLMLAEPTLMIKSLILLHLVSLYVLKVDP 482
Query: 423 ----PNGLPDGERLTYIV-------KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGM 471
P G + L +V +G L + + I D E + E +
Sbjct: 483 TFCDPQGFEKDQGLHKLVFDRGGLPEGTELGYYARGQTYITVDRNCSEATKWALEYIVTV 542
Query: 472 AARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFH 531
A+ Y IYTSNG++LH+ A + + G++ + ++D+C +C DG NLLLC+ CP AFH
Sbjct: 543 ASASYFY--IYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFH 600
Query: 532 AACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLK----GGLEA--PGAEVGGCVICRSHD 585
C+ IP W C C +S ++ ++ G LE P ++ G I +
Sbjct: 601 IECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKN 660
Query: 586 FSAAT----FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 641
A T F RT+I CDQCEKE+H+GCL + DLKE+PK WFC DC RI++ LQ
Sbjct: 661 MEAETNGSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQ 720
Query: 642 DFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATA 701
+ A+ + SSL I K E+ ++ + D++W+++ + + + LLS A A
Sbjct: 721 KLLLGGAEKLSDSSLGIIQTKQ-ERNDVYSISDLDIRWRLI-SGKVTSPESRMLLSQALA 778
Query: 702 IFRECFDPIIAEC-GRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 760
IF +CFDPI+ G +LIP MVYG+ + GQ++GG+ +LTV + VVSAGLLR+FGREV
Sbjct: 779 IFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREV 838
Query: 761 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 820
AELPLVAT + KG FQ LFSCIE+LL SLNVE++V+PAAE+AE +W KFGFRK++
Sbjct: 839 AELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAP 898
Query: 821 ERLLKYQRD-FQLTIFKGTSMLEKKV 845
E+L KY + +Q+ FKG SML+K V
Sbjct: 899 EQLSKYIKICYQMVRFKGASMLQKPV 924
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 368 bits (945), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 243/695 (34%), Positives = 347/695 (49%), Gaps = 125/695 (17%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
KK + N +P +K + GIL+G V Y+ + KVVS
Sbjct: 371 KKFLRN-FPAKLKDIFDCGILEGLIVYYVRGA------------------------KVVS 405
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFR 323
FE HA + + P +I LE+G + ++ K PL LEE ++ V G + S
Sbjct: 406 PAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSSLC 465
Query: 324 VW----------KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSC--- 370
+ S + K +E E PS + ++ SS S+ P S
Sbjct: 466 LSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSS---RPSVDPKSILRR 522
Query: 371 ---SFVQDTGREQTHMKE---------MLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHR 418
S Q REQ K +L E K + H K T ++D LH+
Sbjct: 523 SKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLT------RKDLRLHK 576
Query: 419 LLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPY 478
L+F + LPDG + Y V G E+SPS FEAHAG A+RR+P+
Sbjct: 577 LVFEDDILPDGTEVGYFVAG--------------------EVSPSTFEAHAGCASRRKPF 616
Query: 479 RHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL 538
+HIYT+NG++LH+++++L+M QR + +DD+C +C DG +C P
Sbjct: 617 QHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG----VCASLPS---------- 662
Query: 539 LIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG--------AEVGG------------- 577
+P W C C + + VD A G AE+
Sbjct: 663 -LPSERWSCKYCV--NMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTEL 719
Query: 578 ---CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 634
CV+CR H F F+ RTVI CDQCEKEFHVGCL++ + DLKE+P++KWFC C
Sbjct: 720 PSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCE 779
Query: 635 RIHAALQDFVSNRAQTIPASSLSTINRKHI--EKGILFDGTMNDVQWQMLKKAQCFEEKE 692
I+ L + + + + + L+ + +K E+ T D++W++L +
Sbjct: 780 EINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDT 839
Query: 693 KSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 750
K LL+ A +I E FDPI +E G DLIP MVYGR Q+F GMY +L V V+VS
Sbjct: 840 KILLAKALSILHERFDPI-SESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSV 898
Query: 751 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 810
G+ R+FG E+AELPLVAT ++ QG+G FQ LF+CIERLL LNV+++VLPAA++A+SIWT
Sbjct: 899 GIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWT 958
Query: 811 KKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
KFGF KM+ E + +Y++D+ + IF GTSML K V
Sbjct: 959 DKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 993
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 173/412 (41%), Positives = 253/412 (61%), Gaps = 26/412 (6%)
Query: 444 GCKQGNGIVCDC--CN-KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
G G I C C CN ++SPS FEAHAG RR+PY +I+TSNG++LH++++ ++
Sbjct: 331 GVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDM 390
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQ--GHSSSM 558
+ + +DD+C CG G ++ C CP +FH AC+ +P S W C NC ++
Sbjct: 391 QLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKAL 449
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRS-------------HDFSAATFDDRTVIYCDQCEKE 605
+ + + K G + ++ R+ DF+ + FD+RTVI CDQCEKE
Sbjct: 450 AENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQKDFNNSVFDERTVILCDQCEKE 509
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
+HVGCL+ DLKE+P+ +WFCC+ C+ I ++L +S+ A + S + I +KH
Sbjct: 510 YHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEM 569
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY 724
KG+ D T D++W++L + E+ + LLS+A I + FDPII + GRDLIP MV
Sbjct: 570 KGLSMD-TNTDLRWRLLAGRKASEDGDL-LLSAAVPIIHQSFDPIIEVQSGRDLIPEMVN 627
Query: 725 GRN----ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
GR + GQ++ GMY +LT+ + VVSA LLR+ G EVAELPLVAT ++ QG G FQA
Sbjct: 628 GRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQA 687
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQL 832
LFSCIER+L SL +++ +LPAA++AE IW KFGF K+ +E+LL+ D +
Sbjct: 688 LFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQLLRKNVDLNV 739
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 315 bits (806), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 165/389 (42%), Positives = 234/389 (60%), Gaps = 28/389 (7%)
Query: 487 MTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
++LH++++S++ + + +DD+C CG G ++ C CP +FH AC+ ++P W
Sbjct: 2 VSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWF 61
Query: 547 CPNC--------------------RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDF 586
C NC RQ S+ + + + ++GGC +C+ DF
Sbjct: 62 CDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICDDLGGCALCKKKDF 121
Query: 587 SAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSN 646
+ A FD+RTVI CDQCEKE+HVGCLR DLKE+P+ +WFCCD C+ I ++L +S
Sbjct: 122 NNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISG 181
Query: 647 RAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFREC 706
A + S + I +KH KG++ D +++WQ++ E+ SLLSSA + +
Sbjct: 182 GAHPLSESDVDIIRKKHESKGLVMDAN-TEIRWQLVAGRSATEDG-NSLLSSAVPVIHQS 239
Query: 707 FDPII-AECGRDLIPVMVYGR----NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
FDPII A GRDLIP MV+GR + GQ++ GMY +LTV S VVSA LLR+ G +VA
Sbjct: 240 FDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVA 299
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
ELPLVAT + QG G FQ LFSCIERLL SL V++ +LPAA +AE+IW KKFGF K+ ++
Sbjct: 300 ELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQD 359
Query: 822 RLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
++ Y LT+F GT L K + LPE
Sbjct: 360 QMEAYLNGGHLTVFHGTLNLYKAIP-LPE 387
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 308 bits (789), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 189/540 (35%), Positives = 287/540 (53%), Gaps = 67/540 (12%)
Query: 200 IKM-SKKVVPNE---YPTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYL 250
+KM S +VP+ +PT +K+LL TGIL+ V+YI R ++ L G++ G G L
Sbjct: 259 VKMGSSALVPSTLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGIL 318
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
C C C + VV+ + FE HAG+ + P +IYLENG + S++ A L L+E ++
Sbjct: 319 CYCDTCKGTNVVTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIR 378
Query: 311 KVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSS-LALEES-ISPT 368
GSS + +F + K ++ S++ C S + L+ES SP
Sbjct: 379 VAIGSSIKKSTFCF--------------NCKGSISEVGTSDSLVLCESCVGLKESHASPA 424
Query: 369 SCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPD 428
+ D + T + + + R K+D LH+L F N LP+
Sbjct: 425 QPTGTSDRSTKTTTVSKCSSSGSKNYGR------------VTKKDVGLHKLAFGENDLPE 472
Query: 429 GERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT 488
G ++Y V+G+RL G K+G I+C CCN E+SPSQFEAH+G A+RR+PY HIYTSNG++
Sbjct: 473 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 532
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
LH++++SL G+ + +D++C +C DG LL C+GCP FH C+ IP+ W C
Sbjct: 533 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 592
Query: 549 NC-----------RQGHSSSMSRSVDLKGGLEA--------------PGAEVGGCVICRS 583
C R ++ + R GG++ E GGC +CR
Sbjct: 593 FCLNTLQKGKFVERNANAVAAGR----MGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRR 648
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
H+FS + F TV+ CDQCEKEFHVGCL+ + + DLK +PK KWFCC DC I+++L+
Sbjct: 649 HEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKI 708
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 703
V + + +P L I +++ KG + G D++W++L + + SLLS A ++F
Sbjct: 709 VVRQEEELPDDVLRIIKKRYGRKGSVCSGN-PDIKWRLLHGRRASATEAGSLLSQALSLF 767
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 303 bits (777), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 225/709 (31%), Positives = 343/709 (48%), Gaps = 93/709 (13%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRER----------------------------QLDGI 243
P N K+L++TG+L+G V+ +R L GI
Sbjct: 514 PRNAKELMATGLLEGRYVRCSCRGEQRINSEDCLLDEDEEELRVLSSSDFPDIPLPLTGI 573
Query: 244 VNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP-- 301
+ G +C C +C ++VVS FE H+G+ + HP+++IYLENGK + I+ + +
Sbjct: 574 LQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQESADC 633
Query: 302 ----LGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCS 357
LG L+ + ++ G EG+ + G E + + K P CS
Sbjct: 634 GGDILGALKHAIGEIQGIPKKEGACG--------KCGKREGGDFVSCKEPK-------CS 678
Query: 358 SLALEESI---SPTSCS-FVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGT---K 410
++ E + SP F + Q M + + + KR R E
Sbjct: 679 AVYHAECVGLPSPHRVDWFCAKCEKAQVKMPKTVLKMKRPPAVTDREDTRLKEKELTVRS 738
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
RD LH+ LFLP GL DG L Y + Q + G KQG GI C CCN+EIS S FE HAG
Sbjct: 739 ARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEISCSAFEQHAG 798
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLA----MGQR-RTTGGSDDMCHVCGDGENLLLCNG 525
+RR PY I ++G +L D+ LA +G R + D+ G E +L +
Sbjct: 799 CESRRNPYGSILLADGRSLKDMCKELAYQSKLGDRAHQVARTGDVKSSSGSEEQGVLASS 858
Query: 526 --CPLAFHAACLDPLLIPESGWRCPNCR------QGHSSSMSRSVDLKGGL--------- 568
C + E+ P C+ +G + + VD +
Sbjct: 859 QRCESTWCINFGTRFSCQEADSGHPLCKICQKNVEGAHKTSKKRVDATANIPATDDTGRN 918
Query: 569 ----EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+AP + GC IC+ FD T++ CDQC +E+HVGCLR++G+ D E+P+
Sbjct: 919 VRLFQAPDSS-SGCAICKKWTLKKCGFD-MTMLVCDQCGREYHVGCLRESGILD--ELPE 974
Query: 625 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGI--LFDGTMNDVQWQ 680
+W+C +C I L V+N + + + ++ + +R+H ++GI + + + WQ
Sbjct: 975 AEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESRQH-QQGIVEMAESSSPVFGWQ 1033
Query: 681 MLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSV 739
+L A ++L + A IF EC DPI A G+++IP+MVY R +F G+Y V
Sbjct: 1034 ILHGAGENPVNGRTL-AQAVEIFTECSDPIKDAPSGQNMIPIMVYSRRFKDYDFDGIYCV 1092
Query: 740 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 799
+LT+ VVS LL+IFGREVAE+PL+AT ++Q +G +AL + IERLL LNVE LVL
Sbjct: 1093 VLTLNEKVVSTALLQIFGREVAEVPLIATSVDHQDQGFCKALMTTIERLLGVLNVERLVL 1152
Query: 800 PAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
PA++ AE +W +FGF +M +L + L +F GT+ML K + +
Sbjct: 1153 PASKNAEFVWVNRFGFSRMEDAQLKHIRSMMGLLVFTGTTMLVKHIDAV 1201
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 298 bits (763), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 180/459 (39%), Positives = 251/459 (54%), Gaps = 48/459 (10%)
Query: 412 RDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG---CKQGNGIVCDCC------------ 456
RD LH+L+F N L DG + Y V ++ G KQ +GI+CDCC
Sbjct: 86 RDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQ-SGILCDCCKEVILEFFFCQI 144
Query: 457 ---NKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHV 513
++++SPS+FEAHAG A+RR+PY HI T++G++LH +AI+ R + SD+ C
Sbjct: 145 KKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAIN----HRISISNSDEHCSK 200
Query: 514 CGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG--------HSSSM------- 558
C NLL C+GC AFH C+ P+ W C CR H ++
Sbjct: 201 CKQRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKII 260
Query: 559 ----SRSVDLKGGLEAPGAEV--GGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
S + L EV C +C F+ F TV+ CDQCEK++HVGCL+
Sbjct: 261 ESDPSEQIAKICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLK 320
Query: 613 KNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
+ + +LK++PK WFC DC IH L++F++ + S LS I K +KG+ +
Sbjct: 321 DHNMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEF 380
Query: 673 TMNDVQWQMLKKAQCFEEK-EKSLLSSATAIFRECFDPIIAECGR-DLIPVMVYGRNISG 730
+ D++W++ + + SLLS IF E FD I+ + DLIP MV GR I
Sbjct: 381 GL-DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKD 439
Query: 731 QE-FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
+ FGGMY +L V VVVSAG+ R+FG+EVAEL L+AT EYQ +G F+ L SCIE +L
Sbjct: 440 KYYFGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIENVL 499
Query: 790 CSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 828
L VE LVLPAA +AES+W KFGF + ++ +Y R
Sbjct: 500 KELKVERLVLPAAHEAESMWIDKFGFTEPNQGLGRRYYR 538
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 144/263 (54%), Positives = 189/263 (71%), Gaps = 3/263 (1%)
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
HDFS + F DRTVI CDQCE+E+HVGCL+++ + DLKE+P+ +WFC DC RIH+ALQ
Sbjct: 2 HDFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTL 61
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 703
+ AQ +P + I +K KG G D++WQ+L + + + + LLS A AIF
Sbjct: 62 LLRGAQPLPLLDVDVIRKKCDIKGFNI-GANTDIRWQLLS-GKTADAESRLLLSKAVAIF 119
Query: 704 RECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAE 762
E FDPI+ A GRDLIP MVYGR + Q++GG+Y +LTV S VVSAG+LR+ G E+AE
Sbjct: 120 HESFDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAE 179
Query: 763 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRER 822
LPLVAT RE+QG+G FQ+LFSCIERLL +L V++ VLPAA++AESIWTKKFGF K++ +
Sbjct: 180 LPLVATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDE 239
Query: 823 LLKYQRDFQLTIFKGTSMLEKKV 845
L KY + T+F+GTS L K V
Sbjct: 240 LHKYLNGARTTVFQGTSTLHKPV 262
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 293 bits (749), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 241/755 (31%), Positives = 356/755 (47%), Gaps = 84/755 (11%)
Query: 128 NSGDVSKSHV---VLEIPKHVSSSSGIRK---ITFKFSKRKEDYVAP-LAYEEGRNYTLY 180
NSG +S S V ++ V + G RK T + + + AP L EEG +
Sbjct: 195 NSGGLSDSVVSNGLVHDTDQVVKNGGSRKRKSKTIHYMNSRVPHAAPALESEEGSDGQKS 254
Query: 181 DDLGSSGANDGVLCARNMEIKMSKKVVPN-----EYPTNVKKLLSTGILDGACVKYISTS 235
+ R IK+ K +VP + P N K+L++T +++G V+ +
Sbjct: 255 PEERGDAIARPSQAPRAAAIKL-KSMVPVGDKLLKAPRNAKELMATRLMEGHHVR--CSC 311
Query: 236 RERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSII- 294
R QL G++ G C C C S +VS FE H+G+ + HP+++IYLENGK + I+
Sbjct: 312 RGIQLTGMLKDMGVQCDCRNCRGSVIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILS 371
Query: 295 --QELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHS 352
QE IL + + E S + +G + + + + + HS
Sbjct: 372 AGQEAADCGDNILRALKMAIGDVQGVEKSKSKCAKCGNPEEGDLIYCK--GARCSVVAHS 429
Query: 353 IISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPF----------MHQK 402
C +A + C + T + +K VKRP + +K
Sbjct: 430 --GCVEIA-NPHLGDWFCGKCEKTKKPHASVK---------VKRPISSGAEKEDSRVREK 477
Query: 403 RTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKE--- 459
RD LH+ LFLP GL +G L Y K Q G K+G GI C CCNKE
Sbjct: 478 DATVSARLSRDAHLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASS 537
Query: 460 -ISPSQFEAHAGMAARRQPYRHIYT-SNGMTLHDIAISLA----MGQRRT---TGGSDDM 510
IS +FE HAG ARR PY +I +G +L D+ L +G+++ +
Sbjct: 538 DISCFEFEQHAGCEARRNPYGNILVLVDGRSLKDVCKDLTHKNKLGEQQNCEPLARDVNC 597
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLI-PESGWRCPNCR-----------------Q 552
C+ C L CNGC A+ C + +S W C CR Q
Sbjct: 598 CYECSSSGELKTCNGCEEAWCDNCTKGEEVDSDSKWYCRMCRNDTLKVAQNGQKVSGKHQ 657
Query: 553 GHSSSMSRSVDLKG----GLEAPGAEVGGCVICRSHDFSAATF-DDRTVIYCDQCEKEFH 607
SSS++ +D +G LE EVGGC IC+ + S F D T++ CDQC +E+H
Sbjct: 658 EESSSITE-IDERGRCIRHLEGH-REVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYH 715
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
V CL+ +G+ +L E+P+ +WFC C I L V+ +++ S +S + +K
Sbjct: 716 VSCLKDSGMDNLNELPEGEWFCQKGCKVIDEILTQLVAIGPESLSHSIISELPENRQQKS 775
Query: 668 ILFDGTMN---DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 723
+ + + +WQ+L + L+ A IF EC DPI A+ G++LIP+MV
Sbjct: 776 GVIEKAESISPSFEWQILC-GKGSSPANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMV 834
Query: 724 YGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
R +F G++ V+L + VVSA LL+IFGRE AE+PLVAT +QG+G +AL +
Sbjct: 835 QSRRTKDYDFEGVFCVVLKLNGKVVSAALLQIFGREFAEVPLVATSLPHQGQGFCKALMT 894
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 818
IERLL L+VE LVLP A+ ES+W KFGF ++
Sbjct: 895 TIERLLGVLSVERLVLPTAKDTESLWVNKFGFSRV 929
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/289 (50%), Positives = 193/289 (66%), Gaps = 6/289 (2%)
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+RS+ + LE GGC +C+ HDFS F RTV+ CDQC +E+HVGCL+++ +
Sbjct: 53 FTRSIRIATTLETG---FGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMA 109
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
DL +P+ W+C DC RI+ LQD +++ + + A L I +K KG D DV
Sbjct: 110 DLTALPEGAWYCSTDCVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDV 169
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGM 736
+W++LK + K +LS A AIF E FDPII GRDLIP MVYGR+ Q++ GM
Sbjct: 170 RWRVLKDKS--SDDSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGM 227
Query: 737 YSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVEN 796
Y +LTV ++VVSAGL RI G E+AELPLVAT R+ QG G FQALFSCIERLL SL V++
Sbjct: 228 YCTVLTVNNIVVSAGLFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKH 287
Query: 797 LVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
VLPAAE+AESIWT++FGF K+S++ L +Y + + T+F+GTS L K V
Sbjct: 288 FVLPAAEEAESIWTERFGFAKISQDELREYLKGGRTTVFQGTSNLHKLV 336
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 225/723 (31%), Positives = 329/723 (45%), Gaps = 83/723 (11%)
Query: 153 KITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPN--- 209
K + R V L EEG + + G +G R K+ K +VP
Sbjct: 328 KTIHYMNSRVPHSVPTLESEEGSDGQESHEEGDTGPVRPSQAPRAAATKL-KSMVPIGDK 386
Query: 210 --EYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
+ P N K+L++T +++G V+ + R QL G++ G C C C S +VS F
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVR--CSCRGIQLTGMLKDMGVRCDCRNCKSSVIVSISAF 444
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAG-SSFNEG 320
E H+G+ + HP+++IYLENGK + I+ + A L L+ + + G +
Sbjct: 445 EAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKWKVT 504
Query: 321 SFRVWKASHH-------LRKGLVEHDEKHNMKLPSLPHSIIS-CSSLALEESISPTSCSF 372
+ W + R ++ H + P L C + + S
Sbjct: 505 CAKCWNSDEGDLIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATVKVKRSI 564
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERL 432
T ++ ++E K E RD LH+ LFLP GL DG L
Sbjct: 565 SSGTEKDDGRVRE----------------KDATESTRLNRDAHLHKALFLPGGLEDGTEL 608
Query: 433 TYIVKGQRLRFGCKQGNG---IVCDCCNKEISPSQFEAHAGMAARRQPYRHI-YTSNGMT 488
Y K Q G K+G +V EI+ +FE HAG ARR PY +I ++G +
Sbjct: 609 GYYTKSQLKLKGVKRGEAFKKVVV-----EINCYKFEQHAGCEARRNPYGNILLVADGRS 663
Query: 489 LHDIAISLA----MGQ---RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
L D+ LA +G+ R G + C+ CG L C+GC + +C L
Sbjct: 664 LKDVCKELAHKNKLGEKEKRVARAGKVNSCYECGTRGELKNCHGCVETWCNSCTKGLETD 723
Query: 542 ESG-WRCPNCRQG--------------HSSSMSRSVDLK------GGLEAPGAEVGGCVI 580
G W C CRQ H MS + LE EVGGC I
Sbjct: 724 SDGKWYCRMCRQDTLNVAQIEQKRSNKHIEGMSNIAETDERDRCVRHLEGH-REVGGCAI 782
Query: 581 CRSHDFSAATF-DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 639
C+ + S F D T++ CDQC +E+HV CL+ +G+ DL E+P+ +WFC DC I
Sbjct: 783 CKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPEGEWFCQKDCKVIDEI 842
Query: 640 LQDFVSNRAQTIPASSLSTINRKHIEKGILFD---GTMNDVQWQMLKKAQCFEEKEKSLL 696
L V+N + + S +S + ++ D + WQ+L + L
Sbjct: 843 LTQLVANGPELLTDSIISELLESRQQQTGAKDKAESSCPSFAWQIL-CGKSGNTANTQTL 901
Query: 697 SSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRI 755
+ A IF EC DPI A+ G++LIP+MV R +F G++ ++L + VVSA LL+I
Sbjct: 902 AEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVFCIVLKLNEKVVSAALLQI 961
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
FG E+AE+PLVAT +QG+G +AL + IERLL L+VE LVLP A+ ESIW KFGF
Sbjct: 962 FGGEIAEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGF 1021
Query: 816 RKM 818
++
Sbjct: 1022 SRV 1024
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/271 (50%), Positives = 179/271 (66%), Gaps = 7/271 (2%)
Query: 580 ICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAA 639
+CR DF+ A FD+RTVI CDQCEKE+HVGCL+ +LKE+P+ +WFCC C+ ++
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 640 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 699
L +S+ AQ + L I +KH+ +G+ D T D++WQ+L + E+ LLS+A
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMD-TSKDLKWQLLSGKRTTEDG-SILLSAA 129
Query: 700 TAIFRECFDPII-AECGRDLIPVMVYGRN----ISGQEFGGMYSVILTVKSVVVSAGLLR 754
IF + FDPI A GRDLIP MV GR + GQ++ GMY +LTV S VVSA LLR
Sbjct: 130 VPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLR 189
Query: 755 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 814
+ G +VAELPLVAT ++ QG G FQALFSCIER+L SL +++ VLPAA +AE IW KKFG
Sbjct: 190 VMGGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFG 249
Query: 815 FRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
F + + E L Y LTIF GTS L K V
Sbjct: 250 FSRTTPEELEAYLNGAHLTIFHGTSYLYKAV 280
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 226/755 (29%), Positives = 332/755 (43%), Gaps = 133/755 (17%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHA 271
P N K+L++T +++G V+ + R QL G++ G C C C S +VS FE H+
Sbjct: 567 PRNAKELMATRLMEGHFVR--CSCRGIQLTGMLKDMGVQCNCRNCKGSVIVSISAFEAHS 624
Query: 272 GAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAG----------- 314
G+ + HP+++IYLENGK + I+ + A L L+ + + G
Sbjct: 625 GSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKRKVTCAKC 684
Query: 315 SSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIIS-CSSLALEESISPTSCSFV 373
S EG K + R +V H + P L C + + S
Sbjct: 685 ESSQEGDLIYCKGA---RCSVVAHSRCVGIANPQLGDWFCGKCEKTKKHHAAAKVKRSI- 740
Query: 374 QDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTK-KRDNDLHRLLFLPNGLPDGERL 432
G + K L +G P + + A + RD LH+ LFLP GL DG L
Sbjct: 741 -SGGADPEDGKVRLLRDLQGYASPSASKDKAATASVRLNRDAHLHKALFLPGGLVDGTEL 799
Query: 433 TYIVKGQRLRFGCKQGNGIVCDCCNKE------------ISPSQ---------------- 464
Y K Q G K+G GI C CCN+E + PSQ
Sbjct: 800 GYYTKSQLKLKGVKRGEGICCSCCNEETRHVQGGSRGGLMFPSQQDGTYYVFSQVKEIHV 859
Query: 465 ------------------------FEAHAGMAARRQP-YRHIY-----TSN--------- 485
FE HAG ARR P Y H+ TSN
Sbjct: 860 TTFNEEMKFLSNDLDDIQQISCYEFEQHAGCEARRNPGYYHVVAKYAETSNEYDSYGNIL 919
Query: 486 ----GMTLHDIAISLAMG------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL 535
G +L D+ LA ++R G + + + E + N +A + +
Sbjct: 920 LVPDGRSLKDVCKDLAHKNKLDEKEKRAARGGESRFSL--ELEQDSVFNNSTVAI-VSLI 976
Query: 536 DPLLIPESGWRCPNCRQ--------GHSSS------MSRSVDLKGG------LEAPGAEV 575
E W C CRQ G S MS + L+ P EV
Sbjct: 977 GIEKDSEGRWYCRMCRQDSLKVAQNGQKGSEKIMEGMSNIAETNEQGRCIRHLDGP-REV 1035
Query: 576 GGCVICRSHDFSAATF-DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 634
GGC IC+ + S F D T++ CDQC +E+HV CL+ +G+ DL E+P +WFC DC
Sbjct: 1036 GGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDCK 1095
Query: 635 RIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGI-LFDGTMNDVQWQMLKKAQCFEEK 691
I L V+N + + S++S + +R+ + + + +WQ+L +
Sbjct: 1096 VIDEILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILC-GKGSSPA 1154
Query: 692 EKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSA 750
+ L+ A IF +C DPI + G++LIP+MV R +F G++ V+L + VVSA
Sbjct: 1155 DVQTLAEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVVSA 1214
Query: 751 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 810
LL+IFGRE+AE+PL+AT + G+ +AL + IERLL L+VE LVLP A+ ES+W
Sbjct: 1215 ALLQIFGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESVWI 1274
Query: 811 KKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
KFGF ++ ++L +LT F GTSM+ K +
Sbjct: 1275 NKFGFSRVQEDQLKSICTTIRLTTFTGTSMVVKAI 1309
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/276 (50%), Positives = 178/276 (64%), Gaps = 7/276 (2%)
Query: 575 VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 634
V G + R DF+ A FD+RTVI CDQCEKE+HVGCL+ +LKE+P+ +WFCC C+
Sbjct: 53 VMGLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCS 112
Query: 635 RIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKS 694
++L +S+ AQ + L I +KH +G+ D T D++WQ+L + EE
Sbjct: 113 ETRSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMD-TSKDLKWQLLSGKRATEEG-SI 170
Query: 695 LLSSATAIFRECFDPII-AECGRDLIPVMVYGRN----ISGQEFGGMYSVILTVKSVVVS 749
LLS+A IF + FDPI A GRDLIP MV GR + GQ++ GMY +LTV S VVS
Sbjct: 171 LLSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVS 230
Query: 750 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 809
A L+R+ G +VAELPLVAT ++ QG G FQALFSCIER+L SL +++ VLPAA +AE IW
Sbjct: 231 AALMRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIW 290
Query: 810 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
KKFGF ++ E L LTIF GTS L K V
Sbjct: 291 MKKFGFSRIPPEELEACLNGAHLTIFHGTSYLYKAV 326
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 271/951 (28%), Positives = 400/951 (42%), Gaps = 180/951 (18%)
Query: 31 ELLTDNVESDSFPNGKHAKEEAL--NDDIKSEVSNPVVSPKEFTSS---------LQDIT 79
E L+ NVE+ PNG +K+ A+ +D+ + + P P E + S +++
Sbjct: 332 EELSPNVEAH-VPNGNVSKKMAVPESDEAQGRNAQPAKDPGEISDSGLAQDTDRSVKNGR 390
Query: 80 SQETKLVTESCNQVLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVL 139
S++ K T + L SEE SDG E E GD VL
Sbjct: 391 SRKRKSKTIHYMNSRVPHAVPALESEEG-SDGPESPEE------------GDD-----VL 432
Query: 140 EIPKHVSSSSGIRKI-TFKFSKRKEDYVAPLAYEEGRNYTLYDDLG----SSGANDGVLC 194
P S + + I K K ++V +A G Y LG SS G++
Sbjct: 433 VRPSQASRVAATKSIWALKHCKHIIEHV--VAQPPGVVYRFLSTLGQHVISSSCLVGLII 490
Query: 195 ARNMEIKMS------KKVVPN-----EYPTNVKKLLSTGILDGACVKYISTSRERQLDGI 243
+ I++ K +VP + P N K+L++T +++G V+ + R QL G+
Sbjct: 491 STLATIRLKGFALRLKSMVPIGDKLLKAPRNAKELMATKLMEGHFVR--CSCRGMQLTGM 548
Query: 244 VNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP-- 301
+ G C C C S +VS FE H+G+ + HP+++IYLENGK + ++ + A
Sbjct: 549 LKDMGVQCNCRNCKGSMIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADC 608
Query: 302 ----LGILEEVVKKVAG-----------SSFNEGSFRVWKASHHLRKGLVEHDEKHNMKL 346
L L+ + + G EG K + R +V H
Sbjct: 609 GDNILRALKMAIGDIQGVEKSKVTCAECGGSEEGDLIYCKGA---RCSVVSHSRCVGSAN 665
Query: 347 PSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAE 406
P L C + T + K + G + + K T
Sbjct: 666 PQLGDWF----------------CGKCEKTKKRHAAAK-VKRSISAGTEDSEVRDKATTA 708
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFE 466
RD L + LFLP GL DG L Y K Q G K+G GI C CCNKEIS +FE
Sbjct: 709 SARLNRDAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYEFE 768
Query: 467 AHAGMAARRQPYRHI-YTSNGMTLHDIAISLA----MG--QRRTTGGSDDMCHVCGDGEN 519
HAG ARR PY +I ++G +L D++ LA +G ++R + C+ C +
Sbjct: 769 QHAGCEARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVCCYECSNSGE 828
Query: 520 LLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQ--------GHSSS------MSR--SV 562
L C+ C A+ C + G W C CRQ GH + MS
Sbjct: 829 LKRCHSCEEAWCDKCTKGMETDSEGRWYCRMCRQDSLKVAQNGHKGTDKIIEGMSNIAET 888
Query: 563 DLKG----GLEAPGAEVGGCVICRSHDFSAATF-DDRTVIYCDQCE--KEFHVG---CLR 612
D KG LE P EVGGC IC+ + S F D T++ CDQ + G +
Sbjct: 889 DEKGRCVRHLEGP-REVGGCAICKKWNLSKTGFVDGMTILVCDQVRSLNQMLPGTRITWK 947
Query: 613 KNGLCD-----------LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI-- 659
NG D +E+P+ +WFC DC I L V+N + + S +S +
Sbjct: 948 VNGFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANGPELLSNSIISELLE 1007
Query: 660 -NRKHIEKGILFDGTMNDVQWQML-------KKAQCFEEKEKSLLS-------------- 697
++ + + + WQ+L Q E S
Sbjct: 1008 SRQQQSSVKVKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFTSIDDINLPYLWLVVG 1067
Query: 698 ----------SATAIF---REC-----FDPII-AECGRDL----IPVMVYGRNISGQEFG 734
S+T +F + C FDP +EC D+ + ++V R +F
Sbjct: 1068 NYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNVALVVCSRRAKDHDFE 1127
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G++ V+L + VVSA LL+IFGRE+AE+PL+AT +QG+G +AL + IERLL L+V
Sbjct: 1128 GVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCKALMTTIERLLGVLSV 1187
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E LVLP A+ ESIW KFGF ++ ++L + +L F GT ML K +
Sbjct: 1188 ERLVLPTAKNTESIWINKFGFSRVPEDQLKRICTTIRLMTFTGTRMLGKAI 1238
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 259 bits (661), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 151/361 (41%), Positives = 197/361 (54%), Gaps = 63/361 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC--RQGHSSSMSRS--- 561
SDD+C +C DG LLLC+ CP AFH C+ IP W C C RQ SS++ +
Sbjct: 241 SDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESSLAYNHNA 300
Query: 562 -----VDLKGGLE-----------APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
+D +E P GGC +CR HDFS F RTV+ CDQ
Sbjct: 301 IAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQ---- 356
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
+P+ W+C DC RI L+D +S A+ I + + I RK+ +
Sbjct: 357 ---------------ALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQ 401
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY 724
K + DG + DV+W++LK K +LS A AIF E FDPII GRDLIP MVY
Sbjct: 402 KALNKDGDL-DVRWRVLKDKS--SADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVY 458
Query: 725 GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSC 784
G + VVSAGL R+ G E+AELPLVAT R+ QG G FQALF C
Sbjct: 459 G-------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGC 499
Query: 785 IERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK 844
IERLL SL V++ VLPAA++AESIWT++FGF K++++ L +Y + + T+F+GTS L K
Sbjct: 500 IERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKL 559
Query: 845 V 845
V
Sbjct: 560 V 560
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 49/84 (58%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C C VVS + FE HAG+ +HP+++I+LENG ++ I++
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 300 APLGILEEVVKKVAGSSFNEGSFR 323
A L +L+ ++ G + + +FR
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFR 158
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 163/437 (37%), Positives = 223/437 (51%), Gaps = 76/437 (17%)
Query: 411 KRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG 470
++D LH+L+F N L DG + Y +G+ +SPS+FEAHAG
Sbjct: 185 RKDRGLHKLVFQENMLEDGAAVGYFDRGK--------------------VSPSKFEAHAG 224
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
A+RR+PY +I T++G++LH+ LA +R + SD+ C C NLL C+ C +F
Sbjct: 225 RASRRKPYSYIRTADGVSLHE----LANNRRISMSDSDERCSHCEQVGNLLWCDRCQRSF 280
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMS---RSVDLKGGLEAPG------AEV------ 575
H C+ P+ C CR + + D+ G A G EV
Sbjct: 281 HLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITEVCTLSEK 340
Query: 576 ------GGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
G C +C DF+ RTV+ C QCEKEFHV CL+ + + +L E+PKDKWFC
Sbjct: 341 QKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVELPKDKWFC 400
Query: 630 CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK-KAQCF 688
DC+ IH LQ ++ R E G+ D++W++L K
Sbjct: 401 GIDCDDIHMKLQKLMA---------------RGEAELGL-------DIKWRLLNTKLNNP 438
Query: 689 EEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSV 746
+ L+S A AIF E F I + DLI M+YG I GQ F GMY +L K V
Sbjct: 439 KHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVLYFKKV 498
Query: 747 VVSAGLLRIFGREVAELPLVATCREYQGK------GCFQALFSCIERLLCSLNVENLVLP 800
+ AG+ R+ G+EVAELPLVAT +YQ + G F++LFSCIE +L L V+ LVLP
Sbjct: 499 IACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGYFRSLFSCIENMLRHLKVKTLVLP 558
Query: 801 AAEKAESIWTKKFGFRK 817
AA +AES+W KFGF K
Sbjct: 559 AAHEAESMWIDKFGFTK 575
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 236 bits (601), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 202/691 (29%), Positives = 316/691 (45%), Gaps = 112/691 (16%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 114 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 171
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 172 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 231
Query: 333 K-----------------GLVEHDEKHNMK---LPSLPHSIISCSSLALEE--------- 363
K GL + ++++ S+ +SI + + A +
Sbjct: 232 KDYLDAVASDGCLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNWSASKRRSGRRFRQG 291
Query: 364 ---------SISPTSCSFVQDT------GREQTHMKEMLEERKRGVKRPFMH---QKRTA 405
S SP F T G E+TH + L + +RP + ++
Sbjct: 292 DTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETH-RLSLSSPVKITQRPLRNCSIDSKSK 350
Query: 406 EGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQ 464
E +K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS
Sbjct: 351 E--SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSH 408
Query: 465 FEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCN 524
FE HAGM RRQPY +IYT G++LH +A+ L ++ L N
Sbjct: 409 FEEHAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPN 447
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH 584
G A ++ D + SG C + S++ V LK L+ E C C
Sbjct: 448 GFDNASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYG 502
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHA 638
+ + T+I+C+QCE+ H+ C +K L LKE + CC +C + A
Sbjct: 503 HTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRA 562
Query: 639 ALQDFVSNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQMLKKAQCFEEKEKSL 695
L++ +EK GI F + +++ W++L + K
Sbjct: 563 RLEE--------------------GLEKCVGITFLRRIRSNICWRLLSGMDASRDV-KLY 601
Query: 696 LSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLR 754
+ IF++ F E D+I MV G+N + +F GMY +LT + VVSA +L+
Sbjct: 602 MPQVIDIFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILK 660
Query: 755 IFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFG 814
+ ++AEL L+AT E + KG F L IE L + NV L P + IW++K G
Sbjct: 661 VRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLG 720
Query: 815 FRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
F +S E L +FK +++K +
Sbjct: 721 FTILSAEEKESMLESHPLVMFKNLVLVQKSL 751
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 232 bits (592), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 165/515 (32%), Positives = 233/515 (45%), Gaps = 145/515 (28%)
Query: 478 YRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENLLLCNGCPLAF 530
Y IY ++G +LHD A SL + Q G D C CGD +L LC GCP A+
Sbjct: 961 YTSIYLADGRSLHDAAQSLVVEQNVKQEGVTPAKIEHLDQCVECGDSGDLQLCTGCPNAY 1020
Query: 531 HAACLDP-----------LLIPE---------------SGWRCPNCRQ---GHSSSMSRS 561
H + P LL E + CP+C++ G + RS
Sbjct: 1021 HQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCPDCQEQRFGGTKDRRRS 1080
Query: 562 VDL---KGG-----------------LEAPGAEV-GGCVICRSHDFSAATFDDRTVIYCD 600
+ KG L+ P A V GGCV C+S DF+ F +T + CD
Sbjct: 1081 MTKRRSKGAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCVFCKSGDFAKTGFGPKTTLLCD 1140
Query: 601 QCEKEFHVGCLRKNGL--------------------------------CD---------- 618
QCE+E+HVGCL+K+GL CD
Sbjct: 1141 QCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCVATVRLRDCDRGCESQGIVE 1200
Query: 619 ------------LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI---NRKH 663
+E+P+ +WFC DC IH+ L VSN + + S +S + N+
Sbjct: 1201 MRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLRTNQAR 1260
Query: 664 IEKG-ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR------------------ 704
+E+ + + + +WQ+L + + L+ A IF
Sbjct: 1261 LERSEDATESSCSGFEWQLL-HGRGGDPSNGKALAEAVQIFSVRNLSDPGFPVRTVWDSH 1319
Query: 705 ----------ECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLL 753
ECFDPI G DLIP+MVY R++ Q+FGG+Y V+L + VVS L+
Sbjct: 1320 PCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGGIYCVVLKYDNRVVSTALI 1379
Query: 754 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 813
R+FGR++AELPL+AT +QG+G +AL IERLL L VE L LPAAE AE IW KF
Sbjct: 1380 RVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNKF 1439
Query: 814 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
GFR+M+ ++ ++ D + +F G+ MLEK++ L
Sbjct: 1440 GFRRMAEGQVKQFHSDLNMMVFTGSFMLEKEIPPL 1474
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 124/269 (46%), Gaps = 22/269 (8%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVS 263
K+V+ E P + K LL +G+LDG V+Y+ L GI+ GG LC C C +VV+
Sbjct: 582 KEVLLKEAPASAKLLLQSGLLDGHHVRYMGRGGHIMLTGIIQEGGVLCDCSSCKGVQVVN 641
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ------ELKTAPLGILEEVVKKVAG--- 314
FE+HAG+ RHP++ I+LENGK + I++ + K + +L+ + +V G
Sbjct: 642 VSAFEKHAGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNIMDVLKSAIGEVGGVKV 701
Query: 315 --SSFNEGSFRVWKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSF 372
SS + A L + D K + + P + + +
Sbjct: 702 QISSLEHPLIAIQPALKKLPQPKSLLDTKPRVPVDVKPRTPQGDTKPKMPSDTKARFTPE 761
Query: 373 VQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN---DLHRLLFLPNGLPDG 429
V+ G + L+ R +K TA R++ +LH+ LFLP GL D
Sbjct: 762 VKARGSDARASMPRLDRTPR--------EKETASPPVLSRESSGANLHKALFLPGGLEDD 813
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNK 458
+ Y VKGQ+ G K+G GI+C CC +
Sbjct: 814 TEVGYYVKGQKSLAGVKKGAGILCSCCQQ 842
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 208 bits (530), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 146/474 (30%), Positives = 227/474 (47%), Gaps = 90/474 (18%)
Query: 403 RTAEGGTKKRDNDLHRLLFLPN---GLPDGERLTYIV-KGQRLRFGCKQ-------GNGI 451
R G ++R+++ H+ LFLP+ GL DGE ++YI +G+ L G + +GI
Sbjct: 431 RVVSGARRERNSNKHKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGI 490
Query: 452 VCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMC 511
+C CCN IS SQFEAHAG +RR PY +I+T+ G++L +A + + + +
Sbjct: 491 LCACCNGVISCSQFEAHAGRGSRRAPYDNIFTAAGVSLRKLACLMPASEAESP-----IS 545
Query: 512 HVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAP 571
H LC +A A L+P L+ SG EA
Sbjct: 546 H-----RPAALCA---VADRRA-LEPELVTVSG------------------------EA- 571
Query: 572 GAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK----- 626
A GGCV+C+ DF F +RT+I CDQCE+E+H+GCL ++G L E+P+ K
Sbjct: 572 -ALHGGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLAEHGRAHLTELPEGKASLYD 630
Query: 627 ----------WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND 676
W C +C I +++ VS ++P + G +
Sbjct: 631 ILLTLHLNGEWHCSPECKGIATRMRERVS----SVP---------------VPLQG---E 668
Query: 677 VQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGG 735
WQ+L+ +L +A I E FDPI+ G DL+ MVY + + ++ G
Sbjct: 669 YSWQVLRGKDGTHATTWAL-KAAQEILTESFDPILDLVTGADLMMAMVYAQELGDWDYTG 727
Query: 736 MYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVE 795
MY+ +L + V + + R+FGR++AE+PLVAT + +G + L + E SL V+
Sbjct: 728 MYTAVLRRRGKAVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLMAAFEDYFRSLGVQ 787
Query: 796 NLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLP 849
+L LPAA+ W FGF ++ E + ++ IF GT +L+K + P
Sbjct: 788 SLCLPAAQSTVETWIHGFGFAAITPEEQAATCSELRVLIFPGTELLQKPLNPTP 841
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 63/133 (47%), Gaps = 20/133 (15%)
Query: 436 VKGQRLRFGCKQG----NG-------IVCDC--CNKEISP----SQFEAHAGMAARRQPY 478
++GQ + F + G NG I C C C + +P S+FE HAG RR P
Sbjct: 239 LRGQPVFFQSRHGDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEHAGSRERR-PG 297
Query: 479 RHIYTSN-GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
IY +N ++L + +L + R+ C +C DG +LL C+GCP A HA C
Sbjct: 298 ESIYLTNLSISLKEF-CALVNDEGRSADRHGSACGLCMDGGDLLCCDGCPTAVHAYCAGL 356
Query: 538 LLIPESGWRCPNC 550
+PE W C C
Sbjct: 357 EEVPEGDWFCDAC 369
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 38/76 (50%), Gaps = 3/76 (3%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDG-IVNGGGYLCGCPLCNFSKV--VSAHEFEQH 270
++++L G L G V + S + L+G I G C C C K VS EFE+H
Sbjct: 229 TLREVLKGGALRGQPVFFQSRHGDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEH 288
Query: 271 AGAKTRHPNNHIYLEN 286
AG++ R P IYL N
Sbjct: 289 AGSRERRPGESIYLTN 304
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 149/468 (31%), Positives = 214/468 (45%), Gaps = 74/468 (15%)
Query: 397 PFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVK--GQRLRFGCKQGNGIVCD 454
P+ H+ +T + LL N L +LTY + G RL+ G +GI C
Sbjct: 696 PYQHRAKT-----------VLSLLVDKNILLPRVKLTYKQRSDGPRLKEGAVTKDGIKCR 744
Query: 455 CCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------------ 502
CCN+ + FE HAG + R P HI+ +G L + L MG+ +
Sbjct: 745 CCNELFTLESFEVHAGCSTR-LPAAHIFLKDGRPLSQCLVEL-MGENKPKESLHVRLKTN 802
Query: 503 -TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
+ SD +C +C +G +LLC+ CP +FH AC+ PE W CP+CR
Sbjct: 803 YSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWYCPSCR---------- 852
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAAT--FDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
C IC S D+ T F ++T++YCDQCE+E+HVGC+R G L
Sbjct: 853 ----------------CSICDSSDYDPDTNKFTEKTIMYCDQCEREYHVGCMRNKGD-QL 895
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS-TINRKHIEKGILFDGTMNDVQ 678
P+ WFC C+ I LQ + T P LS TI R E N++
Sbjct: 896 TCCPEGCWFCSRGCSEIFQHLQGLIGKSIPT-PVEGLSCTILRFDRENASQHGDFYNEII 954
Query: 679 WQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISGQEFGG 735
+ K L A + ECF II RDL +V+ R + F G
Sbjct: 955 AEQYGK-----------LCIALDVLHECFVTIIEPSTRRDLSEDIVFNRESGLRRLNFRG 1003
Query: 736 MYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVE 795
Y++IL ++S G R+ G++ AELPL+ T +Y+ +G + L + +E+LL L VE
Sbjct: 1004 FYTLILQKDGELISVGTFRVCGKKFAELPLIGTRVQYRRQGMCRLLMNELEKLLSGLGVE 1063
Query: 796 NLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
LVLPA + WT FGFR MS + L+ F+GT++ +K
Sbjct: 1064 RLVLPAIPQLLETWTGSFGFRAMSFSDRFELAESSILS-FQGTTICQK 1110
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 238 RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQEL 297
R +G V G C C CN ++ + FE HAG TR P HI+L++G+P+ + EL
Sbjct: 730 RLKEGAVTKDGIKCRC--CN--ELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVEL 785
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 208/446 (46%), Gaps = 57/446 (12%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+KKRD LH L+F GLPD LTY +K G+ L G KQG GIVC+CCN+E+SPS+FE
Sbjct: 167 SKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEK 226
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY++IYTS G+TLHD+A+ L + G S+ D N L +GC
Sbjct: 227 HAGMGKRRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPN-LTSSGC- 284
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS-VDLKGGLEAPGAEVGGCVICRSHDF 586
G S+S V LK L+ + C C
Sbjct: 285 -------------------------GKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHT 319
Query: 587 SAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
D +++C+QCE+ HV C +K L LK+ + CC +C + L
Sbjct: 320 ELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRL 379
Query: 641 QDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSAT 700
+ + E+ +++ W++L A + K LS
Sbjct: 380 EGM------------------EKCEEIAFLGRIRSNICWRLLSSADASRDV-KLYLSQVI 420
Query: 701 AIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGRE 759
IF++ F + + I MVYG+N G+ +F GMY V+LT + VVSA +L++
Sbjct: 421 DIFKDAF--LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVEH 478
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
AEL L+AT E + KG F+ L IE L + NV L+ P + IW+ K GF +S
Sbjct: 479 AAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLGFTILS 538
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
+ L +FK +++K +
Sbjct: 539 ADEKKSMLESHPLVMFKNLVLVQKPL 564
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 222 GILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNN 280
G+ D + Y + E L G G G +C C CN + VS EFE+HAG K R P
Sbjct: 183 GLPDNTLLTYKLKNGEALLQGYKQGAGIVCNC--CN--QEVSPSEFEKHAGMGKRRQPYQ 238
Query: 281 HIYLENGKPIYSIIQEL 297
+IY G ++ + +L
Sbjct: 239 NIYTSQGLTLHDVALQL 255
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 208/446 (46%), Gaps = 57/446 (12%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+KKRD LH L+F GLPD LTY +K G+ L G KQG GIVC+CCN+E+SPS+FE
Sbjct: 136 SKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEK 195
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY++IYTS G+TLHD+A+ L + G S+ D N L +GC
Sbjct: 196 HAGMGKRRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPN-LTSSGC- 253
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS-VDLKGGLEAPGAEVGGCVICRSHDF 586
G S+S V LK L+ + C C
Sbjct: 254 -------------------------GKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHT 288
Query: 587 SAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
D +++C+QCE+ HV C +K L LK+ + CC +C + L
Sbjct: 289 ELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRL 348
Query: 641 QDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSAT 700
+ + E+ +++ W++L A + K LS
Sbjct: 349 EGM------------------EKCEEIAFLGRIRSNICWRLLSSADASRDV-KLYLSQVI 389
Query: 701 AIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGRE 759
IF++ F + + I MVYG+N G+ +F GMY V+LT + VVSA +L++
Sbjct: 390 DIFKDAF--LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVEH 447
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
AEL L+AT E + KG F+ L IE L + NV L+ P + IW+ K GF +S
Sbjct: 448 AAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLGFTILS 507
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
+ L +FK +++K +
Sbjct: 508 ADEKKSMLESHPLVMFKNLVLVQKPL 533
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 5/77 (6%)
Query: 222 GILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAG-AKTRHPNN 280
G+ D + Y + E L G G G +C C CN + VS EFE+HAG K R P
Sbjct: 152 GLPDNTLLTYKLKNGEALLQGYKQGAGIVCNC--CN--QEVSPSEFEKHAGMGKRRQPYQ 207
Query: 281 HIYLENGKPIYSIIQEL 297
+IY G ++ + +L
Sbjct: 208 NIYTSQGLTLHDVALQL 224
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 205/450 (45%), Gaps = 67/450 (14%)
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPS 463
G K RDN LH +LF GLPD LTY +K G+ L+ G K G GI+C+CC+ E+ +PS
Sbjct: 480 GQPKVRDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPS 539
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFE H GM RRQPYR IYTS+G+TLH++A+ L G + G G+
Sbjct: 540 QFEKHVGMGRRRQPYRSIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKE---- 595
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS 583
++ P+++P LK L+ V C +CR
Sbjct: 596 -------YSTTSRPIIVP----------------------LKRTLQERVLTVESCYMCRK 626
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD-------LKEIPKDKWFCCDDCNRI 636
+++C+QCE+ HV C NGL L E + + CC+ C +
Sbjct: 627 PHTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLL 685
Query: 637 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 696
A+L + + R E ++ WQ+L + + ++
Sbjct: 686 RASLHEGLKKR-----------------EDIAFLRRIRYNICWQLLNGTNMRSDVQHQVI 728
Query: 697 SSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRI 755
IF++ F A D I MV ++ +G+ +F G+Y +LT + VVSA +L++
Sbjct: 729 E----IFKDAFAE-TAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKV 783
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
EVAEL L+AT E + KG F L S IE L + NV L P + IW++K G+
Sbjct: 784 RTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGY 843
Query: 816 RKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+S E+ L +F S+++K +
Sbjct: 844 TILSDEQKHSMLMAHPLVMFANLSLVQKSL 873
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/205 (35%), Positives = 108/205 (52%), Gaps = 18/205 (8%)
Query: 164 DYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSK-KVVPNEYPTNVKKLLSTG 222
D+ P A E+ RN L SS +L +++++++ K P +V+ LLSTG
Sbjct: 200 DHAMPDA-EQTRNPVLEQPWDSS-----MLITAPVKVRVTETKYRPESILKDVRGLLSTG 253
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+L+G V Y ER G +NG GY CGC C + +++A EFEQH+G + + NNHI
Sbjct: 254 LLEGFRVTYKKNEVERI--GRINGQGYSCGCSECGYRNIMNACEFEQHSGESSNNQNNHI 311
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKH 342
+L++G +Y +IQ LK L +L +V+ KV N + WKAS L K +
Sbjct: 312 FLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFD----- 366
Query: 343 NMKLPSLPHSIISC--SSLALEESI 365
PS P S S S++AL +S+
Sbjct: 367 --DAPSDPCSTQSSQESNIALTDSL 389
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 205/450 (45%), Gaps = 67/450 (14%)
Query: 407 GGTKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPS 463
G K RDN LH +LF GLPD LTY +K G+ L+ G K G GI+C+CC+ E+ +PS
Sbjct: 579 GQPKVRDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPS 638
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLC 523
QFE H GM RRQPYR IYTS+G+TLH++A+ L G + G G+
Sbjct: 639 QFEKHVGMGRRRQPYRSIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKE---- 694
Query: 524 NGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS 583
++ P+++P LK L+ V C +CR
Sbjct: 695 -------YSTTSRPIIVP----------------------LKRTLQERVLTVESCYMCRK 725
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD-------LKEIPKDKWFCCDDCNRI 636
+++C+QCE+ HV C NGL L E + + CC+ C +
Sbjct: 726 PHTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLL 784
Query: 637 HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 696
A+L + + R E ++ WQ+L + + ++
Sbjct: 785 RASLHEGLKKR-----------------EDIAFLRRIRYNICWQLLNGTNMRSDVQHQVI 827
Query: 697 SSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRI 755
IF++ F A D I MV ++ +G+ +F G+Y +LT + VVSA +L++
Sbjct: 828 E----IFKDAFAE-TAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKV 882
Query: 756 FGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
EVAEL L+AT E + KG F L S IE L + NV L P + IW++K G+
Sbjct: 883 RTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGY 942
Query: 816 RKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+S E+ L +F S+++K +
Sbjct: 943 TILSDEQKHSMLMAHPLVMFANLSLVQKSL 972
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 213 TNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL---CNF-SKVVSAHEFE 268
T VK LL +L G V Y +L+G V GGY C CP C++ KV+SA +FE
Sbjct: 23 TCVKDLLP--LLQGVPVTYRFEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFE 80
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
+HAG +++ N HI+L NG+ +Y + +L+ P E + AG
Sbjct: 81 KHAGVTSKNQNGHIFLRNGRSLYELFHKLREVPAEKFPEAFRMAAG 126
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 97/205 (47%), Gaps = 41/205 (20%)
Query: 164 DYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSK-KVVPNEYPTNVKKLLSTG 222
D+ P A E+ RN L SS +L +++++++ K P +V+ LLSTG
Sbjct: 322 DHAMPDA-EQTRNPVLEQPWDSS-----MLITAPVKVRVTETKYRPESILKDVRGLLSTG 375
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+L+G V Y +++++A EFEQH+G + + NNHI
Sbjct: 376 LLEGFRVTY-------------------------KKNEIMNACEFEQHSGESSNNQNNHI 410
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRKGLVEHDEKH 342
+L++G +Y +IQ LK L +L +V+ KV N + WKAS L K +
Sbjct: 411 FLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFD----- 465
Query: 343 NMKLPSLPHSIISC--SSLALEESI 365
PS P S S S++AL +S+
Sbjct: 466 --DAPSDPCSTQSSQESNIALTDSL 488
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 141/453 (31%), Positives = 217/453 (47%), Gaps = 59/453 (13%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEI--SPSQF 465
+K RDN LH+L+F G+P+ LTY +K G+ L+ G KQG I+CDCC++E+ +PS F
Sbjct: 488 SKVRDNSLHQLVFKEGGVPELTILTYKLKHGEVLKQGYKQGTCILCDCCSEEVQFTPSHF 547
Query: 466 EAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNG 525
E HAGM RRQPYR+IYT G+TLH++A+ L G + G+ GD L
Sbjct: 548 EEHAGMGKRRQPYRNIYTPEGLTLHELALKLQGG--LNSNGNSSANFPGGDEPPNLSSGS 605
Query: 526 CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHD 585
+ P ++P LK L+ + C +C
Sbjct: 606 --SRESSTTYRPSIVP----------------------LKRTLQQIADKTESCRLCGDAC 641
Query: 586 FSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAA 639
+ T + +++C+QCE+ HV C +K L L E + +FCC C + A+
Sbjct: 642 TTIGTISEDMIVFCNQCERPCHVKCYNNGLQKQKGPLNVLAEYMQFHFFCCQKCQLLRAS 701
Query: 640 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 699
L + ++ R + I +K + V WQ+L +K +
Sbjct: 702 LHEVLNKREK---------IRQKR-----------SYVFWQILNGMNPGINVQK-YIHQV 740
Query: 700 TAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKS-VVVSAGLLRIFG 757
IF+ F P A +I MV +++ G+ +F GMY +LT S +VVSA +L++
Sbjct: 741 IEIFKVAF-PKTAASDFGVIQDMVNAKDVGGEKDFRGMYCAVLTTSSKLVVSAAVLKVRT 799
Query: 758 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 817
EVAEL +VATC +++ KG F L IE L ++NV L + ESIW+KK GF
Sbjct: 800 EEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMNVRLLTALVDPEMESIWSKKLGFTI 859
Query: 818 MSRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
+S E L +F+ ++++K + P+
Sbjct: 860 LSGEEKETLLEAHPLVMFEDLTLMQKSLASKPD 892
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 200 IKMSK-KVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
++M++ K P +V+ LLSTG+L+G V Y G + C
Sbjct: 258 VRMAETKYRPETLLKDVRGLLSTGLLEGFSVTY-------------KKNGKMNAC----- 299
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
EFEQHAG + + N+HI+L++G +Y +IQ LK L +L +++++ G N
Sbjct: 300 -------EFEQHAGQSSNNQNDHIFLDSGISLYKLIQALKYKKLHLLADLIEEQTGLPPN 352
Query: 319 EGSFRVWKASHHLRKGLVE 337
+ WKAS ++ +E
Sbjct: 353 LIEYGKWKASFEVQNDDLE 371
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 240 LDGIVNGGGYLCGCP-LCNF---SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQ 295
L+G V GGY+C C C++ K +SA +FE+HAGA +++ N HI+L NG +Y++
Sbjct: 26 LEGFVAEGGYVCACTSPCDYRRGGKALSALQFEKHAGAVSKNQNGHIFLRNGTSLYALFH 85
Query: 296 ELKTAPLGILEEVVKKVAG 314
L+ P E + AG
Sbjct: 86 ALREVPAQRFAEDFRMAAG 104
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 142/487 (29%), Positives = 219/487 (44%), Gaps = 79/487 (16%)
Query: 408 GTKKRDNDLHRLLFLPNGLPDGERLTY--IVKGQ-RLRFGCKQGNGIV-CDCCNKEISPS 463
GT+ R L +L L +TY + GQ ++G G GI+ C CCNK + +
Sbjct: 313 GTQHRSRTLLSVLIDSGILAPRGNVTYWAVRDGQPACKYGFITGEGIIRCKCCNKTFTVA 372
Query: 464 QFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISL-------AMGQRRTTGGS----- 507
+ EAHA G RR+ + ++ +G +L + L A R GS
Sbjct: 373 ELEAHATGGIGTDDRREAWARVFVEDGRSLSLCLMELMRRDDVGAAAANRNRNGSVMRVK 432
Query: 508 --------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
D +C VC D LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 433 EKCSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR-------- 484
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDF---SAATFDDRTVIYCDQCEKEFHVGCLRKNG- 615
C +C D +A F D+T+IYC+QCE+E+HVGC+R+ G
Sbjct: 485 ------------------CGVCGGSDLDDDTAEGFTDKTIIYCEQCEREYHVGCMRRGGS 526
Query: 616 -------LC---DLKEIPKDKW---FCCDDCNRIHAALQDFV-SNRAQTIPASSLSTINR 661
C E P+++W C +C + LQ V S+RA++IP S +
Sbjct: 527 EEESAAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQALVASSRARSIPHYSRGAYHS 586
Query: 662 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIP 720
+ +WQ ++ + + L +A + ECFD ++ DL
Sbjct: 587 APCGRRRYMSTVTRITRWQHEEE----DAADHGQLCAALDVLHECFDDMVEPRTQTDLAA 642
Query: 721 VMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 778
+V+ + + F G Y V L +++ G LR+FG +VAELPLV T ++ +G
Sbjct: 643 DIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRVFGNQVAELPLVGTRFAHRRQGMC 702
Query: 779 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 838
+ L + +E++L + V LVLPA + +WT GF M+R +++ + + FKGT
Sbjct: 703 RLLVTELEKMLRQVGVRRLVLPAVPELMPMWTASLGFHAMTRSDVMEMAVEHAILSFKGT 762
Query: 839 SMLEKKV 845
+M +K +
Sbjct: 763 TMCQKTL 769
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 192 bits (487), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 213/440 (48%), Gaps = 59/440 (13%)
Query: 426 LPDGERLTYIVKGQR--LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
L + ++++Y+ K R L G + GIVC CC K S S FEAH+G + RR I+
Sbjct: 2 LHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRR-ACASIFN 60
Query: 484 SNGMTLHDIAISL------AMGQRRTTGGS-----DDMCHVCGDGENLLLCNGCPLAFHA 532
+ G +L D+ + + + + DD C VCGDG L+ C+ CP +H
Sbjct: 61 NKGESLLDLQVQAWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHL 120
Query: 533 ACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSA--AT 590
+CL +PE W CP+CR C IC +++A ++
Sbjct: 121 SCLLLKELPEGEWFCPSCR--------------------------CAICGGSEYNADGSS 154
Query: 591 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT 650
F++ TV+ CDQCE+E+HV CL G+ + P D WFC D C++I L+ V + T
Sbjct: 155 FNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFEGLRKLV-GISNT 213
Query: 651 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPI 710
I LS + E + +MN + Q E + K L+ A + +ECF P+
Sbjct: 214 I-GEGLSWTLLRSGEDDLPSASSMN--------REQMAEHRSK--LAVALGVMQECFLPM 262
Query: 711 IAECGR-DLIPVMVYGRN---ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 766
+ + DL+ ++Y R ++ F G Y+V+L V+S +RI G +AE+PL+
Sbjct: 263 VDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLI 322
Query: 767 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 826
T ++ +G + L IE LL L + + VLPA + W FGF++M+ + L+
Sbjct: 323 GTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLEL 382
Query: 827 QRDFQLTIFKGTSMLEKKVQ 846
+ + F G ++L+K +Q
Sbjct: 383 VK-LSVVSFPGVTLLQKPLQ 401
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 162/555 (29%), Positives = 230/555 (41%), Gaps = 141/555 (25%)
Query: 202 MSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPL------ 255
M K+V+ E P + K LL +G+LDG V+Y+ L GI+ GG LC C
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLGRGGHIMLTGIIQEGGVLCDCSSCKGVQV 636
Query: 256 ---------------------------------CNFS------------KVVSAHEFEQH 270
C F KVV+ FE+H
Sbjct: 637 TCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFEKH 696
Query: 271 AGAKTRHPNNHIYLENGKPIYSIIQ------ELKTAPLGILEEVVKKVAGS-----SFNE 319
AG+ RHP++ I+LENGK + I++ + K + +L+ + +V G S +
Sbjct: 697 AGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNVMDVLKSAIGEVGGEKVQIISLDH 756
Query: 320 GSFRVWKASHHLRKGLVEHDEKHNMKL---PSLPHSIISCSSLALEESISP--TSCSFVQ 374
+ A L + + D K + + P +P + + S P T
Sbjct: 757 PLIAIQPAEKKLPQPRLVLDTKPRVPVDLKPRMPQVDMKPRVMLDTRSRMPLDTKAKSTS 816
Query: 375 DTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN---DLHRLLFLPNGLPDGER 431
D ++ + R + +K A R++ +LH+ LFLP GL D
Sbjct: 817 DVKARGGDVRATIPRLDRTTR-----EKEAASPPVPSRESSGANLHKALFLPGGLEDDIE 871
Query: 432 LTYIVKGQRLRFGCKQGNGIVCDCCN------------------------KEISPSQFEA 467
+ Y VKGQ+ G K+G GI+C CC + IS S FE
Sbjct: 872 VGYYVKGQKFLAGLKRGAGILCSCCQQVARNGMSSILMDSVVCGLTYEGERMISCSLFEQ 931
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENL 520
HAG +RR PY IY ++G +LHD A SL + Q G+ D C CGD +L
Sbjct: 932 HAGWGSRRNPYTSIYLADGRSLHDAAQSLVVEQTVKQEGNTPAKIEHLDQCVECGDRGDL 991
Query: 521 LLCNGCPLAFHAACLDPLLIPESG-WRCPNC---RQGHSSSMSRSV---DLKGG------ 567
LC CP A+H CL + SG + CP+C R G + RS+ KG
Sbjct: 992 QLCTRCPNAYHQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRRSMVNRRSKGAAKTLLS 1051
Query: 568 -----------LEAPGAEV-GGCVICRSHDFSAATFDDRTVIYCDQ----------CEKE 605
L+ P A V GGCV C+S DF+ F +T + CDQ CE+E
Sbjct: 1052 KDRVTGRCTRLLQVPEAVVLGGCVFCKSGDFAKTGFGPKTTLLCDQVSVGDMKVKGCERE 1111
Query: 606 FHVGCLRKNGLCDLK 620
+HVGCL+K+GL DLK
Sbjct: 1112 YHVGCLKKHGLEDLK 1126
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/70 (50%), Positives = 47/70 (67%)
Query: 779 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 838
+AL IERLL L VE L LPAAE AE IW +FGFR+M+ E++ ++ D + +F G+
Sbjct: 1480 KALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMVFTGS 1539
Query: 839 SMLEKKVQCL 848
SMLEK + L
Sbjct: 1540 SMLEKDIPPL 1549
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 619 LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI---NRKHIEKG-ILFDGTM 674
+E+P+ +WFC DC IH+ L VSN + + S +S + N+ +E + +
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 675 NDVQWQML 682
+ +WQ+L
Sbjct: 1262 SGFEWQLL 1269
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/175 (56%), Positives = 125/175 (71%), Gaps = 4/175 (2%)
Query: 676 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFG 734
DV+W++L + + K LL A ++F ECFDPI+ GRDLIP MV GRN+ Q+FG
Sbjct: 16 DVRWRLLN-GRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIPAMVNGRNLRTQDFG 74
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
GMY +L V S V SA +LRIFG ++AELPL+AT + +GKG FQ LFSCIERLL L+V
Sbjct: 75 GMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQTLFSCIERLLSFLSV 134
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSMLEKKV-QC 847
+NLVLPAAE+AESIW KFGF KM ++L Y++++ Q+ FKGT ML K V QC
Sbjct: 135 KNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTVMLHKTVPQC 189
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 189 bits (479), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 130/440 (29%), Positives = 208/440 (47%), Gaps = 59/440 (13%)
Query: 426 LPDGERLTYIVKGQR--LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
L + ++++Y+ K R L G + GIVC CC K S S FEAH+G + RR I+
Sbjct: 27 LHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRA-CASIFN 85
Query: 484 SNGMTLHDIAISL------AMGQRRTTGGS-----DDMCHVCGDGENLLLCNGCPLAFHA 532
+ G +L D+ + + + + DD C VCGDG L+ C+ CP +H
Sbjct: 86 NKGESLLDLQVQAWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHL 145
Query: 533 ACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSA--AT 590
+CL +PE W CP+CR C IC +++A ++
Sbjct: 146 SCLLLKELPEGEWFCPSCR--------------------------CAICGGSEYNADGSS 179
Query: 591 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT 650
F++ TV+ CDQCE+E+HV CL G+ + P D WFC D C++I L+ V
Sbjct: 180 FNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFQGLRKLVGISNNI 239
Query: 651 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPI 710
LS + E + +MN Q + +S L+ A + +ECF P+
Sbjct: 240 --GEGLSWTLLRSGEDDLPSANSMNREQ----------MAEHRSKLAVALGVMQECFLPM 287
Query: 711 I-AECGRDLIPVMVYGRN---ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLV 766
+ DL+ ++Y R ++ F G Y+V+L V+S +RI G +AE+PL+
Sbjct: 288 VDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLI 347
Query: 767 ATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKY 826
T ++ +G + L IE LL L + + VLPA + W FGF++M+ + L+
Sbjct: 348 GTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLEL 407
Query: 827 QRDFQLTIFKGTSMLEKKVQ 846
+ + F G ++L+K +Q
Sbjct: 408 VK-LSVVSFPGVTLLQKPLQ 426
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 134/439 (30%), Positives = 199/439 (45%), Gaps = 68/439 (15%)
Query: 429 GERLTYI-VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGM 487
G+++TY G ++ G + +GI C CCN+ + FE HAG ++ P H++ +GM
Sbjct: 585 GDKVTYKQSDGPGIKEGSIRRDGIKCMCCNEIFTVENFEVHAG-SSTPLPSAHMFLKDGM 643
Query: 488 TLHDIAISLAMGQR---------RTTGG-----SDDMCHVCGDGENLLLCNGCPLAFHAA 533
+L + G + R G SD +C VC DG +LLLC+ CP ++H
Sbjct: 644 SLSQCLVEFMGGNKPRDPHPLHARLKGKNSDLESDSICSVCHDGGDLLLCDNCPSSYHHD 703
Query: 534 CLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT--F 591
C+ IPE W CP+CR C IC D+ T F
Sbjct: 704 CVGLEAIPEGNWYCPSCR--------------------------CSICNLSDYDPDTSQF 737
Query: 592 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT- 650
++T++YCDQCE+E+HVGC R N L P+ WFC C+ + LQ+ + T
Sbjct: 738 TEKTIVYCDQCEREYHVGCTR-NSDNQLICRPEGCWFCSRGCSNVFQHLQELIGKSVPTP 796
Query: 651 ---IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECF 707
+ + L + + G D M D L A I ECF
Sbjct: 797 IEGVSWTILKFCSGNGSDHGDYDDEIMAD---------------HYGKLCVAVGILHECF 841
Query: 708 DPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELP 764
II D+ +V+ R + F G Y+++L +S G RI G++ AELP
Sbjct: 842 VTIIEPRTQSDISEDIVFNRESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFAELP 901
Query: 765 LVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLL 824
L+ T Y+ +G + L + +E+LL L VE L+LPA + WT FGF MS L
Sbjct: 902 LIGTSSPYRRQGMCRLLINELEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNSDRL 961
Query: 825 KYQRDFQLTIFKGTSMLEK 843
+ + L+ F+GT+M +K
Sbjct: 962 ELAGNSILS-FQGTTMCQK 979
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 186 bits (471), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 197/415 (47%), Gaps = 59/415 (14%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC+K ++ S+FE HAG + RQP+++IY +G++L + I Q
Sbjct: 691 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLDSGVSLLECQIDAWNRQESIERIGF 749
Query: 503 ---TTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
T G +DD C +CGDG +L+ C+GCP FH +CLD +++P W CPNC
Sbjct: 750 HSVNTDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCPNCT----- 804
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVIC--RSHDFSAATFDDRT-VIYCDQCEKEFHVGCLRK 613
C C S DF + + ++ C C K++H CL+
Sbjct: 805 ---------------------CKFCGIASEDFVQEDGTNVSELLTCSLCAKKYHKSCLQD 843
Query: 614 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
+ FC C + LQ ++ + + S S ++R I+ + G
Sbjct: 844 VDAPCIDFNNSTPCFCGKTCRELFEQLQKYLGIKHELESGFSWSLVHRMDIDLDMSLQG- 902
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISG 730
+ ++ +C S L+ A ++ ECF PI+ G ++I ++Y G N +
Sbjct: 903 -------LPQRVEC-----NSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNR 950
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
+ G Y+ IL ++SA +R G ++AE+P + T Y+ +G + LFS IE LC
Sbjct: 951 LNYSGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRRLFSAIESALC 1010
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
SL V+ L++PA + WT FGF +S + L + + + +F G ML+K++
Sbjct: 1011 SLKVQKLIIPAISELTHTWTGVFGFTTLS-DSLKQELKSMNMLVFPGIDMLQKQL 1064
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 38/60 (63%), Gaps = 5/60 (8%)
Query: 229 VKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
V+Y++ R + L+G V G CGC C SK+++ +FE HAG+K R P +IYL++G
Sbjct: 672 VRYMNRRRTKVMLEGWVTRDGIHCGC--C--SKILTVSKFEIHAGSKLRQPFQNIYLDSG 727
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 186 bits (471), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 136/497 (27%), Positives = 219/497 (44%), Gaps = 73/497 (14%)
Query: 396 RPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQR--LRFGCKQGNGIVC 453
R + + ++R L +L + + +++TY R + G G GI C
Sbjct: 372 RVVLRPTNSTASACQRRSRTLLSVLIDKDIVVPRDKVTYRAARDRPAAKDGFITGEGIRC 431
Query: 454 DCCNKEISPSQFEAHAGM---AARRQPYRHIYTSNGMTL---------HDIAISLA---- 497
CCNK ++ ++F AHA + RR+ + ++ +G +L D+A+ A
Sbjct: 432 TCCNKTLTVAEFAAHATARRGSDRREAWARVFLKDGRSLSQCLVELMRRDVAVVAARNGD 491
Query: 498 --MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+ ++ + D +C +C DG +LLLC+ CP AFH AC+ PE W CP+CR
Sbjct: 492 VRVKEKCSDPEGDSVCSICNDGGDLLLCDNCPSAFHHACVGLQATPEGDWFCPSCR---- 547
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT-----FDDRTVIYCDQCEKEFHVGC 610
C +C DF A F D+T+IYCDQCE+E+HVGC
Sbjct: 548 ----------------------CGVCGGSDFDATAAGGGGFTDKTIIYCDQCEREYHVGC 585
Query: 611 LRKNG----------LCDLKEIPKD-KWFCCDDCNRIHAALQDFVS-NRAQTIPASSLST 658
+R+ G C E ++ W C +C + LQ + R ++IP +
Sbjct: 586 VRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEVFRHLQGLAAVARERSIPIPTTVP 645
Query: 659 INRKHIEKGILFDGTMNDVQWQMLKKAQC------FEEKEKSLLSSATAIFRECFDPII- 711
+ + IL + + + C + E L SA + ECF +I
Sbjct: 646 TTVEGVSLSILRRRRRRPIS-MVATGSGCQEEEEEEDAAEHGQLCSALDVLHECFVTLIE 704
Query: 712 AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATC 769
DL +V+ R + F G Y V L +++ G LR+ G EVAELPLV T
Sbjct: 705 PRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGELITVGTLRVLGTEVAELPLVGTR 764
Query: 770 REYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD 829
++ +G L + +E++L + V LVLPA + +WT GF M+R +++ +
Sbjct: 765 FAHRRQGMCHLLVTELEKVLRQVGVRRLVLPAVPELLPMWTASLGFHPMTRSDVMEIAAE 824
Query: 830 FQLTIFKGTSMLEKKVQ 846
+ F+GT+M K +
Sbjct: 825 HAILSFQGTTMCHKSLH 841
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/420 (31%), Positives = 192/420 (45%), Gaps = 71/420 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLAMGQRRTTG 505
+GI+C CC+K ++ S+F+ HAG R P +I+ +G TL + A S RR+
Sbjct: 728 DGIICTCCDKVLTLSEFKFHAGFTVNR-PCLNIFMESGEPFTLCLLQAWSAEYKARRSQN 786
Query: 506 GS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ DD C +CG+G L+ C+ CP FH ACL IP+ W C NC
Sbjct: 787 QAVHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----- 841
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
C IC + T D + C QCE ++H CL
Sbjct: 842 ---------------------CRICGNLVIDKDTSDAHDSLQCSQCEHKYHEKCLEDR-- 878
Query: 617 CDLKEIP-KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN 675
D +E+ D WFC C +++ LQ V Q S + + + I D ++
Sbjct: 879 -DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISWTLL------RCIHDDQKVH 931
Query: 676 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECF----DPIIAECGRDLIPVMVY--GRNIS 729
QW LK A C + L+ A I ECF DP G +IP ++Y G +
Sbjct: 932 SAQWFALK-AVC-----NTKLAVALTIMEECFVSMFDP---RTGIHMIPQVLYNWGSEFA 982
Query: 730 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
F G Y+++L K V++S +R+ G VAE+PL+ATC +Y+ +G + L S IE++L
Sbjct: 983 RLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVSAIEQML 1042
Query: 790 CSLNVENLVLPAAEKAESIWTKKFGF---RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
S VE LV+ A WTK FGF + R+RL K L +F GT +L K +
Sbjct: 1043 ISFKVEKLVVSAIPDLVETWTKGFGFITVDDIERQRLNK----INLMVFPGTVLLVKSLH 1098
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 182 bits (461), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 139/448 (31%), Positives = 210/448 (46%), Gaps = 59/448 (13%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS FE
Sbjct: 387 SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEE 446
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G++LH +A+ L ++ L NG
Sbjct: 447 HAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFD 485
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFS 587
A ++ D + SG C + S++ V LK L+ E C C +
Sbjct: 486 NASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHTT 540
Query: 588 AATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 641
+ T+I+C+QCE+ H+ C +K L LKE + CC +C + A L+
Sbjct: 541 IGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE 600
Query: 642 DFVSNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQMLKKAQCFEEKEKSLLSS 698
+ +EK GI F + +++ W++L + K +
Sbjct: 601 E--------------------GLEKCVGITFLRRIRSNICWRLLSGMDASRDV-KLYMPQ 639
Query: 699 ATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFG 757
IF++ F E D+I MV G+N + +F GMY +LT + VVSA +L++
Sbjct: 640 VIDIFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRI 698
Query: 758 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 817
++AEL L+AT E + KG F L IE L + NV L P + IW++K GF
Sbjct: 699 EQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTI 758
Query: 818 MSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+S E L +FK +++K +
Sbjct: 759 LSAEEKESMLESHPLVMFKNLVLVQKSL 786
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 72/125 (57%), Gaps = 3/125 (2%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 114 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 171
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 172 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 231
Query: 333 KGLVE 337
K ++
Sbjct: 232 KDYLD 236
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 191/415 (46%), Gaps = 59/415 (14%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAIS----------LAM 498
+GI C CC+K ++ S+FE HAG + RQP+++IY +G +L D I L
Sbjct: 17 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLESGASLLDCQIEAWNRQEPVKRLGF 75
Query: 499 GQRRTTGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
G +DD C +CGDG +L+ C+GCP FH +CLD ++P W CPNC
Sbjct: 76 QAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPNC------ 129
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTV---IYCDQCEKEFHVGCLRK 613
C C DD TV + C C K++H C+++
Sbjct: 130 --------------------SCKFCGVASDKNFQRDDTTVSKLLTCSLCVKKYHKSCMQE 169
Query: 614 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
+ FC C + LQ ++ + + S S I+R + G
Sbjct: 170 INTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIHRTDADSDTSLQG- 228
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISG 730
+ ++ +C S L+ + ++ ECF PI+ G +LI ++Y G N +
Sbjct: 229 -------LPQRVEC-----NSKLAVSLSVMDECFLPIVDRRSGINLIQNVLYNCGSNFNR 276
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
FGG Y++IL ++SA +R G +AE+P + T Y+ +G + LF IE LC
Sbjct: 277 LNFGGFYALILERGDEIISAASIRFHGTRLAEMPFIGTRHMYRRQGMCRRLFYAIESTLC 336
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
SL VE L++PA + WT+ FGF + E L + + + +F G ML+K++
Sbjct: 337 SLKVEKLIIPAISELMHTWTEVFGFTTLD-ESLKQELKSMNMLVFPGIDMLQKQL 390
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
L+G V G CGC SK+++ +FE HAG+K R P +IYLE+G +
Sbjct: 10 LEGWVTRDGIHCGCC----SKILTVSKFEIHAGSKLRQPFQNIYLESGASL 56
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 197/429 (45%), Gaps = 54/429 (12%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 621 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 679
Query: 490 HDIAISLAMGQRRT----------TGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
I Q GG +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 680 LQCQIDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 739
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVI 597
++P WRC NC G+ + +E +C H
Sbjct: 740 QMLPPGEWRCMNCTCKFC-----------GIASGTSEKDDASVCVLH------------- 775
Query: 598 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS 657
C+ CEK++H C ++ FC +C + L+ ++ + + S S
Sbjct: 776 ICNLCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEHLKKYLGTKHELESGFSWS 835
Query: 658 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGR 716
I+R + G + ++ +C S L+ + ECF P+I G
Sbjct: 836 LIHRTDDDSEAACRG--------ISQRVEC-----NSKLAITLTVMDECFLPVIDRRSGI 882
Query: 717 DLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQG 774
+LI ++Y G N S + G Y+ IL +++A +R G ++AE+P + T Y+
Sbjct: 883 NLIRNVLYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHIYRR 942
Query: 775 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 834
+G + LFS IE LCSL VE LV+PA + + WT FGF + + L + + + +
Sbjct: 943 QGMCRRLFSAIESTLCSLKVEKLVIPAIAEVTNTWTTVFGFTHLDKS-LRQEMKSLNMMV 1001
Query: 835 FKGTSMLEK 843
F G ML+K
Sbjct: 1002 FPGIDMLQK 1010
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 9/81 (11%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
P K+ + ++D V+ + R+ L+G + G CGC C SK+++ +
Sbjct: 600 PYTGKRTVLAWLIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGC--C--SKILTVSK 655
Query: 267 FEQHAGAKTRHPNNHIYLENG 287
FE HAG+K P +IYLE+G
Sbjct: 656 FELHAGSKLPQPYQNIYLESG 676
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 179 bits (454), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 136/446 (30%), Positives = 204/446 (45%), Gaps = 54/446 (12%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F GL D LTY +K G+ L+ G K G GI+C+CC++E +PS FE
Sbjct: 102 SKTRDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNCCSQEFAPSHFEE 161
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G TLH +A+ L ++ L NG
Sbjct: 162 HAGMGRRRQPYHNIYTPEGSTLHKLALQL---------------------QDHLNSNGFD 200
Query: 528 LAFHAACLD-PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDF 586
A ++ D P L SG C + S++ V LK L+ E C C
Sbjct: 201 NASVSSFSDYPNLTSASG-----CGRQPSTTSGPIVPLKRTLQGRVVETESCYFCGYGHT 255
Query: 587 SAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
+ D +I+C+QCE+ HV C +K L LKE ++ CC +C + L
Sbjct: 256 TIGNIDPDMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQECQLLRDRL 315
Query: 641 QDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSAT 700
++ + + E+ +++ W++L + K +
Sbjct: 316 EEGL-----------------EKCEEIAFLRRIRSNICWRLLSGMDASRDV-KLFMPQVI 357
Query: 701 AIFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGRE 759
IF++ F E D+ MV +N + +F GMY +LT + VVSA +L++ +
Sbjct: 358 DIFKDAFVESTDE-HSDIFSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRMEQ 416
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS 819
+AEL L+AT RE + KG F L IE L + NV L P + IW++K GF +S
Sbjct: 417 IAELVLIATRRECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILS 476
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
E L +FK +++K +
Sbjct: 477 AEEKESVLESHPLVMFKNLVLVQKSL 502
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 179 bits (453), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 124/423 (29%), Positives = 194/423 (45%), Gaps = 65/423 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC + S S+FEAHAG + R +I+ +G +L + + + R TG
Sbjct: 593 DGIKCSCCQEVFSLSRFEAHAGSSYHRSA-ANIFLEDGRSLLECQMQII---RDITGKGF 648
Query: 507 ----------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+D +C VC G +L+LC+ CP +FH +CL + W CP+C
Sbjct: 649 TKESFSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSC 708
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
C IC + F + D V C QCE+++HVGC
Sbjct: 709 --------------------------CCGICGENKFDGGSEQDNVVFSCYQCERQYHVGC 742
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE-KGIL 669
LRK G L P WFC C +I LQ + ++ + +L+ K I KG+
Sbjct: 743 LRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLG-KSFPVGVDNLTWTLLKPIRSKGLE 801
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GR 726
D L + E S L+ A + ECF+P+ RD++ +++ G
Sbjct: 802 ID----------LPDIEALTEV-YSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGS 850
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
+++ F G Y+V+L ++S +R++G +VAE+PL+ T +Y+ G L + +E
Sbjct: 851 DLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHILMNELE 910
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
+ L L VE LVLPA + WT FGF KM+ L++ D+ F+ T M +K +
Sbjct: 911 KKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFL-DYSFLDFQDTVMCQKLLM 969
Query: 847 CLP 849
+P
Sbjct: 970 KIP 972
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 126/228 (55%), Gaps = 24/228 (10%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
ETLS GD + E++ + R NN + S + + P H+ + + F R Y
Sbjct: 317 ETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENM-----SFECRNPPY 371
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L + + K +KK N +P+NVK LLSTGI D
Sbjct: 372 ASPRV-------------------DTLLVPKIKDTKTAKKGSTNTFPSNVKSLLSTGIFD 412
Query: 226 GACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLE 285
G VKY S SRER L G++ G GYLCGC C +KV++A+EFEQHA KT+HPNNHIY E
Sbjct: 413 GVTVKYYSWSRERNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFE 472
Query: 286 NGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
NGK IY ++QELK P L + ++ V GS N +F WKAS+H+ +
Sbjct: 473 NGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHVAR 520
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 78/136 (57%), Positives = 104/136 (76%)
Query: 195 ARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCP 254
+++ E +MSKKV PN +P+NVK LLSTG+LDG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 407 SKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCE 466
Query: 255 LCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
CN SK ++A+EFE+HAG KT+HPNNHIY ENGK IY+++QELK P +L + ++ V G
Sbjct: 467 NCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTG 526
Query: 315 SSFNEGSFRVWKASHH 330
S N+ +FR+WKAS+
Sbjct: 527 SPINQKNFRIWKASYQ 542
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG + Y+ + G +G G +C C CN K ++ +FE HAG +
Sbjct: 430 LLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAG-CKTK 488
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 489 HPNNHIYFENGKTIYAVVQEL 509
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/136 (57%), Positives = 104/136 (76%)
Query: 195 ARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCP 254
+++ E +MSKKV PN +P+NVK LLSTG+LDG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 428 SKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCE 487
Query: 255 LCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAG 314
CN SK ++A+EFE+HAG KT+HPNNHIY ENGK IY+++QELK P +L + ++ V G
Sbjct: 488 NCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTG 547
Query: 315 SSFNEGSFRVWKASHH 330
S N+ +FR+WKAS+
Sbjct: 548 SPINQKNFRIWKASYQ 563
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG + Y+ + G +G G +C C CN K ++ +FE HAG +
Sbjct: 451 LLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAG-CKTK 509
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 510 HPNNHIYFENGKTIYAVVQEL 530
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 78/138 (56%), Positives = 102/138 (73%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
+ +N + + SKKV PN +P+NVK LLSTG+LDG VKYIS SRE+ L G++ G GYLCG
Sbjct: 336 VTPKNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSREKNLKGLIKGAGYLCG 395
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C CNF+K ++A+EFE+HA KT+HPNNHIY ENGK IY ++QELK P +L E ++ V
Sbjct: 396 CQECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTV 455
Query: 313 AGSSFNEGSFRVWKASHH 330
GS N+ +FR WKAS+
Sbjct: 456 TGSPINQKNFRSWKASYQ 473
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 131/420 (31%), Positives = 189/420 (45%), Gaps = 71/420 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLAMGQRRTTG 505
+GI+C CC K ++ S+F+ HAG R P +I+ +G TL + A S R++
Sbjct: 728 DGIICICCGKVLTLSEFKFHAGFTLNR-PCLNIFMESGEPFTLCLLQAWSTEYKARKSQN 786
Query: 506 GS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ DD C +CG+G L+ C+ CP FH ACL IP+ W C NC
Sbjct: 787 QAVHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----- 841
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
C IC + T D + C QCE ++H CL
Sbjct: 842 ---------------------CRICGNLVIDKDTLDAHDSLQCSQCEHKYHEKCLEDR-- 878
Query: 617 CDLKE-IPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN 675
D +E D WFC C +++ LQ V Q S + + + I D ++
Sbjct: 879 -DKQEGAILDTWFCGQSCQEVYSGLQSQVGLVNQVADGISWTLL------RCIHDDQKVH 931
Query: 676 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECF----DPIIAECGRDLIPVMVY--GRNIS 729
QW LK A C + L+ A I ECF DP G LIP ++Y G +
Sbjct: 932 SAQWFALK-AVC-----NTKLAVALTIMEECFVSMFDP---RTGIHLIPQVLYNWGSEFA 982
Query: 730 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
F G Y+++L V++S +R+ G VAE+PL+ATC +Y+ +G + L + IE++L
Sbjct: 983 RLNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVTAIEQVL 1042
Query: 790 CSLNVENLVLPAAEKAESIWTKKFGF---RKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
S VE LV+ A WTK FGF + R+RL K L +F GT +L K +
Sbjct: 1043 ISFKVEKLVISAIPDLVETWTKGFGFIPVDDIERQRLNK----INLMVFPGTVLLVKSLH 1098
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 130/227 (57%), Gaps = 25/227 (11%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
+TLS GD + ES+ R N+ + S + + P H+ + ++F+F R Y
Sbjct: 301 QTLSFGDYQKESTIGSAVRVINSYENFSHDPAIAKDPPHMEAE---ENMSFEF--RNPPY 355
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L ++ + K +KK N +P+NVK LLSTG+ D
Sbjct: 356 ASPRV-------------------DTLLVPKSKDTKTAKKGSTNTFPSNVKSLLSTGMFD 396
Query: 226 GACVKYISTSRE-RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYL 284
G VKY S SRE R L GI+ G GYLCGC CNF++V++A+EFEQHA KT+HPNNHIY
Sbjct: 397 GVTVKYYSWSREVRNLKGIIKGTGYLCGCGNCNFNRVLNAYEFEQHANCKTKHPNNHIYF 456
Query: 285 ENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHL 331
ENGK IY ++QELK P L + ++ V GS N +F WKAS+H+
Sbjct: 457 ENGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHV 503
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 11/141 (7%)
Query: 402 KRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRF--GCKQGNGIVCDC---- 455
K +G T +++ LL G+ DG + Y + +R G +G G +C C
Sbjct: 372 KTAKKGSTNTFPSNVKSLL--STGMFDGVTVKYYSWSREVRNLKGIIKGTGYLCGCGNCN 429
Query: 456 CNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCG 515
N+ ++ +FE HA + P HIY NG T++ + L + D + +V G
Sbjct: 430 FNRVLNAYEFEQHANCKTK-HPNNHIYFENGKTIYGVVQELKNTPQEKL--FDAIQNVTG 486
Query: 516 DGENLLLCNGCPLAFHAACLD 536
N N ++H A L+
Sbjct: 487 SDINHKNFNTWKASYHVASLE 507
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 81/130 (62%), Positives = 102/130 (78%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E K+SKKV PN +P+NV+ LLSTG+LDG VKYI+ SRE +L GI+ G GYLCGC CNF
Sbjct: 450 EQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSRE-ELRGIIKGSGYLCGCQSCNF 508
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QELK+ P L +V++ + GS N
Sbjct: 509 SKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPIN 568
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 569 QKSFRLWKES 578
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
L G+ DG + YI + G +G+G +C C CN K I+ +FE HAG +
Sbjct: 469 LLSTGMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAG-CKTKH 527
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 528 PNNHIYFENGKTIYGIVQEL 547
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 101/130 (77%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CNF
Sbjct: 269 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRE-ELHGIIKGSGYLCGCQSCNF 327
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V GS N
Sbjct: 328 NKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTVTGSPIN 387
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 388 QKSFRIWKES 397
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C NK ++ +FE HAG +
Sbjct: 288 LISTGMLDGVPVKYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-H 346
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 347 PNNHIYFENGKTIYQIVQEL 366
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 103/137 (75%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N E+K SKKV N +P+NVK LLSTG+LDG VKY+S SRE+ L+GI+ G GYLCGC
Sbjct: 435 ASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKTLEGIIKGTGYLCGC 494
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
C +K ++A+EFE+HA KT+HPNNHI+ ENGK IY+++QELK P G+L ++ V
Sbjct: 495 KECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNAIQTVT 554
Query: 314 GSSFNEGSFRVWKASHH 330
GS N+ +FR+WKAS+
Sbjct: 555 GSHINQKNFRIWKASYQ 571
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 116/417 (27%), Positives = 189/417 (45%), Gaps = 68/417 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC+K ++ S+FE HAG + RQP+++I+ ++G++L I A +++ G
Sbjct: 615 DGIHCGCCSKILAVSKFEIHAG-SKLRQPFQNIFLNSGVSLLQCQID-AWDKQKGAGNIG 672
Query: 507 -----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+DD C +CGDG +L+ C+GCP FH CLD + P W CPNC
Sbjct: 673 FCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRMFPLGDWHCPNCT---- 728
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIY----CDQCEKEFHVGCL 611
C C+ A +D T C CEK++H C+
Sbjct: 729 ----------------------CKFCK------AVIEDVTQTVGANTCKMCEKKYHKSCM 760
Query: 612 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
K + FC C + ++ +V + + S S ++R+ +
Sbjct: 761 PKANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHRECTNSDLSLS 820
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNI 728
G + V+ S L+ A + ECF PII G +++ ++Y G N
Sbjct: 821 GHPHIVE-------------NNSKLALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNF 867
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
+ FGG Y+ +L +V++ +R G +AE+P + T Y+ +G + LFS +E
Sbjct: 868 NRLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESA 927
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L L V+ L++PA +W KFGFR++ + L K R L F G +L+K++
Sbjct: 928 LQHLKVKLLIIPATADFSHVWISKFGFRQVE-DSLKKEMRSMNLLTFPGIDVLQKEL 983
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 206 VVPNEYPTNVKKLLSTGILD-GACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVS 263
V +E T + L+ +G L V Y++ R R L+G + G CGC C SK+++
Sbjct: 572 VPSSEKRTVLAWLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGC--C--SKILA 627
Query: 264 AHEFEQHAGAKTRHPNNHIYLENG 287
+FE HAG+K R P +I+L +G
Sbjct: 628 VSKFEIHAGSKLRQPFQNIFLNSG 651
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 191/423 (45%), Gaps = 57/423 (13%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG---MTLHDIAI 494
G +++G G+GI+C+CC+ +S S+F++HAG R +++ +G M A
Sbjct: 864 GSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRA-CSNLFLDSGRPFMLCQLQAW 922
Query: 495 SLAMGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
S R++ + DD C +CGDG L+ C+ CP FH +CL +PE W
Sbjct: 923 STEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNW 982
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
C NC C IC + C QCE++
Sbjct: 983 YCLNCT--------------------------CRICGDLVNFEEISSSSDALKCFQCEQK 1016
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
+H CL++ + + WFC C +I+AALQ + Q S + + H +
Sbjct: 1017 YHGQCLKQRDIN--SGVESHIWFCSGSCQKIYAALQSQLGLTNQFANGFSWTLLRCIHYD 1074
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY 724
+ IL ++ A+C S L A I ECF ++ G D+IP +VY
Sbjct: 1075 QKILSTA-------RLAMMAEC-----NSRLVVALTIMEECFLSMVDPRTGIDMIPHLVY 1122
Query: 725 GRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
S +F G Y+VIL V++ +R+ G E+AE+PL+ATC +Y+ +G + L
Sbjct: 1123 SWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLL 1182
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
+ IE +L S V+ LV+ A WT+ FGF + E + F L +F GT +L+
Sbjct: 1183 NAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHK-FNLMVFPGTVLLK 1241
Query: 843 KKV 845
K +
Sbjct: 1242 KAL 1244
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 101/130 (77%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CNF
Sbjct: 339 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRE-ELHGIIKGSGYLCGCQSCNF 397
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V GS N
Sbjct: 398 NKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTVTGSPIN 457
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 458 QKSFRIWKES 467
Score = 42.7 bits (99), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C NK ++ +FE HAG +
Sbjct: 358 LISTGMLDGVPVKYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-H 416
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 417 PNNHIYFENGKTIYQIVQEL 436
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 191/423 (45%), Gaps = 57/423 (13%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG---MTLHDIAI 494
G +++G G+GI+C+CC+ +S S+F++HAG R +++ +G M A
Sbjct: 864 GSVVKYGRITGDGIICNCCSDILSISEFKSHAGFKFNRA-CSNLFLDSGRPFMLCQLQAW 922
Query: 495 SLAMGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
S R++ + DD C +CGDG L+ C+ CP FH +CL +PE W
Sbjct: 923 STEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNW 982
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
C NC C IC + C QCE++
Sbjct: 983 YCLNCT--------------------------CRICGDLVNFEEISSSSDALKCFQCEQK 1016
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
+H CL++ + + WFC C +I+AALQ + Q S + + H +
Sbjct: 1017 YHGQCLKQRDI--DSGVESHIWFCSGSCQKIYAALQSQLGLTNQFANGFSWTLLRCIHYD 1074
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY 724
+ IL ++ A+C S L A I ECF ++ G D+IP +VY
Sbjct: 1075 QKILSTA-------RLAMMAEC-----NSRLVVALTIMEECFLSMVDPRTGIDMIPHLVY 1122
Query: 725 GRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
S +F G Y+VIL V++ +R+ G E+AE+PL+ATC +Y+ +G + L
Sbjct: 1123 SWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLL 1182
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
+ IE +L S V+ LV+ A WT+ FGF + E + F L +F GT +L+
Sbjct: 1183 NAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHK-FNLMVFPGTVLLK 1241
Query: 843 KKV 845
K +
Sbjct: 1242 KAL 1244
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 175 bits (444), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 120/433 (27%), Positives = 193/433 (44%), Gaps = 62/433 (14%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 620 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 678
Query: 490 HDIAISLAMGQRRT------------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
I Q +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 679 LQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 738
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC----RSHDFSAATFDD 593
++P W CPNC C C + D A+ +
Sbjct: 739 QMLPPGEWHCPNC--------------------------TCKFCGIASETSDKDDASVN- 771
Query: 594 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 653
+ C CEK++H C ++ FC +C + L+ ++ + +
Sbjct: 772 -VLRTCILCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEYLKKYLGTKHELEAG 830
Query: 654 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-A 712
S I+R + G + ++ +C S L+ A + ECF P+I
Sbjct: 831 FSWCLIHRSDEDSEAACRG--------LTQRVEC-----NSKLAIALTVMDECFLPVIDR 877
Query: 713 ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCR 770
G +LI ++Y G N S + G Y+ IL +++A +R G ++AE+P + T
Sbjct: 878 RSGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKIAEMPFIGTRH 937
Query: 771 EYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF 830
Y+ +G + LFS IE LCSL VE LV+PA + WT FGF + E L + +
Sbjct: 938 IYRRQGMCRRLFSAIELALCSLKVEKLVIPAVAELTHTWTTVFGFTYLD-ESLRQEMKSL 996
Query: 831 QLTIFKGTSMLEK 843
+ +F G ML+K
Sbjct: 997 NMMVFPGIDMLQK 1009
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
L+G + G CGC C SK+++ +FE HAG+K P +IYLE+G
Sbjct: 632 LEGWITRDGIHCGC--C--SKILTVSKFELHAGSKLPQPYQNIYLESG 675
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 108/296 (36%), Positives = 164/296 (55%), Gaps = 29/296 (9%)
Query: 209 NEYPTNVKKLLSTGILDGACVKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVSAHEF 267
N+ PTN+++LL+TG+L+G VKYI +R L G++ G LC C C KVVS + F
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIMRKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYYF 316
Query: 268 EQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW-- 325
E HAG+ +HP+++I+LENG ++ +++ A L +LE ++K G + E FR
Sbjct: 317 EVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKSC 376
Query: 326 KASHH-LRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
K+S LR G + C S LE + + S + GR QT
Sbjct: 377 KSSFSTLRSGKFA----------------LFCDS-CLESKGAKNNISSSK-VGRSQTSSA 418
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFG 444
++ + G K ++ G ++D +H+++F+ LP+G + Y V G+RL G
Sbjct: 419 KVYKSASPGAKS-------SSVGRLTRKDKGMHKVVFMSGILPEGTDVGYYVGGKRLLDG 471
Query: 445 CKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
+ GI C CC+ +SPSQFE HAG AARR+PY +IY SNG++LH+++ISL GQ
Sbjct: 472 YIKELGIYCHCCSTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQ 527
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 9/90 (10%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIV-KGQRLRF-GCKQGNGIVCDCCN----KEISP 462
T K +L LL G+ +G+ + YI+ KG+R G + GI+C C + K +SP
Sbjct: 256 TNKIPTNLRELLA--TGMLEGQPVKYIMRKGKRAVLRGVIKRIGILCSCSSCKGRKVVSP 313
Query: 463 SQFEAHAGMAARRQPYRHIYTSNGMTLHDI 492
FE HAG + ++ P +I+ NG LHD+
Sbjct: 314 YYFEVHAG-STKKHPSDYIFLENGNNLHDV 342
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 77/137 (56%), Positives = 101/137 (73%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+N E K +KK N +P+NVK LLSTGI DG VKY+S SRE+ L GI+ G GYLC C
Sbjct: 426 IVKNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSC 485
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN SK ++A+EFE+HAGAKT+HPNNHIY ENGK IY+++QELK P +L + ++ V
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 314 GSSFNEGSFRVWKASHH 330
GS+ N+ +FR+WKAS+
Sbjct: 546 GSTINQKNFRIWKASYQ 562
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG ++ Y+ + G +G G +C C CN K ++ +FE HAG A +
Sbjct: 450 LLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTK 508
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 509 HPNNHIYFENGKTIYAVVQEL 529
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 123/423 (29%), Positives = 192/423 (45%), Gaps = 68/423 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GI C CC + S S+FEAHAG + R +I+ +G +L + + + R TG
Sbjct: 848 DGIKCSCCQEVFSLSRFEAHAGSSYHRSA-ANIFLEDGRSLLECQMQII---RDITGKGF 903
Query: 507 ----------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+D +C VC G +L+LC+ CP +FH +CL +PE W CP+C
Sbjct: 904 TKESFSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSC 963
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
C IC + F + D V C QCE++ C
Sbjct: 964 --------------------------CCGICGENKFDGGSEQDNVVFSCYQCERQC---C 994
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE-KGIL 669
LRK G L P WFC C +I LQ + ++ + +L+ K I KG+
Sbjct: 995 LRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLG-KSFPVGVDNLTWTLLKPIRSKGLE 1053
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GR 726
D L + E S L+ A + ECF+P+ RD++ +++ G
Sbjct: 1054 ID----------LPDIEALTEV-YSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGS 1102
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
+++ F G Y+V+L ++S +R++G +VAE+PL+ T +Y+ G L + +E
Sbjct: 1103 DLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHILMNELE 1162
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
+ L L VE LVLPA + WT FGF KM+ L++ D+ F+ T M +K +
Sbjct: 1163 KKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFL-DYSFLDFQDTVMCQKLLM 1221
Query: 847 CLP 849
+P
Sbjct: 1222 KIP 1224
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 185/427 (43%), Gaps = 74/427 (17%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDIAISLAM 498
++ G + GI+C CCN +S + F+ HAG R P +++ +G TL + A
Sbjct: 787 IKDGLIKKEGIMCKCCNMVLSVTNFKNHAGFKQSR-PCLNVFMKSGKPFTLCQLQAWSAE 845
Query: 499 GQRRTT-----------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRC 547
+ R + +DD C +CGDG L+ C+ CP FH ACL +PE W C
Sbjct: 846 YKTRKSRTIKVVRTADDDENDDSCGLCGDGGELICCDNCPSTFHQACLSTEELPEGSWYC 905
Query: 548 PNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFH 607
PNC C IC + C QCE ++H
Sbjct: 906 PNCT--------------------------CWICGELVNDKEDINSSNAFKCSQCEHKYH 939
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
C + + K D WFC C ++ LQ S + IN HI G
Sbjct: 940 DSCWKNKTIG--KGGASDTWFCGGSCQAVYFGLQ------------SRVGIIN--HIADG 983
Query: 668 ILF--------DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDL 718
+ + D ++ Q ++ KA+C S L+ A I ECF ++ G D+
Sbjct: 984 VCWTLLKCIHEDQKVHSAQ-RLALKAEC-----NSKLAVALTIMEECFQSMVDPRTGIDM 1037
Query: 719 IPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
IP ++Y + F G Y+V+L V++S +RI G VAE+PL+ATC Y+ +G
Sbjct: 1038 IPHVLYNWRSEFARLNFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLIATCSNYRRQG 1097
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L + IE +L S VE LV+ A WT+ FGF MS + + L +F
Sbjct: 1098 MCRRLMTAIEEMLISFKVEKLVVSAIPDLVETWTEGFGFTPMSNDEKQSLNK-INLMVFP 1156
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 1157 GTILLKK 1163
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 187/420 (44%), Gaps = 57/420 (13%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDIAISLAM 498
++ G +GI+C CCN +S ++F++HAG R P +++ +G TL + A
Sbjct: 141 IKDGVVTKDGIMCKCCNMVLSVTKFKSHAGFKLNR-PCSNLFMESGKPFTLCQLQAWSAE 199
Query: 499 GQRRTTGG----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+ R +G +DD C +CGDG L+ C+ CP FH ACL +PE W CP
Sbjct: 200 YKSRKSGTQVVRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPEGSWYCP 259
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
NC C IC C QCE ++H
Sbjct: 260 NCT--------------------------CWICGDLVNDKEASSSVGAYKCLQCEHKYHG 293
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
C + G + + D WFC C +++ L V + + H ++ +
Sbjct: 294 AC--QQGKQTHEGLVSDAWFCSGSCQEVYSGLHSRVGINNPIADGFCWTLLRCIHEDQKV 351
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--G 725
L ++ KA+C S L+ A I ECF ++ G D+IP +Y G
Sbjct: 352 L-------SAQRLALKAEC-----NSKLAVALTIMEECFQSMVDPRTGIDMIPHALYNWG 399
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
+ + F G Y+V+L V+VSA +R+ G VAE+PL+ATC Y+ +G + L + I
Sbjct: 400 SDFARLNFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSNYRRQGMCRHLMTAI 459
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E +L S VE LV+ A WTK FGF +S++ + +F GT +L+K++
Sbjct: 460 EEMLISYKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSLNK-INFMVFPGTILLKKQL 518
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 12/105 (11%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQL-------DGIVN 245
L RN+ K+ K V ++ + + + ++D + + R L DG+V
Sbjct: 89 LLPRNLG-KLGKHYVGGKWSRMGSRTVLSWLIDAGVLSVKDVVQYRNLKDDFVIKDGVVT 147
Query: 246 GGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPI 290
G +C C CN V+S +F+ HAG K P +++++E+GKP
Sbjct: 148 KDGIMCKC--CNM--VLSVTKFKSHAGFKLNRPCSNLFMESGKPF 188
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 191/415 (46%), Gaps = 64/415 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 961 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 1019
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 1020 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 1074
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
C IC + C QCE ++H+ CL++
Sbjct: 1075 ---------------------CRICGDLVKDREASSSFLALKCSQCEHKYHMPCLKEK-- 1111
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
C +KE+ D FC ++C I++ LQ FV++ A + L I+ D
Sbjct: 1112 C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIHD---------DQK 1161
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISG 730
++ Q ++ KA+C S L+ A I ECF ++ G D+IP ++Y R + +
Sbjct: 1162 VHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFAR 1215
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG + L + IE++L
Sbjct: 1216 LNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLK 1275
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
S+ VE +V+ A WT FGF+ + + ++ L +F GT +L+K +
Sbjct: 1276 SVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPGTILLKKSL 1329
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 126/229 (55%), Gaps = 25/229 (10%)
Query: 106 ETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHVSSSSGIRKITFKFSKRKEDY 165
ETLS GD + E++ + R NN + S + + P H+ + + F R Y
Sbjct: 317 ETLSFGDCQKETAMGSSVRVSNNYENFSHDPAITKDPLHIEAEENM-----SFECRNPPY 371
Query: 166 VAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILD 225
+P D +L + + K +KK N +P+NVK LLSTGI D
Sbjct: 372 ASPRV-------------------DTLLVPKIKDTKTAKKGSTNTFPSNVKSLLSTGIFD 412
Query: 226 GACVKYISTSRE-RQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYL 284
G VKY S SRE R L G++ G GYLCGC C +KV++A+EFEQHA KT+HPNNHIY
Sbjct: 413 GVTVKYYSWSREQRNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYF 472
Query: 285 ENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLRK 333
ENGK IY ++QELK P L + ++ V GS N +F WKAS+H+ +
Sbjct: 473 ENGKTIYGVVQELKNTPQEKLFDAIQNVTGSDINHKNFNTWKASYHVAR 521
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 52/122 (42%), Gaps = 9/122 (7%)
Query: 421 FLPNGLPDGERLTYIV--KGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAAR 474
L G+ DG + Y + QR G +G G +C C NK ++ +FE HA +
Sbjct: 405 LLSTGIFDGVTVKYYSWSREQRNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTK 464
Query: 475 RQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC 534
P HIY NG T++ + L + D + +V G N N ++H A
Sbjct: 465 -HPNNHIYFENGKTIYGVVQELKNTPQEKL--FDAIQNVTGSDINHKNFNTWKASYHVAR 521
Query: 535 LD 536
L+
Sbjct: 522 LE 523
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 126/420 (30%), Positives = 197/420 (46%), Gaps = 59/420 (14%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCNK +S S+F+ HAG + P +++ +G A S
Sbjct: 648 VKTGLVTKDGVVCTCCNKTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 706
Query: 498 MGQRRT---------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR +DD C VCGDG L+ C+ CP FH ACL ++PE W C
Sbjct: 707 YKARRNGWRLEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCS 766
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
+C S L + AE RS DF C QC ++H
Sbjct: 767 SCTCWICSE----------LVSDNAE-------RSQDFK-----------CSQCAHKYHG 798
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
CL+ G+ +++ + +FC +C +++ L V S S I K
Sbjct: 799 TCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWS------ILKCF 850
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--G 725
DG ++ + ++ KA+C S L+ A +I E F ++ G D+IP ++Y G
Sbjct: 851 QEDGMVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWG 904
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
+ +F G Y+V++ V++S +R+ G +AE+PLVATC +Y+ +G + L + I
Sbjct: 905 STFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILVAAI 964
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E +L SL VE LV+ A WT+ FGF+ M E +R L +F GT++L+K +
Sbjct: 965 EEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLMVFPGTTLLKKTL 1023
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 191/415 (46%), Gaps = 64/415 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 897 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 955
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 956 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 1010
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
C IC + C QCE ++H+ CL++
Sbjct: 1011 ---------------------CRICGDLVKDREASSSFLALKCSQCEHKYHMPCLKEK-- 1047
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
C +KE+ D FC ++C I++ LQ FV++ A + L I+ D
Sbjct: 1048 C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIHD---------DQK 1097
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISG 730
++ Q ++ KA+C S L+ A I ECF ++ G D+IP ++Y R + +
Sbjct: 1098 VHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFAR 1151
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG + L + IE++L
Sbjct: 1152 LNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLK 1211
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
S+ VE +V+ A WT FGF+ + + ++ L +F GT +L+K +
Sbjct: 1212 SVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPGTILLKKSL 1265
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 77/142 (54%), Positives = 104/142 (73%), Gaps = 9/142 (6%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRE---------RQLDGIVNGGGY 249
E KMSKK PN +P+NV+ L+STG+LDG VKY+S SRE ++L GI+ G GY
Sbjct: 450 EFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHGIIKGSGY 509
Query: 250 LCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVV 309
LCGC CNF+KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + +
Sbjct: 510 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFBAI 569
Query: 310 KKVAGSSFNEGSFRVWKASHHL 331
+ V GS N+ SFR+WK + ++
Sbjct: 570 QTVTGSPINQKSFRIWKVNEYI 591
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 434 YIVKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
YI ++ G +G+G +C C NK ++ +FE HAG + P HIY NG T+
Sbjct: 492 YICAHKQELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTK-HPNNHIYFENGKTI 550
Query: 490 HDIAISL 496
+ I L
Sbjct: 551 YQIVQEL 557
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 191/415 (46%), Gaps = 64/415 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGG-- 506
+GIVC CC + S F+ HAG R P R+++ +G + + + + G
Sbjct: 800 DGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKVRKGGI 858
Query: 507 ----------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG L+ C+ CP FH ACL +PE W CPNC
Sbjct: 859 KNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT----- 913
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
C IC + C QCE ++H+ CL++
Sbjct: 914 ---------------------CRICGDLVKDREASSSFLALKCSQCEHKYHMPCLKEK-- 950
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQD---FVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
C +KE+ D FC ++C I++ LQ FV++ A + L I+ D
Sbjct: 951 C-VKEVGGDARFCGENCQEIYSGLQGLLGFVNHIADGFTWTLLRCIHD---------DQK 1000
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISG 730
++ Q ++ KA+C S L+ A I ECF ++ G D+IP ++Y R + +
Sbjct: 1001 VHSSQ-KLALKAEC-----NSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFAR 1054
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
F G Y+V+L +VS +R+ G VAE+PL+AT +++ KG + L + IE++L
Sbjct: 1055 LNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLK 1114
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
S+ VE +V+ A WT FGF+ + + ++ L +F GT +L+K +
Sbjct: 1115 SVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-INLMVFPGTILLKKSL 1168
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 126/420 (30%), Positives = 197/420 (46%), Gaps = 59/420 (14%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCNK +S S+F+ HAG + P +++ +G A S
Sbjct: 604 VKTGLVTKDGVVCTCCNKTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 662
Query: 498 MGQRRT---------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR +DD C VCGDG L+ C+ CP FH ACL ++PE W C
Sbjct: 663 YKARRNGWRLEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCS 722
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
+C S L + AE RS DF C QC ++H
Sbjct: 723 SCTCWICSE----------LVSDNAE-------RSQDFK-----------CSQCAHKYHG 754
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
CL+ G+ +++ + +FC +C +++ L V S S I K
Sbjct: 755 TCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWS------ILKCF 806
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--G 725
DG ++ + ++ KA+C S L+ A +I E F ++ G D+IP ++Y G
Sbjct: 807 QEDGMVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWG 860
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
+ +F G Y+V++ V++S +R+ G +AE+PLVATC +Y+ +G + L + I
Sbjct: 861 STFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILVAAI 920
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E +L SL VE LV+ A WT+ FGF+ M E +R L +F GT++L+K +
Sbjct: 921 EEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLMVFPGTTLLKKTL 979
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 75/130 (57%), Positives = 102/130 (78%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K ++K PN +P+NV+ L+STG+LDG VKY+S SRE +L GI+ G GYLCGC CN+
Sbjct: 247 ELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYVSLSRE-ELRGIIKGSGYLCGCQSCNY 305
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L +V++ V G+ N
Sbjct: 306 SKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPIN 365
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 366 QKSFRIWKES 375
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
+ G+ DG + Y+ + G +G+G +C C CN K ++ +FE HAG +
Sbjct: 266 LISTGMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTK-H 324
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 325 PNNHIYFENGKTIYQIVQEL 344
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 76/138 (55%), Positives = 101/138 (73%)
Query: 193 LCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCG 252
+ +N E K +KK N +P+NVK LLSTGI DG VKY+S SRE+ L GI+ G GYLC
Sbjct: 426 IVVKNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCS 485
Query: 253 CPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKV 312
C CN SK ++A+EFE+HAGAKT+HPNNHIY ENGK IY+++QELK +L + ++ V
Sbjct: 486 CDNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNV 545
Query: 313 AGSSFNEGSFRVWKASHH 330
GS+ N+ +FR+WKAS+
Sbjct: 546 TGSTINQKNFRIWKASYQ 563
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRF-GCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
L G+ DG ++ Y+ + G +G G +C C CN K ++ +FE HAG A +
Sbjct: 451 LLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTK 509
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 510 HPNNHIYFENGKTIYAVVQEL 530
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 176/416 (42%), Gaps = 63/416 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC++ I+ S+FE HAG QP +IY G +L + Q
Sbjct: 597 DGIHCSCCDEVITISKFEMHAGSRVG-QPLENIYVHTGSSLLQCLLESWNKQNEPQCKGY 655
Query: 503 ------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+ CP FH +CLD P W C C
Sbjct: 656 NFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------ 709
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR------TVIYCDQCEKEFHVGC 610
C +C DD + CD CE+++H C
Sbjct: 710 --------------------SCKVCGQVTIGLHPMDDHHEAAADVLCKCDLCEEKYHPIC 749
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 670
++ N + + FC C +H LQ + R S + I R ++ +
Sbjct: 750 VQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRRSDVDSDVSL 807
Query: 671 DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRN 727
N+V +K +C E L+ A + ECF P+I G +LI ++Y G N
Sbjct: 808 ---CNEVA----QKIKCNSE-----LAVALFVMDECFLPVIDHRSGINLIHNILYNCGSN 855
Query: 728 ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIER 787
+ F G Y+ IL V+ A LRI G E+AE+P + T Y+ +G + S IE
Sbjct: 856 FTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIES 915
Query: 788 LLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
+L SLNVE LV+PA + W FGF+ + E + R L +F G ML+K
Sbjct: 916 VLSSLNVEKLVIPAISEVRDTWISVFGFKPLD-ETTKQRMRKMSLLVFPGVEMLQK 970
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/438 (27%), Positives = 196/438 (44%), Gaps = 72/438 (16%)
Query: 430 ERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL 489
+++ Y + + + G +GI C CC+K ++ S+FE HAG + QPY++IY +G++L
Sbjct: 636 QKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQNIYLESGVSL 694
Query: 490 ------------HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDP 537
H I +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 695 LQCQIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDI 754
Query: 538 LLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVI 597
++P W CPNC C C S + D +
Sbjct: 755 QMLPLGEWHCPNCT--------------------------CKFCGI--ASGNSEKDDASV 786
Query: 598 Y----CDQCEKEFHVGCLRKNGLCDLKEIPKD-----KWFCCDDCNRIHAALQDFVSNRA 648
Y C+ CEK++H C + ++ +P + FC +C + L+ ++ +
Sbjct: 787 YVLQICNLCEKKYHDSCTK-----EMDNLPNNINTSSLSFCGKECKELSEHLKKYLGTKH 841
Query: 649 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFD 708
+ S S I+R D + ++ +C S L+ A + ECF
Sbjct: 842 ELEAGFSWSLIHR--------IDEDSEAACRGISQRVEC-----NSKLAIALTVMDECFL 888
Query: 709 PII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPL 765
P+I G +LI ++Y G N S + G Y+ L ++++ +R G ++AE+P
Sbjct: 889 PVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTATLERGDEIIASASIRFHGTQIAEMPF 948
Query: 766 VATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLK 825
+ T Y+ +G + LFS IE LCSL VE LV+PA + + WT FGF + E L +
Sbjct: 949 IGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAELTNTWTTVFGFTHLD-ESLRQ 1007
Query: 826 YQRDFQLTIFKGTSMLEK 843
+ + +F G ML K
Sbjct: 1008 EMKSLNMMVFPGIDMLMK 1025
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 9/81 (11%)
Query: 212 PTNVKKLLSTGILDGACVKYISTSRERQ-----LDGIVNGGGYLCGCPLCNFSKVVSAHE 266
P K+ + + ++D V+ + R+ L+G + G CGC C SK+++ +
Sbjct: 615 PYMGKRTVLSWLIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGC--C--SKILTVSK 670
Query: 267 FEQHAGAKTRHPNNHIYLENG 287
FE HAG+K P +IYLE+G
Sbjct: 671 FELHAGSKLPQPYQNIYLESG 691
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 176/416 (42%), Gaps = 63/416 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRR------ 502
+GI C CC++ I+ S+FE HAG QP +IY G +L + Q
Sbjct: 570 DGIHCSCCDEVITISKFEMHAGSRVG-QPLENIYVHTGSSLLQCLLESWNKQNEPQCKGY 628
Query: 503 ------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+ CP FH +CLD P W C C
Sbjct: 629 NFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------ 682
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR------TVIYCDQCEKEFHVGC 610
C +C DD + CD CE+++H C
Sbjct: 683 --------------------SCKVCGQVTIGLHPMDDHHEAAADVLCKCDLCEEKYHPIC 722
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 670
++ N + + FC C +H LQ + R S + I R ++ +
Sbjct: 723 VQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRRSDVDSDVSL 780
Query: 671 DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--GRN 727
N+V +K +C E L+ A + ECF P+I G +LI ++Y G N
Sbjct: 781 ---CNEVA----QKIKCNSE-----LAVALFVMDECFLPVIDHRSGINLIHNILYNCGSN 828
Query: 728 ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIER 787
+ F G Y+ IL V+ A LRI G E+AE+P + T Y+ +G + S IE
Sbjct: 829 FTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIES 888
Query: 788 LLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
+L SLNVE LV+PA + W FGF+ + E + R L +F G ML+K
Sbjct: 889 VLSSLNVEKLVIPAISEVRDTWISVFGFKPLD-ETTKQRMRKMSLLVFPGVEMLQK 943
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 189/418 (45%), Gaps = 63/418 (15%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC S F HAG ++ +P I+ +G +L D + + R
Sbjct: 716 GIRCTCCQNLYGLSGFANHAGGSSNCRPSACIFLKDGRSLLDCMMEVMQDHRTREITEKP 775
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D++C VC G L+LC+ CP A+H CL+ IP+ W CP+CR
Sbjct: 776 HNDLFEGENDNICSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPSCR------- 828
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
C IC + +D + C QCE ++HV CLR NG D
Sbjct: 829 -------------------CGICGQNKIEET--EDGHFLTCIQCEHKYHVECLR-NGEKD 866
Query: 619 LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDGTMNDV 677
WFC ++C R++ LQ+ + + + A +L+ K++ G +D+
Sbjct: 867 DSRRCMKNWFCGEECERVYTGLQNLLG-KPVLVGADNLTWTLVKYVNSETCGVGGAESDL 925
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVYGR--NISGQEFG 734
+ K LS A ++ ECF+P+ RD++ +++ + ++ F
Sbjct: 926 VVENYSK-----------LSVALSVMHECFEPLHNPFSSRDIVEDVIFNQRSELNRLNFQ 974
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+V+L ++S +RIFG ++AE+PLV T +Y+ G + L +E+ L L V
Sbjct: 975 GFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRFQYRRLGMCRVLMDELEKKLKQLGV 1034
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGTSMLEKKVQCLP 849
E LVLPA WT FGF +M+ R + L D+ F+GT M +K + P
Sbjct: 1035 ERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFL----DYSFLDFQGTVMCQKLLTRFP 1088
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 75/130 (57%), Positives = 100/130 (76%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
++K KKV N +P+NV+ LLSTG+LDG VKYI+ SRE +L G++ G GYLCGC CNF
Sbjct: 269 DLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKYIAWSRE-ELRGVIKGSGYLCGCQTCNF 327
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HA KT+HPNNHIY ENGK +Y I+QEL++ P +L EV++ + GS N
Sbjct: 328 SKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLFEVIQTITGSPIN 387
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 388 QKSFRLWKES 397
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
L G+ DG + YI + G +G+G +C C CN K I+ +FE HA +
Sbjct: 288 LLSTGMLDGVPVKYIAWSREELRGVIKGSGYLCGCQTCNFSKVINAYEFERHAD-CKTKH 346
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 347 PNNHIYFENGKTVYGIVQEL 366
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 74/131 (56%), Positives = 100/131 (76%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
E+K +KK PN +P+NV+ L+STGILDG VKY+S SRE +L GI+ G GYLCGC CN
Sbjct: 315 QELKTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSRE-ELRGIIKGSGYLCGCQSCN 373
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V G+
Sbjct: 374 YTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGAPI 433
Query: 318 NEGSFRVWKAS 328
N+ +FR WK S
Sbjct: 434 NQKAFRNWKES 444
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 335 LISTGILDGVPVKYVSVSREELR-GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTK- 392
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 393 HPNNHIYFENGKTIYQIVQEL 413
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 100/129 (77%), Gaps = 1/129 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K +K PN +P+NV+ L+STG+LDG VKYI+ SRE +L G++ G GYLC C CN+
Sbjct: 284 EVKTGRKEAPNSFPSNVRSLISTGMLDGVPVKYIALSRE-ELRGVIKGSGYLCSCQSCNY 342
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L +V++ V G+ N
Sbjct: 343 SKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPIN 402
Query: 319 EGSFRVWKA 327
+ SFR+WKA
Sbjct: 403 QKSFRIWKA 411
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 421 FLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQ 476
+ G+ DG + YI + G +G+G +C C CN K ++ +FE HAG +
Sbjct: 303 LISTGMLDGVPVKYIALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKT-KH 361
Query: 477 PYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 362 PNNHIYFENGKTIYQIVQEL 381
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 181/413 (43%), Gaps = 60/413 (14%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K ++ S+FE HAG + RQP+++I +G++L + Q +
Sbjct: 429 DGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERSGF 487
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+GCP FH +CL+ ++P W CPNC
Sbjct: 488 HPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCT----- 542
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTV---IYCDQCEKEFHVGCLRK 613
C C D S A DD TV + C CEK++H C++
Sbjct: 543 ---------------------CKFCGMADGSNAE-DDTTVSELVTCSLCEKKYHTSCIQG 580
Query: 614 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
FC C + LQ F+ + + S S I+R G
Sbjct: 581 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHRTDPGSDTSVRGF 640
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVYGR--NISG 730
V+ S L+ A + ECF I+ +LI ++Y R N +
Sbjct: 641 PQRVE-------------SNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNR 687
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
+ G Y+ IL ++ A +RI G ++AE+P + T Y+ +G + LF IE LC
Sbjct: 688 LNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALC 747
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
SL VE L++PA + WT FGF + E + R + +F GT ML+K
Sbjct: 748 SLKVEMLIIPAISELMHTWTVGFGFNPL-EESHKQELRSLNMLVFPGTDMLQK 799
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 181/413 (43%), Gaps = 60/413 (14%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K ++ S+FE HAG + RQP+++I +G++L + Q +
Sbjct: 597 DGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDAWNRQEESERSGF 655
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG +L+ C+GCP FH +CL+ ++P W CPNC
Sbjct: 656 HPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCT----- 710
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTV---IYCDQCEKEFHVGCLRK 613
C C D S A DD TV + C CEK++H C++
Sbjct: 711 ---------------------CKFCGMADGSNAE-DDTTVSELVTCSLCEKKYHTSCIQG 748
Query: 614 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT 673
FC C + LQ F+ + + S S I+R G
Sbjct: 749 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHRTDPGSDTSVRGF 808
Query: 674 MNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVYGR--NISG 730
V+ S L+ A + ECF I+ +LI ++Y R N +
Sbjct: 809 PQRVE-------------SNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNR 855
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
+ G Y+ IL ++ A +RI G ++AE+P + T Y+ +G + LF IE LC
Sbjct: 856 LNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALC 915
Query: 791 SLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
SL VE L++PA + WT FGF + E + R + +F GT ML+K
Sbjct: 916 SLKVEMLIIPAISELMHTWTVGFGFNPL-EESHKQELRSLNMLVFPGTDMLQK 967
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/409 (30%), Positives = 184/409 (44%), Gaps = 56/409 (13%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 556 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 614
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 615 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 667
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ A G + G + +++ C CE+ +H CL
Sbjct: 668 ---TCKFCDAAVASGGKDGNSI---------------SLLSCGMCERRYHQLCLNDEAH- 708
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
++ FC C + LQ ++ + + S S I+R T +D
Sbjct: 709 KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD---------TDSDT 759
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFG 734
QM AQ E K L+ AI ECF PI+ G DLI ++Y G N + +
Sbjct: 760 NSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYT 815
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+ IL ++SA LR G ++AE+P + T Y+ +G + LF IE + SL V
Sbjct: 816 GFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 875
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
E LV+PA WT FGF + + + K R +F G ML+K
Sbjct: 876 EKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDMLQK 923
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 177/398 (44%), Gaps = 75/398 (18%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL-------HDIAISLAMGQR 501
+GIVC+CC + S FEAHAG R +P I +G +L D S +G+
Sbjct: 417 DGIVCNCCRVNFTVSGFEAHAG-CTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEA 475
Query: 502 RTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
+ +D++C +CG G +L+LC+ CP AFH CL +P+ W CP C
Sbjct: 476 NSEANNDNVCSICGFGGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTC----------- 524
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAATFD--DRTVIYCDQCEKEFHVGCLRKNGLCDL 619
C IC D + + C QCE++FH GC++
Sbjct: 525 ---------------CCKICYRPKCKQECADGNENNFLVCVQCEQKFHFGCVKTTRFGSS 569
Query: 620 ---KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND 676
I K WFC C + L+ + + ++
Sbjct: 570 HTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKV-----------------------ADN 606
Query: 677 VQWQMLKKAQCFEE----------KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 725
+ W +LK ++ +EK L++A + E F+P I A GR+LI +V+
Sbjct: 607 INWTLLKNVSSDDDGGDFTSNEFSQEKHKLNAALGVLYEGFNPTIDALSGRELIKDLVFS 666
Query: 726 RNISGQE--FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
R+ + F G Y+VIL V+S +RIFG++VAE+ VAT +++G+G + L
Sbjct: 667 RDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMCRLLMD 726
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
+E L L V LVL ++E A + WTK FGF +M+ E
Sbjct: 727 ELEEQLTRLGVGRLVLHSSEDAINTWTKSFGFARMTSE 764
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 74/136 (54%), Positives = 101/136 (74%), Gaps = 1/136 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N E+K SKK+ PN +P+NVK LLSTG+LDG VKY+S SRE+ L G + G GYLC C
Sbjct: 406 ASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKTLRGTIKGTGYLCSC 465
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
+C +KV++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P +L ++ V
Sbjct: 466 KVCG-NKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFNAIETVT 524
Query: 314 GSSFNEGSFRVWKASH 329
GS+ N+ +F WKAS+
Sbjct: 525 GSAINQKNFLSWKASY 540
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYIV--KGQRLRFGCKQGNGIVCDC---CNKEISPSQFEAHAGMAARR 475
L GL DG + Y+ + + LR G +G G +C C NK ++ +FE HA +
Sbjct: 430 LLSTGLLDGVAVKYVSWSREKTLR-GTIKGTGYLCSCKVCGNKVLNAYEFERHAN-CKTK 487
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ + L
Sbjct: 488 HPNNHIYFENGKTIYAVVQEL 508
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 124/409 (30%), Positives = 184/409 (44%), Gaps = 56/409 (13%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 556 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 614
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 615 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 667
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ A G + G + +++ C CE+ +H CL
Sbjct: 668 ---TCKFCDAAVASGGKDGNFI---------------SLLSCGMCERRYHQLCLNDEAH- 708
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
++ FC C + LQ ++ + + S S I+R T +D
Sbjct: 709 KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD---------TDSDT 759
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFG 734
QM AQ E K L+ AI ECF PI+ G DLI ++Y G N + +
Sbjct: 760 NSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYT 815
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+ IL ++SA LR G ++AE+P + T Y+ +G + LF IE + SL V
Sbjct: 816 GFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 875
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
E LV+PA WT FGF + + + K R +F G ML+K
Sbjct: 876 EKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDMLQK 923
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 124/425 (29%), Positives = 192/425 (45%), Gaps = 69/425 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAMG---QRR 502
+GI C CCN+ S +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 930 DGINCSCCNEVYSVLEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDVERQ 987
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
+DD C +CGDG NL+ C+GCP FH +CL ++P W C NC
Sbjct: 988 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCCANC---- 1043
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRSH--DFSAATFD-DRTVIYCDQCEKEFHVGCL 611
C C H D + T D D ++ C QCE+++H C
Sbjct: 1044 ----------------------SCKFCHEHSSDGAEDTADVDYSLHTCSQCEEQYHEACS 1081
Query: 612 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
+ FC C + LQ+ ++ + P S + R H
Sbjct: 1082 PETDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIH-------- 1133
Query: 672 GTMNDVQWQML---KKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--G 725
DV ++L K+ +C S ++ A ++ ECF PII + G +LI +VY G
Sbjct: 1134 ---EDVPEEVLALDKRVEC-----NSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCG 1185
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
N + +F G Y IL +++A +RI G ++AE+P + T Y+ +G + L I
Sbjct: 1186 SNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGI 1245
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK- 844
E +L SLNVE L++PA + WT KFGF + + + + +F GT +L+K
Sbjct: 1246 EMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEKQEV-KSISMLVFPGTGLLQKPL 1304
Query: 845 VQCLP 849
++ LP
Sbjct: 1305 LKALP 1309
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 100/130 (76%), Gaps = 2/130 (1%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KM +K PN +P+NV+ L+STG+LDG VKY+S +RE +L GI+ G GYLCGC CNF
Sbjct: 315 EYKM-RKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTRE-ELRGIIKGSGYLCGCQSCNF 372
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SK+++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ + G+ N
Sbjct: 373 SKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPIN 432
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 433 QKSFRIWKES 442
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 333 LISTGMLDGVPVKYVSVTREELR-GIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTK- 390
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 391 HPNNHIYFENGKTIYQIVQEL 411
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 100/130 (76%), Gaps = 2/130 (1%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E KM +K PN +P+NV+ L+STG+LDG VKY+S +RE +L GI+ G GYLCGC CNF
Sbjct: 321 EYKM-RKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTRE-ELRGIIKGSGYLCGCQSCNF 378
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SK+++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ + G+ N
Sbjct: 379 SKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPIN 438
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 439 QKSFRIWKES 448
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + Y+ V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 339 LISTGMLDGVPVKYVSVTREELR-GIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTK- 396
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 397 HPNNHIYFENGKTIYQIVQEL 417
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 121/409 (29%), Positives = 183/409 (44%), Gaps = 56/409 (13%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG---- 505
GI CDCC+K ++ S+FE HAG A QP+++IY +G +L + Q+ T
Sbjct: 539 GIHCDCCSKILTVSRFEIHAGSKAC-QPFQNIYLESGASLLQCQVRAWNMQKDATNVGLH 597
Query: 506 --------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 598 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPNC------- 650
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ A G + G + +++ C CE+ +H CL
Sbjct: 651 ---TCKFCDAAVASGGKDGNFL---------------SLLSCSMCERRYHQLCLSDEAQ- 691
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
++ FC C + LQ ++ + + S S I+R T +D+
Sbjct: 692 KVQSFGSASSFCGPKCLELFEKLQKYLGVKNEIEGGYSWSLIHRVD---------TDSDI 742
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFG 734
Q+ AQ E K L+ AI ECF PI+ G +LI ++Y G N + +
Sbjct: 743 NSQL--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINYT 798
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+ IL ++SA LR G ++AE+P + T Y+ +G + LF IE + SL V
Sbjct: 799 GFYTAILERGDEIISAASLRFHGTQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 858
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
E LV+PA WT FGF + + + K R +F G ML+K
Sbjct: 859 EKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDMLQK 906
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 116/417 (27%), Positives = 183/417 (43%), Gaps = 68/417 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL------------HDIAISL 496
+GI C CC+K ++ S+FE HAG + RQP+++I+ +G++L +S
Sbjct: 516 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSF 574
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ +DD C +CGDG +L+ C+GCP FH +CLD L+ P W CPNC +
Sbjct: 575 HTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKYCG 634
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC------ 610
S+D+ G +E+ C++ CEK+FH C
Sbjct: 635 VA--SIDICQGDNTSVSEISTCIL---------------------CEKKFHESCNLEMDT 671
Query: 611 -LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
+ +GL FC C + +LQ + + + S S I R + +
Sbjct: 672 PVHSSGLV--------TSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRRTSEDSDVS 723
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GR 726
G ++ S L+ A + ECF PI+ G +LI ++Y G
Sbjct: 724 VRGLSQRIE-------------SNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGS 770
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
N + G Y+ IL ++SA +R G ++AE+P + T Y+ +G + LF IE
Sbjct: 771 NFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIE 830
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
L VE L++PA + W FGF + L + R + +F GT ML+K
Sbjct: 831 SALRVFKVEKLIIPAIAELMHTWNVIFGFSPL-EPSLKQEMRLMNMLVFPGTDMLQK 886
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 20/119 (16%)
Query: 185 SSGANDGVLCARN-----MEIKMSKKVVPNE----YPTNVKKLLSTGILDGACV------ 229
SSG+N VL R + ++ S + + +E P K+ L + ++D V
Sbjct: 438 SSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVPYTGKRTLLSWLIDSGTVQLSQKV 497
Query: 230 KYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+Y++ + R L+G + G CGC C SK+++ +FE HAG+K R P +I+LE+G
Sbjct: 498 RYMNRRQTRVMLEGWITRDGIHCGC--C--SKILTVSKFEIHAGSKLRQPFQNIFLESG 552
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/131 (55%), Positives = 99/131 (75%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
E+K +K PN +P+NV+ L+STGILDG VKYIS SRE +L GI+ G GYLCGC CN
Sbjct: 354 QELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSRE-ELRGIIKGSGYLCGCQSCN 412
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++KV++A+EFE+HAG KT+HPNNHIY ENGK IY I+QEL++ P +L + ++ V G+
Sbjct: 413 YTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGAPI 472
Query: 318 NEGSFRVWKAS 328
++ +FR WK S
Sbjct: 473 HQKAFRNWKES 483
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 7/81 (8%)
Query: 421 FLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARR 475
+ G+ DG + YI V + LR G +G+G +C C CN K ++ +FE HAG +
Sbjct: 374 LISTGILDGVPVKYISVSREELR-GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTK- 431
Query: 476 QPYRHIYTSNGMTLHDIAISL 496
P HIY NG T++ I L
Sbjct: 432 HPNNHIYFENGKTIYQIVQEL 452
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/417 (28%), Positives = 184/417 (44%), Gaps = 68/417 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL------------HDIAISL 496
+GI C CC+K ++ S+FE HAG + RQP+++I+ +G++L +S
Sbjct: 634 DGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIFLESGLSLLQCQRDAWNRQEESKLLSF 692
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+ +DD C +CGDG +L+ C+GCP FH +CLD L+ P W CPNC +
Sbjct: 693 HTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKYCG 752
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC------ 610
S+D+ G +E+ C++ CEK+FH C
Sbjct: 753 VA--SIDICQGDNTSVSEISTCIL---------------------CEKKFHESCNLEMDT 789
Query: 611 -LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
+ +GL FC C + +LQ + + + S S I R
Sbjct: 790 PVHSSGLV--------TSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------- 833
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GR 726
T D + +Q E K L+ A + ECF PI+ G +LI ++Y G
Sbjct: 834 ---TSEDSDVSVRGLSQRIESNSK--LAVALTVMDECFLPIVDRRSGINLIHNVLYNCGS 888
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
N + G Y+ IL ++SA +R G ++AE+P + T Y+ +G + LF IE
Sbjct: 889 NFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIE 948
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
L VE L++PA + W FGF + L + R + +F GT ML+K
Sbjct: 949 SALRVFKVEKLIIPAIAELMHTWNVIFGFSPL-EPSLKQEMRLMNMLVFPGTDMLQK 1004
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 62/119 (52%), Gaps = 20/119 (16%)
Query: 185 SSGANDGVLCARN-----MEIKMSKKVVPNE----YPTNVKKLLSTGILDGACV------ 229
SSG+N VL R + ++ S + + +E P K+ L + ++D V
Sbjct: 556 SSGSNSRVLHGRKGRKLGLLVRGSSRGLDSENDGYVPYTGKRTLLSWLIDSGTVQLSQKV 615
Query: 230 KYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+Y++ + R L+G + G CGC C SK+++ +FE HAG+K R P +I+LE+G
Sbjct: 616 RYMNRRQTRVMLEGWITRDGIHCGC--C--SKILTVSKFEIHAGSKLRQPFQNIFLESG 670
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 260 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 318
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 319 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 378
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 379 TVFGSPINQKAFRIWKES 396
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 224 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 280
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 281 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 339
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 340 GCKTK-HPNNHIYFENGKTIYQIVQEL 365
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGKTIYQIVQEL 375
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 99/130 (76%), Gaps = 1/130 (0%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
E+K ++K PN +P+NV+ L+STG+LDG VKYIS SR ++L GI+ G GYLCGC CN+
Sbjct: 310 ELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYISLSR-KELRGIIKGSGYLCGCQSCNY 368
Query: 259 SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFN 318
SKV++A+EFE+HAG KT+HPNNHI ENGK IY I+QEL+ P +L + ++ V G+ N
Sbjct: 369 SKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTVFGAPIN 428
Query: 319 EGSFRVWKAS 328
+ SFR+WK S
Sbjct: 429 QKSFRIWKES 438
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 134/490 (27%), Positives = 206/490 (42%), Gaps = 81/490 (16%)
Query: 385 EMLEERKRGVKRPFMHQKRTAEGGTKKRDNDLH-----RLLFLPNGLPDGERLTYIVKGQ 439
E LE+ + G KR RTA G R + + N + GE+++Y +G
Sbjct: 481 ETLEKPRDGQKR---QSSRTAMQGVTPRSSKCKPRCALSWMIDNNLVSPGEKVSY--RGS 535
Query: 440 R----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA-- 493
+ L G GI C+CC K + + FE+HAG R P +I +G +L D
Sbjct: 536 KGPGELTRGRITREGIECNCCQKIFTLTGFESHAGSTNHR-PAANIILEDGRSLLDCQRK 594
Query: 494 -------------ISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI 540
Q + G +D +C VC DG +L++C+ CP FH C+ I
Sbjct: 595 KKPRIKMQRVTREAKWKGRQNQHQGETDYICSVCHDGGDLIVCDHCPSTFHKNCVGLEDI 654
Query: 541 PESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCD 600
PE W + K ++ P D ++ CD
Sbjct: 655 PEGEW----FCPPCCCGICGENKFKYNVQEP--------------------KDSRLLSCD 690
Query: 601 QCEKEFHVGCLRKNGLCDLK-EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
QCE+++H+GCLR G+ LK + PKD WFC + C I LQ + P + T+
Sbjct: 691 QCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVGPDNLTWTL 750
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEK---EKSLLSSATAIFRECFDPII-AECG 715
W+ + C E + S L A + ECF+P G
Sbjct: 751 -------------------WKFMDSDSCDVEAPTGKHSKLDLAVEVIHECFEPATETYTG 791
Query: 716 RDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQ 773
RD+ +++ R N++ F G Y+V+L +++ +R+FG +VAE+PLV T ++
Sbjct: 792 RDIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGTRFLFR 851
Query: 774 GKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLT 833
G + L +E+ L +L VE L+LPA W FGF K++ ++Y D
Sbjct: 852 RLGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYL-DHTFL 910
Query: 834 IFKGTSMLEK 843
F GT +K
Sbjct: 911 DFPGTIKCQK 920
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/138 (54%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENGK IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGKTIYQIVQEL 375
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/426 (29%), Positives = 180/426 (42%), Gaps = 71/426 (16%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQR----RTT 504
+GI CC + SP FEAHAG + + S+ + SL GQR R T
Sbjct: 278 DGIKNSCCQEVFSPRGFEAHAGSS--------FHQSDANIFLEDEGSLLEGQRQMVHRIT 329
Query: 505 GGS------------------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
G S DD+C VC G +L+LC+ CP FH +CL +PE W
Sbjct: 330 GKSFTKESSHGKKSNGDQCNNDDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWF 389
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEF 606
CP+C C IC + F + +D C QCE ++
Sbjct: 390 CPSC--------------------------CCRICGENRFDEYSEEDNFKFSCHQCELQY 423
Query: 607 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 666
HVGCLRK L+ P FC C +I L + + T+ + I +
Sbjct: 424 HVGCLRKQRHVKLETYPDGTRFCSTQCEKIFLGLLKLLGKPIPVGVDNLTWTLLKPTISE 483
Query: 667 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY- 724
D N ++ K L+ A + ECF+PI GRDL+ +++
Sbjct: 484 WFDMDVPDNKALTEVYSK-----------LNIALNVMHECFEPIKEPHTGRDLVEDVIFC 532
Query: 725 -GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
G ++ F G Y V+L ++S +R+ G +VAE+PLV T +Y+ G + L +
Sbjct: 533 RGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEKVAEVPLVGTRSQYRRLGMCRILIN 592
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
IE+ L L VE L LPAA W FGF KM+ L + D+ F+ T M +K
Sbjct: 593 EIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMTDSERLTF-LDYTFLDFQDTVMCQK 651
Query: 844 KVQCLP 849
+ +P
Sbjct: 652 LLMKIP 657
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/441 (29%), Positives = 195/441 (44%), Gaps = 77/441 (17%)
Query: 431 RLTYIVKGQR-----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
++ Y VKG+ L G +GI C+CC S FE HA ++ +P I+ +
Sbjct: 644 KVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHASGSSTCRPSARIFLED 703
Query: 486 GMTLHDIAISLAMGQ--RRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAAC 534
G +L D I + R TTG S D +C VC G L+LC+ CP +FH C
Sbjct: 704 GRSLLDCQIKMMHDHKTRETTGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTC 763
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR 594
L IP W CP+C C IC DD
Sbjct: 764 LGLEDIPNGDWFCPSC--------------------------CCGICGQRKIDG---DDE 794
Query: 595 T--VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI----HAALQDFVSNRA 648
++ C QCE ++HV CL +NG D+ WFC DC +I H L + VS
Sbjct: 795 VGQLLPCIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGV 853
Query: 649 QTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFD 708
+ + + IN E D + +D+ + K L+ A ++ ECF+
Sbjct: 854 DNLTWTLVKFINPDSCEH----DSSKSDLLAESYSK-----------LNLAISVMHECFE 898
Query: 709 PIIAE-CGRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPL 765
P+ RDL+ +++ R ++ F G Y+V+L ++S +R++G++VAE+PL
Sbjct: 899 PLKESLTNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPL 958
Query: 766 VATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RER 822
V T +Y+ +G L +E+ L L VE LVLPA WT+ FGF KM+ R +
Sbjct: 959 VGTRLQYRRRGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQ 1018
Query: 823 LLKYQRDFQLTIFKGTSMLEK 843
L D+ F+G M +K
Sbjct: 1019 FL----DYTFLDFQGAIMCQK 1035
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 186/425 (43%), Gaps = 67/425 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS-----------NGMTLHDIAISLA 497
+GI C CC+K +S +F AHAG PYR+I N + A
Sbjct: 931 DGIDCSCCSKVLSVLEFVAHAGSEV-NTPYRNILVDGQDIDLLHCLINAWNMQSDAEKQD 989
Query: 498 MGQRRTTGG--SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
G +DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 990 FFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWCCANC----- 1044
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSH--DFSAATFD-DRTVIYCDQCEKEFHVGCLR 612
C C H D + T D D ++ C QCE+++H C
Sbjct: 1045 ---------------------SCKFCHEHSNDGAEDTADVDSSLHTCSQCEEQYHEACSP 1083
Query: 613 KNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
+N FC C + LQ+ ++ + P S + R H
Sbjct: 1084 ENDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIH--------- 1134
Query: 673 TMNDVQWQML---KKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GR 726
DV ++L + +C S ++ A ++ ECF PI+ + G +LI +VY G
Sbjct: 1135 --EDVPEEVLPLDTRVEC-----NSKIAVALSLMDECFLPIVDQRTGINLIRNVVYSCGS 1187
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
N + +F G Y IL +++A +RI G ++AE+P + T Y+ +G + L IE
Sbjct: 1188 NFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIE 1247
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK-V 845
+L SLNVE L++PA + WT +FGF + + + + +F GT +L+K +
Sbjct: 1248 MILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSEKEEV-KSISMLVFPGTGLLQKPLL 1306
Query: 846 QCLPE 850
+ LP+
Sbjct: 1307 KALPK 1311
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 198/422 (46%), Gaps = 63/422 (14%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA 497
++ G +G+VC CCN+ +S S+F+ HAG + P +++ +G A S
Sbjct: 636 VKTGLVTKDGVVCTCCNRTVSLSEFKNHAGFN-QNCPCLNLFMGSGKPFASCQLEAWSAE 694
Query: 498 MGQRRTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR S DD C VCGDG L+ C+ CP FH ACL ++PE W
Sbjct: 695 YKARRNGWRSEEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWY-- 752
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
S S + L + E RS DF C QC ++H
Sbjct: 753 --------CSSCSCQICSELVSDNGE-------RSQDFK-----------CSQCAHKYHG 786
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP--ASSLSTINRKHIEK 666
CL+ G+ +++ + +FC +C +++ L S+R I A LS I K
Sbjct: 787 ICLQ--GISKRRKLFPETYFCGKNCEKVYTGL----SSRVGVINPNADGLSW----SILK 836
Query: 667 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY- 724
DG ++ + ++ KA+C S L+ A +I E F ++ G D+IP ++Y
Sbjct: 837 CFQEDGKVHSAR-RLALKAEC-----NSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYN 890
Query: 725 -GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
G N + +F G Y+++L V++S +R+ G VAE+PLVATC +Y+ +G + L +
Sbjct: 891 WGSNFARLDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVATCSKYRRQGMCRILVA 950
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
IE +L SL VE LV+ A WT+ FGF+ M E +R L +F GT +L K
Sbjct: 951 AIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLMVFPGTILLMK 1009
Query: 844 KV 845
+
Sbjct: 1010 TL 1011
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 202/443 (45%), Gaps = 70/443 (15%)
Query: 426 LPDGERLTYI--VKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT 483
+P ++ Y+ K + L G +GI C CC++ + S+FE HAGM +P ++I
Sbjct: 154 VPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLC-EPSQNIIL 212
Query: 484 SNGMTLHDIAISLAMGQRRTTGG------------SDDMCHVCGDGENLLLCNGCPLAFH 531
G++L + Q + +DD C +CGDG +L+ C+GCP FH
Sbjct: 213 ETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFH 272
Query: 532 AACLDPLLIPESGWRCPNCR-------QGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH 584
+CLD P W C C G++ M+ ++D+
Sbjct: 273 QSCLDIQKFPSGDWHCIYCSCKFCGMFSGNTDQMNYNLDV-------------------- 312
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCLR-KNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
+D ++ C CE+++H C + ++ + D P FC C + LQ
Sbjct: 313 -------NDSALLTCQLCEEKYHHMCTQGEDSILDDSSSPS---FCGKTCRELFEQLQML 362
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIF 703
+ + + S + + R + FD ++N + +K +C S L+ A +I
Sbjct: 363 LGVKHELEDGFSWTLVQRTEVG----FDISLNGIP----QKVEC-----NSKLAVALSIM 409
Query: 704 RECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREV 760
ECF PI+ + G +LI ++Y G N + + G ++ IL ++SA +RI G ++
Sbjct: 410 DECFLPIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKL 469
Query: 761 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 820
AE+P + T Y+ +G + L + IE L SLNVE LV+PA + WT FGF+ +
Sbjct: 470 AEMPFIGTRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEV 529
Query: 821 ERLLKYQRDFQLTIFKGTSMLEK 843
K R+ + +F GT ML+K
Sbjct: 530 SS-RKEMRNMNMLVFHGTDMLQK 551
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/434 (28%), Positives = 188/434 (43%), Gaps = 84/434 (19%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG----MTLHDIAISL 496
++ G GIVC+CC K +S S F AHAG + Q ++ +G + L + +
Sbjct: 880 IKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS-HPQSSLGLFLESGKSYTLCLVEAWSAE 938
Query: 497 AMGQRRTTGG--------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+M +R G SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 939 SMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCH 998
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIY-CDQCEKEFH 607
NC C +C FS + I+ C QC +H
Sbjct: 999 NCT--------------------------CQVCGG-PFSEKEVSTFSAIFKCFQCGDAYH 1031
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
C+ + L L++ WFC C I L R H+
Sbjct: 1032 DTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGL--------------------RSHVGTD 1070
Query: 668 ILFDGTMNDVQWQMLK------------KAQCFEEKEKSLLSSATAIFRECFDPII-AEC 714
+ D +D+ W +L+ K C E L + A + ECF ++
Sbjct: 1071 NILD---SDLSWSILRCNNDGQKLHSVQKIACLAECNMKL-AVALTLLEECFIRMVDPRT 1126
Query: 715 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREY 772
G D+IP ++Y G N + ++ G Y+VIL ++ +R+ G + AELP +AT ++
Sbjct: 1127 GVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDF 1186
Query: 773 QGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-SRERLLKYQRDFQ 831
+ +G + L S IE++LCS NV+ LVL A + S W FGF+ + ER K +
Sbjct: 1187 RRQGMCRILMSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAER--KQLHNVN 1244
Query: 832 LTIFKGTSMLEKKV 845
L +F GTS+L K++
Sbjct: 1245 LMLFPGTSLLTKRL 1258
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/125 (64%), Positives = 96/125 (76%), Gaps = 1/125 (0%)
Query: 722 MVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 781
MVYGRN+ GQEFGGMY +L V S VVSA +LRIFG +VAELPLVAT GKG FQ L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 782 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDF-QLTIFKGTSM 840
FSCIERLL +NV++LVLPAAE+AESIWT KFGF +M + L Y+++F Q+ FKGT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 841 LEKKV 845
L K V
Sbjct: 121 LHKLV 125
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/138 (53%), Positives = 99/138 (71%), Gaps = 1/138 (0%)
Query: 191 GVLCARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYL 250
G L E K SKK +P+NV+ L+STG+LDG VKY+S SRE +L G++ G GYL
Sbjct: 270 GSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE-ELRGVIKGSGYL 328
Query: 251 CGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CGC C+F+KV++A+ FE+HAG KT+HPNNHIY ENG+ IY I+QEL+ P IL +V++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQ 388
Query: 311 KVAGSSFNEGSFRVWKAS 328
V GS N+ +FR+WK S
Sbjct: 389 TVFGSPINQKAFRIWKES 406
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 234 VQDPIGTLDIVYDQETGSSQTSSGVVSEQQ---VAKPSLGSLPKTKAEAKSSKKEASTSF 290
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHA 469
+ + G+ DG + Y+ V + LR G +G+G +C C C+ K ++ FE HA
Sbjct: 291 PSNVRSLISTGMLDGVPVKYVSVSREELR-GVIKGSGYLCGCQTCDFTKVLNAYAFERHA 349
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 350 GCKTK-HPNNHIYFENGRTIYQIVQEL 375
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/411 (30%), Positives = 185/411 (45%), Gaps = 56/411 (13%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 38 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 96
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 97 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 149
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ A G + G + +++ C CE+ +H CL
Sbjct: 150 ---TCKFCDAAVASGGKDGNFI---------------SLLSCGMCERRYHQLCLNDEAH- 190
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
++ FC C + LQ ++ + + S S I+R T +D
Sbjct: 191 KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD---------TDSDT 241
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFG 734
QM AQ E K L+ AI ECF PI+ G DLI ++Y G N + +
Sbjct: 242 NSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYT 297
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+ IL ++SA LR G ++AE+P + T Y+ +G + LF IE + SL V
Sbjct: 298 GFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 357
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E LV+PA WT FGF + + + K R +F G ML+K +
Sbjct: 358 EKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDMLQKPL 407
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/411 (30%), Positives = 185/411 (45%), Gaps = 56/411 (13%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----------HDIAISLAMG 499
GI CDCC+K ++ S+FE HAG + QP+++IY +G +L A +LA+
Sbjct: 18 GIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALH 76
Query: 500 Q--RRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
Q +DD C +CGDG +L+ C+GCP +H CL ++P W CPNC
Sbjct: 77 QVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------- 129
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ A G + G + +++ C CE+ +H CL
Sbjct: 130 ---TCKFCDAAVASGGKDGNFI---------------SLLSCGMCERRYHQLCLNDEAH- 170
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
++ FC C + LQ ++ + + S S I+R T +D
Sbjct: 171 KVQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVD---------TDSDT 221
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEFG 734
QM AQ E K L+ AI ECF PI+ G DLI ++Y G N + +
Sbjct: 222 NSQM--SAQRIENNSK--LAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYT 277
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+ IL ++SA LR G ++AE+P + T Y+ +G + LF IE + SL V
Sbjct: 278 GFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 337
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E LV+PA WT FGF + + + K R +F G ML+K +
Sbjct: 338 EKLVIPAIPDFLHAWTGNFGFTPLD-DSVRKEMRSLNTLVFPGIDMLQKPL 387
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 189/427 (44%), Gaps = 76/427 (17%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 727 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 785
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 786 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 845
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH---DFSAATFDDRTVIYCDQCEKEFH 607
C C+ H D D ++ C QCE+++H
Sbjct: 846 --------------------------SCKFCQEHSRQDAQDIAEVDSSLCTCSQCEEKYH 879
Query: 608 VGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
GC + + CDL FC C + L++ ++ + P S I
Sbjct: 880 PGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKDLEPEFSCRII 931
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 718
R H + + + ++ +C S ++ A ++ ECF PI+ + G +L
Sbjct: 932 QRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 978
Query: 719 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
I +VY G N +F G Y +L +++A +RI G ++AE+P + T Y+ +G
Sbjct: 979 IRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQG 1038
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L IE +L SLNVE L++PA + WT KFGF + + + L +F
Sbjct: 1039 MCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEVKSTSML-VFP 1097
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 1098 GTGLLQK 1104
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 142/280 (50%), Gaps = 30/280 (10%)
Query: 578 CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIH 637
CV+C DF F D+TV+ CDQCEKE+H+GCLR++ + D++ +P+ +WFC D+C RI
Sbjct: 1512 CVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVRIR 1571
Query: 638 AALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLS 697
AL + V+ +P + +WQ+L + + ++ LS
Sbjct: 1572 DALGEDVAAGEVLMPGNPA--------------------YRWQIL-RGKNGRQQTWHALS 1610
Query: 698 SATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVIL-------TVKSVVVS 749
+ I +E FDPII G DL+P MV +F GMYS++L + V
Sbjct: 1611 TVLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVL 1670
Query: 750 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 809
A L R+ G +AE+PLVAT + + +G +AL + L L V +VLPA A+ W
Sbjct: 1671 AALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAW 1730
Query: 810 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCLP 849
++ F+ + + + ++ IF T++L + + +P
Sbjct: 1731 -RQLQFQDLDEPSTRVARSEHRMVIFPHTTVLARPLIRVP 1769
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 48/88 (54%), Gaps = 9/88 (10%)
Query: 418 RLLF--LPNGLPDGERLTYIVKGQRLRFGCKQ-------GNGIVCDCCNKEISPSQFEAH 468
R LF LP GL + + Y ++G+RL G Q +GI+C C+ IS S FEAH
Sbjct: 1351 RTLFDGLPGGLKHDDIVWYTLQGKRLLRGVVQLQACGAATSGILCGHCDTIISASAFEAH 1410
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISL 496
AG RR PY I T +GMTL IA L
Sbjct: 1411 AGHKHRRNPYEAILTDDGMTLKRIAERL 1438
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 189/427 (44%), Gaps = 76/427 (17%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 882 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 940
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 941 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 1000
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH---DFSAATFDDRTVIYCDQCEKEFH 607
C C+ H D D ++ C QCE+++H
Sbjct: 1001 --------------------------SCKFCQEHSRQDAQDIAEVDSSLCTCSQCEEKYH 1034
Query: 608 VGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
GC + + CDL FC C + L++ ++ + P S I
Sbjct: 1035 PGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKDLEPEFSCRII 1086
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 718
R H + + + ++ +C S ++ A ++ ECF PI+ + G +L
Sbjct: 1087 QRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 1133
Query: 719 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
I +VY G N +F G Y +L +++A +RI G ++AE+P + T Y+ +G
Sbjct: 1134 IRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQG 1193
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L IE +L SLNVE L++PA + WT KFGF + + + L +F
Sbjct: 1194 MCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEVKSTSML-VFP 1252
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 1253 GTGLLQK 1259
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 191/425 (44%), Gaps = 66/425 (15%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG--MTLHDI-AISLA 497
++ G G+VC+CC K +S S F AHAG + Q ++ +G TL + A S
Sbjct: 904 IKDGLVTWEGVVCNCCKKTLSVSGFMAHAGFS-HPQSSLGLFLESGKSYTLCQVEAWSAE 962
Query: 498 MGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
RR+ SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 963 FMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHPACLSAKELPEGSWYCH 1022
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS--HDFSAATFDDRTVIYCDQCEKEF 606
NC C IC + +TF + C QC +
Sbjct: 1023 NCT--------------------------CQICGGPVSEKEVSTF--SAIFKCFQCGDAY 1054
Query: 607 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLS-TINRKHIE 665
H C+ + L L++ WFC C I L+ V + I S LS +I R +
Sbjct: 1055 HDTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGLRSHVGT--ENILDSELSWSILRCN-- 1109
Query: 666 KGILFDGTMNDVQ-WQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 723
ND Q ++K C E L + A + ECF ++ G D+IP ++
Sbjct: 1110 ---------NDGQKLHSVQKIACLAECNMKL-AVALTLLEECFIRMVDPRTGVDMIPHVL 1159
Query: 724 Y--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 781
Y G N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L
Sbjct: 1160 YNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRIL 1219
Query: 782 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-SRERLLKYQRDFQLTIFKGTSM 840
+ IE++LCS NV+ LVL A + S W FGF+ + ER K + L +F GTS+
Sbjct: 1220 MNIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAER--KQLHNVNLMLFPGTSL 1277
Query: 841 LEKKV 845
L K++
Sbjct: 1278 LTKRL 1282
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 189/427 (44%), Gaps = 76/427 (17%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 863 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 921
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 922 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 981
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH---DFSAATFDDRTVIYCDQCEKEFH 607
C C+ H D D ++ C QCE+++H
Sbjct: 982 --------------------------SCKFCQEHSRQDAQDIAEVDSSLCTCSQCEEKYH 1015
Query: 608 VGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
GC + + CDL FC C + L++ ++ + P S I
Sbjct: 1016 PGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKDLEPEFSCRII 1067
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 718
R H + + + ++ +C S ++ A ++ ECF PI+ + G +L
Sbjct: 1068 QRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 1114
Query: 719 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
I +VY G N +F G Y +L +++A +RI G ++AE+P + T Y+ +G
Sbjct: 1115 IRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQG 1174
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L IE +L SLNVE L++PA + WT KFGF + + + L +F
Sbjct: 1175 MCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEVKSTSML-VFP 1233
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 1234 GTGLLQK 1240
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 105/160 (65%), Gaps = 14/160 (8%)
Query: 182 DLGSSGANDGVLCARNM-------------EIKMSKKVVPNEYPTNVKKLLSTGILDGAC 228
+ GSS + GV+ + + E K SKK +P+NV+ L+STG+LDG
Sbjct: 238 ETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFPSNVRSLISTGMLDGVP 297
Query: 229 VKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGK 288
V Y+S SRE +L G++ G GYLCGC C F+KV++A+ FE+HAG KT+HPNNHIY ENGK
Sbjct: 298 VTYVSISRE-ELRGVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKHPNNHIYFENGK 356
Query: 289 PIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
IY I+QEL+ P IL +V++ V GS N+ +FR+WK S
Sbjct: 357 TIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKES 396
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 16/147 (10%)
Query: 361 LEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQ--KRTAEGGTKKRDNDL-- 416
+++ I + Q+TG QT + E++ V +P + K AE + K++
Sbjct: 224 VQDPIGTLDIVYGQETGSSQTSSGVVSEQQ---VAKPSLEPVPKNKAETKSSKKEASTSF 280
Query: 417 --HRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC----CNKEISPSQFEAHA 469
+ + G+ DG +TY+ + + LR G +G+G +C C K ++ FE HA
Sbjct: 281 PSNVRSLISTGMLDGVPVTYVSISREELR-GVIKGSGYLCGCQTCEFTKVLNAYAFERHA 339
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISL 496
G + P HIY NG T++ I L
Sbjct: 340 GCKT-KHPNNHIYFENGKTIYQIVQEL 365
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 157/612 (25%), Positives = 244/612 (39%), Gaps = 133/612 (21%)
Query: 35 DNVESDSFPN--GKHAKEEALNDDIKSEVSNPVVSPKEFTSSLQDITSQETKLVTESCNQ 92
++ +SDS N G KE+ LN D+ T + D+ + K +S +
Sbjct: 219 NDKQSDSLVNCGGGVEKEKELNGDV--------------TEKMMDVDKTDNKEQLDSLME 264
Query: 93 VLSTTSTGNLSSEETLSDGDERTESSYAETSRTDNNSGDVSKSHVVLEIPKHV------- 145
L+ EE GD +T+S + D + + +P+ V
Sbjct: 265 ---------LAMEELPMSGDCKTKSEAIKA--------DCAYGSALATVPESVENSKVST 307
Query: 146 SSSSGIRKITFKFSKRKEDYVAPLAYEEGRNYTLYDDLGSSGANDGVLCARNMEIKMSKK 205
SS +R+ T K K+ ++ E+ T +D S + +E+KMSKK
Sbjct: 308 SSEKPLRRFTRSCLKPKQQAMSASPAED----TKAEDALESDEASAIGTTSKLEMKMSKK 363
Query: 206 VVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAH 265
V + PT +K LL+TG+L+G V+Y E+ L G++ G G LC C C +KVV+ +
Sbjct: 364 VALVKIPTKLKGLLATGLLEGLPVRYARGRPEKGLQGVIQGSGILCFCQNCGGTKVVTPN 423
Query: 266 EFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVW 325
+FE HAG+ + P +IYL+NGK + ++ K AP L E + R
Sbjct: 424 QFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADAL-------------EAAIRNA 470
Query: 326 KASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMKE 385
+ RK V + K ++ S + C S + S T+ S V D + E
Sbjct: 471 TGAGDARKSTVCLNCKASLPEASFGRPRLQCDS-CMTSKKSQTTPSQVGDANCSRDGQLE 529
Query: 386 ML---------EERKRGV-----------KRPFMHQKRTAEGGT----------KKRDND 415
+ + K G+ + GT ++D
Sbjct: 530 FIFLLNYYWADDLYKLGLPDLRGLQWSPSSNSVLKSTERMSSGTCPPSKVHGRLTRKDLR 589
Query: 416 LHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARR 475
+H+L+F + LPDG L Y V+G+ +SPSQFEAHAG A+RR
Sbjct: 590 MHKLVFEGDVLPDGTALAYYVRGK--------------------VSPSQFEAHAGCASRR 629
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRR-----TTGGSDDMCHVCGDGENLLLCNGCPLAF 530
+P + + + + ++ + G + G D+C +C DG +LL C+ CP AF
Sbjct: 630 KPGWYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADGGDLLCCDNCPRAF 689
Query: 531 HAACLDPLLIPESGWRCPNCRQGH-SSSMSRSVDLKGGLEAPG----------------- 572
H C+ IP W C C RS + G
Sbjct: 690 HTECVSLPNIPRGTWYCKYCENMFLKEKFDRSANAIAAGRVAGIDALEQITKRSIRIVDT 749
Query: 573 --AEVGGCVICR 582
AEVG CV+CR
Sbjct: 750 LHAEVGVCVLCR 761
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 189/427 (44%), Gaps = 76/427 (17%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 335 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 393
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 394 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 453
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH---DFSAATFDDRTVIYCDQCEKEFH 607
C C+ H D D ++ C QCE+++H
Sbjct: 454 --------------------------SCKFCQEHSRQDAQDIAEVDSSLCTCSQCEEKYH 487
Query: 608 VGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
GC + + CDL FC C + L++ ++ + P S I
Sbjct: 488 PGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKDLEPEFSCRII 539
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 718
R H + + + ++ +C S ++ A ++ ECF PI+ + G +L
Sbjct: 540 QRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 586
Query: 719 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
I +VY G N +F G Y +L +++A +RI G ++AE+P + T Y+ +G
Sbjct: 587 IRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQG 646
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L IE +L SLNVE L++PA + WT KFGF + + + + +F
Sbjct: 647 MCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEV-KSTSMLVFP 705
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 706 GTGLLQK 712
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 72/91 (79%), Positives = 82/91 (90%)
Query: 237 ERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
+R+LDGI++GGGYLCGC CNFSKV+SA+EFEQHAGAKTRHPNNHIYLENGKPIYSIIQE
Sbjct: 4 QRELDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQE 63
Query: 297 LKTAPLGILEEVVKKVAGSSFNEGSFRVWKA 327
LKTAPL +++ V+K VAGSS NE FRVWK
Sbjct: 64 LKTAPLSMIDGVIKDVAGSSINEEFFRVWKG 94
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 42/98 (42%), Gaps = 23/98 (23%)
Query: 439 QRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI-- 492
QR G G G +C C CN K +S +FE HAG A R P HIY NG ++ I
Sbjct: 4 QRELDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAG-AKTRHPNNHIYLENGKPIYSIIQ 62
Query: 493 -----AISLAMGQRRTTGGSD-----------DMCHVC 514
+S+ G + GS D+C +C
Sbjct: 63 ELKTAPLSMIDGVIKDVAGSSINEEFFRVWKGDICFIC 100
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/420 (28%), Positives = 186/420 (44%), Gaps = 67/420 (15%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT--- 504
GNGI C CC+K + S FE HAG + P ++IY G +L + Q +
Sbjct: 494 GNGIHCGCCDKIFTISDFELHAG-SKLADPLKNIYVGEGTSLLQCLLDSWNKQDESERKG 552
Query: 505 ---------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+DD C VCGDG +L+ C+GCP FH CLD P W C C
Sbjct: 553 FHFVDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIYC----- 607
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR------TVIYCDQCEKEFHVG 609
C C S S+ DD ++ C CE+++H
Sbjct: 608 ---------------------CCKFCGSVSGSSNQRDDNDELIVSKLLTCQLCEEKYHRS 646
Query: 610 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
C+ N + +D +FC + C + L+ + + + S + I R +
Sbjct: 647 CIEANDAN--TDDSRDVFFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRRSDVG---- 700
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVYGR-- 726
FD + ++ QM+ +C S L+ A +I ECF P I G +LI ++Y R
Sbjct: 701 FDASQ--IKPQMV---EC-----NSKLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGS 750
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
N + + G + IL ++SA +RI G ++AE+P + T Y+ +G + L + +E
Sbjct: 751 NFNRLNYSGFVTAILERGDEIISAASIRIRGNQLAEMPFIGTRYMYRRQGMCRRLLNAVE 810
Query: 787 RLLCSLNVENLVLPAAEKAESIWTKKFGFRKM-SRERLLKYQRDFQLTIFKGTSMLEKKV 845
L SLNVE LV+PA + WT FGF + S + + + ++ L +F ML+KK+
Sbjct: 811 WGLGSLNVELLVIPAISELRETWTSVFGFESLESTSKQILHNKN--LLVFPHVDMLQKKI 868
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/412 (28%), Positives = 181/412 (43%), Gaps = 58/412 (14%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + + FE HA A +I+ +G +L + + +++
Sbjct: 635 GIKCSCCRRVFTINGFEVHAN-GASCSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDML 693
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D C VC G L+LC+GCP AFHA CL +P+ W C +C G
Sbjct: 694 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 753
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
++ + I C QCE ++H CLR +G CD
Sbjct: 754 FLKT------------------------TSTNAKEEKFISCKQCELKYHPSCLRYDGACD 789
Query: 619 -LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
L +I +KWFC DC I L D + + + + L+ + +E + D
Sbjct: 790 SLDKILGEKWFCSKDCEEIFVILYDLIG-KPREVSVEKLTWRLVQSLEPNMYGDDA---- 844
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISGQEFG 734
K + E +LS A + E F+P+ GRDL +++ R F
Sbjct: 845 -----SKIEAAAENH-CILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFKRLNFS 898
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+V+L + +VS +RI G++VAE+P + T +++ +G + L + +E++L L V
Sbjct: 899 GFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELEKVLIDLGV 958
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGTSMLEK 843
E LVLPA + W FGF KM+ R+ LK F L F T + EK
Sbjct: 959 ERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK----FTLLEFGRTILCEK 1006
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 67/137 (48%), Positives = 99/137 (72%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+N E K +KK N +P+NVK LLSTGI DG VKY + SRE+ L G++ G GYLC C
Sbjct: 384 VVKNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYCTWSREKNLQGVIKGTGYLCSC 443
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
+C K ++A+EFE+HAGAK++HPN+HI+ ENGK +Y+++QELK +P +L + ++ V
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 314 GSSFNEGSFRVWKASHH 330
G++ N+ +FR WK+S+
Sbjct: 504 GATINQRNFRTWKSSYQ 520
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 189/427 (44%), Gaps = 76/427 (17%)
Query: 444 GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT--LHDIAISL---AM 498
G +GI C CC+K + +F AHAG + PYR++ T LH + + +
Sbjct: 198 GVTTRDGINCRCCSKVFTVLEFVAHAGGPVSK-PYRNVLVDGLDTDLLHCLINAWDKQSD 256
Query: 499 GQRR--------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+R+ T +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC C
Sbjct: 257 SERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKC 316
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH---DFSAATFDDRTVIYCDQCEKEFH 607
C C+ H D D ++ C QCE+++H
Sbjct: 317 --------------------------SCKFCQEHSRQDAQDIAEVDSSLCTCSQCEEKYH 350
Query: 608 VGCLRK--------NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
GC + + CDL FC C + L++ ++ + P S I
Sbjct: 351 PGCSPETTNTSNVSSQACDL--------FCQQSCRLLFEGLRNLLAVKKDLEPEFSCRII 402
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDL 718
R H + + + ++ +C S ++ A ++ ECF PI+ + G +L
Sbjct: 403 QRIH--------ENVPETVVALDERVEC-----NSKIAVALSLMDECFLPIVDQRTGINL 449
Query: 719 IPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKG 776
I +VY G N +F G Y +L +++A +RI G ++AE+P + T Y+ +G
Sbjct: 450 IRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQG 509
Query: 777 CFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFK 836
+ L IE +L SLNVE L++PA + WT KFGF + + + L +F
Sbjct: 510 MCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSEKQEVKSTSML-VFP 568
Query: 837 GTSMLEK 843
GT +L+K
Sbjct: 569 GTGLLQK 575
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 189/421 (44%), Gaps = 73/421 (17%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAM---GQRR 502
+GI C CC++ IS +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 935 DGINCSCCSEVISVPEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDAERQ 992
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
+DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 993 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNC---- 1048
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRSH--DFSAATFD-DRTVIYCDQCEKEFHVGCL 611
C C H D + T D D ++ C QCE++ C
Sbjct: 1049 ----------------------SCKFCHEHSSDDAEDTADVDSSLHTCSQCEEQCTEACS 1086
Query: 612 RKNGLCDLKEIPKD------KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
D+ I + FC C + LQ+ ++ + P S + R H E
Sbjct: 1087 P-----DIDSIATNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEE 1141
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY 724
+ + + K+ +C S ++ A ++ ECF PI+ + G +LI +VY
Sbjct: 1142 --------VPEEVLALDKRVEC-----NSKIAVALSLMDECFLPIVDQRTGINLIRNVVY 1188
Query: 725 --GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
G N + +F G Y +IL +++A +RI G ++AE+P + T Y+ +G + L
Sbjct: 1189 NCGSNFARLDFRGFYIIILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLV 1248
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
IE +L SLN+E L++PA + WT KFGF + + + + +F GT +L+
Sbjct: 1249 DGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEV-KSVSMLVFPGTGLLQ 1307
Query: 843 K 843
K
Sbjct: 1308 K 1308
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/418 (28%), Positives = 188/418 (44%), Gaps = 65/418 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS--NGMTLHDIAISL---AMGQRR- 502
+GI C CC+K ++ +F AHAG +PYR+I + LH + I+ + +R+
Sbjct: 958 DGINCSCCSKVVTVLEFVAHAG-GQLSKPYRNILVDGLDNDLLHCLIIAWDKQSDSERQA 1016
Query: 503 ----TTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
+T G +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC NC
Sbjct: 1017 FFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCTNC----- 1071
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSH-DFSAATFDDRTVIY-CDQCEKEFHVGCLRK 613
C +C H + A + ++ C QCEK++H C +
Sbjct: 1072 ---------------------SCKLCHEHLNHDAPDNAEIDPLHSCSQCEKKYHPSCSPE 1110
Query: 614 NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASS---LSTINRKHIEKGILF 670
FC C + LQ+ ++ P + + I+ E +
Sbjct: 1111 TEKLSSVSSQAGNHFCQQSCRLLFEELQNLLAVEKDLGPEYACRIIQCIHEDAPETVLDL 1170
Query: 671 DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRN 727
DG + +C S ++ A ++ ECF PI+ + G +LI +VY G N
Sbjct: 1171 DG-----------RVEC-----NSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSN 1214
Query: 728 ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIER 787
+F G Y IL +VSA +RI G ++AE+P + T Y+ +G + L IE
Sbjct: 1215 FLRLDFRGFYIFILERGDEIVSAASVRIHGTKLAEMPFIGTRHMYRRQGMCRRLLDGIEM 1274
Query: 788 LLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+L SL VE L++PA + WT KFGF + + + + +F GT +L+K +
Sbjct: 1275 ILSSLKVEKLIIPAINELVDTWTSKFGFSPLEVSDKQEV-KSINMLVFPGTGLLQKSL 1331
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/125 (63%), Positives = 96/125 (76%), Gaps = 1/125 (0%)
Query: 722 MVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 781
MVYGR++ GQEFGGMY IL V S VVSA +LR+FG+++AELPLVAT GKG FQ L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 782 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSM 840
FSCIERLL L V+ LVLPAAE+AESIWT+KFGF ++ ++L Y+R Q+ FKGTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 841 LEKKV 845
L+K V
Sbjct: 121 LQKTV 125
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 119/413 (28%), Positives = 185/413 (44%), Gaps = 54/413 (13%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI CDCC + + S FEAHAG + QP ++I+ NG +L + Q +
Sbjct: 17 DGIQCDCCGETFAISDFEAHAG-SKSCQPLKNIFLENGPSLLHCQLESWHRQDESDRKGF 75
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG NL+ C+ CP FH +CL+ +P W C C
Sbjct: 76 HFVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYCS----- 130
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
+ + GG C+ + AA ++ C CE+++H C+
Sbjct: 131 --CKFCGMAGG-----------DACQMDENDAAA--RPALLTCCLCEEKYHHSCIPAED- 174
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND 676
+ + FC C +H LQ + + + + + + R + I G
Sbjct: 175 -TINDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRFDVGSDITLSG---- 229
Query: 677 VQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEF 733
M +K +C S ++ A I ECF P+ G +LI +VY G N + +
Sbjct: 230 ----MHRKVEC-----NSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNY 280
Query: 734 GGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLN 793
G + IL V+SA +RI G ++AE+P + T Y+ +G + L IE LCSLN
Sbjct: 281 CGFLTAILERGDEVISAASIRIHGNQLAEMPFIGTRHMYRRQGMCRRLLGAIETALCSLN 340
Query: 794 VENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ-RDFQLTIFKGTSMLEKKV 845
VE LV+PA + WT FGF+++ E L K + R ++ F G ML+K +
Sbjct: 341 VEKLVIPAISELRETWTSVFGFKQL--EGLSKQKMRYMKMVAFPGVDMLQKPL 391
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 128/423 (30%), Positives = 189/423 (44%), Gaps = 60/423 (14%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRH-IYTSNGMTLHDI-AISLAM 498
L+ G +GI+C CC+K +S S F+AHA ++ R + + TL I A +
Sbjct: 577 LKDGWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSLNLCLQSGKSFTLCQIEAWNAEY 636
Query: 499 GQRRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
RR+ G+DD C CGDG LL C+ CP +H +CL +P+ W C N
Sbjct: 637 MDRRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHN 696
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFH 607
C + GC + FSA +I C QC H
Sbjct: 697 CI---------------------CRICGCPVTEKEISSFSA-------IIKCLQCGAAHH 728
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
C+ + G +E+ D+WFC C I+ L V SSL I +
Sbjct: 729 DTCV-EMGATAFEEMDSDEWFCGTHCKEIYLGLHGCVGVE------SSLGDGLSWTILRC 781
Query: 668 ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY-- 724
M+ VQ K A E K L+ A + ECF ++ G ++IP ++Y
Sbjct: 782 NSGGQKMHSVQ----KIAHAIECNSK--LAVALTLMEECFAQMVDTRTGINMIPHVLYNQ 835
Query: 725 GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSC 784
G + + G Y+VIL ++ A +R+ G + AELP +ATCRE++ KG + L +
Sbjct: 836 GSKYARLNYQGFYTVILEKGEEILCAASIRVHGMKAAELPFIATCREHRRKGMCRRLINT 895
Query: 785 IERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEK 843
IE +L S +V+ LVL A + S W FGF+ + E + Q D L +F GTS+L K
Sbjct: 896 IEEMLKSFHVKMLVLSAIPELVSTWVSGFGFKPI--EEYERKQLDTINLMLFPGTSLLIK 953
Query: 844 KVQ 846
++
Sbjct: 954 SLE 956
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 68/131 (51%), Positives = 97/131 (74%), Gaps = 1/131 (0%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCN 257
+IK ++K PN +PTNV+ L+STG+LDG VKY+S +RE +L GI+ G YLCGC CN
Sbjct: 296 QDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVARE-ELRGIIKGTTYLCGCQSCN 354
Query: 258 FSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSF 317
++K ++A EFE+HAG K++HPNNHIY ENGK IY I+QEL++ P L + ++ + G+
Sbjct: 355 YAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTIFGAPI 414
Query: 318 NEGSFRVWKAS 328
N+ +FR+WK S
Sbjct: 415 NQKAFRIWKES 425
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 64/150 (42%), Gaps = 16/150 (10%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKK 411
S + S+ A E+ + ++ V T T E + + K+ +K T +
Sbjct: 256 SSVHVSTTAHEKELEASNSDVVASTPLVTTKKPESVSKNKQDIKSTRKESPNTFPTNVRS 315
Query: 412 RDNDLHRLLFLPNGLPDGERLTYI-VKGQRLRFGCKQGNGIVCDC--CN--KEISPSQFE 466
+ G+ DG + Y+ V + LR G +G +C C CN K ++ +FE
Sbjct: 316 ---------LISTGMLDGVPVKYVSVAREELR-GIIKGTTYLCGCQSCNYAKGLNAFEFE 365
Query: 467 AHAGMAARRQPYRHIYTSNGMTLHDIAISL 496
HAG ++ P HIY NG T++ I L
Sbjct: 366 KHAGCKSK-HPNNHIYFENGKTIYQIVQEL 394
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 115/421 (27%), Positives = 188/421 (44%), Gaps = 80/421 (19%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQP---YRHIYTSNGMTLHDIAISLAMGQRRTT 504
G+GI C+CC K + F+ HA R + +I+ +G +L D + + +
Sbjct: 353 GSGIKCNCCGKVYTLCGFDYHASGKQGRTATSIFSNIFLEDGRSLLDCQMQIMHDHTKNL 412
Query: 505 GG--------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
G +D +C VC G L+LC+ CP +FH +CL + +P+ W C +C
Sbjct: 413 GEEPLERWQSSKDQVENDHICSVCHYGGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSC 472
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
++ G + R D S +D V+ C QCE+++HV C
Sbjct: 473 ---------------------CCKICGQCLKRDSDLS---MEDDGVLDCTQCERKYHVVC 508
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 670
L L+ PK+ WFC C +I L + + + IP
Sbjct: 509 LGNKREECLEYFPKEHWFCSKRCQQIFLGLHELLGKK---IPVG---------------- 549
Query: 671 DGTMNDVQWQMLKKAQCFEEKEKS----------LLSSATAIFRECFDPII-AECGRDLI 719
++++ W +LK Q ++ E S +L+ A + E FDP+ RDL+
Sbjct: 550 ---LHNLTWTLLKSIQFNDQCEASDIEALSENYSMLNIALDMMHEFFDPVEEPHTKRDLL 606
Query: 720 PVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGC 777
+++ + ++ F G Y+V+L +S +R++G +VAE+PLV T +Y+ G
Sbjct: 607 KDVIFSKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGTRFQYRRLGM 666
Query: 778 FQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQ-RDFQLT 833
L + +E+ L L V+ L+LPA A + W FGF K++ R +LL Y DFQ T
Sbjct: 667 CCILMNVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKLTELDRLQLLDYTFLDFQDT 726
Query: 834 I 834
I
Sbjct: 727 I 727
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 184/429 (42%), Gaps = 83/429 (19%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI-----SLAMGQRRT 503
NGI C CCNK + + FE H + R HI +G +L D I + + + +
Sbjct: 828 NGIKCCCCNKLYTINGFEIHVSGTSSRSA-AHILLEDGKSLLDCQILWNKKTRSFKNQAS 886
Query: 504 TGGSDD--------MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
T G D +C +C G L+LC+ CP +FH +CL +PE W CP+C
Sbjct: 887 TCGKGDYSKDENDYICSICHFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSC----- 941
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSA-ATFDDRTVIYCDQCEKEFHVGCLRKN 614
C IC + S A D + C QCE ++HV CLR
Sbjct: 942 ---------------------CCGICGQNKLSEHANIVDGPFLTCYQCECKYHVQCLR-- 978
Query: 615 GLCDLKEIPKDKWFCCDDCNRIHAALQDFV--------SNRAQTI---PASSLSTINRKH 663
G K WFC C +I+ LQ + N ++ P+S + N H
Sbjct: 979 GTKKFGSCSKPHWFCNKHCKQIYWGLQKLLGKSIPVGGDNLTWSLLKSPSSDTNYFNPPH 1038
Query: 664 IEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVM 722
+E T+ + Q S L+ A + ECF+P+ + RD++ +
Sbjct: 1039 LE-------TLTENQ---------------SKLNVALRVMHECFEPVREQHTRRDIVEDV 1076
Query: 723 VYGRN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
++ R + F G Y+V+L +++ +R++G +VAE+PLV T +Y+ G
Sbjct: 1077 IFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGTRFQYRRLGMCHI 1136
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQ-RDFQLTIFK 836
L + +E L L V+ LVLPA WT FGF KM+ R L Y +FQ T+
Sbjct: 1137 LMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFLNYTFLNFQETVMC 1196
Query: 837 GTSMLEKKV 845
+L+ V
Sbjct: 1197 QKFLLKNTV 1205
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 153 bits (387), Expect = 4e-34, Method: Composition-based stats.
Identities = 70/99 (70%), Positives = 76/99 (76%)
Query: 451 IVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDM 510
IVC CC EISPSQFE+HAGM+ARRQPYRHIYTSNG++LHDIAISLA GQ TTG DDM
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
C GDG +L+ C CP AFHAACLD PE W CPN
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHCPN 102
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 113/416 (27%), Positives = 187/416 (44%), Gaps = 65/416 (15%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI C CC+K+ + +F AHAG + PYR++ +G+ + ++ +++
Sbjct: 904 DGIQCRCCSKDFALLEFVAHAGGQVSK-PYRNVLV-DGLDKDLLHCLISAWDKQSDSERQ 961
Query: 504 ------TGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGH 554
T G +DD C +CGDG NL+ C+GCP FH +CL+ +P WRC NC
Sbjct: 962 SFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCANC---- 1017
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C C+ H D + + ++ C QCE+ +H C
Sbjct: 1018 ----------------------CCKFCQEHSNDDAPDIAEVDSLCTCSQCEENYHPVCSP 1055
Query: 613 KNGLCDLKEIPKD--KWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILF 670
+ + +P FC C + LQ+ ++ + P + I H +
Sbjct: 1056 ETE--NPSSVPSQAGDLFCQQSCRLLFEELQNLLAVKKDLEPEFACRIIKCIHED----- 1108
Query: 671 DGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRN 727
+ + + ++ +C S ++ A ++ ECF PII + G +LI +VY G N
Sbjct: 1109 ---VPETALALDERVEC-----NSKIAVALSLMDECFLPIIDQRTGINLIRNVVYNCGSN 1160
Query: 728 ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIER 787
+F G Y IL +VSA +RI G + AE+P + T Y+ +G + L IE
Sbjct: 1161 FLRLDFRGFYIFILERGDEIVSAASVRIHGTKCAEMPFIGTRNMYRRQGMCRRLLDGIEM 1220
Query: 788 LLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
+L SL V+ L++PA + WT KFGF + + + + +F GT +L+K
Sbjct: 1221 ILSSLKVQKLIIPAISELVDTWTSKFGFSPLEVSEKQEV-KSISMLVFPGTGLLQK 1275
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 186/425 (43%), Gaps = 65/425 (15%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL---HDIAISLA 497
L+ G +GI C CC + S+F+ HAG+ + P +++ G + A S+
Sbjct: 905 LKDGNITASGIQCQCCGTTFTMSKFKCHAGLR-QEVPSLNLFLDTGKSYSLCQLQAWSIE 963
Query: 498 MGQRR---------TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
R+ +DD C CGDG L+ C+ CP ++H ACL IP+ W C
Sbjct: 964 QKVRKEHAKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCS 1023
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
+C C IC S + C QCE+++HV
Sbjct: 1024 SCL--------------------------CDICGEVIDSKELVTSLPALDCSQCERQYHV 1057
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
C+ C+ E WFC C+ I+ + V +P H++ +
Sbjct: 1058 KCVSAKVPCN--EDGSGTWFCGRKCHEIYMTFRSRVG-----VP---------DHMDDDL 1101
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSL-LSSATAIFRECFDPII-AECGRDLIPVMVYG- 725
F N+ ++ + E ++ L AT+I ECF PI+ G D+IP ++Y
Sbjct: 1102 CFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNW 1161
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
R+ + G Y+V+L +VS +R+ G +AE+PLVAT +E + +G + L I
Sbjct: 1162 RSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYI 1221
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLE 842
E +L SL VE L+L A WT FGFR++ ++RL K +L GT +L+
Sbjct: 1222 EEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRLSK----VRLAAVPGTVLLK 1277
Query: 843 KKVQC 847
K + C
Sbjct: 1278 KDLLC 1282
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/459 (25%), Positives = 204/459 (44%), Gaps = 81/459 (17%)
Query: 426 LPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAG-------------- 470
L +G ++Y+ K ++ G +GI+C CCN+ S + F+ HAG
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 471 ----MAARRQPYRHIYTS------NGMTLHDIAISLAMGQRRT--------TGGSDDMCH 512
+ ++Q + I + NG D AM + + +DD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VCGDG L+ C+ CP FH CL +PE W CP C +S RS+
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC---CCASCGRSL---------- 179
Query: 573 AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD 632
+D + T ++YCDQCE+E+H C+ + + K D FC
Sbjct: 180 -----------YDPTIQT----EILYCDQCEREYHSNCVPGSAM---KYESSDNQFCSRK 221
Query: 633 CNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE 692
C +I L+ V + S + + +H +D + + + + +
Sbjct: 222 CLKIFRGLRKLVGRVNKVDDMYSWTLLRSEH------YDQSEENSKLESVADL------- 268
Query: 693 KSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVS 749
+ L+ A + +ECF P+I D++ ++Y R + +F G Y+V+L + ++S
Sbjct: 269 NTRLALALTVIQECFRPMIDPRSNIDMVSHILYNRRGEDKRMDFRGFYTVVLEKEQELIS 328
Query: 750 AGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIW 809
+R+ G AE+P + T +Y+ +G + L + I+++L +L V+ LVLPA + W
Sbjct: 329 VASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETW 388
Query: 810 TKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
T FGF+K++ + ++ + + F G+S+L+K + L
Sbjct: 389 TSAFGFQKLTAAQGIQLM-ELNIVTFPGSSVLQKPLTWL 426
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/426 (28%), Positives = 182/426 (42%), Gaps = 88/426 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPY-RHIYTSNGMTLHDI-AISLAMGQRRTTG-- 505
GI+C+CC K +S S F+AHAG R + + TL + A S + RR+
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLGLFLQSGKSYTLCQVEAWSAELMSRRSDAYG 1026
Query: 506 -------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
+DD C CGDG LL C+ CP +H ACL +PE W C NC
Sbjct: 1027 RKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYCHNC-------- 1078
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRS-----HDFSAATFDDRTVIYCDQCEKEFHVGCLRK 613
CRS ++ ++F D ++ C QC +H C+ +
Sbjct: 1079 ---------------------TCRSCGNPVNEKEVSSFSD--ILKCLQCGDAYHNTCIDR 1115
Query: 614 NGL-CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
L D K D WFC C I L V + IN
Sbjct: 1116 VMLPSDGKR--SDTWFCGRYCKEIFMGLHSQVGVE---------NVIN------------ 1152
Query: 673 TMNDVQWQMLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-AECGRDLIP 720
ND+ W +L+ Q +K + L+ A + ECF ++ G D+IP
Sbjct: 1153 --NDLSWTILRCNSDGQRLHSAQKIGLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIP 1210
Query: 721 VMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 778
++Y G N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G
Sbjct: 1211 HVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIRLHGTKAAELPFIATSVDYRRQGMC 1270
Query: 779 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 838
+ L IE++L S +VE LVL A + + W FGF+ + + K R+ L +F G
Sbjct: 1271 RRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFGFKPIEDDE-KKQLRNVNLMLFPGA 1329
Query: 839 SMLEKK 844
S+L K+
Sbjct: 1330 SLLTKR 1335
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 95/137 (69%), Gaps = 1/137 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N +I +KK + +P NVK LLSTG+ DG VKY+S +RE+ + G++ G GYLC C
Sbjct: 456 VSKNKQIN-TKKDPSSHFPLNVKSLLSTGMFDGVPVKYVSWTREKSVRGVIKGSGYLCSC 514
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN S ++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P L EV++ V
Sbjct: 515 KDCNSSNCLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVT 574
Query: 314 GSSFNEGSFRVWKASHH 330
G N+ +F+ WKAS+
Sbjct: 575 GCPINQKNFQTWKASYQ 591
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 177/403 (43%), Gaps = 74/403 (18%)
Query: 438 GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS-NGMTLHDI---- 492
G+++ G G+GI C CCN I+ S+F+ HAG R PY+ I+ S G++L
Sbjct: 179 GEQILQGVLTGDGIWCSCCNTVITVSEFQLHAGDEPNR-PYQRIFISETGLSLLTCQAEA 237
Query: 493 ----------AISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPE 542
L + + DD C VC DG NL+ C+ CP +H +CL P+
Sbjct: 238 WNQQGIPELQGYHLIEPREDVSDKYDDACVVCADGGNLICCDKCPSTYHISCLQMEDEPQ 297
Query: 543 SGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQC 602
WRCP C C C +H F D +V C QC
Sbjct: 298 GEWRCPAC--------------------------ACKFCHTHAF------DISVFTCSQC 325
Query: 603 EKEFHVGCLRKNG--LCDLK-EIPKDKW-FCCDDCNRIHAALQDFVSNRAQTIPASSLST 658
+K++H C R+N L DL + P FC C++I+ L+ V R + + +
Sbjct: 326 DKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDEGLTWTL 385
Query: 659 INRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR-D 717
+ R E G+ + + + + C S ++ A A+ ECF+P+I + +
Sbjct: 386 LRRMDPEAGVYLEESYD--------RTLC-----NSKIAVAVAVMEECFEPVIDRHTQIN 432
Query: 718 LIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
++ ++Y G N F G Y+ IL +S +RI G ++AE+P +AT Y+
Sbjct: 433 VVRSVIYNCGANFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIATRPSYRRL 492
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 818
G L IE +V+ LV+P+ E+ W + +GF+ +
Sbjct: 493 GMCHKLLVAIE------SVQYLVIPSIEQRVRRWEESYGFQAI 529
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 151/547 (27%), Positives = 231/547 (42%), Gaps = 110/547 (20%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGG--T 409
++ SCS+ + P +C + + + + + R V RP Q R EG T
Sbjct: 224 TMASCSNNNNKRPAPPAACKTATSSKKNKKKKASLQQAR---VLRP---QPRNEEGNALT 277
Query: 410 KKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIV---CDCC----NK---- 458
R L LL L ++L Y K G G+G+V C C NK
Sbjct: 278 TARARTLLSLLIDKKILAPRDQLIYTTKR-----GLITGDGMVKCMCGGCINNNNKRRVA 332
Query: 459 EISPSQFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISLAMG--------------- 499
E + ++F H ++ RQP+ ++ +G +L + L M
Sbjct: 333 EYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKY 392
Query: 500 ----------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
+R+ D +C VC D LL+C+ CP FH AC+ P+ W CP
Sbjct: 393 LPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPA 452
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
C + ++ S DL + P + FS+ DR VI C+QC +E+HVG
Sbjct: 453 C----TCAICGSSDLD---DPPATTT-------TQGFSS----DRMVISCEQCRREYHVG 494
Query: 610 CLRK--NGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP-ASSLSTINRKHIEK 666
C+R+ NGL + + W C + C++I+ L++ +A AS LS + + +
Sbjct: 495 CMRERDNGLWYPEADGEGPWLCSEACSKIYLRLEELAVVQAPCRSVASGLSLVV---LRR 551
Query: 667 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 725
G DG EE+E + L A + RECF +I DL +V+
Sbjct: 552 GAARDG----------------EEEEHAKLCMALDVLRECFVTLIEPRTQTDLTADIVFN 595
Query: 726 --RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
+ +F G Y V L +++ LR++G EVAE+PLV T + +G + L
Sbjct: 596 TESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFARRRQGMCRLLMD 655
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWT-KKFGFRKMSRERLLKYQRDFQ------LTIFK 836
I++LL + VE LVLPA + + WT FGFR+M Q D Q + F+
Sbjct: 656 EIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFREMG-------QADRQDVAHHAILRFQ 708
Query: 837 GTSMLEK 843
GT M K
Sbjct: 709 GTIMCHK 715
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 68/137 (49%), Positives = 95/137 (69%), Gaps = 1/137 (0%)
Query: 194 CARNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
++N +I +KK + +P NVK LLSTG+ DG VKY+S +RE+ + G++ G GYLC C
Sbjct: 431 VSKNKQIN-TKKDPSSHFPLNVKSLLSTGMFDGVPVKYVSWTREKSVRGVIKGSGYLCSC 489
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVA 313
CN S ++A+EFE+HA KT+HPNNHIY ENGK IY+++QELK P L EV++ V
Sbjct: 490 KDCNSSNCLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVT 549
Query: 314 GSSFNEGSFRVWKASHH 330
G N+ +F+ WKAS+
Sbjct: 550 GCPINQKNFQTWKASYQ 566
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 140/492 (28%), Positives = 211/492 (42%), Gaps = 104/492 (21%)
Query: 401 QKRTAEGGTKKRDND------LHRLLFLP------NGLPDGERLTYIVKGQRLRFGCKQG 448
Q R A GT +R+ + R L P N DG +T+ R
Sbjct: 221 QPRKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTW-------RGSITNE 273
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMG----QRRTT 504
I CDCC+ S+FEAH G + + +P +I+ +G +L D L Q+ T
Sbjct: 274 GFIKCDCCSNLFPISKFEAHTG-STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETK 332
Query: 505 GG-------------------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
+D +C VC G L+LC+ CP AFH +CL IP W
Sbjct: 333 ATEKKVNHNDNADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNW 392
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC--RSHDF--SAATFDDRTVIYCDQ 601
CP+C C IC ++DF ++FD + + C Q
Sbjct: 393 YCPSC--------------------------CCKICGQVTYDFDDQVSSFDT-SFVRCVQ 425
Query: 602 CEKEFHVGCLRKNGLCD--LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
CE+ H+GC++ + + + I ++ WFC C IH LQ+
Sbjct: 426 CEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNL---------------- 469
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQC-FEEKEKSLLSSATAIFRECFDPIIAECGR-D 717
+ K I ++ W ++K E + L+ A + + F P+ + D
Sbjct: 470 ----LWKQIPVGDARENLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKND 525
Query: 718 LIP-VMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
LI V + R+ S + F G Y+ IL K+ VV+ +R++G EVAE+PLVAT +Y+
Sbjct: 526 LIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRH 585
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQ-RDFQ 831
G + L + +E L + V+ L LPA +A + WTK FGF KM+ R L+KY FQ
Sbjct: 586 GMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIKYTFLGFQ 645
Query: 832 LTIFKGTSMLEK 843
T+ +LEK
Sbjct: 646 HTVRCQKDLLEK 657
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 149 bits (376), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 140/492 (28%), Positives = 211/492 (42%), Gaps = 104/492 (21%)
Query: 401 QKRTAEGGTKKRDND------LHRLLFLP------NGLPDGERLTYIVKGQRLRFGCKQG 448
Q R A GT +R+ + R L P N DG +T+ R
Sbjct: 198 QPRKAAKGTSRRNQRTALSYLVDRELISPGDRVHCNVTRDGRLVTW-------RGSITNE 250
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMG----QRRTT 504
I CDCC+ S+FEAH G + + +P +I+ +G +L D L Q+ T
Sbjct: 251 GFIKCDCCSNLFPISKFEAHTG-STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETK 309
Query: 505 G-------------------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGW 545
+D +C VC G L+LC+ CP AFH +CL IP W
Sbjct: 310 ATEKKVNHNDNADSDTHGLDKNDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNW 369
Query: 546 RCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC--RSHDF--SAATFDDRTVIYCDQ 601
CP+C C IC ++DF ++FD + + C Q
Sbjct: 370 YCPSC--------------------------CCKICGQVTYDFDDQVSSFDT-SFVRCVQ 402
Query: 602 CEKEFHVGCLRKNGLCD--LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI 659
CE+ H+GC++ + + + I ++ WFC C IH LQ+
Sbjct: 403 CEQNVHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNL---------------- 446
Query: 660 NRKHIEKGILFDGTMNDVQWQMLKKAQC-FEEKEKSLLSSATAIFRECFDPIIAECGR-D 717
+ K I ++ W ++K E + L+ A + + F P+ + D
Sbjct: 447 ----LWKQIPVGDARENLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKND 502
Query: 718 LIP-VMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
LI V + R+ S + F G Y+ IL K+ VV+ +R++G EVAE+PLVAT +Y+
Sbjct: 503 LIEDVFLSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRH 562
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQ-RDFQ 831
G + L + +E L + V+ L LPA +A + WTK FGF KM+ R L+KY FQ
Sbjct: 563 GMCRRLLNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIKYTFLGFQ 622
Query: 832 LTIFKGTSMLEK 843
T+ +LEK
Sbjct: 623 HTVRCQKDLLEK 634
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 149 bits (375), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 119/442 (26%), Positives = 188/442 (42%), Gaps = 73/442 (16%)
Query: 434 YIVKGQR---LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLH 490
Y +KG L G +GI C+CC K S S F+AH +P +++ NG +L
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 491 DIAISLAMGQ----------RRTTGGS----------------DDMCHVCGDGENLLLCN 524
+ L + R T GS D +C +C G +L+ C+
Sbjct: 109 SCQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH 584
CP +FHA CL+ +PE W D
Sbjct: 169 RCPSSFHATCLNIERVPEGDWF------CPCCCCGICGD------------------SQF 204
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 644
D A F D +++ C QCE++FH C ++ G+ +E WFCC C + LQ +
Sbjct: 205 DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLL 260
Query: 645 SNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR 704
+ + I + + V + + A+ + S LS A +
Sbjct: 261 GKPILVGHNLTCTLIKPMQYQ-------AEDRVDYDLAAMAENY-----SKLSVALEVMH 308
Query: 705 ECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
ECFDP+ + RDL+ +++ G N++ F G Y+V+L +++ LLRI+G +VA
Sbjct: 309 ECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVA 368
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
E+PL+ T +++ G + L + IE+ L +L V+ LVLPA+ + WT FGF M+
Sbjct: 369 EMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPMTES 428
Query: 822 RLLKYQRDFQLTIFKGTSMLEK 843
L + F F T M +K
Sbjct: 429 DRLDFL-GFTFLDFHDTIMCKK 449
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 65/106 (61%), Positives = 83/106 (78%)
Query: 223 ILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHI 282
+LDG VKYI+ SRE++L GI+ G GYLCGC CNFSKV++A+EFE+HAG KT+HPNNHI
Sbjct: 1 MLDGVPVKYIAWSREKELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHI 60
Query: 283 YLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKAS 328
Y ENGK IY I+QELK+ P L +V++ + GS N+ SFR+WK S
Sbjct: 61 YFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSFRLWKES 106
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 40/75 (53%), Gaps = 8/75 (10%)
Query: 428 DGERLTYIV--KGQRLRFGCKQGNGIVCDC--CN--KEISPSQFEAHAGMAARRQPYRHI 481
DG + YI + + LR G +G+G +C C CN K I+ +FE HAG + P HI
Sbjct: 3 DGVPVKYIAWSREKELR-GIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTK-HPNNHI 60
Query: 482 YTSNGMTLHDIAISL 496
Y NG T++ I L
Sbjct: 61 YFENGKTIYGIVQEL 75
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 109/417 (26%), Positives = 181/417 (43%), Gaps = 69/417 (16%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC------- 692
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRT---VIYCDQCEKEFHVGCLRKN 614
C C + AA + T + C CE+++H C+ ++
Sbjct: 693 -------------------SCKFCEKDE--AAKHETSTLPSLSSCRLCEEKYHQACINQD 731
Query: 615 GLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
G +P ++ FC C + LQ F+ + S S + R + +
Sbjct: 732 GT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFELPSEV--- 783
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--GRNI 728
D+ ++ A+ ++ A ++ ECF P++ G +L+ +VY G N
Sbjct: 784 -ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNF 833
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
+F + +L +++ +RI G ++AE+P + T Y+ +G + L IE
Sbjct: 834 HRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESA 893
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L SL V+ LV+PA + WT FGF ++ + K ++ L +F G ML K +
Sbjct: 894 LGSLKVDKLVIPAVPELIDTWTSGFGFAPVN-DSEKKTIKNLNLLVFPGVDMLGKSL 949
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 166/357 (46%), Gaps = 64/357 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C +CGDG L+ C+ CP ++H ACL IP+ W C +CR
Sbjct: 1000 NDDTCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR--------------- 1044
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIY---CDQCEKEFHVGCLRKNGLCDLKEIP 623
C +C ++ + RT ++ C QCE+++H+ C+ LC+ + P
Sbjct: 1045 -----------CDVCGE---VVSSKEPRTPLHAFECSQCERQYHIKCISGKVLCNEESGP 1090
Query: 624 KDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG----ILFDGTMNDVQW 679
WFC C +I+ +L+ V IP H++ G IL + D +
Sbjct: 1091 G-TWFCGRRCQQIYTSLRSRVG-----IP---------DHLDDGFSCTILHNN--GDQKV 1133
Query: 680 QMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG--RNISGQEFGGM 736
+M E L+ A +I ECF PI G D++P+++Y N ++ G
Sbjct: 1134 RMAADIALLAECNMKLII-ALSILEECFLPIFDPRTGMDIMPLILYNWRSNFVHLDYKGF 1192
Query: 737 YSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVEN 796
Y+++L ++S +R+ G VAE+PL+ATC E + +G + + IE++L SL VE
Sbjct: 1193 YTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMCRRIVDYIEQMLKSLKVEM 1252
Query: 797 LVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLEKKVQCLPE 850
L+L A WT FGFR + +++L K +L GT +L+K + E
Sbjct: 1253 LLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSK----IRLASVPGTVLLKKDLHEFSE 1305
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/421 (26%), Positives = 176/421 (41%), Gaps = 90/421 (21%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARR-------QPYRHIYTSNGMTLHDIAISLAMGQR 501
+GI C CC K + S FE HAG+ + R +P I+ +G +L + I + +
Sbjct: 618 DGIKCSCCGKVYTLSGFELHAGIKSCRSGASKYCKPAASIFLDDGRSLLECQIQMMRDKE 677
Query: 502 RTT---------------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ G+D +C VC G L+LC+ CP +FH CL +P+ W
Sbjct: 678 MSNHKAETPDSLKGSWDRDGNDHVCSVCHYGGELILCDHCPSSFHKRCLGMKDVPDGDWF 737
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFD-DRTVIYCDQCEKE 605
CP+C C IC + T D V+ C QCE +
Sbjct: 738 CPSC--------------------------CCKICGQNKLKKDTKDFIDGVLNCTQCEHQ 771
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIE 665
+H+ CL + K+ PK+ FC C + ++ + + A T+ +
Sbjct: 772 YHIMCLSNSWTDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDAFDDETLVETY-- 822
Query: 666 KGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY 724
S L A + ECF+PI GRDL+ +++
Sbjct: 823 ----------------------------SKLKIALDVVHECFEPIEEPRTGRDLMKDVIF 854
Query: 725 --GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALF 782
G ++ F G Y+++L +VS +RI G +VAE+PLV T +++ G + L
Sbjct: 855 SNGSELNRLNFQGFYTILLEKNDELVSVATVRIHGDKVAEIPLVGTRFQFRQLGMCRILM 914
Query: 783 SCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLE 842
+E+ L L V+ LVLPA + WT FGF KM+ L++ D+ F+ T M +
Sbjct: 915 DVLEKKLMELGVQRLVLPAVPGVLNTWTGSFGFSKMTDSERLQFV-DYTFLDFQDTVMCQ 973
Query: 843 K 843
K
Sbjct: 974 K 974
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 175/397 (44%), Gaps = 58/397 (14%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + S FE HA + R +I+ +G +L + + +++
Sbjct: 635 GIKCSCCRRIFTISGFEVHANGGSCRAAA-NIFLDDGRSLLECQVEAYETRKKAQPPDIL 693
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D +C VC G L+LC+GCP AFHA CL +P+ W C +C
Sbjct: 694 KMKLRQGENDVICSVCHYGGKLILCDGCPSAFHANCLGLEEVPDGDWFCESC-------- 745
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT--FDDRTVIYCDQCEKEFHVGCLRKNGL 616
C C A + + I C QCE ++H CLR +G
Sbjct: 746 ------------------CCGACGQFFLKATSKYAKEEKFISCKQCELKYHPSCLRYDGA 787
Query: 617 CD-LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMN 675
D L +KWFC DC I L + + + + + L+ + E + D +
Sbjct: 788 GDSLDTFLGEKWFCSKDCEEIFVNLCELI-GKPREVGVEKLTWRLVQSFEPNMYGD---D 843
Query: 676 DVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISGQE 732
+ + + + C LS A + E F+P+ GRDL +++ R
Sbjct: 844 AYKIEAVAENHC-------KLSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFKRLN 896
Query: 733 FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSL 792
F G Y+V+L +V+ +RI G++VAE+P + T +++ +G + L + +E++L L
Sbjct: 897 FSGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELEKVLIDL 956
Query: 793 NVENLVLPAAEKAESIWTKKFGFRKMS---RERLLKY 826
VE LVLPA + W FGF KM+ R+ LK+
Sbjct: 957 GVERLVLPAVPCVLNTWINSFGFTKMTISERKEFLKF 993
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 186/427 (43%), Gaps = 62/427 (14%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G GI CDCC+K ++ ++FE HAG + +QPY +I+ +G ++L +
Sbjct: 766 KGTRARLVGRVTREGICCDCCSKILTVAKFELHAG-SKEQQPYANIFLEDGGLSLFQCLL 824
Query: 495 SL--AMGQRRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
+ Q G DD C +CGDG +LL C+ C FH ACL + +P
Sbjct: 825 NAWDKQAQNEKKGFYKIDPADDPDDDTCGICGDGGDLLCCDRCTSTFHVACLG-IEMPSG 883
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCE 603
W C NC C C S + T ++ C QC
Sbjct: 884 DWFCRNCI--------------------------CKFCGSAE--ERTSSPAELLSCLQCS 915
Query: 604 KEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINR 661
+++H C + + P FC C +I+ L+ + + S S +
Sbjct: 916 RKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTKIYKRLKRLLGLKNDLEAGFSWSLVR- 974
Query: 662 KHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIP 720
D + KKAQ K+ L A ++ ECF P I E G ++I
Sbjct: 975 ------CFADTEATSTK----KKAQLVHCNSKTAL--AFSVLDECFLPRIDERSGINIIH 1022
Query: 721 VMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 778
+VY G + S F G Y+ IL V+SA +RI G + AE+P + T Y+ +G
Sbjct: 1023 NVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVRIHGTDFAEMPFIGTRGMYRHQGMC 1082
Query: 779 QALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGT 838
L IE LCSLNV LV+PA + ++ W+ FGF+ + + K + L I GT
Sbjct: 1083 HRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFGFKPVGPTKKQKI-KSVNLLIIHGT 1141
Query: 839 SMLEKKV 845
+LEK++
Sbjct: 1142 GLLEKRL 1148
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 5/62 (8%)
Query: 227 ACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLE 285
A VKY++ R +L G V G C C C SK+++ +FE HAG+K + P +I+LE
Sbjct: 758 AKVKYMNKKGTRARLVGRVTREGICCDC--C--SKILTVAKFELHAGSKEQQPYANIFLE 813
Query: 286 NG 287
+G
Sbjct: 814 DG 815
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 146 bits (368), Expect = 5e-32, Method: Composition-based stats.
Identities = 72/114 (63%), Positives = 84/114 (73%)
Query: 732 EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCS 791
EFGGMY ILTV VVSA R+ G+EVAELPLVAT + QG+G FQAL++CIERLLC
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 792 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L V +LVLPAAE AES+W KF F KM +E L RDFQ+ F+GTSML+K V
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPV 130
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 116/439 (26%), Positives = 190/439 (43%), Gaps = 95/439 (21%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA-------- 497
+ IVCDCC+ + + FE+HAG R +P I +G +L D A+S +
Sbjct: 36 DAIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSV 94
Query: 498 ---------------------MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
+ + + +D++C +CG G +L LC+ CP AFH CL
Sbjct: 95 VNENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSICGFGGDLALCDRCPSAFHLGCLG 154
Query: 537 PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFD--DR 594
+P W CP C C IC D D
Sbjct: 155 LNRVPIGEWFCPTC--------------------------CCKICYRPKCKQECKDHKDN 188
Query: 595 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 654
++ C QCE+++H GC++ G+ + + + WFC C + L+ + +
Sbjct: 189 NILVCVQCEQKYHFGCVKAVGI-EFNHM--ENWFCSVVCGNMFLCLKKLLGKPIKV---- 241
Query: 655 SLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE----KSLLSSATAIFRECFDPI 710
+++ W ++K ++KE +S L+ A + E F+P
Sbjct: 242 -------------------ADNLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPT 282
Query: 711 I-AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 767
A GR+LI +V+ R F G Y+VIL V+S +RI+G++VAE+ VA
Sbjct: 283 FDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVA 342
Query: 768 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ 827
T +Y+ +G L IE+ L L VE L+L ++E A +IWTK FGF +M+ + ++
Sbjct: 343 TKEQYRRQGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWTKSFGFARMASKDKCQFI 402
Query: 828 RDFQLTIFKGTSMLEKKVQ 846
D F+ ++M K ++
Sbjct: 403 -DHTFLEFQNSTMCLKALK 420
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 117/445 (26%), Positives = 190/445 (42%), Gaps = 79/445 (17%)
Query: 434 YIVKGQR---LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLH 490
Y +KG L G +GI C+CC K S + F+AH +P +++ NG +L
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 491 DIAISLAMGQ----------RRTTGGS----------------DDMCHVCGDGENLLLCN 524
+ L + R GS D +C +C G +L+ C+
Sbjct: 109 SCQVELMRKKIMMFDQGPAVRAAGTGSRSKFRSLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 525 GCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSH 584
CP +FHAACL+ +PE W D
Sbjct: 169 RCPSSFHAACLNIESVPEGDWF------CPCCCCGICGD------------------SQF 204
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 644
D A F D +++ C QCE++FH C ++ G+ +E WFCC C + LQ +
Sbjct: 205 DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLL 260
Query: 645 SNR---AQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATA 701
Q + + + + + ++ + + A+ + S LS A
Sbjct: 261 GKPILVGQNLTCTLIKPMQYQAEDRE----------DYDLAAMAENY-----SKLSVALE 305
Query: 702 IFRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGR 758
+ ECFDP+ + RDL+ +++ G N++ F G Y+V+L +++ LLRI+G
Sbjct: 306 VMHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGD 365
Query: 759 EVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 818
+VAE+PL+ T +++ G + L + IE+ L +L V+ LVLPA+ + WT FGF M
Sbjct: 366 KVAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPM 425
Query: 819 SRERLLKYQRDFQLTIFKGTSMLEK 843
+ L + F F T M +K
Sbjct: 426 TESDRLDFL-GFTFLDFHDTIMCKK 449
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 173/412 (41%), Gaps = 84/412 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTT----- 504
GI C CC + + + FE HA A +I+ +G +L + + +++
Sbjct: 626 GIKCSCCRRVFTINGFEVHAN-GASCSGAANIFLDDGRSLLECQVEAYKKRKKAQPPDML 684
Query: 505 ------GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
G +D C VC G L+LC+GCP AFHA CL +P+ W C +C G
Sbjct: 685 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 744
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
++ + I C QCE ++H CLR +G CD
Sbjct: 745 FLKT------------------------TSTNAKEEKFISCKQCELKYHPSCLRYDGACD 780
Query: 619 -LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV 677
L +I +KWFC DC + + + A I A++
Sbjct: 781 SLDKILGEKWFCSKDCE--ESLEPNMYGDDASKIEAAA---------------------- 816
Query: 678 QWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISGQEFG 734
+ +LS A + E F+P+ GRDL +++ R F
Sbjct: 817 -------------ENHCILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFKRLNFS 863
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+V+L + +VS +RI G++VAE+P + T +++ +G + L + +E++L L V
Sbjct: 864 GFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELEKVLIDLGV 923
Query: 795 ENLVLPAAEKAESIWTKKFGFRKMS---RERLLKYQRDFQLTIFKGTSMLEK 843
E LVLPA + W FGF KM+ R+ LK F L F T + EK
Sbjct: 924 ERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK----FTLLEFGRTILCEK 971
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 184/409 (44%), Gaps = 68/409 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 915 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 971
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ R+ +DD C +CGDG L+ C+ CP ++H CL IP+ W
Sbjct: 972 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWY 1031
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEF 606
C C + +L+ L A + C QCE+++
Sbjct: 1032 CYRCLCDICGEVINLKELRSSLPA--------------------------LECAQCERQY 1065
Query: 607 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 666
H C+ LC+ + P WFC C +I+ L+ V TI S T+ R + ++
Sbjct: 1066 HAKCIYGKLLCNEEGGPC-AWFCGRRCQQIYMNLRSRVGIPIHTIDGFS-CTVLRNNGDQ 1123
Query: 667 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 725
+ T D+ A+C + L A +I ECF PII A G D+IP ++Y
Sbjct: 1124 RV---STAADIAIL----AECNMK-----LVIALSIMEECFLPIIDARTGIDIIPPILYN 1171
Query: 726 --RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
+ ++ G Y+V+L ++S +R+ G VAE+PL+ATC E + +G + L
Sbjct: 1172 WRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMD 1231
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGF--------RKMSRERLL 824
IE++L SL VE L+L A WT FGF + +SR RL+
Sbjct: 1232 YIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLRLV 1280
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 123/426 (28%), Positives = 188/426 (44%), Gaps = 70/426 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDIAIS 495
L+ G +GI C CC + S+F+ HAG+ + P +++ G + L +I
Sbjct: 894 LKDGNITTSGIRCRCCGTTFTMSKFKCHAGLR-QEVPSLNLFLGTGKSYSLCLLQAWSIE 952
Query: 496 LAMGQRRT-------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+ + R +DD C CGDG L+ C+ CP ++H ACL IP+ W C
Sbjct: 953 QKVRKERVKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLPCQDIPDGNWYCS 1012
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHV 608
+C + S +L+ L A + C QCE+++HV
Sbjct: 1013 SCLCNICGEVITSKELRTSLPA--------------------------LECSQCERQYHV 1046
Query: 609 GCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA---SSLS-TINRKHI 664
C+ C+ E WFC C +I+ + V +P + LS TI R +
Sbjct: 1047 KCVSAKVSCN--EDGPGTWFCGRKCQQIYMIFRSRVG-----VPDHVDNDLSCTILRNNG 1099
Query: 665 EKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMV 723
+K + G ++ A+C + L A +I ECF PI+ G D+IP ++
Sbjct: 1100 DKKVRTAG-------EIALMAECNMK-----LMIALSIMEECFLPILDPRTGIDIIPSIL 1147
Query: 724 YGRNISGQEFG--GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQAL 781
Y F G Y+V+L +VS +R+ G VAE+PLVAT E + +G + L
Sbjct: 1148 YNWRSDFIHFNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVATSTENRQQGMCRRL 1207
Query: 782 FSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM--SRERLLKYQRDFQLTIFKGTS 839
IE +L SL VE L+L A WT FGFR++ S ++ L R L GT
Sbjct: 1208 MDYIEEMLKSLKVEMLLLSAIPHLVETWTSTFGFREIDDSDKKRLSMVR---LAAVPGTV 1264
Query: 840 MLEKKV 845
+L+K +
Sbjct: 1265 LLKKNL 1270
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 178/416 (42%), Gaps = 66/416 (15%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDI----AISLAMGQ 500
GI C CC+K ++ +FE HAG + ++PY +I+ G L D + G
Sbjct: 665 GIFCGCCSKILTVGKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGF 723
Query: 501 RRTTGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ G D D C +CGDG +L+ C+ C FH CL + +P W C +C
Sbjct: 724 YKIDKGEDEHDDTCAICGDGGDLVCCDHCASTFHLDCLG-IKLPSGDWYCRSC------- 775
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDF--SAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+CR F + ++ C QC +++H C G
Sbjct: 776 ----------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCSSGTG 813
Query: 616 L---CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
C + D FC C +I+ L + + S S ++ F
Sbjct: 814 TDFDCTIPGTSID-CFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLVH--------CFPN 864
Query: 673 TMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNIS 729
K AQC S ++ A + ECF P I E G ++I + Y G + S
Sbjct: 865 DQAMPPKNKEKMAQC-----NSKIALAFTVLDECFQPHIDERSGINMIHNVAYNCGSDFS 919
Query: 730 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
+F G Y+ IL V+SA +RI G ++AE+P + T Y+ +G + L + IE L
Sbjct: 920 RLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGMYRHQGMLRRLLNGIESAL 979
Query: 790 CSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
CSLNV+ LV+ A + E+ WT FGF+ + + + + L I GT +LEK++
Sbjct: 980 CSLNVQKLVVSAVTEMENTWTTVFGFKPVQPSKKQRI-KSLNLLIMNGTGLLEKRL 1034
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 7/85 (8%)
Query: 207 VPNEYPTNVKK-LLSTGIL-DGACVKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVS 263
+P E+ V ++ G++ + VKY++ +R R+ L G + G CGC C SK+++
Sbjct: 621 IPYEWKRTVYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITREGIFCGC--C--SKILT 676
Query: 264 AHEFEQHAGAKTRHPNNHIYLENGK 288
+FE HAG+K + P +I+LE GK
Sbjct: 677 VGKFELHAGSKEKKPYANIFLEGGK 701
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/439 (25%), Positives = 190/439 (43%), Gaps = 95/439 (21%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDI---AISLA-------- 497
+ IVCDCC+ + + FE+HAG R +P I +G +L D A+S +
Sbjct: 36 DAIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSV 94
Query: 498 ---------------------MGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
+ + ++ +D++C +CG G +L LC+ CP AFH CL
Sbjct: 95 VNENQKKNHSIVKENRKKNHCVVKEKSEANNDNVCSICGFGGDLALCDRCPSAFHLGCLG 154
Query: 537 PLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFD--DR 594
+P W CP C C IC D D
Sbjct: 155 LNRVPIGEWFCPTC--------------------------CCKICYRPKCKQECKDHKDN 188
Query: 595 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPAS 654
++ C QCE+++H GC++ G+ + + + WFC C + L+ + +
Sbjct: 189 NILVCVQCEQKYHFGCVKAVGI-EFNHM--ENWFCSVVCGNMFLCLKKLLGKPIKV---- 241
Query: 655 SLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE----KSLLSSATAIFRECFDPI 710
+++ W ++K ++KE +S L+ A + E F+P
Sbjct: 242 -------------------ADNLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPT 282
Query: 711 I-AECGRDLIPVMVYGRNISGQE--FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 767
A GR+LI +V+ R F G Y+VIL V+S +RI+G++VAE+ VA
Sbjct: 283 FDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVA 342
Query: 768 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQ 827
T +Y+ +G L IE+ L L VE L+L ++E A + WT+ FGF +M+ + ++
Sbjct: 343 TKEQYRRQGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWTRSFGFARMASKDKCQFI 402
Query: 828 RDFQLTIFKGTSMLEKKVQ 846
D F+ ++M K ++
Sbjct: 403 -DHTFLEFQNSTMCLKALK 420
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 184/409 (44%), Gaps = 68/409 (16%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 1022
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWR 546
+ R+ +DD C +CGDG L+ C+ CP ++H CL IP+ W
Sbjct: 1023 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWY 1082
Query: 547 CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEF 606
C C + +L+ L A + C QCE+++
Sbjct: 1083 CYRCLCDICGEVINLKELRSSLPA--------------------------LECAQCERQY 1116
Query: 607 HVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 666
H C+ LC+ + P WFC C +I+ L+ V TI S T+ R + ++
Sbjct: 1117 HAKCIYGKLLCNEEGGPC-AWFCGRRCQQIYMNLRSRVGIPIHTIDGFS-CTVLRNNGDQ 1174
Query: 667 GILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYG 725
+ T D+ A+C + L A +I ECF PII A G D+IP ++Y
Sbjct: 1175 RV---STAADIAIL----AECNMK-----LVIALSIMEECFLPIIDARTGIDIIPPILYN 1222
Query: 726 --RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
+ ++ G Y+V+L ++S +R+ G VAE+PL+ATC E + +G + L
Sbjct: 1223 WRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMD 1282
Query: 784 CIERLLCSLNVENLVLPAAEKAESIWTKKFGF--------RKMSRERLL 824
IE++L SL VE L+L A WT FGF + +SR RL+
Sbjct: 1283 YIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDRKNLSRLRLV 1331
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 144 bits (363), Expect = 2e-31, Method: Composition-based stats.
Identities = 98/324 (30%), Positives = 156/324 (48%), Gaps = 44/324 (13%)
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEV------GGCVICRSHDFSAATFDD 593
+ E G C C S SV L G E +E CV+C +F F D
Sbjct: 1750 VGEGGEDCEGC--------SDSVTLTGRDEEHVSEALAADLASSCVLCHQPEFDREGFSD 1801
Query: 594 RTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 653
+TV+ CDQCEKE+H+GCLRK+ + D++ +P+ +WFC D+C RI L
Sbjct: 1802 QTVLICDQCEKEYHIGCLRKHKMVDMQAVPEGEWFCSDECVRIRELLT------------ 1849
Query: 654 SSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE 713
K +E+G +WQ ++ + ++ L + I +E FDPII
Sbjct: 1850 --------KSLEEGETTMSGNPAYRWQFIRGRDGTKATARA-LKTVLEILQESFDPIIDN 1900
Query: 714 -CGRDLIPVMVYGRNISGQEFGGMYSVILTVKSV-------VVSAGLLRIFGREVAELPL 765
G DL+P MV+ + +F GMYS++L + V AGL+R+ G +AE+PL
Sbjct: 1901 GSGEDLLPRMVHAESAGDYDFQGMYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPL 1960
Query: 766 VATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLK 825
VAT + + +G +AL + L +L V +VLPA A W ++ F + +
Sbjct: 1961 VATRYDCRRQGHLRALVEGLRHRLIALGVRAMVLPATADALPAW-RQLAFMDLDEGSVRV 2019
Query: 826 YQRDFQLTIFKGTSMLEKKVQCLP 849
+ + ++ IF TS++ +++ +P
Sbjct: 2020 ARGEHRMIIFPHTSVVVRQLIRVP 2043
Score = 79.0 bits (193), Expect = 1e-11, Method: Composition-based stats.
Identities = 40/77 (51%), Positives = 48/77 (62%), Gaps = 6/77 (7%)
Query: 423 PNGLPDGERLTYIVKGQRLRFGC------KQGNGIVCDCCNKEISPSQFEAHAGMAARRQ 476
P+ L DGER+ Y ++GQRL G G+GI+CDCC+K IS S FE+HAG RR
Sbjct: 1532 PDSLQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHAGHKHRRN 1591
Query: 477 PYRHIYTSNGMTLHDIA 493
PY I T NG TL IA
Sbjct: 1592 PYESIMTDNGTTLKRIA 1608
Score = 39.3 bits (90), Expect = 8.6, Method: Composition-based stats.
Identities = 38/116 (32%), Positives = 50/116 (43%), Gaps = 16/116 (13%)
Query: 198 MEIKMSKKVVPNEYPTNVKKLLSTG---ILDGACVKYISTSRERQLDGIV------NGGG 248
M ++ P + L G + DG V Y + +R L G V G G
Sbjct: 1507 MRASAARTSASQRRPIRSRTLFDGGPDSLQDGERVHY-TIQGQRLLSGTVVIVQRTAGSG 1565
Query: 249 YLCGCPLCNFSKVVSAHEFEQHAGAK-TRHPNNHIYLENGKPIYSIIQELKTAPLG 303
LC C C SKV+SA FE HAG K R+P I +NG + I + + T PL
Sbjct: 1566 ILCDC--C--SKVISASAFESHAGHKHRRNPYESIMTDNGTTLKRIAERI-TPPLA 1616
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 112/438 (25%), Positives = 188/438 (42%), Gaps = 78/438 (17%)
Query: 426 LPDGERLTYIVKGQRLRF---GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIY 482
+P ++ Y+ K ++ RF G +GI CDCCNK + ++FEAHAG QP+ +IY
Sbjct: 774 VPLDGKVQYL-KRRKARFVTKGSITTDGIQCDCCNKTFTSAEFEAHAG-GKSCQPFENIY 831
Query: 483 TSNGMTLHDIAISLAMGQRRTT------------GGSDDMCHVCGDGENLLLCNGCPLAF 530
G +L + + + +DD C +CGDG +L+ C+ CP F
Sbjct: 832 LETGSSLLQCQLDSWYKEDDSAHKGFHFIDIDGEDPNDDTCGICGDGGDLICCDSCPSTF 891
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT 590
H +CL+ P W C C L VGG R + +A +
Sbjct: 892 HQSCLEIRKFPSGLWHCMYC-----------------LCKFCGMVGGNTCQRDGNMAAVS 934
Query: 591 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQT 650
++ C CE ++H C ++ + + P FC ++C ++ LQ + +
Sbjct: 935 ---HALVTCHLCEDKYHHSCFQEKDIINAD--PGSPSFCGNNCQELYERLQMLFGVKQEL 989
Query: 651 IPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPI 710
S + + R + I G + W K C S ++ A I ECF P+
Sbjct: 990 EAGFSWTFVRRFDVSSDISVSG----MSW----KVDC-----NSKVAVALQIMDECFVPM 1036
Query: 711 IA-ECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVA 767
+ + G +LI +VY G N + + G ++ +L +++A +R F +P+
Sbjct: 1037 VDHKSGVNLIRNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF----YSMPV-- 1090
Query: 768 TCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM--SRERLLK 825
+ S + LCSLNV LV+PA + WT FGF+ + S ++++
Sbjct: 1091 ------------SFHSSLSMGLCSLNVGKLVIPAISELTGTWTSVFGFKHLEGSDKQIM- 1137
Query: 826 YQRDFQLTIFKGTSMLEK 843
R+ + +F G ML+K
Sbjct: 1138 --RNMNMMVFPGVDMLQK 1153
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 6/73 (8%)
Query: 219 LSTGILDGACVKYISTSRERQL-DGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRH 277
L T LDG V+Y+ + R + G + G C C CN K ++ EFE HAG K+
Sbjct: 771 LGTVPLDGK-VQYLKRRKARFVTKGSITTDGIQCDC--CN--KTFTSAEFEAHAGGKSCQ 825
Query: 278 PNNHIYLENGKPI 290
P +IYLE G +
Sbjct: 826 PFENIYLETGSSL 838
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 114/415 (27%), Positives = 183/415 (44%), Gaps = 58/415 (13%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRT----- 503
+GI CDCC + + FE+HAG + QP ++I NG +L + Q +
Sbjct: 17 DGIQCDCCGETFAILDFESHAG-SKSCQPLKNICLENGHSLLQCQLESWNKQDESDRKGF 75
Query: 504 -------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSS 556
+DD C +CGDG NL+ C+ CP FH +CL+ P W C C
Sbjct: 76 HFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYCS----- 130
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
+ + GG E +D +A ++ C CE+++H C+
Sbjct: 131 --CKFCGMAGGDTCQMDE---------NDTAAQP----ALLACCLCEEKYHHSCILAEN- 174
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND 676
+ + FC C ++ LQ + + + + + + R + I G
Sbjct: 175 -TVNDGYSSVSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRFDVGSDISLSG---- 229
Query: 677 VQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQEF 733
M +K +C S ++ A I ECF P+ G +LI +VY G N + +
Sbjct: 230 ----MHRKVEC-----NSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNY 280
Query: 734 GGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLN 793
G + IL ++SA +RI G +AE+P + T Y+ +G + L S IE LCSLN
Sbjct: 281 SGFLTAILERGDEIISAASIRIHGNHLAEMPFIGTRHMYRRQGMCRRLLSAIETALCSLN 340
Query: 794 VENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLEKKV 845
VE LV+PA + WT FGF+ + S++++ R+ ++ F G ML+K +
Sbjct: 341 VEKLVIPAISELRETWTSVFGFKPLEGSSKQKM----RNMKMVAFPGIDMLQKPL 391
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 189/444 (42%), Gaps = 96/444 (21%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 649 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 707
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 708 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 766
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRT----VIYC 599
W C +C ICR F +T + T ++ C
Sbjct: 767 DWHCSSC-----------------------------ICR---FCGSTQEITTSSAELLSC 794
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRAQTIPASSLS 657
QC +++H C +K FC C +I+ L+ + +
Sbjct: 795 LQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------- 844
Query: 658 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSLLSSATAIFR 704
IE G W +++ CF +K S + A ++
Sbjct: 845 ----NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSKTAVAFSVMD 888
Query: 705 ECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
ECF P I E G ++I ++Y G + + F Y+ IL V+SA +RI G ++A
Sbjct: 889 ECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLA 948
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
E+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT FGF+ +
Sbjct: 949 EMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS 1008
Query: 822 RLLKYQRDFQLTIFKGTSMLEKKV 845
+ K + + I GT +LEK++
Sbjct: 1009 KRQKI-KSLNILIIHGTGLLEKRL 1031
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 6/72 (8%)
Query: 218 LLSTGIL-DGACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKT 275
++ G++ + A VKY++ R +L+G + G CGC C SK+++ +FE HAG+K
Sbjct: 631 MIDMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGC--C--SKILTVAKFELHAGSKE 686
Query: 276 RHPNNHIYLENG 287
+ P +I+LE+G
Sbjct: 687 QQPYENIFLEDG 698
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 189/444 (42%), Gaps = 96/444 (21%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 648 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 706
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 707 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 765
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRT----VIYC 599
W C +C ICR F +T + T ++ C
Sbjct: 766 DWHCSSC-----------------------------ICR---FCGSTQEITTSSAELLSC 793
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRAQTIPASSLS 657
QC +++H C +K FC C +I+ L+ + +
Sbjct: 794 LQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------- 843
Query: 658 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSLLSSATAIFR 704
IE G W +++ CF +K S + A ++
Sbjct: 844 ----NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSKTAVAFSVMD 887
Query: 705 ECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
ECF P I E G ++I ++Y G + + F Y+ IL V+SA +RI G ++A
Sbjct: 888 ECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLA 947
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
E+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT FGF+ +
Sbjct: 948 EMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS 1007
Query: 822 RLLKYQRDFQLTIFKGTSMLEKKV 845
+ K + + I GT +LEK++
Sbjct: 1008 KRQKI-KSLNILIIHGTGLLEKRL 1030
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 45/72 (62%), Gaps = 6/72 (8%)
Query: 218 LLSTGIL-DGACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKT 275
++ G++ + A VKY++ R +L+G + G CGC C SK+++ +FE HAG+K
Sbjct: 630 MIDMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGC--C--SKILTVAKFELHAGSKE 685
Query: 276 RHPNNHIYLENG 287
+ P +I+LE+G
Sbjct: 686 QQPYENIFLEDG 697
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 189/444 (42%), Gaps = 96/444 (21%)
Query: 437 KGQRLRF-GCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG-MTLHDIAI 494
KG R R G +GI C CC+K ++ ++FE HAG + +QPY +I+ +G TL +
Sbjct: 226 KGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYENIFLEDGGATLSQCLV 284
Query: 495 SLAMGQRRT-----------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES 543
Q ++ DD C +CGDG +LL C+ CP FH ACL + +P
Sbjct: 285 DAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLG-IKMPSG 343
Query: 544 GWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRT----VIYC 599
W C +C ICR F +T + T ++ C
Sbjct: 344 DWHCRSC-----------------------------ICR---FCGSTQEITTSSAELLSC 371
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAALQDFVSNRAQTIPASSLS 657
QC +++H C +K FC C +I+ L+ + +
Sbjct: 372 LQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------- 421
Query: 658 TINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKE-------------KSLLSSATAIFR 704
IE G W +++ CF +K S + A ++
Sbjct: 422 ----NAIEAGF---------SWSLVR---CFPDKLAAPPKGKAHLIHCNSKTAVAFSVMD 465
Query: 705 ECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
ECF P I E G ++I ++Y G + + F Y+ IL V+SA +RI G ++A
Sbjct: 466 ECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLA 525
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
E+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT FGF+ +
Sbjct: 526 EMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS 585
Query: 822 RLLKYQRDFQLTIFKGTSMLEKKV 845
+ K + + I GT +LEK++
Sbjct: 586 KRQKI-KSLNILIIHGTGLLEKRL 608
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 10/109 (9%)
Query: 185 SSGANDGVLCARNMEIKMSKKV---VPNEYP-TNVKKLLSTGIL-DGACVKYISTSRER- 238
SSG L R M V VP + T + ++ G++ + A VKY++ R
Sbjct: 171 SSGVRGCALLVRGSTHSMEGNVDGYVPYRWKRTVLSWMIDMGVVSEDAKVKYMNKKGTRA 230
Query: 239 QLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
+L+G + G CGC SK+++ +FE HAG+K + P +I+LE+G
Sbjct: 231 RLEGRITRDGIHCGC----CSKILTVAKFELHAGSKEQQPYENIFLEDG 275
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 106/417 (25%), Positives = 176/417 (42%), Gaps = 59/417 (14%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L HD + Q +
Sbjct: 678 GIRCNCCDEVFSVLDFEVHAG-GKRNQPFKSLYLEGGNSLLQCLHDFMNKQSESQHKGYH 736
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 737 FVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEK 796
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICR---SHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ ++ L + + C +C S+ + D H C+ ++
Sbjct: 797 VEAAIHDTSALHS----LSSCRLCEEKCSNHYPHTLAD--------------HQACINQD 838
Query: 615 GLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
G +P ++ FC C + LQ + + S S + R + +
Sbjct: 839 GT-----VPGERSTDSFCGKYCQELFEELQLLIGVKHPLPEGFSWSFLRRFELPSEV--- 890
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVYG--RNI 728
D+ ++ A+ ++ A ++ ECF P++ G +L+ +VY N
Sbjct: 891 -ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFWSNF 940
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
F + +L +++ +RI G ++AE+P + T Y+ +G + L IE
Sbjct: 941 HRLNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESA 1000
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L SL V LV+PA + WT FGF ++ E K ++ L +F G ML K +
Sbjct: 1001 LGSLKVAKLVIPAVPELIDTWTSGFGFTPVN-ESEKKTIKNLNLLVFPGVDMLGKSL 1056
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/420 (25%), Positives = 179/420 (42%), Gaps = 65/420 (15%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC S
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC----SCK 695
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEF------HVGCL 611
+ + + C +C ++C K + H C+
Sbjct: 696 FCEKDEAAKHETSTLPSLSSCRLCE-----------------EKCSKHYPHTLADHQACI 738
Query: 612 RKNGLCDLKEIPKDKW---FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
++G +P ++ FC C + LQ F+ + S S + R + +
Sbjct: 739 NQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFELPSEV 793
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--G 725
D+ ++ A+ ++ A ++ ECF P++ G +L+ +VY G
Sbjct: 794 ----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFG 840
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
N +F + +L +++ +RI G ++AE+P + T Y+ +G + L I
Sbjct: 841 SNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGI 900
Query: 786 ERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
E L SL V+ LV+PA + WT FGF ++ + K ++ L +F G ML K +
Sbjct: 901 ESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVN-DSEKKTIKNLNLLVFPGVDMLGKSL 959
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 176/425 (41%), Gaps = 88/425 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPY-RHIYTSNGMTLHDI-AISLAMGQRRTTGG- 506
GI+C+CC K S S F+ H G + + + + TL + A S R+
Sbjct: 956 GILCNCCTKTFSISDFKVHGGCSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDASG 1015
Query: 507 --------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSM 558
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 1016 RKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNC-------- 1067
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRS-----HDFSAATFDDRTVIYCDQCEKEFHVGCLRK 613
CRS + +TF ++ C QC +H C+ +
Sbjct: 1068 ---------------------TCRSCGNPLSEKEVSTFS--AILKCLQCGDSYHDTCIDQ 1104
Query: 614 NGL-CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
L C K+ + WFC C I L + V IE D
Sbjct: 1105 EMLPCGDKQ--SNIWFCGRYCKEIFIGLHNHVG------------------IEN--FLD- 1141
Query: 673 TMNDVQWQMLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-AECGRDLIP 720
N++ W +LK Q +K + L+ A I ECF ++ G D+IP
Sbjct: 1142 --NELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIP 1199
Query: 721 VMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
++ N + ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G +
Sbjct: 1200 HVL--SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRR 1257
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 840
L IE +L S +VE LVL A + + W FGF+ + + K R+ L +F GTS+
Sbjct: 1258 LMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSL 1316
Query: 841 LEKKV 845
L K++
Sbjct: 1317 LTKRL 1321
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 153/355 (43%), Gaps = 81/355 (22%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VCGDG LL C+ CP FH ACL + +P+ W C CR
Sbjct: 452 SDDACGVCGDGGELLCCDSCPSTFHPACLA-MKVPQGWWACHYCR--------------- 495
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
CV+C ++D D+ + C C ++H C R+ L + + I
Sbjct: 496 -----------CVLCMAND-------DQGLSTCQHCSLKYHEVC-RRPSLSNGRGIGA-- 534
Query: 627 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK--K 684
+C + C ++ A L D V H E G W +LK K
Sbjct: 535 -YCSETCKKVSARLSDMVGVT--------------NHTEDGF---------SWALLKIQK 570
Query: 685 AQCFEEKEKSL-------LSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQEFG 734
+ ++ + L+ A + ECF+P + D++ VY G +
Sbjct: 571 DEAVSSQDTAAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAVYSLGSEFKRVSYE 630
Query: 735 GMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNV 794
G Y+++L ++A LLRI G +VAE+P AT Y+ +G + L + +E++L S+ V
Sbjct: 631 GFYTMVLDKDGETIAAALLRIHGTKVAEMPFAATLPAYRKQGMMRRLVNAVEQVLASVQV 690
Query: 795 ENLVLPAAEKAESIWTKKFGFRKM----SRERLLKYQRDFQLTIFKGTSMLEKKV 845
+ LV+PA WT+ F FR + SRE + R L + GT++L K V
Sbjct: 691 DKLVIPAIAALVDTWTRSFSFRPLLDPESREEI----RRRSLVVIAGTTLLHKPV 741
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 115/465 (24%), Positives = 182/465 (39%), Gaps = 111/465 (23%)
Query: 448 GNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS 507
G+GI C CCN+ ++ S F AHAG + + P ++IYT +L + Q + S
Sbjct: 846 GDGIHCGCCNEIVTISDFGAHAG-SKQSDPLKNIYTEEETSLLQCLLDSWNKQDESELKS 904
Query: 508 ------------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS 555
DD C VCGDG +L+ C+GCP FH +CLD P W C C
Sbjct: 905 FHFFDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYC----- 959
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICR------SHDFSAATFDDRTVIYCDQCEKEFHVG 609
C CR + F ++ C CE++FH+
Sbjct: 960 ---------------------CCKFCRLVGGSSNQSVVNDEFTMPALLTCHLCEEKFHIS 998
Query: 610 CLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL 669
C+ NG + KD FC + C + L+ + + + S S I R +
Sbjct: 999 CVEANG--GKTDDSKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWSFIRRSDVG---- 1052
Query: 670 FDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--GR 726
D ++ + Q +C S L+ A +I ECF P I G +L+ ++Y G
Sbjct: 1053 CDLSLTNPQL-----VEC-----NSKLAVALSIMNECFMPYIDHRSGTNLLRSILYNCGS 1102
Query: 727 NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIE 786
N ++ G +VIL ++ +R+ G +AE+P + T Y+ +G + L + IE
Sbjct: 1103 NFKRLDYSGFITVILERGDEIICVASIRVHGNRLAEMPYIGTRYMYRRQGMCRRLLNAIE 1162
Query: 787 ----------------------------------------------RLLCSLNVENLVLP 800
+ L SL+VE LV+P
Sbjct: 1163 SEAVYGLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDWGGPKALSSLDVELLVIP 1222
Query: 801 AAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
A + WT FGF + ++ + + L +F +L+KK+
Sbjct: 1223 AISELRETWTSVFGFEPL-KQTSKQITNNMNLLVFPHVDLLQKKI 1266
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 148/361 (40%), Gaps = 87/361 (24%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ CP FH ACL + +PE W C CR
Sbjct: 511 SDDACGVCADGGELLCCDSCPSTFHPACLA-MKVPEGLWACHYCR--------------- 554
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLK 620
CV+C ++ DD+ + C C ++H C R NG
Sbjct: 555 -----------CVLCMAN-------DDQGLSRCQHCTLKYHEICRPSLSNGRGNGA---- 592
Query: 621 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 680
+C + C ++ A L D + H E G W
Sbjct: 593 -------YCSETCKKVSAQLSDMIG--------------ITNHTEDGF---------SWA 622
Query: 681 MLK--KAQCFEEKEKS-------LLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNI 728
+LK K + + L+ A + ECF+P+ + D++ VY G
Sbjct: 623 LLKIQKDEPVSSQNSPDVLECNVKLAVALGVLNECFNPVKDRRTKIDMLHQAVYSLGSEF 682
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
+ G Y+++L ++SA LLRI G +VAE+P T Y+ +G + L + +E++
Sbjct: 683 KRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRKQGMMRRLVNAVEQV 742
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
L S+ VE LV+PA W K F F+ + E L + R L + GT++L+K V
Sbjct: 743 LASVQVEKLVIPAIAALVDTWKKSFSFKALDPE-LKEEIRRRSLVVITGTTLLQKPVVAA 801
Query: 849 P 849
P
Sbjct: 802 P 802
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 156/347 (44%), Gaps = 52/347 (14%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG L+ C+ CP ++H ACL IP+ W C +C
Sbjct: 8 NDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCL--------------- 52
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C IC S + C QCE+++HV C+ C+ E
Sbjct: 53 -----------CDICGEVIDSKELVTSLPALDCSQCERQYHVKCVSAKVPCN--EDGSGT 99
Query: 627 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ 686
WFC C+ I+ + V +P H++ + F N+ ++ +
Sbjct: 100 WFCGRKCHEIYMTFRSRVG-----VP---------DHMDDDLCFTVLRNNGDKKVRTAEE 145
Query: 687 CFEEKEKSL-LSSATAIFRECFDPII-AECGRDLIPVMVYG-RNISGQEFGGMYSVILTV 743
E ++ L AT+I ECF PI+ G D+IP ++Y R+ + G Y+V+L
Sbjct: 146 IALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLES 205
Query: 744 KSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAE 803
+VS +R+ G +AE+PLVAT +E + +G + L IE +L SL VE L+L A
Sbjct: 206 DDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIP 265
Query: 804 KAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLEKKVQC 847
WT FGFR++ ++RL K +L GT +L+K + C
Sbjct: 266 HLAETWTSTFGFREIDESDKKRLSK----VRLAAVPGTVLLKKDLLC 308
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 150/351 (42%), Gaps = 78/351 (22%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ C FH CL + +PE W C CR
Sbjct: 526 SDDACGVCADGGELLCCDSCTSTFHPECLA-IKVPEGSWSCHYCR--------------- 569
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC--LRKNGLCDLKEIPK 624
CV+C S+D D + + C QC +++H C L NG CD+
Sbjct: 570 -----------CVLCMSND------DLQGLSTCQQCARKYHESCRPLPGNG-CDIGT--- 608
Query: 625 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK- 683
+C + C ++ + L + + T + W +L+
Sbjct: 609 ---YCGETCKKLFSQLAQVTG-----------------------VTNPTGDGFWWALLRI 642
Query: 684 ---KAQCFEEKEKSL-----LSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQE 732
+ EE L L+ A +F ECF+P+ + D++ VY G
Sbjct: 643 QKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGSQFKRLS 702
Query: 733 FGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSL 792
+ G Y+++L +VSA LLRI G +VAE+P T Y+ +G + L S +E++L S+
Sbjct: 703 YEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRRLVSAVEQVLASV 762
Query: 793 NVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
VE LV+PA + W + F FR + +L + + L + GT++L K
Sbjct: 763 QVEKLVIPAIDSLVDTWKRSFFFRPVD-PQLREELKRLSLVVITGTTLLHK 812
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 164/344 (47%), Gaps = 65/344 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSRSVDLKGG 567
D+C VC G +LLLC+GCP AFH ACL +PE W D+ G
Sbjct: 257 DVCCVCHWGGDLLLCDGCPSAFHHACLGLSSLPEEDLW----------FCPCCCCDICGS 306
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
+E+P + ++ C+QC++ FH+ CL+++ + W
Sbjct: 307 MESPA--------------------NSKLMACEQCQRRFHLTCLKEDSCI----VSSRGW 342
Query: 628 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK---K 684
FC CNR+ +AL++ + ++ G D+ W +++ +
Sbjct: 343 FCSSQCNRVFSALENLLGSKIAV---------------------GNDGDLVWTLMRAPNE 381
Query: 685 AQCFEEKEKSLLSSATAIFRECFDPI-IAECGRDLIPVMVYGRNISGQEFG-GMYSVILT 742
+ +++++ S L SA I + F+P GRDL+ ++Y ++ +G G G Y+V++
Sbjct: 382 GEHYDDEQISKLESAVEILHQGFEPTNDVFSGRDLVEELIYRKDRTG--VGRGFYTVLIE 439
Query: 743 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 802
K+ ++ +R+ ++V E+PLVAT Y+ G + L +E+ + + V LVLPAA
Sbjct: 440 RKNEPITVAAVRV-DKDVVEIPLVATLSSYRRSGMCRVLMDELEKQMSQMGVCRLVLPAA 498
Query: 803 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
++ + WT++FGF M+ L+ + L F GT M K +Q
Sbjct: 499 KEVVTTWTERFGFSVMNSSERLELVKHGMLD-FVGTIMCHKFLQ 541
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 146/313 (46%), Gaps = 57/313 (18%)
Query: 212 PTNVKKLLSTGILDGACVKYISTS-RERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFEQH 270
P N K+L++TG+L+G Y+ S R QL GI G +C C +C ++VVS FE H
Sbjct: 312 PRNAKELMATGLLEG---HYVHCSCRGEQLTGIFQDMGVVCNCRICKGTQVVSISAFEAH 368
Query: 271 AGAKTRHPNNHIYLENGKPIYSIIQELKTAP------LGILEEVVKKVAGSSFNEGSFRV 324
+G+ + HP+++IYLENGK + I+ + + L L+ + ++ G S
Sbjct: 369 SGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRALQHAIGEIQGIS-------- 420
Query: 325 WKASHHLRKGLVEHDEKHNMKLPSLPHSIISCSSLALEESISPTSCSFVQDTGREQTHMK 384
K ++ G E E ISC ++ + G + H++
Sbjct: 421 -KEMTCVKCGKHEGGE------------FISCKGAKC------SAAYHAECVGVKSPHLE 461
Query: 385 EML----EERKRGVKRPFMHQKRTAEGGTKK----------------RDNDLHRLLFLPN 424
+ E+ + +P + KR+ G K+ RD LH+ LFLP
Sbjct: 462 DWFCAKCEKTQARKPQPLLKVKRSPAGTDKEDARSKGKEQTMIARSARDAHLHKALFLPG 521
Query: 425 GLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS 484
GL DG L Y + Q + G KQG GI C CCN+EI+ S FE HA AR+ PY I +
Sbjct: 522 GLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHARCEARQNPYGSILLA 581
Query: 485 NGMTLHDIAISLA 497
+G +L D+ LA
Sbjct: 582 DGRSLKDMCKELA 594
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 164/375 (43%), Gaps = 59/375 (15%)
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+IYTS G+TLHDIA+ L + G + D NL
Sbjct: 2 NIYTSEGVTLHDIALQLHRLNLNSNGFGNASVSSFSDYPNL------------------- 42
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
SG C + S++ V LK L+ + C CR D D T+ +C
Sbjct: 43 -TSSG-----CGKEPSTTSGPIVPLKRTLQERVVQTESCYFCRYGDTEFGKLDPNTIFFC 96
Query: 600 DQCEKEFHVGC-------LRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIP 652
+QCE+ HV C ++K L LKE ++ CC++C + A L+
Sbjct: 97 NQCERPCHVRCYNSRDRDVKKVPLEILKEYMCFRFLCCEECQSLRARLEG---------- 146
Query: 653 ASSLSTINRKHIEKG---ILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDP 709
+EKG +++ W++L KA K +S A IF++ F
Sbjct: 147 -----------VEKGEEIAFLRQIRSNICWRLLSKADA-SRDVKLYMSQAIDIFKDAFVE 194
Query: 710 IIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 768
+ D+ MVYG+N +G+ +F GMY V+LT + VVSA +L++ + AEL L+AT
Sbjct: 195 -STDAHSDIFSDMVYGKNGAGEKDFRGMYCVVLTASTHVVSAAILKVRVEQFAELVLIAT 253
Query: 769 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 828
E + KG F+ L IE L + NV L+ P + IW++K GF +S E
Sbjct: 254 RSECRKKGYFRLLLKSIEANLRACNVSLLMAPVDPEMAQIWSEKLGFTILSAEEKKSMLE 313
Query: 829 DFQLTIFKGTSMLEK 843
L +F+ +++K
Sbjct: 314 SHPLVMFQDLVLVQK 328
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 145/347 (41%), Gaps = 73/347 (21%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
SD C +C +G L+ C CPL FH C+ L +P+ W C C H R+
Sbjct: 264 AAANSDQCCGICNEGGELVCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGEPLRTQ 323
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C+QCE+ FH GC CD +
Sbjct: 324 P-----------------------------------CEQCERCFHPGC------CDDAIL 342
Query: 623 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQML 682
D +FC C I L + V+ +++ + R +++ W +L
Sbjct: 343 AGDFFFCSSGCWNIFQRLAEMVA---------TVNPLGR-------------SELSWSLL 380
Query: 683 KKAQCFEEKEKSLLSSATAIFRECFDPIIAECGR--DLIPVMVYGRNISGQ--EFGGMYS 738
++ +C + LL+ A + FDP++ +C D + MV+ R+ +F G Y+
Sbjct: 381 RRGRC----DDKLLAEALQVISSRFDPVL-DCWTQLDYLDAMVFSRSHHSPRLDFSGFYT 435
Query: 739 VILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLV 798
+L + VV +LRI G +AE+P +AT +G+G ++LF+ +E +L L VE +V
Sbjct: 436 AVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVEMMV 495
Query: 799 LPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L AA+ E +W F F M R +L L G L+K V
Sbjct: 496 LLAAKDTEKMWKNSFEFHAMDR-KLKARTVALGLVALNGAGFLQKSV 541
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 426 LPDGERLTYIVKGQRLRFGCKQG----NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHI 481
+ + E++ YI +R R K+G +G+VC CC++ + S FEAH G RR P +I
Sbjct: 67 IAEQEKVVYIGGTKRFRHLLKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRR-PAANI 125
Query: 482 YTSNGMTL 489
+ + L
Sbjct: 126 FVGDEAQL 133
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 107/431 (24%), Positives = 174/431 (40%), Gaps = 115/431 (26%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL-----HDIAI- 494
L+ G +G+VC CC++ + S FEAH G RR P +I+ + L ++A
Sbjct: 2 LKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRR-PAANIFVGDEAQLSIADCQNVAFK 60
Query: 495 --------SLAMGQRR--------------------------TTGGSDDMCHVCGDGENL 520
L + +RR SD C +C +G L
Sbjct: 61 METLESLPGLPVARRRKFDSYCQSDEDSGLTTVSSGSDVDYEAAANSDQCCGICNEGGEL 120
Query: 521 LLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVI 580
+ C CPL FH C+ L +P+ W C C H R+
Sbjct: 121 VCCETCPLTFHMECVSLLEVPKDAWFCFRCLCCHCGEPLRTQP----------------- 163
Query: 581 CRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
C+QCE+ FH GC CD + D +FC C + L
Sbjct: 164 ------------------CEQCERCFHPGC------CDDAILAGDFFFCSSGCWNLFQRL 199
Query: 641 QDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSAT 700
+ V+ +++ + R +++ W +L++ +C + LL+ A
Sbjct: 200 AEMVA---------TVNPLGR-------------SELSWSLLRRGRC----DDKLLAEAL 233
Query: 701 AIFRECFDPIIAECGR--DLIPVMVYGRNISGQ--EFGGMYSVILTVKSVVVSAGLLRIF 756
+ FDP++ +C D + MV+ R+ +F G Y+ +L + VV +LRI
Sbjct: 234 QLISSRFDPVL-DCWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIH 292
Query: 757 GREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFR 816
+AE+P +AT +G+G ++LF+ +E +L L VE + L AA+ E +W F F
Sbjct: 293 AAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVETMALLAAKDTEKMWKNSFEFH 352
Query: 817 KMSRERLLKYQ 827
+ R+ LK Q
Sbjct: 353 AVDRK--LKAQ 361
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 150/354 (42%), Gaps = 67/354 (18%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNC---------------- 47
Query: 567 GLEAPGAEVGGCVICRS-----HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL-CDLK 620
CRS + +TF ++ C QC +H C+ + L C K
Sbjct: 48 -------------TCRSCGNPLSEKEVSTFS--AILKCLQCGDSYHDTCIDQEMLPCGDK 92
Query: 621 EIPKDKWFCCDDCNRI------HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTM 674
+ + WFC C I H +++F+ N + + H K I
Sbjct: 93 Q--SNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTEC 150
Query: 675 NDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVY--GRNISGQ 731
N + L+ A I ECF ++ G D+IP ++Y G N +
Sbjct: 151 N------------------TKLAVALTILEECFVRMVDPRTGVDMIPHVLYNKGSNFARL 192
Query: 732 EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCS 791
++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L IE +L S
Sbjct: 193 DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRS 252
Query: 792 LNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+VE LVL A + + W FGF+ + + K R+ L +F GTS+L K++
Sbjct: 253 FHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSLLTKRL 305
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 158/341 (46%), Gaps = 65/341 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSRSVDLKGG 567
D+C VC G +LLLC+GCP AFH CL +PE W CP C SM V+ K
Sbjct: 255 DVCCVCHWGGDLLLCDGCPSAFHHTCLGLSSLPEEDLWFCPCCCCDICGSMESPVNSK-- 312
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN-GLCDLKEIPKDK 626
++ C+QC++ FH+ CL++ G+ +
Sbjct: 313 ----------------------------LMACEQCQRRFHLKCLKEEPGIVSCR-----G 339
Query: 627 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLK--- 683
WFC CNR+ +AL++ + + G D+ W +++
Sbjct: 340 WFCSSQCNRVSSALENLIGCKIAV---------------------GNNGDLVWTLMRAPN 378
Query: 684 KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILT 742
+ + +++++ S L SA I + F+P GRDL+ +++ ++ +G G Y+V++
Sbjct: 379 EGEHYDDEQISKLESAVEILHQGFEPTKDVFSGRDLVEELIFRKDRTGVG-RGFYTVLIE 437
Query: 743 VKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAA 802
K ++ +R+ ++V E+PLVAT Y+ G + L +E+ + + V LVLPAA
Sbjct: 438 RKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMCRVLVDELEKQMSQMGVCRLVLPAA 496
Query: 803 EKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
++ S WT++FGF M L+ + L F GT M K
Sbjct: 497 KEVVSTWTQRFGFSVMESSERLELVKHGMLD-FVGTVMCHK 536
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 149/344 (43%), Gaps = 61/344 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
SDD C VC DG LL C+ C FH CL + +P+ W C CR
Sbjct: 436 SDDACGVCADGGELLCCDFCTSTFHPECLA-IEVPDGSWSCHYCR--------------- 479
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC--LRKNGLCDLKEIPK 624
C +C S+D D+ + C +C ++H C L NG D+
Sbjct: 480 -----------CTLCMSND-------DQDLSTCQECACKYHESCRPLLGNGR-DIGA--- 517
Query: 625 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKK 684
+C + C ++ A L + + T S S + R H ++ G M V + +K
Sbjct: 518 ---YCGEICKKLSAKLSEVIGVMNSTEDGFSWSLL-RIHEDEPASSQG-MPAVLERNVK- 571
Query: 685 AQCFEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQEFGGMYSVIL 741
L+ A + +CF+P+ + D++ VY G + G Y++IL
Sbjct: 572 -----------LAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGSQFKRLSYEGFYTMIL 620
Query: 742 TVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPA 801
+VS LLRI GR+VAE+P T Y+ +G + S +E++L S+ VE L++PA
Sbjct: 621 EKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAVEQVLASVQVETLIIPA 680
Query: 802 AEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
W + F FR + +L + + L + GT+ML K V
Sbjct: 681 IASMVDTWKRSFSFRPVD-PQLREELKRLSLVVITGTTMLHKPV 723
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 147/345 (42%), Gaps = 86/345 (24%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
D +C VC DG LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 564 DSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------------- 607
Query: 568 LEAPGAEVGGCVICRSHDF---SAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
C +C D +A F D+T+IYC+ + IP
Sbjct: 608 ----------CGVCGGSDLDDDTAEGFTDKTIIYCEA------------------RSIP- 638
Query: 625 DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDV-QWQMLK 683
T+ SLST+ R+ + T+ + +WQ +
Sbjct: 639 ------------------------TTVEGVSLSTLRRRR------YMSTVTRITRWQHEE 668
Query: 684 KAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGR--NISGQEFGGMYSVI 740
+ + + L +A + ECFD ++ DL +V+ + + F G Y V
Sbjct: 669 E----DAADHGQLCAALDVLHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVG 724
Query: 741 LTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLP 800
L +++ G LR+FG +VAELPLV T ++ +G + L + +E++L + V LVLP
Sbjct: 725 LEKAGELITVGTLRVFGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLP 784
Query: 801 AAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
A + +WT GF M+R +++ + + FKGT+M +K +
Sbjct: 785 AVPELLPMWTASLGFHAMTRSDVMEMAVEHAILSFKGTTMCQKTL 829
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 151/357 (42%), Gaps = 77/357 (21%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNC---------------- 47
Query: 567 GLEAPGAEVGGCVICRS-----HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL-CDLK 620
CRS + +TF ++ C QC +H C+ + L C K
Sbjct: 48 -------------TCRSCGNPLSEKEVSTFS--AILKCLQCGDSYHDTCIDQEMLPCGDK 92
Query: 621 EIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQ 680
+ + WFC C I L + V IE D N++ W
Sbjct: 93 Q--SNIWFCGRYCKEIFIGLHNHVG------------------IEN--FLD---NELSWS 127
Query: 681 MLK---KAQCFEEKEK--------SLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNI 728
+LK Q +K + L+ A I ECF ++ G D+IP ++ N
Sbjct: 128 ILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNF 185
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
+ ++ G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L IE +
Sbjct: 186 ARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMM 245
Query: 789 LCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
L S +VE LVL A + + W FGF+ + + K R+ L +F GTS+L K++
Sbjct: 246 LRSFHVETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSLLTKRL 301
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 122 bits (307), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 148/352 (42%), Gaps = 67/352 (19%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
+DD C CGDG LL C+ CP +H CL +PE W C NC
Sbjct: 4 NDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYCHNC---------------- 47
Query: 567 GLEAPGAEVGGCVICRS-----HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL-CDLK 620
CRS + +TF ++ C QC +H C+ + L C K
Sbjct: 48 -------------TCRSCGNPLSEKEVSTFS--AILKCLQCGDSYHDTCIDQEMLPCGDK 92
Query: 621 EIPKDKWFCCDDCNRI------HAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTM 674
+ + WFC C I H +++F+ N + + H K I
Sbjct: 93 Q--SNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGRKLHSSKKIAHMTEC 150
Query: 675 NDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEF 733
N + L+ A I ECF ++ G D+IP ++ N + ++
Sbjct: 151 N------------------TKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDY 190
Query: 734 GGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLN 793
G Y+VIL ++ +R+ G + AELP +AT +Y+ +G + L IE +L S +
Sbjct: 191 QGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRSFH 250
Query: 794 VENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
VE LVL A + + W FGF+ + + K R+ L +F GTS+L K++
Sbjct: 251 VETLVLSAIPELVNTWVSGFGFKPI-EDNEKKQLRNVNLMLFPGTSLLTKRL 301
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 133/264 (50%), Gaps = 33/264 (12%)
Query: 593 DRTVIYCDQCEKEFHVGCLRKNGLCDL-KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
D ++ CDQC ++FHV C R GL L ++ WFC D C + + LQ + +++
Sbjct: 44 DGFILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHLL---GKSV 100
Query: 652 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ--CFE----EKEKSLLSSATAIFRE 705
P + +++ W +LK+ + CF+ S L A + E
Sbjct: 101 PVGT-------------------DNLTWTLLKRVEPDCFDLEVLSANNSKLKLALEVMHE 141
Query: 706 CFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAE 762
CF+P A G+DL+ +++ G N++ F G Y+V+L + + + +R+FG +VAE
Sbjct: 142 CFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVRVFGDKVAE 201
Query: 763 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRER 822
+P VAT +Y+ G + L + +ER L +L VE LVLPAA W K FGF M+
Sbjct: 202 VPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFGFSVMTYSD 261
Query: 823 LLKYQRDFQLTIFKGTSMLEKKVQ 846
K D+ + F+GT + +K ++
Sbjct: 262 -KKAHSDYPILFFQGTVLCQKFLK 284
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 122 bits (306), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 102/407 (25%), Positives = 167/407 (41%), Gaps = 88/407 (21%)
Query: 437 KGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYT--SNGMTLH---D 491
K + L G + G+ C+CCN+ ++ FE HAG +R PY HI+ SN L +
Sbjct: 125 KKEVLMKGRARREGVRCECCNEVMTVWDFETHAGSVLQR-PYEHIHVARSNSSLLQCQFE 183
Query: 492 IAISLAMGQRRT-------TGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP 541
+ S +RRT G SD D C +C DG +L+ C C H C+ IP
Sbjct: 184 VWQSNVEVERRTFNEIVPRNGASDKHDDACLICADGGDLICCEKCWSTSHLKCMGLERIP 243
Query: 542 ESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQ 601
+ W CP C H + +D+ + C Q
Sbjct: 244 QGDWICPYCVCKHCNK----------------------------------NDKDLQTCVQ 269
Query: 602 CEKEFHVGCLRKNGLCDLKEIPKDKWFCCDD-CNRIHAALQDFVSNRAQTIPASSLSTIN 660
C+K++H CL N DL + CD C ++ LQ V + +
Sbjct: 270 CDKKYHCQCLVSNKELDLN--ASGETLACDSHCGEVYEKLQSLVGVKHE----------- 316
Query: 661 RKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSL---------LSSATAIFRECFDPII 711
+E G W +L++ + K L ++ A + ECF II
Sbjct: 317 ---LEGGFC---------WTLLQRMEPDNLDFKDLHLITECNSKIALAWEVLDECFTTII 364
Query: 712 AECGR-DLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 768
+ +++ + Y R N++ F G Y+ IL ++SA +R+ G ++AE+P + T
Sbjct: 365 DRHTQINVVQSVAYSRGSNLNRINFRGFYTAILEKNDDIISAATIRVHGTDLAEMPFIGT 424
Query: 769 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF 815
Y+ G + L +E + + VE+L++P+ ++ +W K GF
Sbjct: 425 RHLYRQNGMSRMLLVTLESIFSVMGVEHLIIPSVQELTEMWEGKCGF 471
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/241 (33%), Positives = 118/241 (48%), Gaps = 33/241 (13%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEA 467
+K RD LH L+F +GL D LTY +K G+ L+ G K+G I+C+CCN+E SPS FE
Sbjct: 341 SKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEE 400
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCP 527
HAGM RRQPY +IYT G++LH +A+ L ++ L NG
Sbjct: 401 HAGMGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFD 439
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFS 587
A ++ D + SG C + S++ V LK L+ E C C +
Sbjct: 440 NASVSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHTT 494
Query: 588 AATFDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 641
+ T+I+C+QCE+ H+ C +K L LKE + CC +C + A L+
Sbjct: 495 IGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE 554
Query: 642 D 642
+
Sbjct: 555 E 555
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 72/125 (57%), Gaps = 3/125 (2%)
Query: 214 NVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF-SKVVSAHEFEQHAG 272
+V+ LLSTG+L+G V Y+ E G +NG GY CGC CN+ S +++A EFE+H G
Sbjct: 68 DVRGLLSTGLLEGFRVTYMKDEVEEV--GRINGQGYSCGCSKCNYNSNIMNACEFEEHYG 125
Query: 273 AKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVKKVAGSSFNEGSFRVWKASHHLR 332
+ +HI+L+ G ++ +++ LK L +L + +++ G N + WKAS R
Sbjct: 126 QSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKASFQKR 185
Query: 333 KGLVE 337
K ++
Sbjct: 186 KDYLD 190
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/433 (24%), Positives = 175/433 (40%), Gaps = 87/433 (20%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL--------HDIAISLAMGQR 501
GI C+CC++ S FE HAG R QP++ +Y G +L + + S G
Sbjct: 428 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 486
Query: 502 RTTGGS----DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
GS DD C +CGDG +L+ C+GCP FH +CLD P W C NC
Sbjct: 487 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC------- 539
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRT---VIYCDQCEKEFHVGCLRKN 614
C C + AA + T + C CE+++H C+ ++
Sbjct: 540 -------------------SCKFCEKDE--AAKHETSTLPSLSSCRLCEEKYHQACINQD 578
Query: 615 GLCDLKEIPKDK---WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
G +P ++ FC C + LQ F+ + S S + R + +
Sbjct: 579 GT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFELPSEV--- 630
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--GRNI 728
D+ ++ A+ ++ A ++ ECF P++ G +L+ +VY G N
Sbjct: 631 -ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNF 680
Query: 729 SGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERL 788
+F + +L +++ +RI G ++AE+P + T Y+ +G + L IE
Sbjct: 681 HRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESF 740
Query: 789 LCSLNVENLVLPAAEKAESIWT----------------KKFGFRKMSRERLLKYQRDFQL 832
+ + + L +E +W FGF ++ + K ++ L
Sbjct: 741 VAYFS--QMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVN-DSEKKTIKNLNL 797
Query: 833 TIFKGTSMLEKKV 845
+F G ML K +
Sbjct: 798 LVFPGVDMLGKSL 810
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/107 (58%), Positives = 81/107 (75%), Gaps = 1/107 (0%)
Query: 740 ILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVL 799
I+ + VV+AGLLR+FGREVAELPLVAT + KG FQ LFSCIE+LL SLNV ++V+
Sbjct: 80 IIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIVV 139
Query: 800 PAAEKAESIWTKKFGFRKMSRERLLKYQRD-FQLTIFKGTSMLEKKV 845
PAAE+ E +W KFGFRK++ E+L KY + +Q+ FKG SML+K V
Sbjct: 140 PAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPV 186
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 619 LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTI--NRKHIEKGIL 669
LKE+PK WFC DC RI++ LQ + A+ + S I N + G+L
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSCFGIIQTNATVVAAGLL 93
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 145/356 (40%), Gaps = 72/356 (20%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
+RR + SD +C C G +L+LC+ CP FH CL+ +P W CP+C
Sbjct: 135 KRRNSSLSDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC--------- 185
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
C +C D S +T C QC + +HV CL K+G
Sbjct: 186 -----------------CCELCGKGDSSTSTNA------CLQCARAYHVHCLTKDGCLLP 222
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQW 679
+ P + FC C + A L + GI +++ + W
Sbjct: 223 TDYPSEN-FCSKSCYELCAQLHQLL----------------------GISNPTSVDGLTW 259
Query: 680 QMLKKAQCFEEKEKSLLSSATA-------IFRECFDPII-AECGRDLIPVMVY--GRNIS 729
+ + ++ SS + ECF + +D++ ++Y G
Sbjct: 260 TLTRSSKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDMVTDLIYNSGSKFK 319
Query: 730 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
F G Y+V+L +VS LRI G + AE+PLVAT ++ +G + L + +LL
Sbjct: 320 RLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQGMCRLLMQEVLKLL 379
Query: 790 CSLNVENLVLPAAEKAESIWTKKFGFRKM---SRERLLKYQRDFQLTIFKGTSMLE 842
VE L+LPA + +W FGF +M R++L Y F+GT ML+
Sbjct: 380 NKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQLSGY----SFVGFQGTMMLQ 431
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 116/461 (25%), Positives = 173/461 (37%), Gaps = 122/461 (26%)
Query: 449 NGIVCDCCNKEISPSQFEAHA----------------------------------GMAAR 474
GI C CCN + + FE HA M AR
Sbjct: 411 TGIRCRCCNTVVPVAVFETHARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAR 470
Query: 475 R--------------QPYRHIYTSNGMTLHD----IAISLAMGQRRTTGG----SDDMCH 512
Q R + + M L D ++ S Q + GG SDD C
Sbjct: 471 EKAMASLEQEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACG 530
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VC DG LL C+ CP FH CL + +S W + RQ ++ + L+ APG
Sbjct: 531 VCADGGQLLCCDTCPSTFHPDCLAIQFMIKS-WLLFD-RQQLTTIYGQQPWLQ---TAPG 585
Query: 573 AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE-FHVGCLRKNGLCDLKEIPKDKWFCCD 631
A I H YC + F +G +C +
Sbjct: 586 A-----AISADHQ------------YCRPLQSPGFEIGA-----------------YCSE 611
Query: 632 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ----C 687
C ++ + L D + +N H E G F + +Q L ++
Sbjct: 612 TCKKMSSHLSDMIG------------VMN--HTEDG--FSWALLKIQKDELVTSEDMPVI 655
Query: 688 FEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQEFGGMYSVILTVK 744
E K L+ A + ECF+P+ + D++ VY G + G Y+++L
Sbjct: 656 LESNVK--LAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKD 713
Query: 745 SVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 804
++S LLR GR++AE+P T YQ +G + L +E++L SL VENLV+PA
Sbjct: 714 GEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVAD 773
Query: 805 AESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
W + F FR M E + + L GT++L+K +
Sbjct: 774 LVETWKRSFSFRPMQAE-VRDEAKKLSLVAITGTTLLQKPI 813
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 167/385 (43%), Gaps = 58/385 (15%)
Query: 471 MAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF 530
M RRQPY +IYT G++LH +A+ L ++ L NG A
Sbjct: 1 MGRRRQPYHNIYTLEGLSLHKLALQL---------------------QDHLNPNGFDNAS 39
Query: 531 HAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT 590
++ D + SG C + S++ V LK L+ E C C +
Sbjct: 40 VSSVSDYHNLTSSG-----CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHTTIGN 94
Query: 591 FDDRTVIYCDQCEKEFHVGCL------RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 644
+ T+I+C+QCE+ H+ C +K L LKE + CC +C + A L++
Sbjct: 95 INPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLEE-- 152
Query: 645 SNRAQTIPASSLSTINRKHIEK--GILFDGTM-NDVQWQMLKKAQCFEEKEKSLLSSATA 701
+EK GI F + +++ W++L + K +
Sbjct: 153 ------------------GLEKCVGITFLRRIRSNICWRLLSGMDASRDV-KLYMPQVID 193
Query: 702 IFRECFDPIIAECGRDLIPVMVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREV 760
IF++ F E D+I MV G+N + +F GMY +LT + VVSA +L++ ++
Sbjct: 194 IFKDAFMDSTDE-HSDIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQI 252
Query: 761 AELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSR 820
AEL L+AT E + KG F L IE L + NV L P + IW++K GF +S
Sbjct: 253 AELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSA 312
Query: 821 ERLLKYQRDFQLTIFKGTSMLEKKV 845
E L +FK +++K +
Sbjct: 313 EEKESMLESHPLVMFKNLVLVQKSL 337
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 116/461 (25%), Positives = 173/461 (37%), Gaps = 122/461 (26%)
Query: 449 NGIVCDCCNKEISPSQFEAHA----------------------------------GMAAR 474
GI C CCN + + FE HA M AR
Sbjct: 413 TGIRCRCCNTVVPVAVFETHARCERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAR 472
Query: 475 R--------------QPYRHIYTSNGMTLHD----IAISLAMGQRRTTGG----SDDMCH 512
Q R + + M L D ++ S Q + GG SDD C
Sbjct: 473 EKAMASLEQEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRHQVKKNGGGKDCSDDACG 532
Query: 513 VCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPG 572
VC DG LL C+ CP FH CL + +S W + RQ ++ + L+ APG
Sbjct: 533 VCADGGQLLCCDTCPSTFHPDCLAIQFMIKS-WLLFD-RQQLTTIYGQQPWLQ---TAPG 587
Query: 573 AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE-FHVGCLRKNGLCDLKEIPKDKWFCCD 631
A I H YC + F +G +C +
Sbjct: 588 A-----AISADHQ------------YCRPLQSPGFEIGA-----------------YCSE 613
Query: 632 DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ----C 687
C ++ + L D + +N H E G F + +Q L ++
Sbjct: 614 TCKKMSSHLSDMIG------------VMN--HTEDG--FSWALLKIQKDELVTSEDMPVI 657
Query: 688 FEEKEKSLLSSATAIFRECFDPIIAECGR-DLIPVMVY--GRNISGQEFGGMYSVILTVK 744
E K L+ A + ECF+P+ + D++ VY G + G Y+++L
Sbjct: 658 LESNVK--LAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKD 715
Query: 745 SVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEK 804
++S LLR GR++AE+P T YQ +G + L +E++L SL VENLV+PA
Sbjct: 716 GEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVAD 775
Query: 805 AESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
W + F FR M E + + L GT++L+K +
Sbjct: 776 LVETWKRSFSFRPMQAE-VRDEAKKLSLVAITGTTLLQKPI 815
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 99/436 (22%), Positives = 174/436 (39%), Gaps = 83/436 (19%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTL----HDIAISLAMGQRR--- 502
GI C+CC++ S FE HAG R QP++ +Y G +L H+ + Q +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 503 -----TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ +DD C +CGDG +L+ C+GCP FH +CLD P W C NC S
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNC----SCK 695
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEF------HVGCL 611
+ + + C +C ++C K + H C+
Sbjct: 696 FCEKDEAAKHETSTLPSLSSCRLCE-----------------EKCSKHYPHTLADHQACI 738
Query: 612 RKNGLCDLKEIPKDKW---FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
++G +P ++ FC C + LQ F+ + S S + R + +
Sbjct: 739 NQDGT-----VPGERSTDSFCGKYCQELFEELQLFIGVKHPLPEGFSWSFLRRFELPSEV 793
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIA-ECGRDLIPVMVY--G 725
D+ ++ A+ ++ A ++ ECF P++ G +L+ +VY G
Sbjct: 794 ----ADCDISEKIAYNAK---------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFG 840
Query: 726 RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCI 785
N +F + +L +++ +RI G ++AE+P + T Y+ +G + L I
Sbjct: 841 SNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGI 900
Query: 786 ERLLCSLNVENLVLPAAEKAESIWT----------------KKFGFRKMSRERLLKYQRD 829
E + + + L +E +W FGF ++ + K ++
Sbjct: 901 ESFVAYFS--QMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVN-DSEKKTIKN 957
Query: 830 FQLTIFKGTSMLEKKV 845
L +F G ML K +
Sbjct: 958 LNLLVFPGVDMLGKSL 973
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/461 (24%), Positives = 196/461 (42%), Gaps = 92/461 (19%)
Query: 426 LPDGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTS 484
L +G ++Y+ K ++ G +GI+C CCN+ S + F+ HAG R T+
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHR-------TA 65
Query: 485 NGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAF--HAACLDPLLIPE 542
+TL D G ++L C L A C +P
Sbjct: 66 ALLTLED-------------------------GRSVLECQKQALKKIEQAKCDEP----- 95
Query: 543 SGWRCPNCRQGHSSSMSRSVDLKGG-LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQ 601
N + + ++++LK L E+ + D A D ++ CD
Sbjct: 96 -----ANGQLTVDETALKAMELKESELVVDDVEMD-----ENDDTCAVCGDGGQLVCCDH 145
Query: 602 CEKEFHVGCLRKNGLCDLKEIPKDKWFC----CDDCNR------IHAALQDFVSNR---- 647
C FH+ CLR L+ +P+ WFC C C R I + + SN
Sbjct: 146 CPSTFHLKCLR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHSNCVPGC 199
Query: 648 AQTIPASSLSTINRK--HIEKGIL-FDGTMNDVQ----WQMLKKAQCFEEKEKSLLSS-- 698
A +S +RK I +G+ G +N V W +L+ + E S L S
Sbjct: 200 AMKYESSDNQFCSRKCFKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSAENSKLESVA 259
Query: 699 --------ATAIFRECFDPII-AECGRDLIPVMVYGRNISGQ--EFGGMYSVILTVKSVV 747
A + +ECF P+I D++ ++Y R + +F G Y+V+L + +
Sbjct: 260 DLNTRLALALTVIQECFRPMIDPRSNIDMVSHILYNRRGEDKRMDFRGFYTVVLEKEQEL 319
Query: 748 VSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAES 807
+S +R+ G AE+P + T +Y+ +G + L + I+++L +L V+ LVLPA +
Sbjct: 320 ISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIE 379
Query: 808 IWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQCL 848
WT FGF+K++ + ++ + + F G+S+L+K + L
Sbjct: 380 TWTSAFGFQKLTAAQGIQLM-ELNIVTFPGSSVLQKPLTWL 419
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/88 (61%), Positives = 66/88 (75%), Gaps = 1/88 (1%)
Query: 759 EVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKM 818
++AELPLVAT GKG FQ LFSCIERLL L V+ LVLPAAE+AESIWT+KFGF ++
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 819 SRERLLKYQRD-FQLTIFKGTSMLEKKV 845
++L Y+R Q+ FKGTSML+K V
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTV 223
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 120/257 (46%), Gaps = 38/257 (14%)
Query: 592 DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
D +I CD C FH CL D++ + LQ + + +
Sbjct: 294 DGGDLICCDGCPSTFHQSCL------DIQ---------------LFEQLQMLLGVKHELE 332
Query: 652 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 711
S + + R + FD ++N + +K +C S L+ A +I ECF PI+
Sbjct: 333 DGFSWTLVQRTEVG----FDISLNGIP----QKVEC-----NSKLAVALSIMDECFLPIV 379
Query: 712 AE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 768
+ G +LI ++Y G N + + G ++ IL ++SA +RI G ++AE+P + T
Sbjct: 380 DQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGT 439
Query: 769 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQR 828
Y+ +G + L + IE L SLNVE LV+PA + WT FGF+ + K R
Sbjct: 440 RHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEVSS-RKEMR 498
Query: 829 DFQLTIFKGTSMLEKKV 845
+ + +F GT ML+K +
Sbjct: 499 NMNMLVFHGTDMLQKPL 515
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 23/170 (13%)
Query: 389 ERKRGVKRPFMHQKRTAEGGTKKRDN--------DLHRLLFLPNGLPDGERLTYI--VKG 438
+R + KR + + + EG T D L + +P ++ Y+ K
Sbjct: 146 QRTQNTKRFALLARHSKEGLTTDTDGYVPYSGKRTLLSWMVDLGTVPLNAKVQYMNRRKT 205
Query: 439 QRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAM 498
+ L G +GI C CC++ + S+FE HAGM +P ++I G++L +
Sbjct: 206 RALLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLC-EPSQNIILETGISLLQCQLDSWN 264
Query: 499 GQRRTTGG------------SDDMCHVCGDGENLLLCNGCPLAFHAACLD 536
Q + +DD C +CGDG +L+ C+GCP FH +CLD
Sbjct: 265 KQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCLD 314
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 133/342 (38%), Gaps = 81/342 (23%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG----MTLHDIAISL 496
++ G GIVC+CC K +S S F AHAG + Q ++ +G + L + +
Sbjct: 880 IKDGMVTWEGIVCNCCKKNLSVSDFMAHAGRS-HPQSSLGLFLESGKSYTLCLVEAWSAE 938
Query: 497 AMGQRRTTGG--------SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCP 548
+M +R G SDD C CGDG LL C+ CP +H ACL +PE W C
Sbjct: 939 SMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCH 998
Query: 549 NCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIY-CDQCEKEFH 607
NC C +C FS + I+ C QC +H
Sbjct: 999 NCT--------------------------CQVCGG-PFSEKEVSTFSAIFKCFQCGDAYH 1031
Query: 608 VGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKG 667
C+ + L L++ WFC C I L R H+
Sbjct: 1032 DTCIEQEKL-PLEDQISQTWFCGKYCKEIFIGL--------------------RSHVGTD 1070
Query: 668 ILFDGTMNDVQWQMLK------------KAQCFEEKEKSLLSSATAIFRECFDPII-AEC 714
+ D +D+ W +L+ K C E L+ A + ECF ++
Sbjct: 1071 NILD---SDLSWSILRCNNDGQKLHSVQKIACLAECNMK-LAVALTLLEECFIRMVDPRT 1126
Query: 715 GRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR 754
G D+IP ++Y G N + ++ G Y+VIL ++ +R
Sbjct: 1127 GVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIR 1168
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 149/360 (41%), Gaps = 65/360 (18%)
Query: 450 GIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT-----LHDI----AISLAMGQ 500
GI C CC+K ++ ++FE HAG + ++PY +I+ G L D S G
Sbjct: 685 GIFCGCCSKILTIAKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLDAWEKHTQSENKGF 743
Query: 501 RRTTGGSD---DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSS 557
+ G D D C +CGDG +L+ C+ C FH CL + +P W C +C
Sbjct: 744 YKIDKGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLG-IKLPSGDWYCRSC------- 795
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDF--SAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+CR F + ++ C QC +++H C G
Sbjct: 796 ----------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCSSGTG 833
Query: 616 L---CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDG 672
C + D FC C +I+ L + + S S ++ D
Sbjct: 834 TDSGCTMPGTSID-CFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLVH------CFANDQ 886
Query: 673 TMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNIS 729
M + + L AQC S + A + ECF P I + G ++I + Y G + S
Sbjct: 887 AMPNKNKEKL--AQC-----NSKTALAFTVLDECFQPHIDDRSGINMIHNVAYNCGSDFS 939
Query: 730 GQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLL 789
+F G Y+ IL V++A +RI G ++AE+P + T Y+ +G + L + IE ++
Sbjct: 940 RLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIGTRGMYRHQGMCRRLLNGIESVI 999
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 5/61 (8%)
Query: 229 VKYISTSRERQ-LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENG 287
VKY++++R R+ L G + G CGC SK+++ +FE HAG+K + P +I+LE G
Sbjct: 665 VKYMNSNRTREMLAGKITREGIFCGC----CSKILTIAKFELHAGSKEKKPYANIFLEGG 720
Query: 288 K 288
K
Sbjct: 721 K 721
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 112/229 (48%), Gaps = 24/229 (10%)
Query: 628 FCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQML---KK 684
FC C + LQ+ ++ + P S + R H DV ++L K+
Sbjct: 27 FCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIH-----------EDVPEEVLALDKR 75
Query: 685 AQCFEEKEKSLLSSATAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVIL 741
+C S ++ A ++ ECF PII + G +LI +VY G N + +F G Y IL
Sbjct: 76 VEC-----NSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFIL 130
Query: 742 TVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPA 801
+++A +RI G ++AE+P + T Y+ +G + L IE +L SLNVE L++PA
Sbjct: 131 ERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPA 190
Query: 802 AEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKK-VQCLP 849
+ WT KFGF + + + + +F GT +L+K ++ LP
Sbjct: 191 ITELVDTWTSKFGFSPLEDSEKQEV-KSISMLVFPGTGLLQKPLLKALP 238
>gi|224090647|ref|XP_002309044.1| predicted protein [Populus trichocarpa]
gi|222855020|gb|EEE92567.1| predicted protein [Populus trichocarpa]
Length = 467
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/65 (56%), Positives = 52/65 (80%)
Query: 199 EIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNF 258
+IK +KK+ N +P+NV+ LLSTG+LDG VKY++ S+E++L G++ G GYLCGC CNF
Sbjct: 382 DIKTAKKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWSQEKELRGVIKGSGYLCGCQTCNF 441
Query: 259 SKVVS 263
SKV+S
Sbjct: 442 SKVIS 446
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 705 ECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVA 761
ECF PII + G +LI +VY G N + +F G Y IL +++A +RI G ++A
Sbjct: 3 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 62
Query: 762 ELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRE 821
E+P + T Y+ +G + L IE +L SLNVE L++PA + WT KFGF +
Sbjct: 63 EMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDS 122
Query: 822 RLLKYQRDFQLTIFKGTSMLEKK-VQCLP 849
+ + + +F GT +L+K ++ LP
Sbjct: 123 EKQEV-KSISMLVFPGTGLLQKPLLKALP 150
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 10/156 (6%)
Query: 694 SLLSSATAIFRECFDPIIAEC-GRDLIPVMVYGR--NISGQEFGGMYSVILTVKSVVVSA 750
S L A ++ ECF+P+ RDL+ +++ R ++ F G Y+V+L ++S
Sbjct: 977 SKLHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISV 1036
Query: 751 GLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT 810
+R++G++VAE+PLV T +Y+ G L +E+ L L VE LVLPA WT
Sbjct: 1037 ATVRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWT 1096
Query: 811 KKFGFRKMS---RERLLKYQRDFQLTIFKGTSMLEK 843
+ FGF KM+ R + L D+ F+G M +K
Sbjct: 1097 RSFGFAKMTNLERSQFL----DYTFLDFQGAIMCQK 1128
Score = 82.4 bits (202), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 92/224 (41%), Gaps = 48/224 (21%)
Query: 431 RLTYIVKGQR-----LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSN 485
++ Y VKG+ L G +GI C+CC S FE HA + +P I+ +
Sbjct: 548 KVYYKVKGRHRKVCTLADGKITRDGIKCNCCMGIYSFVGFENHASGNSTCRPSASIFLED 607
Query: 486 GMTLHDIAISLAMGQR--RTTGGS---------DDMCHVCGDGENLLLCNGCPLAFHAAC 534
G +L D I + + T+G S D +C VC G L+LC+ CP +FH C
Sbjct: 608 GRSLLDCLIKMMHDHKTMETSGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTC 667
Query: 535 LDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR 594
L IP W CP+C C IC DD
Sbjct: 668 LGLEDIPNGDWFCPSC--------------------------CCGICGQRKIDR---DDE 698
Query: 595 T--VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 636
++ C QCE ++HV CL +NG D+ WFC DC ++
Sbjct: 699 VEQLLPCIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKL 741
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 76/140 (54%), Gaps = 11/140 (7%)
Query: 696 LSSATAIFRECFDPII-AECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGL 752
L A +I ECF PII A G D+IP ++Y + ++ G Y+V+L ++S
Sbjct: 1122 LVIALSIMEECFLPIIDARTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVAS 1181
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
+R+ G VAE+PL+ATC E + +G + L IE++L SL VE L+L A WT
Sbjct: 1182 IRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMA 1241
Query: 813 FGF--------RKMSRERLL 824
FGF + +SR RL+
Sbjct: 1242 FGFVPIDDLDRKNLSRLRLV 1261
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 17/109 (15%)
Query: 441 LRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQ--PYRHIYTSNG--MTLHDI-AIS 495
L+ G GI C CC+ + S F+ HAG+ RQ P +++ +G TL + A S
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGL---RQEIPSLNLFLGSGKSYTLCQLQAWS 1022
Query: 496 LAMGQRRTTGG---------SDDMCHVCGDGENLLLCNGCPLAFHAACL 535
+ R+ +DD C +CGDG L+ C+ CP ++H CL
Sbjct: 1023 IEHKARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCL 1071
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 88.2 bits (217), Expect = 2e-14, Method: Composition-based stats.
Identities = 40/71 (56%), Positives = 56/71 (78%)
Query: 775 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 834
+G FQALF CIERLL SL V++ VLPAA++AESIWT++FGF K++++ L +Y + + T+
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTV 62
Query: 835 FKGTSMLEKKV 845
F+GTS L K V
Sbjct: 63 FQGTSTLHKLV 73
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 66/102 (64%), Gaps = 1/102 (0%)
Query: 210 EYPTNVKKLLSTGILDGACVKYI-STSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHEFE 268
++P N+++LL+TG+L+G V YI S++ L G++ G C C CN SK VSA+ FE
Sbjct: 405 KHPGNIRELLNTGLLEGMPVMYIIPHSKKAVLKGVITGCNIRCFCLSCNGSKAVSAYYFE 464
Query: 269 QHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
QHAG+ +HP ++IYL NG + +++ +PL LE+ ++
Sbjct: 465 QHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIR 506
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 79/160 (49%), Gaps = 10/160 (6%)
Query: 696 LSSATAIFRECFDPIIAECGRDLIPVMVYGRNIS---------GQEFGGMYSVILTVKSV 746
+ A +F+ F P++ + GRDL+ ++ G G F G + +L +
Sbjct: 221 IDQALRLFKSSFSPLLMDNGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRGA 280
Query: 747 VVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAE 806
VV+A LR+FGR AELP VAT Y+ G + L +E LL S V LV+P+ +
Sbjct: 281 VVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPLL 340
Query: 807 SIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKVQ 846
+W KFGF ++ + + + D+ + + TS L V+
Sbjct: 341 PMWAAKFGFTPLTEQEVAAIE-DWVVDTDRETSTLSTVVR 379
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 120/269 (44%), Gaps = 46/269 (17%)
Query: 596 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC----CDDCNRIHAALQDFVSNRAQTI 651
++ CD+C FH C+ L+ P+ WFC C C S+
Sbjct: 423 LLMCDRCPSMFHHACV------GLESTPQGDWFCPACTCAICG----------SSDLDDP 466
Query: 652 PASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPII 711
PA++ + +G D + + Q ++++ EE+E + L A + RECF +I
Sbjct: 467 PATTTT--------QGFSSDRMVISCE-QCRRESRDGEEEEHAKLCMALDVLRECFVTLI 517
Query: 712 -AECGRDLIPVMVYG--RNISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVAT 768
DL +V+ + +F G Y V L +++ LR++G EVAE+PLV T
Sbjct: 518 EPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 577
Query: 769 CREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWT-KKFGFRKMSRERLLKYQ 827
+ +G + L I++LL + VE LVLPA + + WT FG R+M Q
Sbjct: 578 RFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIREMG-------Q 630
Query: 828 RDFQ------LTIFKGTSMLEKKVQCLPE 850
D Q + F+GT M K++ P+
Sbjct: 631 ADRQDVAHHAILRFQGTIMCHKQLPPQPQ 659
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 116/300 (38%), Gaps = 71/300 (23%)
Query: 352 SIISCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGG--T 409
++ SCS+ + P +C + +++ + + R V RP Q R EG T
Sbjct: 224 TMASCSNNNNKRPAPPAACKTATSSKKKKKKKASLQQAR---VLRP---QPRNEEGNALT 277
Query: 410 KKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIV---CDCC----NK---- 458
R L LL L ++L Y K G G+G+V C C NK
Sbjct: 278 PARARTLLSLLIDKKILAPRDQLIYTTKR-----GLITGDGMVKCMCGGCINNNNKRRVA 332
Query: 459 EISPSQFEAHA----GMAARRQPYRHIYTSNGMTLHDIAISLAMG--------------- 499
E + ++F H ++ RQP+ ++ +G +L + L M
Sbjct: 333 EYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKKY 392
Query: 500 ----------QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPN 549
+R+ D +C VC D LL+C+ CP FH AC+ P+ W CP
Sbjct: 393 LPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPA 452
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
C + ++ S DL + P + FS+ DR VI C+QC +E G
Sbjct: 453 C----TCAICGSSDLD---DPPATTT-------TQGFSS----DRMVISCEQCRRESRDG 494
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
Query: 196 RNMEIKMSKKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQ--LDGIVNGGGYLCGC 253
+N+E+KMSKK+ + P VK+L TG+L+G V Y+ + + L G + G LC C
Sbjct: 67 KNLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSC 126
Query: 254 PLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLGILEEVVK 310
CN +V+ +FE HA + R +I ENGK + ++ +TAPL LE ++
Sbjct: 127 AFCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQ 183
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 109/451 (24%), Positives = 178/451 (39%), Gaps = 74/451 (16%)
Query: 355 SCSSLALEESISPTSCSFVQDTGREQTHMKEMLEERKRGVKRPFMHQKRTAEGGTKKRDN 414
S S L + S S S D G+ H++ L V+RP +G +
Sbjct: 476 SASPLHHQTEKSTGSSSHHVDGGKSSKHVRSTLS-----VRRPVRGDNSEGDGFVPSSEK 530
Query: 415 DLHRLLFLPNG-LPDGERLTYIVKGQRLRFGCKQG----NGIVCDCCNKEISPSQFEAHA 469
+ +G L E++ Y+ QR +G +GI C CC+K +S S+FE HA
Sbjct: 531 QTILAWLIDSGTLKLSEKVMYM--NQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHA 588
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMC--HVCGDGENLLLCNGCP 527
G + RQP+++I+ + G++L I Q+ G+ C V D N C C
Sbjct: 589 G-SKLRQPFQNIFLNTGVSLFQCQIDAWDKQK--GAGNIGFCSVDVIADDPNDDACGICG 645
Query: 528 LAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFS 587
C D CP S+ R +D++G L D+
Sbjct: 646 DGGDLVCCDG---------CP------STFHQRCLDIRGHLMP--------------DWI 676
Query: 588 AATFDDRTVIYCDQCEKEFH---VGCLRKNGLCDLKEIPKDKWFCCDDCNR--IHAALQD 642
F+ R + H VG L K L +IP C R + ++
Sbjct: 677 FLRFNYRCFLLVIGIAPIVHANSVGQLLKMLLRPRMQIPAK-------CVRKNLSEGVKK 729
Query: 643 FVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAI 702
+V + + S S ++R+ + + + V+ S L+ A +
Sbjct: 730 YVGVKHELEAGFSWSLVHRECADSDLFLGEHPHIVE-------------NNSKLALALTV 776
Query: 703 FRECFDPII-AECGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLRIFGRE 759
ECF PI+ G +++ ++Y G N + FGG Y+ +L VV++ +R G
Sbjct: 777 MDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVASASIRFHGNH 836
Query: 760 VAELPLVATCREYQGKGCFQALFSCIERLLC 790
+AE+P + T Y+ +G + LFS IE + C
Sbjct: 837 LAEMPFIGTRHVYRHQGMCRRLFSVIESVKC 867
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 11/87 (12%)
Query: 212 PTNVKKLLSTGILDGACVK------YISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVSA 264
P++ K+ + ++D +K Y++ R L+G + G CGC C SK++S
Sbjct: 526 PSSEKQTILAWLIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGC--C--SKILSV 581
Query: 265 HEFEQHAGAKTRHPNNHIYLENGKPIY 291
+FE HAG+K R P +I+L G ++
Sbjct: 582 SKFEIHAGSKLRQPFQNIFLNTGVSLF 608
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 78/151 (51%), Gaps = 4/151 (2%)
Query: 696 LSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGL 752
L A I ECF II D+ +V+ R + F G Y ++L +VS G
Sbjct: 8 LCIAVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGT 67
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
RI G++ AELPL+ T Y+ +G + L + +E+LL L VE L+LPA + WT
Sbjct: 68 FRICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCS 127
Query: 813 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
FGF MS L+ + L+ F+GT+M +K
Sbjct: 128 FGFTVMSNSERLELAGNSILS-FQGTTMCQK 157
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 69/125 (55%), Gaps = 1/125 (0%)
Query: 722 MVYGRNISGQ-EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQA 780
MV ++ +G+ +F G+Y +LT + VVSA +L++ EVAEL L+AT E + KG F
Sbjct: 109 MVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSL 168
Query: 781 LFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSM 840
L S IE L + NV L P + IW++K G+ +S E+ L +F S+
Sbjct: 169 LLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTILSDEQKHSMLMAHPLVMFANLSL 228
Query: 841 LEKKV 845
++K +
Sbjct: 229 VQKSL 233
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 75.1 bits (183), Expect = 2e-10, Method: Composition-based stats.
Identities = 50/156 (32%), Positives = 79/156 (50%), Gaps = 4/156 (2%)
Query: 691 KEKSLLSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVV 747
+E L A + ECF ++ DL +V+ R + F G Y + L +
Sbjct: 4 EEHGKLCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGEL 63
Query: 748 VSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAES 807
++ G LR++G++VAELPLV T ++ +G L + +E LL VE LVLPA +
Sbjct: 64 ITVGTLRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQ 123
Query: 808 IWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
WT FGF+ M++ + L + + F+GT+M K
Sbjct: 124 TWTGSFGFQVMTQSQKLDIAQH-TIMCFQGTTMCHK 158
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 735 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDEDEE 794
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 795 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 841
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 842 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGLPPPKPLEGIPERE 901
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 902 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 931
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 324 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 383
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 384 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 430
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 431 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPKPLEGIPERE 490
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 491 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 520
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 74.3 bits (181), Expect = 2e-10, Method: Composition-based stats.
Identities = 52/206 (25%), Positives = 88/206 (42%), Gaps = 30/206 (14%)
Query: 627 WFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQ 686
+FC +C R + AL+ R++ T H ++
Sbjct: 67 YFCGKECQRSYEALEAATGRRSRIRDEPEQYTFELVHYKQD------------------- 107
Query: 687 CFEEKEKSLLSSATAIFRECFDPIIAECGRDLI---------PVMVYGRNISGQEFGGMY 737
+ +S + +A +FR F P+I E GRDL+ P G F
Sbjct: 108 --DRTVRSAVETAMRMFRTSFAPLIMENGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFR 165
Query: 738 SVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENL 797
IL + +++A LR+FG + AE+P V+T ++ G + L +E LL + V L
Sbjct: 166 LAILRMGGTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCL 225
Query: 798 VLPAAEKAESIWTKKFGFRKMSRERL 823
V+P+ + +WT KFGF K++ + +
Sbjct: 226 VIPSINELLPMWTNKFGFAKIATDEV 251
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 396
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 397 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 443
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 444 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPKPLEGIPERE 503
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 504 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 533
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDEDDE 430
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 431 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 477
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 478 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPKPLEGIPERE 537
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 538 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 567
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 440
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 441 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 487
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 488 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPKPLEGIPERE 547
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 548 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 577
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 330 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 389
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 390 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 436
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 437 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPKPLEGIPERE 496
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 497 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 526
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 105/250 (42%), Gaps = 58/250 (23%)
Query: 640 LQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLLSSA 699
LQ+ ++ + P S + R H E + + + K+ +C S ++ A
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEE--------VPEEVLALDKRVEC-----NSKIAVA 1288
Query: 700 TAIFRECFDPIIAE-CGRDLIPVMVY--GRNISGQEFGGMYSVILTVKSVVVSAGLLR-- 754
++ ECF PI+ + G +LI +VY G N + +F G Y +IL +++A +R
Sbjct: 1289 LSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLK 1348
Query: 755 ---------------------------------------IFGREVAELPLVATCREYQGK 775
I G ++AE+P + T Y+ +
Sbjct: 1349 EKNILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQ 1408
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 835
G + L IE +L SLN+E L++PA + WT KFGF + + + + +F
Sbjct: 1409 GMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEV-KSVSMLVF 1467
Query: 836 KGTSMLEKKV 845
GT +L+K +
Sbjct: 1468 PGTGLLQKPL 1477
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 60/116 (51%), Gaps = 16/116 (13%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMT---LHDIAISLAM---GQRR 502
+GI C CC++ IS +F HAG + PYR+I +G+ LH + + M +R+
Sbjct: 935 DGINCSCCSEVISVPEFVTHAGSEVNK-PYRNILV-DGLDIDLLHCLINAWNMQSDAERQ 992
Query: 503 T--------TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
+DD C +CGDG NL+ C+GCP FH +CL +P W C NC
Sbjct: 993 DFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNC 1048
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 482 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKEDDEEE 541
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 542 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLAEI 588
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + + PA + + +E GI +
Sbjct: 589 PNGEWLCPRCTCPPLKGKVQRILHWKWTEPPAPFMVGLPGPEVEPGIPPPKPLEGIPERE 648
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 649 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 678
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 320 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEEEEE 379
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 380 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 426
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 427 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPGVPPPKPLEGIPERE 486
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 487 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 516
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
LRI G +VAE+PLVAT +Y+ +G Q L +E++L L+VE LVLPA + +W
Sbjct: 95 LRIHGHKVAEMPLVATAFKYRRQGMCQVLVHELEKMLSQLHVERLVLPAISERSELWQSL 154
Query: 813 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
FGF +MS L+ R F F+GT+M +K
Sbjct: 155 FGFSEMSSAERLELLR-FPFLGFQGTTMFQK 184
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 19/136 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS---RSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +S+S + +
Sbjct: 330 DYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHCEKEGITSVSKGNKECE 389
Query: 564 LKGGLEAPGAEVGGCVICRSH----------DFSAATFDDRTVIYCDQCEKEFHVGCLRK 613
G A ++ G ++ H +F D +I CD C +H+ CL
Sbjct: 390 DSGSEPAVTSDKDGKIVSAVHPTSPEKDEHQEFCTECRDGGDLICCDNCPASYHIACL-- 447
Query: 614 NGLCDLKEIPKDKWFC 629
+ L IP+ W C
Sbjct: 448 --IPPLANIPEGVWLC 461
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C DG +L+ C+ CP ++H ACL P L IPE W CP C
Sbjct: 421 EFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRC 464
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 342 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 401
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 402 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 448
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 449 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPRPLEGIPERE 508
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 509 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 538
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 70 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 129
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 130 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 176
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 177 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPEVEPGMPPPRPLEGIPERE 236
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 237 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 266
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 54/123 (43%), Gaps = 25/123 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP AFH CLDP L PE W CPNC +G D
Sbjct: 349 DYCEVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWSCPNC-EGEGIPEPEPAD--- 404
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ C +C D ++ C+QC +H+ CL L++IP D
Sbjct: 405 ------EHMEFCRVCH---------DGGELLCCEQCPSSYHIFCLNP----PLRKIPDDD 445
Query: 627 WFC 629
W C
Sbjct: 446 WVC 448
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C CP ++H CL+P L IP+ W CP C
Sbjct: 408 EFCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPRC 451
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 72.8 bits (177), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 405
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 406 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 452
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 453 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPRPLEGIPERE 512
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 513 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 405
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 406 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 452
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 453 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPRPLEGIPERE 512
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 513 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 81/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDEEE 414
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 415 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 461
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E G+ +
Sbjct: 462 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFVVGLPGPEVEPGMPPPRPLEGIPERE 521
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 522 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 551
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 8/123 (6%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 413 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREEGSEG 472
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E +VG +F D ++ CD C +H+ CL L EIP +
Sbjct: 473 --EDDNGDVGEMEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNP----PLPEIPNGE 526
Query: 627 WFC 629
W C
Sbjct: 527 WIC 529
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 489 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 532
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 69/130 (53%), Gaps = 1/130 (0%)
Query: 717 DLIPVMVYGRN-ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGK 775
D+I MV G+N ++F GMY +LT + VVSA +L++ ++AEL L+AT E + K
Sbjct: 76 DIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKK 135
Query: 776 GCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTIF 835
G F L IE L + NV L P + IW++K GF +S E L +F
Sbjct: 136 GYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESMLESHPLVMF 195
Query: 836 KGTSMLEKKV 845
K +++K +
Sbjct: 196 KNLVLVQKSL 205
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 815 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 874
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 875 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 921
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 922 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 981
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 982 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 1011
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 447 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 506
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 507 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 553
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 554 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 613
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 614 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 643
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 65/150 (43%), Gaps = 9/150 (6%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R +G
Sbjct: 257 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREEGSEG 316
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ +VG +F D ++ CD C +H+ CL L EIP +
Sbjct: 317 --DEDNGDVGEMEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNP----PLPEIPNGE 370
Query: 627 WFCCD-DCNRIHAALQDFVSNRAQTIPASS 655
W C C + +Q ++ R P +
Sbjct: 371 WICPRCTCPSMKGKVQKILTWRWGEPPTPT 400
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEEEEE 427
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 428 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 474
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 475 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSVPPPKPLEGIPERE 534
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 535 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 564
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 57/125 (45%), Gaps = 11/125 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 413
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E E GG H +F D ++ CD C +H+ CL L EIP
Sbjct: 414 GEE--DNEAGGEAEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLN----PPLPEIPN 467
Query: 625 DKWFC 629
+W C
Sbjct: 468 GEWIC 472
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 432 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 475
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 594 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 653
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 654 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 700
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 701 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 760
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 761 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 790
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 403
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 404 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 450
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 451 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 510
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 511 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 403
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 404 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 450
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 451 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 510
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 511 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 88/213 (41%), Gaps = 40/213 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C + SS++ + + G
Sbjct: 154 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHCVR---SSVNIAFEENG 210
Query: 567 GLEAPGAE-VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
+E E + C +C+ D ++ CD C +HV CL N + +P
Sbjct: 211 PVEQDDDEHMEFCRVCK---------DGGELLCCDTCTSAYHVFCLAPN----VANVPDG 257
Query: 626 KWFCCD-DCNRIHAALQDFVSNRAQ-----------TIPAS--SLSTINRKHIEKGILFD 671
W C C + A +Q + R + P S + I R+H E+
Sbjct: 258 IWHCPRCSCQPLKARVQKILFWRWKVPMKVEDELDHVTPHSPNKKAQIEREHKEREFF-- 315
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C E L A+FR
Sbjct: 316 -----VKWHELSYWHCEWISELQLDVYHPAMFR 343
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 403
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 404 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 450
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 451 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 510
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 511 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 403
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 404 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 450
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 451 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 510
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 511 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 329 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 388
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 389 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 435
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 436 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 495
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 496 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 525
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 344 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 403
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 404 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 450
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 451 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 510
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 511 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 540
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 414 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 473
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 474 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 520
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 521 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 580
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 581 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 610
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 80/210 (38%), Gaps = 27/210 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 54 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 113
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 114 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 160
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND----- 676
P +W C C + +Q + R PA + + +E + +
Sbjct: 161 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPERE 220
Query: 677 --VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 221 FFVKWAGLSYWHCSWVKELQLELYHTVMYR 250
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 57/125 (45%), Gaps = 11/125 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 412
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E E GG H +F D ++ CD C +H+ CL L EIP
Sbjct: 413 GEE--DNEAGGEAEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPP----LPEIPN 466
Query: 625 DKWFC 629
+W C
Sbjct: 467 GEWIC 471
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 474
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDAEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 358 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 417
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 418 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 465
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 466 ---PLPEIPNGEWLC 477
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 437 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 480
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 306 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 365
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 366 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 413
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 414 ---PLPEIPNGEWLC 425
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 385 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 428
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 452
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 453 ---PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 352 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 411
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 412 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 459
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 460 ---PLPEIPNGEWLC 471
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 474
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 427
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 428 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 475
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 476 ---PLPEIPNGEWLC 487
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 447 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 490
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 424
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 425 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 472
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 473 ---PLPEIPNGEWLC 484
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 422 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 481
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 482 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 529
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 530 ---PLPEIPNGEWLC 541
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 501 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 544
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 353 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 412
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 413 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 460
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 461 ---PLPEIPNGEWLC 472
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 432 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 475
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 427
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 428 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 475
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 476 ---PLPEIPNGEWLC 487
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 447 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 490
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 428
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 429 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 476
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 477 ---PLPEIPNGEWLC 488
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 448 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 491
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 423
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 424 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 471
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 472 ---PLPEIPNGEWLC 483
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 443 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 486
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 451
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 452 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 499
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 500 ---PLPEIPNGEWLC 511
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 471 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 514
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 434
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 435 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 482
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 483 ---PLPEIPNGEWLC 494
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 454 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 497
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 361 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 420
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 421 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 468
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 469 ---PLPEIPNGEWLC 480
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 430
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 431 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 478
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 479 ---PLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 452
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 453 ---PLPEIPNGEWLC 464
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 225 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 284
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 285 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 332
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 333 ---PLPEIPNGEWLC 344
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 304 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 347
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 57/125 (45%), Gaps = 11/125 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D
Sbjct: 354 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEARE-DASE 412
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E E GG H +F D ++ CD C +H+ CL L EIP
Sbjct: 413 GEE--DNEAGGEAEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNPP----LPEIPN 466
Query: 625 DKWFC 629
+W C
Sbjct: 467 GEWIC 471
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 431 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 474
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 58/132 (43%), Gaps = 25/132 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C R+ S
Sbjct: 367 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDGSDG 426
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ D G +E + C +C+ D ++ CD C +H+ CL
Sbjct: 427 EEDNGDA-GDMEEDDHHMEFCRVCK---------DGGELLCCDSCPSSYHIHCLNP---- 472
Query: 618 DLKEIPKDKWFC 629
L EIP +W C
Sbjct: 473 PLPEIPNGEWIC 484
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 487
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 449 NGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD 508
+GI C CC+K ++ S+FE HAG + RQP+++I+ ++G + I +D
Sbjct: 582 DGIHCGCCSKILAVSKFEIHAG-SKLRQPFQNIFLNSGGAGN---IGFCSVDVIADDPND 637
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLD 536
D C +CGDG +L+ C+GCP FH CLD
Sbjct: 638 DACGICGDGGDLVCCDGCPSTFHQRCLD 665
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 23/128 (17%)
Query: 694 SLLSSATAIFRECFDPIIAE--------CGRDLIPVMVYG---------------RNISG 730
S L+ A + ECF PII C R+ V+ +G N +
Sbjct: 767 SKLALALTVMDECFLPIIDRRSGHCKKFCLRNFTTVIFFGISLCWFVCLYIAFRRSNFNR 826
Query: 731 QEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLC 790
FGG Y+ +L +V++ +R G +AE+P + T Y+ +G + LFS +E +
Sbjct: 827 LNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESVSS 886
Query: 791 SLNVENLV 798
+ +V L
Sbjct: 887 TADVAKLT 894
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 206 VVPNEYPTNVKKLLSTGILD-GACVKYISTSRER-QLDGIVNGGGYLCGCPLCNFSKVVS 263
V +E T + L+ +G L V Y++ R R L+G + G CGC C SK+++
Sbjct: 539 VPSSEKRTVLAWLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGC--C--SKILA 594
Query: 264 AHEFEQHAGAKTRHPNNHIYLENG 287
+FE HAG+K R P +I+L +G
Sbjct: 595 VSKFEIHAGSKLRQPFQNIFLNSG 618
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 57/132 (43%), Gaps = 24/132 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++ +S
Sbjct: 362 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDNSEI 421
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
D G E + C +C+ D ++ CD C +H+ CL
Sbjct: 422 DDDMDDTVGDPEEEDHHMEFCRVCK---------DGGELLCCDACPSSYHIHCLNP---- 468
Query: 618 DLKEIPKDKWFC 629
L EIP +W C
Sbjct: 469 PLPEIPNGEWLC 480
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 290 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 349
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 350 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 397
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 398 ---PLPEIPNGEWLC 409
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 369 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 412
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 57/132 (43%), Gaps = 24/132 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++ +S
Sbjct: 361 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDNSEI 420
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
D G E + C +C+ D ++ CD C +H+ CL
Sbjct: 421 DDDMDDTVGDPEEEDHHMEFCRVCK---------DGGELLCCDACPSSYHIHCLNP---- 467
Query: 618 DLKEIPKDKWFC 629
L EIP +W C
Sbjct: 468 PLPEIPNGEWLC 479
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 439 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 482
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 68/166 (40%), Gaps = 15/166 (9%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK- 565
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + SM V+ K
Sbjct: 51 DYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGSWSCPHCEK-EGISMGSQVEGKA 109
Query: 566 -GGLEAPGAEVGGCVIC-----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
G AP +F D +I C+ C +H+ CL + L
Sbjct: 110 TGTKMAPDKSAKQVAAASPEKDEHQEFCTECHDGGDLICCENCPVSYHLDCL----IPPL 165
Query: 620 KEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHI 664
IP+ W C C + A + ++ R P S + HI
Sbjct: 166 TNIPEGVWLCPRCGCKPLKARVSKILTWRWFEPPKESDELDHTHHI 211
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 116 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 175
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 176 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 223
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 224 ---PLPEIPNGEWLC 235
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 195 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 238
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 57/124 (45%), Gaps = 9/124 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R D+
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEARE-DVSD 439
Query: 567 GLEAPGAEVGGCVICRSH-DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
E G E G H +F D ++ CD C +H+ CL L EIP
Sbjct: 440 AEEDNG-ETGEMEEDDHHMEFCRVCKDGGELLCCDSCPSSYHIHCLNP----PLPEIPNG 494
Query: 626 KWFC 629
+W C
Sbjct: 495 EWIC 498
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 501
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 57/132 (43%), Gaps = 24/132 (18%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHSSS 557
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++ +S
Sbjct: 362 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDNSEL 421
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
D G E + C +C+ D ++ CD C +H+ CL
Sbjct: 422 DDDLDDAVGDPEEEDHHMEFCRVCK---------DGGELLCCDVCPSSYHIHCLNP---- 468
Query: 618 DLKEIPKDKWFC 629
L EIP +W C
Sbjct: 469 PLPEIPNGEWLC 480
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 440 EFCRVCKDGGELLCCDVCPSSYHIHCLNPPLPEIPNGEWLCPRC 483
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 9/125 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ D
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE-DNSE 424
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E VG H +F D ++ CD C +H+ CL L EIP
Sbjct: 425 GEETMEDAVGDAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNP----PLPEIPN 480
Query: 625 DKWFC 629
+W C
Sbjct: 481 GEWLC 485
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 445 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 78/206 (37%), Gaps = 20/206 (9%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + + D +
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCEKEGIQWEPKDDDEED 406
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E CR D ++ CD C +H+ CL L EIP +
Sbjct: 407 EEGGEEEEDDHMEFCR------VCKDGGELLCCDTCPSSYHLHCLNP----PLPEIPNGE 456
Query: 627 WFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND-------VQ 678
W C C + +Q + + PA ++ + + I + V+
Sbjct: 457 WLCPRCTCPPLKGKVQRILHWKWTEPPAPFITVLPPPDADPSIPLPKPLEGIPEREFFVK 516
Query: 679 WQMLKKAQCFEEKEKSLLSSATAIFR 704
W L C KE L T ++R
Sbjct: 517 WAGLSYWHCSWVKELQLELYHTVMYR 542
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 54/110 (49%), Gaps = 15/110 (13%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS---SSMSRS 561
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG + + +
Sbjct: 303 NEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCLQGRALQDTPHAEE 362
Query: 562 VDLKGGLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ LE+PGA G C +C D + C C FH+ C
Sbjct: 363 PRPQEPLESPGAAPGARCGVC---------GDGAEALRCAHCAAAFHLRC 403
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 9/125 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ D
Sbjct: 372 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE-DNSE 430
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E VG H +F D ++ CD C +H+ CL L EIP
Sbjct: 431 GEEILEDVVGDAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNP----PLPEIPN 486
Query: 625 DKWFC 629
+W C
Sbjct: 487 GEWLC 491
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 451 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 1/92 (1%)
Query: 754 RIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKF 813
RI G ++AE+P + T Y+ +G L + IE L SLNV LV+PA + ++ WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 814 GFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
GF+ + + K + + I GT +LEK++
Sbjct: 61 GFKPVEPSKRQKI-KSLNILIIHGTGLLEKRL 91
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 9/125 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ D
Sbjct: 372 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE-DNSE 430
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E VG H +F D ++ CD C +H+ CL L EIP
Sbjct: 431 GEEILEDVVGDAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNP----PLPEIPN 486
Query: 625 DKWFC 629
+W C
Sbjct: 487 GEWLC 491
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 451 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 9/125 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ D
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKE-DNSE 423
Query: 567 GLEAPGAEVGGCVICRSH--DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G E VG H +F D ++ CD C +H+ CL L EIP
Sbjct: 424 GEEILEDVVGDAEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNP----PLPEIPN 479
Query: 625 DKWFC 629
+W C
Sbjct: 480 GEWLC 484
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 444 EFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDDE 440
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
D G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 441 DEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 487
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q + R PA
Sbjct: 488 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 519
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 328 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDDE 387
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
D G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 388 DEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 434
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q + R PA
Sbjct: 435 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 466
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 432 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDDE 491
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
D G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 492 DEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 538
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q + R PA
Sbjct: 539 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 570
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G
Sbjct: 260 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 319
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 320 --EEILEEVGGDPEEEDDHHMEFCRVCK---------DGGELLCCDPCPSSYHIHCLNPP 368
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 369 ----LPEIPNGEWLC 379
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP+C
Sbjct: 339 EFCRVCKDGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 318 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDDE 377
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
D G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 378 DEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 424
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q + R PA
Sbjct: 425 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 456
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 62/152 (40%), Gaps = 20/152 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 432 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDDE 491
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
D G E + C +C+ D ++ CD C +H+ CL L EI
Sbjct: 492 DEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPEI 538
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q + R PA
Sbjct: 539 PNGEWLCPRCTCPPLKGKVQRILHWRWTEPPA 570
>gi|334821760|gb|AEG90850.1| chromodomain-helicase-DNA-binding protein 5 [Apostichopus
japonicus]
Length = 110
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 54/123 (43%), Gaps = 20/123 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C G + + V+ +
Sbjct: 5 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEAAPEGKWSCPQCEAGAAPPSAHVVEQEQ 64
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+F D ++ CDQC +H+ CL LK+IP +
Sbjct: 65 D--------------EHMEFCRVRQDGGELLCCDQCPSSYHIFCLNPP----LKKIPDGE 106
Query: 627 WFC 629
W C
Sbjct: 107 WLC 109
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 81/211 (38%), Gaps = 28/211 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQG----HSSSMSRS 561
D C VC G ++LC+ CP A+H CLDP L PE W CP+C ++G
Sbjct: 346 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 405
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
+ E + C +C+ D ++ CD C +H+ CL L E
Sbjct: 406 EEGGCEEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCLNP----PLPE 452
Query: 622 IPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND---- 676
IP +W C C + +Q + R PA + + +E + +
Sbjct: 453 IPNGEWLCPRCTCPPLKGKVQRILHWRWTEPPAPFMVGLPGPDVEPSLPPPKPLEGIPER 512
Query: 677 ---VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 513 EFFVKWAGLSYWHCSWVKELQLELYHTVMYR 543
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 74/151 (49%), Gaps = 18/151 (11%)
Query: 696 LSSATAIFRECFDPII-AECGRDLIPVMVYGRN--ISGQEFGGMYSVILTVKSVVVSAGL 752
L A + E F II RDL +V+ R + F G Y+++
Sbjct: 8 LCIALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL------------ 55
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
+ G++ AELPL+ T +Y+ +G + L + +E+LL L VE L+LP + WT
Sbjct: 56 --VCGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGS 113
Query: 813 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEK 843
FGF +MS +Y + L+ F+GT+M +K
Sbjct: 114 FGFTEMSYSDRFQYAANIILS-FQGTTMCQK 143
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 57/136 (41%), Gaps = 29/136 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R DL
Sbjct: 305 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARD-DLSD 363
Query: 567 G-------------LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRK 613
G E + C +C+ D ++ CD C +H+ CL
Sbjct: 364 GEGEDEEDRRDEGVEEEDDHHIEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP 414
Query: 614 NGLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 415 ----PLPEIPNGEWIC 426
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ C VC DG LL C+ CP ++H CL+P L IP W CP C+
Sbjct: 386 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 430
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 56/125 (44%), Gaps = 19/125 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H C DP L PE W CP+C +G + + +V K
Sbjct: 416 DYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHC-EGEGITAA-TVTEKA 473
Query: 567 GLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G A + C IC+ D ++ CD C +H CL L EIP
Sbjct: 474 GRNAADDDEHSEFCRICK---------DGGELLCCDSCTSAYHTFCLNP----PLSEIPD 520
Query: 625 DKWFC 629
W C
Sbjct: 521 GDWKC 525
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 485 EFCRICKDGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 63/152 (41%), Gaps = 12/152 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + +R DL
Sbjct: 387 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARD-DLSE 445
Query: 567 GLEAPGAEVGGCVICRSHD----FSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ + D F D ++ CD C +H+ CL L EI
Sbjct: 446 AEGEDDDDRRDEGMEEEDDHHIEFCRVCKDGGELLCCDTCPSSYHIHCLNP----PLPEI 501
Query: 623 PKDKWFCCD-DCNRIHAALQDFVSNRAQTIPA 653
P +W C C + +Q ++ R PA
Sbjct: 502 PNGEWICPRCKCPPMKGKVQKVLTWRWGEAPA 533
>gi|297820632|ref|XP_002878199.1| hypothetical protein ARALYDRAFT_907299 [Arabidopsis lyrata subsp.
lyrata]
gi|297324037|gb|EFH54458.1| hypothetical protein ARALYDRAFT_907299 [Arabidopsis lyrata subsp.
lyrata]
Length = 173
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/50 (60%), Positives = 38/50 (76%)
Query: 204 KKVVPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGC 253
+ ++ YP+NVK LL TGIL+GA VKYIST R+L GI++ GGYLCGC
Sbjct: 21 QGILDLSYPSNVKNLLETGILEGAPVKYISTPHVRELQGIIHSGGYLCGC 70
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 15/155 (9%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP AFH CLDP L PE W CP+C + + SRS
Sbjct: 366 DYCEVCQQGGEIMLCDTCPRAFHLVCLDPELEEAPEGSWSCPHCEKEGVVAASRSTTPAT 425
Query: 567 GLEA--------PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
G + A+ +F D +I C +C +H CL
Sbjct: 426 GGDMSQNPQNIRKSAQPNEEEKDEHQEFCNECKDGGDLICCAKCPVSYHPECL----YPP 481
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIP 652
L EIP+ W C C + A + ++ R P
Sbjct: 482 LSEIPEGPWLCPRCGCGPLKAKVHKILTWRWMEPP 516
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 59/135 (43%), Gaps = 29/135 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLD + PE W CP+C + ++ + +G
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG 404
Query: 567 GLEAPGAEVGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
E EVGG C +C+ D ++ CD C +H+ CL
Sbjct: 405 --EEILEEVGGDLEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 452
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 453 ---PLPEIPNGEWLC 464
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 55/126 (43%), Gaps = 22/126 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + P W CP+C + + D
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCE----NDLVNDNDAVT 395
Query: 567 GLEAPGAEVGG---CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
EA A+ G C +CR D ++ CD C +H CL + L IP
Sbjct: 396 SKEAAPAKAGNMEFCRLCR---------DGGELLCCDSCPSSYHRYCL----IPPLTTIP 442
Query: 624 KDKWFC 629
+ W C
Sbjct: 443 EGDWHC 448
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
G+ + C +C DG LL C+ CP ++H CL P L IPE W CP C
Sbjct: 404 AGNMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 45/99 (45%), Gaps = 29/99 (29%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
D +C VC DG LLLC+ CP AFH AC+ PE W CP CR
Sbjct: 465 DSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------------- 508
Query: 568 LEAPGAEVGGCVICRSHDF---SAATFDDRTVIYCDQCE 603
C +C D +A F D+T+IYC+QCE
Sbjct: 509 ----------CGVCGGSDLDDDTAEGFTDKTIIYCEQCE 537
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHE 266
V E+ + K LL +G+L+G V+Y+ E G+V G LC C C + +A
Sbjct: 530 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLGSGVVKSGVILCNCRHCKSHQGFNASS 589
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
FE+H G+ RHP++ I+L+NG+ + ++++
Sbjct: 590 FEKHVGSTARHPSDFIFLDNGRRLREVLED 619
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-----GHSSSMSRS 561
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 457 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 516
Query: 562 VDLKGGLEAPGAE---VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +GG E E + C +C+ D ++ CD C +H+ CL
Sbjct: 517 EEERGGKERRREEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 563
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 564 LPDIPNGEWLC 574
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RR + C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 525 RRREEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%)
Query: 207 VPNEYPTNVKKLLSTGILDGACVKYISTSRERQLDGIVNGGGYLCGCPLCNFSKVVSAHE 266
V E+ + K LL +G+L+G V+Y+ E G+V G LC C C + +A
Sbjct: 503 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLGSGVVKSGVILCNCRHCKSHQGFNASS 562
Query: 267 FEQHAGAKTRHPNNHIYLENGKPIYSIIQE 296
FE+H G+ RHP++ I+L+NG+ + ++++
Sbjct: 563 FEKHVGSTARHPSDFIFLDNGRRLREVLED 592
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 71/173 (41%), Gaps = 43/173 (24%)
Query: 460 ISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN 519
++P + + G+ +R+P R + S G+++ D C VC G
Sbjct: 318 VAPRKAKTKFGVKNKRKPKRKMNDSKS------------GEQQE---HQDYCEVCQQGGE 362
Query: 520 LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGG 577
++LC+ CP A+H CLDP L PE W CP+C + +G E E
Sbjct: 363 IILCDTCPRAYHLVCLDPELEETPEGKWSCPHC------------EAEGTQEQDDDEHNE 410
Query: 578 -CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
C +C+ D ++ CD C +H+ CL L EIP W C
Sbjct: 411 FCRLCK---------DGGELLCCDSCTSAYHIFCLNP----PLSEIPDGDWKC 450
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 409 NEFCRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-----GHSSSMSRS 561
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 203 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 262
Query: 562 VDLKGGLEAPGAE---VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +GG E E + C +C+ D ++ CD C +H+ CL
Sbjct: 263 EEERGGKERRREEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 309
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 310 LPDIPNGEWLC 320
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RR + C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 271 RRREEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 51/111 (45%), Gaps = 16/111 (14%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG----HSSSMSR 560
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG + S
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRIQQNLSQPEE 352
Query: 561 SVDLKGGLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
S L+ E PG + C +C D V+ C C FH C
Sbjct: 353 SRPLEPSAETPGPTLSARCGVC---------GDSTDVLRCAHCAAAFHWRC 394
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 59/143 (41%), Gaps = 35/143 (24%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------------- 552
D C VC G ++LC+ CP A+H CL+P L PE W CP+C +
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEDFED 438
Query: 553 GHSSSMSRSVDLKGGLEAPGAE------VGGCVICRSHDFSAATFDDRTVIYCDQCEKEF 606
S R + G A GAE + C +C+ D ++ CD C +
Sbjct: 439 FEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVCK---------DGGELLCCDTCTSSY 489
Query: 607 HVGCLRKNGLCDLKEIPKDKWFC 629
H+ CL L EIP +W C
Sbjct: 490 HIHCLNP----PLPEIPNGEWLC 508
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWR 546
+ ++ + +A G + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 448 ISEVGVRVATGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWL 507
Query: 547 CPNC 550
CP C
Sbjct: 508 CPRC 511
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 56/124 (45%), Gaps = 7/124 (5%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+GCP A+H CL+P L PE W CP C + R +
Sbjct: 345 DYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTCVKNGIKPKVRGAEKDE 404
Query: 567 GLEAPGAEVGGCVICRSH-DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
+ E H +F + D ++ CD C +H+ CL ++++P+
Sbjct: 405 DYDDLEEEEEAEENMDEHMEFCSRCKDGGDLLICDTCPHSYHLNCLNP----PVEKVPEG 460
Query: 626 KWFC 629
+W C
Sbjct: 461 EWSC 464
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C DG +LL+C+ CP ++H CL+P + +PE W CP C
Sbjct: 424 EFCSRCKDGGDLLICDTCPHSYHLNCLNPPVEKVPEGEWSCPRC 467
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + + +
Sbjct: 328 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEAKDDEEEE 387
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 388 EEAPGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEIP 434
Query: 624 KDKWFC 629
+W C
Sbjct: 435 NGEWLC 440
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 400 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 443
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 51/113 (45%), Gaps = 20/113 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 356
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SR +L E PG A C +C D V+ C C FH C
Sbjct: 357 SRPPELPA--ETPGPAPSARCSVC---------GDGTEVLRCAHCAAAFHWRC 398
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + + +
Sbjct: 352 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEAKDDEEEE 411
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 412 DEVAGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEIP 458
Query: 624 KDKWFC 629
+W C
Sbjct: 459 NGEWLC 464
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 424 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 467
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 72/162 (44%), Gaps = 22/162 (13%)
Query: 501 RRTTGGSD--DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
RR G D ++C CG L+LC+ CP+ +H CLDP L + + W CP C ++
Sbjct: 1415 RRGLGALDHNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLCVMDRTT 1474
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
+ + S + G + C CR +I CD C K FH+ C +
Sbjct: 1475 NGADSEEEMGSNDGEIEHEDVCSRCRHGG---------ELICCDTCPKAFHMECCKPV-- 1523
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLST 658
L+++PK W C++C + + + R T PA S+
Sbjct: 1524 --LRKVPKGHWE-CENCKKGTKS----AAIRVPTGPAKKFSS 1558
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 51/113 (45%), Gaps = 20/113 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 355
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SR +L E PG A C +C D V+ C C FH C
Sbjct: 356 SRPPELPA--ETPGPAPSARCSVC---------GDGTEVLRCAHCAAAFHWRC 397
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 26/44 (59%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D CHVC DG LL C+ CPL++H CL+P L IPE WRCP C
Sbjct: 1 DFCHVCKDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRC 44
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 91/245 (37%), Gaps = 56/245 (22%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H C DP L PE W CP+C +G D
Sbjct: 326 DYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHC-EGEGIKEQEEDD--- 381
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ C +C+ D ++ CD C +HV CL +K IP +
Sbjct: 382 -------HMEFCRVCK---------DGGELLCCDTCPSAYHVHCLNP----PMKMIPDGE 421
Query: 627 WFCCD-DCNRIHAALQDFVSNRAQT-----------------IPASSLSTINRKHIEKGI 668
W C C + + ++ R +PAS+ +NRK + +
Sbjct: 422 WHCPRCSCEPLKGRVAKILTWRWTEPIQEEGKMEELDHTHPHLPASTRKLMNRKPLREFF 481
Query: 669 LFDGTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNI 728
V+W L C E L A++R I + D P + G +
Sbjct: 482 --------VKWHDLSYWHCSWISELQLDVYHPAMYRA----YIRKNDMDEPPPLEDGSSY 529
Query: 729 SGQEF 733
G+++
Sbjct: 530 GGKDY 534
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 51/113 (45%), Gaps = 20/113 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 352
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SR +L E PG A C +C D V+ C C FH C
Sbjct: 353 SRPPELPA--ETPGPAPSARCSVC---------GDGTEVLRCAHCAAAFHWRC 394
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 51/113 (45%), Gaps = 20/113 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH------SSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG +
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRVQQNLSQPEV 351
Query: 559 SRSVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SR +L E PG A C +C D V+ C C FH C
Sbjct: 352 SRPPELPA--ETPGPAPSARCSVC---------GDGTEVLRCAHCAAAFHWRC 393
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + + +
Sbjct: 505 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEAKDEEEDE 564
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 565 EEPVGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEIP 611
Query: 624 KDKWFC 629
+W C
Sbjct: 612 NGEWLC 617
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 577 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 620
>gi|301776989|ref|XP_002923917.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Ailuropoda melanoleuca]
Length = 1948
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 77/209 (36%), Gaps = 28/209 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 345 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 404
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ C +L +
Sbjct: 405 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCXTPTPTLNLGTL 455
Query: 623 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND------ 676
C C + +Q + R PA + + +E G+ +
Sbjct: 456 ------LCPQCPPLKGKVQRILHWRWTEPPAPFMVGLPGSDVEPGVPPPKPLEGIPEREF 509
Query: 677 -VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 510 FVKWAGLSYWHCSWVKELQLELYHTVMYR 538
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 52/130 (40%), Gaps = 40/130 (30%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMS 559
G D C VC GD +LL C GC L +HA CLD P +GW+CP C+
Sbjct: 211 AGEDSWCAVCDSAGDLTDLLFCTGCGLHYHATCLDTGATPILRAGWQCPECKV------- 263
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
C CR +D ++ CD CEK H CL+ +
Sbjct: 264 ------------------CQTCRQ------PGEDSKMLVCDSCEKGCHTFCLQPA----M 295
Query: 620 KEIPKDKWFC 629
+P D+W C
Sbjct: 296 DSVPSDRWKC 305
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 20/136 (14%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q S+S
Sbjct: 878 VCEVCGEASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSPGFH 937
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C +CR + ++ ++ C C++ H C L
Sbjct: 938 CEWQNNYTHCGPCASLVTCPVCREN-----FMEEELLLQCQYCDRWVHAVC---ESLYTE 989
Query: 620 KEIPK--DKWFCCDDC 633
E+ + D+ F C C
Sbjct: 990 DEVEQASDEGFACTSC 1005
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 56/125 (44%), Gaps = 17/125 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQG-HSSSMSRSVDL 564
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++G +
Sbjct: 356 DYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCEKEGIQWEPKDDDEED 415
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+ E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 416 EDLCEEADDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEIPN 462
Query: 625 DKWFC 629
+W C
Sbjct: 463 GEWLC 467
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 427 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 470
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 57/125 (45%), Gaps = 17/125 (13%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQG-HSSSMSRSVDL 564
D C VC G ++LC+ CP A+H CLDP L PE W CP+C ++G +
Sbjct: 276 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKEEDEEE 335
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+ G E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 336 EEGGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP----PLPEIPN 382
Query: 625 DKWFC 629
+W C
Sbjct: 383 GEWLC 387
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 347 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 56/126 (44%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +
Sbjct: 335 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKEEEDEE 394
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+GG E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 395 EEGGEEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP----PLPEIP 441
Query: 624 KDKWFC 629
+W C
Sbjct: 442 NGEWLC 447
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 56/126 (44%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CLDP L PE W CP+C + +
Sbjct: 320 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKEEEDEE 379
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+GG E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 380 EEGGEEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP----PLPEIP 426
Query: 624 KDKWFC 629
+W C
Sbjct: 427 NGEWLC 432
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 51/124 (41%), Gaps = 28/124 (22%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +G
Sbjct: 374 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN------------EG 421
Query: 567 GLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
E E C IC+ D ++ CD C +H CL L EIP
Sbjct: 422 PAEQDDDEHQEFCRICK---------DGGELLCCDSCPSAYHTHCLNP----PLVEIPDG 468
Query: 626 KWFC 629
W C
Sbjct: 469 DWKC 472
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ W+CP C
Sbjct: 432 EFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 475
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 51/124 (41%), Gaps = 28/124 (22%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +G
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN------------EG 424
Query: 567 GLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
E E C IC+ D ++ CD C +H CL L EIP
Sbjct: 425 PAEQDDDEHQEFCRICK---------DGGELLCCDSCPSAYHTHCLNP----PLVEIPDG 471
Query: 626 KWFC 629
W C
Sbjct: 472 DWKC 475
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ W+CP C
Sbjct: 435 EFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 478
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 52/130 (40%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 439 DYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHCEKEGVQWEPKEEEDEY 498
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
E + C +C+ D ++ CD C +H+ CL L
Sbjct: 499 EGEMDDAEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 545
Query: 620 KEIPKDKWFC 629
EIP +W C
Sbjct: 546 PEIPNGEWLC 555
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 515 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPEIPNGEWLCPRC 558
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 60/144 (41%), Gaps = 34/144 (23%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVD-- 563
C +C GD + LLLC+ C FH CL P L +P+ W CP C+ + SR D
Sbjct: 1092 CKICRKKGDEDKLLLCDECNQPFHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDYA 1151
Query: 564 -LKGGL-----------EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
L GG E+ C +C DD+ ++YC +C +H C
Sbjct: 1152 ELNGGSDSDEYDETDSDESEAEHDEMCCMCD---------DDQELVYCSRCPAAYHREC- 1201
Query: 612 RKNGLCDLKEIPKDKWFC--CDDC 633
L+ P+ KW C C +C
Sbjct: 1202 ---HDPPLRNFPRGKWVCSACTNC 1222
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D+MC +C D + L+ C+ CP A+H C DP L P W C C
Sbjct: 1175 DEMCCMCDDDQELVYCSRCPAAYHRECHDPPLRNFPRGKWVCSAC 1219
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 53/129 (41%), Gaps = 21/129 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR------QGHSSSMSR 560
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKEDDEDE 399
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
+ E + C +C+ D ++ CD C +H+ CL L
Sbjct: 400 EDGAEEEEEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHLHCLNP----PLP 446
Query: 621 EIPKDKWFC 629
EIP +W C
Sbjct: 447 EIPNGEWLC 455
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 415 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|170592228|ref|XP_001900871.1| CHD4 protein [Brugia malayi]
gi|158591738|gb|EDP30342.1| CHD4 protein, putative [Brugia malayi]
Length = 1846
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 52/125 (41%), Gaps = 25/125 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G + +
Sbjct: 257 DYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCESTGATKDDEEEKKIT 316
Query: 566 GGLEA-PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+E + GG ++C CD C +H CL + L EIP+
Sbjct: 317 TNMEYCRTCKEGGWLLC-----------------CDTCPSSYHAYCLNPS----LTEIPE 355
Query: 625 DKWFC 629
W C
Sbjct: 356 GDWSC 360
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 320 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 363
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 49/128 (38%), Gaps = 36/128 (28%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 457 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCE--------------- 501
Query: 567 GLEAPGAEVGG-----CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
E P E C IC+ D ++ CD C +H CL L +
Sbjct: 502 -AEGPADEDDDEHQEFCRICK---------DGGELLCCDNCPSAYHTFCLNP----PLDD 547
Query: 622 IPKDKWFC 629
IP W C
Sbjct: 548 IPDGDWRC 555
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 515 EFCRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPRC 558
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 55/126 (43%), Gaps = 18/126 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---GHSSSMSRSVD 563
D C VC G ++LC+ CP A+H CL+P L PE W CP+C + + +
Sbjct: 330 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCEKEGIQWEAKGEEEEE 389
Query: 564 LKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
+ E + C +C+ D ++ CD C +H+ CL L EIP
Sbjct: 390 EEAAGEEEDDHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEIP 436
Query: 624 KDKWFC 629
+W C
Sbjct: 437 NGEWLC 442
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 402 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 445
>gi|198428271|ref|XP_002124973.1| PREDICTED: similar to RING and PHD-finger domain-containing protein
KIAA1542 [Ciona intestinalis]
Length = 1966
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 6/76 (7%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--I 540
G+ +H++ + + T +D C VCG GEN LLLC+GC L +H ACL+P L +
Sbjct: 121 GVFVHELVVEKKINVVET-HENDTNCQVCGSGENEETLLLCDGCDLGYHCACLNPPLEQV 179
Query: 541 PESGWRCPNCRQGHSS 556
P W C NCR H++
Sbjct: 180 PSDEWFCINCRPSHAA 195
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 33/63 (52%), Gaps = 11/63 (17%)
Query: 577 GCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRI 636
C +C S + ++ T++ CD C+ +H CL L+++P D+WFC +C
Sbjct: 144 NCQVCGSGE------NEETLLLCDGCDLGYHCACLN----PPLEQVPSDEWFCI-NCRPS 192
Query: 637 HAA 639
HAA
Sbjct: 193 HAA 195
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 61/143 (42%), Gaps = 35/143 (24%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------------- 552
D C VC G ++LC+ CP A+H CL+P L PE W CP+C +
Sbjct: 381 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEDFED 440
Query: 553 GHSSSMSRSV-DLKGGLEAPGAE-----VGGCVICRSHDFSAATFDDRTVIYCDQCEKEF 606
S R + ++ G+ A G + + C +C+ D ++ CD C +
Sbjct: 441 FEEDSEDRVISEVSSGVPAGGDDEDDDHMEFCRVCK---------DGGELLCCDTCTSSY 491
Query: 607 HVGCLRKNGLCDLKEIPKDKWFC 629
H+ CL L EIP +W C
Sbjct: 492 HIHCLNP----PLPEIPNGEWLC 510
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 8/54 (14%)
Query: 505 GGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
GG D+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 460 GGDDEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 513
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2590 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2642
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2643 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2683
Query: 626 KWFC 629
KW+C
Sbjct: 2684 KWYC 2687
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2650 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2702
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2703 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2743
Query: 626 KWFC 629
KW+C
Sbjct: 2744 KWYC 2747
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2636 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2688
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2689 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2729
Query: 626 KWFC 629
KW+C
Sbjct: 2730 KWYC 2733
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2641 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2693
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2694 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2734
Query: 626 KWFC 629
KW+C
Sbjct: 2735 KWYC 2738
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 50/128 (39%), Gaps = 36/128 (28%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 414 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCEA-------------- 459
Query: 567 GLEAPGAEVGG-----CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
E P E C +C+ D ++ CD C +H CL L +
Sbjct: 460 --EGPADEDDDEHQEFCRVCK---------DGGELLCCDNCPSAYHTFCLNP----PLDD 504
Query: 622 IPKDKWFC 629
IP +W C
Sbjct: 505 IPDGEWRC 512
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 472 EFCRVCKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGEWRCPRC 515
>gi|449683391|ref|XP_002153791.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Hydra
magnipapillata]
Length = 1699
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 21/124 (16%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
+ +C VCG+G ++LLC+ C +H CLDP L +PE W CP C S + D
Sbjct: 164 NSVCDVCGEGGDILLCDTCTCVWHLTCLDPPLDEVPEGDWSCPKCEDELSGPVEEDED-- 221
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C ICR D ++ CD C +H+ C++ L +P+
Sbjct: 222 ----DENFHGDYCKICR---------DGGELLCCDFCPGTYHMRCVKPQ----LITVPEG 264
Query: 626 KWFC 629
+W C
Sbjct: 265 EWKC 268
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLI--PESGWRCPNCR 551
D C +C DG LL C+ CP +H C+ P LI PE W+CP C+
Sbjct: 228 DYCKICRDGGELLCCDFCPGTYHMRCVKPQLITVPEGEWKCPLCK 272
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2604 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2656
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2657 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2697
Query: 626 KWFC 629
KW+C
Sbjct: 2698 KWYC 2701
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2568 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2620
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2621 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2661
Query: 626 KWFC 629
KW+C
Sbjct: 2662 KWYC 2665
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2873 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2925
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2926 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2966
Query: 626 KWFC 629
KW+C
Sbjct: 2967 KWYC 2970
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2542 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2594
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2595 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2635
Query: 626 KWFC 629
KW+C
Sbjct: 2636 KWYC 2639
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 2509 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 2562
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2563 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2602
Query: 626 KWFC 629
KW+C
Sbjct: 2603 KWYC 2606
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 2454 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 2507
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2508 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2547
Query: 626 KWFC 629
KW+C
Sbjct: 2548 KWYC 2551
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 56/135 (41%), Gaps = 27/135 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQGHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP+C ++G +
Sbjct: 386 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKDELSEG 445
Query: 566 GGL-----------EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
G E + C +C+ D ++ CD C +H+ CL
Sbjct: 446 EGEDEEDRRDEGVEEEDDHHIEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP- 495
Query: 615 GLCDLKEIPKDKWFC 629
L EIP +W C
Sbjct: 496 ---PLPEIPNGEWIC 507
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ C VC DG LL C+ CP ++H CL+P L IP W CP C+
Sbjct: 467 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 511
>gi|402590896|gb|EJW84826.1| chromodomain-helicase-DNA-binding protein 4 [Wuchereria bancrofti]
Length = 1519
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 52/125 (41%), Gaps = 25/125 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G + +
Sbjct: 171 DYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCESTGAAKDDEEEKKIT 230
Query: 566 GGLEA-PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+E + GG ++C CD C +H CL + L EIP+
Sbjct: 231 TNMEYCRTCKEGGWLLC-----------------CDTCPSSYHAYCLNPS----LTEIPE 269
Query: 625 DKWFC 629
W C
Sbjct: 270 GDWSC 274
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 234 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|242036737|ref|XP_002465763.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
gi|241919617|gb|EER92761.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
Length = 158
Score = 63.5 bits (153), Expect = 4e-07, Method: Composition-based stats.
Identities = 31/72 (43%), Positives = 43/72 (59%), Gaps = 4/72 (5%)
Query: 247 GGYLCGCPL---CNF-SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPL 302
GGY C CP C + KV+SA +FE+HAGA++ +PN HI L NGK +Y + +L+ P
Sbjct: 3 GGYACACPASAGCGYHGKVLSARQFEKHAGAESNNPNGHILLRNGKSLYQLFHDLRHVPA 62
Query: 303 GILEEVVKKVAG 314
L + AG
Sbjct: 63 EALAAKFLEFAG 74
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 49/128 (38%), Gaps = 36/128 (28%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP C
Sbjct: 379 DYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTCEA-------------- 424
Query: 567 GLEAPGAEVGG-----CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
E P E C +C+ D ++ CD C +H CL L +
Sbjct: 425 --EGPADEDDDEHQEFCRVCK---------DGGEMLCCDSCPSAYHTWCL----TPPLDD 469
Query: 622 IPKDKWFC 629
IP W C
Sbjct: 470 IPDGDWRC 477
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG +L C+ CP A+H CL P L IP+ WRCP C
Sbjct: 437 EFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRC 480
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 52/123 (42%), Gaps = 20/123 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + K
Sbjct: 257 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWSCPSCEAAGIPQKDEEEEKK- 315
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ C +C+ D ++ CD C +H C+ L E+P+ +
Sbjct: 316 ----VATNMEYCRVCK---------DVGWLLCCDTCPSSYHAYCMNP----PLTEVPEGE 358
Query: 627 WFC 629
W C
Sbjct: 359 WSC 361
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC D LL C+ CP ++HA C++P L +PE W CP C
Sbjct: 321 EYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPRC 364
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 2657 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 2710
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2711 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2750
Query: 626 KWFC 629
KW+C
Sbjct: 2751 KWYC 2754
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2620 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2672
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2673 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2713
Query: 626 KWFC 629
KW+C
Sbjct: 2714 KWYC 2717
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 2720 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 2773
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2774 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2813
Query: 626 KWFC 629
KW+C
Sbjct: 2814 KWYC 2817
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 17/156 (10%)
Query: 494 ISLAMGQRRTTGG---SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
++ A + +TG S+D C VC + E L+ C CP +FH C+ P WRC C
Sbjct: 975 VTSAEDEEESTGEVPESNDFCEVCLNDEQLISCGSCPRSFHLICIQMKRAPRRDWRCLAC 1034
Query: 551 RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSA------ATFDDRTVIYCDQCEK 604
G DLK +E E DFS R I C C +
Sbjct: 1035 TAGVKKYKQELKDLKKIIEE--KEAFEAKDSNEEDFSINQCLKCGELLSRGHIECIGCGR 1092
Query: 605 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
++H+ C DL + PK W+C C + +L
Sbjct: 1093 KYHLAC------ADLTKRPKGDWYCKKRCEPGYVSL 1122
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 50/130 (38%), Gaps = 13/130 (10%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS 559
G ++ C VC + ++ C C L FH C+ P L IP W C C
Sbjct: 910 GLTNARCKVCRGKATPDRMIRCETCDLVFHLPCIKPALREIPRGEWFCKACTPETVPDSP 969
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
R E G + S+DF +D +I C C + FH+ C++ +
Sbjct: 970 RKKPKVTSAEDEEESTGE--VPESNDFCEVCLNDEQLISCGSCPRSFHLICIQ------M 1021
Query: 620 KEIPKDKWFC 629
K P+ W C
Sbjct: 1022 KRAPRRDWRC 1031
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 65/161 (40%), Gaps = 26/161 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W C C Q + D
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYC-QAEGNQEQEDDDEHQ 434
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C IC+ D ++ CD C +H CL L+E+P +
Sbjct: 435 EF---------CRICK---------DGGELLCCDSCPSAYHRFCLNP----PLEEVPDGE 472
Query: 627 WFCCD-DCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 666
W C C + + ++ R + PA S + +R+ K
Sbjct: 473 WKCPRCSCPPLDGKVAKILTWRWKEQPAKSKAPRSREFFVK 513
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 2877 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 2930
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2931 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2970
Query: 626 KWFC 629
KW+C
Sbjct: 2971 KWYC 2974
>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
Length = 2123
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 51/124 (41%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++
Sbjct: 1633 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERK------ 1686
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 1687 ------------CIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 1726
Query: 626 KWFC 629
KW+C
Sbjct: 1727 KWYC 1730
>gi|393911013|gb|EJD76123.1| LET-418 protein [Loa loa]
Length = 1755
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 51/125 (40%), Gaps = 25/125 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G +
Sbjct: 171 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCESTGAPKEDEEEKKVT 230
Query: 566 GGLEA-PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+E + GG ++C CD C +H CL + L EIP+
Sbjct: 231 TNMEYCRTCKEGGWLLC-----------------CDTCPSSYHAYCLNPS----LTEIPE 269
Query: 625 DKWFC 629
W C
Sbjct: 270 GDWSC 274
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 234 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
Length = 3086
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 2585 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 2637
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 2638 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 2678
Query: 626 KWFC 629
KW+C
Sbjct: 2679 KWYC 2682
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 64/144 (44%), Gaps = 21/144 (14%)
Query: 428 DGERLTYIVK-GQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNG 486
DGE L Y K GQ + G + GI +K + + FE AG R P H +T+ G
Sbjct: 20 DGELLRYKSKHGQLMAVGRAREGGIEVAGTSKLLGYTAFEDIAGSKYHR-PAEHTHTTTG 78
Query: 487 MTLHDIAISLAMGQRRTTGG-------------------SDDMCHVCGDGENLLLCNGCP 527
TL +A+ G G +DD+CH+CG G +L+ C CP
Sbjct: 79 RTLQSLALLAGGGAGGGAGKRGGKAAPPPEPVVELLEDDNDDLCHICGLGGDLMCCETCP 138
Query: 528 LAFHAACLDPLLIPESGWRCPNCR 551
FHAACL PE + CP CR
Sbjct: 139 GVFHAACLGLAAPPEGDYHCPLCR 162
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 60/144 (41%), Gaps = 37/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQGHS---------- 555
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++G
Sbjct: 275 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEEFED 334
Query: 556 ----------SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
S +S V + G E + C +C+ D ++ CD C
Sbjct: 335 FEEDSEDRVISEVSLGVPM-GAEEEDDDHMEFCRVCK---------DGGELLCCDTCTSS 384
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFC 629
+H+ CL L EIP +W C
Sbjct: 385 YHIHCLNP----PLPEIPNGEWLC 404
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGW 545
+ ++++ + MG + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 343 VISEVSLGVPMGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEW 402
Query: 546 RCPNC 550
CP C
Sbjct: 403 LCPRC 407
>gi|312077956|ref|XP_003141528.1| CHromoDomain protein family member [Loa loa]
Length = 1696
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 51/125 (40%), Gaps = 25/125 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-GHSSSMSRSVDLK 565
D C VC G ++LC+ CP A+H CLDP + PE W CP C G +
Sbjct: 112 DYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCESTGAPKEDEEEKKVT 171
Query: 566 GGLEA-PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
+E + GG ++C CD C +H CL + L EIP+
Sbjct: 172 TNMEYCRTCKEGGWLLC-----------------CDTCPSSYHAYCLNPS----LTEIPE 210
Query: 625 DKWFC 629
W C
Sbjct: 211 GDWSC 215
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C C +G LL C+ CP ++HA CL+P L IPE W CP C
Sbjct: 175 EYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 218
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 59/144 (40%), Gaps = 37/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-RQGHS---------- 555
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++G
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCEKEGIQWEAKDEEFED 451
Query: 556 ----------SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKE 605
S +S V G E + C +C+ D ++ CD C
Sbjct: 452 FEEDSEDRVISEVSLGVPT-GAEEEDDDHMEFCRVCK---------DGGELLCCDTCTSS 501
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFC 629
+H+ CL L EIP +W C
Sbjct: 502 YHIHCLNP----PLPEIPNGEWLC 521
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGW 545
+ ++++ + G + C VC DG LL C+ C ++H CL+P L IP W
Sbjct: 460 VISEVSLGVPTGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEW 519
Query: 546 RCPNC 550
CP C
Sbjct: 520 LCPRC 524
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 74/192 (38%), Gaps = 30/192 (15%)
Query: 477 PYRHIYTSNGMTLHDIAISLAMGQRRTTG---GSDDMCHVCGDGENLLLCNGCPLAFHAA 533
P R T G L A++ G+ G D C VC G ++LC+ CP A+H
Sbjct: 254 PIRRAKTKEGKVLGCPAVA---GEEEVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLV 310
Query: 534 CLDPLL--IPESGWRCPNCRQ--------GHSSSMSRSVDLKGGLEAPGAEVGGCVICRS 583
CLDP L PE W CP+C + + +G E + C +C+
Sbjct: 311 CLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCK- 369
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD-DCNRIHAALQD 642
D ++ CD C +H+ CL L +IP +W C C + +Q
Sbjct: 370 --------DGGELLCCDACISSYHIHCLNP----PLPDIPNGEWLCPRCTCPVLKGRVQK 417
Query: 643 FVSNRAQTIPAS 654
+ R P S
Sbjct: 418 ILHWRWGEPPVS 429
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 63/156 (40%), Gaps = 23/156 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ +G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 440 EEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 486
Query: 620 KEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
+IP +W C C + +Q + R P S
Sbjct: 487 PDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 522
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 66/146 (45%), Gaps = 29/146 (19%)
Query: 699 ATAIFRECFDPII-AECGRDLIPVMVYGRNISGQEFGGMYSVILTVKSVVVSAGLLRIFG 757
A I ECF II D+ +V+ R I GQ+F
Sbjct: 774 AVDILHECFVTIIEPRTQSDISEDIVFNREICGQKF------------------------ 809
Query: 758 REVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRK 817
AELPL+ T Y+ +G + L + +E+LL L VE L+LPA + WT FGF
Sbjct: 810 ---AELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGFTV 866
Query: 818 MSRERLLKYQRDFQLTIFKGTSMLEK 843
MS L+ + L+ F+GT+M +K
Sbjct: 867 MSNSERLELAGNSILS-FQGTTMCQK 891
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 10/127 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP + PE W CP+C Q + S
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCEKEGIQWEAREESSEG 435
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ + +F D ++ CD C +H+ CL L +I
Sbjct: 436 EEENDDGRRDDGDVEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHLHCLNP----PLPDI 491
Query: 623 PKDKWFC 629
P +W C
Sbjct: 492 PNGEWIC 498
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 62/142 (43%), Gaps = 28/142 (19%)
Query: 500 QRRTTGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC--- 550
+R+ GGS + C VC DG +L+LC+ CP +FH +C++ IP+ W CP C
Sbjct: 443 ERKEGGGSGNESDHFWYCEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTGE 502
Query: 551 ---RQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFH 607
G SS+ S + E C +C+ VI CD C FH
Sbjct: 503 GLPEDGDSSN---SAQEEEEGEEETEHDQFCKVCKRGG---------DVILCDFCSCVFH 550
Query: 608 VGCLRKNGLCDLKEIPKDKWFC 629
+ CL L E+P+ W C
Sbjct: 551 LRCLNP----PLGEVPEGDWKC 568
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV 562
D C VC G +++LC+ C FH CL+P L +PE W+CP C+ S + V
Sbjct: 527 DQFCKVCKRGGDVILCDFCSCVFHLRCLNPPLGEVPEGDWKCPRCKISEKSPKGKVV 583
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 10/127 (7%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP + PE W CP+C Q + S
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCEKEGIQWEAREESSEG 435
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ + +F D ++ CD C +H+ CL L +I
Sbjct: 436 EEENDDGRRDDGDVEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHLHCLNP----PLPDI 491
Query: 623 PKDKWFC 629
P +W C
Sbjct: 492 PNGEWIC 498
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 436
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 437 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDTCISSYHIHCLNP----P 483
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P S
Sbjct: 484 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 520
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 255 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 314
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 315 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 361
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P S
Sbjct: 362 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 398
>gi|255081825|ref|XP_002508131.1| SNF2 super family [Micromonas sp. RCC299]
gi|226523407|gb|ACO69389.1| SNF2 super family [Micromonas sp. RCC299]
Length = 2064
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 94/252 (37%), Gaps = 91/252 (36%)
Query: 511 CHVCGDGE---NLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--------------- 550
C CG G+ NL+LC+ CP +H CL P L +P W CP C
Sbjct: 28 CERCGLGDDEPNLVLCDDCPRGWHVYCLRPKLPHVPRGSWSCPRCAPAKKGSGGGGGGDD 87
Query: 551 -----------------RQGHSSSMSRS-----VDLKGGLEAP---------GAE----- 574
R G +RS V G +AP GAE
Sbjct: 88 ARGSPAAPASGDRAKRRRTGTRDETTRSTRRSGVAAAGSTKAPDDDEMSDREGAEDDEMR 147
Query: 575 VGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCC---- 630
CV C D D+ ++ CD C+ H+ CLR L ++P+ +WFC
Sbjct: 148 TMRCVTCDLGD------DENKMVLCDGCDAGHHLYCLRPK----LSQVPRGRWFCPACEI 197
Query: 631 -DDCNRIHA-------ALQDFVSNR------------AQTIPASSLSTINRKHIEKGILF 670
+D R A AL+ V+N ++ +P + L+T+ R +G F
Sbjct: 198 REDARRRSAEATAATKALRVAVANEYADGRVVRAILGSRLVPETRLATL-RADATRGPAF 256
Query: 671 DGTMNDVQWQML 682
+G +V++ L
Sbjct: 257 EGRSAEVEFLAL 268
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 203 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 262
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 263 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 309
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 310 PDIPNGEWLC 319
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 279 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|328717947|ref|XP_001943997.2| PREDICTED: hypothetical protein LOC100159693, partial
[Acyrthosiphon pisum]
Length = 2904
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 52/131 (39%), Gaps = 42/131 (32%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD NL++C C +H CL L+P +GW+C NCR
Sbjct: 388 ICSTLGDVSNLMMCTACGSHYHGVCLGLALLPGVRAGWQCGNCR---------------- 431
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
C +CR + V+ C+ C+K +H GCLR + IPK W
Sbjct: 432 ---------ICQVCRQ------PAEQTKVMLCEGCDKAYHPGCLRPQ----VTTIPKIGW 472
Query: 628 FC-----CDDC 633
C C DC
Sbjct: 473 KCKCCRVCTDC 483
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 56/140 (40%), Gaps = 36/140 (25%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D L+LC+ C +++H C DP L +P W+C C Q L
Sbjct: 773 VCEGCGQRNDESRLILCDECDISYHIYCTDPKLDYVPRGTWKCKWCAQC----------L 822
Query: 565 KGGLEAPG---------AEVGGCV---ICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
G PG E G C IC S S D++ +I C QC++ H
Sbjct: 823 TCGSNDPGFNCSWLNNYTECGPCASRSICPSCQESYT--DNQLIIKCSQCDRWLH----- 875
Query: 613 KNGLCDLKEIPKDKWFCCDD 632
G CD E D C +D
Sbjct: 876 --GKCDKIENEDDAEKCAED 893
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 37/176 (21%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSN--GMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGE 518
+P E G+A R+ TS M L+ + S+A + C C G+
Sbjct: 1889 APDPNEIPKGLAVWREAVNRAQTSAQLAMCLYSLESSIAWDKSIMKAN----CQFCHSGD 1944
Query: 519 N---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGA 573
N LLLC+GC +H C P + IPE W C C M+++ +
Sbjct: 1945 NEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHEC-------MNKATGER-------- 1989
Query: 574 EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
C++C ++ T +I C+ C + +H C+ + ++P+ KW+C
Sbjct: 1990 ---NCIVCGKKSSTSGT----RLILCELCPRAYHTDCIHP----IMHKVPRGKWYC 2034
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 474 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 533
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 534 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 580
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P S
Sbjct: 581 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 617
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 33/50 (66%), Gaps = 3/50 (6%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
+DD C VC DG L+ C+GCP AFH ACLDP L IP W+C CR GH
Sbjct: 259 NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCR-GH 307
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 336 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 395
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 396 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 442
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 443 PDIPNGEWLC 452
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 412 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 455
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 377 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 436
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 437 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDTCISSYHIHCLNP----P 483
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P S
Sbjct: 484 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 520
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDDDY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 433 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 492
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 493 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 539
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 540 PDIPNGEWLC 549
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 509 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 552
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDDDY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 339 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 398
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 399 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 445
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 446 PDIPNGEWLC 455
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 415 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 458
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 396
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 397 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 443
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 444 PDIPNGEWLC 453
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 337 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 396
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 397 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 443
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 444 PDIPNGEWLC 453
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 413 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 398 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 457
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 458 EEDGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 504
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 505 LPDIPNGEWLC 515
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 475 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 518
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 59/149 (39%), Gaps = 30/149 (20%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP C + G
Sbjct: 400 DFCEVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWSCPTC------------EADG 447
Query: 567 GLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G+ + C IC+ D ++ CD C +H CL L +IP
Sbjct: 448 GVAEDDDDEHQEFCRICK---------DGGELLCCDMCPSAYHTFCL----TPPLDDIPD 494
Query: 625 DKWFCCD-DCNRIHAALQDFVSNRAQTIP 652
W C C + +Q ++ R P
Sbjct: 495 GDWRCPRCSCPPLQYKVQKILTWRWTDKP 523
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 447 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 506
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 507 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 553
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 554 PDIPNGEWLC 563
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 523 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 566
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 437
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 438 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 484
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 485 LPDIPNGEWLC 495
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 397 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 456
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 457 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----PL 503
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 504 PDIPNGEWLC 513
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 473 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 516
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 52/124 (41%), Gaps = 32/124 (25%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGG 567
CH + LLLC+GC +H C P + IP+ W C CR + +
Sbjct: 1613 FCHSGDKEDQLLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQRN-------- 1664
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTV-IYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C++C ++T+ + CDQC K +H+ CL+ L ++P+ K
Sbjct: 1665 ----------CIVC-------GKPGNKTISVLCDQCPKAYHIECLQP----PLAKVPRGK 1703
Query: 627 WFCC 630
W C
Sbjct: 1704 WLCV 1707
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 436
>gi|358334996|dbj|GAA53428.1| histone-lysine N-methyltransferase MLL3, partial [Clonorchis
sinensis]
Length = 3518
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 52/133 (39%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES----GWRCPNCRQGHSSSMSRSVDLK 565
+C GD LL C GC +H +CL+P L P GW+C C+
Sbjct: 43 LCECLGDISELLFCTGCGSHYHGSCLEPSLQPNPTIRIGWQCAECK-------------- 88
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C+IC D+ ++ CD C+K FH CLR C IP++
Sbjct: 89 -----------ACLICNESK------DENKMLVCDVCDKGFHTYCLRPPVSC----IPRN 127
Query: 626 KWFC-----CDDC 633
+ C C DC
Sbjct: 128 GFKCERCRVCSDC 140
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 62/150 (41%), Gaps = 34/150 (22%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCP------NCRQGHSSSM 558
+C CG N LLLC+ C +++H CLDP L +P+ GW+C NC Q
Sbjct: 559 VCEGCGGTTNESLLLLCDDCDISYHTYCLDPPLQEVPKGGWKCSECVVCTNCGQ------ 612
Query: 559 SRSVDLKGGLE------APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC-- 610
R L G AP A + C +C + A + +I C C + H GC
Sbjct: 613 -RDPGLNGKWHANYSMCAPCASLATCPVC-----TLAYREGELLIRCALCSRWSHAGCDQ 666
Query: 611 LRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
LR D E+ D + C C A +
Sbjct: 667 LRTE---DELELATDMGYNCLLCREAGAEM 693
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 61/163 (37%), Gaps = 48/163 (29%)
Query: 509 DMCHVCGD---GENLLLCNGCPLAFHAACLDPLLIP----ESGWRCPNCRQGHSSSMSRS 561
DMC CG LL C C +H C D I E GWRC
Sbjct: 509 DMCVACGSFGLDTVLLACAQCGQCYHPFCADVPKITRTMLEKGWRC-------------- 554
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
+D C +C + T ++ ++ CD C+ +H CL L+E
Sbjct: 555 LD--------------CTVC---EGCGGTTNESLLLLCDDCDISYHTYCLDP----PLQE 593
Query: 622 IPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C +C + L + +N + P +SL+T
Sbjct: 594 VPKGGWKCSECVVCTNCGQRDPGLNGKWHANYSMCAPCASLAT 636
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 436
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 531 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 590
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 591 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 637
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 638 LPDIPNGEWLC 648
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 608 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 651
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 355 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 414
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 415 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 461
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 462 LPDIPNGEWLC 472
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 432 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 475
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 24/161 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 395 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 454
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 455 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 501
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASSLST 658
L +IP +W C C + +Q + R P ++ ++
Sbjct: 502 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAAPAS 542
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 386 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 445
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 446 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 492
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 493 LPDIPNGEWLC 503
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 463 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 506
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 64/158 (40%), Gaps = 24/158 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEDEEY 434
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 435 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 481
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPASS 655
L +IP +W C C + +Q + R P ++
Sbjct: 482 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVAA 519
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 56/138 (40%), Gaps = 12/138 (8%)
Query: 442 RFGCKQGNGIVCDCCNKEI-SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQ 500
RF G C E +P +FE +G R P T ++ +S G
Sbjct: 178 RFPLGNGGSKKCIQVGGEFYTPGKFEEPSGKNKTRSPKPPTRTKATQGRAELQLSQQSGG 237
Query: 501 RRTTG---------GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
T +DD C VC DG L+ C+GCP AFH ACL+P L IP WRC
Sbjct: 238 PATPAQPPELHLHQKNDDECAVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWRCGC 297
Query: 550 CRQGHSSSMSRSVDLKGG 567
C G R D + G
Sbjct: 298 CIVGKVHQDGRHGDQRDG 315
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 437
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 438 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 484
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 485 LPDIPNGEWLC 495
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 406
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 407 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 453
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 454 LPDIPNGEWLC 464
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 424 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 62/140 (44%), Gaps = 28/140 (20%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHS 555
+C CGD + LLLC+ C +++H CLDP L +P+ W+C C G
Sbjct: 1071 VCEACGDASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKWCVWCVQCGSTSPGVH 1130
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
S R+ L G P + C C+ A +D ++ C QC++ H C G
Sbjct: 1131 SDWQRNYSLCG----PCCSLSRCPACQQ-----AYAEDDLILQCQQCDRWVHATC---QG 1178
Query: 616 LC--DLKEIPKDKWFCCDDC 633
LC D E+ D+ F C C
Sbjct: 1179 LCTEDEVEVAADEGFDCSLC 1198
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 69/176 (39%), Gaps = 52/176 (29%)
Query: 500 QRRTTGGSD-DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES----GWRCPNCRQGH 554
Q TG S +C GD LL+C+ C +H +CLDP + P GW+CP CR
Sbjct: 455 QHARTGSSQCRLCAGSGDSGGLLMCSCCGSCYHGSCLDPPVTPSPLSRVGWQCPQCR--- 511
Query: 555 SSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ RS L+G D V+ C +C+K +H CL
Sbjct: 512 ---VCRSCSLQG--------------------------DSGVLLCARCDKAYHAHCLTP- 541
Query: 615 GLCDLKEIPKDKWFC----CDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEK 666
L + P W C C D +AA + ++ PAS S ++EK
Sbjct: 542 ---PLDDAPHAAWTCKAETCPDLVLTNAA-------QLKSGPASLQSAAANDNVEK 587
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 432 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCIQGTTA 483
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 376 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 435
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 436 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 482
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 483 LPDIPNGEWLC 493
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 453 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 496
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 347 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 406
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 407 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 453
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 454 LPDIPNGEWLC 464
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 424 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 212 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 271
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 272 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 318
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 319 LPDIPNGEWLC 329
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 289 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 438 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 497
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 498 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 544
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 545 LPDIPNGEWLC 555
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 515 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 558
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 374 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 433
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 434 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 480
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 481 LPDIPNGEWLC 491
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 451 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 494
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 439 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 498
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 499 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 545
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 546 LPDIPNGEWLC 556
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 516 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 559
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
++D C C DG LL C+GCP AFH ACL P L IP WRC NC QG ++
Sbjct: 303 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTA 354
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 425 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 484
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 485 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 531
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 532 LPDIPNGEWLC 542
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 502 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 545
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 50/119 (42%), Gaps = 28/119 (23%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEA 570
C VCG G LL C+GCP AFH C+ IPE+ W C C ++ G +
Sbjct: 1195 CSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC----------NLQTCGACKK 1244
Query: 571 PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
+ VIC S D + C++ FH+ C + L +P D W+C
Sbjct: 1245 NRIRLDSHVICGSEDGTKG------------CDRVFHLKCAK------LDAVPADDWYC 1285
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACL--DPLLIPESGWRCPNC 550
C++C DG LL C+ CP AFH CL +IP+S W C C
Sbjct: 161 CNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMC 202
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
C++C +G L+ C+GCP FH +C+ +P C C
Sbjct: 1062 CYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 425
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 426 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 472
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 473 LPDIPNGEWLC 483
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 443 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 486
>gi|281346162|gb|EFB21746.1| hypothetical protein PANDA_013140 [Ailuropoda melanoleuca]
Length = 1814
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 75/209 (35%), Gaps = 26/209 (12%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 215 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDDDEE 274
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ G E + C +C+ D ++ CD C +H+ C L
Sbjct: 275 EEGGCEEEEDDHMEFCRVCK---------DGGELLCCDACPSSYHLHCXTPTPSPALNL- 324
Query: 623 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMND------ 676
C C + +Q + R PA + + +E G+ +
Sbjct: 325 ---GTLLCPQCPPLKGKVQRILHWRWTEPPAPFMVGLPGSDVEPGVPPPKPLEGIPEREF 381
Query: 677 -VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
V+W L C KE L T ++R
Sbjct: 382 FVKWAGLSYWHCSWVKELQLELYHTVMYR 410
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 486
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 487 LPDIPNGEWLC 497
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 457 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 391 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEDY 450
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 451 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 497
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 498 LPDIPNGEWLC 508
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 468 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 511
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 65/160 (40%), Gaps = 30/160 (18%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVD 563
G D C C G L+LC+ CPL+FH C+DP L +P W C C
Sbjct: 999 GHSDRCARCRRGGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLC------------- 1045
Query: 564 LKGGLEAPGAEVGGCVI-CRSH-DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
LEA + + GC SH D A + +I CD C FH+ C L +
Sbjct: 1046 ---VLEAESSPLEGCSDGTDSHCDVCARCYKHGQLILCDVCPLAFHLRCTDP----PLLK 1098
Query: 622 IPKDKW---FCCDDCNRIHAALQDFVSNRAQTIPASSLST 658
+P KW C DC + + FV NR S ST
Sbjct: 1099 VPSGKWTCQICVKDCQPVSSG--GFV-NRGPAKSKSGKST 1135
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 51/144 (35%), Gaps = 36/144 (25%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------------- 550
D C CG L+LC CP A+H C +P L IP W C C
Sbjct: 912 DECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVCTDKSQKKPTGIKFKGK 971
Query: 551 -RQGHSSSMSRSVDLKGGLEAPGAEVGG----CVICRSHDFSAATFDDRTVIYCDQCEKE 605
R+G + S L LE G G C CR +I CD C
Sbjct: 972 HRKGLLPTSSSPSSLSSDLETLGIVADGHSDRCARCRRGG---------ELILCDSCPLS 1022
Query: 606 FHVGCLRKNGLCDLKEIPKDKWFC 629
FH+ C+ L +P D W C
Sbjct: 1023 FHLDCVDP----PLLGVPPDIWLC 1042
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD E +LLC+GC H CL P + IP W C +CR
Sbjct: 778 CRICRRKGDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDCR 823
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 56/127 (44%), Gaps = 29/127 (22%)
Query: 517 GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAE 574
G ++LC+ CP A+H CLDP + PE W CP+C + ++ + +G E E
Sbjct: 290 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEG--EEILEE 347
Query: 575 VGG------------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
VGG C +C+ D ++ CD C +H+ CL L EI
Sbjct: 348 VGGDPEEEDDHHMEFCRVCK---------DGGELLCCDTCPSSYHIHCLNP----PLPEI 394
Query: 623 PKDKWFC 629
P +W C
Sbjct: 395 PNGEWLC 401
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 361 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 404
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 45/99 (45%), Gaps = 11/99 (11%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA----ISLAMGQRRTTGGSDDMCHVCGD 516
+P +FE G + R P + + S GQ+ +DD C VC D
Sbjct: 236 TPGKFEDAGGKSKNRSPKPPVRPKASQVPQHVRSLSRFSSLWGQK-----NDDECAVCRD 290
Query: 517 GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G L+ C+GCP AFH CL P L IP WRC C +G
Sbjct: 291 GGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRCVEG 329
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 55/131 (41%), Gaps = 23/131 (17%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 429
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 430 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 476
Query: 619 LKEIPKDKWFC 629
L +IP +W C
Sbjct: 477 LPDIPNGEWLC 487
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 447 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|255646689|gb|ACU23818.1| unknown [Glycine max]
Length = 296
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 7/86 (8%)
Query: 763 LPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGF---RKMS 819
+PL+ATC +Y+ +G + L S IE++L S VE LV+ A WTK FGF +
Sbjct: 1 MPLIATCSQYRRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIE 60
Query: 820 RERLLKYQRDFQLTIFKGTSMLEKKV 845
R+RL K L +F GT +L K +
Sbjct: 61 RQRLNK----INLMVFPGTVLLVKSL 82
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like
[Oreochromis niloticus]
Length = 4907
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 53/130 (40%), Gaps = 40/130 (30%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMS 559
G + C VC G+ +LL C GC L +HAACL+ P +GW+CP C+
Sbjct: 212 AGEESWCAVCDSAGELTDLLFCTGCGLHYHAACLEIGATPIQRAGWQCPECKV------- 264
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
C CR +D ++ CD C+K +H CL+ +
Sbjct: 265 ------------------CQTCRQ------PGEDSKMLVCDACDKGYHTFCLQPA----M 296
Query: 620 KEIPKDKWFC 629
+P D W C
Sbjct: 297 DSLPSDPWKC 306
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 62/136 (45%), Gaps = 20/136 (14%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q S+S
Sbjct: 897 VCEVCGKASDPSRLLLCDDCDVSYHTYCLDPPLHTVPKGGWKCKWCVCCVQCGSNSPGFH 956
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C +CR + ++ ++ C C++ H C L
Sbjct: 957 CEWQNNYTHCGPCASLVTCPVCREN-----FMEEELLLQCQYCDRWVHAVC---ESLYTE 1008
Query: 620 KEIPK--DKWFCCDDC 633
E+ + D+ F C C
Sbjct: 1009 DEVEQASDEGFACTSC 1024
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GEN LLLC+GC +H C P + IP+ W C C ++++ + +
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYEC-------VNKATNER 245
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C H S +IYCD C + +H C + L ++P+
Sbjct: 246 -----------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----IPPLLKVPRG 286
Query: 626 KWFC 629
KW+C
Sbjct: 287 KWYC 290
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 47/123 (38%), Gaps = 26/123 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCESEGGQEQEED----- 429
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL + ++P
Sbjct: 430 ------EHQEFCRVCK---------DGGELLCCDSCPAAYHTFCLSP----PITDVPDGD 470
Query: 627 WFC 629
W C
Sbjct: 471 WKC 473
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C VC DG LL C+ CP A+H CL P + +P+ W+CP C
Sbjct: 434 FCRVCKDGGELLCCDSCPAAYHTFCLSPPITDVPDGDWKCPRC 476
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG S
Sbjct: 307 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQGRS 357
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 53/133 (39%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 381 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCKI------------- 427
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR DD ++ CD C+K +H CL+ + IPK+
Sbjct: 428 ------------CQTCRQ------PGDDNKMLVCDTCDKGYHTFCLKPAMIT----IPKN 465
Query: 626 KWFC-----CDDC 633
W C C DC
Sbjct: 466 GWKCKTCRVCTDC 478
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 22/137 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C S + +
Sbjct: 817 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGEN 876
Query: 562 VDLKGGLE--APGAEVGGCVIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC- 617
D P A + C C RS+ D+ + C C++ H C N L
Sbjct: 877 ADWMNNYTQCGPCASMTHCAYCYRSYR------DNELLCQCSHCQRWEHALC---NSLYT 927
Query: 618 -DLKEIPKDKWFCCDDC 633
D E DK F C C
Sbjct: 928 EDETERAMDKGFICTLC 944
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 53/133 (39%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 398 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCKI------------- 444
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR DD ++ CD C+K +H CL+ + IPK+
Sbjct: 445 ------------CQTCRQ------PGDDNKMLVCDTCDKGYHTFCLKPAMIT----IPKN 482
Query: 626 KWFC-----CDDC 633
W C C DC
Sbjct: 483 GWKCKTCRVCTDC 495
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 22/137 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C S + +
Sbjct: 834 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGEN 893
Query: 562 VDLKGGLE--APGAEVGGCVIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC- 617
D P A + C C RS+ D+ + C C++ H C N L
Sbjct: 894 ADWMNNYTQCGPCASMTHCAYCYRSYR------DNELLCQCSHCQRWEHALC---NSLYT 944
Query: 618 -DLKEIPKDKWFCCDDC 633
D E DK F C C
Sbjct: 945 EDETERAMDKGFICTLC 961
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 53/133 (39%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C + G+ L C C +H +CLDP + + +GW+CPNC+
Sbjct: 382 ICDLPGNLSESLFCTSCGQHYHGSCLDPPVSIDPVVRAGWQCPNCKI------------- 428
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR DD ++ CD C+K +H CL+ + IPK+
Sbjct: 429 ------------CQTCRQ------PGDDNKMLVCDTCDKGYHTFCLKPAMIT----IPKN 466
Query: 626 KWFC-----CDDC 633
W C C DC
Sbjct: 467 GWKCKTCRVCTDC 479
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 58/137 (42%), Gaps = 22/137 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C S + +
Sbjct: 837 VCEGCGKSSDEARLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCTHCGSVTPGEN 896
Query: 562 VDLKGGLE--APGAEVGGCVIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC- 617
D P A + C C RS+ D+ + C C++ H C N L
Sbjct: 897 ADWMNNYTQCGPCASMTHCAYCYRSYR------DNELLCQCSHCQRWEHALC---NSLYT 947
Query: 618 -DLKEIPKDKWFCCDDC 633
D E DK F C C
Sbjct: 948 EDETERAMDKGFICTLC 964
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 13/130 (10%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR----QGHSSSMSRSV 562
D C VC G ++LC+ CP A+H CL+P L PE W CP+C Q +
Sbjct: 388 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCEKEGIQWEAKEEDFEE 447
Query: 563 DLKGGLEAPGAEVGGCVICRSHD---FSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + E G D F D ++ CD C +H+ CL L
Sbjct: 448 FEEECDDVRDVESGLGGEEEEDDHMEFCRVCKDGGELLCCDSCPSSYHIHCLNP----PL 503
Query: 620 KEIPKDKWFC 629
EIP +W C
Sbjct: 504 PEIPNGEWLC 513
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP ++H CL+P L IP W CP C
Sbjct: 473 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWLCPRC 516
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 11/107 (10%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNCRQGHSS 556
G T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C ++
Sbjct: 413 GSDDTDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPECLMQRNN 472
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCE 603
L G E G + G + F A ++ V+ D C+
Sbjct: 473 GSRNMAKLGRGAEVLGIDPHGRLY-----FGACSY----VLVVDSCD 510
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 332 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQGRA 382
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 23/154 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CL+P L P+ W CP+C +
Sbjct: 402 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELERAPQGKWSCPHCEKEGVQWEAKELEEEE 461
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ E + C +C+ D ++ CD C +H+ CL L
Sbjct: 462 MEEPKRERREEEDDHMEFCRVCK---------DGGELLCCDACVSSYHIHCLNP----PL 508
Query: 620 KEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIP 652
+IP +W C C ++ +Q + R P
Sbjct: 509 PDIPHGEWLCPRCTCPQLKGKVQKILHWRWGVPP 542
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQGRA 343
>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
Short=CHD-3
gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
Length = 1787
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 21/121 (17%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C VC L+LC+ C A+H AC+D + PE W CP+C + H + L
Sbjct: 268 CEVCNQDGELMLCDTCTRAYHVACIDENMEQPPEGDWSCPHCEE-HGPDV-----LIVEE 321
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
E A + C IC+ + ++ CD C +H C+ L EIP+ +W
Sbjct: 322 EPAKANMDYCRICK---------ETSNILLCDTCPSSYHAYCIDP----PLTEIPEGEWS 368
Query: 629 C 629
C
Sbjct: 369 C 369
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 29/44 (65%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C + N+LLC+ CP ++HA C+DP L IPE W CP C
Sbjct: 329 DYCRICKETSNILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPRC 372
>gi|281352213|gb|EFB27797.1| hypothetical protein PANDA_013886 [Ailuropoda melanoleuca]
Length = 640
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 380 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 439
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 440 EEEGEDEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNPP---- 486
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P S
Sbjct: 487 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVS 523
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSCLQG 347
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 375 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 434
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 435 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNPA----LDTIPDGD 472
Query: 627 WFC 629
W C
Sbjct: 473 WRC 475
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 435 EFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 233 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 292
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 293 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNPA----LDTIPDGD 330
Query: 627 WFC 629
W C
Sbjct: 331 WRC 333
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 293 EFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 59.7 bits (143), Expect = 7e-06, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 146
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 429 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 466
Query: 627 WFC 629
W C
Sbjct: 467 WRC 469
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 438 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 475
Query: 627 WFC 629
W C
Sbjct: 476 WRC 478
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 439 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 476
Query: 627 WFC 629
W C
Sbjct: 477 WRC 479
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 428
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 429 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 466
Query: 627 WFC 629
W C
Sbjct: 467 WRC 469
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 429 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 438 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 475
Query: 627 WFC 629
W C
Sbjct: 476 WRC 478
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 438 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 475
Query: 627 WFC 629
W C
Sbjct: 476 WRC 478
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 365 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 424
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 425 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 462
Query: 627 WFC 629
W C
Sbjct: 463 WRC 465
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 425 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHS 555
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG +
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSCLQGRA 349
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 28/119 (23%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGGLEA 570
C VCG G LL C+GCP AFH C+ IP++ W C C ++ G +
Sbjct: 1242 CSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC----------NLQTCGACKK 1291
Query: 571 PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
+ VIC S D + C++ FH+ C + L +P D W+C
Sbjct: 1292 NKIRLDSHVICGSEDGTKG------------CDRVFHLKCAK------LDAVPADDWYC 1332
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 62/173 (35%), Gaps = 56/173 (32%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC-------------------- 550
C++C +G LL C+GCP FH +C+ IP C C
Sbjct: 1117 CYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVKPVFPVNGAKKNGKAA 1176
Query: 551 ----RQGHSSSMSRSVDLKGGL----------EAPGAEVGGCVICRSHDFSA----ATFD 592
R+ +S + R + L E GAE V S A A D
Sbjct: 1177 SKRPRRSNSPTSRRRPRKQAKLGKAKSDDSESEDSGAESDSAVSTTSSARPASRPKAPED 1236
Query: 593 DRTV-----------IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 634
V + CD C + FHV C+ L EIP +WF C++CN
Sbjct: 1237 QWDVDCSVCGLGGELLCCDGCPRAFHVNCI------GLAEIPDTEWF-CNECN 1282
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACL--DPLLIPESGWRCPNC 550
C++C DG LL C+ CP AFH +CL +IP+S W C C
Sbjct: 161 CNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMC 202
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 363 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 422
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 423 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 460
Query: 627 WFC 629
W C
Sbjct: 461 WRC 463
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 423 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 384 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 443
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 444 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 481
Query: 627 WFC 629
W C
Sbjct: 482 WRC 484
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 444 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 487
>gi|268557732|ref|XP_002636856.1| C. briggsae CBR-LET-418 protein [Caenorhabditis briggsae]
Length = 1849
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 3/50 (6%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IP-ESGWRCPNC 550
T +DD C +C + ENLLLC+ C AFHA C+DP L +P E W CP C
Sbjct: 309 TKQNDDFCKICKETENLLLCDSCVCAFHAYCIDPPLTQVPQEETWACPRC 358
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 56/127 (44%), Gaps = 22/127 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
++D C C G L+LC+ CP A+H C+D PE W CP+C + H + + K
Sbjct: 254 NNDFCETCKVGGELVLCDTCPRAYHTGCMDE-DPPEGDWSCPHCIE-HGPEVIKEEPTKQ 311
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ C IC+ + ++ CD C FH C+ L ++P+++
Sbjct: 312 NDDF-------CKICKETE---------NLLLCDSCVCAFHAYCIDP----PLTQVPQEE 351
Query: 627 WFCCDDC 633
+ C C
Sbjct: 352 TWACPRC 358
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 55/138 (39%), Gaps = 45/138 (32%)
Query: 506 GSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSR 560
G + C VC G+ +LL C GC +HAACL+ P +GW+CP C+
Sbjct: 210 GEEACCAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKV-------- 261
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
C CR +D ++ CD C+K +H CL+ +
Sbjct: 262 -----------------CQTCRK------PGEDSKMLVCDACDKGYHTFCLQPA----MD 294
Query: 621 EIPKDKWFC-----CDDC 633
+P D W C C DC
Sbjct: 295 SLPTDPWKCKRCRVCTDC 312
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 58/132 (43%), Gaps = 13/132 (9%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRSVD 563
+MC D LLLC+ C +++H CLDP L +P+ GW+C C Q S++ +
Sbjct: 740 EMCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTPGFHCE 799
Query: 564 LKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
+ P A + C +CR + ++ ++ C C++ H C D E
Sbjct: 800 WQNNYTHCGPCASLVTCPVCREN-----FMEEELLLQCQYCDRWVHAVC-ESLYTEDEVE 853
Query: 622 IPKDKWFCCDDC 633
D+ F C C
Sbjct: 854 QASDEGFACTYC 865
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 373 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 433 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 470
Query: 627 WFC 629
W C
Sbjct: 471 WRC 473
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 373 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 432
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 433 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 470
Query: 627 WFC 629
W C
Sbjct: 471 WRC 473
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 433 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 18/135 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 992 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1051
Query: 562 VDLKGGLE--APGAEVGGCVIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ + AP A + C IC R++ D+ +I C QC++ H C N +
Sbjct: 1052 CEWQNNYTQCAPCASLSTCPICYRTYR------DEELIIQCRQCDRWMHAICQNLNTEEE 1105
Query: 619 LKEIPKDKWFCCDDC 633
++ I D F C C
Sbjct: 1106 VENI-ADMGFDCTIC 1119
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 64/154 (41%), Gaps = 30/154 (19%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + + + + G C VC G+ +L+C+ C +H CL
Sbjct: 362 TTCGQHYHGMCLDIQVTPLKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 421
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS----HDFSAAT 590
P++ +P +GW+C NCR V + G C++C S D +
Sbjct: 422 PVMDSVPTNGWKCKNCR----------VCAECGTRTSCQWHHNCLVCDSCYQQQDNLSCP 471
Query: 591 FDD--------RTVIYCDQCEKEFHVGCLRKNGL 616
F D + +++C C++ H+ C R G+
Sbjct: 472 FCDKLCLQDFQKDMLHCHMCKRWIHMECDRSPGI 505
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 53/133 (39%), Gaps = 45/133 (33%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC GD + L C C +H CLD + P +GW+CP+C+
Sbjct: 346 CAVCDSPGDLLDQLFCTTCGQHYHGMCLDIQVTPLKRAGWQCPDCK-------------- 391
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
+C++ S +D ++ CD C+K +H CL+ + +P +
Sbjct: 392 --------------VCQNCKHSG---EDNKMLVCDTCDKGYHTFCLQPV----MDSVPTN 430
Query: 626 KWFC-----CDDC 633
W C C +C
Sbjct: 431 GWKCKNCRVCAEC 443
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 941 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 987
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 988 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1025
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C L+ ++ +N Q P +SLST
Sbjct: 1026 TVPKGGWKCKWCVWCRHCGATSPGLRCEWQNNYTQCAPCASLST 1069
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 65/134 (48%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C +C Q +++
Sbjct: 1041 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTPGLR 1100
Query: 562 VDLKGG--LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + L AP A + C IC D+S T ++ C QC++ FH C + D+
Sbjct: 1101 CEWQNNYTLCAPCASLSTCPICLV-DYSEGTI----IVQCRQCDRWFHASCQSLHSEEDI 1155
Query: 620 KEIPKDKWFCCDDC 633
++ D F C C
Sbjct: 1156 EKA-ADSSFDCTMC 1168
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 30/178 (16%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H I + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 360 TSCGLHYHGICLDMAVTPLRRAGWQCPECKVCQTCKNPGEDTKMLVCDMCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR-----------QGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS 583
P++ +P +GWRC NCR Q H +S+ ++ A + C++
Sbjct: 420 PVIDTLPTNGWRCQNCRVCLQCGTRTGGQWHHTSLLCENCVQNQDPALCCPMCSCILDPE 479
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQ 641
H + +++C C++ H+ C R+N EI + + C +C R AA Q
Sbjct: 480 H--------HKDLVFCHTCKRWLHLECERQN--SGQAEIHPREDYVCSNC-RSPAAEQ 526
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 48/122 (39%), Gaps = 37/122 (30%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C L +H CLD + P +GW+CP C+
Sbjct: 346 LCDSPGDLLDQLFCTSCGLHYHGICLDMAVTPLRRAGWQCPECKV--------------- 390
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
C C++ +D ++ CD C+K +H CL+ + +P + W
Sbjct: 391 ----------CQTCKN------PGEDTKMLVCDMCDKGYHTFCLQPV----IDTLPTNGW 430
Query: 628 FC 629
C
Sbjct: 431 RC 432
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 61/165 (36%), Gaps = 51/165 (30%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMS 559
DMC VCG +G LL C C +H C+ ++ GWRC
Sbjct: 990 DMCVVCGSFGLGAEGR-LLACAQCGQCYHPFCVGIKITKVVLSKGWRC------------ 1036
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
LE C +C + D ++ CD C+ +H CL L
Sbjct: 1037 --------LE--------CTVCEA---CGQATDPGRLLLCDDCDISYHTYCLDP----PL 1073
Query: 620 KEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+ +PKD W C C C L+ ++ +N P +SLST
Sbjct: 1074 QNVPKDSWKCKWCVSCTQCGATTPGLRCEWQNNYTLCAPCASLST 1118
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 54/130 (41%), Gaps = 22/130 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-------GHSSSMS 559
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 392 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 451
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+G E + C +C+ D ++ CD C +H+ CL L
Sbjct: 452 EEEGEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNPP----L 498
Query: 620 KEIPKDKWFC 629
+IP +W C
Sbjct: 499 PDIPNGEWLC 508
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C ++H CL+P L IP W CP C
Sbjct: 468 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 511
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1486 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1545
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1546 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1600
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1601 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1646
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1435 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1481
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1482 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1516
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1517 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1544
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+DD C VC DG L+ C+GCP AFH CL+P L IP W+C CR
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERCR 202
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1414 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1473
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1474 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1528
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1529 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1574
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1363 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1409
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1410 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1444
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1445 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1472
Score = 39.7 bits (91), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|146421865|ref|XP_001486876.1| hypothetical protein PGUG_00253 [Meyerozyma guilliermondii ATCC
6260]
Length = 709
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 60/144 (41%), Gaps = 27/144 (18%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
TKKR H L P G +T ++G+RL G K+ K ISP + A
Sbjct: 197 TKKRRKRTHNKLETPKQPTRGSSVTPALEGRRLIRGSKRI---------KVISPKKHAA- 246
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
+A +SNG I+ A + TG +DD C CG + C+ CP
Sbjct: 247 TNLAP---------SSNG------TITAAASESEQTGDNDDFCSTCGGSGVFICCDSCPK 291
Query: 529 AFHAACLDPLL--IPESGWRCPNC 550
+FH C DP + PE W C C
Sbjct: 292 SFHFLCCDPPIEECPEDNWNCREC 315
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 438
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 439 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNP----PLDTIPDGD 476
Query: 627 WFC 629
W C
Sbjct: 477 WRC 479
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 439 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 58/130 (44%), Gaps = 24/130 (18%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G L+LC+ CP A+H C+D + PE W CP+C + H + +
Sbjct: 254 NNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCME-HGPEIVK---- 308
Query: 565 KGGLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
E P + C IC+ + ++ CD C FH C+ L +L P
Sbjct: 309 ----EEPAKQNDDFCKICK---------ETENLLLCDNCTCSFHAYCM-DPPLLELP--P 352
Query: 624 KDKWFCCDDC 633
+D+ + C C
Sbjct: 353 QDESWACPRC 362
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 18/135 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 962 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1021
Query: 562 VDLKGGLE--APGAEVGGCVIC-RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ + AP A + C IC R++ D+ +I C QC++ H C N +
Sbjct: 1022 CEWQNNYTQCAPCASLSTCPICYRTYR------DEELIIQCRQCDRWMHAICQNLNTEEE 1075
Query: 619 LKEIPKDKWFCCDDC 633
++ I D F C C
Sbjct: 1076 VENI-ADMGFDCTIC 1089
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 53/133 (39%), Gaps = 45/133 (33%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC GD + L C C +H CLD + P +GW+CP+C+
Sbjct: 314 CAVCDSPGDLLDQLFCTTCGQHYHGMCLDIQVTPLKRAGWQCPDCK-------------- 359
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
+C++ S +D ++ CD C+K +H CL+ + +P +
Sbjct: 360 --------------VCQNCKHSG---EDNKMLVCDTCDKGYHTFCLQPV----MDAVPTN 398
Query: 626 KWFC-----CDDC 633
W C C +C
Sbjct: 399 GWKCKNCRVCAEC 411
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 63/153 (41%), Gaps = 30/153 (19%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + + + + G C VC G+ +L+C+ C +H CL
Sbjct: 330 TTCGQHYHGMCLDIQVTPLKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 389
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS----HDFSAAT 590
P++ +P +GW+C NCR V + G C++C S D +
Sbjct: 390 PVMDAVPTNGWKCKNCR----------VCAECGTRTSCQWHHNCLVCDSCYQQQDNLSCP 439
Query: 591 FDD--------RTVIYCDQCEKEFHVGCLRKNG 615
F + + +++C C++ H+ C R G
Sbjct: 440 FCEKLCLQDFQKDMLHCHMCKRWIHIDCDRSPG 472
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 911 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 957
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 958 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 995
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C L+ ++ +N Q P +SLST
Sbjct: 996 TVPKGGWKCKWCVWCRHCGATSPGLRCEWQNNYTQCAPCASLST 1039
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 58/130 (44%), Gaps = 24/130 (18%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G L+LC+ CP A+H C+D + PE W CP+C + H + +
Sbjct: 254 NNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCME-HGPEIVK---- 308
Query: 565 KGGLEAPGAEVGG-CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIP 623
E P + C IC+ + ++ CD C FH C+ L +L P
Sbjct: 309 ----EEPAKQNDDFCKICKETE---------NLLLCDNCTCSFHAYCM-DPPLLELP--P 352
Query: 624 KDKWFCCDDC 633
+D+ + C C
Sbjct: 353 QDESWACPRC 362
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1392 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1451
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1452 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1506
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1507 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1552
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1341 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1387
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1388 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1422
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1423 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1450
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 55/138 (39%), Gaps = 45/138 (32%)
Query: 506 GSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSR 560
G + C VC G+ +LL C GC +HAACL+ P +GW+CP C+
Sbjct: 81 GEEACCAVCDSAGELSDLLFCTGCGQHYHAACLEIGATPIQRAGWQCPECKV-------- 132
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
C CR +D ++ CD C+K +H CL+ +
Sbjct: 133 -----------------CQTCRK------PGEDSKMLVCDACDKGYHTFCLQPA----MD 165
Query: 621 EIPKDKWFC-----CDDC 633
+P D W C C DC
Sbjct: 166 SLPTDPWKCKRCRVCTDC 183
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 12/109 (11%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRSVD 563
+MC D LLLC+ C +++H CLDP L +P+ GW+C C Q S++ +
Sbjct: 470 EMCGKASDPSRLLLCDDCDVSYHTYCLDPPLHNVPKGGWKCKWCVCCVQCGSNTPGFHCE 529
Query: 564 LKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ P A + C +CR + ++ ++ C C++ H C
Sbjct: 530 WQNNYTHCGPCASLVTCPVCREN-----FMEEELLLQCQYCDRWVHAVC 573
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH-----APYMEEDLLIQCRHCERWMHAGCESLFTEDDV 1543
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1544 EQA-ADEGFDCVSC-------QPYVVKPAVPVAPPELVPMKVKEPEPQYFRFEG 1589
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1378 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1424
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1425 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1459
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1460 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1487
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1268
Query: 620 KEIPKDKWFCCDDCN 634
P D+ F C C
Sbjct: 1269 DHAP-DEGFDCVSCQ 1282
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1103 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1149
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1150 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1184
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1185 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1212
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 62/135 (45%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 849 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 908
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 909 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 963
Query: 620 KEIPKDKWFCCDDCN 634
P D+ F C C
Sbjct: 964 DHAP-DEGFDCVSCQ 977
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 798 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 844
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 845 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 879
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 880 PLLTVPKGGWKCKWCVSCMQCGAASPGF 907
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1008 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1067
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1068 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1122
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1123 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1168
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 957 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1003
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1004 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1038
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1039 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1066
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 867 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 926
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 927 CEWQNSYTHCGPCASLVTCPICH-----APYMEEDLLIQCRHCERWMHAGCESLFTEDDV 981
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 982 EQA-ADEGFDCVSC-------QPYVVKPAVPVAPPELVPMKVKEPEPQYFRFEG 1027
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 816 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 862
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 863 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 897
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 898 PLLTVPKGGWKCKWCVSCMQCGAASPGF 925
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 256 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 315
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 316 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 362
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P +
Sbjct: 363 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVA 399
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 59/153 (38%), Gaps = 35/153 (22%)
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
M R + + +C + E +LLC+GC +H CL P L IP+ W C C
Sbjct: 967 VMWMRSVSNAACRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQ 1026
Query: 555 SSSMSRS---VDLKGGLEAPGAEVGG---------------CVICRSHDFSAATFDDRTV 596
S R+ V++ E +V C IC S +
Sbjct: 1027 LSPRKRTKAPVEVSSEEEDDNEKVDEDGDEDEEEEDLNQEVCNICESPG---------EL 1077
Query: 597 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
I CD C K FH+ C+ DLK +P+ W C
Sbjct: 1078 ILCDFCPKSFHLDCI------DLKRLPRGTWKC 1104
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQG 553
++C++C L+LC+ CP +FH C+D +P W+CP C G
Sbjct: 1066 EVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPCVLG 1110
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 63/157 (40%), Gaps = 24/157 (15%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ--------GHSSSM 558
D C VC G ++LC+ CP A+H CLDP L PE W CP+C +
Sbjct: 256 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEEEEY 315
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ +G E + C +C+ D ++ CD C +H+ CL
Sbjct: 316 EEEGEEEGEKEEEDDHMEYCRVCK---------DGGELLCCDACISSYHIHCLNP----P 362
Query: 619 LKEIPKDKWFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
L +IP +W C C + +Q + R P +
Sbjct: 363 LPDIPNGEWLCPRCTCPVLKGRVQKILHWRWGEPPVA 399
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 31/47 (65%), Gaps = 3/47 (6%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IP-ESGWRCPNC 550
+DD C +C + ENLLLC+ C AFHA C+DP L +P E W CP C
Sbjct: 311 NDDFCKICKETENLLLCDTCVCAFHAYCMDPPLTQVPQEETWNCPRC 357
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 57/132 (43%), Gaps = 23/132 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C G LLLC+ CP A+H C+D + PE W CP+C + H + +
Sbjct: 250 NNDFCEECKQGGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHCIE-HGPEIVKE--- 305
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
E C IC+ + ++ CD C FH C+ L ++P+
Sbjct: 306 ----EPQKVNDDFCKICK---------ETENLLLCDTCVCAFHAYCMDP----PLTQVPQ 348
Query: 625 DKWFCCDDCNRI 636
++ + C C +
Sbjct: 349 EETWNCPRCELV 360
>gi|194874449|ref|XP_001973400.1| GG13363 [Drosophila erecta]
gi|190655183|gb|EDV52426.1| GG13363 [Drosophila erecta]
Length = 711
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 49/123 (39%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L PE W CP+C ++ D
Sbjct: 378 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDDEHQ 437
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C +H CL L IP
Sbjct: 438 EF---------CRVCK---------DGGELLCCDSCPSAYHTFCLNPP----LDTIPDGD 475
Query: 627 WFC 629
W C
Sbjct: 476 WRC 478
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP A+H CL+P L IP+ WRCP C
Sbjct: 438 EFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C QG
Sbjct: 293 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSCLQG 341
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 75/177 (42%), Gaps = 31/177 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H CLDP L +P+ GW+C C S + +
Sbjct: 1120 VCEACGKATDPGRLLLCDDCDISYHTFCLDPPLQTVPKGGWKCKWC-----VSCTNCKAI 1174
Query: 565 KGGLE----------APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
GL AP A + C +C + ++ ++ C QC + H C N
Sbjct: 1175 TPGLRCEWQNNYTQCAPCASLSACPVCCQN-----YIEEELILQCRQCIRWSHASCQNLN 1229
Query: 615 GLCDLKEIPKDKWFCCDDCNRI---HAALQDFVSNRAQTIPASSLSTINRKHIEKGI 668
++ E+ D F C C H ++ + AQ IP + S + + + G+
Sbjct: 1230 TEAEV-ELAADSGFDCAACKPFVVSHEVIEPTID--AQIIPKTRDSDVLKTFTQDGV 1283
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 52/131 (39%), Gaps = 42/131 (32%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C +H CLD + P +GW+CP+C+
Sbjct: 671 LCDSSGDLLDQLFCTTCGQHYHGMCLDIAVTPLKRAGWQCPDCK---------------- 714
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
+C++ S DD ++ CD C+K +H CL+ + +P + W
Sbjct: 715 ------------VCQNCKHSG---DDNQMLVCDTCDKGYHTFCLQPV----MDSVPTNGW 755
Query: 628 FC-----CDDC 633
C C +C
Sbjct: 756 KCKNCRICTEC 766
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC GD +L+C+ C +H CL
Sbjct: 685 TTCGQHYHGMCLDIAVTPLKRAGWQCPDCKVCQNCKHSGDDNQMLVCDTCDKGYHTFCLQ 744
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 745 PVMDSVPTNGWKCKNCR 761
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1390 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1449
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1450 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1504
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q ++ I L I K E F+G
Sbjct: 1505 EQA-ADEGFDCVSC-------QPYIVKPVVPIAPPELVPIKVKEPEPQYFRFEG 1550
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 53/140 (37%), Gaps = 45/140 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CTVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC-----CDDCNRIHAAL 640
W C C C A L
Sbjct: 314 SWKCKACRICRTCGASSAEL 333
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1339 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1385
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1386 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1420
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1421 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1448
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 55/127 (43%), Gaps = 16/127 (12%)
Query: 508 DDMCHVCGDGEN-----LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D +C +C ++ +L C+GCPLA H C IP+ W C C++ ++ +
Sbjct: 288 DPVCEICTKPDSKRPNLILFCDGCPLAVHQKCYSVPKIPDGDWFCKKCQRNRVAAEAARA 347
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+ L E+ C +CR D +I C+ C+ H C D+ +
Sbjct: 348 NENDALNDSDDEI-KCAVCRGLDSKKPN----EIILCENCDYAVHQTC------GDIPKK 396
Query: 623 PKDKWFC 629
P+++W C
Sbjct: 397 PREEWLC 403
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 54/124 (43%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IPE W C C M+++ +
Sbjct: 1791 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHEC-------MNKATGER 1843
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C ++ T +I C+ C + +H C+ + ++P+
Sbjct: 1844 -----------NCIVCGKKSSTSGT----RLILCELCPRAYHTDCIHP----IMHKVPRG 1884
Query: 626 KWFC 629
KW+C
Sbjct: 1885 KWYC 1888
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1420 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1479
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1480 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1534
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V I L + K E F+G
Sbjct: 1535 EQA-ADEGFDCVSC-------QPYVVKPVAPIAPPELVPMKVKEPEPQYFRFEG 1580
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 49/124 (39%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC 629
W C
Sbjct: 314 SWKC 317
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1369 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1415
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1416 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1450
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1451 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1478
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 753 LRIFGREVAELPLVATCREYQGKGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKK 812
L+ G +AE+ + T Y+ +G + LFS +E L +L VE LV+PA +W K
Sbjct: 248 LQFHGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATADLSHVWISK 307
Query: 813 FGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
FGF K + L K R L F G +L+K++
Sbjct: 308 FGF-KYVEDSLKKELRSMNLLAFPGIDVLQKEL 339
>gi|432114496|gb|ELK36344.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
Length = 3462
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1416 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1475
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1476 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1530
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1531 EQA-ADEGFDCVSC-------QPYVIKPAAPVAPPELVPMKVKEPEPQYFRFEG 1576
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 261 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 307
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 308 ------------CQACRK------PGNDSKMLVCETCDKGYHTFCLKP----PMEELPAH 345
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 346 SWKCKACRVCRSCGAGSAELNPN 368
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1365 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1411
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1412 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1446
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1447 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1474
Score = 39.7 bits (91), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 277 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 336
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 337 PPMEELPAHSWKCKACR 353
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 302 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCCLQG 350
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 75/174 (43%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1411 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1470
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC + ++ +I C CE+ H GC D+
Sbjct: 1471 CEWQNSYTHCGPCASLVTCPICHT-----PYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1525
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1526 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1571
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1360 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1406
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1407 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1441
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1442 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1469
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1447 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1506
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1507 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1561
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1562 EQA-ADEGFDCVSCQ 1575
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRK------PGNDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1396 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1442
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1443 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1477
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1478 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1505
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1268
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V I L + K E F+G
Sbjct: 1269 EQA-ADEGFDCVSC-------QPYVVKPVAPIAPPELVPMKVKEPEPQYFRFEG 1314
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 49/124 (39%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC 629
W C
Sbjct: 314 SWKC 317
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1103 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1149
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1150 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1184
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1185 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1212
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 431 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 490
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 491 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 545
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 546 EQA-ADEGFDCVSCQ 559
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 380 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 426
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 427 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 461
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 462 PLLTVPKGGWKCKWCVSCMQCGAASPGF 489
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 56/130 (43%), Gaps = 42/130 (32%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C+ GD NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 370 CYGMGDVSNLVMCSVCGQHYHGSCVGLALLPGVRAGWQCVSCRV---------------- 413
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C +CR + D V+ C++C+K +H GCLR + IPK W
Sbjct: 414 ---------CQVCRQPE------DVSKVMLCERCDKAYHPGCLRP----IVTSIPKYGWK 454
Query: 629 C-----CDDC 633
C C DC
Sbjct: 455 CKCCRVCTDC 464
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 53/115 (46%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C Q G +
Sbjct: 753 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAQCQTCGSNDPGFN 812
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SS +S G P A CV+C+ A + +I C QCE+ H GC
Sbjct: 813 SSWQKSYTQCG----PCASHTACVVCQE-----AYQEGDLIIQCVQCERWLHCGC 858
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 67/184 (36%), Gaps = 61/184 (33%)
Query: 510 MCHVCGDGENL---LLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C VC E++ +LC C A+H CL P++ IP+ GW+C CR V
Sbjct: 413 VCQVCRQPEDVSKVMLCERCDKAYHPGCLRPIVTSIPKYGWKCKCCR----------VCT 462
Query: 565 KGGLEAPGAEVG---------------------GCVICRSHDFSAATFDDRTVIYCDQCE 603
G PGA + C +CR + AA + R ++ C C+
Sbjct: 463 DCGSRTPGAGLSSRWHSHYTVCDSCYQQRNKGFSCPLCRKA-YRAAAY--REMVQCHGCK 519
Query: 604 KEFHVGCLRKNGLCD----------LKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPA 653
K H G CD KE D + C C I V+ R +I
Sbjct: 520 KFVH-------GTCDPEADPLTYQQRKEAKPDYEYVCLHCKSIA-----MVARRKDSIDE 567
Query: 654 SSLS 657
LS
Sbjct: 568 GDLS 571
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1420 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1479
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1480 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1534
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1535 EQA-ADEGFDCVSCQ 1548
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 49/124 (39%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC 629
W C
Sbjct: 314 SWKC 317
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1369 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1415
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1416 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1450
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1451 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1478
Score = 39.7 bits (91), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1543
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1544 EQA-ADEGFDCVSCQ 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 49/124 (39%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE N+ C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCNMFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC 629
W C
Sbjct: 314 SWKC 317
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1378 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1424
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1425 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1459
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1460 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1487
>gi|115473879|ref|NP_001060538.1| Os07g0661500 [Oryza sativa Japonica Group]
gi|113612074|dbj|BAF22452.1| Os07g0661500 [Oryza sativa Japonica Group]
Length = 1752
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q M
Sbjct: 426 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQATVQDM 479
>gi|50510226|dbj|BAD31424.1| PHD finger transcription factor-like protein [Oryza sativa Japonica
Group]
Length = 1696
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG LL C+GCP AFH ACL P L IP WRC +C Q M
Sbjct: 293 NEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQATVPVM 346
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 59/146 (40%), Gaps = 45/146 (30%)
Query: 498 MGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C C GD NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 367 LGQVPLLLNGDVTCMQCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCASCR- 425
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C +CR + D V+ C++CEK +H CLR
Sbjct: 426 ------------------------VCQVCRQPE------DVSKVMLCERCEKAYHPSCLR 455
Query: 613 KNGLCDLKEIPKDKWFC-----CDDC 633
+ IPK W C C DC
Sbjct: 456 P----IVTSIPKYGWKCKCCRVCTDC 477
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 766 VCEGCGERNDEGRLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 825
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SS ++ G P A C+ C+ A + +I C QCE+ H C
Sbjct: 826 SSWQKNYTQCG----PCASHTACISCQE-----AYNEGDLIIQCIQCERWLHCAC 871
>gi|125559491|gb|EAZ05027.1| hypothetical protein OsI_27209 [Oryza sativa Indica Group]
Length = 1696
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 438 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 252 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 299
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1428 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1487
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1488 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1542
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1543 EQA-ADEGFDCISCQ 1556
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1377 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1423
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1424 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1458
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1459 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1486
Score = 39.7 bits (91), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|222637620|gb|EEE67752.1| hypothetical protein OsJ_25457 [Oryza sativa Japonica Group]
Length = 1646
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
T G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 423 TEDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 472
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1154 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1213
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1214 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1268
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1269 EQA-ADEGFDCVSCQ 1282
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 49/124 (39%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC 629
W C
Sbjct: 314 SWKC 317
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1103 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1149
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1150 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1184
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1185 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1212
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|399218583|emb|CCF75470.1| unnamed protein product [Babesia microti strain RI]
Length = 1643
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 15/111 (13%)
Query: 507 SDDMCHVCGD--GENLLLCNGCPLAFHAACLDPLLIPES-GWRCPNCR-QGHSSSMSRSV 562
+DD C +C + E LLLC+GCP ++H +CL P++ W CP C+ + H R +
Sbjct: 70 NDDRCKICKEINEETLLLCDGCPNSYHLSCLALETEPDADNWYCPMCKPEDHKGHEIRRL 129
Query: 563 DLKGGLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
+ + E+ V C +C+ H ++ CD C FH CL
Sbjct: 130 NRRPRTESTTDHVNSTTCYVCQRHG---------KLLGCDFCSNSFHHSCL 171
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1429 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1488
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1489 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1543
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1544 EQA-ADEGFDCVSCQ 1557
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1378 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1424
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1425 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1459
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1460 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1487
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|297262270|ref|XP_001099471.2| PREDICTED: histone-lysine N-methyltransferase MLL2 [Macaca mulatta]
Length = 5505
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1376 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1435
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1436 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1490
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1491 EQA-ADEGFDCISCQ 1504
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1325 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1371
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1372 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1406
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1407 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1434
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 343
>gi|302855516|ref|XP_002959250.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
gi|300255380|gb|EFJ39692.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
Length = 358
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 114/254 (44%), Gaps = 20/254 (7%)
Query: 597 IYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSL 656
+ CD C H+GC+ G+ +++P WF C C + L+ P+S+
Sbjct: 123 LRCDTCGCWVHLGCV---GVEVAEQVPPRPWFHCRACRSTYLRLEAAAERNPH--PSSAS 177
Query: 657 STINRKHIEKGILFDGTMNDVQWQM-LKKAQCFEEKEKSLLSSATAIFRECFDPIIAECG 715
T L+ T +D Q M + + + + P++A+ G
Sbjct: 178 PT--------HALYVLTPSDHQAAMAVALNRGLTPAVAAGGGAGGRGPVGAVLPLLAQ-G 228
Query: 716 RDLIPVMVYGRNISGQEF-GGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQG 774
+ + +G+ +E+ GG YS +L + V+A +FG + A+L L+AT +++
Sbjct: 229 FNEDAIRGFGQ--PAREYDGGKYSAVLLNRGQPVAAATFNVFGAD-AQLCLLATAVQHRL 285
Query: 775 KGCFQALFSCIERLLCSLNVENLVLPAAEKAESIWTKKFGFRKMSRERLLKYQRDFQLTI 834
KG AL + +E LL + V L++ + A +W + G+R + + L+ R +
Sbjct: 286 KGNGSALVADLEALLADVGVSRLLVQSRGVALPLWLGRLGYRLVPPQEALQLHRTLPIAY 345
Query: 835 FKGTSMLEKKVQCL 848
+ ++++K+++ L
Sbjct: 346 YD-CALMQKQLRPL 358
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1261 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1320
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1321 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1375
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1376 EQA-ADEGFDCVSCQ 1389
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1210 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1256
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1257 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1291
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1292 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1319
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|351706869|gb|EHB09788.1| PHD finger protein 21B [Heterocephalus glaber]
Length = 482
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NLL C+ CP A+H CL P L P+ GW CP C+Q
Sbjct: 303 DELCAACRRGSNLLCCSACPGAYHLGCLHPPLKTAPKGGWLCPKCQQ 349
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 345
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1157 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1216
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1217 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1271
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1272 EQA-ADEGFDCVSCQ 1285
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1106 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1152
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1153 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1187
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1188 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1215
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 343
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1414 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1473
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1474 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1528
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1529 EQA-ADEGFDCVSCQ 1542
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 55/143 (38%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + SN
Sbjct: 314 SWKCKACRVCRACGAGSAELNSN 336
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1363 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1409
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1410 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1444
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1445 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1472
Score = 39.7 bits (91), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1441 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1500
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1501 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1555
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1556 EQA-ADEGFDCVSCQ 1569
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1390 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1436
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1437 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1471
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1472 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1499
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 242 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCCLQG 290
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 340
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 345
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 341
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 344
>gi|195999462|ref|XP_002109599.1| hypothetical protein TRIADDRAFT_53787 [Trichoplax adhaerens]
gi|190587723|gb|EDV27765.1| hypothetical protein TRIADDRAFT_53787 [Trichoplax adhaerens]
Length = 1866
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 51/133 (38%), Gaps = 35/133 (26%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
C +C G+N LLLC+ C H C P L +PE W CP C
Sbjct: 1630 FCQICRKGDNESLLLLCDKCDCGTHTYCCTPKLTSVPEGDWFCPKC-------------- 1675
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
A + C IC D A I C++C +H+ CL +K PK
Sbjct: 1676 -----ASECDKDYCCICEIDDNEKA-------IACERCNSVYHLACLDPV----IKRYPK 1719
Query: 625 DKWFCCDDCNRIH 637
W+C D RI+
Sbjct: 1720 TGWYCRDCRKRIY 1732
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 509 DMCHVC--GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C +C D E + C C +H ACLDP++ P++GW C +CR+
Sbjct: 1682 DYCCICEIDDNEKAIACERCNSVYHLACLDPVIKRYPKTGWYCRDCRK 1729
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 77/177 (43%), Gaps = 31/177 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q ++S
Sbjct: 791 VCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVCCVQCGAASPGFH 850
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C CR +D +I C C++ H C + L
Sbjct: 851 CEWQNNYTHCAPCASLVVCPFCREK-----YVEDDLLIQCRHCDRWLHAAC---DSLFTE 902
Query: 620 KEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDGT 673
+E+ + D+ F C C Q +V + +PA S I K E F+G
Sbjct: 903 EEVEQAADEGFDCSAC-------QPYV---VKPVPAPSAEMIKAKDPEPQYFRFEGV 949
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 52/142 (36%), Gaps = 46/142 (32%)
Query: 508 DDMCHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G GE +L C C FH ACLD L P SGW+CP C+
Sbjct: 123 DARCSVCDGPGELRDLAFCTSCGQHFHGACLDISLTPRKRSGWQCPQCK----------- 171
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+C++ D ++ C+ C+K +H C + +
Sbjct: 172 -----------------VCQNLQ----PGQDSAMLVCETCDKGYHTSCTEPAA----QGL 206
Query: 623 PKDKWFC-----CDDCNRIHAA 639
P W C C DC + A
Sbjct: 207 PTTSWKCKNCWVCSDCGQRPAG 228
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 35/75 (46%), Gaps = 7/75 (9%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC-----GDGENLLLCNGCPLAFHAACLDP 537
TS G H + +++ R+ +G C VC G +L+C C +H +C +P
Sbjct: 142 TSCGQHFHGACLDISLTPRKRSGWQCPQCKVCQNLQPGQDSAMLVCETCDKGYHTSCTEP 201
Query: 538 LL--IPESGWRCPNC 550
+P + W+C NC
Sbjct: 202 AAQGLPTTSWKCKNC 216
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 67/182 (36%), Gaps = 58/182 (31%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 740 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 786
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 787 ----VE--------------CIVC---EVCGKASDPSRLLLCDDCDISYHTYCLDP---- 821
Query: 618 DLKEIPKDKW-----FCCDDCNRIHAALQ-DFVSNRAQTIPASSLST---INRKHIEKGI 668
L+ +PK W CC C ++ +N P +SL K++E +
Sbjct: 822 PLQTVPKGGWKCKWCVCCVQCGAASPGFHCEWQNNYTHCAPCASLVVCPFCREKYVEDDL 881
Query: 669 LF 670
L
Sbjct: 882 LI 883
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 344
>gi|397629214|gb|EJK69257.1| hypothetical protein THAOC_09499 [Thalassiosira oceanica]
Length = 734
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D+C VCGDG +LL C+GC ++H CL+P + +PE W CP+C
Sbjct: 672 DVCEVCGDGGDLLCCDGCINSYHQRCLNPPMEQVPEGQWFCPSC 715
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 342
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1402 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1461
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1462 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1516
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1517 EQX-ADEGFDCISCQ 1530
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 314 SWKCKACRVCRACGAGSAELNPN 336
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1351 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1397
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1398 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1432
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1433 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1460
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPMEELPAHSWKCKACR 321
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 380 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 427
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 412 GSAEVDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 412 GSAEVDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 465
>gi|242041381|ref|XP_002468085.1| hypothetical protein SORBIDRAFT_01g039297 [Sorghum bicolor]
gi|241921939|gb|EER95083.1| hypothetical protein SORBIDRAFT_01g039297 [Sorghum bicolor]
Length = 134
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 39/59 (66%), Gaps = 4/59 (6%)
Query: 247 GGYLCGCPL---CNF-SKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAP 301
GGY C CP C + KV+SA +FE+HAG+++++ N HI L NGK +Y + EL+ P
Sbjct: 3 GGYACACPASARCGYRDKVLSALQFEKHAGSESKNQNGHILLLNGKSLYDLFHELRDVP 61
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQG 340
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + P A + C IC A ++ +I C CE+ H GC
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGC 1490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++++P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PIEDLPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A D N
Sbjct: 314 SWKCKTCRICRACGAGSADLNPN 336
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1334 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1380
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1381 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1415
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1416 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1443
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPIEDLPAHSWKCKTCR 321
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + P A + C IC A ++ +I C CE+ H GC
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGC 1490
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 229 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 275
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++++P
Sbjct: 276 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PIEDLPAH 313
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A D N
Sbjct: 314 SWKCKTCRICRACGAGSADLNPN 336
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1334 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1380
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1381 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1415
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1416 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1443
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 245 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 304
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 305 PPIEDLPAHSWKCKTCR 321
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C QG
Sbjct: 306 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCCLQG 354
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 342
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 45/88 (51%), Gaps = 5/88 (5%)
Query: 468 HAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG--GSDDMCHVCGDGENLLLCNG 525
H +R+P + G+ + +A+ Q+ T +DD C VC DG L+ C+G
Sbjct: 251 HTSSHQQREPSSRGLATPGLQ-YVVAVLTCYPQKMTFAVEHNDDECAVCKDGGELICCDG 309
Query: 526 CPLAFHAACLDPLL--IPESGWRCPNCR 551
CP AFH +CL P L IP WRC C+
Sbjct: 310 CPRAFHLSCLVPPLTSIPRGTWRCQLCQ 337
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 64/130 (49%), Gaps = 15/130 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C C Q +++
Sbjct: 721 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVTCTQCGATTPGLR 780
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C IC D+S T T++ C QC++ FH C + D+
Sbjct: 781 CEWQKNYTQCAPCASLMTCPICLV-DYSEGT----TILQCRQCDRWFHASCQSLHSEEDV 835
Query: 620 KEIPKDKWFC 629
++ ++ + C
Sbjct: 836 EKAAENGFNC 845
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 60/165 (36%), Gaps = 51/165 (30%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDPLL---IPESGWRCPNCRQGHSSSMS 559
DMC VCG +G LL C C +H C+ + + GWRC
Sbjct: 670 DMCVVCGSFGLGAEGR-LLACAQCGQCYHPYCVGIKINKVVLSKGWRC------------ 716
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
LE C +C + D ++ CD C+ +H CL L
Sbjct: 717 --------LE--------CTVCEA---CGQATDPGRLLLCDDCDISYHTYCLDP----PL 753
Query: 620 KEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+ +PKD W C C C L+ ++ N Q P +SL T
Sbjct: 754 QNVPKDSWKCKWCVTCTQCGATTPGLRCEWQKNYTQCAPCASLMT 798
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 50/122 (40%), Gaps = 37/122 (30%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD + L C C L +H CLD + P +GW+CP C+
Sbjct: 346 LCDSPGDLLDQLFCTSCGLHYHGMCLDMAVTPLRRAGWQCPECKV--------------- 390
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
C C++H DD ++ CD C+K +H CL+ ++ +P + W
Sbjct: 391 ----------CQTCKNHG------DDTKMLVCDMCDKGYHTFCLQPA----MESLPTNGW 430
Query: 628 FC 629
C
Sbjct: 431 RC 432
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 64/130 (49%), Gaps = 15/130 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ W+C +C Q +++
Sbjct: 1015 VCEACGQATDPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQCGATTPGLR 1074
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C IC ++S T ++ C QC++ FH C + DL
Sbjct: 1075 CEWQSNYTQCAPCASLSTCPICLV-NYSEGT----VIVQCRQCDRWFHASCQGLHSEDDL 1129
Query: 620 KEIPKDKWFC 629
++ ++ + C
Sbjct: 1130 EKAAENSFDC 1139
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 77/182 (42%), Gaps = 30/182 (16%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H + + +A+ R G C VC GD +L+C+ C +H CL
Sbjct: 360 TSCGLHYHGMCLDMAVTPLRRAGWQCPECKVCQTCKNHGDDTKMLVCDMCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR-----------QGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS 583
P + +P +GWRC NCR H +S+ + A + C++
Sbjct: 420 PAMESLPTNGWRCKNCRVCIQCGTRTSGHWHHNSLLCENCFQNQDPALCCSMCSCILDPE 479
Query: 584 HDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
H + +++C C++ H+ C R+N EI + + C +C R AA Q
Sbjct: 480 H--------HKDLLFCQTCKRWLHLECERQN--SGQTEINPREDYVCFNC-RSPAAEQAL 528
Query: 644 VS 645
++
Sbjct: 529 LT 530
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 62/165 (37%), Gaps = 51/165 (30%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDPLL---IPESGWRCPNCRQGHSSSMS 559
DMC VCG +G LL C C +H C+ + + GWRC
Sbjct: 964 DMCVVCGSFGLGAEGR-LLACAQCGQCYHPFCVGIKINKVVLSKGWRC------------ 1010
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
LE C +C + D ++ CD C+ +H CL L
Sbjct: 1011 --------LE--------CTVCEA---CGQATDPGRLLLCDDCDISYHTYCLDP----PL 1047
Query: 620 KEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+ +PKD W C C C L+ ++ SN Q P +SLST
Sbjct: 1048 QNVPKDSWKCKWCVSCTQCGATTPGLRCEWQSNYTQCAPCASLST 1092
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC C Q
Sbjct: 274 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQ 321
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 955 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1014
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C H ++ ++ C QC++ H C N ++
Sbjct: 1015 CEWQNNYTQCAPCASLSTCPVCYRH-----YREEDLILQCRQCDRWMHAICQNLNTEEEV 1069
Query: 620 KEIPKDKWFCCDDC 633
+ + D F C C
Sbjct: 1070 ENV-ADIGFDCSMC 1082
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 308 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 367
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 368 PVMKSVPTNGWKCKNCR 384
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 289 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 338
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 339 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 373
Query: 623 PKDKWFC 629
P + W C
Sbjct: 374 PTNGWKC 380
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 904 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 950
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 951 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 988
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C L+ ++ +N Q P +SLST
Sbjct: 989 TVPKGGWKCKWCVWCRHCGATSPGLRCEWQNNYTQCAPCASLST 1032
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSM 558
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q S M
Sbjct: 77 NEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSCLQATVSEM 130
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPL--LIPESGWRCPNC 550
G G+ D C++CG NLL C+GCP AFH+ C+ + L+PE W CP C
Sbjct: 417 GSAEMDDGNSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 470
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 45/146 (30%)
Query: 498 MGQRRTTGGSDDMC-HVCGDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C H CG G+ NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 358 LGQVPLLLNGDITCMHCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCVSCR- 416
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C +CR + D V+ C++CEK +H CLR
Sbjct: 417 ------------------------VCQVCRQPE------DVSKVMLCERCEKAYHPSCLR 446
Query: 613 KNGLCDLKEIPKDKWFC-----CDDC 633
+ IPK W C C DC
Sbjct: 447 P----IVTSIPKYGWKCKCCRVCTDC 468
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 49/115 (42%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 816
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SS ++ G P A C+ C+ + +I C QCE+ H C
Sbjct: 817 SSWQKNYTQCG----PCASHAACISCQE-----TYTEGDLIIQCIQCERWLHCAC 862
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 45/146 (30%)
Query: 498 MGQRRTTGGSDDMC-HVCGDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C H CG G+ NL++C+ C +H +C+ L+P +GW+C +CR
Sbjct: 358 LGQVPLLLNGDITCMHCCGMGDVSNLVMCSICGQHYHGSCVGLALLPGVRAGWQCVSCR- 416
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C +CR + D V+ C++CEK +H CLR
Sbjct: 417 ------------------------VCQVCRQPE------DVSKVMLCERCEKAYHPSCLR 446
Query: 613 KNGLCDLKEIPKDKWFC-----CDDC 633
+ IPK W C C DC
Sbjct: 447 P----IVTSIPKYGWKCKCCRVCTDC 468
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 49/115 (42%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ---------GHS 555
+C CG D L+LC+ C +++H C+DP L +P W+C C G +
Sbjct: 757 VCEGCGERNDEARLILCDDCDISYHIYCMDPPLDYVPHGTWKCKWCAHCQTCGSNDPGFN 816
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
SS ++ G P A C+ C+ + +I C QCE+ H C
Sbjct: 817 SSWQKNYTQCG----PCASHTACISCQE-----TYTEGDLIIQCIQCERWLHCAC 862
>gi|431901376|gb|ELK08402.1| Histone-lysine N-methyltransferase MLL2 [Pteropus alecto]
Length = 5640
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 24/174 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1430 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1489
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H C D+
Sbjct: 1490 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHASCESLFTEDDV 1544
Query: 620 KEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
++ D+ F C C Q +V A + L + K E F+G
Sbjct: 1545 EQA-ADEGFDCVSC-------QPYVVKPAAPVAPPELVPMKVKEPEPQYFRFEG 1590
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 54/143 (37%), Gaps = 42/143 (29%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C+
Sbjct: 248 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKV------------- 294
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 295 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PMEELPPH 332
Query: 626 KWFC--CDDCNRIHAALQDFVSN 646
W C C C A + N
Sbjct: 333 SWKCKACRVCRACGAGSAELNPN 355
Score = 40.4 bits (93), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1379 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1425
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1426 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1460
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1461 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1488
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G H + A+ R+ G C VC G+ +L+C C +H CL
Sbjct: 264 TSCGHHYHGACLDTALTARKRAGWQCPECKVCQACRKPGNDSKMLVCETCDKGYHTFCLK 323
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 324 PPMEELPPHSWKCKACR 340
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
+DD C VC DG L+ C+GCP AFH ACL+P L IP WRC C G
Sbjct: 286 NDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCCIVG 334
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 960 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1019
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C + +D ++ C QC++ H C N ++
Sbjct: 1020 CEWQNNYTQCAPCASLSSCPVCYRN-----YREDDLILQCRQCDRWMHAVCQNLNTEEEV 1074
Query: 620 KEIPKDKWFCCDDC 633
+ + D F C C
Sbjct: 1075 ENV-ADIGFDCSMC 1087
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 316 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 375
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 376 PVMKSVPTNGWKCKNCR 392
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 297 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 346
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 347 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 381
Query: 623 PKDKWFC 629
P + W C
Sbjct: 382 PTNGWKC 388
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 909 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 955
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 956 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 993
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 994 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1037
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 65/143 (45%), Gaps = 22/143 (15%)
Query: 480 HIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAA 533
Y + +TLHD S +G+ G C VC GD NLLLC+ C +++H
Sbjct: 450 QTYHTYCVTLHDKLNSAVVGR----GWRCLDCTVCEGCGTGGDEANLLLCDECDVSYHIY 505
Query: 534 CLDPLL--IPESGWR---CPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC-RSHDFS 587
C+ PLL IP+ WR C CR+ + + S + GL P A + C C R++
Sbjct: 506 CMKPLLDKIPQGPWRCQWCSRCRRCNHKAASGNDLTSQGLCFPCASLRKCPRCERNYQL- 564
Query: 588 AATFDDRTVIYCDQCEKEFHVGC 610
+ +I C QC K H C
Sbjct: 565 -----NEKLIRCSQCSKWQHGAC 582
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 58/151 (38%), Gaps = 50/151 (33%)
Query: 510 MCHVCGD-GE----NLLLCNGCPLAFHAACL---DPL--LIPESGWRCPNCRQGHSSSMS 559
MC VCG G+ +++ C+ C +H C+ D L + GWRC +C
Sbjct: 427 MCLVCGSIGKGPEGSMVACSNCAQTYHTYCVTLHDKLNSAVVGRGWRCLDC--------- 477
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
G GG D+ ++ CD+C+ +H+ C++ L
Sbjct: 478 --------TVCEGCGTGG--------------DEANLLLCDECDVSYHIYCMKPL----L 511
Query: 620 KEIPKDKWFC-----CDDCNRIHAALQDFVS 645
+IP+ W C C CN A+ D S
Sbjct: 512 DKIPQGPWRCQWCSRCRRCNHKAASGNDLTS 542
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 48/122 (39%), Gaps = 37/122 (30%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C GD +LL C C L +H CL+ + P SGW+CP C+
Sbjct: 223 LCDKPGDLIDLLFCTSCGLHYHGTCLEITVSPLKRSGWQCPECK---------------- 266
Query: 568 LEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW 627
C CR +D ++ CD C+K +H CL+ C +P D W
Sbjct: 267 ---------VCQTCRQ------PGEDTMMLVCDACDKGYHTFCLKPAIEC----LPTDSW 307
Query: 628 FC 629
C
Sbjct: 308 KC 309
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 71/158 (44%), Gaps = 28/158 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q + +
Sbjct: 859 VCEVCGKATDPSRLLLCDDCDISYHTYCLDPPLHTVPKGGWKCRWCVSCMQCGAVTPGFR 918
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C + +I C CE+ H C L
Sbjct: 919 SEWQNNYTHCAPCASLVSCPVCH-----LKYLEGDLLIQCRHCERWLHAVC---ENLFTE 970
Query: 620 KEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASS 655
+E+ + D+ F C C Q F+ + +IPAS+
Sbjct: 971 EEVEQAADEGFDCSSC-------QPFIP-KPISIPAST 1000
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H + + + + +G C VC G+ +L+C+ C +H CL
Sbjct: 237 TSCGLHYHGTCLEITVSPLKRSGWQCPECKVCQTCRQPGEDTMMLVCDACDKGYHTFCLK 296
Query: 537 PLL--IPESGWRCPNCR 551
P + +P W+C CR
Sbjct: 297 PAIECLPTDSWKCKTCR 313
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 56/150 (37%), Gaps = 51/150 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 808 DMCVVCGSFGRGSEG-HLLACSQCSQCYHPYCVNSRITKVMLL--KGWRC---------- 854
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+E C++C + D ++ CD C+ +H CL
Sbjct: 855 ----------VE--------CIVC---EVCGKATDPSRLLLCDDCDISYHTYCLDP---- 889
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDFVS 645
L +PK W C C C + A F S
Sbjct: 890 PLHTVPKGGWKCRWCVSCMQCGAVTPGFRS 919
>gi|340371689|ref|XP_003384377.1| PREDICTED: hypothetical protein LOC100637285 [Amphimedon
queenslandica]
Length = 400
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D CH+C DG LL C+ CP A+H CL P L IP+ W CP C
Sbjct: 318 DYCHICQDGGELLCCDRCPKAYHLQCLYPPLKKIPDGTWHCPRC 361
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 74/176 (42%), Gaps = 28/176 (15%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q + S
Sbjct: 969 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAVSPGFH 1028
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C CR A ++ +I C CE+ H GC L
Sbjct: 1029 CEWQNSYTHCGPCASLVTCPACR-----APYVEEDLLIQCRHCERWMHAGC---ESLFTE 1080
Query: 620 KEIPK--DKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGIL-FDG 672
+E+ + D+ F C C Q +V I L + K E F+G
Sbjct: 1081 EEVEQAADEGFDCASC-------QPYVVKPVVPIAPPELVPVKVKEPEPQYFRFEG 1129
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 55/143 (38%), Gaps = 45/143 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G+ +LL C C +H ACLD L +GW+CP+C+
Sbjct: 225 DAHCVVCDGLGELRDLLFCTSCGQHYHGACLDTALTARKRAGWQCPDCKV---------- 274
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C CR +D ++ C+ C+K +H CL+ ++ +
Sbjct: 275 ---------------CQTCRQPG------EDSMMLVCEACDKGYHTFCLKPA----IQSL 309
Query: 623 PKDKWFC-----CDDCNRIHAAL 640
P D W C C C A L
Sbjct: 310 PPDSWKCKTCRVCRACGACPAEL 332
>gi|410046801|ref|XP_003313790.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Pan
troglodytes]
Length = 2476
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1026 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1085
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + P A + C IC A ++ +I C CE+ H GC D+
Sbjct: 1086 CEWQNSYTHCGPCASLVTCPICH-----APYVEEDLLIQCRHCERWMHAGCESLFTEDDV 1140
Query: 620 KEIPKDKWFCCDDCN 634
++ D+ F C C
Sbjct: 1141 EQA-ADEGFDCVSCQ 1154
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 975 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1021
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1022 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1056
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1057 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1084
>gi|196003152|ref|XP_002111443.1| hypothetical protein TRIADDRAFT_23466 [Trichoplax adhaerens]
gi|190585342|gb|EDV25410.1| hypothetical protein TRIADDRAFT_23466 [Trichoplax adhaerens]
Length = 1296
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 5/75 (6%)
Query: 481 IYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDP 537
+Y + ++ A + + R +D C +CG G+N LLLC+GC ++H CL P
Sbjct: 256 LYFIVIIIMYQNAFTFYLANYRKENYADTACMLCGLGDNEEFLLLCDGCDDSYHTYCLIP 315
Query: 538 LL--IPESGWRCPNC 550
L IP WRCP C
Sbjct: 316 PLQSIPPGDWRCPKC 330
>gi|344254289|gb|EGW10393.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 4002
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1799 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1858
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + P A + C +C A ++ +I C CE+ H GC
Sbjct: 1859 CEWQNSYTHCGPCASLVTCPVCH-----APYVEEDLLIQCRHCERWMHAGC 1904
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 50/124 (40%), Gaps = 40/124 (32%)
Query: 511 CHVC-GDGE--NLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G GE +L C C +H ACLD L +GW+CP C++
Sbjct: 642 CAVCEGPGELCDLFFCTSCGHHYHGACLDTALTARKRAGWQCPECKE------------- 688
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR +D ++ C+ C+K +H CL+ ++E+P
Sbjct: 689 ------------CQACRKPG------NDSKMLVCETCDKGYHTFCLKP----PIEELPAH 726
Query: 626 KWFC 629
W C
Sbjct: 727 SWKC 730
Score = 39.7 bits (91), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1748 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1794
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1795 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1829
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1830 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1857
>gi|449486846|ref|XP_004174324.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Taeniopygia guttata]
Length = 2088
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CLDP L PE W CP+C Q
Sbjct: 340 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCPQ 385
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + P A + C +C A ++ +I C CE+ H GC
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPVCH-----APYVEEDLLIQCRHCERWMHAGC 1490
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1334 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1380
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1381 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1415
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1416 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1443
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDP--LLIPESGWRCP---NCRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C +C Q ++S
Sbjct: 1385 VCEVCGQASDPSRLLLCDDCDISYHTYCLDPPLLTVPKGGWKCKWCVSCMQCGAASPGFH 1444
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + P A + C +C A ++ +I C CE+ H GC
Sbjct: 1445 CEWQNSYTHCGPCASLVTCPVCH-----APYVEEDLLIQCRHCERWMHAGC 1490
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 51/148 (34%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 1334 DMCVVCGSFGRGAEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1380
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1381 ----VE--------------CIVC---EVCGQASDPSRLLLCDDCDISYHTYCLDP---- 1415
Query: 618 DLKEIPKDKWFC--CDDCNRIHAALQDF 643
L +PK W C C C + AA F
Sbjct: 1416 PLLTVPKGGWKCKWCVSCMQCGAASPGF 1443
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C +G
Sbjct: 409 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSCLRG 457
>gi|291231854|ref|XP_002735870.1| PREDICTED: PHD finger protein 21B-like [Saccoglossus kowalevskii]
Length = 143
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
G DD+C VC LL+C+ C L +H CLDP L IP W CP C+ G+
Sbjct: 4 GHDDICAVCKQSGELLMCDTCSLVYHLGCLDPPLSFIPAGIWICPQCQAGN 54
>gi|17562600|ref|NP_504523.1| Protein LET-418 [Caenorhabditis elegans]
gi|403399446|sp|G5EBZ4.1|LE418_CAEEL RecName: Full=Protein let-418; AltName: Full=Lethal protein 418
gi|11095333|gb|AAG29838.1| LET-418 [Caenorhabditis elegans]
gi|351020697|emb|CCD62685.1| Protein LET-418 [Caenorhabditis elegans]
Length = 1829
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 56/132 (42%), Gaps = 23/132 (17%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C C LLLC+ CP A+H C+D + PE W C +C + H + +
Sbjct: 255 NNDYCEECKQDGELLLCDTCPRAYHTVCIDENMEEPPEGDWSCAHCIE-HGPEVVKEEPA 313
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
K E C IC+ + ++ CD C FH C+ L E+PK
Sbjct: 314 KQNDEF-------CKICK---------ETENLLLCDSCVCSFHAYCIDP----PLTEVPK 353
Query: 625 DKWFCCDDCNRI 636
++ + C C +
Sbjct: 354 EETWSCPRCETV 365
>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 4990
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 57/146 (39%), Gaps = 47/146 (32%)
Query: 500 QRRTTGGSDDMCHVCGDGENL---LLCNGCPLAFHAACLDP--LLIPE--SGWRCPNCRQ 552
Q T G + C +C + + L C C +H CL P L PE +GW+CP+C+
Sbjct: 1819 QAETIAGEEAFCVLCCQADKIGKQLFCTSCGHHYHGGCLHPSVALSPEVRAGWQCPDCKV 1878
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C +CR +D ++ CD C+K +H CL+
Sbjct: 1879 -------------------------CQMCRQPG------EDSKMLVCDTCDKGYHTFCLK 1907
Query: 613 KNGLCDLKEIPKDKWFC-----CDDC 633
+ IPK+ W C C DC
Sbjct: 1908 PV----MTAIPKNGWKCKNCRVCGDC 1929
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 50/121 (41%), Gaps = 35/121 (28%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H CLDP L +P+ W+C C + +
Sbjct: 2196 VCEGCGKPHDEGRLLLCDECDISYHIYCLDPPLDQVPKGTWKCKWC----------VMCI 2245
Query: 565 KGGLEAPG---------AEVGG------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
G PG + G C +CR H + DD +I C QC + H
Sbjct: 2246 NCGTTTPGFGCNWQNNYTQCGPCRSKIDCPVCR-HKYQ----DDEMIIQCLQCNRWLHAL 2300
Query: 610 C 610
C
Sbjct: 2301 C 2301
>gi|449665037|ref|XP_002164686.2| PREDICTED: uncharacterized protein LOC100213470 [Hydra
magnipapillata]
Length = 1382
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 52/132 (39%), Gaps = 20/132 (15%)
Query: 511 CHVCGDG------ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-----RQGHSSS 557
C +C G ++L++C+ C LA+H AC+ P L +PES W CP C R+ S+
Sbjct: 965 CKICNRGKGRYHKDHLVICDACSLAYHLACIKPPLKEVPESTWECPECTSNSQREKISNG 1024
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
SR + D D VI CD C +H C
Sbjct: 1025 KSRE---RSSKTTESESESSNEDSDHEDICHVCETDGLVILCDTCPLSYHFDCHNP---- 1077
Query: 618 DLKEIPKDKWFC 629
L+ P+ W C
Sbjct: 1078 PLRHAPRGPWSC 1089
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
+D+CHVC ++LC+ CPL++H C +P L P W C CR+ +S
Sbjct: 1048 EDICHVCETDGLVILCDTCPLSYHFDCHNPPLRHAPRGPWSCYECRKNSKNS 1099
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 68/179 (37%), Gaps = 43/179 (24%)
Query: 502 RTTGGS--DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
R +G S +D C C +G +LL C CP AFH C DP L IP W C C+ +
Sbjct: 36 RRSGRSFNNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIPAGEWICKKCKAEYQ 95
Query: 556 SSMSRSVDLKGGLEAPGAEV-------------------------GGCVICRSHDFSAAT 590
+ +R + + L +P A + G +S+ +AT
Sbjct: 96 DN-TRPIAKQSELRSPFAVLVKDNVKQNPSEFRLPKEMQYHIPLPGSKKRKKSNGHGSAT 154
Query: 591 FDDR-------TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQD 642
R +I+CD C FH+ C+ L P W C + A+QD
Sbjct: 155 KGGRRGNSFLKPLIHCDYCPLSFHLDCMDP----PLTTTPSGLWMCPNHAEHFLPAMQD 209
>gi|169596086|ref|XP_001791467.1| hypothetical protein SNOG_00794 [Phaeosphaeria nodorum SN15]
gi|160701226|gb|EAT92289.2| hypothetical protein SNOG_00794 [Phaeosphaeria nodorum SN15]
Length = 1085
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 63/151 (41%), Gaps = 23/151 (15%)
Query: 487 MTLHDIAISLAMGQRRTTGG--SDDMCHVCGDGE-----NLLLCNGCPLAFHAACLDPLL 539
++ D+ + L + RT G D C +C DG+ ++ C+GC LA H C
Sbjct: 353 VSYPDLVVRLLSMESRTGQGEEQDTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPF 412
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHD--FSAATFDDRTVI 597
IPE W C C+ G P +E+ GC+ C + D F T +
Sbjct: 413 IPEGQWLCRRCQL-------------VGRGTPASELPGCIFCPNVDGAFKQTTAMKWAHL 459
Query: 598 YCDQCEKEFHVG-CLRKNGLCDLKEIPKDKW 627
C E +G + + D++++PK +W
Sbjct: 460 LCAMWIPEVSLGNATFQEPVQDVEKVPKTRW 490
>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
Length = 2934
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 55/130 (42%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD NL++C+ C +H C+ +P SGW+C +C++
Sbjct: 294 CSALGDVGNLMMCSICGDHYHGTCVGLAQLPGVRSGWQCGSCKK---------------- 337
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D S + RTV C+QC+K +H CLR + IPK W
Sbjct: 338 ---------CQICRVPDSS----EGRTV-GCEQCDKIYHASCLRP----IMTSIPKYGWK 379
Query: 629 C-----CDDC 633
C C DC
Sbjct: 380 CRCCRICSDC 389
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 25/116 (21%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D L+LC+ C +++H C+DP L +P+ W+C C ++ L
Sbjct: 680 ICEGCGQRNDEGRLILCDDCDISYHTYCMDPPLEQVPQGNWKCKWC----------AICL 729
Query: 565 KGGLEAPG---------AEVGGCVICRSHDFSAATF-DDRTVIYCDQCEKEFHVGC 610
K G PG +E G C S + D +I C+QCE+ H GC
Sbjct: 730 KCGTNDPGYNCAWLNNYSECGPCASQVSCPCCGEGYADGELIIQCNQCERWLHCGC 785
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C CGDG L+ C+GCP AFH ACL P L +P WRC +C
Sbjct: 246 NEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSC 291
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%)
Query: 240 LDGIVNGGGYLCGCPLCNFSKVVSAHEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 299
L G++ G LC C LC +V+S +FE HA + R +I ENGK + +++ +
Sbjct: 35 LRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRG 94
Query: 300 APLGILEEVVKKVAGSSFNEGSF 322
APL LE ++ + S E F
Sbjct: 95 APLHDLEATIQNIVCSPPEEKYF 117
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 35/59 (59%), Gaps = 7/59 (11%)
Query: 497 AMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
AM Q+ ++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 2 AMAQK-----NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 55
>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
Length = 1566
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 52/133 (39%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDLK 565
+C GD L C C +H +CLDP + + +GW+CP C+
Sbjct: 377 VCDSPGDMMEQLFCTSCGQHYHGSCLDPPVDVNPVVRAGWQCPECKI------------- 423
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C CR DD ++ CD C+K +H CLR ++ IPK+
Sbjct: 424 ------------CQTCRQ------PGDDNKMLVCDTCDKGYHTFCLRPV----MQTIPKN 461
Query: 626 KWFC-----CDDC 633
W C C DC
Sbjct: 462 GWKCKNCRICTDC 474
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 88/220 (40%), Gaps = 40/220 (18%)
Query: 510 MCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR-----------QG 553
+C C GD +L+C+ C +H CL P++ IP++GW+C NCR G
Sbjct: 423 ICQTCRQPGDDNKMLVCDTCDKGYHTFCLRPVMQTIPKNGWKCKNCRICTDCGSRTPGSG 482
Query: 554 HSSSMSRSVDLKGGLEAPGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVG 609
SS + + + C IC R H A +I C+ C+K HV
Sbjct: 483 PSSRWHLNYSVCDSCYQQRNKGLCCPICGKAYRQHTAHNA------MIQCESCKKWVHVD 536
Query: 610 CLRKNGLCDLKEIPKDKW---FCCDDC-----NRIHAALQDF-----VSNRAQTIPASSL 656
C + +++ DK + C DC N+ H L D +++ PA+++
Sbjct: 537 CDESIDISVYQQLKDDKLTTIYNCVDCRNKDPNQQHEILSDSENLLEITDSNSGYPAANV 596
Query: 657 STINRKHIEKGILFDGTMNDVQWQMLKKAQCFEEKEKSLL 696
T + E I+ D D + + E+SLL
Sbjct: 597 DT-DSNMTETDIISDTQQIDDDPDYIPIGLHMDMPEQSLL 635
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHS 555
+C CG D LLLC+ C +++H CL+P L +P+ GW+C C G +
Sbjct: 850 VCEGCGKASDEGRLLLCDDCDISYHTYCLEPPLQNVPKGGWKCKWCVCCTKCGATSPGFN 909
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
S + G P + + C +C F +D +I C QC + H C
Sbjct: 910 SEWQNNYTQCG----PCSSLLTCPVC----FKEYK-EDELIIQCVQCYRWLHAEC 955
>gi|403223808|dbj|BAM41938.1| SWI/SNF-related chromatin remodelling factor [Theileria orientalis
strain Shintoku]
Length = 1002
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 23/131 (17%)
Query: 507 SDDMCHVC-----GDGENLLLCNGCPLAFHAACLDPLLIPES-GWRCPNCR--QGHSSSM 558
+D+ C +C + + LLLC+GCP ++H +CL+ + P+S W CP CR ++
Sbjct: 90 NDERCKICRERNRDESDVLLLCDGCPNSYHMSCLELQVEPDSEKWYCPMCRPEAFERVNL 149
Query: 559 SRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
R L E + + C +C+ H ++ CD C FH CL + D
Sbjct: 150 RRRNPLMEASEHVNSSI--CYVCQRHG---------KLLGCDFCSNSFHYDCLPE---FD 195
Query: 619 LKEIPKDKWFC 629
+ I D W C
Sbjct: 196 IDTIS-DTWEC 205
>gi|84997071|ref|XP_953257.1| SWI/SNF-related chromatin remodelling factor (ISWI homologue)
[Theileria annulata strain Ankara]
gi|65304253|emb|CAI76632.1| SWI/SNF-related chromatin remodelling factor (ISWI homologue),
putative [Theileria annulata]
Length = 1972
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 17/113 (15%)
Query: 507 SDDMCHVC-----GDGENLLLCNGCPLAFHAACLDPLLIPES-GWRCPNCRQGHSSSMSR 560
+D+ C +C + + LLLC+GCP ++H +CLD + P+S W CP C+ ++
Sbjct: 79 NDERCKICKERNRDESDILLLCDGCPNSYHMSCLDLHVEPDSEKWYCPMCKPESFEGVNI 138
Query: 561 SVDLKGGLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
+ L V C +C+ H ++ CD C FH CL
Sbjct: 139 RAHRRNPLLDTSEHVNSSICYVCQRHG---------KLLGCDFCSNSFHYDCL 182
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1060 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGPR 1119
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C S ++ ++ C QC++ H C N ++
Sbjct: 1120 CEWQNNYTQCAPCASLSTCPVC-----SRNYREEDLILQCRQCDRWMHAVCQNLNTEDEV 1174
Query: 620 KEIPKDKWFCCDDC 633
+ + D F C C
Sbjct: 1175 ENV-ADIGFDCTMC 1187
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 81/205 (39%), Gaps = 33/205 (16%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 404 TTCGQHYHGMCLDIAVTALKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHTFCLQ 463
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC----RSHDFSAAT 590
P++ +P +GW+C NCR + + G C++C + D +A
Sbjct: 464 PVMDSVPTNGWKCKNCR----------ICAECGTRTSSQWHHNCLVCDNCYQPQDNTACP 513
Query: 591 F--------DDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKW--FCCDDCNRIHAAL 640
F + +++C C++ H+ C K +L K+ + CC +
Sbjct: 514 FCGKCYQPDFQKDMLHCQMCKRWIHIEC-DKPADTELDSQLKEDYVCMCCKQLGDLEPVQ 572
Query: 641 QDFVSNRAQTIPASSLSTINRKHIE 665
++T+P S ++N E
Sbjct: 573 PCDEMETSETVPQSGTDSVNEMETE 597
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 985 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1044
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C IC R D ++ C QC++ H C N
Sbjct: 1045 CEWQNNYTQCAPCASLSSCPICCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1095
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 1096 EEEVENV-ADIGFDCSMCRPYMPA 1118
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/213 (22%), Positives = 87/213 (40%), Gaps = 32/213 (15%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 343 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 402
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC----RSHD----F 586
P++ +P +GW+C NCR + ++ G + C+IC + D F
Sbjct: 403 PVMKSVPTNGWKCKNCR----------ICIECGTRSSAQWHHNCLICDTCYQQQDNLCPF 452
Query: 587 SAATFD---DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDF 643
++ + ++YC+ C++ H+ C + +I +D + C C + A +
Sbjct: 453 CGKCYNPEFQKDMLYCNMCKRWVHLECDKPTDHELDSQIKED--YICMYCKHLGAEIDSL 510
Query: 644 VSNRAQTIPASSLSTINRKHIEKGILF-DGTMN 675
+P + IE +LF D T N
Sbjct: 511 HPGNELEMPELPTDYSSGMEIEDEVLFLDPTAN 543
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 324 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 373
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 374 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 408
Query: 623 PKDKWFC 629
P + W C
Sbjct: 409 PTNGWKC 415
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 934 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKISKVVLSKGWRC------------- 980
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 981 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1018
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1019 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1062
>gi|71029212|ref|XP_764249.1| DNA-dependent helicase [Theileria parva strain Muguga]
gi|68351203|gb|EAN31966.1| DNA-dependent helicase, putative [Theileria parva]
Length = 2026
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 51/113 (45%), Gaps = 17/113 (15%)
Query: 507 SDDMCHVC-----GDGENLLLCNGCPLAFHAACLDPLLIPES-GWRCPNCRQGHSSSMSR 560
+D+ C +C + + LLLC+GCP ++H +CLD + P+S W CP C+ ++
Sbjct: 79 NDERCKICKERNRDESDILLLCDGCPNSYHMSCLDLHVEPDSEKWYCPMCKPESFEGVNV 138
Query: 561 SVDLKGGLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
+ L V C +C+ H ++ CD C FH CL
Sbjct: 139 RAHRRNPLLDTSEHVNSSICYVCQRHG---------KLLGCDFCSNSFHYDCL 182
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 52/134 (38%), Gaps = 31/134 (23%)
Query: 511 CHVCGD----GENLLLCNGCPLAFHAACLD-PLLIPESGWRCPNCRQGHSSSMSRSVDL- 564
C C D G+ ++ C+ CP FH CLD LL SG + ++ VDL
Sbjct: 8 CKYCLDDYRQGDIIVRCSSCPRTFHRECLDHELLTNTSG------SAYYQNNYDNPVDLS 61
Query: 565 --------KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
+ E C IC+ + + ++ CD C +H+ CL
Sbjct: 62 DFTCDSCQQYAQEVEIVNDERCKICKERNRDESDI----LLLCDGCPNSYHMSCL----- 112
Query: 617 CDLKEIP-KDKWFC 629
DL P +KW+C
Sbjct: 113 -DLHVEPDSEKWYC 125
>gi|444519036|gb|ELV12526.1| PHD finger protein 21B [Tupaia chinensis]
Length = 489
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C GCP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DEHCAACTRGANLQPCGGCPGAYHLSCLDPPLKTAPKGTWLCPKCQQ 356
>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
Length = 1458
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C C FH+ C+ +P+ SGW C C +
Sbjct: 212 CSSLGDLSKLIMCCSCGDHFHSTCIGLANLPDTRSGWSCARCTK---------------- 255
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR H+ + F I C+QC+K +H CLR + IPK W
Sbjct: 256 ---------CQICRQHETNDIKF-----IKCEQCQKMYHAMCLRPT----ISSIPKYGWK 297
Query: 629 C-----CDDC 633
C C DC
Sbjct: 298 CNRCRVCTDC 307
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H C+ P L +P W+C C + + +
Sbjct: 582 VCEGCGKKNDEARLLLCDECDISYHIYCVKPPLETVPHGNWKCSFCTICQKCGRNPTEKV 641
Query: 565 KGGLEA-----PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
K + P A C +CR A D +I C+QCE+ H C
Sbjct: 642 KNSDASLSECLPCASQNSCPLCRK-----AYSDGEMIIQCEQCEQWSHFLC 687
>gi|432909970|ref|XP_004078255.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1882
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 2/61 (3%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP + PE W CP+C + R + +
Sbjct: 385 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCCPPMKGKVQRVITWRW 444
Query: 567 G 567
G
Sbjct: 445 G 445
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 615 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 674
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 675 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 725
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 726 EEEVENV-ADIGFDCSMCRPYMPA 748
Score = 45.8 bits (107), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 545 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 602
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 603 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 631
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 632 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 687
Query: 654 SSLST 658
+SLS+
Sbjct: 688 ASLSS 692
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 515 GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
G+ +L+C+ C +H CL P++ +P +GW+C NCR
Sbjct: 4 GEDSKMLVCDTCDKGYHTFCLQPVMKSVPTNGWKCKNCR 42
>gi|431905346|gb|ELK10391.1| Chromodomain-helicase-DNA-binding protein 4 [Pteropus alecto]
Length = 1854
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CLDP + PE W CP+C +
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEK 416
>gi|190344475|gb|EDK36155.2| hypothetical protein PGUG_00253 [Meyerozyma guilliermondii ATCC
6260]
Length = 709
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 58/144 (40%), Gaps = 27/144 (18%)
Query: 409 TKKRDNDLHRLLFLPNGLPDGERLTYIVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAH 468
TKKR H L P G +T +G+R G K+ K ISP + A
Sbjct: 197 TKKRRKRTHNKLETPKQPTRGSSVTPASEGRRSIRGSKRI---------KVISPKKHAA- 246
Query: 469 AGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPL 528
+A +SNG I+ A + TG +DD C CG + C+ CP
Sbjct: 247 TNLAP---------SSNG------TITAAASESEQTGDNDDFCSTCGGSGVFICCDSCPK 291
Query: 529 AFHAACLDPLL--IPESGWRCPNC 550
+FH C DP + PE W C C
Sbjct: 292 SFHFLCCDPPIEECPEDNWNCREC 315
>gi|156379809|ref|XP_001631648.1| predicted protein [Nematostella vectensis]
gi|156218692|gb|EDO39585.1| predicted protein [Nematostella vectensis]
Length = 824
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDP--LLIPESGWRCPNCR 551
D C VC DG LL C+ CP+A+H CL P + +P W+CP C+
Sbjct: 12 DYCRVCKDGGQLLCCDKCPMAYHLKCLIPPMMRVPTGEWKCPRCQ 56
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 180
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 181 EEEVENV-ADIGFDCSMC 197
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 71/184 (38%), Gaps = 51/184 (27%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLI 540
+H+ + + + T + DMC VCG G+ LL C+ C +H C+ ++
Sbjct: 1 MHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVV 58
Query: 541 PESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCD 600
GWRC LE C +C + D ++ CD
Sbjct: 59 LSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLCD 87
Query: 601 QCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPAS 654
C+ +H CL L+ +PK W C C C A L+ ++ +N Q P +
Sbjct: 88 DCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCA 143
Query: 655 SLST 658
SLS+
Sbjct: 144 SLSS 147
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1119
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1120 EEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 46.2 bits (108), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 939 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 996
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 997 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1025
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1026 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1081
Query: 654 SSLST 658
+SLS+
Sbjct: 1082 ASLSS 1086
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1119
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1120 EEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 939 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 996
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 997 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1025
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1026 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1081
Query: 654 SSLST 658
+SLS+
Sbjct: 1082 ASLSS 1086
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1119
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1120 EEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 46.2 bits (108), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 939 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 996
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 997 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1025
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1026 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1081
Query: 654 SSLST 658
+SLS+
Sbjct: 1082 ASLSS 1086
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 180
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 181 EEEVENV-ADIGFDCSMC 197
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 71/184 (38%), Gaps = 51/184 (27%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLI 540
+H+ + + + T + DMC VCG G+ LL C+ C +H C+ ++
Sbjct: 1 MHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVV 58
Query: 541 PESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCD 600
GWRC LE C +C + D ++ CD
Sbjct: 59 LSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLCD 87
Query: 601 QCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPAS 654
C+ +H CL L+ +PK W C C C A L+ ++ +N Q P +
Sbjct: 88 DCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCA 143
Query: 655 SLST 658
SLS+
Sbjct: 144 SLSS 147
>gi|332838463|ref|XP_508960.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 isoform 3 [Pan troglodytes]
Length = 1825
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP + PE W CP+C
Sbjct: 364 DYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC 407
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 941 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1000
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1001 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1051
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1052 EEEVENV-ADIGFDCSMC 1068
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 292 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 351
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 352 PVMKSVPTNGWKCKNCR 368
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 273 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 322
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 323 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 357
Query: 623 PKDKWFC 629
P + W C
Sbjct: 358 PTNGWKC 364
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 871 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 928
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 929 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 957
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 958 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1013
Query: 654 SSLST 658
+SLS+
Sbjct: 1014 ASLSS 1018
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1046 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1105
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1106 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1156
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1157 EEEVENV-ADIGFDCSMC 1173
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 398 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 457
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 458 PVMKSVPTNGWKCKNCR 474
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 379 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 428
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 429 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 463
Query: 623 PKDKWFC 629
P + W C
Sbjct: 464 PTNGWKC 470
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 976 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 1033
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 1034 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1062
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1063 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1118
Query: 654 SSLST 658
+SLS+
Sbjct: 1119 ASLSS 1123
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1119
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1120 EEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 45.8 bits (107), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 939 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 996
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 997 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1025
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1026 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1081
Query: 654 SSLST 658
+SLS+
Sbjct: 1082 ASLSS 1086
>gi|345494441|ref|XP_001603951.2| PREDICTED: lysine-specific demethylase lid [Nasonia vitripennis]
Length = 1704
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 379 ICHNCGKGDNEENMLLCDGCDDSYHTFCLLPPLTEIPKGDWRCPKC 424
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 50/127 (39%), Gaps = 27/127 (21%)
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSV 562
G D C VC G ++LC+ CP A+H CL+P L W CP+C
Sbjct: 366 GDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCEN---------- 415
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
G LE + C +C+ D ++ CD C +H CL L EI
Sbjct: 416 --DGALEDDDEHMEFCRVCK---------DGGELLCCDSCTSAYHTHCLNP----PLTEI 460
Query: 623 PKDKWFC 629
P W C
Sbjct: 461 PDGDWKC 467
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 427 EFCRVCKDGGELLCCDSCTSAYHTHCLNPPLTEIPDGDWKCPRC 470
>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
Length = 906
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 81/211 (38%), Gaps = 46/211 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIP----ESGWRCPNCRQGHSSSMSRSVDL 564
+ C G+ +LL C C HA CL+ +I +GW+C C+
Sbjct: 163 EFCSQVGELSDLLFCTTCGAHSHARCLNEGIIVTGEVRAGWQCYTCKI------------ 210
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
C CR D DD +I C+ C+K +H CL N + D +PK
Sbjct: 211 -------------CQQCRKSD------DDAQMIICETCDKGWHTYCL--NPVMD--SVPK 247
Query: 625 DKWFC--CDDC----NRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGTMNDVQ 678
D W C C +C N+IH Q+ S + S N+ + E +L + ++
Sbjct: 248 DGWSCTNCRNCIECGNKIHLEWQNDYSTCPVCWSKQACSVCNKDYKEDDVLLKCS-ECLR 306
Query: 679 WQMLKKAQCFEEKEKSLLSSATAIFRECFDP 709
WQ Q + E + ++ + C P
Sbjct: 307 WQHALHEQVYSESDADSMAEQNFKCKLCRPP 337
>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Harpegnathos saltator]
Length = 1948
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 31/55 (56%), Gaps = 8/55 (14%)
Query: 504 TGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
TG +DD C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 418 TGAADDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 472
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------ITG 419
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
+ + C +C+ D ++ CD C +H CL L EIP
Sbjct: 420 AADDDDEHMEFCRVCK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 466
Query: 627 WFC 629
W C
Sbjct: 467 WKC 469
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+DD C C DG L+ C+GCP AFH CLDP L IP W+C C
Sbjct: 251 NDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWC 296
>gi|356510796|ref|XP_003524120.1| PREDICTED: uncharacterized protein LOC100793933 [Glycine max]
Length = 2325
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 2/70 (2%)
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP 548
++A +++ G C +C G NLL C+ CP +H CLDP L IP W+CP
Sbjct: 53 EVATDQISSKKKGNDGYYYECVICDVGGNLLCCDSCPRTYHLQCLDPPLKRIPNGKWQCP 112
Query: 549 NCRQGHSSSM 558
+C +G M
Sbjct: 113 SCFEGKDQRM 122
>gi|156088833|ref|XP_001611823.1| SNF2 family N-terminal domain containing protein [Babesia bovis]
gi|154799077|gb|EDO08255.1| SNF2 family N-terminal domain containing protein [Babesia bovis]
Length = 1744
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 26/117 (22%)
Query: 507 SDDMCHVC-----GDGENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSR 560
+DD C +C D + LLLC+GCP ++H CL+ + P+ W CP C+ + +S
Sbjct: 76 NDDRCKLCKEKNRNDNDVLLLCDGCPNSYHIRCLELVAEPDGDQWFCPMCKPDNFQVVSF 135
Query: 561 SVDLKGGLEAPGAEVGG------CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
+ P A+ G C +C+ H ++ CD C FH GCL
Sbjct: 136 RKK-----QHPLADTGDHVNSSICYVCQRHG---------KLLGCDFCSNSFHHGCL 178
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 29/137 (21%), Positives = 55/137 (40%), Gaps = 40/137 (29%)
Query: 511 CHVCGD----GENLLLCNGCPLAFHAACLDPLLIPESG--------------WRCPNCRQ 552
C +C D G++++ C CP ++H CL+ + G ++C NC +
Sbjct: 8 CKLCQDDYKSGDSIIKCTSCPKSYHRECLEHDISTNVGHNAELVDGHYDFRTFQCDNCLR 67
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
++ + D C +C+ + + D+ ++ CD C +H+ CL
Sbjct: 68 YEDENVIENDD-------------RCKLCKEKNRN----DNDVLLLCDGCPNSYHIRCLE 110
Query: 613 KNGLCDLKEIPKDKWFC 629
+ E D+WFC
Sbjct: 111 L-----VAEPDGDQWFC 122
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 454 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 513
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 514 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 564
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 565 EEEVENV-ADIGFDCSMC 581
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 384 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 441
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 442 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 470
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 471 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 526
Query: 654 SSLST 658
+SLS+
Sbjct: 527 ASLSS 531
>gi|414590323|tpg|DAA40894.1| TPA: hypothetical protein ZEAMMB73_603732 [Zea mays]
Length = 345
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 43/94 (45%), Gaps = 4/94 (4%)
Query: 460 ISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGEN 519
+SPS E A P H T + HD IS + + R G C C G N
Sbjct: 29 VSPSN-ENEATSPTSDSPMSHGSTKKKLK-HDTNISPSAKKIRGHDGYFFECVECDLGGN 86
Query: 520 LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
LL C+ CP +H CL+P L +P W+CP CR
Sbjct: 87 LLCCDSCPRVYHLECLNPPLKRLPPGKWQCPRCR 120
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1009 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1068
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1069 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1119
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1120 EEEVENV-ADIGFDCSMC 1136
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 45.8 bits (107), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 939 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 996
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 997 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1025
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1026 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1081
Query: 654 SSLST 658
+SLS+
Sbjct: 1082 ASLSS 1086
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 956 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1015
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1016 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1066
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 1067 EEEVENV-ADIGFDCSMCRPYMPA 1089
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 307 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 366
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 367 PVMKSVPTNGWKCKNCR 383
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 288 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 337
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 338 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 372
Query: 623 PKDKWFC 629
P + W C
Sbjct: 373 PTNGWKC 379
Score = 45.8 bits (107), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 886 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 943
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 944 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 972
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 973 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1028
Query: 654 SSLST 658
+SLS+
Sbjct: 1029 ASLSS 1033
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 180
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 181 EEEVENV-ADIGFDCSMCRPYMPA 203
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 71/184 (38%), Gaps = 51/184 (27%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLI 540
+H+ + + + T + DMC VCG G+ LL C+ C +H C+ ++
Sbjct: 1 MHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVV 58
Query: 541 PESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCD 600
GWRC LE C +C + D ++ CD
Sbjct: 59 LSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLCD 87
Query: 601 QCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPAS 654
C+ +H CL L+ +PK W C C C A L+ ++ +N Q P +
Sbjct: 88 DCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCA 143
Query: 655 SLST 658
SLS+
Sbjct: 144 SLSS 147
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1113
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1114 EEEVENV-ADIGFDCSMC 1130
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 933 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 990
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 991 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1019
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1020 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1075
Query: 654 SSLST 658
+SLS+
Sbjct: 1076 ASLSS 1080
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 454 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 513
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 514 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 564
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 565 EEEVENV-ADIGFDCSMC 581
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 384 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 441
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 442 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 470
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 471 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 526
Query: 654 SSLST 658
+SLS+
Sbjct: 527 ASLSS 531
>gi|444728247|gb|ELW68711.1| Chromodomain-helicase-DNA-binding protein 5 [Tupaia chinensis]
Length = 2128
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 366 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 409
>gi|351713762|gb|EHB16681.1| Chromodomain-helicase-DNA-binding protein 5 [Heterocephalus glaber]
Length = 2263
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 80/230 (34%), Gaps = 54/230 (23%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 412 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC---------------- 455
Query: 567 GLEAPGAEVGGCVI------------CRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
P E GCV+ RS R ++ QC + G L +
Sbjct: 456 ---VPSFEAQGCVLSVLGGALCYPATLRSPSILGPQCVSR-LMETGQCTRVGPGGSLARG 511
Query: 615 GL---CDLK-----EIPKDKWFCC-----DDCNRIHAALQDFVSNRAQTIPASSLSTINR 661
G CD K E P+ C + C + +Q + R PA + +
Sbjct: 512 GRGWPCDNKLTAKCEGPRTAMSCSGPEDGETCPPLKGKVQRILHWRWTEPPAPFMVGLPG 571
Query: 662 KHIEKGILFDGTMND-------VQWQMLKKAQCFEEKEKSLLSSATAIFR 704
+E G+ + V+W L C KE L T ++R
Sbjct: 572 PDVEPGVPPPKPLEGIPEREFFVKWAGLSYWHCSWVKELQLELYHTVMYR 621
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 960 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1019
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1020 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1070
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1071 EEEVENV-ADIGFDCSMC 1087
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 314 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 373
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 374 PVMKSVPTNGWKCKNCR 390
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 295 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 344
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 345 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 379
Query: 623 PKDKWFC 629
P + W C
Sbjct: 380 PTNGWKC 386
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 909 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKISKVVLSKGWRC------------- 955
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 956 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 993
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 994 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1037
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 580 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 639
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 640 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 690
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 691 EEEVENV-ADIGFDCSMC 707
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T DMC VCG G+ LL C+ C +H C+ +
Sbjct: 510 SMHNTVVLFSSSDKFTL--QQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 567
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 568 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 596
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 597 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 652
Query: 654 SSLST 658
+SLS+
Sbjct: 653 ASLSS 657
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 521 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 580
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 581 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 631
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 632 EEEVENV-ADIGFDCSMC 648
Score = 46.6 bits (109), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCG---DGEN--LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG GE LL C+ C +H C+ ++ GWRC
Sbjct: 470 DMCVVCGSFGQGEEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 516
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 517 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 554
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 555 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 598
>gi|341890749|gb|EGT46684.1| hypothetical protein CAEBREN_05565 [Caenorhabditis brenneri]
Length = 1728
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 47/121 (38%), Gaps = 22/121 (18%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGL 568
C C L++C+ C +H C+DP P+ W C +C +K G
Sbjct: 247 CAECDQAGELVMCSTCERMYHCVCIDPNSDEPPKGVWSCVHC-------------MKNG- 292
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
PG + VI R H D+ + C C +H CL L E+P D WF
Sbjct: 293 --PGFPIDPNVIVRKHTNCQICKDNDHTLLCATCPNAYHAYCLNP----PLDEMPDDDWF 346
Query: 629 C 629
C
Sbjct: 347 C 347
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C +C D ++ LLC CP A+HA CL+P L +P+ W CP C
Sbjct: 309 CQICKDNDHTLLCATCPNAYHAYCLNPPLDEMPDDDWFCPRC 350
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 64/135 (47%), Gaps = 18/135 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 895 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 954
Query: 562 VDLKGGLE--APGAEVGGC-VICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ + AP A + C V CR++ ++ ++ C QC++ H C N +
Sbjct: 955 CEWQNNYTQCAPCASLSSCPVCCRNYR------EEDLILQCRQCDRWMHAVCQNLNTEEE 1008
Query: 619 LKEIPKDKWFCCDDC 633
++ + D F C C
Sbjct: 1009 VENV-ADIGFDCSMC 1022
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 291 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 350
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 351 PVMKSVPTNGWKCKNCR 367
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 50/127 (39%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 272 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECK----------- 320
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+C++ S +D ++ CD C+K +H CL+ +K +
Sbjct: 321 -----------------VCQNCKQSG---EDSKMLVCDTCDKGYHTFCLQPV----MKSV 356
Query: 623 PKDKWFC 629
P + W C
Sbjct: 357 PTNGWKC 363
Score = 45.8 bits (107), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GE----NLLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T DMC VCG G+ LL C+ C +H C+ +
Sbjct: 825 SMHNTVVLFSSSDKFTL--QQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 882
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 883 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 911
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 912 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 967
Query: 654 SSLST 658
+SLS+
Sbjct: 968 ASLSS 972
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ CP AFH CL+P L +P W CP C
Sbjct: 476 EFCRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPRC 519
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 48/123 (39%), Gaps = 26/123 (21%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C ++
Sbjct: 418 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWSCPHCEGEGIQEQEEDEHME- 476
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C +C+ D ++ CD C FH CL LK +P K
Sbjct: 477 ----------FCRVCK---------DGGELLCCDSCPAAFHTFCLNP----PLKNVPTGK 513
Query: 627 WFC 629
W C
Sbjct: 514 WNC 516
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1002 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1061
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1062 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1112
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1113 EEEVENV-ADIGFDCSMC 1129
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 390 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 424
Query: 623 PKDKWFC 629
P + W C
Sbjct: 425 PTNGWKC 431
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T DMC VCG G+ LL C+ C +H C+ +
Sbjct: 932 SMHNTVVLFSSSDKFTL--QQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 989
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 990 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1018
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1019 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1074
Query: 654 SSLST 658
+SLS+
Sbjct: 1075 ASLSS 1079
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1113
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1114 EEEVENV-ADIGFDCSMC 1130
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 390 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 424
Query: 623 PKDKWFC 629
P + W C
Sbjct: 425 PTNGWKC 431
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T DMC VCG G+ LL C+ C +H C+ +
Sbjct: 933 SMHNTVVLFSSSDKFTL--QQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 990
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 991 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 1019
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 1020 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 1075
Query: 654 SSLST 658
+SLS+
Sbjct: 1076 ASLSS 1080
>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
Length = 1482
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 209 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 252
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H CLR + IPK W
Sbjct: 253 ---------CQICRQQDS-----NDTKYVKCEQCQKTYHASCLRPV----ISAIPKYGWK 294
Query: 629 C-----CDDC 633
C C DC
Sbjct: 295 CNRCRVCTDC 304
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC----RQGHSSSMSR 560
+C CG D LLLC+ C +++H C++P L +P W+C C + G + +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKS 639
Query: 561 SVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
LE P C +C+ + + +I C+ CE H C
Sbjct: 640 EFGDSNMLECPSCTSQSSCPVCK-----VSYSNGEMIIQCEHCELWAHFHC 685
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 61/135 (45%), Gaps = 18/135 (13%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG+ + LLLC+ C +++H CLDP L +P+ W+C C Q SSS
Sbjct: 201 VCEACGEASDPGRLLLCDDCDISYHTYCLDPPLHTVPKGAWKCKWCVRCVQCGSSSPGVR 260
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
D + P + C +C+ D ++ C QC++ H C +N +C+
Sbjct: 261 CDWQDNYSCCGPCGSLRRCPLCQR-----PYAHDELIMQCQQCDRWVHATC--QNLMCEE 313
Query: 620 K-EIPKDKWFCCDDC 633
E D+ F C C
Sbjct: 314 DVEAAADEGFDCSLC 328
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 925 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 984
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 985 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1035
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1036 EEEVENV-ADIGFDCSMC 1052
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 276 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 335
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 336 PVMKSVPTNGWKCKNCR 352
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 257 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 306
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 307 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 341
Query: 623 PKDKWFC 629
P + W C
Sbjct: 342 PTNGWKC 348
Score = 45.8 bits (107), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 855 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 912
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 913 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 941
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 942 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 997
Query: 654 SSLST 658
+SLS+
Sbjct: 998 ASLSS 1002
>gi|157125686|ref|XP_001660731.1| hypothetical protein AaeL_AAEL002037 [Aedes aegypti]
gi|108882584|gb|EAT46809.1| AAEL002037-PA [Aedes aegypti]
Length = 2884
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 53/124 (42%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GE LLLC+GC +H C P + IP+ W C C+ + ++ R +
Sbjct: 2335 CQFCQSGEQEDKLLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECK--NKATGDRKCIVC 2392
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
GGL P +G ++YC+ C + +H C + L + P+
Sbjct: 2393 GGLRPP--PLG------------------KMVYCELCPRAYHQDCY----IPPLLKYPRG 2428
Query: 626 KWFC 629
KW+C
Sbjct: 2429 KWYC 2432
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 70 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 129
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 130 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 180
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 181 EEEVENV-ADIGFDCSMCRPYMPA 203
Score = 44.3 bits (103), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 19 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 65
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 66 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 103
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 104 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 147
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 31/48 (64%), Gaps = 2/48 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 98 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQ 145
>gi|307209985|gb|EFN86754.1| Histone demethylase JARID1A [Harpegnathos saltator]
Length = 1625
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 306 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 351
>gi|427795843|gb|JAA63373.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1933
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 53/130 (40%), Gaps = 37/130 (28%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C ++++ D K
Sbjct: 1664 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYEC-------LNKTQDEK 1716
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C ++ CD C K FH CL L + PK
Sbjct: 1717 -----------VCILCGKKG---------KLVRCDACPKVFHHTCLDP----PLSKPPKG 1752
Query: 626 KWFCCDDCNR 635
KW CC C +
Sbjct: 1753 KW-CCSGCAK 1761
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
+ +C +CG L+ C+ CP FH CLDP L P+ W C C +G
Sbjct: 1715 EKVCILCGKKGKLVRCDACPKVFHHTCLDPPLSKPPKGKWCCSGCAKG 1762
>gi|332026170|gb|EGI66312.1| Lysine-specific demethylase 5A [Acromyrmex echinatior]
Length = 1637
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1003 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1062
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1063 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1113
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1114 EEEVENV-ADIGFDCSMC 1130
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLRRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C CR + +D ++ CD C+K +H CL+ +K +
Sbjct: 390 ---------------CQNCRQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 424
Query: 623 PKDKWFC 629
P + W C
Sbjct: 425 PTNGWKC 431
Score = 46.2 bits (108), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCG---DGEN--LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG GE LL C+ C +H C+ ++ GWRC
Sbjct: 952 DMCVVCGSFGQGEEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 998
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 999 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1036
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1037 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1080
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1004 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1063
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1064 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1114
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1115 EEEVENV-ADIGFDCSMC 1131
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ R G C VC G+ +L+C+ C +H CL
Sbjct: 359 TTCGQHYHGMCLDIAVTPLRRAGWQCPECKVCQNCRQSGEDSKMLVCDTCDKGYHTFCLQ 418
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 419 PVMKSVPTNGWKCKNCR 435
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 340 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLRRAGWQCPECKV---------- 389
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C CR + +D ++ CD C+K +H CL+ +K +
Sbjct: 390 ---------------CQNCRQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 424
Query: 623 PKDKWFC 629
P + W C
Sbjct: 425 PTNGWKC 431
Score = 46.2 bits (108), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCG---DGEN--LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG GE LL C+ C +H C+ ++ GWRC
Sbjct: 953 DMCVVCGSFGQGEEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 999
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 1000 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1037
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1038 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1081
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 49/124 (39%), Gaps = 34/124 (27%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C+ + +
Sbjct: 1684 CQFCQSGDNEDKLLLCDGCDRGYHMYCFKPKMESIPDGDWYCHECKNKSNGEKN------ 1737
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C + + C+ C + +H+ CL L ++P+
Sbjct: 1738 ------------CIVCGKRPI-------KNYVICEHCPRIYHIECLNP----PLSKVPRA 1774
Query: 626 KWFC 629
KW C
Sbjct: 1775 KWNC 1778
>gi|358412582|ref|XP_003582344.1| PREDICTED: PHD finger protein 21B isoform 1 [Bos taurus]
gi|359066204|ref|XP_003586213.1| PREDICTED: PHD finger protein 21B isoform 2 [Bos taurus]
Length = 477
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 298 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 344
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 91/221 (41%), Gaps = 35/221 (15%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 922 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 981
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHV----GCL 611
+ + AP A + C +C R D ++ C QC+ V G +
Sbjct: 982 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDSSMKVFLLIGIV 1032
Query: 612 RKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFD 671
R+ G K +D + +L V R ++ P L IN+ + +L
Sbjct: 1033 RRKGQDPPKTYTQDGVCLTESGMTQLQSLTVTVPRRKRSKPKLKLKIINQNSV--AVL-- 1088
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLS----SATAIFRECFD 708
T D+Q + + + + +E L+ S ++ RE D
Sbjct: 1089 QTPPDIQSEHSRDGEMDDSREGELMDCDGKSESSPEREAVD 1129
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 265 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 324
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 325 PVMKSVPTNGWKCKNCR 341
>gi|328786362|ref|XP_003250774.1| PREDICTED: lysine-specific demethylase lid isoform 1 [Apis
mellifera]
Length = 1643
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|390367018|ref|XP_003731167.1| PREDICTED: PHD finger protein 21A-like isoform 1
[Strongylocentrotus purpuratus]
gi|390367020|ref|XP_003731168.1| PREDICTED: PHD finger protein 21A-like isoform 2
[Strongylocentrotus purpuratus]
Length = 501
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+ G +D+C VC LL+C+ C L +H CLDP L +P W CP C+
Sbjct: 351 SDGHEDICAVCRQIGELLMCDTCNLVYHLTCLDPPLAAVPPGAWSCPECK 400
>gi|307166621|gb|EFN60647.1| Histone demethylase JARID1A [Camponotus floridanus]
Length = 1566
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 308 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 353
>gi|328786364|ref|XP_003250775.1| PREDICTED: lysine-specific demethylase lid isoform 2 [Apis
mellifera]
Length = 1617
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 304 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 349
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 62/160 (38%), Gaps = 24/160 (15%)
Query: 501 RRTTGGSDDMCHVCGDGENLLLCNG-CPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
R T +DD C +CG+ L+ C+G C AFH CL L P +RC C G+ + +
Sbjct: 432 RTTMTANDDTCRICGELGQLICCDGGCRGAFHLECLSILQPPTGEFRCDECSTGNHTCYT 491
Query: 560 RSVDLKGGLEAPGAEVGGCVI----------CRSHDFSAATF-----DDRTVIYCDQCEK 604
+ GA++ C C F A F ++ CD C
Sbjct: 492 --------CDKVGADLIKCQFPHCNKLYHRGCAEKQFKADNFCLVCGTGGDLVVCDGCPG 543
Query: 605 EFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAALQDFV 644
+H C++ K + +WFC D + + D V
Sbjct: 544 AYHAACIKSTFAFTGKPDEQGQWFCHDCLTGTKSMIHDVV 583
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 6/67 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLI------PESGWRCPNCRQGHSSSMSR 560
+D+ C VCG G +L++C+GCP A+HAAC+ + W C +C G S +
Sbjct: 522 ADNFCLVCGTGGDLVVCDGCPGAYHAACIKSTFAFTGKPDEQGQWFCHDCLTGTKSMIHD 581
Query: 561 SVDLKGG 567
V K G
Sbjct: 582 VVWAKYG 588
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 85 SISLAKGKVKSEGHRDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 144
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 145 KRIPMGKWHCPTCNQ 159
>gi|118341427|gb|AAI27573.1| LOC568230 protein [Danio rerio]
gi|118341429|gb|AAI27575.1| LOC568230 protein [Danio rerio]
Length = 430
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D C VC G ++LC+ CP A+H CL+P L PE W CP+C +
Sbjct: 384 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCEK 429
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1007 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1066
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1067 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1117
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1118 EEEVENV-ADIGFDCSMC 1134
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 360 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 419
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 420 PVMKSVPTNGWKCKNCR 436
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 341 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 390
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 391 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 425
Query: 623 PKDKWFC 629
P + W C
Sbjct: 426 PTNGWKC 432
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 956 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 1002
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 1003 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1040
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1041 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1084
>gi|380014866|ref|XP_003691437.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
lid-like [Apis florea]
Length = 1643
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEENMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|219111827|ref|XP_002177665.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410550|gb|EEC50479.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1255
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 40/89 (44%), Gaps = 16/89 (17%)
Query: 478 YRHIYTSNGMTLHDIAISL-------------AMGQRRTTGGSDDMCHVCGDGENLLLCN 524
YR +YT NG L D S+ GS D C+VC + NLL C+
Sbjct: 608 YRTVYTPNGFFLIDQVTSVDDFGEIMPKENVATSAASLIEDGSVDYCNVCRNHGNLLCCD 667
Query: 525 GCPLAFHAACL---DPLLIPESGWRCPNC 550
CP AFH+ C+ D L E+ W CP C
Sbjct: 668 YCPRAFHSECIHVKDEELDSEAPWECPVC 696
>gi|119893354|ref|XP_876176.2| PREDICTED: PHD finger protein 21B isoform 2 [Bos taurus]
gi|297475482|ref|XP_002688025.1| PREDICTED: PHD finger protein 21B isoform 1 [Bos taurus]
gi|296486907|tpg|DAA29020.1| TPA: PHD finger protein 21B [Bos taurus]
Length = 489
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 356
>gi|170053718|ref|XP_001862805.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874114|gb|EDS37497.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 3017
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 54/126 (42%), Gaps = 31/126 (24%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GE LLLC+GC +H C P + IP+ W C C+ + ++ R +
Sbjct: 2471 CQFCQSGEQEDKLLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECK--NKATGDRKCIVC 2528
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
GGL P +G ++YC+ C + +H C + + + P+
Sbjct: 2529 GGLRPP--PLG------------------KMVYCELCPRAYHQDCY----IPPMLKYPRG 2564
Query: 626 KWFCCD 631
KW+C +
Sbjct: 2565 KWYCTN 2570
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 237 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 282
>gi|429329536|gb|AFZ81295.1| helicase family member protein [Babesia equi]
Length = 1736
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 17/113 (15%)
Query: 507 SDDMCHVCGD-----GENLLLCNGCPLAFHAACLDPLLIPESG-WRCPNCRQGHSSSMSR 560
+D+ C +C + + LLLC+GCP ++H +CL+ + P+S W CP CR + ++
Sbjct: 79 NDERCKLCKEKNRDSSDILLLCDGCPNSYHMSCLELQVEPDSDKWYCPMCRPDNFKGINV 138
Query: 561 SVDLKGGLEAPGAEVGG--CVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCL 611
+ L V C +C+ H ++ CD C FH CL
Sbjct: 139 RKSRRNPLTDSSDHVNSSICYVCQRHG---------KLLGCDFCSNAFHHDCL 182
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 224 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 269
>gi|339244153|ref|XP_003378002.1| putative PHD finger protein [Trichinella spiralis]
gi|316973126|gb|EFV56753.1| putative PHD finger protein [Trichinella spiralis]
Length = 864
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 78/184 (42%), Gaps = 38/184 (20%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWR---CPNCRQGHSSSMSRS 561
+C CG G++ LLLC+ C +++H CLDP L IP+ WR C C+ ++ +
Sbjct: 266 VCEGCGTGDDEQHLLLCDECDVSYHMYCLDPPLDSIPQGAWRCKWCSTCQFCGATPPNGM 325
Query: 562 VDLKGGLEA--PGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC-- 617
+D L A A + C C ++ +I CD C + H C NGLC
Sbjct: 326 LDSIKNLRACFKCASLYSCCFCH-----LQYKEEDMIILCDICHRWSHANC---NGLCAE 377
Query: 618 DLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTIPASSLSTINRKHIEKGILFDGT-MND 676
D+ + D F C C P + S+ + +G+L + ++
Sbjct: 378 DILKKGLDAGFICVYCR-----------------PGDACSSAAMHFVIEGVLLTKSGLST 420
Query: 677 VQWQ 680
VQW+
Sbjct: 421 VQWR 424
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 85 SISLAKGKVKSEGHRDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 144
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 145 KRIPMGKWHCPTCNQ 159
>gi|391331997|ref|XP_003740425.1| PREDICTED: uncharacterized protein LOC100907650 [Metaseiulus
occidentalis]
Length = 1922
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 50/124 (40%), Gaps = 35/124 (28%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDP-LLIPESGWRCPNCRQGHSSSMSRSVDLKG 566
C CG GEN LLLC+ C +H C P +++P+ W C C SV+L
Sbjct: 1538 CQFCGSGENEESLLLCDACDKGYHTYCFKPEMVVPQGDWFCFEC--------INSVNLN- 1588
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
CVIC + + CD C K FH C+ +L ++P K
Sbjct: 1589 ---------KVCVIC-------GKPGNTKMAQCDVCPKLFHPVCV------NLAKVPPRK 1626
Query: 627 WFCC 630
W C
Sbjct: 1627 WTCV 1630
>gi|321456775|gb|EFX67875.1| hypothetical protein DAPPUDRAFT_229111 [Daphnia pulex]
Length = 1870
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 47/164 (28%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAF-----HAACLDPLLIPESGWR-----CPNCRQ 552
++ G+ C +C LL G + F H CLD +L W CPN R+
Sbjct: 51 SSTGNSATCPIC-----LLKFKGQAIGFPEVCGHPFCLDCIL----EWSKTVQTCPNDRR 101
Query: 553 GHSSSMSRSVDLKGGL---------------EAPGAEVGGCVICRSHDFSAATFDDRTVI 597
+ + R +DL G + E P ++ GC +CR+ D + T++
Sbjct: 102 KFDNILVR-LDLDGEIVRIVPVNDNQSKDEEEDPFPDITGCQVCRNGDR------EETML 154
Query: 598 YCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC--CDDCNRIHAA 639
CD+C++ +H+ CL L E+P + WFC CD I A
Sbjct: 155 LCDRCDQGYHMDCLDP----PLDEVPIEDWFCPQCDGTQAIAGA 194
>gi|158297171|ref|XP_317442.4| AGAP008017-PA [Anopheles gambiae str. PEST]
gi|157015066|gb|EAA12387.4| AGAP008017-PA [Anopheles gambiae str. PEST]
Length = 2930
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GE+ LLLC+GC +H C P + IP+ W C C+ + ++ R +
Sbjct: 2387 CQFCQSGESEDKLLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECK--NKATGDRKCIVC 2444
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
GGL P +G ++YC+ C + +H C + + + P+
Sbjct: 2445 GGLRPP--PLG------------------KMVYCELCPRAYHQDCY----IPPMLKYPRG 2480
Query: 626 KWFC 629
KW+C
Sbjct: 2481 KWYC 2484
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 15/75 (20%)
Query: 493 AISLAMGQRRTTGGSDDM-------------CHVCGDGENLLLCNGCPLAFHAACLDPLL 539
+ISLA G+ ++ G D C VC G NLL C+ CP +H CL+P L
Sbjct: 92 SISLAKGKVKSEGHHDQFSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPL 151
Query: 540 --IPESGWRCPNCRQ 552
IP W CP C Q
Sbjct: 152 KRIPMGKWHCPTCNQ 166
>gi|47206538|emb|CAF92234.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C VC G ++LC+ CP A+H CLDP L PE W CP+C
Sbjct: 299 DYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC 342
>gi|195347259|ref|XP_002040171.1| GM16061 [Drosophila sechellia]
gi|194135520|gb|EDW57036.1| GM16061 [Drosophila sechellia]
Length = 1476
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 203 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 246
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H CLR + IPK W
Sbjct: 247 ---------CQICRQQDS-----NDTKYVKCEQCQKIYHASCLRPV----ISAIPKYGWK 288
Query: 629 C-----CDDC 633
C C DC
Sbjct: 289 CNRCRVCTDC 298
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC----RQGHSSSMSR 560
+C CG D LLLC+ C +++H C++P L +P W+C C + G + +
Sbjct: 574 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKS 633
Query: 561 SVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
LE P C +C+ + + T+I C+ CE H C
Sbjct: 634 EFGDSNMLECPSCTSQSSCPVCK-----VSYSNGETIIQCEHCELWAHFHC 679
>gi|339242099|ref|XP_003376975.1| putative bromodomain protein [Trichinella spiralis]
gi|316974284|gb|EFV57780.1| putative bromodomain protein [Trichinella spiralis]
Length = 1670
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 51/125 (40%), Gaps = 34/125 (27%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
MC +C D N LLLC+GC + +H C P + +PE W CP C
Sbjct: 1389 MCQICRDDCNESQLLLCDGCDMGYHTYCFRPKMTKVPEEDWYCPEC-------------- 1434
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
A C++C H + ++ C +C +E+H C+ L +K + K
Sbjct: 1435 ----VAKATRRSCCLVCCRH-------SSQPMLGCAECGREYHAECV---DLDPVKSV-K 1479
Query: 625 DKWFC 629
W C
Sbjct: 1480 SNWRC 1484
>gi|449690917|ref|XP_004212502.1| PREDICTED: nucleosome-remodeling factor subunit BPTF-like, partial
[Hydra magnipapillata]
Length = 588
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMS 559
SDD C CG +LL C CP +H CL P L +PE+ W CP C + H +S
Sbjct: 280 SDDYCRKCGRMGDLLCCELCPAVYHLECLSPPLLEVPENEWFCPICAEQHVKGVS 334
>gi|444322063|ref|XP_004181687.1| hypothetical protein TBLA_0G02280 [Tetrapisispora blattae CBS 6284]
gi|387514732|emb|CCH62168.1| hypothetical protein TBLA_0G02280 [Tetrapisispora blattae CBS 6284]
Length = 769
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR 551
++D C CG + + C+ CP +FH CLDP L +PE W CPNC+
Sbjct: 375 NEDFCSSCGQTGSFVCCDTCPKSFHFLCLDPPLDPDNLPEGNWSCPNCQ 423
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 213 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 258
>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
Length = 1481
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 209 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 252
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H CLR + IPK W
Sbjct: 253 ---------CQICRQQDS-----NDTKYVKCEQCQKIYHASCLRPV----ISAIPKYGWK 294
Query: 629 C-----CDDC 633
C C DC
Sbjct: 295 CNRCRVCTDC 304
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC----RQGHSSSMSR 560
+C CG D LLLC+ C +++H C++P L +P W+C C + G + +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKN 639
Query: 561 SVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
LE P C +C+ + + +I C+ CE H C
Sbjct: 640 EFGESNMLECPSCTSQSSCPVCK-----VSYSNGEMIIQCEHCELWAHFHC 685
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
+DD C VC DG L+ C+GCP +FH +CL P L IP WRC C +S
Sbjct: 27 SNDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDACNTQRPTS 80
>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
Length = 1481
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 209 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 252
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H CLR + IPK W
Sbjct: 253 ---------CQICRQQDS-----NDTKYVKCEQCQKIYHASCLRPV----ISAIPKYGWK 294
Query: 629 C-----CDDC 633
C C DC
Sbjct: 295 CNRCRVCTDC 304
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC----RQGHSSSMSR 560
+C CG D LLLC+ C +++H C++P L +P W+C C + G + +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKS 639
Query: 561 SVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
LE P C +C+ + + +I C+ CE H C
Sbjct: 640 EFGDSNMLECPSCTSQSSCPVCK-----VSYSNGEMIIQCEHCELWAHFHC 685
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/188 (26%), Positives = 73/188 (38%), Gaps = 41/188 (21%)
Query: 451 IVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIA-----ISLAMGQRRTTG 505
+ C C++ + E G RRQ R +++G HD I + R
Sbjct: 1114 LYCKHCSEGDTKGAEEKSVGGDGRRQSLR--LSADGR--HDDVEENDEIRMKSSNRELES 1169
Query: 506 GS----DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRS 561
G+ D C +C LL C+GCP AFH AC+ P+ W C C R+
Sbjct: 1170 GAVGPWDVECFICKLYGELLGCDGCPKAFHLACIGIKSWPQEEWFCDECDMQTCGVCGRN 1229
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
++ VIC S D S C+K FH+ C++ L++
Sbjct: 1230 ----------KIKLNSHVICGSEDGSKG------------CDKVFHLKCVK------LEK 1261
Query: 622 IPKDKWFC 629
+P+ WFC
Sbjct: 1262 VPESDWFC 1269
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 66/159 (41%), Gaps = 48/159 (30%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHS------------ 555
++ C+VC +G ++ C+GC FH +CL+ +P C +C +G +
Sbjct: 1077 EEECYVCREGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCSEGDTKGAEEKSVGGDG 1136
Query: 556 -------SSMSRSVDLKGGLE-------------APGAEVGGCVICRSHDFSAATFDDRT 595
S+ R D++ E A G C IC+ +
Sbjct: 1137 RRQSLRLSADGRHDDVEENDEIRMKSSNRELESGAVGPWDVECFICKLYG---------E 1187
Query: 596 VIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCN 634
++ CD C K FH+ C+ +K P+++WF CD+C+
Sbjct: 1188 LLGCDGCPKAFHLACI------GIKSWPQEEWF-CDECD 1219
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 62/134 (46%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1012 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1071
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C + ++ ++ C QC++ H C N ++
Sbjct: 1072 CEWQNNYTQCAPCASLSTCPVCYRN-----YREEELILQCRQCDRWMHAICQNLNTEEEV 1126
Query: 620 KEIPKDKWFCCDDC 633
+ I D F C C
Sbjct: 1127 ENI-ADIGFDCTMC 1139
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 355 TTCGQHYHGMCLDIAITPLKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 414
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 415 PVIDSVPTNGWKCKNCR 431
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 961 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 1007
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 1008 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1045
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C L+ ++ +N Q P +SLST
Sbjct: 1046 TVPKGGWKCKWCVWCRHCGATSPGLRCEWQNNYTQCAPCASLST 1089
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 63/137 (45%), Gaps = 20/137 (14%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN---CRQGHSSSMSRS 561
+C VCG D LLLC+ C +++H CLDP L +P+ GW+C C Q + S
Sbjct: 1022 VCEVCGKASDPSRLLLCDDCDISYHTYCLDPPLNTVPKGGWKCKWCVCCVQCGAVSPGFH 1081
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C IC+ ++ +I C CE+ H C + L
Sbjct: 1082 CEWQNNYTHCAPCASLVTCPICQ-----VKYVEEDLLIQCQHCERWMHAVC---DNLFTE 1133
Query: 620 KEIPK--DKWFCCDDCN 634
+E+ + D+ F C C
Sbjct: 1134 EEVEQAADEGFDCTSCQ 1150
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 66/164 (40%), Gaps = 48/164 (29%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC G+ +LL C C L +H CL+ + P SGW+C C+
Sbjct: 231 DSRCMVCDAPGELRDLLFCTSCGLHYHGTCLEITVTPRKRSGWQCHECKV---------- 280
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C CR + +D ++ C+ CEK +H CL+ ++ +
Sbjct: 281 ---------------CQTCR------LSGEDSRMLVCEACEKCYHTYCLKPA----IESV 315
Query: 623 PKDKWFCCDDCNRIHAALQDFVSNRAQTIPA-SSLSTINRKHIE 665
P D W C + L DF ++ P + + ++ +H+E
Sbjct: 316 PADSWKCKTE-------LADFGTHHTTLFPNYTPAADVSVEHME 352
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 69/182 (37%), Gaps = 58/182 (31%)
Query: 509 DMCHVCG------DGENLLLCNGCPLAFHAACLDP-----LLIPESGWRCPNCRQGHSSS 557
DMC VCG +G +LL C+ C +H C++ +L+ GWRC
Sbjct: 971 DMCVVCGSFGRGSEG-HLLACSQCSQCYHPYCVNSKITKVMLL--KGWRC---------- 1017
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
V+ C++C + D ++ CD C+ +H CL
Sbjct: 1018 ----VE--------------CIVC---EVCGKASDPSRLLLCDDCDISYHTYCLDP---- 1052
Query: 618 DLKEIPKDKW-----FCCDDCNRIHAALQ-DFVSNRAQTIPASSLST---INRKHIEKGI 668
L +PK W CC C + ++ +N P +SL T K++E+ +
Sbjct: 1053 PLNTVPKGGWKCKWCVCCVQCGAVSPGFHCEWQNNYTHCAPCASLVTCPICQVKYVEEDL 1112
Query: 669 LF 670
L
Sbjct: 1113 LI 1114
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 8/73 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
TS G+ H + + + R+ +G C VC G+ +L+C C +H CL
Sbjct: 250 TSCGLHYHGTCLEITVTPRKRSGWQCHECKVCQTCRLSGEDSRMLVCEACEKCYHTYCLK 309
Query: 537 PLL--IPESGWRC 547
P + +P W+C
Sbjct: 310 PAIESVPADSWKC 322
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C VC DG L+ C+GCP AFH ACL P L IP WRC +C Q
Sbjct: 4 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQA 52
>gi|76155653|gb|AAX26942.2| SJCHGC07786 protein [Schistosoma japonicum]
Length = 234
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 41/95 (43%), Gaps = 7/95 (7%)
Query: 464 QFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENL 520
+ MAA R + T + L + L R D C VC D +NL
Sbjct: 91 ELRTRTWMAAWRTEVENARTLTRLNL--LHACLDACVRWEKSVEDARCRVCRRKTDDDNL 148
Query: 521 LLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
LLC+GC LAFH CL P L +P W CP CR
Sbjct: 149 LLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTCRPA 183
>gi|357624184|gb|EHJ75056.1| hypothetical protein KGM_20601 [Danaus plexippus]
Length = 1448
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
MCH+CG G E +LLC+GC ++H CL P L +P+ WRCP C
Sbjct: 322 MCHICGRGDIEEQMLLCDGCDDSYHTFCLVPPLADVPKGDWRCPVC 367
>gi|432937609|ref|XP_004082462.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Oryzias latipes]
Length = 1475
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
C +C GD +N+LLC+GC H CL P L +PE W CP+CR SS
Sbjct: 1140 CRICRRKGDADNMLLCDGCDRGHHTHCLRPRLKSVPEGDWFCPDCRPKQRSS 1191
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++D C VCGDG L+ C+GCP AFH CL P L +P W+C +C
Sbjct: 264 NEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSC 309
>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
abelii]
Length = 1215
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 15/115 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 671 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 730
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ + AP A + C +C + ++ ++ C QC++ H C N
Sbjct: 731 CEWQNNYTQCAPCASLSSCPVCYRN-----YREEDLILQCRQCDRWMHAVCQNLN 780
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 64/147 (43%), Gaps = 29/147 (19%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 22 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 81
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS----HD----F 586
P++ +P +GW+C NCR + ++ G + C+IC S D F
Sbjct: 82 PVMKSVPTNGWKCKNCR----------ICIECGTRSSSQWHHNCLICDSCYQQQDNLCPF 131
Query: 587 SAATFD---DRTVIYCDQCEKEFHVGC 610
+ + +++C+ C++ H+ C
Sbjct: 132 CGKCYHPELQKDMLHCNMCKRWVHLEC 158
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 50/127 (39%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 3 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECK----------- 51
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
+C++ S +D ++ CD C+K +H CL+ +K +
Sbjct: 52 -----------------VCQNCKQSG---EDSKMLVCDTCDKGYHTFCLQPV----MKSV 87
Query: 623 PKDKWFC 629
P + W C
Sbjct: 88 PTNGWKC 94
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DM VCG G+ LL C+ C +H C+ +
Sbjct: 601 SMHNTVVLFSSSDKFTL--NQDMYVVCGSFGQGAEGRLLACSQCCQCYHPYCVSIKITKV 658
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 659 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 687
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 688 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 743
Query: 654 SSLST 658
+SLS+
Sbjct: 744 ASLSS 748
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 1028 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGPR 1087
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1088 CEWQNNYTQCAPCASLSICPVCCRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1138
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ + D F C C
Sbjct: 1139 EDEVENV-ADIGFDCTMC 1155
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 369 TTCGQHYHGMCLDIAVTPLKRAGWQCPDCKVCQNCKHSGEDSKMLVCDTCDKGYHTFCLQ 428
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 429 PIMDSVPTNGWKCKNCR 445
>gi|427795587|gb|JAA63245.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1435
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 53/130 (40%), Gaps = 37/130 (28%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C ++++ D K
Sbjct: 1166 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYEC-------LNKTQDEK 1218
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C ++ CD C K FH CL L + PK
Sbjct: 1219 -----------VCILCGKKG---------KLVRCDACPKVFHHTCLDP----PLSKPPKG 1254
Query: 626 KWFCCDDCNR 635
KW CC C +
Sbjct: 1255 KW-CCSGCAK 1263
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
+C +CG L+ C+ CP FH CLDP L P+ W C C +G
Sbjct: 1219 VCILCGKKGKLVRCDACPKVFHHTCLDPPLSKPPKGKWCCSGCAKG 1264
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 15/115 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 745 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 804
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ + AP A + C +C + ++ ++ C QC++ H C N
Sbjct: 805 CEWQNNYTQCAPCASLSSCPVCYRN-----YREEDLILQCRQCDRWMHAVCQNLN 854
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/147 (22%), Positives = 64/147 (43%), Gaps = 29/147 (19%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 96 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 155
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVIC----RSHD----F 586
P++ +P +GW+C NCR + ++ G + C+IC + D F
Sbjct: 156 PVMKSVPTNGWKCKNCR----------ICIECGTRSSSQWHHNCLICDNCYQQQDNLCPF 205
Query: 587 SAATFD---DRTVIYCDQCEKEFHVGC 610
+ + +++C+ C++ H+ C
Sbjct: 206 CGKCYHPELQKDMLHCNMCKRWVHLEC 232
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 44/200 (22%), Positives = 71/200 (35%), Gaps = 49/200 (24%)
Query: 435 IVKGQRLRFGCKQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAI 494
+V G R + G CC ++ + A A Q + HI+ +
Sbjct: 13 VVSGSTERCAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIF---------LLC 63
Query: 495 SLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPN 549
+ Q D C VC GD + C C +H CLD + P +GW+CP
Sbjct: 64 PEHIDQAPERSKEDANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPE 123
Query: 550 CRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
C+ +C++ S +D ++ CD C+K +H
Sbjct: 124 CK----------------------------VCQNCKQSG---EDSKMLVCDTCDKGYHTF 152
Query: 610 CLRKNGLCDLKEIPKDKWFC 629
CL+ +K +P + W C
Sbjct: 153 CLQPV----MKSVPTNGWKC 168
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T + DMC VCG G+ LL C+ C +H C+ +
Sbjct: 675 SMHNTVVLFSSSDKFTL--NQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 732
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 733 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 761
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 762 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 817
Query: 654 SSLST 658
+SLS+
Sbjct: 818 ASLSS 822
>gi|328773887|gb|EGF83924.1| hypothetical protein BATDEDRAFT_84646 [Batrachochytrium
dendrobatidis JAM81]
Length = 828
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 55/143 (38%), Gaps = 43/143 (30%)
Query: 510 MCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC-----------RQG 553
+C VC GD + LL C+ C +H CL+P L +PE W C C +Q
Sbjct: 623 LCTVCNNAGDDDKLLFCDTCDRGYHMYCLNPPLEVLPEGSWLCSECAVCKSCKKRPEKQE 682
Query: 554 HSSSMSRSVDLKGGLE--------APGAEVGG------CVICRSHDFSAATF-------- 591
+ M R V + L P G C C H F A F
Sbjct: 683 GTEDMWRHVVIPPSLSLQEIQIKPPPATSALGTYLCTYCTDCYDH-FEADRFCPLCIHVY 741
Query: 592 ----DDRTVIYCDQCEKEFHVGC 610
DD ++ CD+C++ HVGC
Sbjct: 742 SEDSDDLAMVCCDECDRWVHVGC 764
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 961 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSPGLR 1020
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C IC R D ++ C QC++ H C N
Sbjct: 1021 CEWQNNYTQCAPCASLSTCPICCCNYREEDL---------ILQCRQCDRWMHTVCQNLNT 1071
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ D F C C
Sbjct: 1072 EEEVEST-ADNGFDCTMC 1088
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 910 DMCVVCGSFGKGAEGRLLSCSQCGQCYHPYCVSIKITKVVLHKGWRC------------- 956
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 957 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 994
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C L+ ++ +N Q P +SLST
Sbjct: 995 TVPKGGWKCKWCVWCRHCGATSPGLRCEWQNNYTQCAPCASLST 1038
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + + + + G C VC G+ +L+C+ C +H CL
Sbjct: 335 TTCGQHYHGMCLDIQVTALKRAGWQCPDCKVCQNCKHSGEDNKMLVCDTCDKGYHTFCLQ 394
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C CR
Sbjct: 395 PVMDSVPTNGWKCKYCR 411
>gi|403175396|ref|XP_003334222.2| hypothetical protein PGTG_15759 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171584|gb|EFP89803.2| hypothetical protein PGTG_15759 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 392
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 13/80 (16%)
Query: 476 QPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHA 532
+P R TS+ T HD+ S+ G D+C +CG D N+LLC+ C +H
Sbjct: 289 RPIRRPSTSHPSTSHDLDTSMDHG--------GDICEICGSDEDDPNILLCDSCDKGYHL 340
Query: 533 ACLDP--LLIPESGWRCPNC 550
CL P L +PE W C C
Sbjct: 341 QCLTPPLLTVPEGNWYCDAC 360
>gi|196015577|ref|XP_002117645.1| hypothetical protein TRIADDRAFT_61645 [Trichoplax adhaerens]
gi|190579814|gb|EDV19903.1| hypothetical protein TRIADDRAFT_61645 [Trichoplax adhaerens]
Length = 533
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
DD+C VC G LL+C+ C +H CLDP L IP+ W CP+C
Sbjct: 394 DDVCCVCNKGGELLICDTCNSVYHLRCLDPPLSSIPDGMWMCPDC 438
>gi|25412286|pir||B84645 hypothetical protein At2g25170 [imported] - Arabidopsis thaliana
Length = 1359
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|301775083|ref|XP_002922956.1| PREDICTED: PHD finger protein 21B-like [Ailuropoda melanoleuca]
Length = 484
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W+CP C+Q
Sbjct: 305 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWQCPKCQQ 351
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 57/159 (35%), Gaps = 37/159 (23%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
Q T +D C VC LL C+GCP +H C+ P L +PE W CP C
Sbjct: 709 AQDSPTTSHEDYCIVCQQSGELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPAC------ 762
Query: 557 SMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGL 616
A C IC R V C QC H C + G
Sbjct: 763 ----------------ARNDPCTIC-GKPLPWKPMSQRRV--CVQCGVPMHPACFKATGA 803
Query: 617 CDLKEIPKDKW-FCCDDC---NRIHA----ALQDFVSNR 647
KE P ++ C DC R H +LQ+ V +R
Sbjct: 804 P--KEFPAEQLQRRCPDCCRFQRPHVLEAHSLQEIVGHR 840
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC--LDPLLIPESGWRCPNCR-QGHSS 556
Q+ T+ + D C +CG NL+ C+GCP A+H+ C + +PE W CP C Q ++
Sbjct: 464 QKHTSDWNSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPECTLQKIAT 523
Query: 557 SMSRSVDLKGGLEAPGAEVGG 577
++R+ + G E G + G
Sbjct: 524 EVTRARKVVRGFECFGIDTYG 544
>gi|405973316|gb|EKC38036.1| Lysine-specific demethylase 5A [Crassostrea gigas]
Length = 1883
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%), Gaps = 5/46 (10%)
Query: 510 MCHVC--GDGEN-LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C GDGE +LLC+GC AFH CL P + +P+ WRCP C
Sbjct: 289 ICHMCNRGDGEEYMLLCDGCDDAFHTYCLIPPMPEVPKGDWRCPKC 334
>gi|18400745|ref|NP_565587.1| CHD3-type chromatin-remodeling factor PICKLE [Arabidopsis thaliana]
gi|75193642|sp|Q9S775.1|PKL_ARATH RecName: Full=CHD3-type chromatin-remodeling factor PICKLE;
AltName: Full=Protein GYMNOS
gi|6318930|gb|AAF07084.1|AF185578_1 GYMNOS/PICKLE [Arabidopsis thaliana]
gi|6478518|gb|AAF13875.1|AF185577_1 chromatin remodeling factor CHD3 [Arabidopsis thaliana]
gi|330252572|gb|AEC07666.1| CHD3-type chromatin-remodeling factor PICKLE [Arabidopsis thaliana]
Length = 1384
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESTNLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|440895709|gb|ELR47837.1| PHD finger protein 21B, partial [Bos grunniens mutus]
Length = 476
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 297 DELCAACKRGTNLQPCGTCPGAYHLSCLDPPLKTAPKGMWVCPKCQQ 343
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 63/135 (46%), Gaps = 18/135 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CL P L +P+ GW+C CR ++S
Sbjct: 974 VCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1033
Query: 562 VDLKGGLE--APGAEVGGC-VICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCD 618
+ + AP A + C V CR++ +D ++ C QC++ H C N +
Sbjct: 1034 CEWQNNYTQCAPCASLSSCPVCCRNYR------EDDLILQCRQCDRWMHAVCQNLNTEEE 1087
Query: 619 LKEIPKDKWFCCDDC 633
++ + D F C C
Sbjct: 1088 VENV-ADIGFDCSMC 1101
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 923 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 969
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 970 -------LE--------CTVCEA---CGKASDPGRLLLCDDCDISYHTYCLAP----PLQ 1007
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1008 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1051
>gi|347838360|emb|CCD52932.1| hypothetical protein [Botryotinia fuckeliana]
Length = 886
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 16/125 (12%)
Query: 510 MCHVCGDGEN-----LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDL 564
+C +C ++ +L C+GCPL H C IPE W C C++ ++ +
Sbjct: 293 VCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARAAE 352
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G+ E+ C +C+ D +I C+ C+ H C ++ + P+
Sbjct: 353 NDGVTDSDDEI-SCAVCQGLDSEKPN----EIILCENCDYAVHQSC------GNIPKKPR 401
Query: 625 DKWFC 629
+W C
Sbjct: 402 GEWLC 406
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 967 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1026
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + AP A + C +C R D ++ C QC++ H C
Sbjct: 1027 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVC 1072
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 31/177 (17%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 320 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 379
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRS----HD----F 586
P++ +P +GW+C NCR + ++ G + C++C S D F
Sbjct: 380 PVMKSVPTNGWKCKNCR----------ICVECGTRSSSQWHHNCLVCDSCYQQQDNLCPF 429
Query: 587 SAATFD---DRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCDDCNRIHAAL 640
+ + +++C+ C++ H+ C + DL K+++ C C + AAL
Sbjct: 430 CGKCYHPELQKDMLHCNMCKRWVHLECDKPTDP-DLDSQFKEEYICM-YCKHLAAAL 484
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 301 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 350
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 351 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 385
Query: 623 PKDKWFC 629
P + W C
Sbjct: 386 PTNGWKC 392
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 916 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 962
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 963 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 1000
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1001 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1044
>gi|330805108|ref|XP_003290529.1| hypothetical protein DICPUDRAFT_81259 [Dictyostelium purpureum]
gi|325079359|gb|EGC32963.1| hypothetical protein DICPUDRAFT_81259 [Dictyostelium purpureum]
Length = 1463
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 5/58 (8%)
Query: 507 SDDMCHVCGDG-----ENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMS 559
SDD+C VC DG ++ C+GC +A H C LLIPE W C C S +S
Sbjct: 448 SDDVCAVCFDGTSDDTNQIVYCDGCDIAVHQDCYGILLIPEGQWFCQKCESPEKSQIS 505
>gi|297821893|ref|XP_002878829.1| PKL/SSL2 [Arabidopsis lyrata subsp. lyrata]
gi|297324668|gb|EFH55088.1| PKL/SSL2 [Arabidopsis lyrata subsp. lyrata]
Length = 1399
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 504 TGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES--GWRCPNC 550
T ++ C CG+ NL+ CN C AFHA CL P L S WRCP C
Sbjct: 45 TDAKENACQACGESANLVSCNTCTYAFHAKCLVPPLKDASVENWRCPEC 93
>gi|209880517|ref|XP_002141698.1| PHD / zinc finger (C3HC4 type) family protein [Cryptosporidium
muris RN66]
gi|209557304|gb|EEA07349.1| PHD / zinc finger (C3HC4 type) family protein [Cryptosporidium
muris RN66]
Length = 855
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 9/82 (10%)
Query: 478 YRHIYTSNGMTLHDIAISLAMGQRRTTGGSD-------DMCHVCGDGENLLLCNGCPLAF 530
Y HI T N + + I + G D ++CH+ E LLLC+GC +
Sbjct: 224 YDHIPTENSIAIITIPVERRTLNNDELDGDDPFADFACEVCHLNDHEEVLLLCDGCDCGY 283
Query: 531 HAACLDPLL--IPESGWRCPNC 550
H CLDP L +P W CP C
Sbjct: 284 HTYCLDPPLDSVPSGEWFCPRC 305
>gi|396467513|ref|XP_003837957.1| hypothetical protein LEMA_P120040.1 [Leptosphaeria maculans JN3]
gi|312214522|emb|CBX94513.1| hypothetical protein LEMA_P120040.1 [Leptosphaeria maculans JN3]
Length = 1476
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 58/141 (41%), Gaps = 22/141 (15%)
Query: 508 DDMCHVCGDG--EN---LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D C +C DG EN ++ C+GC LA H C IPE W C C+
Sbjct: 416 DTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRRCQL---------- 465
Query: 563 DLKGGLEAPGAEVGGCVICRSHD--FSAATFDDRTVIYCDQCEKEFHVG-CLRKNGLCDL 619
G P +E GC+ C + D F T + C E +G + + D+
Sbjct: 466 ---VGRGTPVSEHPGCIFCPNKDGAFKQTTTMKWAHLLCAMWIPEVSLGNATFQEPVQDV 522
Query: 620 KEIPKDKW-FCCDDCNRIHAA 639
+++PK +W CC C + A
Sbjct: 523 EKVPKTRWKLCCYICKQKMGA 543
>gi|340710729|ref|XP_003393938.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase
lid-like [Bombus terrestris]
Length = 1644
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|428173481|gb|EKX42383.1| hypothetical protein GUITHDRAFT_73973, partial [Guillardia theta
CCMP2712]
Length = 94
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 56/116 (48%), Gaps = 32/116 (27%)
Query: 516 DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGA 573
D ENLLLC+GC FH C P L IP+ W C +C + R+++
Sbjct: 5 DEENLLLCDGCNKGFHIFCHQPALEEIPDGEWLCSSC------AFVRNIE---------- 48
Query: 574 EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
C +CR D + +I CD+C+K +H+ CL L+ +P+++WFC
Sbjct: 49 ----CEVCRRRD------GENELILCDRCDKGWHMKCLDPP----LRCVPQEEWFC 90
>gi|351700053|gb|EHB02972.1| Bromodomain adjacent to zinc finger domain protein 1A [Heterocephalus
glaber]
Length = 1857
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSS 556
MC GD EN++LC+GC H C+ P L +PE W CP CR S
Sbjct: 1454 MCRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECRPKQRS 1502
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 28/45 (62%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCR 551
+DD C VCG +LL C+GCP +FH C+ +PE W CP C+
Sbjct: 1052 NDDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPECK 1096
>gi|350412813|ref|XP_003489772.1| PREDICTED: lysine-specific demethylase lid-like isoform 1 [Bombus
impatiens]
Length = 1644
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 330 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 375
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 369 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 419
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C IC+ D ++ CD C +H CL L EIP
Sbjct: 420 AAEDDDEHMEFCRICK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 466
Query: 627 WFC 629
W C
Sbjct: 467 WKC 469
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 429 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 472
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 60/149 (40%), Gaps = 25/149 (16%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 379 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 429
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C IC+ D ++ CD C +H CL L EIP
Sbjct: 430 AAEDDDEHMEFCRICK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 476
Query: 627 WFCCD-DCNRIHAALQDFVSNRAQTIPAS 654
W C C I + ++ R + P +
Sbjct: 477 WKCPRCSCPPIRGKVAKILTWRWKECPET 505
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 371 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------ITG 421
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C IC+ D ++ CD C +H CL L EIP
Sbjct: 422 AAEDDDEHMEFCRICK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 468
Query: 627 WFC 629
W C
Sbjct: 469 WKC 471
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 8/55 (14%)
Query: 504 TGGSDD------MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
TG ++D C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 420 TGAAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 474
>gi|320167629|gb|EFW44528.1| hypothetical protein CAOG_02553 [Capsaspora owczarzaki ATCC 30864]
Length = 1716
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 75/208 (36%), Gaps = 60/208 (28%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
++D C+ C N+L C+ CP ++H CL P + P W+CP C+ S RS
Sbjct: 1434 NEDHCNACKARGNVLCCDYCPRSYHLKCLKPPMSKPPRGDWKCPICKSATRHSQRRS--- 1490
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRT----------------------------- 595
G P + G +S D AA D +
Sbjct: 1491 -EGRARPSTDEDGADDDQSGDSDAAGHSDASESEADEGEGEEADDDDDEAQEGEEEEEED 1549
Query: 596 ---VIYCDQCEKEFHVG-------CL-----RKNGLCDLKEIPKDKWFCCDDCNR----I 636
++ C C + VG C+ G+C K P+D W C DC + +
Sbjct: 1550 EQPIVMCHICNTDVEVGGEGRPLVCVCCAHAAHRGVCAKKNSPRDVWL-CQDCTQPKPVV 1608
Query: 637 HAALQDFVSNRAQTIPASSLSTINRKHI 664
A L+ F PA L T +RK +
Sbjct: 1609 WAKLRGF-----PYWPARVLETKDRKSL 1631
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 499 GQRRT--TGGSDDMCHVCGDGENLLLCNGCPLAFHAAC--LDPLLIPESGWRCPNC 550
G RR T ++D C VC LL C+GCP +HA C LD +P+ W CP C
Sbjct: 416 GARRAGPTDLNNDSCEVCSQTGELLCCDGCPRVYHATCLKLDTASLPQGDWFCPTC 471
>gi|219112141|ref|XP_002177822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410707|gb|EEC50636.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 462
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +CGDG +LL+C+GC +H C+ P L IPE W C +C
Sbjct: 347 DACAICGDGGSLLICDGCEGEYHMDCVQPSLAEIPEGHWECDDC 390
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 973 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGPR 1032
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + AP A + C +C + ++ ++ C QC++ H C
Sbjct: 1033 GEWQNNYTQCAPCASLSACPVCHRN-----YREEDLILQCRQCDRWMHAVC 1078
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 331 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 390
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GWRC NCR
Sbjct: 391 PVMKSVPTNGWRCKNCR 407
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 312 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 361
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 362 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 396
Query: 623 PKDKWFC 629
P + W C
Sbjct: 397 PTNGWRC 403
>gi|350412816|ref|XP_003489773.1| PREDICTED: lysine-specific demethylase lid-like isoform 2 [Bombus
impatiens]
Length = 1618
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 304 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPKC 349
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 420
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C IC+ D ++ CD C +H CL L EIP
Sbjct: 421 AAEDDDEHMEFCRICK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 467
Query: 627 WFC 629
W C
Sbjct: 468 WKC 470
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 430 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 473
>gi|154309635|ref|XP_001554151.1| hypothetical protein BC1G_07288 [Botryotinia fuckeliana B05.10]
Length = 765
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 16/125 (12%)
Query: 510 MCHVCGDGEN-----LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDL 564
+C +C ++ +L C+GCPL H C IPE W C C++ ++ +
Sbjct: 293 VCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARAAE 352
Query: 565 KGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPK 624
G+ E+ C +C+ D +I C+ C+ H C ++ + P+
Sbjct: 353 NDGVTDSDDEI-SCAVCQGLDSEKPN----EIILCENCDYAVHQSC------GNIPKKPR 401
Query: 625 DKWFC 629
+W C
Sbjct: 402 GEWLC 406
>gi|383853505|ref|XP_003702263.1| PREDICTED: lysine-specific demethylase lid-like isoform 1
[Megachile rotundata]
Length = 1642
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 328 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPRC 373
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 51/123 (41%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W CP+C +G + G
Sbjct: 370 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC-EGEG--------IAG 420
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C IC+ D ++ CD C +H CL L EIP
Sbjct: 421 AAEDDDEHMEFCRICK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 467
Query: 627 WFC 629
W C
Sbjct: 468 WKC 470
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C +C DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 430 EFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 473
>gi|14626491|gb|AAK70213.1| MLL3-like protein [Mus musculus]
Length = 420
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 17/116 (14%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 46 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 105
Query: 562 VDLKGGLE--APGAEVGGC-VICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ + AP A + C V CR++ ++ ++ C QC++ H C N
Sbjct: 106 CEWQNNYTQCAPCASLSSCPVCCRNY------REEDLILQCRQCDRWMHAVCQNLN 155
>gi|301625847|ref|XP_002942113.1| PREDICTED: PHD finger protein 21B-like [Xenopus (Silurana)
tropicalis]
Length = 565
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H C+DP L +P+ W CP C+Q
Sbjct: 400 DEYCSACKRGSNLQTCGACPRAYHLDCVDPPLKVLPKGVWLCPKCQQ 446
>gi|195130615|ref|XP_002009747.1| GI15065 [Drosophila mojavensis]
gi|193908197|gb|EDW07064.1| GI15065 [Drosophila mojavensis]
Length = 886
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR---- 551
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPSGQWLCHSCRMAKV 104
Query: 552 -QGHSSSMSRSVD 563
Q SSS + SV+
Sbjct: 105 SQPTSSSKASSVE 117
>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
Length = 1503
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 208 CSSLGDLSKLIMCSSCGDHFHSTCIGLANLPDTRSGWCCARCTK---------------- 251
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D + F + C+QC+K +H CLR + IPK W
Sbjct: 252 ---------CQICRQQDSNDIKF-----VKCEQCQKIYHASCLRPV----ISSIPKYGWK 293
Query: 629 C-----CDDC 633
C C DC
Sbjct: 294 CNRCRVCTDC 303
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 56/130 (43%), Gaps = 15/130 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ-----GHSSSMS 559
+C CG D LLLC+ C +++H C++P L +P+ W+C C + + S
Sbjct: 582 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPQGTWKCSFCTMCQKCGRNPTEKS 641
Query: 560 RSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+VD P A C +C + + +I C++CE+ H C N +
Sbjct: 642 DNVDSNMSECPPCASQTACSVC-----TNPYANGEMIIQCEKCEQWSHFLCDSVNAQLTI 696
Query: 620 KEIPKDKWFC 629
K+ + C
Sbjct: 697 DYYDKNIYKC 706
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+DD C C DG +LL C+ C +FH CL+P L IPE W C +C+
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSCK 111
Score = 40.0 bits (92), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 4/45 (8%)
Query: 585 DFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
DF ++ D ++ CD CEK FH+ CL L+EIP+ W+C
Sbjct: 67 DFCSSCKDGGDLLCCDSCEKSFHLMCLNP----PLEEIPEGDWYC 107
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 16/134 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++
Sbjct: 60 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATCAGLR 119
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDL 619
+ + AP A + C +C F DD ++ C QC++ H C N ++
Sbjct: 120 CEWQNNYTQCAPCASLSACPVC----FRNYREDD-LILQCRQCDRWMHAVCQNLNTEEEV 174
Query: 620 KEIPKDKWFCCDDC 633
+ + D F C C
Sbjct: 175 ESV-ADIGFDCSMC 187
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 68/287 (23%), Positives = 107/287 (37%), Gaps = 65/287 (22%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 9 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 55
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 56 -------LE--------CTVCEA---CGKATDPGRLLLCDDCDISYHTYCLDP----PLQ 93
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLSTIN---RKHIEKGILFD 671
+PK W C C C A L+ ++ +N Q P +SLS R + E ++
Sbjct: 94 TVPKGGWKCKWCVWCRHCGATCAGLRCEWQNNYTQCAPCASLSACPVCFRNYREDDLILQ 153
Query: 672 GTMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDLIPVMVYGRNISGQ 731
D +W A C + + S I +C + C R +P + +
Sbjct: 154 CRQCD-RWM---HAVCQNLNTEEEVESVADIGFDC-----SMC-RPYMPTSHGEKAVP-- 201
Query: 732 EFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCF 778
GG+++ L +K + GL+ + R A++P A R CF
Sbjct: 202 TAGGLHTGPLCLK-LSSGQGLMSPWLRPCAQMPSAAVGRAAAPGACF 247
>gi|156392707|ref|XP_001636189.1| predicted protein [Nematostella vectensis]
gi|156223290|gb|EDO44126.1| predicted protein [Nematostella vectensis]
Length = 1071
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 33/67 (49%), Gaps = 10/67 (14%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSSMSRSVDL 564
D C CG+G +LL C+ CP AFH +C DP L IP+ W C CR D
Sbjct: 49 DNCDSCGEGGDLLCCDQCPCAFHLSCCDPPLEEDDIPDGEWLCIECRNKQKQ------DQ 102
Query: 565 KGGLEAP 571
K G E P
Sbjct: 103 KNGKEVP 109
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 510 MCHVCGDGE---NLLLCNGCPLAFHAACLD-PLLIPESG-WRCPN 549
+C C E L+ C+ CPL FH C++ PL P SG W CPN
Sbjct: 212 LCFTCSRTELFGKLVQCDFCPLGFHMDCINPPLTTPPSGMWMCPN 256
>gi|383853507|ref|XP_003702264.1| PREDICTED: lysine-specific demethylase lid-like isoform 2
[Megachile rotundata]
Length = 1616
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G+N +LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 302 ICHNCGRGDNEESMLLCDGCDDSYHTFCLMPPLTEIPKGDWRCPRC 347
>gi|357139082|ref|XP_003571114.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Brachypodium distachyon]
Length = 924
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 44/89 (49%), Gaps = 14/89 (15%)
Query: 481 IYTSNGMT-------LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAF 530
Y SNG LHDI +S A + D +C VCG D +++LLC+ C +
Sbjct: 26 FYLSNGNAGSEIHDELHDI-LSAATNLPKAPW-EDGVCKVCGIDRDDDSVLLCDKCDSEY 83
Query: 531 HAACLDPLL--IPESGWRCPNCRQGHSSS 557
H CL+P L IP+ W CP+C G S
Sbjct: 84 HTYCLNPPLARIPQGNWYCPSCMSGQKKS 112
Score = 39.7 bits (91), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 12/68 (17%)
Query: 564 LKGGLEAPGA--EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
L P A E G C +C DD +V+ CD+C+ E+H CL L
Sbjct: 45 LSAATNLPKAPWEDGVCKVC------GIDRDDDSVLLCDKCDSEYHTYCLNP----PLAR 94
Query: 622 IPKDKWFC 629
IP+ W+C
Sbjct: 95 IPQGNWYC 102
>gi|50290691|ref|XP_447778.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527089|emb|CAG60725.1| unnamed protein product [Candida glabrata]
Length = 671
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 10/65 (15%)
Query: 496 LAMGQRRTTGGS------DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGW 545
+AM +++++ G+ DD C C + L C+ CP +FH CL+P L +PE W
Sbjct: 255 IAMAKKKSSDGTTQEIENDDYCSACFQTGSFLCCDTCPKSFHFLCLNPPLDPDHLPEGDW 314
Query: 546 RCPNC 550
CP C
Sbjct: 315 SCPQC 319
>gi|8978355|dbj|BAA98208.1| unnamed protein product [Arabidopsis thaliana]
Length = 1515
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCV 579
KG A G + G V
Sbjct: 459 KKGPKIAHGTSLRGAV 474
>gi|255565329|ref|XP_002523656.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
gi|223537108|gb|EEF38742.1| chromodomain helicase DNA binding protein, putative [Ricinus
communis]
Length = 2257
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 2/64 (3%)
Query: 491 DIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP 548
++ L+ +++ G C +C G NLL C+ CP +H CLDP L IP W+CP
Sbjct: 58 ELGSDLSSSKKKGNDGYYYECVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCP 117
Query: 549 NCRQ 552
C Q
Sbjct: 118 KCYQ 121
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNC 550
D C++C DG +LL C+ CP +FH C+ IPE W C C
Sbjct: 122 DHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
>gi|297815628|ref|XP_002875697.1| hypothetical protein ARALYDRAFT_905616 [Arabidopsis lyrata subsp.
lyrata]
gi|297321535|gb|EFH51956.1| hypothetical protein ARALYDRAFT_905616 [Arabidopsis lyrata subsp.
lyrata]
Length = 1570
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 414 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 463
Query: 564 LKGGLEAPGAEVGGCV 579
KG A G + G V
Sbjct: 464 KKGPKIAHGTSLRGAV 479
>gi|348551108|ref|XP_003461372.1| PREDICTED: PHD finger protein 21B [Cavia porcellus]
Length = 513
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G +LL C+ CP A+H CL P L P GW CP C+Q
Sbjct: 334 DEHCAACRRGADLLHCSACPDAYHLGCLHPPLKTAPRGGWLCPKCQQ 380
>gi|334188015|ref|NP_198371.3| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
gi|332006563|gb|AED93946.1| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
Length = 1539
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCV 579
KG A G + G V
Sbjct: 459 KKGPKIAHGTSLRGAV 474
>gi|451852457|gb|EMD65752.1| hypothetical protein COCSADRAFT_140082 [Cochliobolus sativus
ND90Pr]
Length = 1106
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 54/128 (42%), Gaps = 21/128 (16%)
Query: 508 DDMCHVCGDG--EN---LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D C +C DG EN ++ C+GC LA H C IPE W C C+
Sbjct: 395 DTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRRCQ----------- 443
Query: 563 DLKGGLEAPGAEVGGCVICRSHD--FSAATFDDRTVIYCDQCEKEFHVG-CLRKNGLCDL 619
G P +E+ GC+ C + D F T + C E +G + + D+
Sbjct: 444 --LVGRGTPASELPGCIFCPNVDGAFKQTTTMKWAHLLCAMWIPEVSLGNATFQEPVQDV 501
Query: 620 KEIPKDKW 627
+++PK +W
Sbjct: 502 EKVPKTRW 509
>gi|395755470|ref|XP_003779950.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like, partial
[Pongo abelii]
Length = 225
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 54/118 (45%), Gaps = 19/118 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGW------RCPNCRQGHSSSM 558
+C CG D LLLC+ C +++H CLDP L +P+ GW RC CR ++S
Sbjct: 17 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWYSRCVWCRHCGATSA 76
Query: 559 SRSVDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKN 614
+ + AP A + C +C + + + ++ C QC++ H C N
Sbjct: 77 GLRCEWQNNYTQCAPCASLSSCPVCYRN------YREDLILQCRQCDRWMHAVCQNLN 128
>gi|312376807|gb|EFR23792.1| hypothetical protein AND_12238 [Anopheles darlingi]
Length = 3049
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 53/124 (42%), Gaps = 31/124 (25%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C GE+ LLLC+GC +H C P + IP+ W C C + ++ R +
Sbjct: 2427 CQFCQSGESEDKLLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECN--NKATGERKCIVC 2484
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
GGL P +G ++YC+ C + +H C + + + P+
Sbjct: 2485 GGLRPP--PLG------------------KMVYCELCPRAYHQDCY----IPPMLKYPRG 2520
Query: 626 KWFC 629
KW+C
Sbjct: 2521 KWYC 2524
>gi|115444767|ref|NP_001046163.1| Os02g0192400 [Oryza sativa Japonica Group]
gi|46389826|dbj|BAD15389.1| PHD finger-like protein [Oryza sativa Japonica Group]
gi|50726413|dbj|BAD34024.1| PHD finger-like protein [Oryza sativa Japonica Group]
gi|113535694|dbj|BAF08077.1| Os02g0192400 [Oryza sativa Japonica Group]
Length = 929
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPES 543
LHDI A D +C VCG D +++LLC+ C +H CL+P L IPE
Sbjct: 41 LHDILT--AANSLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPPLARIPEG 98
Query: 544 GWRCPNCRQGHSSS 557
W CP+C G + +
Sbjct: 99 NWYCPSCMLGQTKA 112
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 10/56 (17%)
Query: 574 EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
E G C +C DD +V+ CD+C+ E+H CL L IP+ W+C
Sbjct: 57 EDGVCKVC------GIDRDDDSVLLCDKCDSEYHTYCLNP----PLARIPEGNWYC 102
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSR 560
+D+C C G L+ C+ CP AFH C P+L +P+ W C NC++G S+ R
Sbjct: 1422 EDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCKKGTKSAAIR 1476
>gi|451997219|gb|EMD89684.1| hypothetical protein COCHEDRAFT_1177489 [Cochliobolus
heterostrophus C5]
Length = 1105
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 54/128 (42%), Gaps = 21/128 (16%)
Query: 508 DDMCHVCGDG--EN---LLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D C +C DG EN ++ C+GC LA H C IPE W C C+
Sbjct: 394 DTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRRCQ----------- 442
Query: 563 DLKGGLEAPGAEVGGCVICRSHD--FSAATFDDRTVIYCDQCEKEFHVG-CLRKNGLCDL 619
G P +E+ GC+ C + D F T + C E +G + + D+
Sbjct: 443 --LVGRGTPASELPGCIFCPNVDGAFKQTTTMKWAHLLCAMWIPEVSLGNATFQEPVQDV 500
Query: 620 KEIPKDKW 627
+++PK +W
Sbjct: 501 EKVPKTRW 508
>gi|82596119|ref|XP_726130.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481408|gb|EAA17695.1| SNF2 family N-terminal domain, putative [Plasmodium yoelii yoelii]
Length = 1312
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 57/140 (40%), Gaps = 21/140 (15%)
Query: 498 MGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLLIPES-GWRCPNCRQG 553
M + T ++D C +C + + LLLC+GCP ++H +CL PES W CP C+
Sbjct: 59 MNEEDDTPENEDRCKICREKSSNLILLLCDGCPNSYHVSCLGLQTEPESEKWYCPVCKPE 118
Query: 554 HSSSMSRSVDLKG----GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
++ KG + C +C+ ++ CD C FH
Sbjct: 119 EHKNLDIRRMRKGFAIDNMNGDHVNSSSCYVCQRPG---------KLLGCDFCPNSFHPT 169
Query: 610 CLRKNGLCDLKEIPKDKWFC 629
CL D I D+W C
Sbjct: 170 CLPD---LDFDNIS-DQWEC 185
>gi|71028128|ref|XP_763707.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350661|gb|EAN31424.1| hypothetical protein TP04_0072 [Theileria parva]
Length = 964
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 29/48 (60%), Gaps = 5/48 (10%)
Query: 508 DDMCHVCGDGENL---LLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
DD C +CG+ +N LLC+ C FH CL+P L IPES W C +C
Sbjct: 484 DDGCQICGNDDNWNQQLLCDNCDKGFHTYCLNPPLTRIPESNWYCQHC 531
>gi|358336343|dbj|GAA54879.1| tyrosine-protein kinase BAZ1B, partial [Clonorchis sinensis]
Length = 1921
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 29/51 (56%), Gaps = 5/51 (9%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
D C +C D +NLLLC+GC AFH CL P L +P W CP+CR
Sbjct: 1423 DARCRICRHKSDDDNLLLCDGCNRAFHLYCLRPPLRRVPAGDWYCPSCRPA 1473
>gi|356564737|ref|XP_003550605.1| PREDICTED: uncharacterized protein LOC100794210 [Glycine max]
Length = 1608
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGH-SSSMSRSV 562
G+ D C +CG LL C+GCP +H+ C+ + IPE W CP C+ +++R
Sbjct: 381 GNGDECRLCGMDGTLLCCDGCPAVYHSRCIGVMKMHIPEGAWYCPECKIDMIGPTIARGT 440
Query: 563 DLKGGLEAPGAEVGGCVI 580
LKG E G ++ G V
Sbjct: 441 SLKGA-EVFGKDLYGQVF 457
>gi|281344197|gb|EFB19781.1| hypothetical protein PANDA_012011 [Ailuropoda melanoleuca]
Length = 487
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W+CP C+Q
Sbjct: 308 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWQCPKCQQ 354
>gi|260793791|ref|XP_002591894.1| hypothetical protein BRAFLDRAFT_125529 [Branchiostoma floridae]
gi|229277106|gb|EEN47905.1| hypothetical protein BRAFLDRAFT_125529 [Branchiostoma floridae]
Length = 1570
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 57/147 (38%), Gaps = 40/147 (27%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C VC G+ E LLLC+ C A+H CL P L +P WRCP C + SR + +
Sbjct: 1188 CKVCRKKGEEERLLLCDECNQAYHIFCLRPALSCVPPGEWRCPACIPRQARVWSRGRNYR 1247
Query: 566 GGLEA-----------PGAEVGG----------CVICRSHDFSAATFDDRTVIYCDQCEK 604
E P G CV+C +++ ++ C C
Sbjct: 1248 EMDEGDYQSSDDDEDKPRTRCGRRDQGIIHREECVVC---------YEEGELVPCSTCPL 1298
Query: 605 EFHVGCLRKNGLCDLKEIPK-DKWFCC 630
FH C + L+ P+ + W CC
Sbjct: 1299 VFHKEC----HIPALRNFPRGNNWVCC 1321
>gi|296086550|emb|CBI32139.3| unnamed protein product [Vitis vinifera]
Length = 1789
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 7/62 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSV 562
+C VCG D +N+LLC+ C +H CL+P L IPE W CP+C Q S SRS
Sbjct: 939 LCKVCGVDKDDDNVLLCDACDSEYHTYCLNPPLARIPEGNWYCPSCVAAQRLSQGTSRSA 998
Query: 563 DL 564
++
Sbjct: 999 EV 1000
>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C D +NLLLC+GC LAFH CL P L +P W CP C
Sbjct: 1250 DARCRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 508 DDMCHVCGDGE-NLLLCNGCPLAFHAACLDPLL--IPES-GWRCPNCRQGHSSS 557
D C VC D +++LC+ CP FH C DP L IP GW+C CR S
Sbjct: 1391 DTSCLVCSDSTGDIVLCSNCPNIFHLDCHDPPLHHIPRGYGWQCSICRSNKKRS 1444
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 25/58 (43%), Gaps = 10/58 (17%)
Query: 572 GAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
E C ICR DD ++ CD C FH+ CLR LK +P WFC
Sbjct: 1247 SVEDARCRICRRKT------DDDNLLLCDGCNLAFHLYCLRP----PLKRVPTGDWFC 1294
>gi|334188013|ref|NP_851094.2| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
gi|332006562|gb|AED93945.1| DNA binding and zinc-finger domain-containing protein [Arabidopsis
thaliana]
Length = 1706
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCRQGHSSSMSRSVD 563
G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +++
Sbjct: 409 GNSDECRICGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWFCPEC----------TIN 458
Query: 564 LKGGLEAPGAEVGGCV 579
KG A G + G V
Sbjct: 459 KKGPKIAHGTSLRGAV 474
>gi|224071533|ref|XP_002303505.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222840937|gb|EEE78484.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2327
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 40/81 (49%), Gaps = 13/81 (16%)
Query: 472 AARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFH 531
AA+R+P +++ L+ +++ G C +C G NLL C+ CP +H
Sbjct: 47 AAKRRPK-----------SELSSDLSTSKKKGNDGYYYECVICDLGGNLLCCDSCPRVYH 95
Query: 532 AACLDPLL--IPESGWRCPNC 550
CLDP L IP W+CP C
Sbjct: 96 LQCLDPPLKRIPMGKWQCPKC 116
>gi|359473443|ref|XP_002264500.2| PREDICTED: methyl-CpG-binding domain-containing protein 9-like [Vitis
vinifera]
Length = 2060
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 7/62 (11%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSV 562
+C VCG D +N+LLC+ C +H CL+P L IPE W CP+C Q S SRS
Sbjct: 1042 LCKVCGVDKDDDNVLLCDACDSEYHTYCLNPPLARIPEGNWYCPSCVAAQRLSQGTSRSA 1101
Query: 563 DL 564
++
Sbjct: 1102 EV 1103
>gi|349603659|gb|AEP99439.1| Histone-lysine N-methyltransferase MLL3-like protein, partial
[Equus caballus]
Length = 452
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 112 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 171
Query: 562 VDLKGGLE--APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + AP A + C +C + ++ ++ C QC++ H C
Sbjct: 172 CEWQNNYTQCAPCASLSSCPVCYRN-----YREEDLILQCRQCDRWMHAVC 217
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 71/185 (38%), Gaps = 51/185 (27%)
Query: 488 TLHDIAISLAMGQRRTTGGSDDMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLL 539
++H+ + + + T DMC VCG G+ LL C+ C +H C+ +
Sbjct: 42 SMHNTVVLFSSSDKFTL--HQDMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKV 99
Query: 540 IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYC 599
+ GWRC LE C +C + D ++ C
Sbjct: 100 VLSKGWRC--------------------LE--------CTVCEA---CGKATDPGRLLLC 128
Query: 600 DQCEKEFHVGCLRKNGLCDLKEIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPA 653
D C+ +H CL L+ +PK W C C C A L+ ++ +N Q P
Sbjct: 129 DDCDISYHTYCLDP----PLQTVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPC 184
Query: 654 SSLST 658
+SLS+
Sbjct: 185 ASLSS 189
>gi|224125208|ref|XP_002329920.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222871157|gb|EEF08288.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 2332
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
C +C G NLL C+ CP +H CLDP L IP W+CP C Q
Sbjct: 65 CVICDLGGNLLCCDSCPRVYHLQCLDPPLKRIPMGKWQCPKCSQ 108
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CLDP L +P+ GW+C CR ++S
Sbjct: 948 VCEACGKATDPGRLLLCDDCDISYHTYCLDPPLQTVPKGGWKCKWCVWCRHCGATSSGPR 1007
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
+ + AP A + C +C R D ++ C QC++ H C
Sbjct: 1008 CEWQNNYTQCAPCASLSSCPVCCRNYREEDL---------ILQCRQCDRWMHAVC 1053
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 306 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 365
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 366 PVMKSVPTNGWKCKNCR 382
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 287 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 336
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 337 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 371
Query: 623 PKDKWFC 629
P + W C
Sbjct: 372 PTNGWKC 378
>gi|395538828|ref|XP_003771376.1| PREDICTED: lysine-specific demethylase 5A [Sarcophilus harrisii]
Length = 1479
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A S+ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 63 AFSMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 122
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 123 DWRCPKC 129
>gi|307180140|gb|EFN68184.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Camponotus floridanus]
Length = 1960
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+ C VC DG LL C+ C A+H CL+P L IP+ W+CP C
Sbjct: 428 EFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 471
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 50/123 (40%), Gaps = 24/123 (19%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACLDPLLIPESG--WRCPNCRQGHSSSMSRSVDLKG 566
D C VC G ++LC+ CP A+H CL+P L W C +C +G + G
Sbjct: 368 DYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCAHC-EGEG--------IAG 418
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
E + C +C+ D ++ CD C +H CL L EIP
Sbjct: 419 AAEDDDEHMEFCRVCK---------DGGELLCCDSCTSAYHTHCLNP----PLSEIPDGD 465
Query: 627 WFC 629
W C
Sbjct: 466 WKC 468
>gi|195048447|ref|XP_001992528.1| GH24800 [Drosophila grimshawi]
gi|193893369|gb|EDV92235.1| GH24800 [Drosophila grimshawi]
Length = 887
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR---- 551
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPTGQWLCHSCRMAKV 104
Query: 552 -QGHSSSMSRSVD 563
Q S+S S SV+
Sbjct: 105 AQPTSNSKSNSVE 117
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 38 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
Length = 1690
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
D C +C D +NLLLC+GC LAFH CL P L +P W CP C
Sbjct: 1250 DARCRICRRKTDDDNLLLCDGCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 508 DDMCHVCGDGE-NLLLCNGCPLAFHAACLDPLL--IPES-GWRCPNCRQGHSSS 557
D C VC D +++LC+ CP FH C DP L IP GW+C CR S
Sbjct: 1391 DTSCLVCSDSTGDIVLCSNCPNIFHLDCHDPPLHHIPRGYGWQCSICRSNKKRS 1444
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 25/58 (43%), Gaps = 10/58 (17%)
Query: 572 GAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
E C ICR DD ++ CD C FH+ CLR LK +P WFC
Sbjct: 1247 SVEDARCRICRRKT------DDDNLLLCDGCNLAFHLYCLRP----PLKRVPTGDWFC 1294
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 10/126 (7%)
Query: 674 MNDVQWQMLKKAQCFEEKEK-------SLLSSATAIFRECFDPIIAECGR-DLIPVMVYG 725
+N++ W +LK + K S LS A + ECF P+ + D + +++
Sbjct: 27 LNNLTWTLLKSNHSSDHKPDASDIENYSKLSIALHVMHECFQPVEEPRTKGDFLKDVIFR 86
Query: 726 RN--ISGQEFGGMYSVILTVKSVVVSAGLLRIFGREVAELPLVATCREYQGKGCFQALFS 783
+ ++ F G Y+V+L ++ +R++G +VAE+PLV T +Y+ G L +
Sbjct: 87 KRSELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVAEIPLVGTRVQYRRLGMCGILMN 146
Query: 784 CIERLL 789
+E+ L
Sbjct: 147 VLEKNL 152
>gi|297808265|ref|XP_002872016.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317853|gb|EFH48275.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1562
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 499 GQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLD--PLLIPESGWRCPNCR-QGHS 555
G G+ D C +CG LL C+GCPLA+H+ C+ + IP+ W CP C +
Sbjct: 401 GDSSDLDGNSDECRLCGMDGTLLCCDGCPLAYHSRCIGVVKMYIPDGPWYCPECTIKKMG 460
Query: 556 SSMSRSVDLKGGL 568
+++ L+G +
Sbjct: 461 PTVAHKTSLRGAV 473
>gi|296192024|ref|XP_002743885.1| PREDICTED: PHD finger protein 21B [Callithrix jacchus]
Length = 489
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 310 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 356
>gi|402226259|gb|EJU06319.1| hypothetical protein DACRYDRAFT_60996 [Dacryopinax sp. DJM-731 SS1]
Length = 1730
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 39/83 (46%), Gaps = 17/83 (20%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDM----------CHVC---GDGENLLLCNGCPLAFHA 532
GMT D+ A QR S D+ C +C D N+LLC+GC AFH
Sbjct: 411 GMT--DLMPPFAQSQREERAQSPDIRLEPLQPGDACEICRIDNDNTNMLLCDGCDAAFHM 468
Query: 533 ACLDPLL--IPESGWRCPNCRQG 553
CLDP L IP+S W C C G
Sbjct: 469 YCLDPPLTYIPKSQWFCHACLFG 491
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 36/82 (43%), Gaps = 13/82 (15%)
Query: 548 PNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFH 607
P Q ++S D++ PG C ICR + D+ ++ CD C+ FH
Sbjct: 417 PPFAQSQREERAQSPDIRLEPLQPG---DACEICRIDN------DNTNMLLCDGCDAAFH 467
Query: 608 VGCLRKNGLCDLKEIPKDKWFC 629
+ CL L IPK +WFC
Sbjct: 468 MYCLDP----PLTYIPKSQWFC 485
>gi|118091868|ref|XP_426440.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Gallus gallus]
Length = 1571
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1163 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKIIPEGDWFCPECR 1208
>gi|115472215|ref|NP_001059706.1| Os07g0497100 [Oryza sativa Japonica Group]
gi|113611242|dbj|BAF21620.1| Os07g0497100, partial [Oryza sativa Japonica Group]
Length = 306
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 58 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 117
Query: 548 PNCR 551
P CR
Sbjct: 118 PRCR 121
>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
Length = 1486
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 51/130 (39%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 209 CSSLGDLSKLIMCSTCGDHFHSTCVGLANLPDTRSGWNCARCTK---------------- 252
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H CLR + IPK W
Sbjct: 253 ---------CQICRVQDS-----NDLKYVKCEQCQKIYHASCLRPV----ISAIPKYGWK 294
Query: 629 C-----CDDC 633
C C DC
Sbjct: 295 CNRCRVCTDC 304
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC----RQGHSSSMSR 560
+C CG D LLLC+ C +++H C++P L +P W+C C + G + +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPTGNWKCSFCTLCQKCGRNPTEKS 639
Query: 561 SVDLKGGLEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
E P A C +C+S A + +I C+ CE H C
Sbjct: 640 EFGDSNMPECPPCASQSACNVCKS-----AYANGEMIIQCEHCELWSHFLC 685
>gi|222622353|gb|EEE56485.1| hypothetical protein OsJ_05715 [Oryza sativa Japonica Group]
Length = 1949
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 7/74 (9%)
Query: 489 LHDIAISLAMGQRRTTGGSDDMCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPES 543
LHDI A D +C VCG D +++LLC+ C +H CL+P L IPE
Sbjct: 1061 LHDILT--AANSLPKAPWEDGVCKVCGIDRDDDSVLLCDKCDSEYHTYCLNPPLARIPEG 1118
Query: 544 GWRCPNCRQGHSSS 557
W CP+C G + +
Sbjct: 1119 NWYCPSCMLGQTKA 1132
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 10/56 (17%)
Query: 574 EVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
E G C +C DD +V+ CD+C+ E+H CL L IP+ W+C
Sbjct: 1077 EDGVCKVC------GIDRDDDSVLLCDKCDSEYHTYCLNP----PLARIPEGNWYC 1122
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/185 (22%), Positives = 74/185 (40%), Gaps = 40/185 (21%)
Query: 454 DCCNKEISPSQFEAHAGMAARRQPYRHIYTSN--GMTLHDIAISLAMGQRRTTGGSDDMC 511
D N+ P + G+ R+ +TS M L+ + S+A + C
Sbjct: 1934 DQSNQTPVPQEETTPRGLNNWREATARAHTSAQLAMALYMLEASIAWDKSIMKAN----C 1989
Query: 512 HVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKG 566
C G+N LLLC+GC +H C P + IP+ W C C M+++ +
Sbjct: 1990 QFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHEC-------MNKATGER- 2041
Query: 567 GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDK 626
C++C + ++ C+ C + +H C N + + ++P+ K
Sbjct: 2042 ----------NCLVC-------GKRVGKNLVLCELCPRAYHTDC--HNPV--MPKMPRGK 2080
Query: 627 WFCCD 631
W+C +
Sbjct: 2081 WYCSN 2085
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 511 CHVCGD--GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSVDL 564
C VCG G+NL+LC CP A+H C +P++ +P W C NC +Q + SR
Sbjct: 2043 CLVCGKRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHT 2102
Query: 565 KGG 567
KGG
Sbjct: 2103 KGG 2105
>gi|338721435|ref|XP_001488154.3| PREDICTED: PHD finger protein 21B [Equus caballus]
Length = 494
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 305 DEHCAACKRGANLQACGTCPGAYHLSCLDPPLRTAPKGVWVCPKCQQ 351
>gi|410898760|ref|XP_003962865.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Takifugu rubripes]
Length = 1329
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSS 557
C +C GD +N+LLC+GC H CL P L +P+ W CP+CR SS
Sbjct: 1051 CRICRRKGDADNMLLCDGCDRGHHTHCLRPRLKAVPQGDWFCPDCRPKQRSS 1102
>gi|224051421|ref|XP_002200551.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Taeniopygia guttata]
Length = 1559
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1156 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKAIPEGDWFCPECR 1201
>gi|326427650|gb|EGD73220.1| hypothetical protein PTSG_04935 [Salpingoeca sp. ATCC 50818]
Length = 2055
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 47/101 (46%), Gaps = 14/101 (13%)
Query: 461 SPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGS------DDMCHVC 514
+P+ A A A R R T+ G I+L+ G R G+ DDMC C
Sbjct: 276 NPASVSAGASRRASRAISRSASTAPGSGAE---IALSHGGHRPPLGTEDHSLGDDMCEEC 332
Query: 515 GDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
G E LLLC+ C +H CL+P L +P WRCP+C
Sbjct: 333 KHGDYAELLLLCDKCDKGYHTFCLNPPLSKVPSGDWRCPSC 373
>gi|158293040|ref|XP_314337.4| AGAP004854-PA [Anopheles gambiae str. PEST]
gi|157016915|gb|EAA09709.5| AGAP004854-PA [Anopheles gambiae str. PEST]
Length = 1494
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 9/56 (16%)
Query: 504 TGGSDDM----CHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
T G D M CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 346 THGYDPMAKYICHMCNRGDVEESMLLCDGCDASYHTFCLMPPLQDIPKGDWRCPKC 401
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 35 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|194687197|ref|XP_001787707.1| PREDICTED: PHD finger protein 21B-like, partial [Bos taurus]
Length = 211
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D++C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 32 DELCAACKRGTNLQPCGTCPRAYHLSCLDPPLKTAPKGVWVCPKCQQ 78
>gi|225431531|ref|XP_002281774.1| PREDICTED: histone-lysine N-methyltransferase ATXR5 [Vitis
vinifera]
gi|296088576|emb|CBI37567.3| unnamed protein product [Vitis vinifera]
Length = 374
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 39/68 (57%), Gaps = 8/68 (11%)
Query: 507 SDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---RQGHSSSM 558
SD C CG G++ LLLC+ C FH +CL P++ IP W CP+C R+G S S
Sbjct: 58 SDVRCVQCGSGDHDDELLLCDKCDRGFHMSCLRPIVVRIPIGTWLCPSCSGQRRGRSLSQ 117
Query: 559 SRSVDLKG 566
++ +D G
Sbjct: 118 TKIIDFFG 125
>gi|428169481|gb|EKX38414.1| hypothetical protein GUITHDRAFT_144200 [Guillardia theta CCMP2712]
Length = 994
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 4/47 (8%)
Query: 509 DMCHVCGDGENLLLCNGCPLAFHAACL---DPLLIPESGWRCPNCRQ 552
D C CGD +LL C+GCP A+HAAC+ DP +P+ W C C++
Sbjct: 435 DYCDDCGDAGDLLCCDGCPAAYHAACIGVADPSKLPDP-WYCNACKR 480
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 43/82 (52%), Gaps = 8/82 (9%)
Query: 503 TTGGSDDMCHVCGDGENLLLCNGCPLAFHAACL----DPLLIPESGWRCPNCRQG-HSSS 557
TT + D C +C NL+ C+GCP AFH+ C+ D L PE W CP C G H +S
Sbjct: 791 TTDLNSDECCLCKMDGNLICCDGCPAAFHSRCVGIASDSL--PEGDWYCPECAIGTHRAS 848
Query: 558 MSRSVDLKGGLEAPGAEVGGCV 579
M L+G + G + GC+
Sbjct: 849 MKSRRSLRGA-DLLGMDPHGCL 869
>gi|312381860|gb|EFR27503.1| hypothetical protein AND_05764 [Anopheles darlingi]
Length = 1549
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 413 ICHMCNRGDVEESMLLCDGCDASYHTFCLMPPLHDIPKGDWRCPKC 458
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 61/144 (42%), Gaps = 24/144 (16%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CL P L +P+ GW+C CR ++S
Sbjct: 856 VCEACGKATDPGRLLLCDDCDISYHTYCLXPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 915
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 916 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 966
Query: 616 LCDLKEIPKDKWFCCDDCNRIHAA 639
+++ + D F C C A
Sbjct: 967 EEEVENV-ADIGFDCSMCRPYMPA 989
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 276 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 335
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 336 PVMKSVPTNGWKCKNCR 352
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 257 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 306
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 307 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 341
Query: 623 PKDKWFC 629
P + W C
Sbjct: 342 PTNGWKC 348
>gi|326920735|ref|XP_003206624.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Meleagris gallopavo]
Length = 1567
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C VC GD E+++LC+GC +H C+ P L IPE W CP CR
Sbjct: 1159 CKVCRKKGDAESMVLCDGCDRGYHTYCIRPKLKIIPEGDWFCPECR 1204
>gi|157128338|ref|XP_001661408.1| hypothetical protein AaeL_AAEL011092 [Aedes aegypti]
gi|108872617|gb|EAT36842.1| AAEL011092-PA [Aedes aegypti]
Length = 1354
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 335 ICHMCNRGDVEESMLLCDGCDASYHTFCLLPPLQEIPKGDWRCPKC 380
>gi|328713195|ref|XP_001950157.2| PREDICTED: hypothetical protein LOC100158992 [Acyrthosiphon pisum]
Length = 1118
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 508 DDMCHVCGDGEN---LLLCNGCPLAFHAACLDP--LLIPESGWRCPNCR 551
DDMC C + +LLC+ C +HA+CL P ++IP+ W CP CR
Sbjct: 890 DDMCQKCNKSDQPEWILLCDTCNQGWHASCLRPPLMVIPDGDWYCPPCR 938
>gi|224092566|ref|XP_002309665.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222855641|gb|EEE93188.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 1340
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAAC-LDPLLIP-ESGWRCPNC 550
+D C CG+ ENLL C C A+H+ C L PL P S WRCP C
Sbjct: 48 AKEDSCQACGESENLLNCETCTYAYHSKCLLPPLKAPFPSNWRCPEC 94
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 35 EEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
Length = 1486
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 50/130 (38%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 208 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 251
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H C R + IPK W
Sbjct: 252 ---------CQICRQQDS-----NDLKYVKCEQCQKIYHASCFRPV----ISAIPKYGWK 293
Query: 629 C-----CDDC 633
C C DC
Sbjct: 294 CNRCRVCTDC 303
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H C++P L +P W+C C ++ +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKS 639
Query: 565 KGG----LEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
G E P C +CR + +S +I C+ CE H C
Sbjct: 640 DYGDSNMPECPSCTSQSSCSVCR-NPYSTGEM----IIQCETCELWSHFLC 685
>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
Length = 1486
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 50/130 (38%), Gaps = 41/130 (31%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPE--SGWRCPNCRQGHSSSMSRSVDLKGGL 568
C GD L++C+ C FH+ C+ +P+ SGW C C +
Sbjct: 208 CSSLGDLSKLIMCSTCGDHFHSTCIGLANLPDTRSGWNCARCTK---------------- 251
Query: 569 EAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWF 628
C ICR D +D + C+QC+K +H C R + IPK W
Sbjct: 252 ---------CQICRQQDS-----NDLKYVKCEQCQKIYHASCFRPV----ISAIPKYGWK 293
Query: 629 C-----CDDC 633
C C DC
Sbjct: 294 CNRCRVCTDC 303
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG D LLLC+ C +++H C++P L +P W+C C ++ +
Sbjct: 580 VCEGCGKKNDEARLLLCDECDISYHIYCVNPPLETVPSGNWKCSFCTLCQKCGLNPTEKS 639
Query: 565 KGG----LEAPG-AEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
G E P C +CR + +S +I C+ CE H C
Sbjct: 640 DYGDSNMPECPSCTSQSSCSVCR-NPYSTGEM----IIQCETCELWSHFLC 685
>gi|170046220|ref|XP_001850672.1| jumonji/ARID domain-containing protein 1A [Culex quinquefasciatus]
gi|167869058|gb|EDS32441.1| jumonji/ARID domain-containing protein 1A [Culex quinquefasciatus]
Length = 1443
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH+C G E++LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 232 ICHMCNRGDVEESMLLCDGCDASYHTFCLLPPLQEIPKGDWRCPKC 277
>gi|431899989|gb|ELK07924.1| PHD finger protein 21B [Pteropus alecto]
Length = 454
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 234 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLRTAPKGVWVCPRCQQ 280
>gi|224143281|ref|XP_002324903.1| chromatin remodeling complex subunit [Populus trichocarpa]
gi|222866337|gb|EEF03468.1| chromatin remodeling complex subunit [Populus trichocarpa]
Length = 1334
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAAC-LDPLLIP-ESGWRCPNC 550
+D C CG+ ENLL C C A+H C L PL P S WRCP C
Sbjct: 49 AKEDSCQACGESENLLNCETCTYAYHPKCLLPPLKAPFPSNWRCPEC 95
>gi|427797709|gb|JAA64306.1| Putative lysine-specific demethylase lid, partial [Rhipicephalus
pulchellus]
Length = 1499
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G E++LLC+GC ++H CL P L IP WRCP C
Sbjct: 33 VCHTCGRGDDEESMLLCDGCDDSYHTFCLLPPLPEIPRGDWRCPRC 78
>gi|395504252|ref|XP_003756470.1| PREDICTED: bromodomain adjacent to zinc finger domain protein
1A-like, partial [Sarcophilus harrisii]
Length = 1378
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
MC GD E+++LC+GC H C+ P L IPE W CP CR
Sbjct: 974 MCRKKGDAESMVLCDGCDRGHHTYCVRPKLKTIPEGDWFCPECR 1017
>gi|242013407|ref|XP_002427399.1| FYVE-containing protein, putative [Pediculus humanus corporis]
gi|212511773|gb|EEB14661.1| FYVE-containing protein, putative [Pediculus humanus corporis]
Length = 1102
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 31/56 (55%), Gaps = 4/56 (7%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCR 551
+R G + D C CG+G NLL C+ CP +FH C DP L IP W C +C+
Sbjct: 46 KRPGKGHNHDSCDSCGEGGNLLCCDKCPASFHLQCHDPPLEEDDIPIGQWLCHSCK 101
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 43/111 (38%), Gaps = 27/111 (24%)
Query: 446 KQGNGIVCDCCNKEISPSQFEAHAGMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTG 505
K GNG+ P + + +A+++PY NGM + +
Sbjct: 260 KAGNGV---------GPGKITSRRNSSAKKKPYE---LDNGMVPLPVKVCYE-------- 299
Query: 506 GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGH 554
CH L+ C+ C L FH CLDP L +P W CPN Q +
Sbjct: 300 -----CHRSCRVAPLVSCDYCSLLFHLDCLDPPLASMPTGKWMCPNHPQHY 345
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CL P L +P+ GW+C CR ++S
Sbjct: 1021 VCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 1080
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 1081 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 1131
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ I D F C C
Sbjct: 1132 EEEVENI-ADIGFDCSMC 1148
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 375 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 434
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 435 PVMKSVPTNGWKCKNCR 451
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 356 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 405
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 406 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 440
Query: 623 PKDKWFC 629
P + W C
Sbjct: 441 PTNGWKC 447
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 970 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 1016
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 1017 -------LE--------CTVCEA---CGKASDPGRLLLCDDCDISYHTYCLAP----PLQ 1054
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 1055 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 1098
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 24/138 (17%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCP---NCRQGHSSSMSRS 561
+C CG D LLLC+ C +++H CL P L +P+ GW+C CR ++S
Sbjct: 877 VCEACGKASDPGRLLLCDDCDISYHTYCLAPPLQTVPKGGWKCKWCVWCRHCGATSAGLR 936
Query: 562 VDLKGGLE--APGAEVGGCVIC----RSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNG 615
+ + AP A + C +C R D ++ C QC++ H C N
Sbjct: 937 CEWQNNYTQCAPCASLSSCPVCYRNYREEDL---------ILQCRQCDRWMHAVCQNLNT 987
Query: 616 LCDLKEIPKDKWFCCDDC 633
+++ I D F C C
Sbjct: 988 EEEVENI-ADIGFDCSMC 1004
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 8/77 (10%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDMCHVC------GDGENLLLCNGCPLAFHAACLD 536
T+ G H + + +A+ + G C VC G+ +L+C+ C +H CL
Sbjct: 231 TTCGQHYHGMCLDIAVTPLKRAGWQCPECKVCQNCKQSGEDSKMLVCDTCDKGYHTFCLQ 290
Query: 537 PLL--IPESGWRCPNCR 551
P++ +P +GW+C NCR
Sbjct: 291 PVMKSVPTNGWKCKNCR 307
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 49/127 (38%), Gaps = 40/127 (31%)
Query: 508 DDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQGHSSSMSRSV 562
D C VC GD + C C +H CLD + P +GW+CP C+
Sbjct: 212 DANCAVCDSPGDLLDQFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPECKV---------- 261
Query: 563 DLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEI 622
C C+ + +D ++ CD C+K +H CL+ +K +
Sbjct: 262 ---------------CQNCKQ------SGEDSKMLVCDTCDKGYHTFCLQPV----MKSV 296
Query: 623 PKDKWFC 629
P + W C
Sbjct: 297 PTNGWKC 303
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 509 DMCHVCGD-GEN----LLLCNGCPLAFHAACLD---PLLIPESGWRCPNCRQGHSSSMSR 560
DMC VCG G+ LL C+ C +H C+ ++ GWRC
Sbjct: 826 DMCVVCGSFGQGAEGRLLACSQCGQCYHPYCVSIKITKVVLSKGWRC------------- 872
Query: 561 SVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLK 620
LE C +C + D ++ CD C+ +H CL L+
Sbjct: 873 -------LE--------CTVCEA---CGKASDPGRLLLCDDCDISYHTYCLAP----PLQ 910
Query: 621 EIPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST 658
+PK W C C C A L+ ++ +N Q P +SLS+
Sbjct: 911 TVPKGGWKCKWCVWCRHCGATSAGLRCEWQNNYTQCAPCASLSS 954
>gi|218199663|gb|EEC82090.1| hypothetical protein OsI_26092 [Oryza sativa Indica Group]
Length = 2275
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 58 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 117
Query: 548 PNCR 551
P CR
Sbjct: 118 PRCR 121
>gi|157109809|ref|XP_001650834.1| set domain protein [Aedes aegypti]
gi|108878936|gb|EAT43161.1| AAEL005378-PA [Aedes aegypti]
Length = 1458
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 60/146 (41%), Gaps = 44/146 (30%)
Query: 498 MGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLLIP--ESGWRCPNCRQ 552
+GQ D C C GD NL++C+ C +H C+ +P +GW+C +C++
Sbjct: 404 LGQVPLVCSDDINCRQCSGLGDVGNLMMCSICGDHYHGKCVGLAQLPGVRAGWQCSSCKK 463
Query: 553 GHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLR 612
C ICR D S + RTV C+QC+K +H CLR
Sbjct: 464 -------------------------CQICRVPDSS----EGRTV-GCEQCDKIYHASCLR 493
Query: 613 KNGLCDLKEIPKDKWFC-----CDDC 633
+ IPK W C C DC
Sbjct: 494 PV----MTSIPKYGWKCKCCRVCSDC 515
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 51/115 (44%), Gaps = 23/115 (20%)
Query: 510 MCHVCG---DGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC---------RQGHS 555
+C CG D L+LC+ C +++H C+DP L +P+ W+C C GH+
Sbjct: 806 ICEGCGQRNDEGRLILCDDCDISYHIYCMDPPLEHVPQGNWKCKWCAICLKCGSSNPGHN 865
Query: 556 SSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC 610
S+ + G P A C +C + + +I C+ CE+ H GC
Sbjct: 866 SNWLNNYSECG----PCASQVNCPVC-----AEGYVEGELIIQCNTCERWLHCGC 911
>gi|222637088|gb|EEE67220.1| hypothetical protein OsJ_24338 [Oryza sativa Japonica Group]
Length = 2258
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 32/64 (50%), Gaps = 2/64 (3%)
Query: 490 HDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRC 547
HD IS + + R G C C G NLL C+ CP +H CL+P L P W+C
Sbjct: 57 HDTNISPSTKKIRGHDGYFYECVECDLGGNLLCCDSCPRTYHLECLNPPLKRAPPGNWQC 116
Query: 548 PNCR 551
P CR
Sbjct: 117 PRCR 120
>gi|384254264|gb|EIE27738.1| hypothetical protein COCSUDRAFT_39323 [Coccomyxa subellipsoidea
C-169]
Length = 1967
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 5/45 (11%)
Query: 511 CHVCGD---GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C CG GE +LLC+GC +H CLDP L IPE W CP+C
Sbjct: 796 CEECGKNDRGEEMLLCDGCDHGYHTDCLDPPLKEIPEGDWFCPSC 840
>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
Length = 265
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 54/132 (40%), Gaps = 42/132 (31%)
Query: 505 GGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHSSS 557
GG++ C++CG+ + +L C C +H CLDP + + GW+CP+C+
Sbjct: 169 GGAEAQCYLCGEAKEIAEMLFCTSCGRHYHGRCLDPAVEITSLVRMGWQCPDCK------ 222
Query: 558 MSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLC 617
+ G PG DD ++ CD C++ +H CL
Sbjct: 223 ------VCQGCRQPG-------------------DDNKMLVCDVCDRGYHTFCLDPP--- 254
Query: 618 DLKEIPKDKWFC 629
+ IPK W C
Sbjct: 255 -MTTIPKTGWKC 265
>gi|332022570|gb|EGI62872.1| Bromodomain adjacent to zinc finger domain protein 2B [Acromyrmex
echinatior]
Length = 2202
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/157 (22%), Positives = 65/157 (41%), Gaps = 37/157 (23%)
Query: 483 TSNGMTLHDIAISLAMGQRRTTGGSDDM---CHVCGDGEN---LLLCNGCPLAFHAACLD 536
T+ T +A++L M + M C C G+N LLLC+GC +H C
Sbjct: 1888 TARAHTSAQLAMALYMLEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR 1947
Query: 537 PLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAATFDDR 594
P + IP+ W C C M+++ + C++C +
Sbjct: 1948 PKMENIPDGDWYCHEC-------MNKATGER-----------NCLVC-------GKRAGK 1982
Query: 595 TVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFCCD 631
++ C+ C + +H C N + + ++P+ KW+C +
Sbjct: 1983 NLVLCELCPRAYHTDC--HNPV--MPKMPRGKWYCSN 2015
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 511 CHVCGD--GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSVDL 564
C VCG G+NL+LC CP A+H C +P++ +P W C NC +Q + SR
Sbjct: 1973 CLVCGKRAGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHT 2032
Query: 565 KGG 567
KGG
Sbjct: 2033 KGG 2035
>gi|71014560|ref|XP_758729.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
gi|46098519|gb|EAK83752.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
Length = 2289
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--I 540
G++ H A S Q + MC +C DG N+LLC+ C +H CL P L I
Sbjct: 517 GVSPHLEADSYLRAQAGNQAQEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPALTSI 576
Query: 541 PESGWRCPNCRQG 553
P+S W CP C G
Sbjct: 577 PKSQWFCPPCLVG 589
>gi|9937325|gb|AAG02418.1|AF286030_1 regulator Ustilago maydis 1 protein [Ustilago maydis]
Length = 2289
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 5/73 (6%)
Query: 486 GMTLHDIAISLAMGQRRTTGGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--I 540
G++ H A S Q + MC +C DG N+LLC+ C +H CL P L I
Sbjct: 517 GVSPHLEADSYLRAQAGNQAQEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPALTSI 576
Query: 541 PESGWRCPNCRQG 553
P+S W CP C G
Sbjct: 577 PKSQWFCPPCLVG 589
>gi|307208076|gb|EFN85607.1| PHD finger protein 12 [Harpegnathos saltator]
Length = 633
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAAC----LDPLLIPESGWRCPNCR 551
+RR G + D C C DG L+ C+ CP ++H C +DP+ IP W C CR
Sbjct: 48 KRRGRGHNHDFCDACKDGGELICCDRCPASYHLQCHYPAVDPMDIPNGEWLCYTCR 103
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
+C CG L+ C+ CPL FH CLDP L P W CPN
Sbjct: 183 LCFECGRSCRKAPLIACDYCPLYFHQDCLDPPLTTFPIGRWMCPN 227
>gi|358374395|dbj|GAA90987.1| PHD finger domain protein [Aspergillus kawachii IFO 4308]
Length = 846
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNC 550
++D CH C LL C+GCP +FH +CL+P L PE W CP C
Sbjct: 459 NNDFCHNCNGSGQLLCCDGCPNSFHFSCLNPPLDPANPPEGDWFCPKC 506
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 753 LRIF-GREVAELPLVATCREYQGKGCFQALFSCIERL------LCSLNVENLVLPAAEKA 805
L++F G +AE+ + T Y+ +G + LFS +E + L +L VE LV+PA
Sbjct: 247 LQLFHGNRLAEMQFIGTRHVYRHQGMCRRLFSVVESMSFDVKTLQNLKVELLVIPATADL 306
Query: 806 ESIWTKKFGFRKMSRERLLKYQRDFQLTIFKGTSMLEKKV 845
+W KFGF K + L K R L F G +L+K++
Sbjct: 307 SHVWISKFGF-KYVEDSLKKELRSMNLLAFPGIDVLQKEL 345
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|145259007|ref|XP_001402242.1| PHD finger domain protein [Aspergillus niger CBS 513.88]
gi|134074859|emb|CAK38971.1| unnamed protein product [Aspergillus niger]
gi|350631896|gb|EHA20265.1| hypothetical protein ASPNIDRAFT_51426 [Aspergillus niger ATCC 1015]
Length = 848
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNC 550
++D CH C LL C+GCP +FH +CL+P L PE W CP C
Sbjct: 461 NNDFCHNCNGSGQLLCCDGCPNSFHFSCLNPPLDPANPPEGDWFCPKC 508
>gi|311254947|ref|XP_003126022.1| PREDICTED: PHD finger protein 21B [Sus scrofa]
Length = 495
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 316 DEHCATCKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPKCQQ 362
>gi|357476459|ref|XP_003608515.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
gi|355509570|gb|AES90712.1| Chromodomain-helicase-DNA-binding protein, partial [Medicago
truncatula]
Length = 1406
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
C +C G NLL C+ CP +H CLDP L IP W+CP+C
Sbjct: 73 CVICDLGGNLLCCDSCPRTYHLQCLDPPLKRIPMGKWQCPSC 114
>gi|302803789|ref|XP_002983647.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
gi|300148484|gb|EFJ15143.1| hypothetical protein SELMODRAFT_155996 [Selaginella moellendorffii]
Length = 1296
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
+D C +C G +L C+ C +H CLDP + +P+ WRCP C +
Sbjct: 47 EDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKCEE 93
>gi|302817780|ref|XP_002990565.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
gi|300141733|gb|EFJ08442.1| hypothetical protein SELMODRAFT_185367 [Selaginella moellendorffii]
Length = 1296
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
+D C +C G +L C+ C +H CLDP + +P+ WRCP C +
Sbjct: 47 EDFCTICKSGGKVLCCDACTAVYHLQCLDPPMKSVPKGSWRCPKCEE 93
>gi|355785070|gb|EHH65921.1| hypothetical protein EGM_02787, partial [Macaca fascicularis]
Length = 493
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 314 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 360
>gi|195402015|ref|XP_002059606.1| GJ14859 [Drosophila virilis]
gi|194147313|gb|EDW63028.1| GJ14859 [Drosophila virilis]
Length = 884
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 37/82 (45%), Gaps = 4/82 (4%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
+R G + D+C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSCRMAKV 104
Query: 556 SSMSRSVDLKGGLEAPGAEVGG 577
S + S P A G
Sbjct: 105 SQPACSSKASSVERVPSAGSGS 126
>gi|68069317|ref|XP_676569.1| iswi protein [Plasmodium berghei strain ANKA]
gi|56496328|emb|CAI04933.1| iswi protein homologue, putative [Plasmodium berghei]
Length = 1164
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 57/140 (40%), Gaps = 21/140 (15%)
Query: 498 MGQRRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHAACLDPLLIPES-GWRCPNCRQG 553
M + T ++D C +C + + LLLC+GCP ++H +CL PES W CP C+
Sbjct: 59 MNEEDDTPENEDRCKICREKASNLILLLCDGCPNSYHVSCLGLQTEPESEKWYCPVCKPE 118
Query: 554 HSSSMSRSVDLKG----GLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVG 609
++ KG + C +C+ ++ CD C FH
Sbjct: 119 EHKNLDIRRMRKGFAIDNMNGDHVNSSSCYVCQRPG---------KLLGCDFCPNSFHPI 169
Query: 610 CLRKNGLCDLKEIPKDKWFC 629
CL D I D+W C
Sbjct: 170 CLPD---LDFDNIS-DQWEC 185
>gi|440799762|gb|ELR20806.1| PHD-finger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 482
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 63/159 (39%), Gaps = 41/159 (25%)
Query: 479 RHIYTSNGMTLHDIAISLAMGQ---RRTTGGSDDMCHVCGDGEN---LLLCNGCPLAFHA 532
R S +L +A+ LA+ + + + C +C +LLC+GC FH
Sbjct: 206 RFCRDSVATSLSQVALVLAVFEGAIQWQQAKQSEKCQICRRSTQPGCMLLCDGCDRGFHT 265
Query: 533 ACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFSAAT 590
CL+P L +P W C +C S+ E GG ++C
Sbjct: 266 FCLNPRLKSVPSGEWYCKSCLANSKSACE------------VCEGGGRLLC--------- 304
Query: 591 FDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
C+ C + +H+ CL LK++PK+KW C
Sbjct: 305 --------CEVCPRVYHLKCLDP----PLKQVPKEKWTC 331
Score = 46.6 bits (109), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 505 GGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
S C VC G LL C CP +H CLDP L +P+ W CP
Sbjct: 287 ANSKSACEVCEGGGRLLCCEVCPRVYHLKCLDPPLKQVPKEKWTCPQ 333
>gi|395838235|ref|XP_003792024.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Otolemur garnettii]
Length = 1525
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1120 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1165
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 36 EEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|440909285|gb|ELR59209.1| Bromodomain adjacent to zinc finger domain protein 1A, partial [Bos
grunniens mutus]
Length = 1532
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1126 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1171
>gi|300793885|ref|NP_001179869.1| bromodomain adjacent to zinc finger domain protein 1A [Bos taurus]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1153 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|403283154|ref|XP_003932992.1| PREDICTED: PHD finger protein 21B [Saimiri boliviensis boliviensis]
Length = 678
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 499 DEHCAACKRGANLQPCGACPGAYHLSCLDPPLKTAPKGVWVCPRCQQ 545
>gi|241841259|ref|XP_002415325.1| Jumonji/ARID domain-containing protein, putative [Ixodes
scapularis]
gi|215509537|gb|EEC18990.1| Jumonji/ARID domain-containing protein, putative [Ixodes
scapularis]
Length = 1356
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH CG G E++LLC+GC ++H CL P L IP WRCP C
Sbjct: 57 VCHTCGRGDDEESMLLCDGCDDSYHTFCLLPPLQEIPRGDWRCPRC 102
>gi|73962719|ref|XP_537409.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Canis lupus familiaris]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|413935348|gb|AFW69899.1| hypothetical protein ZEAMMB73_132865 [Zea mays]
Length = 387
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 499 GQRRTTGGSDDMCHVCGDGE---NLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G D C CG G+ L+LC+GC FH CL P+L +P W CP+CR
Sbjct: 20 GPEAAAADDDVRCEACGSGDAAAELMLCDGCDCGFHIFCLRPILPRVPAGDWYCPSCRAP 79
Query: 554 HSS 556
SS
Sbjct: 80 ASS 82
>gi|395838233|ref|XP_003792023.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Otolemur garnettii]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1152 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1197
>gi|344298527|ref|XP_003420943.1| PREDICTED: PHD finger protein 21B, partial [Loxodonta africana]
Length = 405
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 298 DEHCVACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPQCQQ 344
>gi|330925213|ref|XP_003300958.1| hypothetical protein PTT_12344 [Pyrenophora teres f. teres 0-1]
gi|311324662|gb|EFQ90949.1| hypothetical protein PTT_12344 [Pyrenophora teres f. teres 0-1]
Length = 1124
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 53/128 (41%), Gaps = 21/128 (16%)
Query: 508 DDMCHVCGDGE-----NLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSV 562
D C +C DG+ ++ C+GC LA H C IPE W C C+
Sbjct: 414 DTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRRCQL---------- 463
Query: 563 DLKGGLEAPGAEVGGCVICRSHD--FSAATFDDRTVIYCDQCEKEFHVG-CLRKNGLCDL 619
G P +E+ GC+ C + D F T + C E +G + + D+
Sbjct: 464 ---VGRGTPVSELPGCIFCPNIDGAFKQTTAMKWAHLLCAMWIPEVSLGNATFQEPVQDV 520
Query: 620 KEIPKDKW 627
+++PK +W
Sbjct: 521 EKVPKTRW 528
>gi|444722488|gb|ELW63180.1| Bromodomain adjacent to zinc finger domain protein 1A [Tupaia
chinensis]
Length = 1836
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1461 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1506
>gi|291392893|ref|XP_002712830.1| PREDICTED: retinoblastoma binding protein 2 isoform 2 [Oryctolagus
cuniculus]
Length = 1648
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
++ C VC DG +LL C+ CP +H CL P L IP+ W CP C
Sbjct: 36 EEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|62088090|dbj|BAD92492.1| bromodomain adjacent to zinc finger domain, 1A isoform b variant
[Homo sapiens]
Length = 1188
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 759 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 802
>gi|403263951|ref|XP_003924260.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Saimiri boliviensis boliviensis]
Length = 1240
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 835 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 880
>gi|281342876|gb|EFB18460.1| hypothetical protein PANDA_011886 [Ailuropoda melanoleuca]
Length = 1527
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1121 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1166
>gi|410965818|ref|XP_003989437.1| PREDICTED: PHD finger protein 21B [Felis catus]
Length = 545
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H +CLDP L P+ W CP C+Q
Sbjct: 366 DEHCAACKRGANLQPCGTCPGAYHLSCLDPPLKTAPKGVWVCPKCQQ 412
>gi|410962140|ref|XP_003987633.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Felis catus]
Length = 1416
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1012 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1055
>gi|119609377|gb|EAW88971.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_b
[Homo sapiens]
gi|162319392|gb|AAI56462.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
gi|225000544|gb|AAI72533.1| Jumonji, AT rich interactive domain 1A [synthetic construct]
Length = 1641
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|380798887|gb|AFE71319.1| lysine-specific demethylase 5A, partial [Macaca mulatta]
Length = 1653
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 237 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 296
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 297 DWRCPKC 303
>gi|346421429|ref|NP_001231088.1| bromodomain adjacent to zinc finger domain protein 1A [Sus scrofa]
Length = 1557
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|326912468|ref|XP_003202572.1| PREDICTED: PHD finger protein 21B-like, partial [Meleagris
gallopavo]
Length = 445
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQ 552
D+ C C G NL C CP A+H CLDP L P+ W CP C+Q
Sbjct: 264 DEYCTACKRGVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQQ 310
>gi|29387265|gb|AAH48307.1| JARID1A protein [Homo sapiens]
gi|31753201|gb|AAH53893.1| JARID1A protein [Homo sapiens]
gi|83406060|gb|AAI10917.1| JARID1A protein [Homo sapiens]
Length = 1102
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|397501128|ref|XP_003846181.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A [Pan paniscus]
Length = 1240
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 835 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 880
>gi|256081465|ref|XP_002576990.1| myst-related protein [Schistosoma mansoni]
gi|353229452|emb|CCD75623.1| myst-related protein [Schistosoma mansoni]
Length = 1074
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 54/133 (40%), Gaps = 44/133 (33%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLLIPES----GWRCPNCRQGHSSSMSRSVDLK 565
+C GD LL C GC +HA+CL+P L P GW+C C+
Sbjct: 315 LCESPGDLTELLFCTGCGSHYHASCLEPPLQPSPTIRIGWQCAECKT------------- 361
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C+IC + D+ ++ CD C+K +H CL+ + IPK+
Sbjct: 362 ------------CLICNE------SKDENKMLVCDVCDKGYHTYCLKP----PVSSIPKN 399
Query: 626 KWFC-----CDDC 633
+ C C DC
Sbjct: 400 GFRCERCRVCSDC 412
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 25/155 (16%)
Query: 510 MCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDL 564
+C CG N LLLC+ C ++FH CLDP L +P+ GW+C +C ++ + L
Sbjct: 820 VCEGCGGTSNESLLLLCDDCNISFHTYCLDPPLKEVPKGGWKCTDCVIC-TNCGQKDPGL 878
Query: 565 KGGLE------APGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGC--LRKNGL 616
G AP A + C IC + A ++ ++ C C + H C LR
Sbjct: 879 NGKWHANYSVCAPCASLTTCPIC-----NLAYREEELLVRCALCTRWAHANCDQLRTE-- 931
Query: 617 CDLKEIPKDKWFCCDDCNRIHAALQDFVSNRAQTI 651
D EI D + C C + A D + AQ +
Sbjct: 932 -DELEIATDLGYNCLLCRELGA---DIGTGHAQVL 962
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 84/226 (37%), Gaps = 57/226 (25%)
Query: 509 DMCHVCGD---GENLLLCNGCPLAFHAACLDPLLIP----ESGWRCPNCRQGHSSSMSRS 561
D+C CG LL C+ C +H+ C + I E GWRC +C
Sbjct: 770 DICIACGSIGLDTPLLACSQCGQCYHSFCAEVPKITRTMIEKGWRCLDC----------- 818
Query: 562 VDLKGGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKE 621
+C T ++ ++ CD C FH CL LKE
Sbjct: 819 -----------------TVCEG---CGGTSNESLLLLCDDCNISFHTYCLDP----PLKE 854
Query: 622 IPKDKWFC-----CDDCNRIHAALQ-DFVSNRAQTIPASSLST---INRKHIEKGILFDG 672
+PK W C C +C + L + +N + P +SL+T N + E+ +L
Sbjct: 855 VPKGGWKCTDCVICTNCGQKDPGLNGKWHANYSVCAPCASLTTCPICNLAYREEELLVRC 914
Query: 673 TMNDVQWQMLKKAQCFEEKEKSLLSSATAIFRECFDPIIAECGRDL 718
+ +W A C + + + L AT + C + E G D+
Sbjct: 915 ALC-TRW---AHANCDQLRTEDELEIATDLGYNCL--LCRELGADI 954
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLKGG 567
+C+ D +L+C+ C +H CL P + IP++G+RC CR RS L G
Sbjct: 364 ICNESKDENKMLVCDVCDKGYHTYCLKPPVSSIPKNGFRCERCRVCSDCGGGRSSTLS-G 422
Query: 568 LEAPGA 573
LE P A
Sbjct: 423 LEGPVA 428
>gi|355671930|gb|AER94956.1| bromodomain adjacent to zinc finger domain, 1A [Mustela putorius
furo]
Length = 1221
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 815 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 860
>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
Length = 2182
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 60/149 (40%), Gaps = 34/149 (22%)
Query: 470 GMAARRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNG-CPL 528
G R +P +H T G A+ Q SDD+C +CGDG +LLC+G C
Sbjct: 1626 GSRPRGRPPKHPRTEGGKP------PTALFQAGAINDSDDLCTLCGDGGLILLCDGPCHR 1679
Query: 529 AFHAACLDPLLIP-ESGWRCPNCRQGHSSSMSRSVDLKGGLEAPGAEVGGCVICRSHDFS 587
+FH C+ P + W CP+C +G C+IC+
Sbjct: 1680 SFHLECVGMKDEPNDEQWLCPDCAEGRHM---------------------CLICKQVGEM 1718
Query: 588 AATFDDRTVIYCD--QCEKEFHVGCLRKN 614
F V C +C + +H GCL K+
Sbjct: 1719 GVEFG---VTQCSVAKCGRFYHKGCLAKS 1744
>gi|443695479|gb|ELT96377.1| hypothetical protein CAPTEDRAFT_147496 [Capitella teleta]
Length = 1490
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
MC++CG G E++LLC+ C AFH CL P L +P+ WRCP C
Sbjct: 282 MCNLCGRGDGEESMLLCDSCDDAFHMHCLIPPLHEVPKGDWRCPKC 327
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1713 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREVVWVKVG 1768
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2113 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2163
>gi|332838263|ref|XP_003313474.1| PREDICTED: lysine-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|410354793|gb|JAA44000.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1688
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|307169034|gb|EFN61879.1| Bromodomain adjacent to zinc finger domain protein 2B [Camponotus
floridanus]
Length = 2352
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 54/126 (42%), Gaps = 34/126 (26%)
Query: 511 CHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSVDLK 565
C C G+N LLLC+GC +H C P + IP+ W C C M+++ +
Sbjct: 2069 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHEC-------MNKATGER 2121
Query: 566 GGLEAPGAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKD 625
C++C + ++ C+ C + +H C N + + ++P+
Sbjct: 2122 -----------NCLVC-------GKRAGKNLVLCELCPRAYHTDC--HNPV--MPKMPRG 2159
Query: 626 KWFCCD 631
KW+C +
Sbjct: 2160 KWYCSN 2165
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 6/63 (9%)
Query: 511 CHVCGD--GENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC--RQGHSSSMSRSVDL 564
C VCG G+NL+LC CP A+H C +P++ +P W C NC +Q + SR
Sbjct: 2123 CLVCGKRAGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKRNSSRRSHT 2182
Query: 565 KGG 567
KG
Sbjct: 2183 KGA 2185
>gi|355785795|gb|EHH65978.1| Lysine-specific demethylase 5A [Macaca fascicularis]
Length = 1690
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|355778515|gb|EHH63551.1| hypothetical protein EGM_16543, partial [Macaca fascicularis]
Length = 1426
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1114 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1159
>gi|332842506|ref|XP_003314441.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Pan troglodytes]
Length = 1524
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|426227070|ref|XP_004007650.1| PREDICTED: lysine-specific demethylase 5A [Ovis aries]
Length = 1793
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 377 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 436
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 437 DWRCPKC 443
>gi|7920301|gb|AAF70601.1|AF213467_1 ATP-dependent chromatin remodelling protein [Homo sapiens]
Length = 1556
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|194207285|ref|XP_001490944.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Equus caballus]
Length = 1601
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1195 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1240
>gi|441595313|ref|XP_004087232.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Nomascus leucogenys]
Length = 1533
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1128 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1173
>gi|431907355|gb|ELK11328.1| Bromodomain adjacent to zinc finger domain protein 1A [Pteropus
alecto]
Length = 1550
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1144 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1189
>gi|32967603|ref|NP_038476.2| bromodomain adjacent to zinc finger domain protein 1A isoform a [Homo
sapiens]
gi|116241266|sp|Q9NRL2.2|BAZ1A_HUMAN RecName: Full=Bromodomain adjacent to zinc finger domain protein 1A;
AltName: Full=ATP-dependent chromatin-remodeling protein;
AltName: Full=ATP-utilizing chromatin assembly and
remodeling factor 1; Short=hACF1; AltName: Full=CHRAC
subunit ACF1; AltName: Full=Williams syndrome
transcription factor-related chromatin-remodeling factor
180; Short=WCRF180; AltName: Full=hWALp1
gi|6942227|gb|AAF32366.1|AF221130_1 chromatin remodeling factor WCRF180 [Homo sapiens]
gi|119586304|gb|EAW65900.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_a [Homo
sapiens]
gi|162319394|gb|AAI56492.1| Bromodomain adjacent to zinc finger domain, 1A [synthetic construct]
Length = 1556
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|403263949|ref|XP_003924259.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Saimiri boliviensis boliviensis]
Length = 1524
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|390467343|ref|XP_002752254.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
partial [Callithrix jacchus]
Length = 1595
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 152 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 211
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 212 DWRCPKC 218
>gi|355693214|gb|EHH27817.1| hypothetical protein EGK_18107, partial [Macaca mulatta]
Length = 1519
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1114 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1159
>gi|332249134|ref|XP_003273720.1| PREDICTED: lysine-specific demethylase 5A [Nomascus leucogenys]
Length = 1690
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|301774883|ref|XP_002922870.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Ailuropoda melanoleuca]
Length = 1641
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1235 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1280
>gi|126340225|ref|XP_001373115.1| PREDICTED: lysine-specific demethylase 5A [Monodelphis domestica]
Length = 1689
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 273 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 332
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 333 DWRCPKC 339
>gi|397499371|ref|XP_003820427.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A [Pan
paniscus]
Length = 1717
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|355563873|gb|EHH20373.1| Lysine-specific demethylase 5A [Macaca mulatta]
gi|383420467|gb|AFH33447.1| lysine-specific demethylase 5A [Macaca mulatta]
Length = 1690
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|119586306|gb|EAW65902.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_c [Homo
sapiens]
gi|119586309|gb|EAW65905.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_c [Homo
sapiens]
Length = 1555
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1150 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1195
>gi|119586308|gb|EAW65904.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_e [Homo
sapiens]
Length = 1560
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1155 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1200
>gi|426371192|ref|XP_004052536.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Gorilla gorilla gorilla]
Length = 1589
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 257 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 316
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 317 DWRCPKC 323
>gi|426248832|ref|XP_004018162.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A [Ovis aries]
Length = 1647
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1242 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1285
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1712 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREVVWVKVG 1767
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 2112 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 2162
>gi|241859183|ref|XP_002416180.1| fetal alzheimer antigen, putative [Ixodes scapularis]
gi|215510394|gb|EEC19847.1| fetal alzheimer antigen, putative [Ixodes scapularis]
Length = 2457
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 508 DDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQGHSSSMSRSV-DL 564
DD C VC +LL C CP FH ACLDP L +P W C C+ S ++ + D+
Sbjct: 321 DDHCRVCHKLGDLLCCETCPAVFHLACLDPPLTDVPTEDWICTVCQANQVSGVTDCISDI 380
Query: 565 -KGGL 568
KGGL
Sbjct: 381 EKGGL 385
>gi|119609376|gb|EAW88970.1| Jumonji, AT rich interactive domain 1A (RBBP2-like), isoform CRA_a
[Homo sapiens]
Length = 1315
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|32967605|ref|NP_872589.1| bromodomain adjacent to zinc finger domain protein 1A isoform b [Homo
sapiens]
gi|119586307|gb|EAW65903.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_d [Homo
sapiens]
Length = 1524
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|435778|gb|AAB28544.1| retinoblastoma binding protein 2 [Homo sapiens]
Length = 1722
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|410223430|gb|JAA08934.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410300510|gb|JAA28855.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
gi|410354795|gb|JAA44001.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|302816250|ref|XP_002989804.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
gi|300142370|gb|EFJ09071.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
Length = 2150
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 501 RRTTG--GSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+RT G GS C VC G NLL C+ CP +H CL P L P WRCP+C
Sbjct: 236 KRTRGSDGSFFECMVCQSGGNLLCCDHCPRVYHLHCLSPPLKRAPTGKWRCPDC 289
>gi|119586305|gb|EAW65901.1| bromodomain adjacent to zinc finger domain, 1A, isoform CRA_b [Homo
sapiens]
Length = 1523
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1118 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1163
>gi|410261294|gb|JAA18613.1| lysine (K)-specific demethylase 5A [Pan troglodytes]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|351710793|gb|EHB13712.1| Lysine-specific demethylase 5A [Heterocephalus glaber]
Length = 1694
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|332842504|ref|XP_003314440.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Pan troglodytes]
Length = 1556
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|194375257|dbj|BAG62741.1| unnamed protein product [Homo sapiens]
Length = 1049
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 233 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 292
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 293 DWRCPKC 299
>gi|354476339|ref|XP_003500382.1| PREDICTED: lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1671
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 255 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 314
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 315 DWRCPKC 321
>gi|354498657|ref|XP_003511431.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Cricetulus griseus]
gi|344245398|gb|EGW01502.1| Bromodomain adjacent to zinc finger domain protein 1A [Cricetulus
griseus]
Length = 1525
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1121 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1166
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 32/57 (56%), Gaps = 1/57 (1%)
Query: 511 CHVCGDGENLLLCNGCPLAFHAACLDPLLIPESGWRCPNCRQGHSSSMSRSVDLKGG 567
C VC +G +LL C+ CP AFH CL+ + IPE W C +C+ G V +K G
Sbjct: 1523 CFVCSEGGSLLCCDSCPAAFHRECLN-IDIPEGNWYCNDCKAGKKPHYREIVWVKVG 1578
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 24/51 (47%), Gaps = 2/51 (3%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCN--GCPLAFHAACLDPLLIPESGWRCP 548
Q T +D C CGD L+ C GCP +HA CL+ P W CP
Sbjct: 1923 QGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP 1973
>gi|121718370|ref|XP_001276189.1| PHD transcription factor (Rum1), putative [Aspergillus clavatus
NRRL 1]
gi|119404387|gb|EAW14763.1| PHD transcription factor (Rum1), putative [Aspergillus clavatus
NRRL 1]
Length = 1707
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 5/50 (10%)
Query: 509 DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
D C +CG GE+ +L+C+ C +H CLDP L +PE W CP C G
Sbjct: 441 DKCEICGKGEDRPSILVCDSCDQGYHKNCLDPPLTTVPEYDWHCPKCLVG 490
>gi|110618244|ref|NP_001036068.1| lysine-specific demethylase 5A [Homo sapiens]
gi|215274124|sp|P29375.3|KDM5A_HUMAN RecName: Full=Lysine-specific demethylase 5A; AltName: Full=Histone
demethylase JARID1A; AltName: Full=Jumonji/ARID
domain-containing protein 1A; AltName:
Full=Retinoblastoma-binding protein 2; Short=RBBP-2
gi|168275608|dbj|BAG10524.1| histone demethylase JARID1A [synthetic construct]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|417406655|gb|JAA49977.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1649
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 229 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 288
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 289 DWRCPKC 295
>gi|348552013|ref|XP_003461823.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A-like
[Cavia porcellus]
Length = 1635
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 359 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 418
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 419 DWRCPKC 425
>gi|242004184|ref|XP_002423004.1| Jumonji/ARID domain-containing protein 1D, putative [Pediculus
humanus corporis]
gi|212505920|gb|EEB10266.1| Jumonji/ARID domain-containing protein 1D, putative [Pediculus
humanus corporis]
Length = 1680
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 510 MCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNC 550
+CH C G EN+LLC+GC ++H CL P L IP+ WRCP C
Sbjct: 256 ICHNCERGDAEENMLLCDGCDDSYHTFCLLPPLTEIPKGDWRCPKC 301
>gi|417406677|gb|JAA49985.1| Putative lysine-specific demethylase 5a [Desmodus rotundus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|74141547|dbj|BAE38548.1| unnamed protein product [Mus musculus]
Length = 1093
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|417406599|gb|JAA49949.1| Putative chromatin remodeling complex wstf-iswi large subunit
[Desmodus rotundus]
Length = 1557
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1152 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1197
>gi|301756621|ref|XP_002914152.1| PREDICTED: lysine-specific demethylase 5A-like [Ailuropoda
melanoleuca]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|344253284|gb|EGW09388.1| Lysine-specific demethylase 5A [Cricetulus griseus]
Length = 1608
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 192 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 251
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 252 DWRCPKC 258
>gi|432091446|gb|ELK24528.1| Lysine-specific demethylase 5A [Myotis davidii]
Length = 1097
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 507 SDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
++D C C DG +LL C C AFH CLDP + +PE W C +C Q
Sbjct: 88 NNDFCDACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSCEQN 136
>gi|449273319|gb|EMC82842.1| PHD finger protein 21B, partial [Columba livia]
Length = 446
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 5/73 (6%)
Query: 482 YTSNGMTLHDIAISLAMGQRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL-- 539
Y +N + L AI L + ++ D+ C C G NL C CP A+H CLDP L
Sbjct: 242 YLNNPLFLSTRAILLLL---QSEIHHDEHCTSCKRGINLQPCGTCPRAYHLNCLDPPLKT 298
Query: 540 IPESGWRCPNCRQ 552
P+ W CP C+Q
Sbjct: 299 APKGVWVCPKCQQ 311
>gi|431892152|gb|ELK02599.1| Lysine-specific demethylase 5A [Pteropus alecto]
Length = 1692
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDRLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|74201318|dbj|BAE26113.1| unnamed protein product [Mus musculus]
Length = 1094
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|359323071|ref|XP_854690.3| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Canis lupus familiaris]
Length = 1688
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|443894534|dbj|GAC71882.1| DNA-binding protein jumonji/RBP2/SMCY [Pseudozyma antarctica T-34]
Length = 2474
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Query: 505 GGSDDMCHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCRQG 553
G + MC +C DG N+LLC+ C +H CL P L IP+S W CP C G
Sbjct: 532 GAEEQMCEICLRGEDGPNMLLCDECNRGYHMYCLQPPLTSIPKSQWFCPPCLVG 585
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 10/58 (17%)
Query: 572 GAEVGGCVICRSHDFSAATFDDRTVIYCDQCEKEFHVGCLRKNGLCDLKEIPKDKWFC 629
GAE C IC + D ++ CD+C + +H+ CL+ L IPK +WFC
Sbjct: 532 GAEEQMCEICLRGE------DGPNMLLCDECNRGYHMYCLQP----PLTSIPKSQWFC 579
>gi|6683494|dbj|BAA89209.1| bromodomain adjacent to zinc finger domain 1A [Homo sapiens]
Length = 1674
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1269 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1314
>gi|355697494|gb|AES00689.1| lysine -specific demethylase 5A [Mustela putorius furo]
Length = 1504
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 294 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 353
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 354 DWRCPKC 360
>gi|387541282|gb|AFJ71268.1| bromodomain adjacent to zinc finger domain protein 1A isoform a
[Macaca mulatta]
Length = 1556
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1151 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1196
>gi|51593641|gb|AAH80691.1| Jarid1a protein, partial [Mus musculus]
Length = 1102
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLLPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|329663691|ref|NP_001192554.1| lysine-specific demethylase 5A [Bos taurus]
gi|296487060|tpg|DAA29173.1| TPA: JARID1A variant protein-like [Bos taurus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|291403724|ref|XP_002718181.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1A isoform 2
[Oryctolagus cuniculus]
Length = 1525
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1119 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1164
>gi|410963533|ref|XP_003988319.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A
[Felis catus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|395743750|ref|XP_002822777.2| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5A,
partial [Pongo abelii]
Length = 1613
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 314 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 373
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 374 DWRCPKC 380
>gi|390469001|ref|XP_002807272.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Callithrix jacchus]
Length = 1552
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%), Gaps = 5/46 (10%)
Query: 511 CHVC---GDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
C +C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 1148 CKICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 1193
>gi|291392891|ref|XP_002712829.1| PREDICTED: retinoblastoma binding protein 2 isoform 1 [Oryctolagus
cuniculus]
Length = 1690
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 9/67 (13%)
Query: 493 AISLAMGQRRTTGGSD----DMCHVCGDGEN---LLLCNGCPLAFHAACLDPLL--IPES 543
A ++ M QR+ T + +C CG G N LLLC+GC ++H CL P L +P+
Sbjct: 274 AFNMQMRQRKGTLSVNFVDLYVCMFCGRGNNEDKLLLCDGCDDSYHTFCLIPPLPDVPKG 333
Query: 544 GWRCPNC 550
WRCP C
Sbjct: 334 DWRCPKC 340
>gi|4884108|emb|CAB43261.1| hypothetical protein [Homo sapiens]
Length = 811
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 510 MCHVCGDGENLLLCNGCPLAFHAACLDPLL--IPESGWRCPNCR 551
+C GD EN++LC+GC H C+ P L +PE W CP CR
Sbjct: 408 ICRKKGDAENMVLCDGCDRGHHTYCVRPKLKTVPEGDWFCPECR 451
>gi|195449007|ref|XP_002071907.1| GK24911 [Drosophila willistoni]
gi|194167992|gb|EDW82893.1| GK24911 [Drosophila willistoni]
Length = 884
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 35/81 (43%), Gaps = 4/81 (4%)
Query: 500 QRRTTGGSDDMCHVCGDGENLLLCNGCPLAFHAACLDPLL----IPESGWRCPNCRQGHS 555
+R G + D C C +G NLL C+ CP +FH C DP L IP W C +CR
Sbjct: 45 RRPGRGHNHDFCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPTGQWLCHSCRMAKM 104
Query: 556 SSMSRSVDLKGGLEAPGAEVG 576
S S P A G
Sbjct: 105 SHPPSSSKASSVERVPSAGSG 125
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 43/106 (40%), Gaps = 18/106 (16%)
Query: 462 PSQFEAHAGMAA--RRQPYRHIYTSNGMTLHDIAISLAMGQRRTTGGSD----------- 508
P++ E HA + QP + ++T NG A QRR + +
Sbjct: 199 PAELELHAQFPGHGKVQPVQPVHTDNGHHGRKTGAGSAGNQRRNSKPFELDSQGLVPLPA 258
Query: 509 DMCHVCGDG---ENLLLCNGCPLAFHAACLDPLL--IPESGWRCPN 549
C C L+ C+ CPL +H CLDP L +P W CPN
Sbjct: 259 KTCFYCKRSCKRAPLISCDYCPLYYHQDCLDPPLTALPAGLWMCPN 304
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,515,414,837
Number of Sequences: 23463169
Number of extensions: 584944263
Number of successful extensions: 1463702
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1388
Number of HSP's successfully gapped in prelim test: 4044
Number of HSP's that attempted gapping in prelim test: 1445242
Number of HSP's gapped (non-prelim): 17244
length of query: 850
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 698
effective length of database: 8,792,793,679
effective search space: 6137369987942
effective search space used: 6137369987942
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)