BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002450
(920 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/931 (50%), Positives = 599/931 (64%), Gaps = 103/931 (11%)
Query: 1 MKRELDYELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIVYSRVKR 60
MKRE+ G E+ +Q QA D + N CKRFKV VNGF VYSR+++
Sbjct: 1 MKREVGAFDGGIQFESESQ-------QAQD----SNNNNNCKRFKV--VNGFFVYSRLRK 47
Query: 61 SRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQMTVENV 120
++ S+ + H+ + K + I+++V E E V++V
Sbjct: 48 NKPSSRE---------------CHDDKDRKCQQ-------IIQTVSEVETVNKDPQVKDV 85
Query: 121 IEE-TVKGKKAPICKEEPISK-VECFPRKEGGSE-------------VSNGLNKKCLKRP 165
E ++ + PICK E S+ EGG+E SN L + L R
Sbjct: 86 SRELSLCNVQLPICKIESFSEPSRVILANEGGTEDTERKLAHVGTEGKSNKLRQ--LTR- 142
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S KVEPVEV V E +E +S ++VE IAEGSALT PKKNLELKMSKKI+L+ P
Sbjct: 143 SNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKIALDNIP 202
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
MTV ELFETGLL+GV VVYMGG K A LRG I+D GILC CS C GCRVIPPS+FEIH
Sbjct: 203 MTVKELFETGLLEGVPVVYMGGKK--AFCLRGTIKDVGILCYCSFCKGCRVIPPSQFEIH 260
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
A KQYRRA+QYICFENGKSLL+VL ACR+ PL L+AT+QSA+S LP+EK+F C RCKGT
Sbjct: 261 AIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKTFTCKRCKGT 320
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNN 405
+P VGK GP LC+SCV+SK+ G+ T I+ R+S P VS+S + +S +N
Sbjct: 321 YPTILVGKVGP--LCSSCVESKESNGSPACETNIKSRSSKPATVSKSLNSALEGVSSEN- 377
Query: 406 KRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSR-PGLIANSTPVTSVHK 464
+ W++T R L+ + +
Sbjct: 378 --------------------------------KCQWKITTKDQRLHKLVFEDGGLPDGTE 405
Query: 465 SSQSQRQRKITKKSKKTVLISKPFENAS-PPLSFPNKSRWNITPKDQRLHKLVFDESGLP 523
+ R +K+ K+ I N P +F + W K + ++ +G+
Sbjct: 406 VAYYARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKP---YAYIYTSNGVS 462
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEV----SPSQFEAHAECASLSSIPQGDWY 579
+ G+K + L I+C S + P F H CASLSSIP+G W+
Sbjct: 463 LHELAISLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAF--HKGCASLSSIPRGKWF 520
Query: 580 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 639
C++CQNMF+R++F++H+ANAV AGR+SGVD +EQIT+RCIRIVKN+EAEL+GC+LCRG D
Sbjct: 521 CQFCQNMFQREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYD 580
Query: 640 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 699
FS+SGFGPRTI+LCDQC +EFHVGCL+ HK+A+L+ELPKGKWFCC DC RI+S L+ LL
Sbjct: 581 FSRSGFGPRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLA 640
Query: 700 QEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 758
+EAE +P L + KK LETV++IDVRW+LL+GK+A+PET+LLLSQA+AIF +CF
Sbjct: 641 REAEIIPNKLLEVVMKKNEEKGLETVNNIDVRWKLLTGKSASPETKLLLSQALAIFQECF 700
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 818
DPIVD+ +GRDLIP MVYG+N +GQ++GGMYCA+L VNS VVSA I+R+FGQEVAELPLV
Sbjct: 701 DPIVDT-TGRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLV 759
Query: 819 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 878
ATS NHGKGYFQLLF+ IEKLL++L+V SIVLPAAEEAESIWTDKFGF+KI P+ LS Y
Sbjct: 760 ATSNGNHGKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSKY 819
Query: 879 RKRCSQLVTFKGTSMLQKRVPACRIGSSSTD 909
RK C Q++TFKGTSMLQK VP CRI + +T+
Sbjct: 820 RKSCCQILTFKGTSMLQKAVPPCRIVNQNTE 850
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/969 (48%), Positives = 605/969 (62%), Gaps = 103/969 (10%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P S
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSA 392
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S K K K E +S+S +S +R K +S ++ + S +P L+
Sbjct: 393 SVSISSRYKRKWVTKAKSKSSEYTSISRSP-RSAPMRIPSKNKSALKMRKKSLKPALMLK 451
Query: 456 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
S+ S SS ++ Q KIT K ++ L FE P+ + + Q+L +
Sbjct: 452 SSQSASKCSSSLAKNQWKITTKDQR--LHKLVFEEDG----LPDGTEVAYFARGQKLLQG 505
Query: 516 VFDESGL----------PDGTEV--GY------YAC------------------GQKLLE 539
SG+ P EV G+ YA G+K
Sbjct: 506 YKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSA 565
Query: 540 GYKNGLGIIC----HCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQH 595
+ L IIC + + P F H ECASLSS P+GDWYCK+CQNMF+R++F++H
Sbjct: 566 KDNDDLCIICLDGGNLLLCDGCPRAF--HKECASLSSTPRGDWYCKFCQNMFQREKFVEH 623
Query: 596 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
+ NAV AGRV GVD +EQITKRCIRIV+N+E +LSGC+LCRG DFSKSGFGPRTI+LCDQ
Sbjct: 624 NVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQ 683
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK 715
CE+EFHVGCLK HKMA L+ELP+GKWFC + C+RI+S LQ LL++ EKLP L A+ +
Sbjct: 684 CEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNR 743
Query: 716 YAGNSLETVS-DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 774
G + + D+DV WRL+SGK A+PETRLLLS+A+AIFHD FDPIVD SGRDLIP+M
Sbjct: 744 KLGENCSDIQVDVDVSWRLISGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAM 803
Query: 775 VYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 834
VYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGKGYFQ LF
Sbjct: 804 VYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTLF 863
Query: 835 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 894
+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSML
Sbjct: 864 SCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSML 923
Query: 895 QKRVPACRI 903
QK VP+CR+
Sbjct: 924 QKTVPSCRV 932
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/972 (47%), Positives = 601/972 (61%), Gaps = 100/972 (10%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P S
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSA 392
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S K K K E +S+S +S +R K +S ++ + S +P L+
Sbjct: 393 SVSISSRYKRKWVTKAKSKSSEYTSISRSP-RSAPMRIPSKNKSALKMRKKSLKPALMLK 451
Query: 456 STPVTSVHKSSQSQRQRKITKKSK---KTVLISKPFENASPPLSFPNKSRW--------- 503
S+ S SS ++ Q KIT K + K V + + F +
Sbjct: 452 SSQSASKCSSSLAKNQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSG 511
Query: 504 --------NITPKDQRLHK----------LVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
++P +H ++ +G+ + G+K + L
Sbjct: 512 ILCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDL 571
Query: 546 GIIC----HCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
IIC + + P F H ECASLSSIP+GDWYCK+CQNMF+R++F++H+ NAV
Sbjct: 572 CIICLDGGNLLLCDGCPRAF--HKECASLSSIPRGDWYCKFCQNMFQREKFVEHNVNAVA 629
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
AGRV GVD +EQITKRCIRIV+N+E +LSGC+LCRG DFSKSGFGPRTI+LCDQCE+EFH
Sbjct: 630 AGRVHGVDPIEQITKRCIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFH 689
Query: 662 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 721
VGCLK HKMA L+ELP+GKWFC + C+RI+S LQ LL++ EKLP L A+ + G +
Sbjct: 690 VGCLKDHKMAFLKELPRGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENC 749
Query: 722 ETVS-DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNL 780
+ D+DV WRL+SGK A+PETRLLLS+A+AIFHD FDPIVD SGRDLIP+MVYGR++
Sbjct: 750 SDIQVDVDVSWRLISGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDV 809
Query: 781 RGQEFGGMYCAILTVNSSVVSAGILRVFGQ---------EVAELPLVATSKINHGKGYFQ 831
GQEFGGMYCAIL VNS VVSA +LRVFGQ ++AELPLVATS NHGKGYFQ
Sbjct: 810 GGQEFGGMYCAILIVNSFVVSAAMLRVFGQYCRAAIGCXDIAELPLVATSNGNHGKGYFQ 869
Query: 832 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGT 891
LF+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGT
Sbjct: 870 TLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGT 929
Query: 892 SMLQKRVPACRI 903
SMLQK VP+CR+
Sbjct: 930 SMLQKTVPSCRV 941
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/751 (52%), Positives = 502/751 (66%), Gaps = 72/751 (9%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI--RIRASGPKLV--- 389
++F C RCKG FP CVGK GP LCN C +SK+ T+T + I R + P L+
Sbjct: 121 RTFTCKRCKGIFPSICVGKIGP--LCNLCAESKESHPTLTIGSSIISRYCQNLPSLILIS 178
Query: 390 ------SRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWEL 443
+ +S + +++ + + ++L + +S + S N L+ +
Sbjct: 179 WIINLKTITSGQFLLMLAHCSFRLSFLSPEQVLALEYFKPASLSTFSQDNTLRKKK---- 234
Query: 444 TRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRW 503
R P LIA + V SVH S + ++ +KI+ + ++ L FE P+ +
Sbjct: 235 -RKPEEPDLIAKPSKVASVHLSPRKRKYKKISPRDQR--LHRLVFEEGG----LPDGTEL 287
Query: 504 NITPKDQRL---HKLVFD----------------ESGLPDGTEVGYYAC----------- 533
+ Q+L +K F E+ T YAC
Sbjct: 288 AYYARGQKLLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHD 347
Query: 534 -------GQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYC 580
+K + L IIC C+ P F H CASLS++P GDWYC
Sbjct: 348 LAISLSKSRKYSSQDNDDLCIICADGGDLLLCDG--CPRAF--HKGCASLSTVPSGDWYC 403
Query: 581 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 640
++CQN F+R++F++H+ANA AGRVS +DS+EQITKRC RIVKN+EAEL+GC LCRG DF
Sbjct: 404 QHCQNTFQREKFVEHNANAFAAGRVSEIDSIEQITKRCFRIVKNVEAELTGCALCRGYDF 463
Query: 641 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 700
+SGFGPRTI+LCDQCE+EFHVGCL+ HKMA+L+ELPKG WFCCMDCSRI+S LQ LL++
Sbjct: 464 MRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELPKGNWFCCMDCSRIHSTLQKLLIR 523
Query: 701 EAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 759
AEKLP+ LN IKK + L + IDVRW LLSGK A+PE +LLLS+A++IF +CFD
Sbjct: 524 GAEKLPDSLLNDIKKKHEEKGLNISNSIDVRWTLLSGKIASPENKLLLSRALSIFQECFD 583
Query: 760 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 819
PIVDS GRDLIP MVYG+N +GQ++GGMYCA+L VNS +VSAGILRVFG+EVAELPLVA
Sbjct: 584 PIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLIVNSCIVSAGILRVFGEEVAELPLVA 643
Query: 820 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 879
T +HGKGYFQLLF+CIEKLL+FL V+++VLPAAEEAESIW +KFGF+KI PE LS YR
Sbjct: 644 TRNGDHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWIEKFGFQKIKPEQLSKYR 703
Query: 880 KRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 910
K C Q+V F+GTSMLQK VP C+I + S +S
Sbjct: 704 KNCCQMVRFEGTSMLQKAVPTCKIVNQSIES 734
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 54/118 (45%), Gaps = 11/118 (9%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQA 252
E + IA+ S + S + + KKIS + + E GL DG + Y G K
Sbjct: 239 EPDLIAKPSKVASVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKL-- 296
Query: 253 SGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF--ENGKSLLEV 308
L G R GILC C CN C V+ PS FE HA R+ Y C NG SL ++
Sbjct: 297 --LGGYKRGFGILCHC--CN-CEVVSPSTFEAHAGWATRK-KPYACIYTSNGVSLHDL 348
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/864 (49%), Positives = 544/864 (62%), Gaps = 98/864 (11%)
Query: 45 KVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILES 104
KV+ VNG+IVY+R KRS S NG E
Sbjct: 21 KVSVVNGYIVYTRAKRSLDSC---------------------------------NGFSEH 47
Query: 105 VVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLKR 164
++N V++ EN EC K +EV K+ R
Sbjct: 48 AELKDNAEVEVKTENG---------------------ECEKLKNESTEVVARTRKR--SR 84
Query: 165 PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
SA++ KVE + +V +E+ ++ AL +P+ +ELKMSKKI +N+K
Sbjct: 85 RSALEAKVECCDQMVV------SETEQVVANGGSGINGALGAPRNKMELKMSKKIVVNRK 138
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
PMTV +LF+TG LDGVSVVYMGGIK +ASGLRG+IRDGGILCSC LCNG RVIPPS+FEI
Sbjct: 139 PMTVKKLFDTGFLDGVSVVYMGGIK-KASGLRGVIRDGGILCSCCLCNGRRVIPPSQFEI 197
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HACKQYRRA+QYIC ENGKSLL++LRACR L L+ T+Q+ + S EE+ F C RCKG
Sbjct: 198 HACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYFTCKRCKG 257
Query: 345 TFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQN 404
FP + V + GP +C SCV+S+K + + G R+R+ P ++S S + +S Q
Sbjct: 258 CFPSSFVERVGP--ICRSCVESRKSEESSNNVVGKRVRSPRPVVLSNPSSTSELSVSSQV 315
Query: 405 NKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHK 464
KR RKK KL+ +S SS L L+ + + + T V +
Sbjct: 316 -KRHRKKRTKLV---FISISSVLDQRLHKLVFEENG------------LPDGTEVAYYAR 359
Query: 465 SSQSQRQRK----ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDES 520
+ K I + T + FE + S + T LH+L
Sbjct: 360 GQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVSLHELAIS-- 417
Query: 521 GLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYC 580
L + + + G ++C C P F H ECA+LSSIP+GDWYC
Sbjct: 418 -LSKDRKYSAKDNDDLCIVCWDGGNLLLCDGC-----PRAF--HKECAALSSIPRGDWYC 469
Query: 581 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 640
++CQNMF+R++F+ H+ANAV AGRV GVD +EQI RCIRIVK++EA+LS C LCRG DF
Sbjct: 470 QFCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDF 529
Query: 641 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 700
S+SGFGPRTI+LCDQCE+E+HVGCL+ HKMA L+ELP+G W CC DC+RI+S L+NLLV+
Sbjct: 530 SRSGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVK 589
Query: 701 EAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 759
AE+LPE L IKK LE + IDVRWRLL+GK A+PETR LL +AV+IFH+CF+
Sbjct: 590 GAERLPESLLGVIKKKQEEKGLEPI--IDVRWRLLNGKIASPETRPLLLEAVSIFHECFN 647
Query: 760 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 819
PIVD+ SGRDLIP+MVYGRN+RGQEFGGMYCA+L VNSSVVSAG+LR+FG +VAELPLVA
Sbjct: 648 PIVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVA 707
Query: 820 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 879
TS NHGKGYFQ LF+CIE+LL+FL VK++VLPAAEEAESIWTDKFGF K++P+ L+ YR
Sbjct: 708 TSNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSKMNPDELTNYR 767
Query: 880 KRCSQLVTFKGTSMLQKRVPACRI 903
K C Q+V+FKGT+ML K VP+CR+
Sbjct: 768 KNCHQMVSFKGTNMLHKMVPSCRV 791
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/752 (52%), Positives = 485/752 (64%), Gaps = 115/752 (15%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L PMTV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
++F C RCK + + P +S SS+
Sbjct: 121 RTFTCKRCKEQV--------------------------------LALEYFKPASLSTSSQ 148
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+++ RKK RK E D ++K S KS S+ + R +++ P L+
Sbjct: 149 DNT----------PRKKKRKPEEQDSITKPS-KSASVYLSSRKRKYKKIS-----PRLVC 192
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
P+ + K V I K + SP F +DQRLH+
Sbjct: 193 FFYPIDILFGLVMLS-PFPFLWLVKIFVFIRK-YAYLSPFCPFSGYQS-----QDQRLHR 245
Query: 515 LVFDESGLPDGTEVGYYACGQ-------------------------KLLEGY-----KNG 544
LVF+E GLPDGTE+ YYA GQ KLL GY NG
Sbjct: 246 LVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQKLLGGYAYIYTSNG 305
Query: 545 LGI--------------------ICHCCNS-------EVSPSQFEAHAECASLSSIPQGD 577
+ + +C C + P F H CAS+ ++P GD
Sbjct: 306 VSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPRAF--HKGCASIPTVPSGD 363
Query: 578 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 637
WYC+YCQN FER++ ++H+ANA AGR SG+DS+EQITKRC RIVKN+EAEL+GC LCRG
Sbjct: 364 WYCQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKRCFRIVKNIEAELTGCALCRG 423
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
DF +SGFGPRTI+LCDQCE+EFHVGCL+ HKM +L+ELPKG WFCCMDCSRI+S LQ L
Sbjct: 424 YDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKGNWFCCMDCSRIHSTLQKL 483
Query: 698 LVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 756
L++ AEKLP+ LN IKK + L ++IDVRW LLSGK A+PE +LLLS+A++IF +
Sbjct: 484 LIRGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSGKIASPENKLLLSRALSIFQE 543
Query: 757 CFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 816
CFDPIVDS GRDLIP MVYG+N +GQ++GGMYCA+LT+NSS+VSAGILRVFG+EVAELP
Sbjct: 544 CFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTINSSIVSAGILRVFGEEVAELP 603
Query: 817 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 876
LVAT HGKGYFQLLF+CIEKLL+FL V+++VLPAAEEAESIWT+KFGF+KI PE L+
Sbjct: 604 LVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWTEKFGFQKIKPEQLN 663
Query: 877 IYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 908
YRK C Q+V F+GTSMLQK VP CRI + T
Sbjct: 664 KYRKSCCQMVRFEGTSMLQKAVPTCRIVNQRT 695
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/700 (50%), Positives = 470/700 (67%), Gaps = 59/700 (8%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I ++KKP TV ELF+TGLLDGV VVY+G K + LRG I+DGGILCSCSLCNG RV
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
IPPS+FEIHAC Y+RA+QYIC ENGKS+LE++RACR+ PL L+AT+Q+ ++S PEEK
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C C+G FP + V + G LC SCV+S+K + + + G RIR+S + +++
Sbjct: 207 FTCKNCRGCFPSSNVERVGL--LCLSCVESRKSEKSSIHAVGKRIRSSKLSVKLKTAPIT 264
Query: 397 SMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSR-PGLIAN 455
S C+S QN +S W +++ R LI
Sbjct: 265 SKCLSPQN---------------------------------KSQWRISKRYQRLHKLIFE 291
Query: 456 STPVTSVHKSSQSQRQRKITKKSKK-TVLISKPFENASPPLSFPNKSRWN---------I 505
+ + + + R +K+ + K + ++ + P F + W
Sbjct: 292 EDGLPNGAEVAYYARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIY 351
Query: 506 TPKDQRLHKL-VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
T LH+L +F T+ Y C + + G ++C C P F H
Sbjct: 352 TSNGVSLHELAIFLSKDHKCTTKQNDYVC----VVCWDGGNLLLCDGC-----PRAF--H 400
Query: 565 AECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 624
ECAS+SSIP+G+WYC+ CQ+ F R+R + H+A+AV AGRV GVD +EQI KRCIRIVK+
Sbjct: 401 KECASVSSIPRGEWYCQICQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKD 460
Query: 625 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 684
+ AE+ GC+LCR DFS+SGFGPRTI++CDQCE+E+HVGCL+ HKMA L+ELP+G WFCC
Sbjct: 461 IGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCC 520
Query: 685 MDCSRINSVLQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPET 743
DC+RI+S L+NLL++ AE+LPE L+ IKK G LE +++IDVRW+LL+GK A+PET
Sbjct: 521 NDCTRIHSTLENLLIRVAERLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGKIASPET 580
Query: 744 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 803
R LL +AV++FH+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L VNSSVVSAG
Sbjct: 581 RPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAG 640
Query: 804 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 863
++R+FG+++AELPLVAT N GKGYFQ LFACIE+LL+FL VK++VLPAAEEA SIWT+
Sbjct: 641 MVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTE 700
Query: 864 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
KFGF K+ P L+ YR C Q++ FKGT+ML K VP CR+
Sbjct: 701 KFGFSKMKPNQLTNYRMNCHQIMAFKGTNMLHKTVPQCRV 740
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 637 bits (1642), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/713 (48%), Positives = 465/713 (65%), Gaps = 60/713 (8%)
Query: 208 KKNLELKMSKKI-SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILC 266
K LK +KKI ++KKP+TV ELF+TGLLDGV VVY+G K + LRG I+DGGILC
Sbjct: 85 KTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILC 144
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
SC LCNG RVIPPS+FEIHAC Y+RA+QYIC ENGKSLL+++RACR+ PL L+AT+Q+
Sbjct: 145 SCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQN 204
Query: 327 ALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGP 386
++S PEEK F C C+G P S + + + +++R SG
Sbjct: 205 FINSPPEEKYFTCKSCRG-----------PLGQYYSPIHVHVVLLNLNSVSLLKLRNSGR 253
Query: 387 KLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRN 446
+ S SS K + KL + SK S + +S W +++
Sbjct: 254 QEQSWSS----------------KLSVKLKTVPITSKCLSP--------QNKSQWRISKR 289
Query: 447 SSR-PGLIANSTPVTSVHKSSQSQRQRKITKKSK-KTVLISKPFENASPPLSFPNKSRWN 504
R LI + + + + R +K+ + K + ++ + P F + W
Sbjct: 290 YQRLHKLIFEEDGLPNGAEVAYYARGQKLLEGIKTRCGIVCRCCNTEVSPSQFEVHAGWA 349
Query: 505 ---------ITPKDQRLHKL-VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
T LH+L +F T+ Y C + + G ++C C
Sbjct: 350 SRRKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVC----VVCWDGGNLLLCDGC-- 403
Query: 555 EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 614
P F H ECAS+SSIP+G+WYC+ CQ+ F R+R + ++A+AV AGRV GVD +EQI
Sbjct: 404 ---PRAF--HKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPIEQI 458
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
KRCIRIVK++ AE+ GC+LCR DFS+SGFGPRTI++CDQCE+E+HVGCL+ HK A L+
Sbjct: 459 AKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKAYLK 518
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRL 733
ELP+G WFCC DC+ I+S L+NLL++ AE+LPE L+ IKK LE +++IDVRW+L
Sbjct: 519 ELPEGDWFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVRWKL 578
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
L+GK A+PETR LL +AV++FH+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L
Sbjct: 579 LNGKIASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALL 638
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
VNSSVVSAG++R+FG+++AELPLVAT N GKGYFQ LFACIE+LL+FL VK++VLPA
Sbjct: 639 IVNSSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPA 698
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 906
AEEAESIWT+KFGF K+ + L+ YR C Q++ FKGT+ML K VP CR+ +S
Sbjct: 699 AEEAESIWTEKFGFSKMKLDQLTNYRMNCHQIMAFKGTNMLHKTVPRCRVTNS 751
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 397/970 (40%), Positives = 549/970 (56%), Gaps = 140/970 (14%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDD-------LLEDDVIDKRINSK----IHEGRI- 88
CKR K T+VNGFIVY+R ++++F+ + LLE+ + + SK + G I
Sbjct: 42 CKRIKTTQVNGFIVYTRTRKTKFTKLHEQGDENAGLLENRMSNHLEESKPTIGVTNGSIG 101
Query: 89 ------NKVVKNVLNENGILESVVEEE----------------NQLVQMTVENVIEETVK 126
N +KN E+ + VEE + LV + ++++ +
Sbjct: 102 ETNVSGNSCIKNTFVESPAGKIAVEERLVTGSLAESPAVETDSSSLVDVVIDDINFVELL 161
Query: 127 GKKAPICKEEPISKVEC-FPRKEGGSEV-----SNGLNKKCLKRPSAMKPKVE------- 173
++ P+ E +S+ F K G++V S +++K KR + K +
Sbjct: 162 HEEIPV---EILSEGSLDFEVKRLGTKVRTMGKSYSVSEK--KRHGSFKRTAQIYKSILR 216
Query: 174 --------PVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
P V V FG E L ++ L K I + ++P
Sbjct: 217 MKKVNNLVPENVEVLSEPDFGRE--------------GLDEQSHSVSLA-DKSILIRRRP 261
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
TV ELFETG+LDG+SVVYMG +K QA GLRGII+DGGILCSCS C+ VI SKFEIH
Sbjct: 262 ETVRELFETGILDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHVISTSKFEIH 321
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
ACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK F C RCKG
Sbjct: 322 ACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKCFTCKRCKGA 381
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGT---MTYTTGIRIRASGP-----KLVSRSSENDS 397
FP + +G G LC SC + + Q + M +T + P K+ + SE+ S
Sbjct: 382 FPFSSLGHRGF--LCMSCSEVETSQASPAAMWTSTSSPACIASPVKSRLKITRKPSESMS 439
Query: 398 MCISYQN--NKRERKKTRKLLEADLVSK---SSSKSVSLRNLLKTRSPWELTRNSSRPGL 452
+ + + R TRK L LV K S+S ++S +N +++ LT+ S P
Sbjct: 440 ISPVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSKFKKMLTQYSVTP-- 497
Query: 453 IANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPP----LSFPNKS------- 501
+ S +R ++T+K + L FE P L + +
Sbjct: 498 ---KAVKSVSLSVSSKKRSYRLTRKDQG--LHKLVFERGGLPEGTELGYYARGQKLLGGY 552
Query: 502 -----------RWNITPKDQRLHK----------LVFDESGLPDGTEVGYYACGQKLLEG 540
+ ++P H ++ +G+ ++ G+K
Sbjct: 553 KMGAGIYCYCCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSAN 612
Query: 541 YKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQ 594
N L +IC C+S P F H EC SL SIP+G+W+CKYC+N F + +
Sbjct: 613 DNNDLCVICADGGNLLLCDS--CPRAF--HIECVSLPSIPRGNWHCKYCENKFTSEIAGE 668
Query: 595 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 654
++ N+ G++ GVD V+Q RCIR+VKN+EAE +GC+LC G DF +SGFGPRTI++CD
Sbjct: 669 YNVNSSAVGQLEGVDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICD 728
Query: 655 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI- 713
QCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AE L + L I
Sbjct: 729 QCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSLGIIQ 788
Query: 714 KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 773
+K + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SGR+LIP
Sbjct: 789 RKQERTDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGRNLIPR 848
Query: 774 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 833
MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT + KGYFQLL
Sbjct: 849 MVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLL 908
Query: 834 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 893
F+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C Q+V FKG SM
Sbjct: 909 FSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASM 968
Query: 894 LQKRVPACRI 903
LQK V A +I
Sbjct: 969 LQKPVHAHQI 978
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 630 bits (1624), Expect = e-177, Method: Compositional matrix adjust.
Identities = 348/740 (47%), Positives = 462/740 (62%), Gaps = 67/740 (9%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGP---------- 386
F C RCKG FP + +G G LC SC + + Q ++ T R S P
Sbjct: 373 FTCKRCKGPFPFSSLGHRGF--LCKSCSEVETSQASLAAT---RTSTSAPACITSPVKSR 427
Query: 387 -KLVSRSSENDSMCISYQNN--KRERKKTRKLLEADLVSK---SSSKSVSLRNLLKTRSP 440
K+ + SE+ S+ + ++ RK TRK L LV K S+S +VS + +++
Sbjct: 428 LKITRKPSESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFK 487
Query: 441 WELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENAS-------- 492
LT++S P A + SV +S R + + K V +
Sbjct: 488 KMLTQHSVTPK--ALKSVSLSVSSKKRSYRLARKDQGLHKLVFDRGGLPEGTELGYYARG 545
Query: 493 ----------------------PPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY 530
P F + W ++ + ++ +G+
Sbjct: 546 QKLLGGYKMGAGIYCYCCKCEVSPSLFEAHAGW---ASRRKPYFYIYTSNGVSLHEWATT 602
Query: 531 YACGQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQ 584
++ G+K N L +IC C+S P F H EC SL SIP+G+W+CKYC+
Sbjct: 603 FSHGRKYSANDNNDLCVICADGGNLLLCDS--CPRAF--HIECVSLPSIPRGNWHCKYCE 658
Query: 585 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 644
N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +GC+LC G DF +SG
Sbjct: 659 NKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCVLCSGSDFCRSG 718
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
FGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AEK
Sbjct: 719 FGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEK 778
Query: 705 LPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 763
L + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD
Sbjct: 779 LSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVD 838
Query: 764 SISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
+SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT
Sbjct: 839 PLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMC 898
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
+ KGYFQLLF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C
Sbjct: 899 SREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICY 958
Query: 884 QLVTFKGTSMLQKRVPACRI 903
Q+V FKG SMLQK V + +I
Sbjct: 959 QMVRFKGASMLQKPVDSHQI 978
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGK 128
S V E N V+N + E+ GK
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGK 122
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 347/748 (46%), Positives = 442/748 (59%), Gaps = 132/748 (17%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C RCK F CV + R SE+
Sbjct: 373 FTCKRCKEGF----------------CVSHAR----------------------RPSEST 394
Query: 397 SMCISYQNN--KRERKKTRKLLEADLVSKSS-SKSVSLRNLLKTRSPWELTRNSSRPGLI 453
S+ + ++ RK TRK L LV K+ S S ++ + K RS ++
Sbjct: 395 SISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFK----------- 443
Query: 454 ANSTPVTSVHKSSQSQRQRKITKKSKKTV-LISKPFENASPPLSFPNKSRWNITPKDQRL 512
V +H + + KS + L+S P P KDQ L
Sbjct: 444 --KMLVFLLHNDVLMLAEPTLMIKSLILLHLVSLYVLKVDPTFCDPQGFE-----KDQGL 496
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLL---------------------------------- 538
HKLVFD GLP+GTE+GYYA GQ +
Sbjct: 497 HKLVFDRGGLPEGTELGYYARGQTYITVDRNCSEATKWALEYIVTVASASYFYIYTSNGV 556
Query: 539 ----------EGYK------NGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQG 576
G K N L +IC C+S P F H EC SL SIP+G
Sbjct: 557 SLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDS--CPRAF--HIECVSLPSIPRG 612
Query: 577 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 636
+W+CKYC+N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +G
Sbjct: 613 NWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNG----- 667
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN 696
SGFGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ
Sbjct: 668 ------SGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQK 721
Query: 697 LLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 755
LL+ AEKL + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFH
Sbjct: 722 LLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFH 781
Query: 756 DCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 815
DCFDPIVD +SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAEL
Sbjct: 782 DCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAEL 841
Query: 816 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 875
PLVAT + KGYFQLLF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE L
Sbjct: 842 PLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQL 901
Query: 876 SIYRKRCSQLVTFKGTSMLQKRVPACRI 903
S Y K C Q+V FKG SMLQK V + +I
Sbjct: 902 SKYIKICYQMVRFKGASMLQKPVDSHQI 929
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 294/515 (57%), Positives = 356/515 (69%), Gaps = 79/515 (15%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + N+S NKS IT KDQR
Sbjct: 1903 NNSPATSIHRTSERARLLKPIPVTKSSGSALY------NSSE-----NKSLGKITKKDQR 1951
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLS 571
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHA AS
Sbjct: 1952 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 2011
Query: 572 -------------------SIPQGDWY--------------------CKYCQNMFER--- 589
S+ +G Y C C F R
Sbjct: 2012 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 2071
Query: 590 ---------------------KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 628
++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 2072 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 2130
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 2131 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 2190
Query: 689 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 747
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 2191 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 2250
Query: 748 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 807
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 2251 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 2310
Query: 808 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
FGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPAAEEAE IWT KFGF
Sbjct: 2311 FGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGF 2370
Query: 868 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
KKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2371 KKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2405
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 140/380 (36%), Positives = 191/380 (50%), Gaps = 63/380 (16%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1589 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1628
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKE--- 148
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+
Sbjct: 1629 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1684
Query: 149 GGS-------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
GS E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1685 DGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1743
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G + +
Sbjct: 1744 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDG--RKKGY 1800
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1801 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1860
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1861 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1909
Query: 374 TYTTGIRIRASGPKLVSRSS 393
+ T R R P V++SS
Sbjct: 1910 IHRTSERARLLKPIPVTKSS 1929
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 293/515 (56%), Positives = 354/515 (68%), Gaps = 79/515 (15%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + S NKS IT KDQR
Sbjct: 371 NNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSE-----------NKSLGKITKKDQR 419
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLS 571
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHA AS
Sbjct: 420 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 479
Query: 572 -------------------SIPQGDWY--------------------CKYCQNMFER--- 589
S+ +G Y C C F R
Sbjct: 480 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 539
Query: 590 ---------------------KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 628
++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 540 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 598
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 599 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 658
Query: 689 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 747
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 659 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 718
Query: 748 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 807
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 719 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 778
Query: 808 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
FGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPAAEEAE IWT KFGF
Sbjct: 779 FGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGF 838
Query: 868 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
KKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 839 KKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 873
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 136/369 (36%), Positives = 188/369 (50%), Gaps = 61/369 (16%)
Query: 47 TKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVV 106
+ +G I YSR KR + LE+ D+R + + E ES
Sbjct: 68 NRWDGVIQYSRNKRLK------RLEESKNDER--------------RTIAEEPKDDESTT 107
Query: 107 EEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKE---GGS-------- 151
+EE Q T EN V+E+ G PIC+EEP S+ + K+ GS
Sbjct: 108 DEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESNDGSLKLQTAGL 163
Query: 152 -----EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSAL 204
E+ + +K KR SA+K K + VE L + F N ++ + +L
Sbjct: 164 IDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGVDEKTNGAVRSL 222
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
TSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G + + L+G I+ GI
Sbjct: 223 TSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDG--RKKGYRLQGTIKGNGI 279
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+ PL L+AT+
Sbjct: 280 LCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATI 339
Query: 325 QSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRAS 384
QSA+ S P ++S P K PL NSC+K T + T R R
Sbjct: 340 QSAIGSFPVKRS---------LPADEAAKM--DPLGNSCIKRNNSPATSIHRTSERARLL 388
Query: 385 GPKLVSRSS 393
P V++SS
Sbjct: 389 KPIPVTKSS 397
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 328/786 (41%), Positives = 468/786 (59%), Gaps = 67/786 (8%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
+ +QSA+ +++ C+ CKG P + G LC SC+ SKKPQ +
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIAML--LCCSCMDSKKPQAIDLLSL 458
Query: 378 GIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSS---SKSVSLRNL 434
+ D + I+ + N ++SKSS +KSVS R
Sbjct: 459 S--------HYYMKEFWADHLIITPKPN--------------VLSKSSDTITKSVSTRGK 496
Query: 435 LK---TRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENA 491
+ TR L + ++ + T V + ++ Q+ + K + + +
Sbjct: 497 IHGRITRKDLRLHKLVFEEDILPDGTEV-----AYYARGQKLLVGYKKGSGIFCSCCNSE 551
Query: 492 SPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICH- 550
P F + W + + LH ++ +G+ + G+K + L IC
Sbjct: 552 VSPSQFEAHAGW-ASRRKPYLH--IYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICAD 608
Query: 551 -----CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 605
CC+ P F H +C L IP G WYCKYCQN+F++++F++H+ANAV AGRV
Sbjct: 609 GGDLLCCDG--CPRSF--HRDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRV 664
Query: 606 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 665
+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++LCDQCE+EFHVGCL
Sbjct: 665 AGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCL 724
Query: 666 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETV 724
K++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L ++ KK ++
Sbjct: 725 KENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASI 784
Query: 725 SDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 783
+D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD IPSM+YGRN+RGQ
Sbjct: 785 NDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQ 844
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
EFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+GYFQ L+ACIE+ L F
Sbjct: 845 EFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGF 904
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
L VK++VLPAA+EAES+W +KFGF K+ PE + + KR Q++ F+GTSMLQK VP R+
Sbjct: 905 LNVKNLVLPAADEAESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGTSMLQKEVPKYRV 963
Query: 904 GSSSTD 909
+S+ +
Sbjct: 964 INSAAN 969
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 325/790 (41%), Positives = 466/790 (58%), Gaps = 75/790 (9%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ------- 370
+ +QSA+ +++ C+ CKG P + G LC SC+ S+KPQ
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIAML--LCCSCMDSRKPQVSSSPSP 458
Query: 371 ---GTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSK 427
+ T + R P ++S+SS D ++KS S
Sbjct: 459 SPSPSPTPIVFSKDRTPKPNVLSKSS-------------------------DTITKSVST 493
Query: 428 SVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKP 487
+ + TR L + ++ + T V + ++ Q+ + K + +
Sbjct: 494 RGKIHGRI-TRKDLRLHKLVFEEDILPDGTEV-----AYYARGQKLLVGYKKGSGIFCSC 547
Query: 488 FENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGI 547
+ P F + W + + LH ++ +G+ + G+K + L
Sbjct: 548 CNSEVSPSQFEAHAGW-ASRRKPYLH--IYTSNGVSLHELSISLSKGRKFSLTDNDDLCS 604
Query: 548 ICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
IC CC+ P F H +C L IP G WYCKYCQN+F++++F++H+ANAV
Sbjct: 605 ICADGGDLLCCDG--CPRSF--HRDCVPLPCIPTGIWYCKYCQNLFQKEKFVEHNANAVA 660
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
AGRV+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++LCDQCE+EFH
Sbjct: 661 AGRVAGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFH 720
Query: 662 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNS 720
VGCLK++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L ++ KK
Sbjct: 721 VGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQG 780
Query: 721 LETVSDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN 779
+++D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD IPSM+YGRN
Sbjct: 781 SASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRN 840
Query: 780 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 839
+RGQEFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+GYFQ L+ACIE+
Sbjct: 841 IRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIER 900
Query: 840 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
L FL VK++VLPAA+EAES+W +KFGF K+ PE + + KR Q++ F+GTSMLQK VP
Sbjct: 901 FLGFLNVKNLVLPAADEAESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGTSMLQKEVP 959
Query: 900 ACRIGSSSTD 909
R+ +S+ +
Sbjct: 960 KYRVINSAAN 969
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 267/477 (55%), Positives = 317/477 (66%), Gaps = 87/477 (18%)
Query: 490 NASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIIC 549
N SP S S +DQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C
Sbjct: 1968 NNSPATSIHRTSE---RARDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFC 2024
Query: 550 HCCNSEVSPSQFEAHAECASLS-------------------SIPQGDWY----------- 579
CC+ EVS SQFEAHA AS S+ +G Y
Sbjct: 2025 WCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSI 2084
Query: 580 ---------CKYCQNMFER------------------------KRFLQHDANAVEAGRVS 606
C C F R ++F++H+ANAV AGRVS
Sbjct: 2085 CGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVS 2144
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 666
GVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGFGPRTI+LCDQ
Sbjct: 2145 GVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQ----------- 2192
Query: 667 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVS 725
ELP GKWFCC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++
Sbjct: 2193 --------ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIA 2244
Query: 726 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 785
D +VRWRLLSGK A+PETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F
Sbjct: 2245 DYNVRWRLLSGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDF 2304
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G+YCA++TVNS VVSAGILRVFGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL
Sbjct: 2305 SGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLN 2364
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
V+S VLPAAEEAE IWT KFGFKKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2365 VRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2421
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 137/369 (37%), Positives = 184/369 (49%), Gaps = 63/369 (17%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1654 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1693
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKE--- 148
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+
Sbjct: 1694 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1749
Query: 149 GGS-------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
GS E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1750 DGSLKLQTAXLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1808
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1809 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1865
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1866 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1925
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1926 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1974
Query: 374 TYTTGIRIR 382
+ T R R
Sbjct: 1975 IHRTSERAR 1983
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 239/364 (65%), Positives = 301/364 (82%), Gaps = 8/364 (2%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+ G ++C C P F H ECASLSSIP+GDWYC++CQNMF+R++F+ ++ NA
Sbjct: 369 WDGGNLLLCDGC-----PRAF--HKECASLSSIPRGDWYCQFCQNMFQREKFVAYNVNAF 421
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
AGRV GVD +EQITKRCIRIVK+++AELS C LCRG DFSKSGFGPRTI+LCDQCE+E+
Sbjct: 422 AAGRVEGVDPIEQITKRCIRIVKDIDAELSACALCRGVDFSKSGFGPRTIILCDQCEKEY 481
Query: 661 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK-YAGN 719
HVGCL+ HKM L+ELPKG W CC DC+RI+S L+N+LV+ AE+LP+ L IKK
Sbjct: 482 HVGCLRDHKMTFLKELPKGNWLCCNDCTRIHSTLENVLVRGAERLPKSLLAVIKKKQEEK 541
Query: 720 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN 779
L+ ++DI+VRWRLLSGK A+PETR LL +AV+IFH+CFDPIVD++SGRDLI +MVYG++
Sbjct: 542 GLDPINDINVRWRLLSGKKASPETRPLLLEAVSIFHECFDPIVDAVSGRDLIRAMVYGKS 601
Query: 780 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 839
+RGQEFGGMYCA+L VNSSVVSAG+LR+FG ++AELPLVATS HGKGYFQ LF+CIE+
Sbjct: 602 VRGQEFGGMYCALLIVNSSVVSAGMLRIFGTDIAELPLVATSNSQHGKGYFQALFSCIER 661
Query: 840 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
LL+F++VK++VLPAAEEA+SIWTDKFGF KI P+ L+ YR+ C+Q VTF+GT+ML K VP
Sbjct: 662 LLAFMKVKNLVLPAAEEAQSIWTDKFGFSKIKPDELANYRRNCNQFVTFQGTNMLHKMVP 721
Query: 900 ACRI 903
CR+
Sbjct: 722 PCRV 725
Score = 248 bits (634), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 199/517 (38%), Positives = 270/517 (52%), Gaps = 91/517 (17%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
C RCKG FP +C+ + GP+C+SCV+S K + + RIR+ P LVS+SS
Sbjct: 118 TCKRCKGRFPSSCMERV--GPICSSCVESSKSEESSKNVVSKRIRSPRPVLVSKSSCASE 175
Query: 398 MCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANST 457
M IS + +R RK+ + + + S S SV + K + S + +NS
Sbjct: 176 MSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILPRRKVTPKTKKKSLSVKLKTTSNSN 235
Query: 458 PVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVF 517
+ S Q + + KITKK L+ P SFP T D RLHKLVF
Sbjct: 236 CL-----SPQIKSEWKITKK-----LV---------PYSFP-------TCGDNRLHKLVF 269
Query: 518 DESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLS 571
+E+GLPDG+E+ YYA G QKLLEG+K G GI+C CCN+E+SPSQFE HA AS
Sbjct: 270 EENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRK 329
Query: 572 SIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG 631
P Y ++ E L D + S D+ +
Sbjct: 330 K-PYAYIYTSNGVSLHELSISLSKDR------KYSANDNDDL------------------ 364
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C++C + +LLCD C R FH C A L +P+G W+C
Sbjct: 365 CVVC---------WDGGNLLLCDGCPRAFHKEC------ASLSSIPRGDWYC-------- 401
Query: 692 SVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 728
QN+ + EK +++NA +A +E V I+
Sbjct: 402 QFCQNMF--QREKFVAYNVNA---FAAGRVEGVDPIE 433
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 311/793 (39%), Positives = 434/793 (54%), Gaps = 142/793 (17%)
Query: 184 GFGNESMSLIEVEAIA---EGSALTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDG 239
G E L++V+ A E SP K + K L P + ++F+ G+L+G
Sbjct: 333 GNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEG 392
Query: 240 VSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
+ V Y+ G K + +G L+G+I+ G+LC CS C G +V+ P+ FE+HA +R +Y
Sbjct: 393 LIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIGIQVVSPAMFELHASSNNKRPPEY 452
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGP 356
I E+G +L +V+ AC+ PL L+ L+ + + +KS C+ C+G C T
Sbjct: 453 ILLESGFTLRDVMNACKENPLATLEEKLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKS 509
Query: 357 GPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
+C SC++SK+P+ + N+ +
Sbjct: 510 LVVCKSCLESKEPE-------------------------------FHNSPSKAN------ 532
Query: 417 EADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITK 476
D ++ SS SV +++L+ R+ S P QS R+ + T+
Sbjct: 533 --DALNGSSRPSVDPKSILR--------RSKSSP---------------RQSNRREQPTR 567
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQK 536
KS + ++ + S S + S +T KD RLHKLVF++ LPDGTEVGY+ G+K
Sbjct: 568 KSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEK 627
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECAS------------------LS------- 571
+L GYK G GI C CCN VSPS FEAHA CAS LS
Sbjct: 628 MLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQ 687
Query: 572 --SIPQGDWYCKYCQ------------------------------------NMFERKRFL 593
SI + D C C+ NM ER++F+
Sbjct: 688 RFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSCKYCVNMVEREKFV 747
Query: 594 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILL 652
+ NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG F + GF RT+++
Sbjct: 748 DSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVII 807
Query: 653 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 712
CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL+V+ EKL LN
Sbjct: 808 CDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNF 867
Query: 713 IKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISG 767
++K + E D D+RWR+LSGK ++ +T++LL++A++I H+ FDPI +S +
Sbjct: 868 LRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALSILHERFDPISESGTK 927
Query: 768 RDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 827
DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+AELPLVATSK G+
Sbjct: 928 GDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQ 987
Query: 828 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 887
GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+ E + YRK S ++
Sbjct: 988 GYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI- 1046
Query: 888 FKGTSMLQKRVPA 900
F GTSML+K VPA
Sbjct: 1047 FHGTSMLRKSVPA 1059
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 493 bits (1270), Expect = e-136, Method: Compositional matrix adjust.
Identities = 291/732 (39%), Positives = 412/732 (56%), Gaps = 109/732 (14%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVI 277
L P + E+F G+L+G++V Y+ G K + +G L+G+I+ G+LC C C G +V+
Sbjct: 363 LRNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVV 422
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPE-EKS 336
+ +E+HA +R +YI E+G +L +V+ AC+ P L+ L+ + P +KS
Sbjct: 423 STAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVG--PNLKKS 480
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIRIRASGPKLVSRSSE 394
C+ C+G C T +C SC++SK+P+ + + G +S P + +S
Sbjct: 481 SLCLNCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSIL 538
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+ S Q+N++E+ TR S+ PG++
Sbjct: 539 SRSKSSPRQSNRQEQP---------------------------------TRKSTEPGVVP 565
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
+ S S +S Q K+T+ KD RLHK
Sbjct: 566 GTILSESKSSSIKSNSQGKLTR-------------------------------KDVRLHK 594
Query: 515 LVFDESGLPDGTEVGYYACGQ----------------KLLEGYKNGLGIICHCCNSEVSP 558
LVF++ LPDGTEVGY+ G+ K + G+ H + +S
Sbjct: 595 LVFEDDILPDGTEVGYFVAGEVSPSSFEAHAGCASRRKPFQHIYTTNGVSLHELSVALSM 654
Query: 559 SQ-FEAHAE-----------CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
Q F H CASLSS+P W CKYC NM ER++F+ + NA+ AGRV
Sbjct: 655 DQRFSIHENDDLCSICRDGVCASLSSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQ 714
Query: 607 GVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 665
GVD++ +IT RCIR+V + EL S C+LCRG F + GF RT+++CDQCE+EFHVGCL
Sbjct: 715 GVDAIAEITNRCIRVVSSFGTELPSVCVLCRGHSFCRLGFNSRTVIICDQCEKEFHVGCL 774
Query: 666 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 725
K+H +ADL+ELP+ KWFC +DC +IN+ L NL+++ EKL LN I+ + E+
Sbjct: 775 KEHNIADLKELPEEKWFCSVDCEKINTTLGNLIIRGEEKLTNNILNFIRTKEKPNEESCP 834
Query: 726 DI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNL 780
D D+RWR+LSGK ++ ET++LL++AV+I H+ FDPI ++ + DLIP+MVYGR
Sbjct: 835 DDNTTPDIRWRVLSGKLTSSDETKILLAKAVSILHERFDPISETGTRGDLIPAMVYGRQA 894
Query: 781 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 840
+GQ+F GMYC +L V+ +VS GI RVFG E+AELPLVATSK G+GYFQ LFACIE+L
Sbjct: 895 KGQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERL 954
Query: 841 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
L FL VK IVLPAA+EA+SIWTDKFGF K+ E + YRK S ++ F GTSML+K VPA
Sbjct: 955 LGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI-FHGTSMLRKSVPA 1013
Query: 901 CRIGSSSTDSTE 912
S + S E
Sbjct: 1014 PSAPSKTEGSKE 1025
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 285/738 (38%), Positives = 424/738 (57%), Gaps = 52/738 (7%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
E EA AE S L +P + + S+ L K P + +L TG+L+G+ V+YM G K +
Sbjct: 286 ETEASAEASLLMTPPSSAKFSNSR---LKKFPSKLKDLLATGILEGLPVMYMKGAKVLFA 342
Query: 254 G---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLE 307
G L+G+I+D G+LC C +CNG V+ P+ FE+HA +R +YI +G K+L +
Sbjct: 343 GEKGLQGVIQDSGVLCFCKICNGVEVVTPTVFELHAGSANKRPPEYIYIHDGNCGKTLRD 402
Query: 308 VLRACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVK 365
V+ AC PL + +Q L +KS C+ C+G C G + +C+ C+
Sbjct: 403 VMNACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSKL--VCDLCLA 456
Query: 366 SKKPQGTMT----YTTGIRIRASGPKLVSRSSENDSMCISYQN----NKRERKKTRKLLE 417
S PQ M + ++ R+ P ++ +S +N+ S N N + + L+
Sbjct: 457 SP-PQTAMASRKVISQPVQPRSPEPVVIQKSLDNEVQPNSLDNEVPPNSLDNEVQPNSLD 515
Query: 418 ADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPV--TSVHKSSQSQRQRKIT 475
+ KS S + + + +S LTR R + V + + Q+ +
Sbjct: 516 TGVQPKSFSNGMK-HSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLV 574
Query: 476 KKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP-DGTEVGYYACG 534
K + F + W + + LH ++ +G+ +
Sbjct: 575 GYKKGCGIFCTCCNEQVSASQFEAHAGW-ASRRKPYLH--IYTSNGISLHELSISLSKDH 631
Query: 535 QKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFE 588
++ + L IIC CC+ P F H +C L IP G WYCKYCQN+F+
Sbjct: 632 RRFSNNDNDDLCIICEDGGDLLCCDG--CPRAF--HIDCVPLPCIPSGSWYCKYCQNVFQ 687
Query: 589 RKRFLQHDANAVEA-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 647
+ R QH+ NA+ A GR++G D +E + KRCIR+VK +E + GC LC +FSKS FGP
Sbjct: 688 KDRHGQHEVNALAAAGRIAGPDILELMNKRCIRVVKTVEVDHGGCALCSRPNFSKS-FGP 746
Query: 648 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 707
RT+++CDQCE+E+HVGCLK+H M +L +LP+G WFC +CS I++ L +L+ + + +P+
Sbjct: 747 RTVIICDQCEKEYHVGCLKEHNMENLEKLPEGNWFCSGNCSHIHTALTDLVASKEKDVPD 806
Query: 708 FHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATP-----ETRLLLSQAVAIFHDCFDPI 761
L+ IKK + SLE + +DV+WR+++ K + ETR LLS+AVAIFH+ FDPI
Sbjct: 807 PLLSLIKKKHEEKSLEIGAGLDVKWRVMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPI 866
Query: 762 VDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 821
VDS SGRD IP+M++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+
Sbjct: 867 VDSTSGRDFIPTMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATT 926
Query: 822 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR 881
+ G+GYFQ LF+CIE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y+K
Sbjct: 927 ADHQGQGYFQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKYKKF 986
Query: 882 CSQLVTFKGTSMLQKRVP 899
+++ F+GTS+LQK VP
Sbjct: 987 -YRMMIFQGTSVLQKPVP 1003
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 315/860 (36%), Positives = 472/860 (54%), Gaps = 83/860 (9%)
Query: 98 ENGILESVVEEENQLVQMTV---ENVIEETVKGKKAPICKEEPISKVECFPRKEGG---S 151
E +L+ V+ EE +V T+ E ++ ET+K + E+P+ E + G +
Sbjct: 142 EPKVLDDVINEEEAIVAETLKEQEPIVPETLKEEVVDEMAEQPLCIEESEEKDSNGVALA 201
Query: 152 EVSNGLN-----KKCLKRP--------SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAI 198
V++G KK L+RP SA+K K E E G +S
Sbjct: 202 LVNDGAKGKKSMKKRLERPQSERRFTRSALKVKSEET----NDGEHVGVAGISDGVKRET 257
Query: 199 AEGSAL--TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIK--FQAS- 253
G++L T+P +K S + L K P + +L TG+L+G+ V+YM G+K F
Sbjct: 258 EAGASLVMTTPSS---VKFSNRGKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEK 314
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLEVLR 310
GL+G+I+D G+LC C +C G V+ P+ FE+HA +R +YI +G K+L +V+
Sbjct: 315 GLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMN 374
Query: 311 ACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKK 368
AC PL + +Q L +KS C+ C+G C G + +C+SC+ S
Sbjct: 375 ACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSRL--VCDSCLVSPA 428
Query: 369 PQGTMTYTTGI----RIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKL--------L 416
Q + GI + R+ P ++ +S +N+ S N + K + +
Sbjct: 429 -QTAVASNKGISQPVQPRSPEPVVIQKSLDNEVQPNSLHNEVQPNKLDTGMQPNSLDNGM 487
Query: 417 EADLVSKSSSKSVSLRNLLKT-----RSPWELTRNSSRPGLIANSTPV--TSVHKSSQSQ 469
E D ++ +S K S N +K +S LTR R + V + +
Sbjct: 488 EPDSLN-NSMKPKSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAH 546
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP-DGTEV 528
Q+ + K + F + W + + LH ++ +G+ +
Sbjct: 547 GQKLLVGYKKGYGIFCTCCNEQVSASQFEAHAGW-ASRRKPYLH--IYTSNGISLHELSI 603
Query: 529 GYYACGQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKY 582
++ + L IIC CC+ P F H +C L IP G WYCKY
Sbjct: 604 SLSKDHRRFSNNDNDDLCIICEDGGDLLCCDG--CPRAF--HIDCVPLPCIPSGTWYCKY 659
Query: 583 CQNMFERKRFLQHDANAVEA-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 641
CQN+F++ R QH+ NA+ A GR++G D +E + KRCIR+V+ LE + GC LC +FS
Sbjct: 660 CQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSRPNFS 719
Query: 642 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 701
KS FGP+T+++CDQCE+E+HVGCLK H M +L ELP G WFC +CS+I++ L +L+ +
Sbjct: 720 KS-FGPQTVIICDQCEKEYHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDLVASK 778
Query: 702 AEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCF 758
+ +P+ LN IKK + SL+ + +DV+WR+++ K + + ETR LLS+AVAIFH+ F
Sbjct: 779 EKDVPDPLLNLIKKKHEEKSLDIGAGLDVKWRVINWKLDSDSVETRKLLSKAVAIFHERF 838
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 818
DPIVDS SGRD IP+M++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLV
Sbjct: 839 DPIVDSTSGRDFIPAMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIAELPLV 898
Query: 819 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 878
AT+ + G+GYFQ LF+CIE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y
Sbjct: 899 ATTADHQGQGYFQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKY 958
Query: 879 RKRCSQLVTFKGTSMLQKRV 898
K+ +++ F+GTS+LQK V
Sbjct: 959 -KKFYRMMIFQGTSVLQKPV 977
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 468 bits (1203), Expect = e-128, Method: Compositional matrix adjust.
Identities = 245/474 (51%), Positives = 306/474 (64%), Gaps = 72/474 (15%)
Query: 500 KSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS 559
KS+ +T KD R+HKLVF+E LPDGTEV YY+ GQKLL GYK G GI C CCN+EVSPS
Sbjct: 564 KSQGRLTVKDLRMHKLVFEEDVLPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSPS 623
Query: 560 QFEAHAECA----------------------SLS-----SIPQGDWYCKYCQ-------- 584
QFEAHA A SLS S Q D C+ C+
Sbjct: 624 QFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLCC 683
Query: 585 ----------------------------NMFERKRFLQHDANAVEAGRVSGVDSVEQITK 616
N F++++F++H+ANA+ AGRV+GVD ++QIT+
Sbjct: 684 DVCPRAYHKDCLALPEIPTGRWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITR 743
Query: 617 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
RCIRIVK ++A+ GC+ CRG DF K FGPRT+LLCDQCE+EFHVGCLK H M DL+EL
Sbjct: 744 RCIRIVKTMDADFGGCVFCRGHDFDKI-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKEL 802
Query: 677 PKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV--SDIDVRWRLL 734
PKG WFCC DC RI+S L+ L+++ E+L + LN I K + S+IDVRWRLL
Sbjct: 803 PKGNWFCCSDCCRIHSALEKLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLL 862
Query: 735 SGKA-ATPETRLLLSQAVAIFHDCFDPIV----DSISGRDLIPSMVYGRNLRGQEFGGMY 789
+ K +T LLS+A+AI H+ F+PI+ S + RDLI SMV+G NL+GQEFGGMY
Sbjct: 863 NDKINPAGDTAALLSEALAILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGMY 922
Query: 790 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 849
CA+L +N +VVS I+R FG E+AELPLVATS GKGYFQ LF CIEKLL FL +K++
Sbjct: 923 CAVLMINQAVVSCAIIRFFGLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNL 982
Query: 850 VLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
VLPAAEEAESIW +KFGF+K+ E +RK Q++ F+GTSML K VP RI
Sbjct: 983 VLPAAEEAESIWINKFGFRKLTHEEFLKFRKD-YQMMVFQGTSMLHKPVPKIRI 1035
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 91/180 (50%), Gaps = 5/180 (2%)
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRG 257
GS + LK+ K + K P + +L ++G+L+G V YM G K + +G L+G
Sbjct: 346 GSPSAASNSGTMLKVWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQG 405
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I ILC C C G V+ PS FE+HA +R +YI ENG +L +V+ AC++ L
Sbjct: 406 VISGSAILCFCRSCRGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASL 465
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
L L + + S C++C+G G++ LC+ C+ K Q ++ TT
Sbjct: 466 ETLDEALWLSTGCSSLKNSTFCLKCRGKLAEASTGRS--MTLCSQCMVLKDSQASIPATT 523
Score = 39.7 bits (91), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 26/148 (17%)
Query: 519 ESGLPDGTEVGYY------ACGQKLLEGYKNGLGIICHC----CNSEVSPSQFEAHAECA 568
+SG+ +G +V Y G+ +L+G +G I+C C N V+PS FE HA
Sbjct: 378 DSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSCRGNEVVTPSIFEVHA--G 435
Query: 569 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 628
S + P Y E L+ NA + + +D ++ C + +
Sbjct: 436 SANKRPPEYIY-------LENGNTLRDVMNACKNASLETLDEALWLSTGCSSLKNS---- 484
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQC 656
+ CL CRG +++ G R++ LC QC
Sbjct: 485 -TFCLKCRG-KLAEASTG-RSMTLCSQC 509
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 247/468 (52%), Positives = 303/468 (64%), Gaps = 71/468 (15%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD R HKLVF+E LPDGTEVGYY G+KLL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 501 VTKKDIRTHKLVFEEEVLPDGTEVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAH 560
Query: 565 AECAS---------------------------LSSIPQGDWYCKYCQ------------- 584
A AS SI + D C+ C+
Sbjct: 561 AGWASRRKPYLNIYTSNGVSLHELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPR 620
Query: 585 -----------------------NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 621
N FE+++F++ +ANA+ AGRV+GVD +EQIT+RCIRI
Sbjct: 621 AFHQECLSLPSIPRGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRI 680
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
VK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+HKM DL+ELPKGKW
Sbjct: 681 VKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKW 739
Query: 682 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 741
FCC C RI+S LQ L+++ EKLP+ LN IKK+ ++ E+ DVRWRLLS K +
Sbjct: 740 FCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKHEESASESGCSDDVRWRLLSKKTDSS 799
Query: 742 E-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEFGGMYCAILTV 795
+ T LLS AVAIFH+CFDPI S R D IPSMV G N++GQ+ GGMYCA+L V
Sbjct: 800 DVTEALLSDAVAIFHECFDPITVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLV 859
Query: 796 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 855
N VVS ++R+FGQE+AELP+VATS G+GYFQ LF CIEKLL FL VK++VLPAAE
Sbjct: 860 NHVVVSVAVVRIFGQELAELPIVATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAE 919
Query: 856 EAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
E SIW +KFGF I + L YR+R Q++ F+G MLQK VP CRI
Sbjct: 920 EVGSIWKNKFGFGAITQDELMEYRRR-YQIMVFQGALMLQKPVPKCRI 966
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/350 (29%), Positives = 163/350 (46%), Gaps = 42/350 (12%)
Query: 68 DLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVV--EEENQLVQMTVENVIEETV 125
++ E++V+ + ++ EG K+ E GI V+ E+ N +V E + +
Sbjct: 157 EMEENEVLGSKSGVEVKEG-----YKDHPCEEGISGLVLMDEDSNAIVNRAFERKNDCEL 211
Query: 126 KGKKAPICKEEPISKVECFPRKEG-GSEVSNGLNK-------KC---LKRP------SAM 168
K A +EE S + K G G +V+N L+ KC ++P SA+
Sbjct: 212 KKDDA---REEGTSGLSSVLVKNGEGGDVNNSLHPVVVDGDIKCKVEAEKPFRRFTRSAL 268
Query: 169 KPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTV 228
KPK+E V++ + S+G ++V+ SA + N KM K P +
Sbjct: 269 KPKIETVDI--SSSDG--------VKVDDRGSSSAAAATTTNTPTKMFSIDGSKKFPTKL 318
Query: 229 TELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
+L ++G+L+G V Y+ G K + GL G++R+ GILC C C G V+ P+ F +H
Sbjct: 319 KDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVTPAIFVLH 378
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
A +R +YIC ENG +L +V+ AC++ L L ++ + P +KS C C+G+
Sbjct: 379 AGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNFCWNCRGS 438
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEN 395
IT G LC+ C K Q T + R + P V SS N
Sbjct: 439 --ITGAGSRKSKVLCSQCFGLKDFQAGSAPKTAKKERTAKPHSVPESSCN 486
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 295/792 (37%), Positives = 418/792 (52%), Gaps = 138/792 (17%)
Query: 154 SNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGN-------ESMSLIEVEAIA---EGSA 203
S G++KK + + KP LV Q N E L++V+ A E
Sbjct: 297 SQGVDKKAVN-DTVDKPLRRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDG 355
Query: 204 LTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDG 262
SP K + K L P + ++F+ G+L+G+ V Y+
Sbjct: 356 FQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEGLIVYYV----------------- 398
Query: 263 GILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKA 322
G +V+ P+ FE+HA +R +YI E+G +L +V+ AC+ PL L+
Sbjct: 399 ---------RGAKVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEE 449
Query: 323 TLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIR 382
L+ + + +KS C+ C+G C T +C SC++SK+P+
Sbjct: 450 KLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKSLVVCKSCLESKEPE------------ 494
Query: 383 ASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWE 442
+ N+ + D ++ SS SV +++L+
Sbjct: 495 -------------------FHNSPSKAN--------DALNGSSRPSVDPKSILR------ 521
Query: 443 LTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSR 502
R+ S P QS R+ + T+KS + ++ + S S + S
Sbjct: 522 --RSKSSP---------------RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSH 564
Query: 503 WNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ----------------KLLEGYKNGLG 546
+T KD RLHKLVF++ LPDGTEVGY+ G+ K + G
Sbjct: 565 GKLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEVSPSTFEAHAGCASRRKPFQHIYTTNG 624
Query: 547 IICHCCNSEVSPSQ-FEAHAE-----------CASLSSIPQGDWYCKYCQNMFERKRFLQ 594
+ H + +S Q F H CASL S+P W CKYC NM ER++F+
Sbjct: 625 VSLHELSVALSMDQRFSIHENDDLCSICRDGVCASLPSLPSERWSCKYCVNMVEREKFVD 684
Query: 595 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLC 653
+ NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG F + GF RT+++C
Sbjct: 685 SNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIIC 744
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI 713
DQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL+V+ EKL LN +
Sbjct: 745 DQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFL 804
Query: 714 KKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGR 768
+K + E D D+RWR+LSGK ++ +T++LL++A++I H+ FDPI +S +
Sbjct: 805 RKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALSILHERFDPISESGTKG 864
Query: 769 DLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKG 828
DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+AELPLVATSK G+G
Sbjct: 865 DLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQG 924
Query: 829 YFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTF 888
YFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+ E + YRK S ++ F
Sbjct: 925 YFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI-F 983
Query: 889 KGTSMLQKRVPA 900
GTSML+K VPA
Sbjct: 984 HGTSMLRKSVPA 995
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 457 bits (1176), Expect = e-125, Method: Compositional matrix adjust.
Identities = 242/480 (50%), Positives = 307/480 (63%), Gaps = 77/480 (16%)
Query: 499 NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSP 558
+ S+ +T KD R+HKLVF+E LPDGTEVGYY+ G+KLL GYK G GI C CCN+EVSP
Sbjct: 471 SNSQGRLTKKDIRMHKLVFEEEVLPDGTEVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSP 530
Query: 559 SQFEAHAECAS---------------------------LSSIPQGDWYCKYCQ------- 584
SQFEAHA AS S + D C+ C+
Sbjct: 531 SQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKCRRHSTKENDDLCQICRDGGKLLC 590
Query: 585 -----------------------------NMFERKRFLQHDANAVEAGRVSGVDSVEQIT 615
N FE+++F++ +ANA+ AGRV+G D +EQIT
Sbjct: 591 CDVCPRAFHQECLSLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQIT 650
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR- 674
+RCIRIVK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+H+M DL+
Sbjct: 651 RRCIRIVKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHQMQDLKA 709
Query: 675 --ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRW 731
ELP GKWFCC C RI+S LQ L+++ EKLP+ LN I KK+ ++ E+ D+RW
Sbjct: 710 ICELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKKHEESASESGGGDDIRW 769
Query: 732 RLLSGKAATPE-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEF 785
RLLS K + T LLS+AVAIFH+ F PI S R D IPSMV G +++GQ+
Sbjct: 770 RLLSKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQDL 829
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
GGMYCA+L VN VVSA ++R+FGQE+AELP+VATS + G+GYFQ LF CIEKLL FL
Sbjct: 830 GGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKSQGQGYFQTLFTCIEKLLGFLN 889
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKI--DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
VK++VLPAAEE ESIWT+KFGF I D L YRK Q++ F+G+ MLQK VP CR+
Sbjct: 890 VKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKS-YQIMEFQGSLMLQKPVPKCRV 948
Score = 119 bits (298), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 120/231 (51%), Gaps = 17/231 (7%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+EP+++ + S+G + V AI T+P K + L K P
Sbjct: 244 SALKPKIEPLDI--SSSDGVKVDDTGSSSVAAIT-----TTPTKMFAID-----GLKKFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++++ GILC C C G V+ P+ F
Sbjct: 292 TKLKDLLDSGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIF 351
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA +R +YI ENG +L +V+ AC++ L +L ++ ++ P +KS C+ C
Sbjct: 352 ELHAGSANKRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSC 411
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
+G+ IT G LC+ C++ K Q + T + R P V SS
Sbjct: 412 RGS--ITGAGTRKSKVLCSQCLELKDSQAILAPETDTKERTPRPSPVPESS 460
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 239/466 (51%), Positives = 295/466 (63%), Gaps = 67/466 (14%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
P+K +T KD R+HKLVF+ LPDGT + YY G+KLLEGYK G I C+CC SEVS
Sbjct: 581 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGKKLLEGYKKGGAIFCYCCQSEVS 640
Query: 558 PSQFEAHAECAS---------------------------LSSIPQGDWYCKYCQN----- 585
PSQFEAHA CAS SS + D C C +
Sbjct: 641 PSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGDLL 700
Query: 586 -------------------------------MFERKRFLQHDANAVEAGRVSGVDSVEQI 614
MF +++F ANA+ AGRV+G+D++EQI
Sbjct: 701 CCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQI 759
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
TK IRIV L AE+ C+LCR DFS SGFGP+T+++CDQCE+E+HV CL++H M DL+
Sbjct: 760 TKCSIRIVDTLHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLK 819
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 733
ELPK KWFCC +C+ I+ LQ L+ + LP+ + I +K +LE S DV+WRL
Sbjct: 820 ELPKDKWFCCKECNSIHYALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSINDVKWRL 879
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR-DLIPSMVYGRNLRGQEFGGMYCAI 792
LSGK +T ETR+ LS AV+IFHD FDPI DS + R DLIP+MVYGRN + Q+FGGM CAI
Sbjct: 880 LSGKNSTEETRVWLSGAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLCAI 939
Query: 793 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 852
L VNS VVSAG++R+FG+EVAELPLVATS GKGYFQ LF IE LL L VK +VLP
Sbjct: 940 LMVNSLVVSAGVIRIFGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLVLP 999
Query: 853 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
AAEEAESIWT KFGF+ I PE L Y+ QL+ F+GT+MLQK+V
Sbjct: 1000 AAEEAESIWTKKFGFQHITPEELKHYKDN-YQLMIFQGTAMLQKQV 1044
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 105/176 (59%), Gaps = 6/176 (3%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLR 256
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y+ K + GL+
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQ 404
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
G+I+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 405 GVIQGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAP 464
Query: 317 LPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 465 ADALEAAIRNATGAGDARKSTFCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 518
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/688 (39%), Positives = 373/688 (54%), Gaps = 135/688 (19%)
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI E+G +L +V+ AC+ PL L+ L+ + + ++ S
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSS----- 56
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
LC SC GP + +++ +C S
Sbjct: 57 -----------------LCLSC--------------------QGPMIEPCDTKSLVVCKS 79
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+K D ++ SS SV +++L+ R+ S P
Sbjct: 80 CLESKEPEFHNSPSKANDALNGSSRPSVDPKSILR--------RSKSSP----------- 120
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
QS R+ + T+KS + ++ + S S + S +T KD RLHKLVF++
Sbjct: 121 ----RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDI 176
Query: 522 LPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECAS------------ 569
LPDGTEVGY+ G+K+L GYK G GI C CCN VSPS FEAHA CAS
Sbjct: 177 LPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTN 236
Query: 570 ------LS---------SIPQGDWYCKYCQ------------------------------ 584
LS SI + D C C+
Sbjct: 237 GVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERW 296
Query: 585 ------NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRG 637
NM ER++F+ + NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG
Sbjct: 297 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRG 356
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL
Sbjct: 357 HSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNL 416
Query: 698 LVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVA 752
+V+ EKL LN ++K + E D D+RWR+LSGK ++ +T++LL++A++
Sbjct: 417 IVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALS 476
Query: 753 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 812
I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+
Sbjct: 477 ILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSEL 536
Query: 813 AELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP 872
AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+
Sbjct: 537 AELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTD 596
Query: 873 ELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
E + YRK S ++ F GTSML+K VPA
Sbjct: 597 EEVKEYRKDYSVMI-FHGTSMLRKSVPA 623
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 206/351 (58%), Positives = 264/351 (75%), Gaps = 8/351 (2%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ P F H EC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV+GVD
Sbjct: 479 CCDG--CPRVF--HKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDP 534
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+EQITKRCIRIV E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL++H M
Sbjct: 535 IEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDM 594
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDV 729
DL+E+PKGKWFCC DC RINS LQ L+V E+LP L IK KY N D D+
Sbjct: 595 DDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDI 654
Query: 730 RWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGM 788
+WRL+ G +A++ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G++ R +FGGM
Sbjct: 655 KWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHGKSTREWDFGGM 713
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
YCAILT++S VVSA R+FG+EVAELPLVAT G+GYFQ LF+C+E LL L V+S
Sbjct: 714 YCAILTISSQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRS 773
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+VLPAAE AESIWT+KFGF K+ E + +R R Q+VTF+GT MLQK VP
Sbjct: 774 LVLPAAEGAESIWTNKFGFNKVTQEQRNNFR-RDYQMVTFQGTLMLQKLVP 823
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/210 (40%), Positives = 101/210 (48%), Gaps = 52/210 (24%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD LHKLVF E+GLP+GTEVGYY GQ+LL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 366 VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQLLVGYKRGSGIFCTCCNSEVSPSQFEAH 425
Query: 565 AECASLSSIPQGDWYCKYCQNMFERKRFLQ-HDANAVEAGRVSGVDSVEQITKRCIRIVK 623
A AS RK +L + +N V S+ R I +
Sbjct: 426 AGWAS------------------RRKPYLHIYTSNGVSLHEF----SISLSRGREISVSD 463
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
N + LC C +L CD C R FH C+ L +PKGKWFC
Sbjct: 464 NDD-------LCSIC------LDGGNLLCCDGCPRVFHKECVS------LANIPKGKWFC 504
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI 713
N++LQ EK E + NA+
Sbjct: 505 KF----CNNMLQK------EKFVEHNANAV 524
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 91/167 (54%), Gaps = 10/167 (5%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKT 354
V+ A ++ L L ++ A+ +KS C+ CKG +G +
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS 341
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 230/493 (46%), Positives = 296/493 (60%), Gaps = 74/493 (15%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 368 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 420
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLS----------------------- 571
++LL G+K G I+C CCNSEVSPSQFEAH+ AS
Sbjct: 421 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 480
Query: 572 ----SIPQGDWYCKYC-----------------------QNMFERKRF------------ 592
SI D C C +N+ + K F
Sbjct: 481 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 540
Query: 593 -LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 651
++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 541 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 600
Query: 652 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 711
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 601 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 660
Query: 712 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 769
IKK G S + D++WRLL G +A+ E LLSQA+++FH+ F+PI D+ GRD
Sbjct: 661 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFHEQFNPIADA-EGRD 719
Query: 770 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 829
L+ MV+ + EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+GY
Sbjct: 720 LLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGY 779
Query: 830 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 889
FQ L+ CIE+LL FL+V S+VLPAAE AES+W +KF F K++ E L+ + R Q++TF+
Sbjct: 780 FQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQ 838
Query: 890 GTSMLQKRVPACR 902
GTSMLQK VP R
Sbjct: 839 GTSMLQKPVPEYR 851
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 65 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 119
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 120 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 173
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 174 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 211
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 330
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 331 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 385
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 212/459 (46%), Positives = 276/459 (60%), Gaps = 66/459 (14%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD LHKLVF LP+GT+VGYY G++LL+GY GI CHCCN+ VSPSQFE H
Sbjct: 281 LTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYIKEPGIHCHCCNTVVSPSQFEGH 340
Query: 565 AECA----------------------SLS-----SIPQGDWYCKYCQNMFE--------- 588
A A SLS S Q D C C + E
Sbjct: 341 AGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQSDDLCSICSDGGELLLCDTCPR 400
Query: 589 ---------------------------RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 621
R+ L ++ NA+ AGR+ G+DS+EQI R IRI
Sbjct: 401 AFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQIFTRSIRI 460
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
E GC LC+ DF K F RT+LLCDQC RE+HVGCLK+H MADL LP+G W
Sbjct: 461 ATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLTALPEGAW 520
Query: 682 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 740
+C DC RI+ +++LL AE +P + I KK L D+DVRWR+L K++
Sbjct: 521 YCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDLDVRWRVLRDKSSE 580
Query: 741 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 800
+++L+LS+AVAIFH+ FDPI+ + +GRDLIP+MVYGR++R Q++ GMYCA+LTV ++VV
Sbjct: 581 -DSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGRSVRDQDYTGMYCAVLTVGNTVV 639
Query: 801 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 860
SAG+ R+ G+E AELPLVATS+ N G GYFQ LF CIE+LL+ L+VK VLPAA+EA SI
Sbjct: 640 SAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVLPAADEAVSI 699
Query: 861 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
WT +FGF KI + L + + ++ F+GTS L K +P
Sbjct: 700 WTQRFGFSKISRDEL-LEHLKGARTTVFQGTSTLHKLIP 737
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 258/706 (36%), Positives = 367/706 (51%), Gaps = 68/706 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 184 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 241
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 242 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 301
Query: 330 SLPEEKSFACVRCKGTFP----ITCVGKTGPGPLCNSCVKSKKPQGTMTY-TTGIRIRAS 384
S + F K +F +T + L N PQ +++ + +
Sbjct: 302 SSVNMECFKAW--KASFHQNNGVTEADENYHAQLLN------HPQSIVSFPVQAVEDSFT 353
Query: 385 GPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELT 444
G +L + E + +ERK K + + S +K R +L
Sbjct: 354 GSRLPLKQKE------LMKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDN-DLH 406
Query: 445 RNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN 504
R P + + + K QR + + ++ ++ P F + W
Sbjct: 407 RLLFMPNGLPDGAELAYYVKG-----QRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGW- 460
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN---------GLGIICHCCNSE 555
++ ++ ++ +GL A GQ G + G I+C C
Sbjct: 461 --AARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGC--- 515
Query: 556 VSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 615
P F H C L +P+GDW C C F R + I
Sbjct: 516 --PRAF--HPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------ARPIR 554
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
+ R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ + DL+E
Sbjct: 555 IQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKE 614
Query: 676 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRL 733
LPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D++W +
Sbjct: 615 LPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DIQWCI 673
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 674 LSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMYCVVL 732
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
S+VVSAG++RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++VLPA
Sbjct: 733 LAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPA 792
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
AEEAE+IWT+K GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 793 AEEAEAIWTNKLGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 837
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 258/706 (36%), Positives = 367/706 (51%), Gaps = 68/706 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 239 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 296
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 297 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 356
Query: 330 SLPEEKSFACVRCKGTFP----ITCVGKTGPGPLCNSCVKSKKPQGTMTY-TTGIRIRAS 384
S + F K +F +T + L N PQ +++ + +
Sbjct: 357 SSVNMECFKA--WKASFHQNNGVTEADENYHAQLLN------HPQSIVSFPVQAVEDSFT 408
Query: 385 GPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELT 444
G +L + E + +ERK K + + S +K R +L
Sbjct: 409 GSRLPLKQKE------LMKEMTQERKHAAKKPSSYIYGSGLQHKKSSEGAIKKRDN-DLH 461
Query: 445 RNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN 504
R P + + + K QR + + ++ ++ P F + W
Sbjct: 462 RLLFMPNGLPDGAELAYYVKG-----QRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGW- 515
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN---------GLGIICHCCNSE 555
++ ++ ++ +GL A GQ G + G I+C C
Sbjct: 516 --AARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGC--- 570
Query: 556 VSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 615
P F H C L +P+GDW C C F R + I
Sbjct: 571 --PRAF--HPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------ARPIR 609
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
+ R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ + DL+E
Sbjct: 610 IQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKE 669
Query: 676 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRL 733
LPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D++W +
Sbjct: 670 LPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DIQWCI 728
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 729 LSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMYCVVL 787
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
S+VVSAG++RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++VLPA
Sbjct: 788 LAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPA 847
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
AEEAE+IWT+K GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 848 AEEAEAIWTNKLGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 892
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 394 bits (1013), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/442 (47%), Positives = 270/442 (61%), Gaps = 60/442 (13%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQ-------------------------------- 535
+D+ +HK+VF L +GT+VGYY G+
Sbjct: 443 RDKGMHKVVFMSGVLQEGTDVGYYVGGKVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHE 502
Query: 536 ---KLLEGYK---------------NGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGD 577
LL+G K G ++C C P F H EC SLSS P+G
Sbjct: 503 LSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTC-----PRAF--HRECVSLSSAPKGT 555
Query: 578 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 637
W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI E GC LC+
Sbjct: 556 WCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTPETGFGGCALCKL 615
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C DC RIN LQ+L
Sbjct: 616 HDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCSTDCVRINQTLQDL 675
Query: 698 LVQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 756
L E +P L+ IKK +D+DVRWR+L K++ +++L+LS+AVAIFH+
Sbjct: 676 LNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDKSSD-DSKLVLSKAVAIFHE 734
Query: 757 CFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 816
FDPI+ +GRDLIP+MVYGR+ R Q++ GMYCA+LTVN++VVSAG+ R+ G E+AELP
Sbjct: 735 TFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNTVVSAGLFRIMGNEIAELP 794
Query: 817 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 876
LVATS+ + G GYFQ LF+CIE+LL+ L VK VLPAAEEAESIWT++FGF KI + L
Sbjct: 795 LVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEEAESIWTERFGFTKISQDELR 854
Query: 877 IYRKRCSQLVTFKGTSMLQKRV 898
Y K + F+GTS L K V
Sbjct: 855 EYLK-GGRTTVFQGTSNLHKLV 875
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 57/93 (61%), Gaps = 2/93 (2%)
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
+V+ P FE+HA + S YI ENG +L +VLRAC +V L ML++ ++ A+ P++
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
++F C CK +F C GK C+SC++SK
Sbjct: 406 RTFRCKGCKSSFSTLCSGKF--ALFCDSCLESK 436
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 258/703 (36%), Positives = 353/703 (50%), Gaps = 64/703 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ N P V +L TG+LDG V Y+ + L GII GG LC C
Sbjct: 201 NMELKMSKKVLPNTFPSNVKKLLSTGILDGARVKYISPQR----ELYGIIDGGGYLCGCP 256
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN RV+ +FE+HA + R + +I ENGK + +++ ++ PL + ++ A
Sbjct: 257 SCNFSRVLTAYEFELHAGAKTRHPNNHIYLENGKPICSIIQELKAAPLGAVDEVIKDAAG 316
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLV 389
S E+ F + C G G C S + P +Y++ + P
Sbjct: 317 SSINEEFFQVWKASLH---QCNGIIGADEKCYSMLP-YSPHSLGSYSSQGLEESGCPPCS 372
Query: 390 SRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSS- 448
S N Y ++ E K+ + + SL + KT R++
Sbjct: 373 SFVHSNPFRRQKYMDSSEEHKRAFR------------RPSSLSHPKKTNEGGTRRRDNDL 420
Query: 449 -RPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITP 507
R + N P + + Q+ + + ++ + P F ++ +
Sbjct: 421 HRLLFMPNGLP-DGAELAYYIKGQKMLAGYKQGNGIVCSCCDREISPSQF--EAHAGMAA 477
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN---------GLGIICHCCNSEVSP 558
+ Q ++ ++ +GL A GQ L G + G I C C P
Sbjct: 478 RRQP-YRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLIFCESC-----P 531
Query: 559 SQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC 618
F H C L +P W+C C F + I R
Sbjct: 532 RAF--HLVCLGLKYVPSDVWHCPNCNKFGHGGNFSR------------------SIVIRL 571
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
R+VK E E+ GC+ CR DFS F RT++LCDQCEREFHVGCL+ + + DL+E+PK
Sbjct: 572 TRVVKTPEYEVGGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKEIPK 631
Query: 679 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI-DVRWRLLSG 736
WFC DC+RI LQN + + +P LN I K+A L D +WR+L G
Sbjct: 632 DNWFCSNDCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWRILMG 691
Query: 737 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVN 796
K+ E LLS A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V
Sbjct: 692 KSRYQEDLSLLSAAAAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVLLLVK 750
Query: 797 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 856
+ VVSAG+LR+FG++VAELPLVATS+ + GKGYFQ LF+CIE+LL L V +VLPAAEE
Sbjct: 751 NVVVSAGLLRIFGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEE 810
Query: 857 AESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
AESIWT +FGF+K+ E LS Y + QL FKGTSML+K VP
Sbjct: 811 AESIWTRRFGFRKMTEEQLSQYTREL-QLTIFKGTSMLEKEVP 852
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 206/440 (46%), Positives = 262/440 (59%), Gaps = 53/440 (12%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY GQK+L GYK G GI+C CC E+SPSQFE+HA
Sbjct: 427 RDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGM 486
Query: 568 A---------------SLSSIP-----------------------QGD-WYCKYCQNMFE 588
+ +L I GD +C+ C F
Sbjct: 487 SARRQPYRHIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFH 546
Query: 589 RKRFLQHDA--------NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 640
HD N + G G + I R R+VK E ++ GC +CR DF
Sbjct: 547 AACLDLHDTPEGAWHCPNCNKLGH--GGNFARPIVIRLTRVVKTPEYDVGGCAVCRAHDF 604
Query: 641 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 700
S F RT++LCDQCE+EFHVGCL++ + DL+E+PK WFCC DC+ I L+N +
Sbjct: 605 SGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVST 664
Query: 701 EAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 758
+ +P LN I K L + DV+W++L GK+ E LLS A AIF +CF
Sbjct: 665 GVQTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECF 724
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 818
DPIV + +GRDLIP MVYGRN+ GQEFGGMYC +LTV VVSAG+LR+FG+EVAELPLV
Sbjct: 725 DPIV-AKTGRDLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLV 783
Query: 819 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 878
AT++ + GKGYFQ LF+CIE+LL L V+ +VLPAAEEAESIWT +FGF+K+ L Y
Sbjct: 784 ATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFGFRKMSEGQLLKY 843
Query: 879 RKRCSQLVTFKGTSMLQKRV 898
R QL FKGTSML+K V
Sbjct: 844 -TREFQLTIFKGTSMLEKEV 862
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 202 ERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 258
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS C+ +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 259 GCSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 318
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 319 VAGSSINEEFF 329
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/353 (52%), Positives = 240/353 (67%), Gaps = 37/353 (10%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ P F H EC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV+GVD
Sbjct: 514 CCDG--CPRVF--HKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDP 569
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+EQITKRCIRIV E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL++H M
Sbjct: 570 IEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDM 629
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDV 729
DL+E+PKGKWFCC DC RINS LQ L+V E+LP L IK KY N D D+
Sbjct: 630 DDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDI 689
Query: 730 RWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGM 788
+WRL+ G +A++ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G
Sbjct: 690 KWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHG----------- 737
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
S VVSA R+FG+EVAELPLVAT G+GYFQ LF+C+E LL L V+S
Sbjct: 738 --------SQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRS 789
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPE--LLSIYRKRCSQLVTFKGTSMLQKRVP 899
+VLPAAE AESIWT+KFGF K+ E ++ ++ Q QKR+P
Sbjct: 790 LVLPAAEGAESIWTNKFGFNKVTQEQYIMDLFGIAAEQ---------FQKRLP 833
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/241 (38%), Positives = 114/241 (47%), Gaps = 55/241 (22%)
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ- 535
+S K IS+ E+ S S +KS +T KD LHKLVF E+GLP+GTEVGYY GQ
Sbjct: 370 RSPKPSTISRSAESVSKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQV 429
Query: 536 --KLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFL 593
+LL GYK G GI C CCNSEVSPSQFEAHA AS RK +L
Sbjct: 430 VTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWAS------------------RRKPYL 471
Query: 594 Q-HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 652
+ +N V S+ R I + N + LC C +L
Sbjct: 472 HIYTSNGVSLHEF----SISLSRGREISVSDNDD-------LCSIC------LDGGNLLC 514
Query: 653 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 712
CD C R FH C+ L +PKGKWFC N++LQ EK E + NA
Sbjct: 515 CDGCPRVFHKECVS------LANIPKGKWFCKF----CNNMLQK------EKFVEHNANA 558
Query: 713 I 713
+
Sbjct: 559 V 559
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 114/210 (54%), Gaps = 15/210 (7%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
V+ A ++ L L ++ A+ +KS C+ CKG +G + LC SC++ K
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS--KVLCLSCLQLK 352
Query: 368 KPQGTMTYTTG---IRIRASGPKLVSRSSE 394
+ Q + + TG +R+ P +SRS+E
Sbjct: 353 ESQASPSQVTGSSDSHLRSPKPSTISRSAE 382
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 182/336 (54%), Positives = 233/336 (69%), Gaps = 4/336 (1%)
Query: 566 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 625
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 64 ETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCST 123
Query: 686 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKY--AGNSLETVSDIDVRWRLLSGKAATPET 743
DC RIN LQ+LL E + L+ IKK + +D+DVRWR+L K+ + ++
Sbjct: 124 DCVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDKS-SDDS 182
Query: 744 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 803
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYC +LTVN+ VVSAG
Sbjct: 183 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSAG 242
Query: 804 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 863
+ R+ G E+AELPLVATS+ G GYFQ LF+CIE+LLS L VK VLPAAEEAESIWT+
Sbjct: 243 LFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWTE 302
Query: 864 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+FGF KI + L Y K + F+GTS L K V
Sbjct: 303 RFGFAKISQDELREYLKG-GRTTVFQGTSNLHKLVA 337
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 211/483 (43%), Positives = 270/483 (55%), Gaps = 71/483 (14%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
SK F PL+ S +T KD LHKLVF L DGTEV YY GQ+ ++GY
Sbjct: 546 SKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKD 603
Query: 545 LGIICHCCNSEVSPSQFEAHA---------------ECASLSSIPQ-------------- 575
I C+ CN VSPS FEAHA SL +
Sbjct: 604 QRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDMELSERETD 663
Query: 576 ---------GDWY-CKYCQNMF-----------------------ERKRFLQHDANAVEA 602
GD + CK C F ++++ L + NA A
Sbjct: 664 DLCRECGQGGDIFPCKMCPRSFHPACVGLSGVPSEWYCDNCSNLVQKEKALAENKNAKAA 723
Query: 603 GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 662
GR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++LCDQCE+E+HV
Sbjct: 724 GRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVILCDQCEKEYHV 782
Query: 663 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSL 721
GCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+ L
Sbjct: 783 GCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKGL 842
Query: 722 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--- 778
++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLIP MV GR
Sbjct: 843 SMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPK 902
Query: 779 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
+ GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G GYFQ LF+CI
Sbjct: 903 DGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSCI 962
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 897
E++L L++K +LPAA+EAE IW +KFGF KI E Y + L F GTS L K
Sbjct: 963 ERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLNG-AHLTIFHGTSNLYKA 1021
Query: 898 VPA 900
+P+
Sbjct: 1022 IPS 1024
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 2/150 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V Y+ +A L+G+I I C C CNG + +
Sbjct: 380 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKKAV-LKGVITGCNIRCFCLSCNGSKDVCSY 438
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C+
Sbjct: 439 FFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCL 498
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C + G LC C++ K+ Q
Sbjct: 499 NCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 527
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 211/483 (43%), Positives = 270/483 (55%), Gaps = 71/483 (14%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
SK F PL+ S +T KD LHKLVF L DGTEV YY GQ+ ++GY
Sbjct: 537 SKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKD 594
Query: 545 LGIICHCCNSEVSPSQFEAHA---------------ECASLSSIPQ-------------- 575
I C+ CN VSPS FEAHA SL +
Sbjct: 595 QRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDMELSERETD 654
Query: 576 ---------GDWY-CKYCQNMF-----------------------ERKRFLQHDANAVEA 602
GD + CK C F ++++ L + NA A
Sbjct: 655 DLCRECGQGGDIFPCKMCPRSFHPACVGLSGVPSEWYCDNCSNLVQKEKALAENKNAKAA 714
Query: 603 GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 662
GR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++LCDQCE+E+HV
Sbjct: 715 GRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVILCDQCEKEYHV 773
Query: 663 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSL 721
GCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+ L
Sbjct: 774 GCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKGL 833
Query: 722 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--- 778
++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLIP MV GR
Sbjct: 834 SMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPK 893
Query: 779 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
+ GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G GYFQ LF+CI
Sbjct: 894 DGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSCI 953
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 897
E++L L++K +LPAA+EAE IW +KFGF KI E Y + L F GTS L K
Sbjct: 954 ERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLNG-AHLTIFHGTSNLYKA 1012
Query: 898 VPA 900
+P+
Sbjct: 1013 IPS 1015
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 76/151 (50%), Gaps = 2/151 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYM-GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P + EL TGLL+G+ V Y+ K Q + L+G+I I C C CNG + +
Sbjct: 369 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCS 428
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C
Sbjct: 429 YFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNC 488
Query: 340 VRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
+ C + G LC C++ K+ Q
Sbjct: 489 LNCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 518
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 204/466 (43%), Positives = 265/466 (56%), Gaps = 71/466 (15%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD RLHKLVF+ L DGTEV YY GQ+ ++GY I C+ CN VSPS
Sbjct: 386 SAGKVTNKDNRLHKLVFNV--LLDGTEVAYYVDGQRKVDGYIKDHRIYCNHCNRVVSPSA 443
Query: 561 FEAHA---------------ECASLSSIPQ-----------------------GDWY-CK 581
FEAHA SL + GD + CK
Sbjct: 444 FEAHAGEGSRRKPYDNIFTSNGVSLHELAMKISKDMELSERETDDLCRECGQGGDIFPCK 503
Query: 582 YCQNMF-----------------------ERKRFLQHDANAVEAGRVSGVDSVEQITKRC 618
C F ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 504 ICPRSFHPACVGLSKVPAEWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRA 563
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP+
Sbjct: 564 IRIVP-ISDDLGGCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPE 622
Query: 679 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGK 737
+WFCC CS S L ++ A+ L + L I KK+ L + D++W+LLSGK
Sbjct: 623 EEWFCCSSCSETRSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSGK 682
Query: 738 AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAIL 793
AT + +LLS AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+L
Sbjct: 683 RATEDGSILLSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALL 742
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
TV S+VVSA +LRV G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPA
Sbjct: 743 TVGSTVVSAALLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPA 802
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
A EAE IW +KFGF +I PE L Y + L F GTS + K VP
Sbjct: 803 AHEAEGIWMNKFGFSRISPEELEAYLNG-AHLTIFHGTSYMYKAVP 847
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 78/154 (50%), Gaps = 5/154 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TG+L+G+ V+Y+ +A L+G+I I C C CNG + I
Sbjct: 207 LTKHPGNIRELLNTGMLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGAKAISAY 265
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + C+
Sbjct: 266 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIRSSIDPVVKRSRINCL 325
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
C + + LC C++SK+PQ +T
Sbjct: 326 NCNELV----LPSSHENVLCQVCLESKQPQDPLT 355
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 209/470 (44%), Positives = 266/470 (56%), Gaps = 92/470 (19%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQ-------KLLEGYKNGLGIICHCCNSEVS 557
I D LHKL F E+ LP+G+EV YY G+ +LL G+K G I+C CCNSEVS
Sbjct: 383 IMGGDVGLHKLAFGENDLPEGSEVSYYVRGEVGTMRSKRLLSGHKKGCRILCDCCNSEVS 442
Query: 558 PSQFEAHAECASLS---------------------------SIPQGDWYCKYC------- 583
PSQFEAH+ AS SI D C C
Sbjct: 443 PSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLRGREPSINTNDEICSICLDGGTLL 502
Query: 584 ----------------QNMFERKRF-------------LQHDANAVEAGRVSGVDSVEQI 614
+N+ + K F ++ +ANAV AGR+ GVD +EQI
Sbjct: 503 CCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQI 562
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
KRCIRIVK+ E GC LCR +FS SGFGP T+++CDQCE+EFHVGCLK H + DL+
Sbjct: 563 RKRCIRIVKSQTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLK 622
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRL 733
+PKGKWFCC DC INS L+ ++V+ E+LP+ L IKK G S + D++WRL
Sbjct: 623 AVPKGKWFCCRDCKDINSSLRKIVVRREEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRL 682
Query: 734 LSGKAATP-ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 792
L G+ A+ E LLSQA+++FH+ F+PI D+ GRDL+ MV+ + EFGGMYCAI
Sbjct: 683 LHGRXASATEAGSLLSQALSLFHEQFNPIADA-EGRDLLLDMVHSNSTGELEFGGMYCAI 741
Query: 793 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 852
LTV VVSA RV G+EVAELPLVAT G+ V S+VLP
Sbjct: 742 LTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQ------------------VNSLVLP 783
Query: 853 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
AAE AES+W +KF F K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 784 AAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 832
Score = 100 bits (248), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 88/162 (54%), Gaps = 6/162 (3%)
Query: 220 SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRV 276
+L P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V
Sbjct: 208 TLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNV 267
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
+ P+ FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +
Sbjct: 268 VTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKST 327
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTG 378
F C CKG+ I+ VG + LC SCV K+ + TG
Sbjct: 328 F-CFNCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG 366
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 194/450 (43%), Positives = 258/450 (57%), Gaps = 65/450 (14%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGI-------------------- 547
+D LH+L+F +GLPDGTE+ YY Q ++ G+
Sbjct: 388 RDNDLHRLLFMPNGLPDGTELAYYVKTQISPSQFEAHAGMAARRQPYRHIFISSGLSLHD 447
Query: 548 ------------------ICHCCNSEVS-------PSQFEAHAECASLSSIPQGDWYCKY 582
+C C P F H C S+P+G WYC
Sbjct: 448 IAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAF--HTACLKFQSMPEGTWYCSS 505
Query: 583 CQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRIVKNLEAELSGCLLCRGCD 639
C + + + + + D + I R R+VK E+++ GC+ CR D
Sbjct: 506 CND------------GPISSKKATTTDPSGNARPIVIRLSRVVKAPESDIGGCVFCRSHD 553
Query: 640 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 699
FS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KWFCC +CSRI++ +QN +
Sbjct: 554 FSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRIHTAVQNSVS 613
Query: 700 QEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 758
+ LP L+ I +K + T V WR+LSGK+ PE LLS+A IF +CF
Sbjct: 614 CGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEHLPLLSRAAVIFRECF 673
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 818
DPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA +LR+FGQEVAELP+V
Sbjct: 674 DPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQEVAELPIV 732
Query: 819 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 878
ATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESIWT KFGF K+ + L Y
Sbjct: 733 ATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEY 792
Query: 879 RKRCSQLVTFKGTSMLQKRVPACRIGSSST 908
+K QL FKGTSML+K+VP G S +
Sbjct: 793 QKEV-QLTIFKGTSMLEKKVPKATTGLSES 821
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 151 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 207
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 208 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 267
Query: 330 SLPEE 334
+L EE
Sbjct: 268 ALNEE 272
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 349 bits (895), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 186/376 (49%), Positives = 247/376 (65%), Gaps = 25/376 (6%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQ 584
A GQKL G + + C C + + P + H C L ++P+G W C C+
Sbjct: 478 ASGQKLTTGDSDDM---CAACGNGGDLIFCDRCPRAY--HTGCLHLQNVPEGVWSCPNCR 532
Query: 585 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 644
+ + ++ A+ G +S + I R R+VK E E+ GC++CR DFS +
Sbjct: 533 DK------VGSNSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAK 583
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
F RT+LLCDQCEREFHVGCL+ + DL+ELPK KWFCC +CS I+ LQN ++ A+
Sbjct: 584 FDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQI 643
Query: 705 LPEFHLN-AIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 762
+P+ + I+K+ G L + DVRW++LSGK+ PE LS+A AIF +CFDPIV
Sbjct: 644 IPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECFDPIV 703
Query: 763 DSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 822
+ SGRDLIP MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+EVAELP+VATS+
Sbjct: 704 -AKSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSR 762
Query: 823 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRC 882
+ GKGYFQ+LF+CIE+LLS L V+++VLPAAE+AESIWT K GF+K+ E L Y +
Sbjct: 763 EHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREV 822
Query: 883 SQLVTFKGTSMLQKRV 898
QL F GTSML+K V
Sbjct: 823 -QLTIFNGTSMLEKVV 837
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 99/212 (46%), Gaps = 51/212 (24%)
Query: 484 ISKPFENASPPLS-----FPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLL 538
+S P E +P S + +D LH+L+F +GLPDG E+ Y+ GQ++L
Sbjct: 364 LSHPVERPNPNFSNAVLQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRIL 423
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHD-A 597
G+K G GI+C CN E+SPSQFEAHA A+ + ++++ HD A
Sbjct: 424 GGFKQGNGILCSHCNREISPSQFEAHAGMAARR---------QPYRHIYTTNGLTLHDIA 474
Query: 598 NAVEAG-RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 656
++ +G +++ DS + +C C G G ++ CD+C
Sbjct: 475 ISLASGQKLTTGDSDD---------------------MCAAC-----GNGG-DLIFCDRC 507
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
R +H GCL L+ +P+G W C C D
Sbjct: 508 PRAYHTGCLH------LQNVPEGVWSCPNCRD 533
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E AE S + +ELKMSKK+ N P V +L TG+LDG V Y+ L
Sbjct: 199 EGSAESSRYSLGPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK--L 256
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG +C CS CN ++ +FE HA + R + +I ENG+ + V++ +S
Sbjct: 257 QGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSA 316
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
PL +L + S SF
Sbjct: 317 PLSILDEVIMEVAGSSVNMNSF 338
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 346 bits (888), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 186/376 (49%), Positives = 247/376 (65%), Gaps = 25/376 (6%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQ 584
A GQKL G + + C C + + P + H C L ++P+G W C C+
Sbjct: 417 ASGQKLTTGDSDDM---CAACGNGGDLIFCDRCPRAY--HTGCLHLQNVPEGVWSCPNCR 471
Query: 585 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 644
+ + ++ A+ G +S + I R R+VK E E+ GC++CR DFS +
Sbjct: 472 DK------VGSNSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAK 522
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
F RT+LLCDQCEREFHVGCL+ + DL+ELPK KWFCC +CS I+ LQN ++ A+
Sbjct: 523 FDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQI 582
Query: 705 LPEFHLN-AIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 762
+P+ + I+K+ G L + DVRW++LSGK+ PE LS+A AIF +CFDPIV
Sbjct: 583 IPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECFDPIV 642
Query: 763 DSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 822
+ SGRDLIP MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+EVAELP+VATS+
Sbjct: 643 -AKSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSR 701
Query: 823 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRC 882
+ GKGYFQ+LF+CIE+LLS L V+++VLPAAE+AESIWT K GF+K+ E L Y +
Sbjct: 702 EHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREV 761
Query: 883 SQLVTFKGTSMLQKRV 898
QL F GTSML+K V
Sbjct: 762 -QLTIFNGTSMLEKVV 776
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 89/178 (50%), Gaps = 44/178 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDG E+ Y+ GQ++L G+K G GI+C CN E+SPSQFEAHA
Sbjct: 332 RDNDLHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGM 391
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHD-ANAVEAG-RVSGVDSVEQITKRCIRIVKNL 625
A+ + ++++ HD A ++ +G +++ DS +
Sbjct: 392 AARR---------QPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDD------------- 429
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+C C G G ++ CD+C R +H GCL L+ +P+G W C
Sbjct: 430 --------MCAAC-----GNGG-DLIFCDRCPRAYHTGCLH------LQNVPEGVWSC 467
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 346 bits (887), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 177/355 (49%), Positives = 233/355 (65%), Gaps = 20/355 (5%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQI 614
P F H C S+P+G WYC C + + + + + D + I
Sbjct: 517 PQAF--HTACLKFQSVPEGTWYCSSCND------------GPISSKKATATDPSGNARPI 562
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
R R+VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ + DL+
Sbjct: 563 VIRLSRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGLCDLK 622
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 733
E+P+ KWFCC DCSRI++ +QN + + +P L+ I +K + T + V WR+
Sbjct: 623 EIPQEKWFCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNGDIVEWRI 682
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 683 LSGKSRYPEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVL 741
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPA
Sbjct: 742 IVNSLVVSAALLRIFGQQVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPA 801
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 908
AEEAESIWT KFGF K+ + L Y+K QL FKGTSML+K+VP S ST
Sbjct: 802 AEEAESIWTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKTTSLSEST 855
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 85/178 (47%), Gaps = 44/178 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY QKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 402 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYKQGSGIVCSCCSREISPSQFEAHAGM 461
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHD-ANAVEAGRVSGVDSVEQITKRCIRIVKNLE 626
A+ + +++F HD A ++ G V + +
Sbjct: 462 AARR---------QPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDM------------ 500
Query: 627 AELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C G D +LLC C + FH CLK + +P+G W+C
Sbjct: 501 -----CSICGDGGD----------LLLCAGCPQAFHTACLK------FQSVPEGTWYC 537
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 72/123 (58%), Gaps = 3/123 (2%)
Query: 213 LKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN 272
+KM KKI P V +L ETG+L+G V Y+ + L+GII GG LC C+ C+
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPVRE--LQGIIHSGGYLCGCTTCS 216
Query: 273 GCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALSSL 331
+V+ +FE+HA + R + +I ENG+++ +++ ++ P +L+ +++ A S+L
Sbjct: 217 FSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAGSAL 276
Query: 332 PEE 334
EE
Sbjct: 277 NEE 279
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 176/355 (49%), Positives = 231/355 (65%), Gaps = 20/355 (5%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQI 614
P F H C S+P+G WYC C + + + + + D + I
Sbjct: 487 PQAF--HTACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPI 532
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
R R+VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+
Sbjct: 533 VIRLSRVVKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLK 592
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 733
E+P+ KWFCC +CSRI++ +QN + + LP L+ I +K + T V WR+
Sbjct: 593 EIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRI 652
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 653 LSGKSRYPEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVL 711
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
VNS VVSA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPA
Sbjct: 712 IVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPA 771
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 908
AEEAESIWT KFGF K+ + L Y+K QL FKGTSML+K+VP G S +
Sbjct: 772 AEEAESIWTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSES 825
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 86/178 (48%), Gaps = 44/178 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 372 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 431
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHD-ANAVEAGRVSGVDSVEQITKRCIRIVKNLE 626
A+ + +++F HD A ++ G V + +
Sbjct: 432 AARR---------QPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDM------------ 470
Query: 627 AELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C G D +LLC C + FH CLK + +P+G W+C
Sbjct: 471 -----CSICGDGGD----------LLLCAGCPQAFHTACLK------FQSMPEGTWYC 507
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 160 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 216
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 217 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 276
Query: 330 SLPEE 334
+L EE
Sbjct: 277 ALNEE 281
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 343 bits (880), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 174/344 (50%), Positives = 231/344 (67%), Gaps = 16/344 (4%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
P F H C S+P+G WYC C + + A A + +++ I R
Sbjct: 505 PQAF--HTACLKFQSMPEGTWYCSSCNDGPTSCKI----ATASDP-------NLKPIVIR 551
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
R+VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++++ DL+ +P
Sbjct: 552 LTRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIP 611
Query: 678 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 736
+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY + + V WR+LSG
Sbjct: 612 QDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSG 671
Query: 737 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVN 796
K+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VN
Sbjct: 672 KSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVN 730
Query: 797 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 856
S VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++LPAAEE
Sbjct: 731 SLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEE 790
Query: 857 AESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
AESIWT+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 791 AESIWTNKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 833
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 84/177 (47%), Gaps = 41/177 (23%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAHA
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 627
A P + ++ + L + + G DS +
Sbjct: 449 AGRRQ-PYRRIHISSGLSLHDIAVSLADGGHVITTG-----DSDDM-------------- 488
Query: 628 ELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C G D +LLC C + FH CLK + +P+G W+C
Sbjct: 489 ----CSICGNGGD----------LLLCAGCPQAFHTACLK------FQSMPEGTWYC 525
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQAW--KGSF 300
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 340 bits (873), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/357 (49%), Positives = 232/357 (64%), Gaps = 20/357 (5%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQI 614
P F H C S+P+G WYC C + + + + + D + I
Sbjct: 192 PQAF--HTACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPI 237
Query: 615 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
R R+VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+
Sbjct: 238 VIRLSRVVKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLK 297
Query: 675 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 733
E+P+ KWFCC +CSRI++ +QN + + LP L+ I +K + T V WR+
Sbjct: 298 EIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRI 357
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 793
LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 358 LSGKSRYPEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVL 416
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
VNS VVSA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPA
Sbjct: 417 IVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPA 476
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 910
AEEAESIWT KFGF K+ + L Y+K QL FKGTSML+K+VP G S + +
Sbjct: 477 AEEAESIWTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSESTT 532
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 86/178 (48%), Gaps = 44/178 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 77 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 136
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHD-ANAVEAGRVSGVDSVEQITKRCIRIVKNLE 626
A+ + +++F HD A ++ G V + +
Sbjct: 137 AARR---------QPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDM------------ 175
Query: 627 AELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C G D +LLC C + FH CLK + +P+G W+C
Sbjct: 176 -----CSICGDGGD----------LLLCAGCPQAFHTACLK------FQSMPEGTWYC 212
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 164/271 (60%), Positives = 205/271 (75%), Gaps = 2/271 (0%)
Query: 639 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 698
DFSKSGF RT+++CDQCERE+HVGCLK+HKMADL+ELP+G+WFC DC RI+S LQ LL
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 699 VQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 757
++ A+ LP ++ I+K ++ D+RW+LLSGK A E+RLLLS+AVAIFH+
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAESRLLLSKAVAIFHES 122
Query: 758 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 817
FDPIVD+ +GRDLIP+MVYGR +R Q++GG+YCA+LTV SSVVSAGILRV G E+AELPL
Sbjct: 123 FDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAELPL 182
Query: 818 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 877
VATS+ + G+GYFQ LF+CIE+LL L+VK VLPAA+EAESIWT KFGF KI + L
Sbjct: 183 VATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDELHK 242
Query: 878 YRKRCSQLVTFKGTSMLQKRVPACRIGSSST 908
Y V F+GTS L K V + S T
Sbjct: 243 YLNGARTTV-FQGTSTLHKPVTVPHVSSRET 272
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 335 bits (858), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 184/375 (49%), Positives = 237/375 (63%), Gaps = 29/375 (7%)
Query: 532 ACGQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQN 585
A GQ L G + + +C CN P F HA C L +P W C C
Sbjct: 494 ANGQNLTTGDSDDMCAVCGDGGDLILCNG--CPRAF--HAACLGLQCVPDSGWQCLNC-- 547
Query: 586 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
+ + N GR S + V I R R+ K E E+ GC++CR DFS + F
Sbjct: 548 -------IDNAGN----GRESSI--VRPIMIRLTRVDKTPEVEMGGCVVCREHDFSVAKF 594
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 705
RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI + LQN + AE +
Sbjct: 595 DERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYAALQNSVSAGAEII 654
Query: 706 P-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 763
P F I+K+ L T + D++WR+LSGK+ PE LLS+A AIF +CFDPIV
Sbjct: 655 PASFSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRAAAIFRECFDPIV- 713
Query: 764 SISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
+ISGRDLIP MVYGRN+ GQEFGGMYC +L VN VVSAG+LR+FG+ VAELPLVATS+
Sbjct: 714 AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFGRNVAELPLVATSRA 773
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
+ GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT K GF+K+ + LS + +
Sbjct: 774 HQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMSEDQLSKHLREV- 832
Query: 884 QLVTFKGTSMLQKRV 898
QL F TSML+K V
Sbjct: 833 QLTLFNKTSMLEKTV 847
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 227 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 281
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 282 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIK 341
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 342 NVAGSSVNEESF 353
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 119/266 (44%), Gaps = 59/266 (22%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPNKSRWN-------ITPKDQRLHKL 515
K S Q K+ KS T L+ P N P N +D LH+L
Sbjct: 357 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNSGVLQKRSADGCTKRRDNDLHRL 416
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQ 575
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA A+
Sbjct: 417 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARR---- 472
Query: 576 GDWYCKYCQNMFERKRFLQHD-ANAVEAGR-VSGVDSVEQITKRCIRIVKNLEAELSGCL 633
+ ++++ HD A ++ G+ ++ DS + C
Sbjct: 473 -----QPYRHIYTSNGLTLHDIALSLANGQNLTTGDSDDM------------------CA 509
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS--- 688
+C G G ++LC+ C R FH CL L+ +P W C C+D +
Sbjct: 510 VC--------GDGG-DLILCNGCPRAFHAACLG------LQCVPDSGWQCLNCIDNAGNG 554
Query: 689 RINSVLQNLLVQ--EAEKLPEFHLNA 712
R +S+++ ++++ +K PE +
Sbjct: 555 RESSIVRPIMIRLTRVDKTPEVEMGG 580
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 169/346 (48%), Positives = 229/346 (66%), Gaps = 8/346 (2%)
Query: 564 HAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 623
H C LS P +W+C C N+ ++++ L + NA AGR +GVDS+EQI KR IRIV
Sbjct: 847 HPACVRLSEFPS-EWFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKRAIRIVP 905
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP G+WFC
Sbjct: 906 ICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELPDGEWFC 964
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C CS I S L ++ A+ L L I KK+ L +DID+RW+LL+G++AT +
Sbjct: 965 CSSCSEIRSCLDKMISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAGRSATED 1024
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSS 798
LLLS AV I H FDPI+++ +GRDLIP MV GR + GQ++ GMYCA++T+ S+
Sbjct: 1025 GSLLLSSAVPIIHQSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAVITLGST 1084
Query: 799 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 858
VVSA +LR+ G +VAELPLVATS G GYFQ+LF+C+E++L L++K +LPAA+EAE
Sbjct: 1085 VVSAALLRIMGGDVAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLPAAQEAE 1144
Query: 859 SIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIG 904
+IW KFGF +I E L Y + L F GTS L K VP+ G
Sbjct: 1145 AIWMKKFGFSRIPQEQLEAYLNG-AHLTVFHGTSNLYKAVPSPSPG 1189
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 77/150 (51%), Gaps = 5/150 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
K P V EL +TGLL+G+ V+Y+ +A ++G+I I C C CNG R +
Sbjct: 544 FTKHPGNVKELLQTGLLEGMPVMYIIPNSKKAV-VKGVITGCNIRCFCIKCNGSRALSTY 602
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA + +++I NG SL +VLRAC L L+ T +S++ + C+
Sbjct: 603 FFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCL 662
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C G P + LC+ C+ SK+PQ
Sbjct: 663 NCGGHLPSSETEHF----LCHCCLDSKQPQ 688
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 81/184 (44%), Gaps = 45/184 (24%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQ+ ++GY I C+ C+ VSPS
Sbjct: 723 SSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCSRVVSPSA 780
Query: 561 FEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIR 620
FEAHA S K N+F +N V +S ++
Sbjct: 781 FEAHAGEGSRR---------KPYDNIFT--------SNGVSLHELS------------MK 811
Query: 621 IVKNLE-AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 679
I K++E +E LCR C G G I C C R FH C++ L E P
Sbjct: 812 ISKDMELSERETDDLCREC-----GLG-GDIFPCKMCPRSFHPACVR------LSEFPS- 858
Query: 680 KWFC 683
+WFC
Sbjct: 859 EWFC 862
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 332 bits (852), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 185/375 (49%), Positives = 237/375 (63%), Gaps = 29/375 (7%)
Query: 532 ACGQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQN 585
A GQ L G + + +C CN P F HA C L +P W C C++
Sbjct: 497 ANGQNLTTGDSDDMCAVCGDGGDLILCNG--CPRAF--HAACLGLQCVPDSGWQCLNCRD 552
Query: 586 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
NA GR S + V I R R+ K E E+ GC++CR DFS + F
Sbjct: 553 ------------NAGN-GRESSI--VRPIMIRLTRVDKTPEFEMGGCVVCREHDFSVAKF 597
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 705
RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI LQN + AE +
Sbjct: 598 DERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSVAAGAEII 657
Query: 706 P-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 763
P I+K+ L T + D++WR+LSGK+ PE LLS+A AIF +CFDPIV
Sbjct: 658 PASVSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRAAAIFRECFDPIV- 716
Query: 764 SISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
+ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSAG+LR+FG+ VAELPLVATS+
Sbjct: 717 AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGRNVAELPLVATSRA 776
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
+ GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT K GF+K+ + LS + +
Sbjct: 777 HQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKMSEDQLSKHLREV- 835
Query: 884 QLVTFKGTSMLQKRV 898
QL F TSML+K V
Sbjct: 836 QLTLFNKTSMLEKTV 850
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 230 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 284
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 285 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIK 344
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 345 NVAGSSVNEESF 356
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 122/266 (45%), Gaps = 59/266 (22%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPN------KSRWNITPK-DQRLHKL 515
K S Q K+ KS T L+ P N P N +S T + D LH+L
Sbjct: 360 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNTGVLQKRSADGCTKRRDNDLHRL 419
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQ 575
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA A+
Sbjct: 420 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARR---- 475
Query: 576 GDWYCKYCQNMFERKRFLQHD-ANAVEAGR-VSGVDSVEQITKRCIRIVKNLEAELSGCL 633
+ ++++ HD A ++ G+ ++ DS + C
Sbjct: 476 -----QPYRHIYTSNGLTLHDIALSLANGQNLTTGDSDDM------------------CA 512
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS--- 688
+C G G ++LC+ C R FH CL L+ +P W C C D +
Sbjct: 513 VC--------GDGG-DLILCNGCPRAFHAACLG------LQCVPDSGWQCLNCRDNAGNG 557
Query: 689 RINSVLQNLLVQ--EAEKLPEFHLNA 712
R +S+++ ++++ +K PEF +
Sbjct: 558 RESSIVRPIMIRLTRVDKTPEFEMGG 583
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 332 bits (850), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 179/377 (47%), Positives = 233/377 (61%), Gaps = 32/377 (8%)
Query: 532 ACGQKLLEGYKNGLGIICH------CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQN 585
A GQ L G + + +C CN P F HA C L S+P+ W+C C++
Sbjct: 543 ANGQNLTTGDSDDMCAVCGDGGDLILCNG--CPRAF--HAACLGLHSVPESGWHCLNCED 598
Query: 586 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
+R I R R+ K E E+ GC++CR DFS F
Sbjct: 599 NTGDER------------------GARPIMIRLTRVDKEPEYEVGGCVVCRANDFSVDKF 640
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 705
RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI LQN + A+ +
Sbjct: 641 DDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQNSVSAGADTI 700
Query: 706 PE-FHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 763
P I+K+ L T D+ D++WR+LSGK+ E LLS+A AIF +CFDPIV
Sbjct: 701 PSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAEHLPLLSRAAAIFRECFDPIV- 759
Query: 764 SISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
+ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSAG+LR+FG+ +AELPLVATS+
Sbjct: 760 AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSAGLLRIFGRNIAELPLVATSRE 819
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
+ GKGYFQ LF+CIE+LLS L V+ +VLPAA +AESIWT K GF K+ + L+ + K
Sbjct: 820 HQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFHKMSEDQLTKHLKEV- 878
Query: 884 QLVTFKGTSMLQKRVPA 900
QL F TS+L+K V A
Sbjct: 879 QLTLFNKTSVLEKMVQA 895
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 100/209 (47%), Gaps = 48/209 (22%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 458 RDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGM 517
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHD-ANAVEAGR-VSGVDSVEQITKRCIRIVKNL 625
A+ + ++++ HD A ++ G+ ++ DS +
Sbjct: 518 AARR---------QPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDM------------ 556
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
C +C G G ++LC+ C R FH CL H +P+ W C
Sbjct: 557 ------CAVC--------GDGG-DLILCNGCPRAFHAACLGLHS------VPESGWHCLN 595
Query: 684 CMDCSRINSVLQNLLVQ--EAEKLPEFHL 710
C D + + ++++ +K PE+ +
Sbjct: 596 CEDNTGDERGARPIMIRLTRVDKEPEYEV 624
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 72/127 (56%), Gaps = 12/127 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILC 266
N+ELKMSKK+ N P V +L TG+LDG +V Y+ G ++ L GII DGG LC
Sbjct: 240 NMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVE-----LDGIIGDGGYLC 294
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR----ACRSVPLPMLKA 322
CS+C+ RV+ +FE HA + R + +I ENGK + ++ A S P ++K
Sbjct: 295 GCSMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKN 354
Query: 323 TLQSALS 329
S+++
Sbjct: 355 VAGSSIN 361
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 328 bits (840), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 7/341 (2%)
Query: 564 HAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 623
H C L +P +W+C C + ++++ L + NA AGR +GVDS+EQI KR IRIV
Sbjct: 821 HPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVP 880
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFC
Sbjct: 881 ICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFC 939
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C CS I S L ++ + A L E ++ I KK+ L ++ ++RW+L++G++AT +
Sbjct: 940 CDSCSEIRSSLDKMISEGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRSATED 999
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSS 798
LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ++ GMYCA+LTV S+
Sbjct: 1000 GNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGST 1059
Query: 799 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 858
VVSA +LRV G +VAELPLVATS G GYFQ+LF+CIE+LL L+VK +LPAA EAE
Sbjct: 1060 VVSAALLRVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAE 1119
Query: 859 SIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+IW KFGF KI + + Y L F GT L K +P
Sbjct: 1120 AIWMKKFGFSKIPQDQMEAYLNG-GHLTVFHGTLNLYKAIP 1159
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 103/209 (49%), Gaps = 21/209 (10%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEV-EAIAEGSALTSPK-KNLELKMSKKIS-LN 222
S +KPKVE + SL+ V E + + T P +E+KMSKK++ L+
Sbjct: 472 SLLKPKVE------------APPASSLVVVPEEPVDSTPETPPSVTKMEMKMSKKVAFLS 519
Query: 223 KKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
K P +L TGLL+G+ V+Y+ + L+G+I I C C C+G + I F
Sbjct: 520 KHPGNTRDLLSTGLLEGMPVMYIIP-NSKKPVLKGVIAGCNIRCFCVKCDGSKTITTYFF 578
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA + ++YI NG L +VLRAC S PL L T+QS + + C+ C
Sbjct: 579 ELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCLNC 638
Query: 343 KGTFPITCVGKTGPGPLCNS-CVKSKKPQ 370
G P +T LC+ C +S +PQ
Sbjct: 639 NGELP----SQTEEQFLCHDCCPESNQPQ 663
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 80/184 (43%), Gaps = 44/184 (23%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQK ++GY I C+ CN VSPS
Sbjct: 697 STGKVTTKDTGLHKLVF--KVLLDGTEVNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSA 754
Query: 561 FEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIR 620
FEAHA +G Y N+F +N V +S +
Sbjct: 755 FEAHAG--------EGSRRKPY-DNIFT--------SNGVSLHELS------------MS 785
Query: 621 IVKNLE-AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 679
I K+++ +E LCR C G G I C C R FH C+ L +P
Sbjct: 786 ISKDMQLSERETDDLCREC-----GLG-GDIFPCRMCPRSFHPACVG------LPVVPSE 833
Query: 680 KWFC 683
+WFC
Sbjct: 834 EWFC 837
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 328 bits (840), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 164/317 (51%), Positives = 212/317 (66%), Gaps = 12/317 (3%)
Query: 564 HAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 623
H C LS +P +WYC C N+ ++++ L + NA AGR +GVDS+EQI KR IRIV
Sbjct: 421 HPACVGLSGVP-SEWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVP 479
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
NL EL DF+ S F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFC
Sbjct: 480 NLWIELGQK------DFNNSVFDERTVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFC 533
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C CS I S L ++ A L E ++ I KK+ L ++ D+RWRLL+G+ A+ +
Sbjct: 534 CNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASED 593
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSS 798
LLLS AV I H FDPI++ SGRDLIP MV GR + GQ++ GMYCA+LT+ +S
Sbjct: 594 GDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTS 653
Query: 799 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 858
VVSA +LRV G EVAELPLVATSK G GYFQ LF+CIE++L L++K +LPAA+EAE
Sbjct: 654 VVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAE 713
Query: 859 SIWTDKFGFKKIDPELL 875
IW +KFGF KI E L
Sbjct: 714 GIWMNKFGFTKIPQEQL 730
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 49/119 (41%), Gaps = 22/119 (18%)
Query: 248 IKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF-ENGKSLL 306
+K Q + L+G+I I C C CNG +V PS FE HA + RR F NG SL
Sbjct: 322 LKLQKAVLKGVIAGCNIRCFCLSCNGSKV-SPSAFEAHAGEGTRRKPYDNIFTSNGVSLH 380
Query: 307 EVLRACRSVPLPMLKATLQSALSSLPEE----------KSFACVRCKGTFPITCVGKTG 355
E+ +K + LS + F C C +F CVG +G
Sbjct: 381 EL----------SMKISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSG 429
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 318 bits (816), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 170/352 (48%), Positives = 221/352 (62%), Gaps = 34/352 (9%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
P F H C S+P+G WYC C + G S + T
Sbjct: 505 PQAF--HTACLKFQSMPEGTWYCSSCND---------------------GPTSCKIATAS 541
Query: 618 CIRIVKNLEAEL----SGCLLC----RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
+ NL A + S L R DFS F RT++LCDQCE+E+HVGCL++++
Sbjct: 542 WLYTYFNLNANILVLHSAYSLSPISDRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENE 601
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDID 728
+ DL+ +P+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY + +
Sbjct: 602 LCDLKGIPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNT 661
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGM 788
V WR+LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGM
Sbjct: 662 VEWRMLSGKSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGM 720
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
YC +L VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++
Sbjct: 721 YCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVEN 780
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
++LPAAEEAESIWT+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 781 LLLPAAEEAESIWTNKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 831
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 103/223 (46%), Gaps = 48/223 (21%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAHA
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 627
A P + ++ + L + + G DS +
Sbjct: 449 AGRRQ-PYRRIHISSGLSLHDIAVSLADGGHVITTG-----DSDDM-------------- 488
Query: 628 ELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 686
C +C G D +LLC C + FH CLK + +P+G W+
Sbjct: 489 ----CSICGNGGD----------LLLCAGCPQAFHTACLK------FQSMPEGTWY---- 524
Query: 687 CSRINSVLQNLLVQEAEKL-PEFHLNA--IKKYAGNSLETVSD 726
CS N + + A L F+LNA + ++ SL +SD
Sbjct: 525 CSSCNDGPTSCKIATASWLYTYFNLNANILVLHSAYSLSPISD 567
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQAW--KGSF 300
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 318 bits (815), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 224/341 (65%), Gaps = 7/341 (2%)
Query: 564 HAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 623
H C L +P +W+C C + ++++ L + NA AGR +GVDS+EQI KR IRIV
Sbjct: 46 HPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVP 105
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFC
Sbjct: 106 ICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFC 164
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C CS I S L ++ A L E ++ I KK+ L ++ ++RW+L++G++AT +
Sbjct: 165 CDSCSEIRSSLDKMISGGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRSATED 224
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSS 798
LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ++ GMYCA+LTV S+
Sbjct: 225 GNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGST 284
Query: 799 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 858
VVSA +LRV G +VAELPLVATS G GYFQ+LF+CIE+LL L+VK +LPAA EAE
Sbjct: 285 VVSAALLRVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAE 344
Query: 859 SIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+IW KFGF KI + + Y L F GT L K +P
Sbjct: 345 AIWMKKFGFSKIPQDQMEAYLNG-GHLTVFHGTLNLYKAIP 384
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 315 bits (808), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 168/344 (48%), Positives = 223/344 (64%), Gaps = 12/344 (3%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
P F H C S+P+G WYC C + + A A + S V S+ +
Sbjct: 492 PQAF--HTACLKFQSMPEGTWYCSSCNDGPTSCK----TATATDPNLKSIVGSIAIFSLS 545
Query: 618 C-IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
IR++ + A + R DFS F RT++LCDQCE+E+HVGCL+++ + DL+ +
Sbjct: 546 AHIRVLHS--AYCFSPISDRSLDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLCDLKGI 603
Query: 677 PKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLS 735
P+ KWFCC DCSRI++ LQ+ + +P L+ I +KY + + +V WR+LS
Sbjct: 604 PQDKWFCCSDCSRIHTALQSSASCGPQTIPTVLLDTISRKYREKGICIDNGDNVEWRMLS 663
Query: 736 GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTV 795
GK+ E LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V
Sbjct: 664 GKSRYAEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMV 722
Query: 796 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 855
NS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++LPAAE
Sbjct: 723 NSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAE 782
Query: 856 EAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
EAESIWT KFGF K+ L Y++ QL FKGTSML+K+VP
Sbjct: 783 EAESIWTKKFGFTKMTEHQLQKYQREV-QLTIFKGTSMLEKKVP 825
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 84/177 (47%), Gaps = 41/177 (23%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LH+L+F +GLPDGTE+ Y+ QKLL+GYK G GI+C CC++E+SPSQFEAHA
Sbjct: 376 RDNDLHRLLFLPNGLPDGTELAYFVKSQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGM 435
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 627
A P + ++ + L + + G DS +
Sbjct: 436 AGRRQ-PYRHIHISSGLSLHDIAMSLADGGHVITTG-----DSDDM-------------- 475
Query: 628 ELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C G D +LLC C + FH CLK + +P+G W+C
Sbjct: 476 ----CSICGDGGD----------LLLCAGCPQAFHTACLK------FQSMPEGTWYC 512
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 70/142 (49%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EGS L +KM KI P V +L TG+LDG V Y+ A L
Sbjct: 147 EHTWEGSPSNVASSTLGVKMLDKIDSTNFPSNVKKLLATGILDGARVKYLS--ISPAREL 204
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C++C+ +V+ +FE HA + + + +I ENG+ + +++ R
Sbjct: 205 QGIIHSGGYLCGCTVCDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVA 264
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
P +L+ ++ S E+ F
Sbjct: 265 PPDVLEEVIRKVAGSALSEEGF 286
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 298 bits (764), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 190/520 (36%), Positives = 263/520 (50%), Gaps = 119/520 (22%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 300 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 357
Query: 513 HKLVFDESGLPDGTEVGYYACGQ---KLLEGY---------------KNGLGI------- 547
HKL F LP+GT+VGYY G+ E + NG+ +
Sbjct: 358 HKLAFMSGVLPEGTDVGYYVGGKVSPSQFEAHAGRAARRKPYHNIYMSNGVSLHELSVSL 417
Query: 548 -------------ICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF 587
+C C+ + P F H EC ++IP+G W C+YC+N
Sbjct: 418 SKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAF--HRECVGFTTIPRGTWCCRYCENRQ 475
Query: 588 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC--------- 638
+R+ L ++ NA+ AGR+ G+D +EQI R IRI GC LC
Sbjct: 476 QRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCSMSGFMDKQSVL 535
Query: 639 ------------------DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
DFSK F RT+LLCDQ LP+G
Sbjct: 536 SRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA-------------------LPEGA 576
Query: 681 WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAA 739
W+C DC RI+ L++LL + AE + + IK KY +L D+DVRWR+L K++
Sbjct: 577 WYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGDLDVRWRVLKDKSS 636
Query: 740 TPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSV 799
+++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG ++V
Sbjct: 637 A-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG-------------------NTV 676
Query: 800 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 859
VSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL+ L+VK VLPAA+EAES
Sbjct: 677 VSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAADEAES 736
Query: 860 IWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
IWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 737 IWTQRFGFVKITQDELREYLK-GGRTTVFQGTSTLHKLVP 775
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 201
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 202 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 261
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 262 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 315
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 316 RRSKNESPGSKYCNS 330
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 296 bits (758), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 159/354 (44%), Positives = 211/354 (59%), Gaps = 48/354 (13%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
++C C P F H EC ++IP+G W C+YC+N +R+ L ++ NA+ AGR+
Sbjct: 255 LLCDSC-----PRAF--HRECVGFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRID 307
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 666
G+D +EQI R IRI GC LCR DFSK F RT+LLCDQ
Sbjct: 308 GIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQ----------- 356
Query: 667 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVS 725
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 357 --------ALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDG 408
Query: 726 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 785
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 409 DLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG-------- 459
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL+ L+
Sbjct: 460 -----------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLLASLK 508
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 509 VKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTTVFQGTSTLHKLVP 561
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 6/143 (4%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
LRG+I+ GILCSCS C G V+ P FE+HA + S YI ENG +L ++LRAC
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 315 VPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
L ML++ +Q+A+ P++++F C CK +F GK LC+SC++SK Q + T
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-T 191
Query: 375 YTTGIRIRASGPKLVSRSSENDS 397
T+ I P +R S+N+S
Sbjct: 192 RTSKI---GRNPTSSARRSKNES 211
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 286 bits (732), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 225/422 (53%), Gaps = 68/422 (16%)
Query: 496 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ----------------KLLE 539
++ +KS IT KD+ LHKLVF E+ L DG VGY+ G+ K
Sbjct: 174 TYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGKVSPSKFEAHAGRASRRKPYS 233
Query: 540 GYKNGLGIICH--CCNSEVSPS---------------------QFEAHAECASLSSIPQG 576
+ G+ H N +S S Q H EC L S P+
Sbjct: 234 YIRTADGVSLHELANNRRISMSDSDERCSHCEQVGNLLWCDRCQRSFHLECIPLESPPKR 293
Query: 577 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 636
YC+YC+N F + + ++H N V GR++ D EQIT+ C K E + C LC
Sbjct: 294 KRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITEVCTLSEKQKEVKDGPCALCS 353
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN 696
DF+ + GPRT+++C QCE+EFHV CLK H MA+L ELPK KWFC +DC I+ LQ
Sbjct: 354 ERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVELPKDKWFCGIDCDDIHMKLQK 413
Query: 697 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRL--LLSQAVAIF 754
L+ + +L +D++WRLL+ K P+ + L+S+A AIF
Sbjct: 414 LMARGEAEL--------------------GLDIKWRLLNTKLNNPKHNISPLISKANAIF 453
Query: 755 HDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVA 813
H+ F I D + DLI +M+YG + GQ F GMYCA+L + AGI RV GQEVA
Sbjct: 454 HERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVLYFKKVIACAGIFRVLGQEVA 513
Query: 814 ELPLVATSK------INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
ELPLVAT+ I GYF+ LF+CIE +L L+VK++VLPAA EAES+W DKFGF
Sbjct: 514 ELPLVATTTKYQKRVILFTSGYFRSLFSCIENMLRHLKVKTLVLPAAHEAESMWIDKFGF 573
Query: 868 KK 869
K
Sbjct: 574 TK 575
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 180/486 (37%), Positives = 253/486 (52%), Gaps = 83/486 (17%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN--GLGIICHCC---------- 552
I +DQ LHKLVF E+ L DG VGY+ +K L+G N GI+C CC
Sbjct: 83 INYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQSGILCDCCKEVILEFFFC 142
Query: 553 -----NSEVSPSQFEAHAECASLS-----------------------SIPQGDWYC---- 580
+ +VSPS+FEAHA AS SI D +C
Sbjct: 143 QIKKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAINHRISISNSDEHCSKCK 202
Query: 581 --------------------------------KYCQNMFERKRFLQHDANAVEAGRVSGV 608
+YC+N ++ + ++H N V ++
Sbjct: 203 QRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIES 262
Query: 609 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
D EQI K C VK+ E E S C LC F+ F P T+++CDQCE+++HVGCLK H
Sbjct: 263 DPSEQIAKICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLKDH 322
Query: 669 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI 727
MA+L+++PK WFC +DC I+ L+N + + L + L+ IK K LET +
Sbjct: 323 NMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGL 382
Query: 728 DVRWRLLSGKAATPE--TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-E 784
D++W++ + + + T LLS V IFH+ FD IV + + DLIP+MV GR ++ +
Sbjct: 383 DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKDKYY 442
Query: 785 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 844
FGGMYCA+L VN VVSAGI RVFG+EVAEL L+AT +G+F+ L +CIE +L L
Sbjct: 443 FGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIENVLKEL 502
Query: 845 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKG---TSMLQKRVPAC 901
+V+ +VLPAA EAES+W DKFGF + + L Y +R KG + L ++ C
Sbjct: 503 KVERLVLPAAHEAESMWIDKFGFTEPNQGLGRRYYRRSWSFHLNKGVEQNTHLTGKLEFC 562
Query: 902 RIGSSS 907
+ +SS
Sbjct: 563 QKCASS 568
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 158/381 (41%), Positives = 210/381 (55%), Gaps = 75/381 (19%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
++C C P F H EC ++IP+G W C+YC+N +R+ L ++ NA+ AGR+
Sbjct: 463 LLCDSC-----PRAF--HRECVGFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRID 515
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGC---------------------------D 639
G+D +EQI R IRI GC LC D
Sbjct: 516 GIDPMEQIFTRSIRIATTPVTGFGGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHD 575
Query: 640 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 699
FSK F RT+LLCDQ LP+G W+C DC RI+ L++LL
Sbjct: 576 FSKKKFSARTVLLCDQA-------------------LPEGAWYCTADCVRISETLKDLLS 616
Query: 700 QEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 758
+ AE + + IK KY +L D+DVRWR+L K++ +++L+LS+AVAIFH+ F
Sbjct: 617 RGAEPISSVDVEIIKRKYEQKALNKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESF 675
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 818
DPI+ +GRDLIP+MVYG ++VVSAG+ RV G E+AELPLV
Sbjct: 676 DPIIQIATGRDLIPAMVYG-------------------NTVVSAGLFRVMGSEIAELPLV 716
Query: 819 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 878
ATS+ + G GYFQ LF CIE+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y
Sbjct: 717 ATSRDSQGLGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREY 776
Query: 879 RKRCSQLVTFKGTSMLQKRVP 899
K + F+GTS L K VP
Sbjct: 777 LK-GGRTTVFQGTSTLHKLVP 796
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 202
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 203 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 262
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 263 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 316
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 317 RRSKNESPGSKYCNS 331
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 109/235 (46%), Gaps = 45/235 (19%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 301 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 358
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSS 572
HKL F LP+GT+VGYY G++LL+GY GI CHCCN+ VSPSQFEAHA A+
Sbjct: 359 HKLAFMSGVLPEGTDVGYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRK 418
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
P + Y ++ E L N + N +++
Sbjct: 419 -PYHNIYMSNGVSLHELSVSLSKGRN-----------------------MSNRQSD---- 450
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC C S G +LLCD C R FH C+ +P+G W CC C
Sbjct: 451 DLCSIC--SDGG----ELLLCDSCPRAFHRECV------GFTTIPRGTW-CCRYC 492
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/230 (63%), Positives = 172/230 (74%), Gaps = 4/230 (1%)
Query: 150 GSEVSNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKK 209
G SN K+ K S +K K++ VEV V E E++S ++VE IAEGSALT PKK
Sbjct: 10 GEPNSNNRPKRVTK--SKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKK 67
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
NLELKMSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+
Sbjct: 68 NLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCA 127
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CNG RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S
Sbjct: 128 FCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAIS 187
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
LP E++F C RCKG FP CVGK GPLCN C +SK+ T+T + I
Sbjct: 188 GLPVERTFTCKRCKGIFPSICVGKI--GPLCNLCAESKESHPTLTIGSSI 235
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 141/271 (52%), Positives = 180/271 (66%), Gaps = 6/271 (2%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 693
LCR DF+ + F RT++LCDQCE+E+HVGCLK +L+ELP+G+WFCC CS S
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 694 LQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 752
L ++ A+ L E L IKK + L + D++W+LLSGK T + +LLS AV
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRTTEDGSILLSAAVP 131
Query: 753 IFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGILRVF 808
IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA +LRV
Sbjct: 132 IFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVM 191
Query: 809 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 868
G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KFGF
Sbjct: 192 GGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFGFS 251
Query: 869 KIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+ PE L Y + L F GTS L K VP
Sbjct: 252 RTTPEELEAYLNG-AHLTIFHGTSYLYKAVP 281
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 272 bits (696), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 142/277 (51%), Positives = 183/277 (66%), Gaps = 12/277 (4%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 690
G L R DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP+G+WFCC CS
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 691 NSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 749
S L ++ A+ L E L I KK+ L + D++W+LLSGK AT E +LLS
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKRATEEGSILLSA 174
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGIL 805
AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA ++
Sbjct: 175 AVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALM 234
Query: 806 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 865
RV G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KF
Sbjct: 235 RVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMKKF 294
Query: 866 GFKKIDPELLSIYRKRC---SQLVTFKGTSMLQKRVP 899
GF +I PE L + C + L F GTS L K VP
Sbjct: 295 GFSRIPPEEL----EACLNGAHLTIFHGTSYLYKAVP 327
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 268 bits (686), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 123/196 (62%), Positives = 157/196 (80%)
Query: 713 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 772
+KK LE + +IDVRWRLL+G+ A+PET+ LL +AV++FH+CFDPIVD +GRDLIP
Sbjct: 1 MKKQEERCLEPLREIDVRWRLLNGRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIP 60
Query: 773 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 832
+MV GRNLR Q+FGGMYCA+L VNSSV SA +LR+FG ++AELPL+AT N GKGYFQ
Sbjct: 61 AMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQT 120
Query: 833 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTS 892
LF+CIE+LLSFL VK++VLPAAEEAESIW KFGF K++P+ L+ YRK Q++ FKGT
Sbjct: 121 LFSCIERLLSFLSVKNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTV 180
Query: 893 MLQKRVPACRIGSSST 908
ML K VP CR+ ++ +
Sbjct: 181 MLHKTVPQCRVTNTQS 196
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 254 bits (648), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 190/342 (55%), Gaps = 52/342 (15%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECA---------------SLSSIP----- 574
QK+L GYK G GI+C CC E+SPSQFE+HA + +L I
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 575 ------------------QGD-WYCKYCQNMFERKRFLQHDA--------NAVEAGRVSG 607
GD +C+ C F HD N + G G
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNKLGH--G 134
Query: 608 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 667
+ I R R+VK E ++ GC +CR DFS F RT++LCDQCE+EFHVGCL++
Sbjct: 135 GNFARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRE 194
Query: 668 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVS 725
+ DL+E+PK WFCC DC+ I L+N + + +P LN I K L +
Sbjct: 195 SGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLNIINRKHVEKGLLVDEA 254
Query: 726 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 785
DV+W++L GK+ E LLS A AIF +CFDPIV + +GRDLIP MVYGRN+ GQEF
Sbjct: 255 AYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDPIV-AKTGRDLIPVMVYGRNISGQEF 313
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 827
GGMYC +LTV VVSAG+LR+FG+EVAELPLVAT++ + GK
Sbjct: 314 GGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGK 355
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 251 bits (642), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 191/346 (55%), Gaps = 72/346 (20%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 430 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 482
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLS----------------------- 571
++LL G+K G I+C CCNSEVSPSQFEAH+ AS
Sbjct: 483 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 542
Query: 572 ----SIPQGDWYCKYC-----------------------QNMFERKRF------------ 592
SI D C C +N+ + K F
Sbjct: 543 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 602
Query: 593 -LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 651
++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 603 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 662
Query: 652 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 711
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 663 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 722
Query: 712 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFH 755
IKK G S + D++WRLL G +A+ E LLSQA+++FH
Sbjct: 723 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFH 768
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 127 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 181
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 182 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 235
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 236 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 273
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 274 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 333
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 334 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 392
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 393 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 447
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 198/333 (59%), Gaps = 15/333 (4%)
Query: 575 QGDWYCKYC-QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 633
+G WYC+ C Q+ + + Q + + G + ++ EQ RCIR + E+ GC
Sbjct: 983 EGRWYCRMCRQDSLKVAQNGQKGSEKIMEGMSNIAETNEQ--GRCIRHLDGPR-EVGGCA 1039
Query: 634 LCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 692
+C+ + SK+GF TIL+CDQC RE+HV CLK + DL ELP G+WFC DC I+
Sbjct: 1040 ICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDCKVIDE 1099
Query: 693 VLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDIDVR--WRLLSGKAATPETRLLL 747
+L L+ E L ++ + ++ ++ E + + R W++L GK ++P L
Sbjct: 1100 ILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSPADVQTL 1159
Query: 748 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 807
++A IF DC DPI D +G++LIP MV R + +F G++C +L +N VVSA +L++
Sbjct: 1160 AEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVVSAALLQI 1219
Query: 808 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
FG+E+AE+PL+ATS +HG+ + + L IE+LL L V+ +VLP A+ ES+W +KFGF
Sbjct: 1220 FGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESVWINKFGF 1279
Query: 868 KKIDPELLSIYRKRCS--QLVTFKGTSMLQKRV 898
++ + L + C+ +L TF GTSM+ K +
Sbjct: 1280 SRVQEDQL---KSICTTIRLTTFTGTSMVVKAI 1309
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 31/53 (58%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
+D LHK +F GL DGTE+GYY Q L+G K G GI C CCN E Q
Sbjct: 779 RDAHLHKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNEETRHVQ 831
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 12/129 (9%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L + P EL T L++G V GI+ L G+++D G+ C+C C G ++
Sbjct: 563 LVRAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVQCNCRNCKGSVIVSI 617
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 618 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 676
Query: 335 KSFACVRCK 343
+ C +C+
Sbjct: 677 RKVTCAKCE 685
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 136/422 (32%), Positives = 208/422 (49%), Gaps = 98/422 (23%)
Query: 576 GDWYCKYCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-EL 629
G+++C CQ +R+R + + G + + +++ RC R+++ EA L
Sbjct: 1059 GEFFCPDCQEQRFGGTKDRRRSMTKRRSK---GAAKTLLTKDRMIGRCSRLLQVPEAIVL 1115
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG-------------------------- 663
GC+ C+ DF+K+GFGP+T LLCDQCERE+HVG
Sbjct: 1116 GGCVFCKSGDFAKTGFGPKTTLLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGE 1175
Query: 664 ----CLKKHKMAD------------------------LRELPKGKWFCCMDCSRINSVLQ 695
C+ ++ D +ELP+G+WFC DC I+S+L
Sbjct: 1176 VVFRCVATVRLRDCDRGCESQGIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILS 1235
Query: 696 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI------DVRWRLLSGKAATPETRLLLSQ 749
L+ E L + ++ + + LE D W+LL G+ P L++
Sbjct: 1236 LLVSNGPEPLADSIISKVLRTNQARLERSEDATESSCSGFEWQLLHGRGGDPSNGKALAE 1295
Query: 750 AVAIFH----------------------------DCFDPIVDSISGRDLIPSMVYGRNLR 781
AV IF +CFDPI D +SG DLIP MVY R+LR
Sbjct: 1296 AVQIFSVRNLSDPGFPVRTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLR 1355
Query: 782 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 841
Q+FGG+YC +L ++ VVS ++RVFG+++AELPL+AT+ + G+G+ + L IE+LL
Sbjct: 1356 DQDFGGIYCVVLKYDNRVVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLL 1415
Query: 842 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 901
LRV+ + LPAAE AE IW +KFGF+++ + + + +V F G+ ML+K +P
Sbjct: 1416 GVLRVERLALPAAEGAEGIWLNKFGFRRMAEGQVKQFHSDLNMMV-FTGSFMLEKEIPPL 1474
Query: 902 RI 903
I
Sbjct: 1475 EI 1476
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 127/291 (43%), Gaps = 43/291 (14%)
Query: 58 VKRSRFSNSDDLLEDDVIDKR----------INSKIHEGRINKVVKNVL-------NENG 100
V SR + + L DD +D R N + E + ++ + V+ NG
Sbjct: 386 VAGSRNDATSNPLSDDGLDGREQDGNGGKGLTNGTVEEDKCTELFEPVVLLPEHSGGTNG 445
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKK 160
+ES V+ E+ E+ + G A SK FP+ G + + ++++
Sbjct: 446 FVESAVDSEDLSSDPAREDSLGNQELGGSAGAAVAS--SKDLEFPQHSGSEKRGDSMDRQ 503
Query: 161 C--LKRPSA----------MKPKVEPVEVLVTQSE---------GFGNESMSLIEVEAIA 199
+ PSA +K V P+++ + +G + S ++++
Sbjct: 504 DSDVATPSAATSAQSRSPGLKVGVVPLKIWLWAVGWLGGWAWLVSYGADGNSRLKLKGQG 563
Query: 200 EGSALTSPKKNLELK-MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGI 258
+ S SP N + K++ L + P + L ++GLLDG V YMG + L GI
Sbjct: 564 DVSMRQSPTLNGARGFVVKEVLLKEAPASAKLLLQSGLLDGHHVRYMG--RGGHIMLTGI 621
Query: 259 IRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVL 309
I++GG+LC CS C G +V+ S FE HA R S +I ENGK L ++L
Sbjct: 622 IQEGGVLCDCSSCKGVQVVNVSAFEKHAGSSARHPSDFIFLENGKCLKDIL 672
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 49/102 (48%), Gaps = 22/102 (21%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D TEVGYY GQK L G K G GI+
Sbjct: 784 ETASPPVLSRESSGAN-------LHKALFLPGGLEDDTEVGYYVKGQKSLAGVKKGAGIL 836
Query: 549 CHCC-------------NSEVSPSQFEAHAECASLSSIPQGD 577
C CC ++ V S +E A + SS+P GD
Sbjct: 837 CSCCQQVGWYRKSLILMDNMVCGSMYELVAPYS--SSMPDGD 876
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 222 bits (566), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 185/324 (57%), Gaps = 24/324 (7%)
Query: 570 LSSIPQGDWYCKYCQN------MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 623
L + G WYC+ C+ E+KR +H + G+ ++ + +R R V+
Sbjct: 720 LETDSDGKWYCRMCRQDTLNVAQIEQKRSNKH---------IEGMSNIAETDER-DRCVR 769
Query: 624 NLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
+LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK + DL ELP+G+
Sbjct: 770 HLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPEGE 829
Query: 681 WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-----KKYAGNSLETVSDIDVRWRLLS 735
WFC DC I+ +L L+ E L + ++ + ++ S W++L
Sbjct: 830 WFCQKDCKVIDEILTQLVANGPELLTDSIISELLESRQQQTGAKDKAESSCPSFAWQILC 889
Query: 736 GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTV 795
GK+ L++A+ IF +C DPI D+ +G++LIP MV R + +F G++C +L +
Sbjct: 890 GKSGNTANTQTLAEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVFCIVLKL 949
Query: 796 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 855
N VVSA +L++FG E+AE+PLVATS + G+G+ + L IE+LL L V+ +VLP A+
Sbjct: 950 NEKVVSAALLQIFGGEIAEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERLVLPTAK 1009
Query: 856 EAESIWTDKFGFKKIDPELLSIYR 879
ESIW +KFGF ++ + S R
Sbjct: 1010 NTESIWINKFGFSRVPDDEGSFLR 1033
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 75/178 (42%), Gaps = 31/178 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C ++
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVRCDCRNCKSSVIVSI 441
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 442 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 500
Query: 335 KSFACVR-----------CKGT-----FPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
C + CKG C+G + P C+ C K KKP T+
Sbjct: 501 WKVTCAKCWNSDEGDLIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATV 558
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 218 bits (556), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 114/301 (37%), Positives = 179/301 (59%), Gaps = 12/301 (3%)
Query: 578 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA--ELSGCLLC 635
WYC+ C+N + + + V S+ +I +R R +++LE E+ GC +C
Sbjct: 633 WYCRMCRNDTLK---VAQNGQKVSGKHQEESSSITEIDERG-RCIRHLEGHREVGGCAIC 688
Query: 636 RGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVL 694
+ + SK+GF TIL+CDQC RE+HV CLK M +L ELP+G+WFC C I+ +L
Sbjct: 689 KKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGMDNLNELPEGEWFCQKGCKVIDEIL 748
Query: 695 QNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDID--VRWRLLSGKAATPETRLLLSQ 749
L+ E L ++ + ++ +E I W++L GK ++P L++
Sbjct: 749 TQLVAIGPESLSHSIISELPENRQQKSGVIEKAESISPSFEWQILCGKGSSPANIQTLAE 808
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFG 809
AV IF +C DPI D+ +G++LIP MV R + +F G++C +L +N VVSA +L++FG
Sbjct: 809 AVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDYDFEGVFCVVLKLNGKVVSAALLQIFG 868
Query: 810 QEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK 869
+E AE+PLVATS + G+G+ + L IE+LL L V+ +VLP A++ ES+W +KFGF +
Sbjct: 869 REFAEVPLVATSLPHQGQGFCKALMTTIERLLGVLSVERLVLPTAKDTESLWVNKFGFSR 928
Query: 870 I 870
+
Sbjct: 929 V 929
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/64 (48%), Positives = 37/64 (57%), Gaps = 4/64 (6%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS----QFEA 563
+D LHK +F GL +GTE+GYY Q L+G K G GI C CCN E S +FE
Sbjct: 487 RDAHLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQ 546
Query: 564 HAEC 567
HA C
Sbjct: 547 HAGC 550
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 69/177 (38%), Gaps = 29/177 (16%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C G ++
Sbjct: 286 LLKAPRNAKELMATRLMEGHHVRCSCRGIQ-----LTGMLKDMGVQCDCRNCRGSVIVSI 340
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRA------CRSVPLPMLKATL--------- 324
S FE H+ S I ENGK+L ++L A C L LK +
Sbjct: 341 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDVQGVEKS 400
Query: 325 --QSALSSLPEEKSF---ACVRCKGTFPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
+ A PEE RC CV P C C K+KKP ++
Sbjct: 401 KSKCAKCGNPEEGDLIYCKGARCSVVAHSGCVEIANPHLGDWFCGKCEKTKKPHASV 457
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 99/133 (74%), Positives = 116/133 (87%)
Query: 774 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 833
MVYGRN+RGQEFGGMYCA+L VNSSVVSA +LR+FG +VAELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 834 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 893
F+CIE+LL+F+ VKS+VLPAAEEAESIWTDKFGF ++ P+ LS YRK +Q+VTFKGT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 894 LQKRVPACRIGSS 906
L K VP CRI S+
Sbjct: 121 LHKLVPPCRIISN 133
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 212 bits (540), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 186/332 (56%), Gaps = 19/332 (5%)
Query: 576 GDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI--TKRCIRIVKNLEAELSGCL 633
G CK CQ E A+ RV ++ T R +R+ + ++ SGC
Sbjct: 881 GHPLCKICQKNVE-------GAHKTSKKRVDATANIPATDDTGRNVRLFQAPDSS-SGCA 932
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 693
+C+ K GF T+L+CDQC RE+HVGCL++ + D ELP+ +W+C +C I V
Sbjct: 933 ICKKWTLKKCGFD-MTMLVCDQCGREYHVGCLRESGILD--ELPEAEWYCQPNCQHIVQV 989
Query: 694 LQNLLVQEAEKLPEFHLNAI---KKYAGNSLETV--SDIDVRWRLLSGKAATPETRLLLS 748
L L+ E L + +N + +++ +E S W++L G P L+
Sbjct: 990 LSQLVANGPELLSDNIVNDLLESRQHQQGIVEMAESSSPVFGWQILHGAGENPVNGRTLA 1049
Query: 749 QAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVF 808
QAV IF +C DPI D+ SG+++IP MVY R + +F G+YC +LT+N VVS +L++F
Sbjct: 1050 QAVEIFTECSDPIKDAPSGQNMIPIMVYSRRFKDYDFDGIYCVVLTLNEKVVSTALLQIF 1109
Query: 809 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 868
G+EVAE+PL+ATS + +G+ + L IE+LL L V+ +VLPA++ AE +W ++FGF
Sbjct: 1110 GREVAEVPLIATSVDHQDQGFCKALMTTIERLLGVLNVERLVLPASKNAEFVWVNRFGFS 1169
Query: 869 KIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
+++ L R LV F GT+ML K + A
Sbjct: 1170 RMEDAQLKHIRSMMGLLV-FTGTTMLVKHIDA 1200
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 42/64 (65%)
Query: 506 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 565
+ +D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E+S S FE HA
Sbjct: 738 SARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEISCSAFEQHA 797
Query: 566 ECAS 569
C S
Sbjct: 798 GCES 801
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRA--- 311
L GI++D G++C+C +C G +V+ S FE H+ S I ENGK+L ++L A
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 312 ---CRSVPLPMLKATLQSALSSLPEEKSFAC----------------VRCKGTFPITCVG 352
C L LK + + +P +K AC +C + CVG
Sbjct: 630 SADCGGDILGALKHAI-GEIQGIP-KKEGACGKCGKREGGDFVSCKEPKCSAVYHAECVG 687
Query: 353 KTGPGPLCNSCVKSKKPQGTMTYT 376
P + C K +K Q M T
Sbjct: 688 LPSPHRVDWFCAKCEKAQVKMPKT 711
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 98/130 (75%), Positives = 116/130 (89%)
Query: 774 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 833
MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 834 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 893
F+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 894 LQKRVPACRI 903
LQK VP+CR+
Sbjct: 121 LQKTVPSCRV 130
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 147/421 (34%), Positives = 203/421 (48%), Gaps = 67/421 (15%)
Query: 513 HKLVF--DE-SGLPDGTEVGYYAC-GQKLLEGY-------KNGLGIICHCCNSEVSPSQF 561
HK +F DE GL DG V Y G++LL+G GI+C CCN +S SQF
Sbjct: 445 HKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCACCNGVISCSQF 504
Query: 562 EAHAECASLSSIPQGDWYCKYCQNMFERKRF-LQHDANAVEAGRVSGVDSVEQITKRCIR 620
EAHA +G Y N+F L+ A + A S +
Sbjct: 505 EAHAG--------RGSRRAPY-DNIFTAAGVSLRKLACLMPASEAESPISHRPAALCAVA 555
Query: 621 IVKNLEAEL----------SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ LE EL GC+LC+ DF + GFG RT+++CDQCERE+H+GCL +H
Sbjct: 556 DRRALEPELVTVSGEAALHGGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLAEHGR 615
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA-------IKKYAGNSLET 723
A L ELP+GK + L ++L+ HLN K A E
Sbjct: 616 AHLTELPEGK-----------ASLYDILLT-------LHLNGEWHCSPECKGIATRMRER 657
Query: 724 VSDIDV------RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 777
VS + V W++L GK T T L A I + FDPI+D ++G DL+ +MVY
Sbjct: 658 VSSVPVPLQGEYSWQVLRGKDGTHATTWALKAAQEILTESFDPILDLVTGADLMMAMVYA 717
Query: 778 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
+ L ++ GMY A+L V + + RVFG+++AE+PLVAT +G+ ++L A
Sbjct: 718 QELGDWDYTGMYTAVLRRRGKAVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLMAAF 777
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSMLQ 895
E L V+S+ LPAA+ W FGF I PE + CS+L + F GT +LQ
Sbjct: 778 EDYFRSLGVQSLCLPAAQSTVETWIHGFGFAAITPEEQAAT---CSELRVLIFPGTELLQ 834
Query: 896 K 896
K
Sbjct: 835 K 835
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 195 bits (496), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 196/706 (27%), Positives = 295/706 (41%), Gaps = 104/706 (14%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+R+ R+ + D + + + S S G +ST TS
Sbjct: 281 ------KRRSGRRFRQGDTGTSTPTFS-----------------GSPGKGGFGHSTD-TS 316
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
K ++ + ++ K T +P N S +KS+ + T +D LH L+F E G
Sbjct: 317 EKKGTEETHRLSLSSPVKIT---QRPLRNCS----IDSKSKESKT-RDTTLHPLIFKEDG 368
Query: 522 LPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYC 580
L D T + Y G+ L +GYK G IIC+CCN E SPS FE HA P + Y
Sbjct: 369 LADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQ-PYHNIYT 427
Query: 581 KYCQNMFERKRFLQH--DANAVEAGRVSGVDSVEQITKR---------------CIRIVK 623
++ + LQ + N + VS V +T R ++
Sbjct: 428 LEGLSLHKLALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGREPSTTSGPIVPLKRTLQ 487
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADLRELP 677
E C C + P TI+ C+QCER H+ C KK + L+E
Sbjct: 488 ERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYM 547
Query: 678 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK 737
+ CC +C + + L+ ++K G + ++ WRLLSG
Sbjct: 548 CFHFLCCQECQSLRARLE---------------EGLEKCVGITFLRRIRSNICWRLLSGM 592
Query: 738 AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVN 796
A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMYCA+LT +
Sbjct: 593 DASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTAS 650
Query: 797 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 856
+ VVSA IL+V +++AEL L+AT KGYF LL IE L V + P E
Sbjct: 651 THVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPE 710
Query: 857 AESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQKRV 898
IW++K GF + E +L + LV FK ++QK +
Sbjct: 711 MAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQKSL 751
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 201/719 (27%), Positives = 299/719 (41%), Gaps = 95/719 (13%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP----GLIA 454
+ + R ++ S S K S K + + N+ P G+ +
Sbjct: 281 KRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDGVKS 340
Query: 455 NS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRWNITPK 508
+S T VT+ H S + + +S P + PL S +KS+ + T +
Sbjct: 341 DSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKSKESKT-R 390
Query: 509 DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS FE HA
Sbjct: 391 DTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGM 450
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQH--DANAVEAGRVSGVDSVEQITKR-------- 617
P + Y ++ + LQ + N + VS V +T
Sbjct: 451 GRRRQ-PYHNIYTLEGLSLHKLALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGREPST 509
Query: 618 -------CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL----- 665
R ++ E C C + P TI+ C+QCER H+ C
Sbjct: 510 TSGPIVPLKRTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVV 569
Query: 666 -KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 724
KK + L+E + CC +C + + L+ ++K G +
Sbjct: 570 KKKVPLEILKEYMCFHFLCCQECQSLRARLE---------------EGLEKCVGITFLRR 614
Query: 725 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQ 783
++ WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + +
Sbjct: 615 IRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEK 672
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
+F GMYCA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L
Sbjct: 673 DFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRA 732
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK----RCSQLVTFKGTSMLQKRV 898
V + P E IW++K GF +LS K LV FK ++QK +
Sbjct: 733 WNVSLLTAPVDPEMAQIWSEKLGFT-----ILSAEEKESMLESHPLVMFKNLVLVQKSL 786
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 137/419 (32%), Positives = 195/419 (46%), Gaps = 56/419 (13%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAE 566
+D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE HA
Sbjct: 170 RDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAG 229
Query: 567 CASLSSIPQGDWYCKYCQNMFERKRFLQH---DANAVEAGRVSGVDSVEQITKR------ 617
P + Y + + L H ++N VS +T
Sbjct: 230 MGKRRQ-PYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPNLTSSGCGKEP 288
Query: 618 -----CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PRTILLCDQCEREFHVGCL---- 665
+ + + L+ + C C + + G P I+ C+QCER HV C
Sbjct: 289 SVSGPIVPLKRTLQERVVQTESCYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRV 348
Query: 666 --KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF-HLNAIKKYAGNSLE 722
KK + L++ + CC +C + L+ + EK E L I+
Sbjct: 349 VKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGM-----EKCEEIAFLGRIRS------- 396
Query: 723 TVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG 782
++ WRLLS A+ + +L LSQ + IF D F ++S I MVYG+N G
Sbjct: 397 -----NICWRLLSSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGKNREG 448
Query: 783 Q-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 841
+ +F GMYC +LT ++ VVSA IL+V + AEL L+AT KGYF+LL IE L
Sbjct: 449 EKDFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNL 508
Query: 842 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CSQLVTFKGTSMLQK 896
V ++ P E IW+DK GF +LS K+ LV FK ++QK
Sbjct: 509 RACNVSLLMAPVDPEMAQIWSDKLGFT-----ILSADEKKSMLESHPLVMFKNLVLVQK 562
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 152/483 (31%), Positives = 218/483 (45%), Gaps = 60/483 (12%)
Query: 447 SSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRW 503
SS G+ S + K ++ R + T S P +A PL S + SR
Sbjct: 76 SSDKGISGLSAGTSKEKKGTEETRSAQNTGDLGLISSSSSPVTSAQRPLPSSSVGSNSRE 135
Query: 504 NITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFE 562
+ +D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE
Sbjct: 136 S-KKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFE 194
Query: 563 AHAECASLSSIPQGDWYCKYCQNMFERKRFLQH---DANAVEAGRVSGVDSVEQITKR-- 617
HA P + Y + + L H ++N VS +T
Sbjct: 195 KHAGMGKRRQ-PYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVSSFSDYPNLTSSGC 253
Query: 618 ---------CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PRTILLCDQCEREFHVGCL 665
+ + + L+ + C C + + G P I+ C+QCER HV C
Sbjct: 254 GKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCY 313
Query: 666 ------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF-HLNAIKKYAG 718
KK + L++ + CC +C + L+ + EK E L I+
Sbjct: 314 NSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGM-----EKCEEIAFLGRIRS--- 365
Query: 719 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 778
++ WRLLS A+ + +L LSQ + IF D F ++S I MVYG+
Sbjct: 366 ---------NICWRLLSSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGK 413
Query: 779 NLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
N G+ +F GMYC +LT ++ VVSA IL+V + AEL L+AT KGYF+LL I
Sbjct: 414 NREGEKDFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESI 473
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CSQLVTFKGTSM 893
E L V ++ P E IW+DK GF +LS K+ LV FK +
Sbjct: 474 ETNLRACNVSLLMAPVDPEMAQIWSDKLGFT-----ILSADEKKSMLESHPLVMFKNLVL 528
Query: 894 LQK 896
+QK
Sbjct: 529 VQK 531
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 172 bits (437), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 95/280 (33%), Positives = 145/280 (51%), Gaps = 30/280 (10%)
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 689
+ C+LC DF + GF +T+L+CDQCE+E+H+GCL++HKM D++ +P+G+WFC +C R
Sbjct: 1510 AACVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVR 1569
Query: 690 INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 749
I L + +P GN RW++L GK +T LS
Sbjct: 1570 IRDALGEDVAAGEVLMP-----------GNPA-------YRWQILRGKNGRQQTWHALST 1611
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-------TVNSSVVSA 802
+ I + FDPI+D+ SG DL+P+MV +F GMY +L V A
Sbjct: 1612 VLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVLA 1671
Query: 803 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 862
+ RV G +AE+PLVAT +G+ + L + L L V++IVLPA +A+ W
Sbjct: 1672 ALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAWR 1731
Query: 863 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---RVP 899
+ F+ +D + R ++V F T++L + RVP
Sbjct: 1732 -QLQFQDLDEPSTRVARSE-HRMVIFPHTTVLARPLIRVP 1769
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 134/432 (31%), Positives = 200/432 (46%), Gaps = 46/432 (10%)
Query: 493 PPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHC 551
P S KS+ + T +D LH L+F E GL D T + Y G+ L +GYK G GIIC+C
Sbjct: 91 PNYSIGTKSKESKT-RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNC 149
Query: 552 CNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQH--DANAVEAGRVSGVD 609
C+ E +PS FE HA P + Y + + LQ ++N + VS
Sbjct: 150 CSQEFAPSHFEEHAGMGRRRQ-PYHNIYTPEGSTLHKLALQLQDHLNSNGFDNASVSSFS 208
Query: 610 SVEQITKR-------------CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PRTILLC 653
+T + + + L+ + C C + + G P I+ C
Sbjct: 209 DYPNLTSASGCGRQPSTTSGPIVPLKRTLQGRVVETESCYFCGYGHTTIGNIDPDMIIFC 268
Query: 654 DQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 707
+QCER HV C KK + L+E ++ CC +C +L++ L + EK E
Sbjct: 269 NQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQEC----QLLRDRLEEGLEKCEE 324
Query: 708 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 767
+++ N + WRLLSG A+ + +L + Q + IF D F D S
Sbjct: 325 IAF--LRRIRSN---------ICWRLLSGMDASRDVKLFMPQVIDIFKDAFVESTDEHS- 372
Query: 768 RDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
D+ MV +N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT +
Sbjct: 373 -DIFSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRMEQIAELVLIATRRECRK 431
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 886
KGYF LL IE L V + P E IW++K GF + E + LV
Sbjct: 432 KGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESVLES-HPLV 490
Query: 887 TFKGTSMLQKRV 898
FK ++QK +
Sbjct: 491 MFKNLVLVQKSL 502
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 162 bits (409), Expect = 1e-36, Method: Composition-based stats.
Identities = 101/303 (33%), Positives = 159/303 (52%), Gaps = 31/303 (10%)
Query: 609 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 667
DSV + + + L A+L S C+LC +F + GF +T+L+CDQCE+E+H+GCL+K
Sbjct: 1762 DSVTLTGRDEEHVSEALAADLASSCVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRK 1821
Query: 668 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 727
HKM D++ +P+G+WFC +C RI +L L +E E +GN
Sbjct: 1822 HKMVDMQAVPEGEWFCSDECVRIRELLTKSL-EEGE----------TTMSGNPA------ 1864
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 787
RW+ + G+ T T L + I + FDPI+D+ SG DL+P MV+ + +F G
Sbjct: 1865 -YRWQFIRGRDGTKATARALKTVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQG 1923
Query: 788 MYCAILTVNSS-------VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 840
MY +L + V AG++RV G +AE+PLVAT +G+ + L +
Sbjct: 1924 MYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHR 1983
Query: 841 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---R 897
L L V+++VLPA +A W + F +D + + R +++ F TS++ + R
Sbjct: 1984 LIALGVRAMVLPATADALPAWR-QLAFMDLDEGSVRVARGE-HRMIIFPHTSVVVRQLIR 2041
Query: 898 VPA 900
VP
Sbjct: 2042 VPG 2044
Score = 41.2 bits (95), Expect = 2.9, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 522 LPDGTEVGYYACGQKLLEGY------KNGLGIICHCCNSEVSPSQFEAHA 565
L DG V Y GQ+LL G G GI+C CC+ +S S FE+HA
Sbjct: 1535 LQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHA 1584
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 184/696 (26%), Positives = 286/696 (41%), Gaps = 97/696 (13%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
V L TGLL+G V Y K G I G C CS C ++ +FE H+
Sbjct: 246 VRGLLSTGLLEGFRVTY----KKNEVERIGRINGQGYSCGCSECGYRNIMNACEFEQHSG 301
Query: 288 KQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFP 347
+ + +I ++G SL V++ + L ML + +S P + + K +F
Sbjct: 302 ESSNNQNNHIFLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYE--KWKASFQ 359
Query: 348 IT-CVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNK 406
+ P + C + + T ++ S+ N S +++ + +
Sbjct: 360 LEKDYFDDAPS---DPCSTQSSQESNIALTDSLK----------DSTSNASSILNWSSFR 406
Query: 407 RERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSS 466
R + K R +T +P L+R+ + I++ + TS+ KS
Sbjct: 407 RRSDRQFK-----------------RGGTETSTPI-LSRSPEKE--ISDLSTSTSM-KSE 445
Query: 467 QSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK--DQRLHKLVFDESGLPD 524
++ + + +I P P + + S+ PK D LH ++F E GLPD
Sbjct: 446 ETPSENTAGLLTTDVTVIQDP----PPDHNVDSNSKDLGQPKVRDNTLHPMLFKEGGLPD 501
Query: 525 GTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHAECASLSSIPQGDWYCK 581
T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H P Y
Sbjct: 502 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQ-PYRSIYTS 560
Query: 582 YCQNMFERKRFLQH----DANAVEAGRV-SGVDSVEQITKRCI-----RIVKNLEAELSG 631
+ E LQ + N E + SG T R I R ++ +
Sbjct: 561 DGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKEYSTTSRPIIVPLKRTLQERVLTVES 620
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC----LKKHK--MADLRELPKGKWFCCM 685
C +CR I+ C+QCER HV C L+K K + L E + + CC
Sbjct: 621 CYMCRKPHTVLGVISVDMIVFCNQCERALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCE 680
Query: 686 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRL 745
C + + L +KK + ++ W+LL+G + +
Sbjct: 681 KCQLLRASLH---------------EGLKKREDIAFLRRIRYNICWQLLNGTNMRSDVQ- 724
Query: 746 LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGI 804
Q + IF D F + D I +MV ++ G+ +F G+YCA+LT ++ VVSA I
Sbjct: 725 --HQVIEIFKDAFAET--APQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAI 780
Query: 805 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 864
L+V +EVAEL L+AT KGYF LL + IE L V+ + P E IW++K
Sbjct: 781 LKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEK 840
Query: 865 FGFKKIDPELLSIYRKR----CSQLVTFKGTSMLQK 896
G+ +LS +K LV F S++QK
Sbjct: 841 LGYT-----ILSDEQKHSMLMAHPLVMFANLSLVQK 871
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 101/139 (72%), Gaps = 4/139 (2%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGP 356
C RCKG FP +C+ + GP
Sbjct: 118 TCKRCKGRFPSSCMERVGP 136
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 68/93 (73%), Positives = 83/93 (89%)
Query: 811 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
++AELPLVATS NHGKGYFQ LF+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 871 DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 228
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 116/417 (27%), Positives = 191/417 (45%), Gaps = 88/417 (21%)
Query: 575 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA--ELSGC 632
+G WYC+ C+ + + Q+ + + G+ ++ + ++ R V++LE E+ GC
Sbjct: 852 EGRWYCRMCRQ--DSLKVAQNGHKGTDK-IIEGMSNIAETDEKG-RCVRHLEGPREVGGC 907
Query: 633 LLCRGCDFSKSGF-GPRTILLCDQCEREFHV----------------GCLKKHKMADLRE 675
+C+ + SK+GF TIL+CDQ + L + +E
Sbjct: 908 AICKKWNLSKTGFVDGMTILVCDQVRSLNQMLPGTRITWKVNGFTDPNILMQRHCITSQE 967
Query: 676 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR----- 730
LP+G+WFC DC I+ +L L+ PE N+I S + S + V+
Sbjct: 968 LPEGEWFCQKDCKVIDEILTQLVANG----PELLSNSIISELLESRQQQSSVKVKLESSN 1023
Query: 731 ----WRLLSGKAATPETRLLLSQAVAIF-------------------------------- 754
W++L G+ + L++A IF
Sbjct: 1024 PRFGWQILCGEGGSSANVQTLAEAANIFTSIDDINLPYLWLVVGNYSPSTHTPVSSTGMF 1083
Query: 755 ---HDC-----FDPIVDSISGRDL----IPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 802
C FDP S D+ + +V R + +F G++C +L +N VVSA
Sbjct: 1084 GSNQGCKNWKEFDPSHGSECHEDIHSLNVALVVCSRRAKDHDFEGVFCVVLKLNEKVVSA 1143
Query: 803 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 862
+L++FG+E+AE+PL+ATS + G+G+ + L IE+LL L V+ +VLP A+ ESIW
Sbjct: 1144 ALLQIFGREIAEVPLIATSLPHQGQGFCKALMTTIERLLGVLSVERLVLPTAKNTESIWI 1203
Query: 863 DKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPAC---RIGSSSTDSTECV 914
+KFGF ++ + L ++ C+ +L+TF GT ML K + RI S D C+
Sbjct: 1204 NKFGFSRVPEDQL---KRICTTIRLMTFTGTRMLGKAITPMTLNRIQRQSRDGCVCI 1257
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 37/60 (61%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D L K +F GL DGTE+GYY Q L+G K G GI C CCN E+S +FE HA C
Sbjct: 714 RDAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYEFEQHAGC 773
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 69/158 (43%), Gaps = 13/158 (8%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL T L++G V + L G+++D G+ C+C C G ++ S
Sbjct: 515 LLKAPRNAKELMATKLMEGHFVR----CSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSIS 570
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EEK 335
FE H+ S I ENGK+L +VL A + +L+A L+ A+ + E+
Sbjct: 571 AFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADCGDNILRA-LKMAIGDIQGVEKS 629
Query: 336 SFACVRCKGTFP---ITCVGKTGPGPLCNSCVKSKKPQ 370
C C G+ I C G + CV S PQ
Sbjct: 630 KVTCAECGGSEEGDLIYCKGARCSVVSHSRCVGSANPQ 667
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 184/415 (44%), Gaps = 50/415 (12%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAH 564
+D LH ++F E GLPD T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H
Sbjct: 584 RDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKH 643
Query: 565 AECASLSSIPQGDWYCKYCQNMFERKRFLQH----DANAVEAGRV-SGVDSVEQITKRCI 619
P Y + E LQ + N E + SG T R I
Sbjct: 644 VGMGRRRQ-PYRSIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKEYSTTSRPI 702
Query: 620 -----RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC----LKKHK- 669
R ++ + C +CR I+ C+QCER HV C L+K K
Sbjct: 703 IVPLKRTLQERVLTVESCYMCRKPHTVLGVISVDMIVFCNQCERALHVKCYNNGLQKPKA 762
Query: 670 -MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 728
+ L E + + CC C + + L +KK + +
Sbjct: 763 PLKVLGEYTQFNFMCCEKCQLLRASLHE---------------GLKKREDIAFLRRIRYN 807
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGG 787
+ W+LL+G + + Q + IF D F + D I +MV ++ G+ +F G
Sbjct: 808 ICWQLLNGTNMRSDVQ---HQVIEIFKDAFAET--APQDIDDIRNMVNSKDTTGEKDFRG 862
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
+YCA+LT ++ VVSA IL+V +EVAEL L+AT KGYF LL + IE L V+
Sbjct: 863 IYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVR 922
Query: 848 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CSQLVTFKGTSMLQKRV 898
+ P E IW++K G+ +LS +K LV F S++QK +
Sbjct: 923 LLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAHPLVMFANLSLVQKSL 972
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC----RVIPPSKFEIHACKQYR 291
LL GV V Y + + L G + GG C+C GC +V+ +FE HA +
Sbjct: 31 LLQGVPVTYR--FEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGVTSK 88
Query: 292 RASQYICFENGKSLLEVLRACRSVP 316
+ +I NG+SL E+ R VP
Sbjct: 89 NQNGHIFLRNGRSLYELFHKLREVP 113
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 146 bits (368), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 150/527 (28%), Positives = 227/527 (43%), Gaps = 82/527 (15%)
Query: 421 VSKSSSKSVSLRNLLKTRSP-WELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSK 479
+ +S+S +S N R P W+ R + +T ++ +S + T S
Sbjct: 394 MKESTSNGISNLNWSAFRRPRWQYKRGGT-------ATSTQTLSRSPEKGISGLSTGTSM 446
Query: 480 KTVLISKPFENASPPL--------------SFPNKSRWNITPK--DQRLHKLVFDESGLP 523
K P EN + PL S KS+ + T K D LH+LVF E G+P
Sbjct: 447 KINTEETPSENTAGPLHSEVTIVQEPPRGHSVGPKSKESRTSKVRDNSLHQLVFKEGGVP 506
Query: 524 DGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHAECASLSSIPQGDWYC 580
+ T + Y G+ L +GYK G I+C CC+ EV +PS FE HA P + Y
Sbjct: 507 ELTILTYKLKHGEVLKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGKRRQ-PYRNIYT 565
Query: 581 KYCQNMFERKRFLQH--DANAVEAGRVSGVDSVEQITKRCI---------------RIVK 623
+ E LQ ++N + G D ++ R ++
Sbjct: 566 PEGLTLHELALKLQGGLNSNGNSSANFPGGDEPPNLSSGSSRESSTTYRPSIVPLKRTLQ 625
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC----LKKHK--MADLRELP 677
+ + C LC + I+ C+QCER HV C L+K K + L E
Sbjct: 626 QIADKTESCRLCGDACTTIGTISEDMIVFCNQCERPCHVKCYNNGLQKQKGPLNVLAEYM 685
Query: 678 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK 737
+ +FCC C + + L +L + EK+ + + Y V W++L+G
Sbjct: 686 QFHFFCCQKCQLLRASLHEVL-NKREKIRQ-----KRSY------------VFWQILNGM 727
Query: 738 AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVN 796
+ + Q + IF F S G +I MV +++ G+ +F GMYCA+LT +
Sbjct: 728 NPGINVQKYIHQVIEIFKVAFPKTAASDFG--VIQDMVNAKDVGGEKDFRGMYCAVLTTS 785
Query: 797 SS-VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 855
S VVSA +L+V +EVAEL +VAT KGYF LL IE L + V+ +
Sbjct: 786 SKLVVSAAVLKVRTEEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMNVRLLTALVDP 845
Query: 856 EAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQKRV 898
E ESIW+ K GF + E LL + LV F+ +++QK +
Sbjct: 846 EMESIWSKKLGFTILSGEEKETLLEAH-----PLVMFEDLTLMQKSL 887
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/308 (31%), Positives = 143/308 (46%), Gaps = 46/308 (14%)
Query: 493 PPLSFPNKS-------RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
P L FP + + +D LH+L+F +GLPDGTE+ YY GQK+L GYK G
Sbjct: 407 PSLYFPGSATKQKKTAESGVRKRDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGN 466
Query: 546 GIICHCCNSEVSPSQFEAHAECA---------------SLSSIP------------QGDW 578
GI+C CC E+SPSQFE+HA + +L I GD
Sbjct: 467 GIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLANGQNITTGIGDD 526
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
C C ++ + + ++ G + R R+ K E + C++C
Sbjct: 527 MCAECGDGGDLMWHVWIYRILLKVLGIVQIDGGNFARPTVIRLTRVGKIPEYNVGDCVVC 586
Query: 636 R------GCDFSKSGFGP-RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
R DF+ + ++L ++EFHVGCL++ + DL E+P+ WFCC DC+
Sbjct: 587 RLNLLKFLIDFTLANSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFCCQDCN 646
Query: 689 RINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLL 746
I L+N + +K+P LN I K L + DV+W++L GK+ E L
Sbjct: 647 NIYVALRNSVSTGVQKIPASLLNIINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSL 706
Query: 747 LSQAVAIF 754
LS A AIF
Sbjct: 707 LSGAAAIF 714
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 204 ERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 260
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS CN +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 261 GCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 320
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 321 VAGSSINEEFF 331
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 47/70 (67%), Gaps = 4/70 (5%)
Query: 830 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 889
F LF+CIE+LL L V+ +VLPA AE+IWT +FGF+K+ L Y R QL FK
Sbjct: 736 FNPLFSCIERLLCSLNVEQLVLPA---AETIWTRRFGFRKMSEGQLLKY-TREFQLTIFK 791
Query: 890 GTSMLQKRVP 899
GTSML+K VP
Sbjct: 792 GTSMLEKEVP 801
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 144 bits (362), Expect = 3e-31, Method: Composition-based stats.
Identities = 74/119 (62%), Positives = 89/119 (74%), Gaps = 1/119 (0%)
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+GYFQ L+ CIE+LL F
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
L+V S+VLPAAE AES+W +KF F K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 134
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/113 (63%), Positives = 88/113 (77%)
Query: 791 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 850
I+ N++VV+AG+LRVFG+EVAELPLVAT + KGYFQLLF+CIEKLLS L V SIV
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIV 138
Query: 851 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
+PAAEE E +W +KFGF+K+ PE LS Y K C Q+V FKG SMLQK V + +I
Sbjct: 139 VPAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQI 191
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 28/35 (80%)
Query: 673 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 707
L+ELPKG WFC MDC+RINS LQ LL+ AEKL +
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSD 75
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 177/397 (44%), Gaps = 50/397 (12%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------ECAS--------LSSIPQGDWYC 580
Q LLEG GI+C CC S S FEAH+ CAS L + W
Sbjct: 17 QVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRACASIFNNKGESLLDLQVQAWEL 76
Query: 581 KYCQ-NMFERKRFLQHDAN------AVEAGRVSGVDSVEQITKRCIRIVKNL-EAEL--- 629
+ N E + D N + GR+ D ++K L E E
Sbjct: 77 LDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCP 136
Query: 630 -SGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 686
C +C G +++ G F T+LLCDQCERE+HV CL MA + P WFC
Sbjct: 137 SCRCAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDH 196
Query: 687 CSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL 746
C +I L+ L V + + E + + + L + S ++ + E R
Sbjct: 197 CDKIFEGLRKL-VGISNTIGEGLSWTLLRSGEDDLPSASSMN--------REQMAEHRSK 247
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRGQEFGGMYCAILTVNSSVVSAG 803
L+ A+ + +CF P+VD + DL+ ++Y R + F G Y +L + V+S
Sbjct: 248 LAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVA 307
Query: 804 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 863
+R+ G +AE+PL+ T + +G + L IE LL L ++S VLPA E W +
Sbjct: 308 SIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKN 367
Query: 864 KFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 896
FGF+++ P EL+ + +V+F G ++LQK
Sbjct: 368 AFGFQEMAPTQRLELVKL------SVVSFPGVTLLQK 398
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 177/397 (44%), Gaps = 50/397 (12%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------ECAS--------LSSIPQGDWYC 580
Q LLEG GI+C CC S S FEAH+ CAS L + W
Sbjct: 42 QVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRACASIFNNKGESLLDLQVQAWEL 101
Query: 581 KYCQ-NMFERKRFLQHDAN------AVEAGRVSGVDSVEQITKRCIRIVKNL-EAEL--- 629
+ N E + D N + GR+ D ++K L E E
Sbjct: 102 LDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCP 161
Query: 630 -SGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 686
C +C G +++ G F T+LLCDQCERE+HV CL MA + P WFC
Sbjct: 162 SCRCAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDH 221
Query: 687 CSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL 746
C +I L+ L V + + E + + + L + + ++ + E R
Sbjct: 222 CDKIFQGLRKL-VGISNNIGEGLSWTLLRSGEDDLPSANSMN--------REQMAEHRSK 272
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRGQEFGGMYCAILTVNSSVVSAG 803
L+ A+ + +CF P+VD + DL+ ++Y R + F G Y +L + V+S
Sbjct: 273 LAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVA 332
Query: 804 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 863
+R+ G +AE+PL+ T + +G + L IE LL L ++S VLPA E W +
Sbjct: 333 SIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKN 392
Query: 864 KFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 896
FGF+++ P EL+ + +V+F G ++LQK
Sbjct: 393 AFGFQEMAPTQRLELVKL------SVVSFPGVTLLQK 423
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 152/360 (42%), Gaps = 71/360 (19%)
Query: 548 ICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
IC CN + PS F H C L S P+G WYC C+
Sbjct: 811 ICSICNEGGEILLCDNCPSSF--HHACVGLESTPEGSWYCPSCR---------------- 852
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS--KSGFGPRTILLCDQCER 658
C +C D+ + F +TI+ CDQCER
Sbjct: 853 -------------------------------CSICDSSDYDPDTNKFTEKTIMYCDQCER 881
Query: 659 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG 718
E+HVGC++ +K L P+G WFC CS I LQ L+ + E I ++
Sbjct: 882 EYHVGCMR-NKGDQLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPTPVEGLSCTILRFDR 940
Query: 719 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 778
+ D E L A+ + H+CF I++ + RDL +V+ R
Sbjct: 941 ENASQHGDF--------YNEIIAEQYGKLCIALDVLHECFVTIIEPSTRRDLSEDIVFNR 992
Query: 779 --NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFAC 836
LR F G Y IL + ++S G RV G++ AELPL+ T +G +LL
Sbjct: 993 ESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAELPLIGTRVQYRRQGMCRLLMNE 1052
Query: 837 IEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+EKLLS L V+ +VLPA + WT FGF+ + + S +++F+GT++ QK
Sbjct: 1053 LEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAM--SFSDRFELAESSILSFQGTTICQK 1110
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 132/262 (50%), Gaps = 26/262 (9%)
Query: 620 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-------LKKHKMAD 672
R ++ + C CR D P TI C+QCER HV C +KK +
Sbjct: 63 RTLQERVVQTESCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 673 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWR 732
L+E ++ CC +C + + L+ V++ E++ L I+ ++ WR
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG--VEKGEEIA--FLRQIRS------------NICWR 166
Query: 733 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCA 791
LLS A+ + +L +SQA+ IF D F D+ S D+ MVYG+N G+ +F GMYC
Sbjct: 167 LLSKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGMYCV 224
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
+LT ++ VVSA IL+V ++ AEL L+AT KGYF+LL IE L V ++
Sbjct: 225 VLTASTHVVSAAILKVRVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSLLMA 284
Query: 852 PAAEEAESIWTDKFGFKKIDPE 873
P E IW++K GF + E
Sbjct: 285 PVDPEMAQIWSEKLGFTILSAE 306
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 161/371 (43%), Gaps = 87/371 (23%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
PS F H C L + P+GDW C C+
Sbjct: 460 PSAF--HHACVGLQATPEGDWCCPLCR--------------------------------- 484
Query: 618 CIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTILLCDQCEREFHVGCLKKHKMAD-- 672
C +C G D + GF +TI+ C+QCERE+HVGC+++ +
Sbjct: 485 --------------CGVCGGSDLDDDTAEGFTDKTIIYCEQCEREYHVGCMRRGGSEEES 530
Query: 673 ----LRELPKGK--------WFCCMDCSRINSVLQNLLVQ-EAEKLPEFHLNAI------ 713
R L + + W C +C + LQ L+ A +P + A
Sbjct: 531 AAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQALVASSRARSIPHYSRGAYHSAPCG 590
Query: 714 -KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 772
++Y + TV+ I RW+ AA L A+ + H+CFD +V+ + DL
Sbjct: 591 RRRY----MSTVTRI-TRWQHEEEDAADHGQ---LCAALDVLHECFDDMVEPRTQTDLAA 642
Query: 773 SMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 830
+V+ + LR F G Y L +++ G LRVFG +VAELPLV T + +G
Sbjct: 643 DIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRVFGNQVAELPLVGTRFAHRRQGMC 702
Query: 831 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCSQLVTFK 889
+LL +EK+L + V+ +VLPA E +WT GF + +++ + + +++FK
Sbjct: 703 RLLVTELEKMLRQVGVRRLVLPAVPELMPMWTASLGFHAMTRSDVMEMAVEHA--ILSFK 760
Query: 890 GTSMLQKRVPA 900
GT+M QK + A
Sbjct: 761 GTTMCQKTLLA 771
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 153/361 (42%), Gaps = 71/361 (19%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+ G ++C C PS + H +C L +IP+G+WYC C+
Sbjct: 685 HDGGDLLLCDNC-----PSSY--HHDCVGLEAIPEGNWYCPSCR---------------- 721
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS--KSGFGPRTILLCDQCER 658
C +C D+ S F +TI+ CDQCER
Sbjct: 722 -------------------------------CSICNLSDYDPDTSQFTEKTIVYCDQCER 750
Query: 659 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG 718
E+HVGC + L P+G WFC CS + LQ L+ + E I K+
Sbjct: 751 EYHVGCTRNSD-NQLICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEGVSWTILKFCS 809
Query: 719 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 778
+ D D + L AV I H+CF I++ + D+ +V+ R
Sbjct: 810 GNGSDHGDYD--------DEIMADHYGKLCVAVGILHECFVTIIEPRTQSDISEDIVFNR 861
Query: 779 N--LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFAC 836
LR F G Y +L +S G R+ GQ+ AELPL+ TS +G +LL
Sbjct: 862 ESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTSSPYRRQGMCRLLINE 921
Query: 837 IEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQ 895
+EKLL L V+ ++LPA E WT FGF + + + L + + +++F+GT+M Q
Sbjct: 922 LEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNSDRLEL---AGNSILSFQGTTMCQ 978
Query: 896 K 896
K
Sbjct: 979 K 979
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 135/288 (46%), Gaps = 33/288 (11%)
Query: 620 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADL 673
R ++ E C C + P TI+ C+QCER H+ C KK + L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 674 RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRL 733
+E + CC +C + + L+ ++K G + ++ WRL
Sbjct: 130 KEYMCFHFLCCQECQSLRARLEE---------------GLEKCVGITFLRRIRSNICWRL 174
Query: 734 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAI 792
LSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMYCA+
Sbjct: 175 LSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCAL 232
Query: 793 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 852
LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V + P
Sbjct: 233 LTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAP 292
Query: 853 AAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 896
E IW++K GF + E +L + LV FK ++QK
Sbjct: 293 VDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 335
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 103/173 (59%), Gaps = 5/173 (2%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGII 259
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y G GL+G+I
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYARG--RPEKGLQGVI 402
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 403 QGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADA 462
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 463 LEAAIRNATGAGDARKSTVCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 513
Score = 119 bits (297), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 73/191 (38%), Positives = 96/191 (50%), Gaps = 57/191 (29%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYY--------------ACGQKLLEGYKN 543
P+K +T KD R+HKLVF+ LPDGT + YY C + G+
Sbjct: 576 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGKVSPSQFEAHAGCASRRKPGWYW 635
Query: 544 G----LGI----------------------------ICH------CCNSEVSPSQFEAHA 565
G LG+ IC CC++ P F H
Sbjct: 636 GKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADGGDLLCCDN--CPRAF--HT 691
Query: 566 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 625
EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D++EQITKR IRIV L
Sbjct: 692 ECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQITKRSIRIVDTL 750
Query: 626 EAELSGCLLCR 636
AE+ C+LCR
Sbjct: 751 HAEVGVCVLCR 761
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 62/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P DW C C F ++ Q +A ++ VDS
Sbjct: 817 CCDG--CPSTF--HMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAEVDS 866
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
++ C QCE ++H GC +
Sbjct: 867 --------------------------------------SLCTCSQCEEKYHPGCSPETTN 888
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + L+NLL + + PEF I++ N ETV +D R
Sbjct: 889 TSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDER 948
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 949 V----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGF 998
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L V+
Sbjct: 999 YIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEK 1058
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1059 LIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1104
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 62/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P DW C C F ++ Q +A ++ VDS
Sbjct: 972 CCDG--CPSTF--HMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAEVDS 1021
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
++ C QCE ++H GC +
Sbjct: 1022 --------------------------------------SLCTCSQCEEKYHPGCSPETTN 1043
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + L+NLL + + PEF I++ N ETV +D R
Sbjct: 1044 TSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDER 1103
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 1104 V----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGF 1153
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L V+
Sbjct: 1154 YIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEK 1213
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1214 LIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1259
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 148/348 (42%), Gaps = 63/348 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L +P DW C C + L HDA D+
Sbjct: 1043 CCDG--CPSTF--HMSCLELEELPSDDWRCTNCSCKLCHE-HLNHDA----------PDN 1087
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
E P + C QCE+++H C + +
Sbjct: 1088 AE--------------------------------IDP--LHSCSQCEKKYHPSCSPETEK 1113
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
G FC C + LQNLL E + PE+ I+ ++ ETV D+D R
Sbjct: 1114 LSSVSSQAGNHFCQQSCRLLFEELQNLLAVEKDLGPEYACRIIQCIHEDAPETVLDLDGR 1173
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 1174 V----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFLRLDFRGF 1223
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y IL +VSA +R+ G ++AE+P + T + +G + L IE +LS L+V+
Sbjct: 1224 YIFILERGDEIVSAASVRIHGTKLAEMPFIGTRHMYRRQGMCRRLLDGIEMILSSLKVEK 1283
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF + E+ + ++ F GT +LQK
Sbjct: 1284 LIIPAINELVDTWTSKFGFSPL--EVSDKQEVKSINMLVFPGTGLLQK 1329
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 62/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P DW C C F ++ Q +A ++ VDS
Sbjct: 953 CCDG--CPSTF--HMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAEVDS 1002
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
++ C QCE ++H GC +
Sbjct: 1003 --------------------------------------SLCTCSQCEEKYHPGCSPETTN 1024
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + L+NLL + + PEF I++ N ETV +D R
Sbjct: 1025 TSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDER 1084
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 1085 V----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFRGF 1134
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L V+
Sbjct: 1135 YIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEK 1194
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1195 LIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1240
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 62/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P DW C C F ++ Q +A ++ VDS
Sbjct: 425 CCDG--CPSTF--HMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAEVDS 474
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
++ C QCE ++H GC +
Sbjct: 475 --------------------------------------SLCTCSQCEEKYHPGCSPETTN 496
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + L+NLL + + PEF I++ N ETV +D R
Sbjct: 497 TSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDER 556
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 557 ----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGF 606
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L V+
Sbjct: 607 YIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEK 666
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 667 LIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 712
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 154/601 (25%), Positives = 235/601 (39%), Gaps = 85/601 (14%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPP 279
L+ V L TGLL+G V YM K + + G I G C CS CN ++
Sbjct: 62 LDSDLRDVRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNA 117
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
+FE H + + +I + G SL V+ A + L ML ++ + P +
Sbjct: 118 CEFEEHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN- 176
Query: 340 VRCKGTFP-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
+ K +F + V G C + G M Y+ ++ S VS S
Sbjct: 177 -KWKASFQKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSIS 227
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP- 450
N + S + + R ++ S S K S K + + N+ P
Sbjct: 228 NLNWSASKRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPL 287
Query: 451 ---GLIANS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKS 501
G+ ++S T VT+ H S + + +S P + PL S +KS
Sbjct: 288 SIDGVKSDSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKS 338
Query: 502 RWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
+ + T +D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS
Sbjct: 339 KESKT-RDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSH 397
Query: 561 FEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQH--DANAVEAGRVSGVDSVEQITKRC 618
FE HA P + Y ++ + LQ + N + VS V +T
Sbjct: 398 FEEHAGMGRRRQ-PYHNIYTLEGLSLHKLALQLQDHLNPNGFDNASVSSVSDYHNLTSSG 456
Query: 619 I---------------RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
R ++ E C C + P TI+ C+QCER H+
Sbjct: 457 CGREPSTTSGPIVPLKRTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIK 516
Query: 664 CL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 717
C KK + L+E + CC +C + + L+ ++K
Sbjct: 517 CYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE---------------EGLEKCV 561
Query: 718 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 777
G + ++ WRLLSG A+ + +L + Q + IF D F D S D+I MV G
Sbjct: 562 GITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNG 619
Query: 778 R 778
+
Sbjct: 620 K 620
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 126 bits (316), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 149/348 (42%), Gaps = 62/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P DW C C F ++ Q +A ++ VDS
Sbjct: 288 CCDG--CPSTF--HMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAEVDS 337
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
++ C QCE ++H GC +
Sbjct: 338 --------------------------------------SLCTCSQCEEKYHPGCSPETTN 359
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + L+NLL + + PEF I++ N ETV +D R
Sbjct: 360 TSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVALDER 419
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 420 ----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGF 469
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L V+
Sbjct: 470 YIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEK 529
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 530 LIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 575
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/431 (26%), Positives = 180/431 (41%), Gaps = 94/431 (21%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERK--RFLQ 594
+LEG+ GI C CC+ ++ S+FE HA G + QN+F LQ
Sbjct: 607 MLEGWITRDGIHCGCCSKILAVSKFEIHA----------GSKLRQPFQNIFLNSGVSLLQ 656
Query: 595 HDANAVE----AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 650
+A + AG + G SV+ I + C +C G G +
Sbjct: 657 CQIDAWDKQKGAGNI-GFCSVDVIAD---------DPNDDACGIC--------GDGG-DL 697
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC-SRINSVLQNLLVQEAEK- 704
+ CD C FH CL D+R P G W C C C + I V Q + +
Sbjct: 698 VCCDGCPSTFHQRCL------DIRMFPLGDWHCPNCTCKFCKAVIEDVTQTVGANTCKMC 751
Query: 705 --------LPEFHLN---------------------AIKKYAGNSLETVSDIDVRWRL-- 733
+P+ ++ +KKY G E + W L
Sbjct: 752 EKKYHKSCMPKANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKHEL--EAGFSWSLVH 809
Query: 734 ---------LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 782
LSG E L+ A+ + +CF PI+D SG +++ +++Y G N
Sbjct: 810 RECTNSDLSLSGHPHIVENNSKLALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNR 869
Query: 783 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 842
FGG Y A+L +V++ +R G +AE+P + T + +G + LF+ +E L
Sbjct: 870 LNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESALQ 929
Query: 843 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
L+VK +++PA + +W KFGF++++ L R L+TF G +LQK + A R
Sbjct: 930 HLKVKLLIIPATADFSHVWISKFGFRQVEDSLKK--EMRSMNLLTFPGIDVLQKELLAPR 987
Query: 903 IGSSSTDSTEC 913
S+ D T+C
Sbjct: 988 HTESAVD-TDC 997
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 130/283 (45%), Gaps = 19/283 (6%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C +C F + C QCER++HVGCL+K L P G WFC C +I
Sbjct: 710 CGICGENKFDGGSEQDNVVFSCYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIF 769
Query: 692 SVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQA 750
LQ LL + + +K LE ID L A E L+ A
Sbjct: 770 LGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLE----ID-----LPDIEALTEVYSKLNIA 820
Query: 751 VAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVF 808
+ + H+CF+P+ + + RD++ +++ G +L F G Y +L N ++S +RV+
Sbjct: 821 LGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVY 880
Query: 809 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 868
G++VAE+PL+ T G +L +EK L L V+ +VLPA + WT FGF
Sbjct: 881 GEKVAEVPLIGTRFQYRRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFS 940
Query: 869 KI-DPELLSIYRKRCSQLVTFKGTSMLQK---RVPACRIGSSS 907
K+ D E L + F+ T M QK ++P + S+
Sbjct: 941 KMTDSERLRFLD---YSFLDFQDTVMCQKLLMKIPLAKSNQST 980
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 150/348 (43%), Gaps = 60/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+S PS F H C + P G W+C YC + +V G
Sbjct: 654 CCDS--CPSTF--HQSCLDIKKFPSGPWHCLYC------------------SCKVCG--- 688
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
Q+T + + EA + +LC+ CD CE ++H C++ +
Sbjct: 689 --QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQMNNA 728
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
+ + FC C ++ LQ LL + F I++ SD+D
Sbjct: 729 SG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDVDSD 777
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
L + A + L+ A+ + +CF P++D SG +LI +++Y G N F G
Sbjct: 778 VSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNFSGF 837
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y AIL + V+ A LR+ G E+AE+P + T + +G + + IE +LS L V+
Sbjct: 838 YTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIESVLSSLNVEK 897
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+V+PA E W FGFK +D + R R L+ F G MLQK
Sbjct: 898 LVIPAISEVRDTWISVFGFKPLDE--TTKQRMRKMSLLVFPGVEMLQK 943
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 167/383 (43%), Gaps = 91/383 (23%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 605
GI+C+CC + S FEAHA C P + +++F+ +R
Sbjct: 418 GIVCNCCRVNFTVSGFEAHAGCTRHR--PSISILLEDGRSLFKCQR-------------- 461
Query: 606 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 665
++ +Q CI + + C +C GFG ++LCD+C FH+GCL
Sbjct: 462 ---EARDQKGSHCIGEANSEANNDNVCSIC--------GFGG-DLVLCDRCPSAFHLGCL 509
Query: 666 KKHKMADLRELPKGKWFCCMDCSRI-----------NSVLQNLLV-QEAEKLPEFHLNAI 713
L +P G WFC C +I + N LV + E+ +FH +
Sbjct: 510 ------GLDRVPDGDWFCPTCCCKICYRPKCKQECADGNENNFLVCVQCEQ--KFHFGCV 561
Query: 714 KKYAGNSLETVSDI------------------------------DVRWRLLSGKAAT--- 740
K S T S+I ++ W LL ++
Sbjct: 562 KTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINWTLLKNVSSDDDG 621
Query: 741 --------PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE--FGGMYC 790
+ + L+ A+ + ++ F+P +D++SGR+LI +V+ R+ + F G Y
Sbjct: 622 GDFTSNEFSQEKHKLNAALGVLYEGFNPTIDALSGRELIKDLVFSRDSEHKRLNFRGFYT 681
Query: 791 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 850
IL V+S +R+FGQ+VAE+ VAT + + G+G +LL +E+ L+ L V +V
Sbjct: 682 VILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMCRLLMDELEEQLTRLGVGRLV 741
Query: 851 LPAAEEAESIWTDKFGFKKIDPE 873
L ++E+A + WT FGF ++ E
Sbjct: 742 LHSSEDAINTWTKSFGFARMTSE 764
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 149/359 (41%), Gaps = 70/359 (19%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+ G I+C C PS F H C L IP GDW+C C +R
Sbjct: 744 HYGGELILCDKC-----PSSF--HKTCLGLEDIPNGDWFCPSCCCGICGQR--------- 787
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
++ G D V Q+ L C QCE ++
Sbjct: 788 ---KIDGDDEVGQL------------------------------------LPCIQCEHKY 808
Query: 661 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGN 719
HV CL+ + AD+ G WFC DC +I L LL + + +K +
Sbjct: 809 HVRCLE-NGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWTLVKFINPD 867
Query: 720 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR- 778
S E S S E+ L+ A+++ H+CF+P+ +S++ RDL+ +++ R
Sbjct: 868 SCEHDS---------SKSDLLAESYSKLNLAISVMHECFEPLKESLTNRDLVEDVIFSRW 918
Query: 779 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
L F G Y +L N ++S +RV+G++VAE+PLV T +G +L +
Sbjct: 919 SELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPLVGTRLQYRRRGMCHILIEEL 978
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
EK L L V+ +VLPA WT FGF K+ S + + F+G M QK
Sbjct: 979 EKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQFLDY--TFLDFQGAIMCQK 1035
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 150/348 (43%), Gaps = 60/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+S PS F H C + P G W+C YC + +V G
Sbjct: 681 CCDS--CPSTF--HQSCLDIKKFPSGPWHCLYC------------------SCKVCG--- 715
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
Q+T + + EA + +LC+ CD CE ++H C++ +
Sbjct: 716 --QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQMNNA 755
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
+ + FC C ++ LQ LL + F I++ SD+D
Sbjct: 756 SG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDVDSD 804
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
L + A + L+ A+ + +CF P++D SG +LI +++Y G N F G
Sbjct: 805 VSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNFSGF 864
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y AIL + V+ A LR+ G E+AE+P + T + +G + + IE +LS L V+
Sbjct: 865 YTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIESVLSSLNVEK 924
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+V+PA E W FGFK +D + R R L+ F G MLQK
Sbjct: 925 LVIPAISEVRDTWISVFGFKPLDE--TTKQRMRKMSLLVFPGVEMLQK 970
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 153/365 (41%), Gaps = 63/365 (17%)
Query: 558 PSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
PS F H C L +P+GDW+C C R+ G + ++ ++
Sbjct: 370 PSCF--HQSCLGLKELPEGDWFCPSC------------------CCRICGENRFDEYSEE 409
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
N + FS C QCE ++HVGCL+K + L P
Sbjct: 410 -----DNFK-------------FS-----------CHQCELQYHVGCLRKQRHVKLETYP 440
Query: 678 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK 737
G FC C +I LL + +P N ++ D+DV
Sbjct: 441 DGTRFCSTQCEKI---FLGLLKLLGKPIPVGVDNLTWTLLKPTISEWFDMDV-----PDN 492
Query: 738 AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTV 795
A E L+ A+ + H+CF+PI + +GRDL+ +++ G +L+ F G Y +L
Sbjct: 493 KALTEVYSKLNIALNVMHECFEPIKEPHTGRDLVEDVIFCRGSDLKRLNFRGFYIVLLER 552
Query: 796 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 855
N ++S +RV G++VAE+PLV T G ++L IEK L L V+ + LPAA
Sbjct: 553 NDELISVATIRVHGEKVAEVPLVGTRSQYRRLGMCRILINEIEKKLVELGVERLTLPAAP 612
Query: 856 EAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTECV 914
W FGF K+ D E L+ + F+ T M QK + SS + C
Sbjct: 613 SVLDTWVTSFGFSKMTDSERLTFLD---YTFLDFQDTVMCQKLLMKIPSTKSSQSTVNCA 669
Query: 915 SGVEV 919
+ + +
Sbjct: 670 ASLWI 674
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 111/225 (49%), Gaps = 15/225 (6%)
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
F ++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 211 FADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILV 266
Query: 705 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 764
IK + + V D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 267 GHNLTCTLIKPMQYQAEDRV-DYDL--------AAMAENYSKLSVALEVMHECFDPVKDP 317
Query: 765 ISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 822
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 318 KTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRF 377
Query: 823 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
+ G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 378 QHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 67/170 (39%), Gaps = 45/170 (26%)
Query: 530 YYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNM 586
YY G L EG + GI C+CC S S F+AH G+ C+ +N+
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHV---------TGNNICRPAENL 99
Query: 587 F--------------ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
F RK+ ++ + V R +G S + R++ L +E
Sbjct: 100 FLGNGKSLVSCQVELMRKKIMRFNQEPVV--RATGTGSRSK-----FRLLAPLGSENCND 152
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
+C C + ++ CD+C FH CL ++ +P+G WF
Sbjct: 153 YVCSICHYGGD------LICCDRCPSSFHATCL------NIERVPEGDWF 190
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 110/225 (48%), Gaps = 15/225 (6%)
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
F ++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 211 FADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILV 266
Query: 705 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 764
IK + E D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 267 GQNLTCTLIKPMQYQA-EDREDYDL--------AAMAENYSKLSVALEVMHECFDPVKDP 317
Query: 765 ISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 822
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 318 KTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRF 377
Query: 823 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
+ G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 378 QHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 151/384 (39%), Gaps = 86/384 (22%)
Query: 548 ICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+C CN + PS F H C L + P+GDW+C C+
Sbjct: 506 VCSICNDGGDLLLCDNCPSAF--HHACVGLQATPEGDWFCPSCR---------------- 547
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS-----GFGPRTILLCDQ 655
C +C G DF + GF +TI+ CDQ
Sbjct: 548 -------------------------------CGVCGGSDFDATAAGGGGFTDKTIIYCDQ 576
Query: 656 CEREFHVGCLKKHKMADL-----------RELPKGKWFCCMDCSRINSVLQNLLVQEAEK 704
CERE+HVGC+++ + E + W C +C + LQ L E+
Sbjct: 577 CEREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEVFRHLQGLAAVARER 636
Query: 705 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LSQAVAIFH 755
G SL + R + + + L A+ + H
Sbjct: 637 SIPIPTTVPTTVEGVSLSILRRRRRRPISMVATGSGCQEEEEEEDAAEHGQLCSALDVLH 696
Query: 756 DCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 813
+CF +++ + DL +V+ R LR F G Y L +++ G LRV G EVA
Sbjct: 697 ECFVTLIEPRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGELITVGTLRVLGTEVA 756
Query: 814 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-P 872
ELPLV T + +G LL +EK+L + V+ +VLPA E +WT GF +
Sbjct: 757 ELPLVGTRFAHRRQGMCHLLVTELEKVLRQVGVRRLVLPAVPELLPMWTASLGFHPMTRS 816
Query: 873 ELLSIYRKRCSQLVTFKGTSMLQK 896
+++ I + +++F+GT+M K
Sbjct: 817 DVMEIAAEH--AILSFQGTTMCHK 838
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/251 (31%), Positives = 124/251 (49%), Gaps = 20/251 (7%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
+L CDQCER++H+GCL+ + L R+ PK WFC C I LQ LL + P+
Sbjct: 686 LLSCDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVGPDN 745
Query: 709 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 768
+ K+ + D++ +GK + L AV + H+CF+P ++ +GR
Sbjct: 746 LTWTLWKFMDSD---SCDVEAP----TGKHSK------LDLAVEVIHECFEPATETYTGR 792
Query: 769 DLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
D+ +++ R NL F G Y +L N +++ +RVFG +VAE+PLV T +
Sbjct: 793 DIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGTRFLFRR 852
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQL 885
G ++L +EK L L V+ ++LPA W + FGF K+ D E +
Sbjct: 853 LGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYLD---HTF 909
Query: 886 VTFKGTSMLQK 896
+ F GT QK
Sbjct: 910 LDFPGTIKCQK 920
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 173/395 (43%), Gaps = 53/395 (13%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFE---RKRFL 593
+LEG+ GI C CC+ ++ S+FE HA S + P + Y + ++ + R +
Sbjct: 530 MLEGWITREGIHCDCCSKILTVSRFEIHA--GSKACQPFQNIYLESGASLLQCQVRAWNM 587
Query: 594 QHDANAVEAGRV------------------------SGVDSVEQITKRCIRIVKNLEAEL 629
Q DA V +V G S T ++++ + +
Sbjct: 588 QKDATNVGLHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHC 647
Query: 630 SGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
C C+ CD + + G ++L C CER +H CL ++ FC
Sbjct: 648 PNCT-CKFCDAAVASGGKDGNFLSLLSCSMCERRYHQLCLSDEAQK-VQSFGSASSFCGP 705
Query: 686 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRL 745
C + LQ L + E + + I + ++T SDI+ + LS + ++L
Sbjct: 706 KCLELFEKLQKYLGVKNEIEGGYSWSLIHR-----VDTDSDINSQ---LSAQRIENNSKL 757
Query: 746 LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAG 803
+ +AI +CF PIVD SG +LI +++Y G N + G Y AIL ++SA
Sbjct: 758 AV--GLAIMDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAA 815
Query: 804 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 863
LR G ++AE+P + T I +G + LF IE + L+V+ +V+PA + WT
Sbjct: 816 SLRFHGTQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTG 875
Query: 864 KFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 896
FGF +D + RK L T F G MLQK
Sbjct: 876 NFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 906
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 145/545 (26%), Positives = 229/545 (42%), Gaps = 80/545 (14%)
Query: 393 SENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPW-ELTRNSSRPG 451
SEN S +Y + ER ++ L KS+ K + + W EL + S R
Sbjct: 418 SENKSEGGAYTDTSEERIRS----SIKLGGKSTKKG-------RNGADWDELHKKSKRSL 466
Query: 452 LIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N+ P S S RK K + T+L+ + +P ++ N + K
Sbjct: 467 YYNNARP--SCGSDSHYLHGRKTKKIGRCTLLVRSSKDKKNPAINGFNP----YSGKRTL 520
Query: 512 LHKLVFDESGLPDGTEVGYYA---CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECA 568
L L+ ESG+ + Y + +LEG+ GI C CC+ ++ S+FE HA
Sbjct: 521 LSWLI--ESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHA--G 576
Query: 569 SLSSIPQGDWYCKYCQNMFE---RKRFLQHDANAVEAGRV----------------SGVD 609
S S P + Y + ++ + R +Q DA + +V G D
Sbjct: 577 SKSCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGD 636
Query: 610 SV------EQITKRCI--RIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCE 657
+ + C+ +++ + + C C+ CD + + G ++L C CE
Sbjct: 637 LICCDGCPSTYHQNCLGMQVLPSGDWHCPNCT-CKFCDAAVASGGKDGNFISLLSCGMCE 695
Query: 658 REFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK 715
R +H CL + HK ++ FC C + LQ L + E + + I +
Sbjct: 696 RRYHQLCLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR 752
Query: 716 YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 775
++T SD + + A E L+ +AI +CF PIVD SG DLI +++
Sbjct: 753 -----VDTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVL 802
Query: 776 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 833
Y G N + G Y AIL ++SA LR G ++AE+P + T I +G + L
Sbjct: 803 YNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRL 862
Query: 834 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGT 891
F IE + L+V+ +V+PA + WT FGF +D + RK L T F G
Sbjct: 863 FDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGI 918
Query: 892 SMLQK 896
MLQK
Sbjct: 919 DMLQK 923
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 85/380 (22%)
Query: 537 LLEGYKNGLGIICHCCNS-------EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFER 589
L EG + IC CN + PS + H C +L IP GDW+C C+
Sbjct: 779 LFEGENDN---ICSVCNYGGELILCDQCPSAY--HKNCLNLEGIPDGDWFCPSCR----- 828
Query: 590 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 649
C +C ++ G
Sbjct: 829 ------------------------------------------CGICGQNKIEETEDG--H 844
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
L C QCE ++HV CL+ + D R K WFC +C R+ + LQNLL + +
Sbjct: 845 FLTCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLGKPVLVGADNL 903
Query: 710 LNAIKKYAGNSLETV----SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 765
+ KY + V SD+ V E LS A+++ H+CF+P+ +
Sbjct: 904 TWTLVKYVNSETCGVGGAESDLVV------------ENYSKLSVALSVMHECFEPLHNPF 951
Query: 766 SGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
S RD++ +++ + L F G Y +L N ++S +R+FG+++AE+PLV T
Sbjct: 952 SSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRFQ 1011
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
G ++L +EK L L V+ +VLPA WT+ FGF+++ S +
Sbjct: 1012 YRRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFLD--Y 1069
Query: 884 QLVTFKGTSMLQK---RVPA 900
+ F+GT M QK R P+
Sbjct: 1070 SFLDFQGTVMCQKLLTRFPS 1089
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 174/397 (43%), Gaps = 57/397 (14%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFE---RKRFL 593
+LEG+ GI C CC+ ++ S+FE HA S S P + Y + ++ + R +
Sbjct: 547 MLEGWITREGIHCDCCSKILTVSRFEIHA--GSKSCQPFQNIYLESGASLLQCQVRAWNM 604
Query: 594 QHDANAVEAGRV----------------SGVDSV------EQITKRCI--RIVKNLEAEL 629
Q DA + +V G D + + C+ +++ + +
Sbjct: 605 QKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHC 664
Query: 630 SGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFC 683
C C+ CD + + G ++L C CER +H CL + HK ++ FC
Sbjct: 665 PNCT-CKFCDAAVASGGKDGNSISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFC 720
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPET 743
C + LQ L + E + + I + ++T SD + + A E
Sbjct: 721 GPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIEN 770
Query: 744 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 801
L+ +AI +CF PIVD SG DLI +++Y G N + G Y AIL ++S
Sbjct: 771 NSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIIS 830
Query: 802 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIW 861
A LR G ++AE+P + T I +G + LF IE + L+V+ +V+PA + W
Sbjct: 831 AASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAW 890
Query: 862 TDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 896
T FGF +D + RK L T F G MLQK
Sbjct: 891 TGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 923
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 146/348 (41%), Gaps = 63/348 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L +P DW C C +F Q +N +A ++ VDS
Sbjct: 989 CCDG--CPSTF--HMSCLELEELPSDDWRCANCCC-----KFCQEHSND-DAPDIAEVDS 1038
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
LC C QCE +H C + +
Sbjct: 1039 -----------------------LC----------------TCSQCEENYHPVCSPETEN 1059
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
G FC C + LQNLL + + PEF IK + ET +D R
Sbjct: 1060 PSSVPSQAGDLFCQQSCRLLFEELQNLLAVKKDLEPEFACRIIKCIHEDVPETALALDER 1119
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
E ++ A+++ +CF PI+D +G +LI ++VY G N +F G
Sbjct: 1120 V----------ECNSKIAVALSLMDECFLPIIDQRTGINLIRNVVYNCGSNFLRLDFRGF 1169
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y IL +VSA +R+ G + AE+P + T + +G + L IE +LS L+V+
Sbjct: 1170 YIFILERGDEIVSAASVRIHGTKCAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLKVQK 1229
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF + E+ + ++ F GT +LQK
Sbjct: 1230 LIIPAISELVDTWTSKFGFSPL--EVSEKQEVKSISMLVFPGTGLLQK 1275
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/349 (28%), Positives = 147/349 (42%), Gaps = 63/349 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C + P GDW+C YC F G SG +
Sbjct: 263 CCDG--CPSTF--HQSCLDIQKFPSGDWHCIYCSCKF--------------CGMFSG--N 302
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK-KHK 669
+Q+ NL+ S +L C CE ++H C + +
Sbjct: 303 TDQMN-------YNLDVNDSA------------------LLTCQLCEEKYHHMCTQGEDS 337
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 729
+ D P FC C + LQ LL + E F +++ E DI
Sbjct: 338 ILDDSSSPS---FCGKTCRELFEQLQMLLGVKHELEDGFSWTLVQR-----TEVGFDIS- 388
Query: 730 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 787
L+G E L+ A++I +CF PIVD SG +LI +++Y G N + G
Sbjct: 389 ----LNGIPQKVECNSKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLNYSG 444
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
+ AIL ++SA +R+ G ++AE+P + T I +G + L IE L L V+
Sbjct: 445 FFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESALHSLNVE 504
Query: 848 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+V+PA E WT FGFK + E+ S R ++ F GT MLQK
Sbjct: 505 KLVIPAISELMQTWTSVFGFKPL--EVSSRKEMRNMNMLVFHGTDMLQK 551
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 152/350 (43%), Gaps = 67/350 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S +P+G+WYC C
Sbjct: 1045 CCDN--CPSTF--HQACLSAKELPEGNWYCPNC--------------------------- 1073
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1074 ---TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKEKC- 1112
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
++E+ FC +C I S LQ LL F + + L + D D +
Sbjct: 1113 --VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD-DQK 1161
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGM 788
A E L+ A+ I +CF +VD +G D+IP ++Y R + F G
Sbjct: 1162 VHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNFNGF 1221
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++V+
Sbjct: 1222 YTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEK 1281
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1282 IVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1329
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 152/350 (43%), Gaps = 67/350 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S +P+G+WYC C
Sbjct: 884 CCDN--CPSTF--HQACLSAKELPEGNWYCPNC--------------------------- 912
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 913 ---TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKEKC- 951
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
++E+ FC +C I S LQ LL F + + L + D D +
Sbjct: 952 --VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD-DQK 1000
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGM 788
A E L+ A+ I +CF +VD +G D+IP ++Y R + F G
Sbjct: 1001 VHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNFNGF 1060
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++V+
Sbjct: 1061 YTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEK 1120
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1121 IVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1168
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 152/350 (43%), Gaps = 67/350 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S +P+G+WYC C
Sbjct: 981 CCDN--CPSTF--HQACLSAKELPEGNWYCPNC--------------------------- 1009
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1010 ---TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKEKC- 1048
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
++E+ FC +C I S LQ LL F + + L + D D +
Sbjct: 1049 --VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD-DQK 1097
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGM 788
A E L+ A+ I +CF +VD +G D+IP ++Y R + F G
Sbjct: 1098 VHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNFNGF 1157
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++V+
Sbjct: 1158 YTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEK 1217
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1218 IVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1265
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 49/397 (12%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---ECASLSSIPQGD----WYCK----YC 583
+ +L+G GI C CC + S FEAHA C L +I + +C+ +
Sbjct: 7 RTVLKGKITTDGIQCDCCGETFAISDFEAHAGSKSCQPLKNIFLENGPSLLHCQLESWHR 66
Query: 584 QNMFERKRFLQHDANAVE-----------AGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
Q+ +RK F D + + G + DS + +K L + + C
Sbjct: 67 QDESDRKGFHFVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNC 126
Query: 633 LLCR----------GCDFSKSGFGPRTILL-CDQCEREFHVGCLKKHKMADLRELPKGKW 681
C C ++ R LL C CE ++H C+ + +
Sbjct: 127 TYCSCKFCGMAGGDACQMDENDAAARPALLTCCLCEEKYHHSCIPAEDT--INDYHSSLS 184
Query: 682 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 741
FC C ++ LQ LL + E F ++++ + SDI LSG
Sbjct: 185 FCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRF-----DVGSDI-----TLSGMHRKV 234
Query: 742 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSV 799
E ++ A+ I +CF P+ D SG +LI ++VY G N + G AIL V
Sbjct: 235 ECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNYCGFLTAILERGDEV 294
Query: 800 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 859
+SA +R+ G ++AE+P + T + +G + L IE L L V+ +V+PA E
Sbjct: 295 ISAASIRIHGNQLAEMPFIGTRHMYRRQGMCRRLLGAIETALCSLNVEKLVIPAISELRE 354
Query: 860 IWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
WT FGFK++ E LS + R ++V F G MLQK
Sbjct: 355 TWTSVFGFKQL--EGLSKQKMRYMKMVAFPGVDMLQK 389
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 143/351 (40%), Gaps = 70/351 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ P+ + H C S IP G+WYC C
Sbjct: 997 CCDN--CPASY--HQACLSCQDIPDGNWYCSSCL-------------------------- 1026
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
C +C SK L C QCER++HV C+
Sbjct: 1027 ---------------------CDICGEVIDSKELVTSLPALDCSQCERQYHVKCVSAK-- 1063
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
E G WFC C I ++ + + ++ + T +I
Sbjct: 1064 VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI--- 1120
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-RNLRGQEFGGMY 789
A E + L A +I +CF PI+D +G D+IPS++Y R+ + G Y
Sbjct: 1121 -------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYKGFY 1173
Query: 790 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 849
+L + S+VS +R+ G +AE+PLVATSK N +G + L IE++L L+V+ +
Sbjct: 1174 TVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSLKVEML 1233
Query: 850 VLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 898
+L A WT FGF++ID +KR S+ L GT +L+K +
Sbjct: 1234 LLSAIPHLAETWTSTFGFREIDES----DKKRLSKVRLAAVPGTVLLKKDL 1280
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 123/251 (49%), Gaps = 16/251 (6%)
Query: 649 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ C QCE ++H C ++ G FC C + LQNLL + + PE+
Sbjct: 1066 SLHTCSQCEEQYHEACSPENDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEY 1125
Query: 709 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 768
+++ + E V +D R E ++ A+++ +CF PIVD +G
Sbjct: 1126 SCRVVQRIHEDVPEEVLPLDTRV----------ECNSKIAVALSLMDECFLPIVDQRTGI 1175
Query: 769 DLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
+LI ++VY G N +F G Y IL +++A +R+ G ++AE+P + T +
Sbjct: 1176 NLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRR 1235
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQL 885
+G + L IE +LS L V+ +++PA E WT +FGF + D E + K S L
Sbjct: 1236 QGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSEKEEV--KSISML 1293
Query: 886 VTFKGTSMLQK 896
V F GT +LQK
Sbjct: 1294 V-FPGTGLLQK 1303
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 171/397 (43%), Gaps = 57/397 (14%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFE---RKRFL 593
+LEG+ GI C CC+ ++ S+FE HA S S P + Y + ++ + R +
Sbjct: 9 MLEGWITREGIHCDCCSKILTVSRFEIHA--GSKSCQPFQNIYLESGASLLQCQVRAWNM 66
Query: 594 QHDANAVEAGRV------------------------SGVDSVEQITKRCIRIVKNLEAEL 629
Q DA + +V G S ++++ + +
Sbjct: 67 QKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHC 126
Query: 630 SGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFC 683
C C+ CD + + G ++L C CER +H CL + HK ++ FC
Sbjct: 127 PNCT-CKFCDAAVASGGKDGNFISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFC 182
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPET 743
C + LQ L + E + + I + ++T SD + + A E
Sbjct: 183 GPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIEN 232
Query: 744 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 801
L+ +AI +CF PIVD SG DLI +++Y G N + G Y AIL ++S
Sbjct: 233 NSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIIS 292
Query: 802 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIW 861
A LR G ++AE+P + T I +G + LF IE + L+V+ +V+PA + W
Sbjct: 293 AASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAW 352
Query: 862 TDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 896
T FGF +D + RK L T F G MLQK
Sbjct: 353 TGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 385
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 171/397 (43%), Gaps = 57/397 (14%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFE---RKRFL 593
+LEG+ GI C CC+ ++ S+FE HA S S P + Y + ++ + R +
Sbjct: 29 MLEGWITREGIHCDCCSKILTVSRFEIHA--GSKSCQPFQNIYLESGASLLQCQVRAWNM 86
Query: 594 QHDANAVEAGRV------------------------SGVDSVEQITKRCIRIVKNLEAEL 629
Q DA + +V G S ++++ + +
Sbjct: 87 QKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHC 146
Query: 630 SGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFC 683
C C+ CD + + G ++L C CER +H CL + HK ++ FC
Sbjct: 147 PNCT-CKFCDAAVASGGKDGNFISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFC 202
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPET 743
C + LQ L + E + + I + ++T SD + + A E
Sbjct: 203 GPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIEN 252
Query: 744 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 801
L+ +AI +CF PIVD SG DLI +++Y G N + G Y AIL ++S
Sbjct: 253 NSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIIS 312
Query: 802 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIW 861
A LR G ++AE+P + T I +G + LF IE + L+V+ +V+PA + W
Sbjct: 313 AASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAW 372
Query: 862 TDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 896
T FGF +D + RK L T F G MLQK
Sbjct: 373 TGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 405
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 149/369 (40%), Gaps = 78/369 (21%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ P+ + H C IP G+WYC C
Sbjct: 986 CCDN--CPASY--HQACLPCQDIPDGNWYCSSCL-------------------------- 1015
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
C +C SK L C QCER++HV C+
Sbjct: 1016 ---------------------CNICGEVITSKELRTSLPALECSQCERQYHVKCVSAK-- 1052
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV---SDI 727
E G WFC C +I + ++ + + + N L ++
Sbjct: 1053 VSCNEDGPGTWFCGRKCQQIYMIFRSRV-------------GVPDHVDNDLSCTILRNNG 1099
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG- 786
D + R A E + L A++I +CF PI+D +G D+IPS++Y F
Sbjct: 1100 DKKVRTAGEIALMAECNMKLMIALSIMEECFLPILDPRTGIDIIPSILYNWRSDFIHFNH 1159
Query: 787 -GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G Y +L + S+VS +R+ G VAE+PLVATS N +G + L IE++L L+
Sbjct: 1160 KGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVATSTENRQQGMCRRLMDYIEEMLKSLK 1219
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPACRI 903
V+ ++L A WT FGF++ID +KR S +L GT +L+K + C
Sbjct: 1220 VEMLLLSAIPHLVETWTSTFGFREIDDS----DKKRLSMVRLAAVPGTVLLKKNLCECS- 1274
Query: 904 GSSSTDSTE 912
G TD E
Sbjct: 1275 GVEDTDVAE 1283
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 152/363 (41%), Gaps = 100/363 (27%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H +C L ++P+GDW+C C
Sbjct: 142 CCDH--CPSTF--HLKCLRLENVPEGDWFCPRC--------------------------- 170
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT---ILLCDQCEREFHVGCLKK 667
C C +S + P IL CDQCERE+H C+
Sbjct: 171 -----------------------CCASC--GRSLYDPTIQTEILYCDQCEREYHSNCVPG 205
Query: 668 HKMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKY----AGNSL 721
M + FC C +I L+ L+ V + + + + L + Y + L
Sbjct: 206 SAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSEENSKL 262
Query: 722 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 781
E+V+D++ R L+ A+ + +CF P++D S D++ ++Y R R
Sbjct: 263 ESVADLNTR----------------LALALTVIQECFRPMIDPRSNIDMVSHILYNR--R 304
Query: 782 GQE----FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
G++ F G Y +L ++S +RV G AE+P + T +G + L I
Sbjct: 305 GEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVI 364
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSM 893
+++L L V+++VLPA E WT FGF+K+ +L+ + +VTF G+S+
Sbjct: 365 QQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQGIQLMEL------NIVTFPGSSV 418
Query: 894 LQK 896
LQK
Sbjct: 419 LQK 421
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 137/336 (40%), Gaps = 71/336 (21%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+ G I+C C PS F H C L +P GDW+C C
Sbjct: 437 HYGGELILCDQC-----PSSF--HKSCLGLMDVPDGDWFCSSC----------------- 472
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
C +I G L R D S G +L C QCER++
Sbjct: 473 -----------------CCKIC--------GQCLKRDSDLSMEDDG---VLDCTQCERKY 504
Query: 661 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP----EFHLNAIKKY 716
HV CL + L PK WFC C +I L LL +K+P +K
Sbjct: 505 HVVCLGNKREECLEYFPKEHWFCSKRCQQIFLGLHELL---GKKIPVGLHNLTWTLLKSI 561
Query: 717 AGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 776
N SDI+ A E +L+ A+ + H+ FDP+ + + RDL+ +++
Sbjct: 562 QFNDQCEASDIE----------ALSENYSMLNIALDMMHEFFDPVEEPHTKRDLLKDVIF 611
Query: 777 GR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 834
+ L F G Y +L + +S +RV+G++VAE+PLV T G +L
Sbjct: 612 SKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGTRFQYRRLGMCCILM 671
Query: 835 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
+EK L L V+ ++LPA A + W FGF K+
Sbjct: 672 NVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKL 707
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 149/351 (42%), Gaps = 70/351 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ P+ + H C S IP G+WYC C G V +DS
Sbjct: 23 CCDN--CPASY--HQACLSCQDIPDGNWYCSSCLCDI--------------CGEV--IDS 62
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
E +T +L A L C QCER++HV C+
Sbjct: 63 KELVT--------SLPA-----------------------LDCSQCERQYHVKCVSAK-- 89
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
E G WFC C I ++ + + ++ + T +I
Sbjct: 90 VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI--- 146
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-RNLRGQEFGGMY 789
A E + L A +I +CF PI+D +G D+IPS++Y R+ + G Y
Sbjct: 147 -------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYKGFY 199
Query: 790 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 849
+L + S+VS +R+ G +AE+PLVATSK N +G + L IE++L L+V+ +
Sbjct: 200 TVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSLKVEML 259
Query: 850 VLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 898
+L A WT FGF++ID +KR S+ L GT +L+K +
Sbjct: 260 LLSAIPHLAETWTSTFGFREIDES----DKKRLSKVRLAAVPGTVLLKKDL 306
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 122/251 (48%), Gaps = 16/251 (6%)
Query: 649 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ C QCE ++H C + G FC C + LQNLL + + PE+
Sbjct: 1065 SLHTCSQCEEQYHEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEY 1124
Query: 709 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 768
+++ + E V +D R E ++ A+++ +CF PI+D +G
Sbjct: 1125 SCRVVQRIHEDVPEEVLALDKRV----------ECNSRIAVALSLMDECFLPIIDQRTGI 1174
Query: 769 DLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
+LI ++VY G N +F G Y IL +++A +R+ G ++AE+P + T +
Sbjct: 1175 NLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRR 1234
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQL 885
+G + L IE +LS L V+ +++PA E WT KFGF + D E + K S L
Sbjct: 1235 QGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEKQEV--KSISML 1292
Query: 886 VTFKGTSMLQK 896
V F GT +LQK
Sbjct: 1293 V-FPGTGLLQK 1302
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 148/368 (40%), Gaps = 76/368 (20%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
++C C PS F H C L ++P+GDW+C C
Sbjct: 934 VLCDHC-----PSSF--HKSCLGLKTLPEGDWFCPSC----------------------- 963
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 666
C C +C F + C QCER+ CL+
Sbjct: 964 -----------C-------------CGICGENKFDGGSEQDNVVFSCYQCERQC---CLR 996
Query: 667 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVS 725
K L P G WFC C +I LQ LL + + +K LE
Sbjct: 997 KWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLE--- 1053
Query: 726 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 783
ID L A E L+ A+ + H+CF+P+ + + RD++ +++ G +L
Sbjct: 1054 -ID-----LPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRL 1107
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
F G Y +L N ++S +RV+G++VAE+PL+ T G +L +EK L
Sbjct: 1108 NFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHILMNELEKKLME 1167
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK---RVP 899
L V+ +VLPA + WT FGF K+ D E L R + F+ T M QK ++P
Sbjct: 1168 LGVERLVLPAVPSVLNTWTTSFGFSKMTDSERL---RFLDYSFLDFQDTVMCQKLLMKIP 1224
Query: 900 ACRIGSSS 907
+ S+
Sbjct: 1225 LAKSNQST 1232
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 173/427 (40%), Gaps = 95/427 (22%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNM 586
+V Y + +LEG+ GI C CC+ ++ S+FE HA S P + Y + ++
Sbjct: 621 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHA--GSKLPQPYQNIYLESGVSL 678
Query: 587 FERKRFLQHDA-NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
+ Q DA N E G SV+ + + C +C G
Sbjct: 679 LQ----CQIDAWNRQEHAEKIGFHSVD---------IDGNDPNDDTCGIC--------GD 717
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC---------------------- 683
G ++ CD C FH CL D++ LP G+W C
Sbjct: 718 GG-DLICCDGCPSTFHQSCL------DIQMLPPGEWHCPNCTCKFCGIASETSDKDDASV 770
Query: 684 -----CMDCSR----------------INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLE 722
C+ C + INS + +E ++L E+ +KKY G E
Sbjct: 771 NVLRTCILCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEY----LKKYLGTKHE 826
Query: 723 TVSDIDVRWRLLS-----------GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 771
+ W L+ G E L+ A+ + +CF P++D SG +LI
Sbjct: 827 L--EAGFSWCLIHRSDEDSEAACRGLTQRVECNSKLAIALTVMDECFLPVIDRRSGINLI 884
Query: 772 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 829
+++Y G N + G Y AIL +++A +R G ++AE+P + T I +G
Sbjct: 885 RNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKIAEMPFIGTRHIYRRQGM 944
Query: 830 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 889
+ LF+ IE L L+V+ +V+PA E WT FGF +D L + ++ F
Sbjct: 945 CRRLFSAIELALCSLKVEKLVIPAVAELTHTWTTVFGFTYLDESLRQ--EMKSLNMMVFP 1002
Query: 890 GTSMLQK 896
G MLQK
Sbjct: 1003 GIDMLQK 1009
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/428 (25%), Positives = 173/428 (40%), Gaps = 97/428 (22%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNM 586
+V Y + +LEG+ GI C CC+ ++ S+FE HA S P + Y + ++
Sbjct: 622 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHA--GSKLPQPYQNIYLESGVSL 679
Query: 587 FERKRFLQHDA-NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
+ Q DA N E G SV+ + + C +C G
Sbjct: 680 LQ----CQIDAWNRQEHAEKIGFHSVD---------IDGGDPNDDTCGIC--------GD 718
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS----------------- 688
G ++ CD C FH CL D++ LP G+W C M+C+
Sbjct: 719 GG-DLICCDGCPSTFHQSCL------DIQMLPPGEWRC-MNCTCKFCGIASGTSEKDDAS 770
Query: 689 ---------------------------RINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 721
INS + +E ++L E +KKY G
Sbjct: 771 VCVLHICNLCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSEH----LKKYLGTKH 826
Query: 722 ETVSDIDVRWRLLS-----------GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 770
E S W L+ G + E L+ + + +CF P++D SG +L
Sbjct: 827 ELESGFS--WSLIHRTDDDSEAACRGISQRVECNSKLAITLTVMDECFLPVIDRRSGINL 884
Query: 771 IPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKG 828
I +++Y G N + G Y AIL +++A +R G ++AE+P + T I +G
Sbjct: 885 IRNVLYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHIYRRQG 944
Query: 829 YFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTF 888
+ LF+ IE L L+V+ +V+PA E + WT FGF +D L + ++ F
Sbjct: 945 MCRRLFSAIESTLCSLKVEKLVIPAIAEVTNTWTTVFGFTHLDKSLRQ--EMKSLNMMVF 1002
Query: 889 KGTSMLQK 896
G MLQK
Sbjct: 1003 PGIDMLQK 1010
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 114 bits (286), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/70 (74%), Positives = 63/70 (90%), Gaps = 1/70 (1%)
Query: 586 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 645
MF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGF
Sbjct: 1 MFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGF 59
Query: 646 GPRTILLCDQ 655
GPR I++CDQ
Sbjct: 60 GPRMIIMCDQ 69
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 145/349 (41%), Gaps = 65/349 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C + P GDW+C C + GV S
Sbjct: 718 CCDG--CPSTF--HQSCLDILIPPPGDWHCPNCTCKY------------------CGVAS 755
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-LKKHK 669
++ +C+G + S S I C CE++FH C L+
Sbjct: 756 ID---------------------ICQGDNTSVS-----EISTCILCEKKFHESCNLEMDT 789
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 729
L FC C + LQ L + E F + I++ T D DV
Sbjct: 790 PVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------TSEDSDV 840
Query: 730 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 787
R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N + G
Sbjct: 841 SVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYSG 897
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
Y AIL ++SA +R G ++AE+P + T I +G + LF IE L +V+
Sbjct: 898 FYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKVE 957
Query: 848 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E W FGF ++P L R ++ F GT MLQK
Sbjct: 958 KLIIPAIAELMHTWNVIFGFSPLEPSLKQ--EMRLMNMLVFPGTDMLQK 1004
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 145/349 (41%), Gaps = 65/349 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C + P GDW+C C + GV S
Sbjct: 600 CCDG--CPSTF--HQSCLDILIPPPGDWHCPNC------------------TCKYCGVAS 637
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-LKKHK 669
++ +C+G + S S I C CE++FH C L+
Sbjct: 638 ID---------------------ICQGDNTSVS-----EISTCILCEKKFHESCNLEMDT 671
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 729
L FC C + LQ L + E F + I++ T D DV
Sbjct: 672 PVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------TSEDSDV 722
Query: 730 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 787
R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N + G
Sbjct: 723 SVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYSG 779
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
Y AIL ++SA +R G ++AE+P + T I +G + LF IE L +V+
Sbjct: 780 FYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKVE 839
Query: 848 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E W FGF ++P L R ++ F GT MLQK
Sbjct: 840 KLIIPAIAELMHTWNVIFGFSPLEPSLKQ--EMRLMNMLVFPGTDMLQK 886
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 112 bits (281), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 150/353 (42%), Gaps = 72/353 (20%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ P+ + H C IP G WYC C+
Sbjct: 1015 CCDN--CPASY--HVACLPSQEIPDGSWYCSSCR-------------------------- 1044
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
C +C SK P C QCER++H+ C+ +
Sbjct: 1045 ---------------------CDVCGEVVSSKEPRTPLHAFECSQCERQYHIKCISGKVL 1083
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
+ P G WFC C +I + L++ + +P+ HL+ ++ L D VR
Sbjct: 1084 CNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPD-HLD--DGFSCTILHNNGDQKVR 1134
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEFGGM 788
+ + A E + L A++I +CF PI D +G D++P ++Y N ++ G
Sbjct: 1135 --MAADIALLAECNMKLIIALSILEECFLPIFDPRTGMDIMPLILYNWRSNFVHLDYKGF 1192
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y +L + S++S +R+ G VAE+PL+AT N +G + + IE++L L+V+
Sbjct: 1193 YTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMCRRIVDYIEQMLKSLKVEM 1252
Query: 849 IVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLVTFKGTSMLQKRV 898
++L A WT FGF+ I D + LS R L + GT +L+K +
Sbjct: 1253 LLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSKIR-----LASVPGTVLLKKDL 1300
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 131/268 (48%), Gaps = 35/268 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
++ C+QC+R FH+ CLK+ D + WFC C+R+ S L+NLL
Sbjct: 315 LMACEQCQRRFHLTCLKE----DSCIVSSRGWFCSSQCNRVFSALENLL----------- 359
Query: 710 LNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSI 765
G+ + +D D+ W L+ G+ E L AV I H F+P D
Sbjct: 360 --------GSKIAVGNDGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTNDVF 411
Query: 766 SGRDLIPSMVYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 824
SGRDL+ ++Y ++ G G G Y ++ + ++ +RV ++V E+PLVAT
Sbjct: 412 SGRDLVEELIYRKDRTG--VGRGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSY 468
Query: 825 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCS 883
G ++L +EK +S + V +VLPAA+E + WT++FGF ++ E L + +
Sbjct: 469 RRSGMCRVLMDELEKQMSQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVK---H 525
Query: 884 QLVTFKGTSMLQKRVPACRIGSSSTDST 911
++ F GT M K + R + S + +
Sbjct: 526 GMLDFVGTIMCHKFLQKERAENDSAEES 553
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/415 (27%), Positives = 174/415 (41%), Gaps = 93/415 (22%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 598
+G GI+C+CC +S S F AHA S PQ FL+ +
Sbjct: 882 DGMVTWEGIVCNCCKKNLSVSDFMAHAG----RSHPQS-----------SLGLFLESGKS 926
Query: 599 A----VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 654
VEA S E +++R + +EA C C G G +L CD
Sbjct: 927 YTLCLVEAW------SAESMSRRSNAWGRKVEAIDESDDTCGFC-----GDGGE-LLCCD 974
Query: 655 QCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQNLLVQE 701
C +H CL +ELP+G W+C C C S +++ + +
Sbjct: 975 NCPSTYHQACLSA------KELPEGSWYCHNCTCQVCGGPFSEKEVSTFSAIFKCFQCGD 1028
Query: 702 A--------EKLP-EFHLN--------------AIKKYAGNSLETVSDIDVRWRLL---- 734
A EKLP E ++ ++ + G + + D D+ W +L
Sbjct: 1029 AYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGT--DNILDSDLSWSILRCNN 1086
Query: 735 SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 784
G+ A E + L+ A+ + +CF +VD +G D+IP ++Y G N +
Sbjct: 1087 DGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVD 1146
Query: 785 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 844
+ G Y IL ++ +RV G + AELP +ATS +G ++L + IEK+L
Sbjct: 1147 YQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDFRRQGMCRILMSIIEKMLCSF 1206
Query: 845 RVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRV 898
VK +VL A E S W FGFK I D E ++ L+ F GTS+L KR+
Sbjct: 1207 NVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHN---VNLMLFPGTSLLTKRL 1258
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 150/372 (40%), Gaps = 95/372 (25%)
Query: 548 ICHCCNS----EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 603
+CH C + PS F H C L S PQGDW+C C
Sbjct: 416 VCHDCGELLMCDRCPSMF--HHACVGLESTPQGDWFCPACT------------------- 454
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF-------SKSGFGP-RTILLCDQ 655
C +C D + GF R ++ C+Q
Sbjct: 455 ----------------------------CAICGSSDLDDPPATTTTQGFSSDRMVISCEQ 486
Query: 656 CEREFHVGCLKK------HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
C RE+HVGC+++ + AD +G W C CS+I L+ L V +A
Sbjct: 487 CRREYHVGCMRERDNGLWYPEAD----GEGPWLCSEACSKIYLRLEELAVVQAP------ 536
Query: 710 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 769
+G SL V R + + E L A+ + +CF +++ + D
Sbjct: 537 --CRSVASGLSL-------VVLRRGAARDGEEEEHAKLCMALDVLRECFVTLIEPRTQTD 587
Query: 770 LIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 827
L +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T +
Sbjct: 588 LTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFARRRQ 647
Query: 828 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRKRCSQ-- 884
G +LL I+KLL + V+ +VLPA E + WT FGF+ E+ R+ +
Sbjct: 648 GMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFR----EMGQADRQDVAHHA 703
Query: 885 LVTFKGTSMLQK 896
++ F+GT M K
Sbjct: 704 ILRFQGTIMCHK 715
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 66/366 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C + P GDW+C YC F G VSG S
Sbjct: 579 CCDG--CPSTF--HQGCLDIKKFPSGDWHCIYCCCKF--------------CGSVSG--S 618
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH-- 668
Q IV L L C CE ++H C++ +
Sbjct: 619 SNQRDDNDELIVSKL-------------------------LTCQLCEEKYHRSCIEANDA 653
Query: 669 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 728
D R++ +FC C ++ L+ LL + E + I++ D
Sbjct: 654 NTDDSRDV----FFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRRS-----------D 698
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFG 786
V + K E L+ AV+I +CF P +D SG +LI S++Y R N +
Sbjct: 699 VGFDASQIKPQMVECNSKLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGSNFNRLNYS 758
Query: 787 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 846
G AIL ++SA +R+ G ++AE+P + T + +G + L +E L L V
Sbjct: 759 GFVTAILERGDEIISAASIRIRGNQLAEMPFIGTRYMYRRQGMCRRLLNAVEWGLGSLNV 818
Query: 847 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 906
+ +V+PA E WT FGF+ ++ I + L+ F MLQK++ ++
Sbjct: 819 ELLVIPAISELRETWTSVFGFESLESTSKQILHNK--NLLVFPHVDMLQKKISKHKLAGQ 876
Query: 907 STDSTE 912
+ + +E
Sbjct: 877 NLNPSE 882
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 146/353 (41%), Gaps = 71/353 (20%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C L ++P W C C F H+ ++ +A + VDS
Sbjct: 1020 CCDG--CPSTF--HMSCLGLEALPTDYWCCSNCSCKF------CHEHSSDDAEDTADVDS 1069
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK-HK 669
++ C QCE + C
Sbjct: 1070 --------------------------------------SLHTCSQCEEQCTEACSPDIDS 1091
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 729
+A G FC C + LQNLL + + PE+ +++ E V +D
Sbjct: 1092 IATNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDK 1151
Query: 730 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 787
R E ++ A+++ +CF PIVD +G +LI ++VY G N +F G
Sbjct: 1152 RV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRG 1201
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
Y IL +++A +R+ G ++AE+P + T + +G + L IE +LS L ++
Sbjct: 1202 FYIIILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNIE 1261
Query: 848 SIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT KFGF +D E+ S+ ++ F GT +LQK
Sbjct: 1262 KLIIPAITELVDTWTSKFGFSPLDDSEKQEVKSV------SMLVFPGTGLLQK 1308
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 124/257 (48%), Gaps = 23/257 (8%)
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 710
L C QCER++H C+ + + P WFC C +I L++ + + F
Sbjct: 1107 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1165
Query: 711 NAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 770
++ + T +DI A E + L A++I +CF PI+D+ +G D+
Sbjct: 1166 TVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGIDI 1215
Query: 771 IPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT N
Sbjct: 1216 IPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQ 1273
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--Q 884
+G + L IE++L L+V+ ++L A WT FGF ID + RK S +
Sbjct: 1274 QGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRLR 1329
Query: 885 LVTFKGTSMLQKRVPAC 901
LV+ GT +L++ + C
Sbjct: 1330 LVSVPGTVLLKRNLYEC 1346
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 113/417 (27%), Positives = 171/417 (41%), Gaps = 99/417 (23%)
Query: 542 KNGL----GIICHCCNSEVSPSQFEAHAECASLSS-----IPQGDWYCKYCQNMFERKRF 592
K+GL G++C+CC +S S F AHA + S + G Y CQ F
Sbjct: 905 KDGLVTWEGVVCNCCKKTLSVSGFMAHAGFSHPQSSLGLFLESGKSY-TLCQVEAWSAEF 963
Query: 593 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 652
+ +NA +V +D + C G LLC
Sbjct: 964 MSRRSNAW-GRKVEAIDESDDTCGFCGD---------GGELLC----------------- 996
Query: 653 CDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQNLLV 699
CD C +H CL +ELP+G W+C C C S +++ +
Sbjct: 997 CDNCPSTYHPACLSA------KELPEGSWYCHNCTCQICGGPVSEKEVSTFSAIFKCFQC 1050
Query: 700 QEA--------EKLP-EFHLN--------------AIKKYAGNSLETVSDIDVRWRLL-- 734
+A EKLP E ++ ++ + G E + D ++ W +L
Sbjct: 1051 GDAYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGT--ENILDSELSWSILRC 1108
Query: 735 --SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 782
G+ A E + L+ A+ + +CF +VD +G D+IP ++Y G N
Sbjct: 1109 NNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFAR 1168
Query: 783 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 842
++ G Y IL ++ +RV G + AELP +ATS +G ++L IEK+L
Sbjct: 1169 VDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRILMNIIEKMLC 1228
Query: 843 FLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRV 898
VK +VL A E S W FGFK I D E ++ L+ F GTS+L KR+
Sbjct: 1229 SFNVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHN---VNLMLFPGTSLLTKRL 1282
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 97/172 (56%), Gaps = 3/172 (1%)
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK P + EL TG+L+G V Y+ K + + LRG+I+ GILCSCS C G +V+ P
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIM-RKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYY 315
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA + S YI ENG +L +VLRAC + L ML+ ++ A+ P+E+ F C
Sbjct: 316 FEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKS 375
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
CK +F GK C+SC++SK + ++ + R + S K+ +S
Sbjct: 376 CKSSFSTLRSGKF--ALFCDSCLESKGAKNNISSSKVGRSQTSSAKVYKSAS 425
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 57/81 (70%), Gaps = 2/81 (2%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
+K +++ASP + S +T KD+ +HK+VF LP+GT+VGYY G++LL+GY
Sbjct: 418 AKVYKSASP--GAKSSSVGRLTRKDKGMHKVVFMSGILPEGTDVGYYVGGKRLLDGYIKE 475
Query: 545 LGIICHCCNSEVSPSQFEAHA 565
LGI CHCC++ VSPSQFE HA
Sbjct: 476 LGIYCHCCSTVVSPSQFEGHA 496
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/414 (26%), Positives = 170/414 (41%), Gaps = 84/414 (20%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHD 596
+LEG+ GI C CC+ ++ S+FE HA S P + Y + ++ + Q +
Sbjct: 9 MLEGWVTRDGIHCGCCSKILTVSKFEIHA--GSKLRQPFQNIYLESGASLLD----CQIE 62
Query: 597 A-NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
A N E + G +V+ V + C LC G G ++ CD
Sbjct: 63 AWNRQEPVKRLGFQAVD---------VDGNDPNDDTCGLC--------GDGG-DLICCDG 104
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFC----CMDCS--------RINSVLQNLL----- 698
C FH CL D++ LP G W C C C R ++ + LL
Sbjct: 105 CPSTFHQSCL------DIKMLPPGDWHCPNCSCKFCGVASDKNFQRDDTTVSKLLTCSLC 158
Query: 699 --------VQEAEKLPEFHLNAIKKYAGNSLETVSD-------------IDVRWRLLSGK 737
+QE L N++ + G + + W L+
Sbjct: 159 VKKYHKSCMQEINTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIHRT 218
Query: 738 AATPETRLL-----------LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 784
A +T L L+ ++++ +CF PIVD SG +LI +++Y G N
Sbjct: 219 DADSDTSLQGLPQRVECNSKLAVSLSVMDECFLPIVDRRSGINLIQNVLYNCGSNFNRLN 278
Query: 785 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 844
FGG Y IL ++SA +R G +AE+P + T + +G + LF IE L L
Sbjct: 279 FGGFYALILERGDEIISAASIRFHGTRLAEMPFIGTRHMYRRQGMCRRLFYAIESTLCSL 338
Query: 845 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
+V+ +++PA E WT+ FGF +D L + ++ F G MLQK++
Sbjct: 339 KVEKLIIPAISELMHTWTEVFGFTTLDESLKQELKSM--NMLVFPGIDMLQKQL 390
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 144/367 (39%), Gaps = 91/367 (24%)
Query: 544 GLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 603
G I+C C PS F H C L +P+GDW+C C + L AN V+
Sbjct: 909 GTLILCDQC-----PSSF--HQSCLGLKDVPEGDWFCPSCCCGICGQNKLSEHANIVD-- 959
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
GP L C QCE ++HV
Sbjct: 960 ------------------------------------------GP--FLTCYQCECKYHVQ 975
Query: 664 CLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 723
CL+ K K WFC C +I LQ LL G S+
Sbjct: 976 CLRGTK--KFGSCSKPHWFCNKHCKQIYWGLQKLL-------------------GKSIPV 1014
Query: 724 VSDIDVRWRLLSGKAATP------------ETRLLLSQAVAIFHDCFDPIVDSISGRDLI 771
D ++ W LL ++ E + L+ A+ + H+CF+P+ + + RD++
Sbjct: 1015 GGD-NLTWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHECFEPVREQHTRRDIV 1073
Query: 772 PSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 829
+++ R L+ F G Y +L N +++ +RV+G++VAE+PLV T G
Sbjct: 1074 EDVIFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGTRFQYRRLGM 1133
Query: 830 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 889
+L +E+ L L V+ +VLPA WT FGF K+ S + + F+
Sbjct: 1134 CHILMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFLNY--TFLNFQ 1191
Query: 890 GTSMLQK 896
T M QK
Sbjct: 1192 ETVMCQK 1198
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 124/257 (48%), Gaps = 23/257 (8%)
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 710
L C QCER++H C+ + + P WFC C +I L++ + + F
Sbjct: 1056 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1114
Query: 711 NAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 770
++ + T +DI A E + L A++I +CF PI+D+ +G D+
Sbjct: 1115 TVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGIDI 1164
Query: 771 IPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT N
Sbjct: 1165 IPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQ 1222
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--Q 884
+G + L IE++L L+V+ ++L A WT FGF ID + RK S +
Sbjct: 1223 QGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRLR 1278
Query: 885 LVTFKGTSMLQKRVPAC 901
LV+ GT +L++ + C
Sbjct: 1279 LVSVPGTVLLKRNLYEC 1295
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 139/353 (39%), Gaps = 68/353 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF---ERKRFLQHDANAVEAGRVSG 607
CC+ PS F H C + +P GDW+C C F + F+Q D V
Sbjct: 775 CCDG--CPSTF--HQSCLDIMMLPPGDWHCPNCTCKFCGIASEDFVQEDGTNVSE----- 825
Query: 608 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 667
+L C C +++H CL+
Sbjct: 826 ------------------------------------------LLTCSLCAKKYHKSCLQD 843
Query: 668 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 727
+ FC C + LQ L + E F + + + DI
Sbjct: 844 VDAPCIDFNNSTPCFCGKTCRELFEQLQKYLGIKHELESGFSWSLVHRM---------DI 894
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 785
D+ L G E L+ A+++ +CF PIVD SG ++I +++Y G N +
Sbjct: 895 DLDMSL-QGLPQRVECNSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNRLNY 953
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G Y AIL ++SA +R G ++AE+P + T + +G + LF+ IE L L+
Sbjct: 954 SGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRRLFSAIESALCSLK 1013
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
V+ +++PA E WT FGF + L + ++ F G MLQK++
Sbjct: 1014 VQKLIIPAISELTHTWTGVFGFTTLSDSLKQELKSM--NMLVFPGIDMLQKQL 1064
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/350 (25%), Positives = 137/350 (39%), Gaps = 97/350 (27%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWY-----CKYCQNMFERKRFLQH 595
+ G I+C C PS F HA C L +P GDW+ C C F +
Sbjct: 710 HYGGKLILCDGC-----PSAF--HANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT---- 758
Query: 596 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
NA E +S C Q
Sbjct: 759 STNAKEEKFIS----------------------------------------------CKQ 772
Query: 656 CEREFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 714
CE ++H CL+ D L ++ KWFC DC I +L +L+ + E
Sbjct: 773 CELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE----------- 821
Query: 715 KYAGNSLETVSDIDVRWRLL------------SGKAATPETRLLLSQAVAIFHDCFDPIV 762
VS + WRL+ S A E +LS A+ + H+ F+P+
Sbjct: 822 ---------VSVEKLTWRLVQSLEPNMYGDDASKIEAAAENHCILSVALDVMHELFEPVK 872
Query: 763 DSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 820
GRDL +++ R + F G Y +L N+ +VS +R+ G++VAE+P + T
Sbjct: 873 RPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 932
Query: 821 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
+ +G ++L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 933 RFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 982
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 163/404 (40%), Gaps = 87/404 (21%)
Query: 546 GIICHCCNSEVSPSQFEAHAEC----ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
GI+C+CC +S S F+AHA C +SL Q CQ VE
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLGLFLQSGKSYTLCQ---------------VE 1011
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
A S E +++R + +EA C C G G +L CD C +H
Sbjct: 1012 AW------SAELMSRRSDAYGRKVEAVDENDDTCGFC-----GDGGE-LLCCDNCPSTYH 1059
Query: 662 VGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQNLLVQEAEK---- 704
CL +ELP+G W+C C C S + +L+ L +A
Sbjct: 1060 EACLSS------QELPEGSWYCHNCTCRSCGNPVNEKEVSSFSDILKCLQCGDAYHNTCI 1113
Query: 705 ----LPEFHLNAIKKYAGN-------------SLETVSDIDVRW----------RLLSGK 737
LP + + G +E V + D+ W RL S +
Sbjct: 1114 DRVMLPSDGKRSDTWFCGRYCKEIFMGLHSQVGVENVINNDLSWTILRCNSDGQRLHSAQ 1173
Query: 738 --AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAIL 793
E L+ A+ + +CF +VD +G D+IP ++Y G N ++ G Y IL
Sbjct: 1174 KIGLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARLDYKGFYTVIL 1233
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
++ +R+ G + AELP +ATS +G + L IEK+L V+ +VL A
Sbjct: 1234 EKGDEILCVASIRLHGTKAAELPFIATSVDYRRQGMCRRLLDIIEKMLRSFHVEMLVLSA 1293
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 897
E + W FGFK I+ + + R L+ F G S+L KR
Sbjct: 1294 IPELVNTWVSGFGFKPIEDD--EKKQLRNVNLMLFPGASLLTKR 1335
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 140/355 (39%), Gaps = 76/355 (21%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S IP GDWYC C
Sbjct: 812 CCDN--CPSTF--HLACLSTQEIPDGDWYCTNCT-------------------------- 841
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
C +C K L C QCE ++H CL+
Sbjct: 842 ---------------------CRICGNLVIDKDTLDAHDSLQCSQCEHKYHEKCLEDRDK 880
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 727
+ L WFC C + S LQ+ L+ Q A+ + L I + + +
Sbjct: 881 QEGAILDT--WFCGQSCQEVYSGLQSQVGLVNQVADGISWTLLRCI--HDDQKVHSA--- 933
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 785
+W L T L+ A+ I +CF + D +G LIP ++Y G F
Sbjct: 934 --QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHLIPQVLYNWGSEFARLNF 986
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G Y +L + ++S +RV G VAE+PL+AT +G +LL IE++L +
Sbjct: 987 QGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVTAIEQVLISFK 1046
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 898
V+ +V+ A + WT FGF +D I R+R ++ L+ F GT +L K +
Sbjct: 1047 VEKLVISAIPDLVETWTKGFGFIPVD----DIERQRLNKINLMVFPGTVLLVKSL 1097
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 108 bits (271), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 128/273 (46%), Gaps = 46/273 (16%)
Query: 639 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNL 697
D + GF IL CDQC R+FHV C + + L R+ WFC C + S LQ+L
Sbjct: 40 DVQQDGF----ILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHL 95
Query: 698 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LS 748
L G S+ +D ++ W LL K P+ L L
Sbjct: 96 L-------------------GKSVPVGTD-NLTWTLL--KRVEPDCFDLEVLSANNSKLK 133
Query: 749 QAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 806
A+ + H+CF+P D+ +G+DL+ +++ G NL F G Y +L N+ + + +R
Sbjct: 134 LALEVMHECFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVR 193
Query: 807 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 866
VFG +VAE+P VAT G ++L +E+ L L V+ +VLPAA W FG
Sbjct: 194 VFGDKVAEVPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFG 253
Query: 867 FKKI---DPELLSIYRKRCSQLVTFKGTSMLQK 896
F + D + S Y ++ F+GT + QK
Sbjct: 254 FSVMTYSDKKAHSDY-----PILFFQGTVLCQK 281
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/446 (24%), Positives = 192/446 (43%), Gaps = 95/446 (21%)
Query: 512 LHKLVFDESGLPDGTEVGYY-ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----- 565
+ + D L +G V Y ++ G + GI+C CCN S + F+ HA
Sbjct: 3 IFSWLIDGEILSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVH 62
Query: 566 ECASLSSIPQGDWYCKYCQNMFE-RKRFLQHDANAVEAGRVSGVDSVEQITKRCIR---- 620
A+L ++ G +++ E +K+ L+ A +G +V++ + +
Sbjct: 63 RTAALLTLEDG-------RSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKES 115
Query: 621 --IVKNLEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
+V ++E + + C +C G G + ++ CD C FH+ CL+ L +
Sbjct: 116 ELVVDDVEMDENDDTCAVC--------GDGGQ-LVCCDHCPSTFHLKCLR------LENV 160
Query: 677 PKGKWFC----CMDCSR--INSVLQN-LLVQEAEKLPEFHLN------------------ 711
P+G WFC C C R + +Q +L + +P +
Sbjct: 161 PEGDWFCPRCCCASCGRSLYDPTIQTEILYYHSNCVPGCAMKYESSDNQFCSRKCFKIFR 220
Query: 712 AIKKYAGNSLETVSDIDVRWRLLSGK-------------AATPETRLLLSQAVAIFHDCF 758
++K G + V D+ W LL + A TRL L A+ + +CF
Sbjct: 221 GLRKLVGR-VNKVDDM-YSWTLLRSEHYDQSAENSKLESVADLNTRLAL--ALTVIQECF 276
Query: 759 DPIVDSISGRDLIPSMVYGRNLRGQE----FGGMYCAILTVNSSVVSAGILRVFGQEVAE 814
P++D S D++ ++Y R RG++ F G Y +L ++S +RV G AE
Sbjct: 277 RPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAE 334
Query: 815 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP-- 872
+P + T +G + L I+++L L V+++VLPA E WT FGF+K+
Sbjct: 335 IPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQ 394
Query: 873 --ELLSIYRKRCSQLVTFKGTSMLQK 896
+L+ + +VTF G+S+LQK
Sbjct: 395 GIQLMEL------NIVTFPGSSVLQK 414
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 147/367 (40%), Gaps = 68/367 (18%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERK--RFLQ 594
LLEG+ + GI C CC+ + S+FE HA G C+ QN+ LQ
Sbjct: 208 LLEGWISRDGIRCGCCSEIFTISKFEIHA----------GMKLCEPSQNIILETGISLLQ 257
Query: 595 ---HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 651
N E SG V+ V + C +C G G ++
Sbjct: 258 CQLDSWNKQEESERSGFHLVD---------VGADDPNDDTCGIC--------GDG-GDLI 299
Query: 652 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 711
CD C FH CL L+ L K + LVQ E + LN
Sbjct: 300 CCDGCPSTFHQSCLDIQLFEQLQMLLGVK-------HELEDGFSWTLVQRTEVGFDISLN 352
Query: 712 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 771
I + E L+ A++I +CF PIVD SG +LI
Sbjct: 353 GIPQKV------------------------ECNSKLAVALSIMDECFLPIVDQRSGINLI 388
Query: 772 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 829
+++Y G N + G + AIL ++SA +R+ G ++AE+P + T I +G
Sbjct: 389 HNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGM 448
Query: 830 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 889
+ L IE L L V+ +V+PA E WT FGFK + E+ S R ++ F
Sbjct: 449 CRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPL--EVSSRKEMRNMNMLVFH 506
Query: 890 GTSMLQK 896
GT MLQK
Sbjct: 507 GTDMLQK 513
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 112/456 (24%), Positives = 175/456 (38%), Gaps = 93/456 (20%)
Query: 536 KLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQ-----------GDWYCKYCQ 584
K++ G G+ C CCN VS FEAHA Q G+ Q
Sbjct: 387 KVVSGAVTRTGVHCGCCNVVVSLPAFEAHAGRGDPGHGQQQRSWEKLLLVSGNSLLNRMQ 446
Query: 585 NMFERKR----FLQHDANAV-------------------EAGRVSGVDSVEQITKRCIRI 621
+E++R +Q A A + G V GV + +I +
Sbjct: 447 EAWEKERVKIFLVQEKARAALEQEQEKSAQAKRRLLAKQKKGAVEGVITSPRIRTKLRSG 506
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
K+ + G G +L CD C FH CL ++P+G W
Sbjct: 507 EKDSSDDACGVCADGG-----------ELLCCDSCPSTFHPACLAM-------KVPEGLW 548
Query: 682 FC----CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL---ETVSDIDVR---- 730
C C+ C + + K E ++ GN ET + +
Sbjct: 549 ACHYCRCVLCMANDDQGLSRCQHCTLKYHEICRPSLSNGRGNGAYCSETCKKVSAQLSDM 608
Query: 731 ------------WRLLSGKAATP----------ETRLLLSQAVAIFHDCFDPIVDSISGR 768
W LL + P E + L+ A+ + ++CF+P+ D +
Sbjct: 609 IGITNHTEDGFSWALLKIQKDEPVSSQNSPDVLECNVKLAVALGVLNECFNPVKDRRTKI 668
Query: 769 DLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 826
D++ VY G + + G Y +L N ++SA +LR+ G +VAE+P T
Sbjct: 669 DMLHQAVYSLGSEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRK 728
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 886
+G + L +E++L+ ++V+ +V+PA W F FK +DPEL R+R LV
Sbjct: 729 QGMMRRLVNAVEQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPELKEEIRRR--SLV 786
Query: 887 TFKGTSMLQKRVPACRIGSSS----TDSTECVSGVE 918
GT++LQK V A SS T++ SG E
Sbjct: 787 VITGTTLLQKPVVAAPPSPSSLHKQTEAAAAKSGAE 822
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 141/356 (39%), Gaps = 78/356 (21%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S IP GDWYC C
Sbjct: 812 CCDN--CPSTF--HLACLSTQEIPDGDWYCTNCT-------------------------- 841
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
C +C K L C QCE ++H CL+
Sbjct: 842 ---------------------CRICGNLVIDKDTSDAHDSLQCSQCEHKYHEKCLEDR-- 878
Query: 671 ADLRELP-KGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLETVSD 726
D +E+ WFC C + S LQ L+ Q A+ + L I + + +
Sbjct: 879 -DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISWTLLRCI--HDDQKVHSA-- 933
Query: 727 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 784
+W L T L+ A+ I +CF + D +G +IP ++Y G
Sbjct: 934 ---QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHMIPQVLYNWGSEFARLN 985
Query: 785 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 844
F G Y +L ++S +RV G VAE+PL+AT +G +LL + IE++L
Sbjct: 986 FQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVSAIEQMLISF 1045
Query: 845 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 898
+V+ +V+ A + WT FGF +D I R+R ++ L+ F GT +L K +
Sbjct: 1046 KVEKLVVSAIPDLVETWTKGFGFITVD----DIERQRLNKINLMVFPGTVLLVKSL 1097
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/425 (26%), Positives = 179/425 (42%), Gaps = 91/425 (21%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNM 586
+V Y + +LEG+ GI C CC+ ++ S+FE HA S +PQ QN+
Sbjct: 637 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG----SKLPQP------YQNI 686
Query: 587 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG--CLLCRGCDFSKSG 644
+ L+ + ++ ++ + E K C V + + C +C G
Sbjct: 687 Y-----LESGVSLLQC-QIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGIC--------G 732
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC--SRINS------ 692
G ++ CD C FH CL D++ LP G+W C C C + NS
Sbjct: 733 DGG-DLICCDGCPSTFHQSCL------DIQMLPLGEWHCPNCTCKFCGIASGNSEKDDAS 785
Query: 693 --VLQ--NLL--------VQEAEKLPEFHLNA----------------IKKYAGNSLETV 724
VLQ NL +E + LP ++N +KKY G E
Sbjct: 786 VYVLQICNLCEKKYHDSCTKEMDNLPN-NINTSSLSFCGKECKELSEHLKKYLGTKHEL- 843
Query: 725 SDIDVRWRLLS-----------GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 773
+ W L+ G + E L+ A+ + +CF P++D SG +LI +
Sbjct: 844 -EAGFSWSLIHRIDEDSEAACRGISQRVECNSKLAIALTVMDECFLPVIDRRSGINLIRN 902
Query: 774 MVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQ 831
++Y G N + G Y A L ++++ +R G ++AE+P + T + +G +
Sbjct: 903 VLYNSGSNFSRLNYSGFYTATLERGDEIIASASIRFHGTQIAEMPFIGTRHMYRRQGMCR 962
Query: 832 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGT 891
LF+ IE L L+V+ +V+PA E + WT FGF +D L + ++ F G
Sbjct: 963 RLFSAIESTLCSLKVEKLVIPAIAELTNTWTTVFGFTHLDESLRQ--EMKSLNMMVFPGI 1020
Query: 892 SMLQK 896
ML K
Sbjct: 1021 DMLMK 1025
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 158/375 (42%), Gaps = 76/375 (20%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S+ +P+G+WYC C
Sbjct: 959 CCDN--CPSTF--HHSCLSIQELPEGNWYCLNC--------------------------- 987
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
T R + N E E+S L C QCE+++H CLK+ +
Sbjct: 988 ----TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQRDI 1027
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVSDIDV 729
E WFC C +I + LQ+ L ++A G S + I
Sbjct: 1028 NSGVE--SHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWTLLRCIHY 1073
Query: 730 RWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEF 785
++LS A E L A+ I +CF +VD +G D+IP +VY + +F
Sbjct: 1074 DQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDF 1133
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G Y IL + ++ +RV G E+AE+PL+AT +G + L IE++L +
Sbjct: 1134 HGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFK 1193
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGS 905
VK +V+ A WT+ FGF ++ E K L+ F GT +L+K A +
Sbjct: 1194 VKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHKF--NLMVFPGTVLLKK---ALYVSG 1248
Query: 906 SSTDSTECV-SGVEV 919
+T++T + SGV++
Sbjct: 1249 QTTETTVGIHSGVQL 1263
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 142/348 (40%), Gaps = 61/348 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+S PS F H C + P G W C YC F ++G D+
Sbjct: 101 CCDS--CPSTF--HQSCLEIKKFPSGVWNCTYCSCKF---------------CGMAGGDT 141
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ D + + P +L C CE ++H C+
Sbjct: 142 CQM-------------------------DENDTAAQP-ALLACCLCEEKYHHSCILAENT 175
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
+ + FC C + LQ LL + E F ++++ + SDI
Sbjct: 176 VN--DGYSSVSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRF-----DVGSDIS-- 226
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
LSG E ++ A+ I +CF P+ D SG +LI ++VY G N + G
Sbjct: 227 ---LSGMHRKVECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNYSGF 283
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
AIL ++SA +R+ G +AE+P + T + +G + L + IE L L V+
Sbjct: 284 LTAILERGDEIISAASIRIHGNHLAEMPFIGTRHMYRRQGMCRRLLSAIETALCSLNVEK 343
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+V+PA E WT FGFK ++ S + R ++V F G MLQK
Sbjct: 344 LVIPAISELRETWTSVFGFKPLEGS--SKQKMRNMKMVAFPGIDMLQK 389
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 160/379 (42%), Gaps = 84/379 (22%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC++ PS F H C S+ +P+G+WYC C
Sbjct: 959 CCDN--CPSTF--HHSCLSIQELPEGNWYCLNC--------------------------- 987
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
T R + N E E+S L C QCE+++H CLK+
Sbjct: 988 ----TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQ--- 1024
Query: 671 ADLRELPKGK----WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVS 725
R++ G WFC C +I + LQ+ L ++A G S +
Sbjct: 1025 ---RDIDSGVESHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWTLLR 1069
Query: 726 DIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLR 781
I ++LS A E L A+ I +CF +VD +G D+IP +VY +
Sbjct: 1070 CIHYDQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFP 1129
Query: 782 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 841
+F G Y IL + ++ +RV G E+AE+PL+AT +G + L IE++L
Sbjct: 1130 RLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEML 1189
Query: 842 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 901
+VK +V+ A WT+ FGF ++ E K L+ F GT +L+K A
Sbjct: 1190 MSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHKF--NLMVFPGTVLLKK---AL 1244
Query: 902 RIGSSSTDSTECV-SGVEV 919
+ +T++T + SGV++
Sbjct: 1245 YVSGQTTETTVGIHSGVQL 1263
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 140/359 (38%), Gaps = 93/359 (25%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+ G I+C C PS F H C + +P GDW+C C
Sbjct: 706 HYGGELILCDHC-----PSSF--HKRCLGMKDVPDGDWFCPSC----------------- 741
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
++ G + +++ TK I V N C QCE ++
Sbjct: 742 -CCKICGQNKLKKDTKDFIDGVLN----------------------------CTQCEHQY 772
Query: 661 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNS 720
H+ CL ++ PK FC C + + + KL F +
Sbjct: 773 HIMCLSNSWTDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDAFDDETL------- 818
Query: 721 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 778
+ET S + + A+ + H+CF+PI + +GRDL+ +++ G
Sbjct: 819 VETYSKLKI--------------------ALDVVHECFEPIEEPRTGRDLMKDVIFSNGS 858
Query: 779 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 838
L F G Y +L N +VS +R+ G +VAE+PLV T G ++L +E
Sbjct: 859 ELNRLNFQGFYTILLEKNDELVSVATVRIHGDKVAEIPLVGTRFQFRQLGMCRILMDVLE 918
Query: 839 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
K L L V+ +VLPA + WT FGF K+ D E L + F+ T M QK
Sbjct: 919 KKLMELGVQRLVLPAVPGVLNTWTGSFGFSKMTDSERLQFVD---YTFLDFQDTVMCQK 974
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 168/378 (44%), Gaps = 74/378 (19%)
Query: 546 GIICHCCNSEVSPSQFEAHAECA----SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
I+C CC+ + + FE+HA C S S + + CQ + + + V
Sbjct: 37 AIVCDCCHVTFTITGFESHAGCTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVVN 96
Query: 602 AGRVSGVDSVEQITKRCIRIVK-NLEAELSG-CLLCRGCDFSKSGFGPRTILLCDQCERE 659
+ V++ K+ +VK N EA+ C +C GFG + LCD+C
Sbjct: 97 ENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSIC--------GFGG-DLALCDRCPSA 147
Query: 660 FHVGCLKKHKMADLRELPKGKWFCCMDCSRI---------------NSVLQNLLVQEAEK 704
FH+GCL L +P G+WFC C +I N++L + VQ +K
Sbjct: 148 FHLGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQECKDHKDNNIL--VCVQCEQK 199
Query: 705 LPEFHLNAIKKYA--GNSLET-----------------------VSDIDVRWRLLSGKAA 739
+H +K N +E V+D ++ W L+ ++
Sbjct: 200 ---YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGKPIKVAD-NLTWTLVKNVSS 255
Query: 740 TPETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ--EFGGMYCAI 792
+ L+ A+ + ++ F+P D++SGR+LI +V+ R F G Y I
Sbjct: 256 VDDKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVI 315
Query: 793 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 852
L V+S +R++GQ+VAE+ VAT + +G LL IEK L+ L V+ ++L
Sbjct: 316 LEKMGEVISVATVRIYGQKVAEVVFVATKEQYRRQGICHLLMDEIEKQLTRLGVEKLLLH 375
Query: 853 AAEEAESIWTDKFGFKKI 870
++E+A +IWT FGF ++
Sbjct: 376 SSEDAMNIWTKSFGFARM 393
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 143/375 (38%), Gaps = 101/375 (26%)
Query: 543 NGLGIIC--HCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+G ++C HC S F H +C + +P GDWYC+ C
Sbjct: 742 DGGDLVCCDHCA------STF--HLDCLGIK-LPSGDWYCRSC----------------- 775
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCER 658
LCR C F K P +L C QC R
Sbjct: 776 ---------------------------------LCRFCGFPQEKPSSSPELLLSCLQCSR 802
Query: 659 EFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKY 716
++H C +P FC C +I L LL IK +
Sbjct: 803 KYHQTCSSGTGTDFDCTIPGTSIDCFCSPGCRKIYKRLNKLL-------------GIKNH 849
Query: 717 AGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQ-------AVAIFHDCFDPIVDSI 765
+ W L+ + +A P+ + ++Q A + +CF P +D
Sbjct: 850 M--------EAGFSWSLVHCFPNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHIDER 901
Query: 766 SGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
SG ++I ++ Y G + +F G Y IL V+SA +R+ G ++AE+P + T +
Sbjct: 902 SGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGM 961
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 883
+G + L IE L L V+ +V+ A E E+ WT FGFK + P R +
Sbjct: 962 YRHQGMLRRLLNGIESALCSLNVQKLVVSAVTEMENTWTTVFGFKPVQPS--KKQRIKSL 1019
Query: 884 QLVTFKGTSMLQKRV 898
L+ GT +L+KR+
Sbjct: 1020 NLLIMNGTGLLEKRL 1034
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 105 bits (263), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 164/394 (41%), Gaps = 51/394 (12%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAECA----SLSSIPQGDWYCKYCQNMFERKRF 592
L +G+ GI+C CC+ +S S F+AHA + SL+ Q CQ +
Sbjct: 577 LKDGWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSLNLCLQSGKSFTLCQIEAWNAEY 636
Query: 593 LQHDANA----VEA--------------GRVSGVDSVEQITKRCIRIVKNLEAELSGC-- 632
+ +NA VEA G + D+ + VK L + C
Sbjct: 637 MDRRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHN 696
Query: 633 LLCR--GCDFSK---SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
+CR GC ++ S F I+ C QC H C++ A E+ +WFC C
Sbjct: 697 CICRICGCPVTEKEISSFS--AIIKCLQCGAAHHDTCVEMGATA-FEEMDSDEWFCGTHC 753
Query: 688 SRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLL 746
I L + E+ ++ G + +V I A E
Sbjct: 754 KEIYLGLHGCVGVESSLGDGLSWTILRCNSGGQKMHSVQKI----------AHAIECNSK 803
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGI 804
L+ A+ + +CF +VD+ +G ++IP ++Y G + G Y IL ++ A
Sbjct: 804 LAVALTLMEECFAQMVDTRTGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEEILCAAS 863
Query: 805 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 864
+RV G + AELP +AT + + KG + L IE++L VK +VL A E S W
Sbjct: 864 IRVHGMKAAELPFIATCREHRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELVSTWVSG 923
Query: 865 FGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQK 896
FGFK I+ RK+ L+ F GTS+L K
Sbjct: 924 FGFKPIE----EYERKQLDTINLMLFPGTSLLIK 953
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 138/348 (39%), Gaps = 63/348 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C ++ +P GDW+C C F G G ++
Sbjct: 513 CCDG--CPSTF--HQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADGSNA 554
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ T +EL C LC E+++H C++
Sbjct: 555 EDDTTV----------SELVTCSLC---------------------EKKYHTSCIQGVDA 583
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + LQ + + E F + I + + SD VR
Sbjct: 584 VLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDTSVR 638
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGM 788
G E+ L+ A+ + +CF IVD S +LI +++Y R N + G
Sbjct: 639 -----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGF 693
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y AIL ++ A +R+ G ++AE+P + T I +G + LF IE L L+V+
Sbjct: 694 YTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEM 753
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT FGF + E R ++ F GT MLQK
Sbjct: 754 LIIPAISELMHTWTVGFGFNPL--EESHKQELRSLNMLVFPGTDMLQK 799
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 138/348 (39%), Gaps = 63/348 (18%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C ++ +P GDW+C C F G G ++
Sbjct: 681 CCDG--CPSTF--HQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADGSNA 722
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ T +EL C LC E+++H C++
Sbjct: 723 EDDTTV----------SELVTCSLC---------------------EKKYHTSCIQGVDA 751
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
FC C + LQ + + E F + I + + SD VR
Sbjct: 752 VLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDTSVR 806
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGM 788
G E+ L+ A+ + +CF IVD S +LI +++Y R N + G
Sbjct: 807 -----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGF 861
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y AIL ++ A +R+ G ++AE+P + T I +G + LF IE L L+V+
Sbjct: 862 YTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEM 921
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+++PA E WT FGF + E R ++ F GT MLQK
Sbjct: 922 LIIPAISELMHTWTVGFGFNPL--EESHKQELRSLNMLVFPGTDMLQK 967
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 167/376 (44%), Gaps = 70/376 (18%)
Query: 546 GIICHCCNSEVSPSQFEAHAECA----SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
I+C CC+ + + FE+HA C S S + + CQ + + + V
Sbjct: 37 AIVCDCCHVTFTITGFESHAGCTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVVN 96
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
+ V++ K+ +VK ++E + +C C GFG + LCD+C FH
Sbjct: 97 ENQKKNHSIVKENRKKNHCVVKE-KSEANNDNVCSIC-----GFGG-DLALCDRCPSAFH 149
Query: 662 VGCLKKHKMADLRELPKGKWFCCMDCSRI---------------NSVLQNLLVQEAEKLP 706
+GCL L +P G+WFC C +I N++L + VQ +K
Sbjct: 150 LGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQECKDHKDNNIL--VCVQCEQK-- 199
Query: 707 EFHLNAIKKYA--GNSLET-----------------------VSDIDVRWRLLSGKAATP 741
+H +K N +E V+D ++ W L+ ++
Sbjct: 200 -YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGKPIKVAD-NLTWTLVKNVSSVD 257
Query: 742 ETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ--EFGGMYCAILT 794
+ L+ A+ + ++ F+P D++SGR+LI +V+ R F G Y IL
Sbjct: 258 DKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILE 317
Query: 795 VNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAA 854
V+S +R++GQ+VAE+ VAT + +G LL IEK L+ L V+ ++L ++
Sbjct: 318 KMGEVISVATVRIYGQKVAEVVFVATKEQYRRQGMCHLLMDEIEKQLTRLGVEKLLLHSS 377
Query: 855 EEAESIWTDKFGFKKI 870
E+A + WT FGF ++
Sbjct: 378 EDAMNTWTRSFGFARM 393
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 153/360 (42%), Gaps = 79/360 (21%)
Query: 544 GLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 603
G I+C C P+ F H C + IP G+WYC C
Sbjct: 365 GELILCDLC-----PAAF--HGSCLGIKGIPSGNWYCPSC-------------------- 397
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
K C ++ + + ++S F S + C QCE+ H+G
Sbjct: 398 ----------CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIG 433
Query: 664 CLKKHKMADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 721
C+K ++ + + + + WFC C I+ LQNLL ++ +P G++
Sbjct: 434 CVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDAR 480
Query: 722 ETVSDIDVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN 779
E ++ W L+ + R L++A+ + H F P+ D I+ DLI + +
Sbjct: 481 ENLT-----WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKR 535
Query: 780 LRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
+ F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L +
Sbjct: 536 SESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLNEL 595
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
E L + VK + LPA EA + WT FGF K+ D + L + + + F+ T QK
Sbjct: 596 EHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTVRCQK 652
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 129/269 (47%), Gaps = 37/269 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHK-MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ C+QC+R FH+ CLK+ + R WFC C+R++S L+NL+
Sbjct: 313 LMACEQCQRRFHLKCLKEEPGIVSCR-----GWFCSSQCNRVSSALENLI---------- 357
Query: 709 HLNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDS 764
G + ++ D+ W L+ G+ E L AV I H F+P D
Sbjct: 358 ---------GCKIAVGNNGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTKDV 408
Query: 765 ISGRDLIPSMVYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
SGRDL+ +++ ++ G G G Y ++ ++ +RV ++V E+PLVAT
Sbjct: 409 FSGRDLVEELIFRKDRTG--VGRGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSN 465
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRC 882
G ++L +EK +S + V +VLPAA+E S WT +FGF ++ E L + +
Sbjct: 466 YRRSGMCRVLVDELEKQMSQMGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVK--- 522
Query: 883 SQLVTFKGTSMLQKRVPACRIGSSSTDST 911
++ F GT M K + R + S + +
Sbjct: 523 HGMLDFVGTVMCHKFLVKERAENDSAEES 551
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 153/360 (42%), Gaps = 79/360 (21%)
Query: 544 GLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 603
G I+C C P+ F H C + IP G+WYC C
Sbjct: 342 GELILCDLC-----PAAF--HGSCLGIKGIPSGNWYCPSC-------------------- 374
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
K C ++ + + ++S F S + C QCE+ H+G
Sbjct: 375 ----------CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIG 410
Query: 664 CLKKHKMADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 721
C+K ++ + + + + WFC C I+ LQNLL ++ +P G++
Sbjct: 411 CVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDAR 457
Query: 722 ETVSDIDVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN 779
E ++ W L+ + R L++A+ + H F P+ D I+ DLI + +
Sbjct: 458 ENLT-----WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKR 512
Query: 780 LRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
+ F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L +
Sbjct: 513 SESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLNEL 572
Query: 838 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
E L + VK + LPA EA + WT FGF K+ D + L + + + F+ T QK
Sbjct: 573 EHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTVRCQK 629
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 111/221 (50%), Gaps = 16/221 (7%)
Query: 679 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKA 738
G FC C + LQNLL + + PE+ +++ + E V +D R
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLALDKRV------- 76
Query: 739 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVN 796
E ++ A+++ +CF PI+D +G +LI ++VY G N +F G Y IL
Sbjct: 77 ---ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERG 133
Query: 797 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 856
+++A +R+ G ++AE+P + T + +G + L IE +LS L V+ +++PA E
Sbjct: 134 DEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITE 193
Query: 857 AESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
WT KFGF + D E + K S LV F GT +LQK
Sbjct: 194 LVDTWTSKFGFSPLEDSEKQEV--KSISMLV-FPGTGLLQK 231
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 93/171 (54%), Gaps = 12/171 (7%)
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 789
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 40 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 97
Query: 790 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 849
CA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V +
Sbjct: 98 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 157
Query: 850 VLPAAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 896
P E IW++K GF + E +L + LV FK ++QK
Sbjct: 158 TAPVDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 203
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 160/385 (41%), Gaps = 48/385 (12%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYC----------------KYCQNMFER 589
GI C CC+ ++ ++FE HA S P + + K QN E+
Sbjct: 780 GICCDCCSKILTVAKFELHA--GSKEQQPYANIFLEDGGLSLFQCLLNAWDKQAQN--EK 835
Query: 590 KRFLQHDANAVEAGRVSGV--DSVEQIT-KRCIRI--VKNLEAEL-SGCLLCRGC--DFS 641
K F + D G+ D + + RC V L E+ SG CR C F
Sbjct: 836 KGFYKIDPADDPDDDTCGICGDGGDLLCCDRCTSTFHVACLGIEMPSGDWFCRNCICKFC 895
Query: 642 KSG----FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQ 695
S P +L C QC R++H C + + + P FC C++I L+
Sbjct: 896 GSAEERTSSPAELLSCLQCSRKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTKIYKRLK 955
Query: 696 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 755
LL + + F + ++ +A D KA + A ++
Sbjct: 956 RLLGLKNDLEAGFSWSLVRCFA----------DTEATSTKKKAQLVHCNSKTALAFSVLD 1005
Query: 756 DCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 813
+CF P +D SG ++I ++VY G + F G Y IL V+SA +R+ G + A
Sbjct: 1006 ECFLPRIDERSGINIIHNVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVRIHGTDFA 1065
Query: 814 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE 873
E+P + T + +G L IE L L V+ +V+PA E ++ W+ FGFK + P
Sbjct: 1066 EMPFIGTRGMYRHQGMCHRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFGFKPVGP- 1124
Query: 874 LLSIYRKRCSQLVTFKGTSMLQKRV 898
+ + L+ GT +L+KR+
Sbjct: 1125 -TKKQKIKSVNLLIIHGTGLLEKRL 1148
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 115/429 (26%), Positives = 171/429 (39%), Gaps = 94/429 (21%)
Query: 546 GIICHCCNSEVSPSQFEAHAECA----SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
GI+C+CC S S F+ H C+ SL Q CQ VE
Sbjct: 956 GILCNCCTKTFSISDFKVHGGCSLPKSSLGLFLQSGKSYTLCQ---------------VE 1000
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
A S E ++++C + +EA C C G G +L CD C +H
Sbjct: 1001 AW------SAEFLSRKCDASGRKVEAMDENDDTCGFC-----GDGGE-LLCCDNCPSTYH 1048
Query: 662 VGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQNLLVQEA------ 702
CL +ELP+G W+C C C S +++L+ L ++
Sbjct: 1049 QTCLSD------QELPEGSWYCHNCTCRSCGNPLSEKEVSTFSAILKCLQCGDSYHDTCI 1102
Query: 703 --EKLP----------------EFHLNAIKKYAGNSLETVSDIDVRWRLL---------- 734
E LP E + + + G +E D ++ W +L
Sbjct: 1103 DQEMLPCGDKQSNIWFCGRYCKEIFI-GLHNHVG--IENFLDNELSWSILKCNTDGQKLH 1159
Query: 735 SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 792
S K A E L+ A+ I +CF +VD +G D+IP ++ N ++ G Y I
Sbjct: 1160 SSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDYQGFYTVI 1217
Query: 793 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 852
L ++ +RV G + AELP +ATS +G + L IE +L V+++VL
Sbjct: 1218 LEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLS 1277
Query: 853 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTD-ST 911
A E + W FGFK I+ + R L+ F GTS+L KR+ S D
Sbjct: 1278 AIPELVNTWVSGFGFKPIEDN--EKKQLRNVNLMLFPGTSLLTKRLDGITAAKSEEDKDA 1335
Query: 912 ECVSGVEVG 920
VSG+ G
Sbjct: 1336 YNVSGLPNG 1344
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 130/276 (47%), Gaps = 42/276 (15%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPE 707
++ C+ C FH+ C+ L E+PK WFC C+ C L+ ++ E+
Sbjct: 281 LVCCETCPLTFHMECVS------LLEVPKDAWFCFRCL-CCHCGEPLRTQPCEQCERC-- 331
Query: 708 FH--------------------LNAIKKYAGNSLETVSDI---DVRWRLLSGKAATPETR 744
FH N ++ A + TV+ + ++ W LL +
Sbjct: 332 FHPGCCDDAILAGDFFFCSSGCWNIFQRLA-EMVATVNPLGRSELSWSLLRRGRCDDK-- 388
Query: 745 LLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ--EFGGMYCAILTVNSSVVSA 802
LL++A+ + FDP++D + D + +MV+ R+ +F G Y A+L + VV
Sbjct: 389 -LLAEALQVISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGV 447
Query: 803 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 862
+LR+ G +AE+P +AT G+G + LF +E++L+ L V+ +VL AA++ E +W
Sbjct: 448 AVLRIHGAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVEMMVLLAAKDTEKMWK 507
Query: 863 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
+ F F +D +L + R LV G LQK V
Sbjct: 508 NSFEFHAMDRKLKA--RTVALGLVALNGAGFLQKSV 541
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 131/299 (43%), Gaps = 59/299 (19%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-------------------CMDCSRI 690
++LCD+C FH+GCL +L+++P WFC C+ C+R
Sbjct: 155 LILCDKCPSTFHLGCL------ELKDVPLENWFCPSCCCELCGKGDSSTSTNACLQCARA 208
Query: 691 NSVLQNLLVQEAEKLPE------------FHLNA-IKKYAGNSLETVSDIDVRWRLLSGK 737
V + L ++ LP + L A + + G S T D + W L
Sbjct: 209 YHV--HCLTKDGCLLPTDYPSENFCSKSCYELCAQLHQLLGISNPTSVD-GLTWTLTRSS 265
Query: 738 AAT--------PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 787
T + Q + + H+CF + + + +D++ ++Y G + F G
Sbjct: 266 KDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDMVTDLIYNSGSKFKRLNFHG 325
Query: 788 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 847
Y +L +VS LR+ G + AE+PLVAT +G +LL + KLL+ RV+
Sbjct: 326 FYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQGMCRLLMQEVLKLLNKFRVE 385
Query: 848 SIVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
++LPA + +W FGF ++ + + LS Y V F+GT MLQ + + RI
Sbjct: 386 RLILPAIPQLRKMWEASFGFSEMPLSERQQLSGY-----SFVGFQGTMMLQNVLTSSRI 439
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 102 bits (254), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 145/349 (41%), Gaps = 75/349 (21%)
Query: 563 AHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIV 622
+H +C L IPQGDW C YC + K C +
Sbjct: 232 SHLKCMGLERIPQGDWICPYC------------------------------VCKHCNKND 261
Query: 623 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
K+L+ C QC++++H CL +K DL G+
Sbjct: 262 KDLQT-------------------------CVQCDKKYHCQCLVSNKELDLN--ASGETL 294
Query: 683 CC-MDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 741
C C + LQ+L+ + E F +++ ++L+ D+ +
Sbjct: 295 ACDSHCGEVYEKLQSLVGVKHELEGGFCWTLLQRMEPDNLD-FKDLHL----------IT 343
Query: 742 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSV 799
E ++ A + +CF I+D + +++ S+ Y R NL F G Y AIL N +
Sbjct: 344 ECNSKIALAWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRINFRGFYTAILEKNDDI 403
Query: 800 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 859
+SA +RV G ++AE+P + T + G ++L +E + S + V+ +++P+ +E
Sbjct: 404 ISAATIRVHGTDLAEMPFIGTRHLYRQNGMSRMLLVTLESIFSVMGVEHLIIPSVQELTE 463
Query: 860 IWTDKFGFKKIDPELLSIYRKRCS-QLVTFKGTSMLQKRVPACRIGSSS 907
+W K GF I+ ++ +K + +TF LQK + + SSS
Sbjct: 464 MWEGKCGFSPIED---AVSQKITNWNTLTFPSAVRLQKALLSTPASSSS 509
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 102 bits (254), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 101/405 (24%), Positives = 168/405 (41%), Gaps = 94/405 (23%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 605
G++C CCN VS S+F+ HA + P C N+F + ++
Sbjct: 657 GVVCTCCNKTVSLSEFKNHA--GFNQNCP--------CLNLF------MGSGKPFASCQL 700
Query: 606 SGVDSVEQITKRCIRIVK--NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
+ + + R+ K + + C +C G G ++ CD C FH
Sbjct: 701 EAWSAEYKARRNGWRLEKASDDDPNDDSCGVC--------GDGGE-LICCDNCPSTFHQA 751
Query: 664 CLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPEFHLNA-IKKYAG 718
CL ++ LP+G W+C C CS L+ AE+ +F + KY G
Sbjct: 752 CLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNAERSQDFKCSQCAHKYHG 798
Query: 719 NSLETVS--------------------------------DID-VRWRLL----------S 735
L+ +S + D + W +L S
Sbjct: 799 TCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWSILKCFQEDGMVHS 858
Query: 736 GK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCA 791
+ A E L+ A++I + F +VD +G D+IP ++Y G +F G Y
Sbjct: 859 ARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSTFARLDFDGFYTV 918
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
++ + ++S +RV G +AE+PLVAT +G ++L A IE++L L+V+ +V+
Sbjct: 919 VVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVV 978
Query: 852 PAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
A WT+ FGFK +D E ++ L+ F GT++L+K
Sbjct: 979 AALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTTLLKK 1021
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/407 (24%), Positives = 169/407 (41%), Gaps = 94/407 (23%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 605
G++C CCN VS S+F+ HA + P C N+F + ++
Sbjct: 613 GVVCTCCNKTVSLSEFKNHA--GFNQNCP--------CLNLF------MGSGKPFASCQL 656
Query: 606 SGVDSVEQITKRCIRIVK--NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
+ + + R+ K + + C +C G G ++ CD C FH
Sbjct: 657 EAWSAEYKARRNGWRLEKASDDDPNDDSCGVC--------GDGGE-LICCDNCPSTFHQA 707
Query: 664 CLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPEFHLNA-IKKYAG 718
CL ++ LP+G W+C C CS L+ AE+ +F + KY G
Sbjct: 708 CLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNAERSQDFKCSQCAHKYHG 754
Query: 719 NSLETVS--------------------------------DID-VRWRLL----------S 735
L+ +S + D + W +L S
Sbjct: 755 TCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWSILKCFQEDGMVHS 814
Query: 736 GK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCA 791
+ A E L+ A++I + F +VD +G D+IP ++Y G +F G Y
Sbjct: 815 ARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSTFARLDFDGFYTV 874
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
++ + ++S +RV G +AE+PLVAT +G ++L A IE++L L+V+ +V+
Sbjct: 875 VVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVV 934
Query: 852 PAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
A WT+ FGFK +D E ++ L+ F GT++L+K +
Sbjct: 935 AALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTTLLKKTL 979
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 129/338 (38%), Gaps = 99/338 (29%)
Query: 541 YKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWY-----CKYCQNMFERKRFLQH 595
+ G I+C C PS F HA C L +P GDW+ C C F +
Sbjct: 701 HYGGKLILCDGC-----PSAF--HANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT---- 749
Query: 596 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
NA E +S C Q
Sbjct: 750 STNAKEEKFIS----------------------------------------------CKQ 763
Query: 656 CEREFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 714
CE ++H CL+ D L ++ KWFC DC S+ N+ +A K+
Sbjct: 764 CELKYHPSCLRYDGACDSLDKILGEKWFCSKDCE--ESLEPNMYGDDASKI--------- 812
Query: 715 KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 774
A E +LS A+ + H+ F+P+ GRDL +
Sbjct: 813 -----------------------EAAAENHCILSVALDVMHELFEPVKRPHGGRDLAEDV 849
Query: 775 VYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 832
++ R + F G Y +L N+ +VS +R+ G++VAE+P + T + +G ++
Sbjct: 850 IFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRV 909
Query: 833 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 910 LINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 947
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 119/465 (25%), Positives = 185/465 (39%), Gaps = 95/465 (20%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQF 561
K + + D + DG V Y G +K++ G G+ C CC+ V F
Sbjct: 367 KKHTILTWLIDGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVHCSCCDGVVPLPVF 426
Query: 562 EAHAECASLSSIP------------QGDWYCKYCQNMFERKRFLQHDANA-VEAGRVSGV 608
EAHA P G+ + Q +E ++ A A V A
Sbjct: 427 EAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFHAQAKVRAALEQEE 486
Query: 609 DSVEQITKRCI-----------RI-------VKNLEAELS--GCLLCRGCDFSKSGFGPR 648
D Q +R + RI +K E + S C +C + G
Sbjct: 487 DKCSQAKRRLLAKHLKKGVVVERIMSPRMEKIKAGEKDSSDDACGVC-----ADGG---- 537
Query: 649 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLV--QEA 702
+L CD C FH CL ++P+G W C C+ C N LQ L Q A
Sbjct: 538 ELLCCDSCTSTFHPECLAI-------KVPEGSWSCHYCRCVLCMS-NDDLQGLSTCQQCA 589
Query: 703 EKLPE----FHLNA--IKKYAGNS-------LETVSDI------DVRWRLLSGKAATP-- 741
K E N I Y G + L V+ + W LL + P
Sbjct: 590 RKYHESCRPLPGNGCDIGTYCGETCKKLFSQLAQVTGVTNPTGDGFWWALLRIQKDEPAS 649
Query: 742 --------ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCA 791
E + L+ A+ +F++CF+P+ D + D++ VY G + + G Y
Sbjct: 650 SEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGSQFKRLSYEGFYTM 709
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
+L + +VSA +LR+ G +VAE+P T +G + L + +E++L+ ++V+ +V+
Sbjct: 710 VLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRRLVSAVEQVLASVQVEKLVI 769
Query: 852 PAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
PA + W F F+ +DP+L KR S LV GT++L K
Sbjct: 770 PAIDSLVDTWKRSFFFRPVDPQLREEL-KRLS-LVVITGTTLLHK 812
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 168/402 (41%), Gaps = 88/402 (21%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF--ERKRFL--QHDANAVE 601
G++C CCN VS S+F+ HA + P C N+F K F Q +A + E
Sbjct: 645 GVVCTCCNRTVSLSEFKNHA--GFNQNCP--------CLNLFMGSGKPFASCQLEAWSAE 694
Query: 602 -AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
R +G S E + + C +C G G ++ CD C F
Sbjct: 695 YKARRNGWRSEE---------ASDDDPNDDSCGVC--------GDGGE-LICCDNCPSTF 736
Query: 661 HVGCLKKHKMADLRELPKGKWFCC--------------------MDCSRINSVLQNLLVQ 700
H CL ++ LP+G W+C CS+ + +Q
Sbjct: 737 HQACLS------MQVLPEGSWYCSSCSCQICSELVSDNGERSQDFKCSQCAHKYHGICLQ 790
Query: 701 EAEK----LPEFHL---NAIKKYAGNSLETVSDID-----VRWRLL-----SGK------ 737
K PE + N K Y G S V I+ + W +L GK
Sbjct: 791 GISKRRKLFPETYFCGKNCEKVYTGLS-SRVGVINPNADGLSWSILKCFQEDGKVHSARR 849
Query: 738 -AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILT 794
A E L+ A++I + F +VD +G D+IP ++Y G N +F G Y +L
Sbjct: 850 LALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSNFARLDFDGFYTMVLE 909
Query: 795 VNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAA 854
+ ++S +RV G VAE+PLVAT +G ++L A IE++L L+V+ +V+ A
Sbjct: 910 KDDVMISVASIRVHGVTVAEMPLVATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVVAAL 969
Query: 855 EEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
WT+ FGFK +D E ++ L+ F GT +L K
Sbjct: 970 PSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTILLMK 1009
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 129/270 (47%), Gaps = 33/270 (12%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM------------DCSRINSVLQNL 697
+LLCD+C FH C+ L+ P+G W C + D + + +
Sbjct: 576 LLLCDKCPSAFHHACVG------LQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 698 LVQEAEKLPE----FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 753
+ EA +P L+ +++ + TV+ I RW+ AA L A+ +
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRR--RYMSTVTRI-TRWQHEEEDAADHGQ---LCAALDV 683
Query: 754 FHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 811
H+CFD +V+ + DL +V+ + L F G Y L +++ G LRVFG +
Sbjct: 684 LHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVFGNQ 743
Query: 812 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 871
VAELPLV T + +G +LL +EK+L + V+ +VLPA E +WT GF +
Sbjct: 744 VAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFHAMT 803
Query: 872 -PELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
+++ + + +++FKGT+M QK + A
Sbjct: 804 RSDVMEMAVEHA--ILSFKGTTMCQKTLLA 831
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 148/357 (41%), Gaps = 83/357 (23%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+S PS F H C ++ +PQG W C YC
Sbjct: 467 CCDS--CPSTF--HPACLAMK-VPQGWWACHYC--------------------------- 494
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
RC+ + N + LS C C ++H C ++ +
Sbjct: 495 ------RCVLCMANDDQGLS---------------------TCQHCSLKYHEVC-RRPSL 526
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSDID 728
++ R + +C C ++++ L +++ E + L I+K E VS D
Sbjct: 527 SNGRGIGA---YCSETCKKVSARLSDMVGVTNHTEDGFSWALLKIQKD-----EAVSSQD 578
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 786
AA E + L+ A+ + ++CF+P D + D++ VY G + +
Sbjct: 579 T--------AAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAVYSLGSEFKRVSYE 630
Query: 787 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 846
G Y +L + ++A +LR+ G +VAE+P AT +G + L +E++L+ ++V
Sbjct: 631 GFYTMVLDKDGETIAAALLRIHGTKVAEMPFAATLPAYRKQGMMRRLVNAVEQVLASVQV 690
Query: 847 KSIVLPAAEEAESIWTDKFGFKK-IDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
+V+PA WT F F+ +DPE R+R LV GT++L K V A R
Sbjct: 691 DKLVIPAIAALVDTWTRSFSFRPLLDPESREEIRRR--SLVVIAGTTLLHKPVAAAR 745
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 100/414 (24%), Positives = 164/414 (39%), Gaps = 86/414 (20%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 597
LEG GI C CC+ ++ ++FE HA G + +N+F L+
Sbjct: 654 LEGRITRDGIHCGCCSKILTVAKFELHA----------GSKEQQPYENIF-----LEDGG 698
Query: 598 NAVEAGRVSGVDSVEQITKRCI-RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 656
+ V Q K+ ++ + + C +C G G +L CD C
Sbjct: 699 ATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGIC--------GDGG-DLLCCDNC 749
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL----------- 705
FH+ CL ++P G W C R Q + AE L
Sbjct: 750 PSTFHLACLG-------IKMPSGDWHCSSCICRFCGSTQEITTSSAELLSCLQCSRKYHQ 802
Query: 706 ---PEFHLNAIKKYAGNS---------------------LETVSDIDVRWRLL----SGK 737
P +++K + +S ++ + W L+
Sbjct: 803 VCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKL 862
Query: 738 AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
AA P+ + L + A ++ +CF P +D SG ++I +++Y G + F
Sbjct: 863 AAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKF 922
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y IL V+SA +R+ G ++AE+P + T I +G L IE LS L V+
Sbjct: 923 YTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRR 982
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPA 900
+V+PA E ++ WT FGFK ++P R++ L + GT +L+KR+ A
Sbjct: 983 LVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLA 1032
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 100/414 (24%), Positives = 164/414 (39%), Gaps = 86/414 (20%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 597
LEG GI C CC+ ++ ++FE HA G + +N+F L+
Sbjct: 655 LEGRITRDGIHCGCCSKILTVAKFELHA----------GSKEQQPYENIF-----LEDGG 699
Query: 598 NAVEAGRVSGVDSVEQITKRCI-RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 656
+ V Q K+ ++ + + C +C G G +L CD C
Sbjct: 700 ATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGIC--------GDGG-DLLCCDNC 750
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL----------- 705
FH+ CL ++P G W C R Q + AE L
Sbjct: 751 PSTFHLACLG-------IKMPSGDWHCSSCICRFCGSTQEITTSSAELLSCLQCSRKYHQ 803
Query: 706 ---PEFHLNAIKKYAGNS---------------------LETVSDIDVRWRLL----SGK 737
P +++K + +S ++ + W L+
Sbjct: 804 VCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKL 863
Query: 738 AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
AA P+ + L + A ++ +CF P +D SG ++I +++Y G + F
Sbjct: 864 AAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKF 923
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y IL V+SA +R+ G ++AE+P + T I +G L IE LS L V+
Sbjct: 924 YTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRR 983
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPA 900
+V+PA E ++ WT FGFK ++P R++ L + GT +L+KR+ A
Sbjct: 984 LVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLA 1033
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 126/308 (40%), Gaps = 56/308 (18%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQN 696
+L CD C +H CL +ELP+G W+C C C S +++L+
Sbjct: 17 LLCCDNCPSTYHQTCLSD------QELPEGSWYCHNCTCRSCGNPLSEKEVSTFSAILKC 70
Query: 697 LLVQEA--------EKLPEFHLNAIKKYAGN-------------SLETVSDIDVRWRLLS 735
L ++ E LP + + G +E D ++ W +L
Sbjct: 71 LQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILK 130
Query: 736 GK------------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 781
A E L+ A+ I +CF +VD +G D+IP ++Y G N
Sbjct: 131 CNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVLYNKGSNFA 190
Query: 782 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 841
++ G Y IL ++ +RV G + AELP +ATS +G + L IE +L
Sbjct: 191 RLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMML 250
Query: 842 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 901
V+++VL A E + W FGFK I+ R L+ F GTS+L KR+
Sbjct: 251 RSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGI 308
Query: 902 RIGSSSTD 909
S D
Sbjct: 309 TAAKSEED 316
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 143/361 (39%), Gaps = 72/361 (19%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVSGVD 609
CC+ PS F H C + P G WYC C F E+ +H+ + + +
Sbjct: 664 CCDG--CPSTF--HQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS------- 712
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
LS C LC E ++H C+ +
Sbjct: 713 -------------------LSSCRLC---------------------EEKYHQACINQDG 732
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDID 728
FC C + LQ L + LPE F + ++++ S V+D D
Sbjct: 733 TVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVADCD 787
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 786
+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N +F
Sbjct: 788 ISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDFS 839
Query: 787 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 846
A+L +++ +R+ G ++AE+P + T + +G + L IE L L+V
Sbjct: 840 SFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGSLKV 899
Query: 847 KSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGS 905
+V+PA E WT FGF + D E +I + L+ F G ML K + +I
Sbjct: 900 DKLVIPAVPELIDTWTSGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKEKITD 956
Query: 906 S 906
S
Sbjct: 957 S 957
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/414 (24%), Positives = 162/414 (39%), Gaps = 86/414 (20%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 597
LEG GI C CC+ ++ ++FE HA Q +E FL+
Sbjct: 232 LEGRITRDGIHCGCCSKILTVAKFELHAGSKE--------------QQPYENI-FLEDGG 276
Query: 598 NAVEAGRVSGVDSVEQITKRCI-RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 656
+ V Q K+ ++ + + C +C G G +L CD C
Sbjct: 277 ATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGIC--------GDGG-DLLCCDNC 327
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL----------- 705
FH+ CL ++P G W C R Q + AE L
Sbjct: 328 PSTFHLACLGI-------KMPSGDWHCRSCICRFCGSTQEITTSSAELLSCLQCSRKYHQ 380
Query: 706 ---PEFHLNAIKKYAGNS---------------------LETVSDIDVRWRLL----SGK 737
P +++K + +S ++ + W L+
Sbjct: 381 VCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVKNAIEAGFSWSLVRCFPDKL 440
Query: 738 AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
AA P+ + L + A ++ +CF P +D SG ++I +++Y G + F
Sbjct: 441 AAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKF 500
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
Y IL V+SA +R+ G ++AE+P + T I +G L IE LS L V+
Sbjct: 501 YTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRR 560
Query: 849 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPA 900
+V+PA E ++ WT FGFK ++P R++ L + GT +L+KR+ A
Sbjct: 561 LVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLA 610
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 154/385 (40%), Gaps = 83/385 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 598
EG GI C CC + S FE HA S C+ N+F D
Sbjct: 628 EGKLTREGIKCSCCRRIFTISGFEVHANGGS----------CRAAANIF------LDDGR 671
Query: 599 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 658
++ +V ++ ++ I +K + E ++C C + ++LCD C
Sbjct: 672 SLLECQVEAYETRKKAQPPDILKMKLRQGEND--VICSVCHYGGK------LILCDGCPS 723
Query: 659 EFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLP------EF 708
FH CL L E+P G WFC C C + + +E + + ++
Sbjct: 724 AFHANCLG------LEEVPDGDWFCESCCCGACGQFFLKATSKYAKEEKFISCKQCELKY 777
Query: 709 HLNAIK-KYAGNSLET----------------------------VSDIDVRWRLLSG--- 736
H + ++ AG+SL+T V + WRL+
Sbjct: 778 HPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELIGKPREVGVEKLTWRLVQSFEP 837
Query: 737 ---------KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 785
A E LS A+ + H+ F+P+ GRDL +++ R + F
Sbjct: 838 NMYGDDAYKIEAVAENHCKLSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFKRLNF 897
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
G Y +L N +V+ +R+ G++VAE+P + T + +G ++L +EK+L L
Sbjct: 898 SGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELEKVLIDLG 957
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKI 870
V+ +VLPA + W + FGF K+
Sbjct: 958 VERLVLPAVPCVLNTWINSFGFTKM 982
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 181/467 (38%), Gaps = 96/467 (20%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG-------QKLLEGYKNGLGIICHCCNSEVSPSQ 560
K + + D L DG V YY G +K++ G G+ C+CC++ V
Sbjct: 277 KKHTILTWLIDGGFLSDGETV-YYVPGDSGGAGKEKIVSGAVTRAGVHCNCCDAVVPLPV 335
Query: 561 FEAHAECASLSSIPQ------------GDWYCKYCQNMFERKRFLQHDANA-VEAG---- 603
FE HA + Q GD + Q ++ ++ A A V A
Sbjct: 336 FEVHAGRVPGTGQQQQQVAWEKLLLVSGDSLLQSMQEAWQNEKVRTFHAQAKVRAALEQE 395
Query: 604 ---------------RVSGVDSVEQITKRCIRIVKNLEAELS--GCLLCRGCDFSKSGFG 646
+ GV VE+I + +K E + S C +C D +
Sbjct: 396 EEKNSQAKRRLLAKHQKKGV-VVERIMSPRMEKIKAGEKDSSDDACGVC--ADGGE---- 448
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC------------------CMDCS 688
+L CD C FH CL E+P G W C C +C+
Sbjct: 449 ---LLCCDFCTSTFHPECLAI-------EVPDGSWSCHYCRCTLCMSNDDQDLSTCQECA 498
Query: 689 RINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-----VRWRLL--------- 734
L+ + + KK + E + ++ W LL
Sbjct: 499 CKYHESCRPLLGNGRDIGAYCGEICKKLSAKLSEVIGVMNSTEDGFSWSLLRIHEDEPAS 558
Query: 735 -SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCA 791
G A E + L+ A+ + + CF+P+ D + D++ VY G + + G Y
Sbjct: 559 SQGMPAVLERNVKLAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGSQFKRLSYEGFYTM 618
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
IL + +VS +LR+ G++VAE+P T +G + + +E++L+ ++V+++++
Sbjct: 619 ILEKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAVEQVLASVQVETLII 678
Query: 852 PAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
PA W F F+ +DP+L KR S LV GT+ML K V
Sbjct: 679 PAIASMVDTWKRSFSFRPVDPQLREEL-KRLS-LVVITGTTMLHKPV 723
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/460 (23%), Positives = 185/460 (40%), Gaps = 97/460 (21%)
Query: 516 VFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC---- 567
+ D L D +V Y +K++ G GI C CCN+ V + FE HA C
Sbjct: 378 LIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETHARCERPG 437
Query: 568 -----------ASLSSIPQGDWYCKYCQNMFERKRFL----QHDANAVEAGR-------- 604
LS Q W + M R++ + Q + +A R
Sbjct: 438 QPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEKSSQAKRKLAKTKKM 497
Query: 605 --VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 662
+ GV V + R ++ KN + C C + G +L CD C FH
Sbjct: 498 QLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG----QLLCCDTCPSTFHP 550
Query: 663 GCL------KKHKMADLRELP----KGKW----------------------------FCC 684
CL K + D ++L + W +C
Sbjct: 551 DCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRPLQSPGFEIGAYCS 610
Query: 685 MDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C +++S L +++ + E + L I+K + L T D+ V +L E
Sbjct: 611 ETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSEDMPV---IL-------E 657
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVV 800
+ + L+ A+ + ++CF+P+ D + D++ VY G + + G Y +L + ++
Sbjct: 658 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 717
Query: 801 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 860
S +LR G+++AE+P T +G + L +EK+L+ L+V+++V+PA +
Sbjct: 718 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 777
Query: 861 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
W F F+ + E+ +K LV GT++LQK + A
Sbjct: 778 WKRSFSFRPMQAEVRDEAKKLS--LVAITGTTLLQKPISA 815
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/460 (23%), Positives = 185/460 (40%), Gaps = 97/460 (21%)
Query: 516 VFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC---- 567
+ D L D +V Y +K++ G GI C CCN+ V + FE HA C
Sbjct: 380 LIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETHARCERPG 439
Query: 568 -----------ASLSSIPQGDWYCKYCQNMFERKRFL----QHDANAVEAGR-------- 604
LS Q W + M R++ + Q + +A R
Sbjct: 440 QPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEKSSQAKRKLAKTKKM 499
Query: 605 --VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 662
+ GV V + R ++ KN + C C + G +L CD C FH
Sbjct: 500 QLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG----QLLCCDTCPSTFHP 552
Query: 663 GCL------KKHKMADLRELP----KGKW----------------------------FCC 684
CL K + D ++L + W +C
Sbjct: 553 DCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRPLQSPGFEIGAYCS 612
Query: 685 MDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPE 742
C +++S L +++ + E + L I+K + L T D+ V +L E
Sbjct: 613 ETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSEDMPV---IL-------E 659
Query: 743 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVV 800
+ + L+ A+ + ++CF+P+ D + D++ VY G + + G Y +L + ++
Sbjct: 660 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 719
Query: 801 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 860
S +LR G+++AE+P T +G + L +EK+L+ L+V+++V+PA +
Sbjct: 720 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 779
Query: 861 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 900
W F F+ + E+ +K LV GT++LQK + A
Sbjct: 780 WKRSFSFRPMQAEVRDEAKKL--SLVAITGTTLLQKPISA 817
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 147/362 (40%), Gaps = 64/362 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVSGVD 609
CC+ PS F H C + P G WYC C F E+ +H+ + + +
Sbjct: 664 CCDG--CPSTF--HQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS------- 712
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
LS C LC C S P T+ C + G +
Sbjct: 713 -------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTVPGE 747
Query: 669 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDI 727
+ D FC C + LQ L + LPE F + ++++ S V+D
Sbjct: 748 RSTD--------SFCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVADC 796
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 785
D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N +F
Sbjct: 797 DISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDF 848
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
A+L +++ +R+ G ++AE+P + T + +G + L IE L L+
Sbjct: 849 SSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGSLK 908
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIG 904
V +V+PA E WT FGF + D E +I + L+ F G ML K + +I
Sbjct: 909 VDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKEKIT 965
Query: 905 SS 906
S
Sbjct: 966 DS 967
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/386 (23%), Positives = 165/386 (42%), Gaps = 74/386 (19%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFL 593
G+++L+G G GI C CCN+ ++ S+F+ HA GD + Q +F + L
Sbjct: 179 GEQILQGVLTGDGIWCSCCNTVITVSEFQLHA----------GDEPNRPYQRIFISETGL 228
Query: 594 QHDANAVEAGRVSGV-------------DSVEQITKRCIRIVK----------------- 623
EA G+ D ++ C+
Sbjct: 229 SLLTCQAEAWNQQGIPELQGYHLIEPREDVSDKYDDACVVCADGGNLICCDKCPSTYHIS 288
Query: 624 --NLEAELSG---CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK--MADLR-E 675
+E E G C C C F + ++ C QC++++H C ++++ + DL +
Sbjct: 289 CLQMEDEPQGEWRCPAC-ACKFCHTHAFDISVFTCSQCDKKYHWECFRENEGMLIDLNMD 347
Query: 676 LPKGKW-FCCMDCSRINSVLQNLL---VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRW 731
P FC CS+I L+ L+ + E L L + AG LE D
Sbjct: 348 GPSTSTPFCSSICSQIYEKLERLVGVRNELDEGLTWTLLRRMDPEAGVYLEESYD----- 402
Query: 732 RLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMY 789
R L ++ AVA+ +CF+P++D + +++ S++Y G N F G Y
Sbjct: 403 RTLCNSK--------IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFY 454
Query: 790 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 849
AIL +S +R+ G ++AE+P +AT + ++ L C + L++ V+ +
Sbjct: 455 TAILEKGDETISVASMRIHGNKLAEMPFIAT------RPSYRRLGMCHKLLVAIESVQYL 508
Query: 850 VLPAAEEAESIWTDKFGFKKIDPELL 875
V+P+ E+ W + +GF+ I+ +++
Sbjct: 509 VIPSIEQRVRRWEESYGFQAIENKVM 534
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 168/413 (40%), Gaps = 95/413 (23%)
Query: 542 KNGL----GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFER--KRFLQH 595
K+GL GI+C CCN +S + F+ HA G + C N+F + K F
Sbjct: 788 KDGLIKKEGIMCKCCNMVLSVTNFKNHA----------GFKQSRPCLNVFMKSGKPFTLC 837
Query: 596 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLCRGCDFSKSGFGPRTILLC 653
A A + R I++V+ + + + C LC G G ++ C
Sbjct: 838 QLQAWSA-------EYKTRKSRTIKVVRTADDDENDDSCGLC--------GDGGE-LICC 881
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPE-- 707
D C FH CL ELP+G W+C C C + + +++ A K +
Sbjct: 882 DNCPSTFHQACLST------EELPEGSWYCPNCTCWICGELVNDKEDINSSNAFKCSQCE 935
Query: 708 --FHLNAIKK------------YAGNSLETV--------------SDIDVRWRLL----- 734
+H + K + G S + V +D V W LL
Sbjct: 936 HKYHDSCWKNKTIGKGGASDTWFCGGSCQAVYFGLQSRVGIINHIAD-GVCWTLLKCIHE 994
Query: 735 -------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 783
A E L+ A+ I +CF +VD +G D+IP ++Y N R +
Sbjct: 995 DQKVHSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHVLY--NWRSEFARL 1052
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
F G Y +L + ++S +R+ G VAE+PL+AT +G + L IE++L
Sbjct: 1053 NFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLIATCSNYRRQGMCRRLMTAIEEMLIS 1112
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
+V+ +V+ A + WT+ FGF + + K L+ F GT +L+K
Sbjct: 1113 FKVEKLVVSAIPDLVETWTEGFGFTPMSNDEKQSLNK--INLMVFPGTILLKK 1163
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 151/380 (39%), Gaps = 67/380 (17%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVSGVD 609
CC+ PS F H C + P G WYC C F E+ HD +A+ +
Sbjct: 761 CCDG--CPSTF--HQSCLDIKKFPSGAWYCCNCSCKFCEKVEAAIHDTSALHS------- 809
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
LS C LC C S P T+ C + G +
Sbjct: 810 -------------------LSSCRLCEEKC----SNHYPHTLADHQACINQ--DGTVPGE 844
Query: 669 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDI 727
+ D FC C + LQ LL+ LPE F + ++++ S V+D
Sbjct: 845 RSTDS--------FCGKYCQELFEELQ-LLIGVKHPLPEGFSWSFLRRFELPS--EVADC 893
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEF 785
D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY N F
Sbjct: 894 DISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFWSNFHRLNF 945
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
A+L +++ +R+ G ++AE+P + T + +G + L IE L L+
Sbjct: 946 SSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGSLK 1005
Query: 846 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK-----RVPA 900
V +V+PA E WT FGF ++ + L+ F G ML K ++
Sbjct: 1006 VAKLVIPAVPELIDTWTSGFGFTPVNESEKKTIKNL--NLLVFPGVDMLGKSLVKEQITD 1063
Query: 901 CRIGSSSTDSTECVSGVEVG 920
+ SS+ DS + VE G
Sbjct: 1064 SIVSSSNVDSCLKLRNVEEG 1083
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 131/318 (41%), Gaps = 57/318 (17%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQN 696
+L CD C +H CL +ELP+G W+C C C S +++L+
Sbjct: 17 LLCCDNCPSTYHQTCLSD------QELPEGSWYCHNCTCRSCGNPLSEKEVSTFSAILKC 70
Query: 697 LLVQEA--------EKLPEFHLNAIKKYAGN-------------SLETVSDIDVRWRLL- 734
L ++ E LP + + G +E D ++ W +L
Sbjct: 71 LQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILK 130
Query: 735 ---SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 783
G+ A E L+ A+ I +CF +VD +G D+IP ++ N
Sbjct: 131 CNTDGRKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARL 188
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
++ G Y IL ++ +RV G + AELP +ATS +G + L IE +L
Sbjct: 189 DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRS 248
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
V+++VL A E + W FGFK I+ R L+ F GTS+L KR+
Sbjct: 249 FHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGITA 306
Query: 904 GSSSTD-STECVSGVEVG 920
S D VSG+ G
Sbjct: 307 AKSEEDKDAYNVSGLPNG 324
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 131/318 (41%), Gaps = 57/318 (17%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQN 696
+L CD C +H CL +ELP+G W+C C C S +++L+
Sbjct: 17 LLCCDNCPSTYHQTCLSD------QELPEGSWYCHNCTCRSCGNPLSEKEVSTFSAILKC 70
Query: 697 LLVQEA--------EKLPEFHLNAIKKYAGN-------------SLETVSDIDVRWRLL- 734
L ++ E LP + + G +E D ++ W +L
Sbjct: 71 LQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILK 130
Query: 735 ---------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 783
S K A E L+ A+ I +CF +VD +G D+IP ++ N
Sbjct: 131 CNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARL 188
Query: 784 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
++ G Y IL ++ +RV G + AELP +ATS +G + L IE +L
Sbjct: 189 DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRS 248
Query: 844 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 903
V+++VL A E + W FGFK I+ R L+ F GTS+L KR+
Sbjct: 249 FHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGITA 306
Query: 904 GSSSTD-STECVSGVEVG 920
S D VSG+ G
Sbjct: 307 AKSEEDKDAYNVSGLPNG 324
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 109/224 (48%), Gaps = 31/224 (13%)
Query: 653 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 712
C+QCER FH GC +A G +F C +S NL + AE +
Sbjct: 164 CEQCERCFHPGCCDDAILA-------GDFFFC------SSGCWNLFQRLAEMVA------ 204
Query: 713 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 772
++ + ++ W LL + LL++A+ + FDP++D + D +
Sbjct: 205 -------TVNPLGRSELSWSLLRRGRCDDK---LLAEALQLISSRFDPVLDCWTQLDYLD 254
Query: 773 SMVYGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 830
+MV+ R+ +F G Y A+L + VV +LR+ +AE+P +AT G+G
Sbjct: 255 AMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGIC 314
Query: 831 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 874
+ LF +E++L+ L V+++ L AA++ E +W + F F +D +L
Sbjct: 315 RSLFTAVEEMLARLGVETMALLAAKDTEKMWKNSFEFHAVDRKL 358
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 3/136 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
S + + KK +LK KK+ N P V L TG+LDGV V Y I + LRG+I+
Sbjct: 259 SGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKY---IAWSREELRGVIKG 315
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LC C CN +VI +FE HA + + + +I FENGK++ +++ RS+P ML
Sbjct: 316 SGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLF 375
Query: 322 ATLQSALSSLPEEKSF 337
+Q+ S +KSF
Sbjct: 376 EVIQTITGSPINQKSF 391
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 111/426 (26%), Positives = 171/426 (40%), Gaps = 88/426 (20%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFER--KRFLQHDANAVEA- 602
GI+C CCN +S ++F++HA G + C N+F K F A A
Sbjct: 150 GIMCKCCNMVLSVTKFKSHA----------GFKLNRPCSNLFMESGKPFTLCQLQAWSAE 199
Query: 603 --GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
R SG V R KN ++ C LC G G ++ CD C F
Sbjct: 200 YKSRKSGTQVV-----RADEDDKNDDS----CGLC--------GDGGE-LICCDNCPSTF 241
Query: 661 HVGCLKKHKMADLRELPKGKWFC----CMDCSRI-------NSV-----LQ-----NLLV 699
H CL +LP+G W+C C C + +SV LQ +
Sbjct: 242 HQACLCT------EDLPEGSWYCPNCTCWICGDLVNDKEASSSVGAYKCLQCEHKYHGAC 295
Query: 700 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVR------------WRLL------------S 735
Q+ ++ E ++ +G+ E S + R W LL
Sbjct: 296 QQGKQTHEGLVSDAWFCSGSCQEVYSGLHSRVGINNPIADGFCWTLLRCIHEDQKVLSAQ 355
Query: 736 GKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAIL 793
A E L+ A+ I +CF +VD +G D+IP +Y G + F G Y +L
Sbjct: 356 RLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHALYNWGSDFARLNFFGFYTVVL 415
Query: 794 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
+ +VSA +RV G VAE+PL+AT +G + L IE++L +V+ +V+ A
Sbjct: 416 EKDDVLVSAASVRVHGVTVAEMPLIATCSNYRRQGMCRHLMTAIEEMLISYKVEKLVISA 475
Query: 854 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTEC 913
+ WT FGF + + K + F GT +L+K++ + + +D +
Sbjct: 476 IPDLVETWTKGFGFIPVSKDEKQSLNK--INFMVFPGTILLKKQLYKTKEADTQSDWGDA 533
Query: 914 VSGVEV 919
EV
Sbjct: 534 APLTEV 539
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 3/147 (2%)
Query: 191 SLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+LI I + T KK E K+SKK+ N P V L TG+LDGV V Y I +
Sbjct: 429 ALISTAQITASGSETVSKKKEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKY---IAW 485
Query: 251 QASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR 310
LRGII+ G LC C CN +VI +FE HA + + + +I FENGK++ +++
Sbjct: 486 SREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQ 545
Query: 311 ACRSVPLPMLKATLQSALSSLPEEKSF 337
+S P L +Q+ S +KSF
Sbjct: 546 ELKSTPQNSLFDVIQTITGSPINQKSF 572
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y I LRGII+ G LC
Sbjct: 306 KNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---ISLSRKELRGIIKGSGYLCG 362
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +ICFENGK++ ++++ R+ P ML +Q+
Sbjct: 363 CQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTV 422
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 423 FGAPINQKSF 432
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 116/258 (44%), Gaps = 43/258 (16%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
++ CD C +H CL +LR SR+ + + F
Sbjct: 1056 LICCDNCPASYHQDCLPCQIYMNLR-------------SRVGIPIHTI--------DGFS 1094
Query: 710 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 769
++ + T +DI A E + L A++I +CF PI+D+ +G D
Sbjct: 1095 CTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGID 1144
Query: 770 LIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 825
+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT N
Sbjct: 1145 IIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1202
Query: 826 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS-- 883
+G + L IE++L L+V+ ++L A WT FGF ID + RK S
Sbjct: 1203 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRL 1258
Query: 884 QLVTFKGTSMLQKRVPAC 901
+LV+ GT +L++ + C
Sbjct: 1259 RLVSVPGTVLLKRNLYEC 1276
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 92.4 bits (228), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 78/136 (57%), Gaps = 3/136 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
+ + S KN ELK SKKI N P V L TGLLDGV+V Y+ + + LRG I+
Sbjct: 401 TTIDSASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKT--LRGTIKG 458
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LCSC +C G +V+ +FE HA + + + +I FENGK++ V++ ++ P ML
Sbjct: 459 TGYLCSCKVC-GNKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLF 517
Query: 322 ATLQSALSSLPEEKSF 337
+++ S +K+F
Sbjct: 518 NAIETVTGSAINQKNF 533
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +KK + N P V L TG+LDGV V Y + LRG
Sbjct: 302 VAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKY---VSVSREELRG 358
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 359 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 418
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 419 SLLFDTIQTVFGAPINQKAF 438
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 4/152 (2%)
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGI 804
L A+++ H+CF+P+ +S+S RDL+ +++ R L F G Y +L N ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 805 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 864
+RV+G++VAE+PLV T G +L +EK L L V+ +VLPA WT
Sbjct: 1039 VRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRS 1098
Query: 865 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
FGF K+ S + + F+G M QK
Sbjct: 1099 FGFAKMTNLERSQFLD--YTFLDFQGAIMCQK 1128
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 51/144 (35%), Gaps = 56/144 (38%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
I+C C PS F H C L IP GDW+C C +R + D
Sbjct: 654 ILCDKC-----PSSF--HKTCLGLEDIPNGDWFCPSCCCGICGQRKIDRD---------- 696
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 666
D VEQ+ L C QCE ++HV CL
Sbjct: 697 --DEVEQL------------------------------------LPCIQCEHKYHVRCL- 717
Query: 667 KHKMADLRELPKGKWFCCMDCSRI 690
++ AD+ G WFC DC ++
Sbjct: 718 ENGAADISTRYLGNWFCGKDCEKL 741
Score = 39.7 bits (91), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 53/140 (37%), Gaps = 33/140 (23%)
Query: 546 GIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG-- 603
GI C+CC S FE HA S + P + + +++ + + HD +E
Sbjct: 572 GIKCNCCMGIYSFVGFENHASGNS-TCRPSASIFLEDGRSLLDCLIKMMHDHKTMETSGK 630
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
SG+ VE +C C + ++LCD+C FH
Sbjct: 631 SFSGLSLVEND------------------YICSVCHYGGE------LILCDKCPSSFHKT 666
Query: 664 CLKKHKMADLRELPKGKWFC 683
CL L ++P G WFC
Sbjct: 667 CL------GLEDIPNGDWFC 680
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 86/172 (50%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYGQETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVTY---VSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +K + N P V L TG+LDGV V Y I LRG
Sbjct: 341 VAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKY---ISVSREELRG 397
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 458 SLLFDTIQTVFGAPIHQKAF 477
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y + LRGII+ G LC
Sbjct: 243 KNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---VSLSREELRGIIKGSGYLCG 299
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P ML +Q+
Sbjct: 300 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTV 359
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 360 FGAPINQKSF 369
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 82/159 (51%), Gaps = 3/159 (1%)
Query: 179 VTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLD 238
V Q E +++ + +A+ + + E+K +K + N P V L TG+LD
Sbjct: 251 VQQKEFDASDAHATASNTRVAKSKTESVSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLD 310
Query: 239 GVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYIC 298
GV V Y I LRG+I+ G LCSC CN +V+ +FE HA + + + +I
Sbjct: 311 GVPVKY---IALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIY 367
Query: 299 FENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
FENGK++ ++++ RS P ML +Q+ + +KSF
Sbjct: 368 FENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSF 406
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K E KMSKK + N P V L TG+LDGV V Y + L GII+ G LC
Sbjct: 265 KNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGYLCG 321
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +Q+
Sbjct: 322 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 381
Query: 328 LSSLPEEKSF 337
S +KSF
Sbjct: 382 TGSPINQKSF 391
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENG+++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 72/133 (54%), Gaps = 3/133 (2%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
++ K E KMSKK + N P V L TG+LDGV V Y + L GII+ G
Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGY 388
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +
Sbjct: 389 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAI 448
Query: 325 QSALSSLPEEKSF 337
Q+ S +KSF
Sbjct: 449 QTVTGSPINQKSF 461
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 428 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 485
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 546 GSTINQKNF 554
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/386 (23%), Positives = 151/386 (39%), Gaps = 67/386 (17%)
Query: 546 GIICHCCNSEVSPSQFEAHA---ECASLSSIP----------QGDWYCKYCQNMFERKRF 592
GI C CCN + ++FEAHA C +I Q D + K + + F
Sbjct: 800 GIQCDCCNKTFTSAEFEAHAGGKSCQPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFHF 859
Query: 593 LQHDANAV---------EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL--LCRGCDFS 641
+ D + G + DS + ++ + L C+ LC+ C
Sbjct: 860 IDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKFCGMV 919
Query: 642 KSGFGPR---------TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 692
R ++ C CE ++H C ++ + + P FC +C +
Sbjct: 920 GGNTCQRDGNMAAVSHALVTCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQELYE 977
Query: 693 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 752
LQ L + E F ++++ + SDI V SG + + ++ A+
Sbjct: 978 RLQMLFGVKQELEAGFSWTFVRRF-----DVSSDISV-----SGMSWKVDCNSKVAVALQ 1027
Query: 753 IFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 810
I +CF P+VD SG +LI ++VY G N + G + A+L +++A +R F
Sbjct: 1028 IMDECFVPMVDHKSGVNLIRNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF-- 1085
Query: 811 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
+P+ S ++ G L L V +V+PA E WT FGFK +
Sbjct: 1086 --YSMPVSFHSSLSMG--------------LCSLNVGKLVIPAISELTGTWTSVFGFKHL 1129
Query: 871 DPELLSIYRKRCSQLVTFKGTSMLQK 896
+ I R ++ F G MLQK
Sbjct: 1130 EGSDKQIMRNM--NMMVFPGVDMLQK 1153
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 72/128 (56%), Gaps = 20/128 (15%)
Query: 564 HAEC-ASLSSIPQGDWYCKYCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKR 617
H +C + S G+++C CQ +R+R + N G + S +++T R
Sbjct: 1002 HQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRRSM---VNRRSKGAAKTLLSKDRVTGR 1058
Query: 618 CIRIVKNLEAE-LSGCLLCRGCDFSKSGFGPRTILLCDQ----------CEREFHVGCLK 666
C R+++ EA L GC+ C+ DF+K+GFGP+T LLCDQ CERE+HVGCLK
Sbjct: 1059 CTRLLQVPEAVVLGGCVFCKSGDFAKTGFGPKTTLLCDQVSVGDMKVKGCEREYHVGCLK 1118
Query: 667 KHKMADLR 674
KH + DL+
Sbjct: 1119 KHGLEDLK 1126
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 828 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 887
G+ + L IE+LL LRV+ + LPAAE AE IW ++FGF+++ E + + + +V
Sbjct: 1477 GHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMV- 1535
Query: 888 FKGTSMLQKRVPACRI 903
F G+SML+K +P I
Sbjct: 1536 FTGSSMLEKDIPPLEI 1551
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 78/220 (35%), Gaps = 65/220 (29%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D EVGYY GQK L G K G GI+
Sbjct: 840 EAASPPVPSRESSGAN-------LHKALFLPGGLEDDIEVGYYVKGQKFLAGLKRGAGIL 892
Query: 549 CHCCNS------------------------EVSPSQFEAHAECASLSSIPQGDWYCKYCQ 584
C CC +S S FE HA S + Y
Sbjct: 893 CSCCQQVARNGMSSILMDSVVCGLTYEGERMISCSLFEQHAGWGSRRN--------PYTS 944
Query: 585 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 644
R L A ++ VEQ K+ +E L C+ C
Sbjct: 945 IYLADGRSLHDAAQSL---------VVEQTVKQEGNTPAKIE-HLDQCVEC--------- 985
Query: 645 FGPRTIL-LCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
G R L LC +C +H CL K + G++FC
Sbjct: 986 -GDRGDLQLCTRCPNAYHQDCLGK-----VDSYSSGEFFC 1019
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 673 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----- 727
+ELP+G+WFC DC I+S+L L+ E L + ++ + K LE D
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 728 -DVRWRLLSGKAATPETRLLLSQAVAIF 754
W+LL G+ P L++ V IF
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKALAEVVQIF 1289
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 56/146 (38%), Gaps = 53/146 (36%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
M K++ L + P + L ++GLLDG V Y+G + L GII++GG+LC CS C G
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLG--RGGHIMLTGIIQEGGVLCDCSSCKGV 634
Query: 275 ---------------------------------------------------RVIPPSKFE 283
+V+ S FE
Sbjct: 635 QVTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFE 694
Query: 284 IHACKQYRRASQYICFENGKSLLEVL 309
HA R S +I ENGK L ++L
Sbjct: 695 KHAGSSARHPSDFIFLENGKCLKDIL 720
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 6/146 (4%)
Query: 754 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 811
+CF PI+D +G +LI ++VY G N +F G Y IL +++A +R+ G +
Sbjct: 1 MDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTK 60
Query: 812 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI- 870
+AE+P + T + +G + L IE +LS L V+ +++PA E WT KFGF +
Sbjct: 61 LAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLE 120
Query: 871 DPELLSIYRKRCSQLVTFKGTSMLQK 896
D E + K S LV F GT +LQK
Sbjct: 121 DSEKQEV--KSISMLV-FPGTGLLQK 143
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 74/140 (52%), Gaps = 7/140 (5%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM-------GGIKFQASGLRG 257
++ K E KMSKK + N P V L TG+LDGV V Y+ G I L G
Sbjct: 443 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHG 502
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 503 IIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 562
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L +Q+ S +KSF
Sbjct: 563 SLLFBAIQTVTGSPINQKSF 582
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 417 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 474
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 475 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 534
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 535 QEMLFDAIQNVTGSPINQKNF 555
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 82/150 (54%), Gaps = 6/150 (4%)
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 807
AV I H+CF I++ + D+ +V+ R LR F G Y +L +VS G R+
Sbjct: 11 AVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGTFRI 70
Query: 808 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
GQ+ AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 71 CGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGF 130
Query: 868 KKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
+ + E L + + +++F+GT+M QK
Sbjct: 131 TVMSNSERLELAG---NSILSFQGTTMCQK 157
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 396 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 453
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 454 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 513
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 514 QEMLFDAIQNVTGSPINQKNF 534
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/396 (21%), Positives = 143/396 (36%), Gaps = 108/396 (27%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 610
CC+ PS F H C + P GDW+C YC F R+ G S
Sbjct: 931 CCDG--CPSTF--HKSCLDIKKFPSGDWHCAYCCCKF---------------CRLVGGSS 971
Query: 611 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 670
+ + F +L C CE +FH+ C++ +
Sbjct: 972 NQSVVN--------------------------DEFTMPALLTCHLCEEKFHISCVEAN-- 1003
Query: 671 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 730
+ K FC C ++ L+ LL + E F + I++ SD+
Sbjct: 1004 GGKTDDSKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWSFIRR---------SDVGCD 1054
Query: 731 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGM 788
L + + ++L + A++I ++CF P +D SG +L+ S++Y G N + ++ G
Sbjct: 1055 LSLTNPQLVECNSKLAV--ALSIMNECFMPYIDHRSGTNLLRSILYNCGSNFKRLDYSGF 1112
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE---------- 838
IL ++ +RV G +AE+P + T + +G + L IE
Sbjct: 1113 ITVILERGDEIICVASIRVHGNRLAEMPYIGTRYMYRRQGMCRRLLNAIESEAVYGLSVN 1172
Query: 839 ------------------------------------KLLSFLRVKSIVLPAAEEAESIWT 862
K LS L V+ +V+PA E WT
Sbjct: 1173 LGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDWGGPKALSSLDVELLVIPAISELRETWT 1232
Query: 863 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 898
FGF+ + I L+ F +LQK++
Sbjct: 1233 SVFGFEPLKQTSKQITNNM--NLLVFPHVDLLQKKI 1266
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 72/134 (53%), Gaps = 2/134 (1%)
Query: 204 LTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGG 263
+ S KN ELK SKK+ N P V L TGLLDGV V Y+ + + L GII+ G
Sbjct: 432 IDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT--LEGIIKGTG 489
Query: 264 ILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKAT 323
LC C C + + +FE HA + + + +I FENGK++ V++ ++ P +L
Sbjct: 490 YLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNA 549
Query: 324 LQSALSSLPEEKSF 337
+Q+ S +K+F
Sbjct: 550 IQTVTGSHINQKNF 563
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 86.3 bits (212), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 119/269 (44%), Gaps = 36/269 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKL 705
+L+CD+C FH C+ L P+G WFC C C + L
Sbjct: 423 LLMCDRCPSMFHHACVG------LESTPQGDWFCPACTCAICG-------------SSDL 463
Query: 706 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 765
+ + + +S R G+ E L A+ + +CF +++
Sbjct: 464 DDPPATTTTQGFSSDRMVISCEQCRRESRDGEE---EEHAKLCMALDVLRECFVTLIEPR 520
Query: 766 SGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 823
+ DL +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 521 TQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFA 580
Query: 824 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRKRC 882
+G +LL I+KLL + V+ +VLPA E + WT FG + E+ R+
Sbjct: 581 RRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIR----EMGQADRQDV 636
Query: 883 SQ--LVTFKGTSMLQKRVPA-CRIGSSST 908
+ ++ F+GT M K++P ++G ++T
Sbjct: 637 AHHAILRFQGTIMCHKQLPPQPQLGHTTT 665
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V+Y+ +A L+G+I I C C CNG + +
Sbjct: 403 LTKHPGNIRELLNTGLLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGSKAVSAY 461
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + S C+
Sbjct: 462 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCL 521
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYT 376
C P+ ++ LC C++SK+PQ +T +
Sbjct: 522 NCNE--PVLPSSQSE-NVLCQVCLESKQPQDPLTAS 554
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 64/124 (51%), Gaps = 21/124 (16%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNSEVSP----SQF--EAHAE-------------CASLSS 572
+ G++ ++GY I C+ CN VS S F AH C L
Sbjct: 583 SAGKRKVDGYIKDQRIYCNHCNRVVSLFSHLSYFFRLAHQHLKLMRVRDQGANPCVGLRK 642
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
+P +WYC C N+ ++++ L + NA AGR +GVDS+EQI KR IRIV + +L GC
Sbjct: 643 VP-SEWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGC 700
Query: 633 LLCR 636
LC+
Sbjct: 701 ALCK 704
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 429 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 486
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ ML +Q+
Sbjct: 487 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVT 546
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 547 GSTINQKNF 555
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 83/161 (51%), Gaps = 8/161 (4%)
Query: 182 SEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGL 236
SE G + + A S +T K ++ LK + + K+ P V L TG+
Sbjct: 279 SEADGVPELESSSFDVPASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGM 338
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDGV V Y+ + + LRGII+ G LC C CN +++ +FE HA + + + +
Sbjct: 339 LDGVPVKYVSVTREE---LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNH 395
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
I FENGK++ ++++ RS P +L T+Q+ + +KSF
Sbjct: 396 IYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPINQKSF 436
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 199 AEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGLLDGVSVVYMGGIKFQAS 253
A S +T K ++ LK + + K+ P V L TG+LDGV V Y+ + +
Sbjct: 302 ASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE-- 359
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
LRGII+ G LC C CN +++ +FE HA + + + +I FENGK++ ++++ R
Sbjct: 360 -LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELR 418
Query: 314 SVPLPMLKATLQSALSSLPEEKSF 337
S P +L T+Q+ + +KSF
Sbjct: 419 STPESLLFDTIQTIFGAPINQKSF 442
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 84.0 bits (206), Expect = 3e-13, Method: Composition-based stats.
Identities = 42/73 (57%), Positives = 51/73 (69%), Gaps = 1/73 (1%)
Query: 827 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 886
+GYFQ LF CIE+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y K +
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTT 61
Query: 887 TFKGTSMLQKRVP 899
F+GTS L K VP
Sbjct: 62 VFQGTSTLHKLVP 74
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK S N P V L TG+ DG+ V Y + L+G+I+ G LCSC
Sbjct: 386 KNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYC--TWSREKNLQGVIKGTGYLCSC 443
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
+C G + + +FE HA + + + +I FENGKS+ V++ ++ P ML +Q+
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 329 SSLPEEKSF 337
+ +++F
Sbjct: 504 GATINQRNF 512
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 117/478 (24%), Positives = 198/478 (41%), Gaps = 82/478 (17%)
Query: 410 KKTRKLLEADLVSKSSSKSVSL------RNLLKTRSPW--ELTRNSSRPGLIANSTPV-- 459
+KT++ +E D+ S+ S S S+ RN L ++ + +R+SSR +++P+
Sbjct: 423 RKTKRKIEKDMKSEEQSASDSVGKATFARNFLAIKNEVGNDDSRDSSRGTTSKSASPLHH 482
Query: 460 ---TSVHKSSQSQRQRKITKKSKKTVLISKPF--ENASPPLSFPNKSRWNITPKDQRLHK 514
S SS K +K + T+ + +P +N+ P+ + Q +
Sbjct: 483 QTEKSTGSSSHHVDGGKSSKHVRSTLSVRRPVRGDNSEGDGFVPSSEK-------QTILA 535
Query: 515 LVFDESGLPDGTEVGYYAC--GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSS 572
+ D L +V Y +LEG+ GI C CC+ +S S+FE HA S
Sbjct: 536 WLIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHA--GSKLR 593
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS-GVDSVEQITKRCIRIVKNLEAELSG 631
P + + ++F+ Q DA + G + G SV+ I +
Sbjct: 594 QPFQNIFLNTGVSLFQ----CQIDAWDKQKGAGNIGFCSVDVIAD---------DPNDDA 640
Query: 632 CLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCS 688
C +C G D ++ CD C FH CL + H M D L + + C +
Sbjct: 641 CGICGDGGD----------LVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVI 689
Query: 689 RINSVLQNLLVQEAEKL---PEFHLNA----------IKKYAGNSLETVSDIDVRWRLLS 735
I ++ V + K+ P + A +KKY G E + W L+
Sbjct: 690 GIAPIVHANSVGQLLKMLLRPRMQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVH 747
Query: 736 GKAATPETRL-----------LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 782
+ A + L L+ A+ + +CF PIVD SG +++ +++Y G N
Sbjct: 748 RECADSDLFLGEHPHIVENNSKLALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNR 807
Query: 783 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 840
FGG Y A+L VV++ +R G +AE+P + T + +G + LF+ IE +
Sbjct: 808 LNFGGFYTALLERGDEVVASASIRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESV 865
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN + + SKK++ N P V L TG+LDGV V Y+ + L+G+I+ G LC C
Sbjct: 339 KNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSR--EKNLKGLIKGAGYLCGC 396
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 397 QECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTVT 456
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 457 GSPINQKNF 465
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 82.4 bits (202), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%)
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGI 804
L A + H+CF +V+ + DL +V+ R LR F G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 805 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 864
LRV+G++VAELPLV T + +G LL +E LL V+ +VLPA E WT
Sbjct: 69 LRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQTWTGS 128
Query: 865 FGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 896
FGF+ + + L I + ++ F+GT+M K
Sbjct: 129 FGFQVMTQSQKLDIAQH---TIMCFQGTTMCHK 158
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 81.3 bits (199), Expect = 2e-12, Method: Composition-based stats.
Identities = 52/133 (39%), Positives = 74/133 (55%), Gaps = 10/133 (7%)
Query: 769 DLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 827
D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT K
Sbjct: 104 DDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKK 163
Query: 828 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CS 883
GYF LL + IE L V+ + P E IW++K G+ +LS +K
Sbjct: 164 GYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAH 218
Query: 884 QLVTFKGTSMLQK 896
LV F S++QK
Sbjct: 219 PLVMFANLSLVQK 231
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 1/134 (0%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
K+ + K +KK S N P V L TG+ DGV+V Y + + L+GII+ G LC C
Sbjct: 367 KSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR-EVRNLKGIIKGTGYLCGC 425
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN RV+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+
Sbjct: 426 GNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVT 485
Query: 329 SSLPEEKSFACVRC 342
S K+F +
Sbjct: 486 GSDINHKNFNTWKA 499
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ++K ++K S N P V L TG+LDGV V Y+ + + LRGII+ LC
Sbjct: 293 KNKQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE---LRGIIKGTTYLCG 349
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN + + +FE HA + + + +I FENGK++ ++++ RS P L T+Q+
Sbjct: 350 CQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTI 409
Query: 328 LSSLPEEKSF 337
+ +K+F
Sbjct: 410 FGAPINQKAF 419
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 63/250 (25%)
Query: 694 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 753
LQNLL + + PE+ +++ E V +D R E ++ A+++
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDKR----------VECNSKIAVALSL 1291
Query: 754 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR----- 806
+CF PIVD +G +LI ++VY G N +F G Y IL +++A +R
Sbjct: 1292 MDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLKEKN 1351
Query: 807 ------------------------------------VFGQEVAELPLVATSKINHGKGYF 830
+ G ++AE+P + T + +G
Sbjct: 1352 ILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQGMC 1411
Query: 831 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLV 886
+ L IE +LS L ++ +++PA E WT KFGF +D E+ S+ ++
Sbjct: 1412 RRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKSV------SML 1465
Query: 887 TFKGTSMLQK 896
F GT +LQK
Sbjct: 1466 VFPGTGLLQK 1475
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 103/254 (40%), Gaps = 38/254 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQC-EREFHVGCLKKHKMADLR-ELPKGK--WFCCMDC 687
C C G D G R ++LC C H GC + L E+ G +FC +C
Sbjct: 16 CTHCGGGDVEPEGR--RVLVLCSACFAAGTHTGCHEDVTGEPLSSEITHGDGLYFCGKEC 73
Query: 688 SRINSVLQNLLVQEAE--KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRL 745
R L+ + + PE + + Y + S ++ R+
Sbjct: 74 QRSYEALEAATGRRSRIRDEPEQYTFELVHYKQDDRTVRSAVETAMRM------------ 121
Query: 746 LLSQAVAIFHDCFDPIVDSISGRDLI---------PSMVYGRNLRGQEFGGMYCAILTVN 796
F F P++ +GRDL+ P G F AIL +
Sbjct: 122 --------FRTSFAPLIME-NGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFRLAILRMG 172
Query: 797 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 856
++++A LRVFG + AE+P V+T + + G+ + L +E LL V +V+P+ E
Sbjct: 173 GTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCLVIPSINE 232
Query: 857 AESIWTDKFGFKKI 870
+WT+KFGF KI
Sbjct: 233 LLPMWTNKFGFAKI 246
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/377 (21%), Positives = 140/377 (37%), Gaps = 90/377 (23%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVSGVD 609
CC+ PS F H C + P G WYC C F E+ +H+ + + +
Sbjct: 511 CCDG--CPSTF--HQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS------- 559
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
LS C LC E ++H C+ +
Sbjct: 560 -------------------LSSCRLC---------------------EEKYHQACINQDG 579
Query: 670 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDID 728
FC C + LQ L + LPE F + ++++ S V+D D
Sbjct: 580 TVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVADCD 634
Query: 729 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 786
+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N +F
Sbjct: 635 ISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDFS 686
Query: 787 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 846
A+L +++ +R+ G ++AE+P + T + +G + L IE +++
Sbjct: 687 SFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESFVAYF-- 744
Query: 847 KSIVLPAAEEAESIWT----------------DKFGFKKI-DPELLSIYRKRCSQLVTFK 889
+ L +E +W FGF + D E +I + L+ F
Sbjct: 745 SQMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTI---KNLNLLVFP 801
Query: 890 GTSMLQKRVPACRIGSS 906
G ML K + +I S
Sbjct: 802 GVDMLGKSLVKEKITDS 818
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + Q + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQRN-LKGMIKGTGYLCGCGNC 444
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 445 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 504
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 505 INHKNFNTWKA 515
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 74.3 bits (181), Expect = 3e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 46/71 (64%)
Query: 566 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 625
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 626 EAELSGCLLCR 636
E GC LC+
Sbjct: 64 ETGFGGCALCK 74
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 66/131 (50%), Gaps = 2/131 (1%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSR--ERNLKGMIKGTGYLCGCGNC 443
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 444 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 503
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 504 INHKNFNTWKA 514
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/378 (22%), Positives = 144/378 (38%), Gaps = 82/378 (21%)
Query: 551 CCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVSGVD 609
CC+ PS F H C + P G WYC C F E+ +H+ + + +
Sbjct: 664 CCDG--CPSTF--HQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS------- 712
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
LS C LC C S P T+ C + G +
Sbjct: 713 -------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTVPGE 747
Query: 669 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDI 727
+ D FC C + LQ L + LPE F + ++++ S V+D
Sbjct: 748 RSTD--------SFCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVADC 796
Query: 728 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 785
D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N +F
Sbjct: 797 DISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRLDF 848
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
A+L +++ +R+ G ++AE+P + T + +G + L IE +++
Sbjct: 849 SSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESFVAYF- 907
Query: 846 VKSIVLPAAEEAESIWT----------------DKFGFKKI-DPELLSIYRKRCSQLVTF 888
+ L +E +W FGF + D E +I + L+ F
Sbjct: 908 -SQMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTI---KNLNLLVF 963
Query: 889 KGTSMLQKRVPACRIGSS 906
G ML K + +I S
Sbjct: 964 PGVDMLGKSLVKEKITDS 981
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 52/82 (63%), Gaps = 4/82 (4%)
Query: 555 EVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 614
++ P F H C LS +P +WYC C+N+ ++++ L + NA AGR +GVDS+EQI
Sbjct: 31 KICPRSF--HPACVGLSKVPS-EWYCDNCRNLVQKEKALAENKNAKAAGRQAGVDSIEQI 87
Query: 615 TKRCIRIVKNLEAELSGCLLCR 636
KR IRIV + +L GC LC+
Sbjct: 88 MKRAIRIVP-ISDDLGGCALCK 108
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/60 (55%), Positives = 40/60 (66%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC 567
+D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E++ S FE HA C
Sbjct: 509 RDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHARC 568
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 12/157 (7%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL TGLL+G V + L GI +D G++C+C +C G +V+ S
Sbjct: 308 LTKPPRNAKELMATGLLEGHYVH----CSCRGEQLTGIFQDMGVVCNCRICKGTQVVSIS 363
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLPE-EKS 336
FE H+ S I ENGK+L ++L A + +L+A LQ A+ + K
Sbjct: 364 AFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRA-LQHAIGEIQGISKE 422
Query: 337 FACVRC---KGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
CV+C +G I+C G CV K P
Sbjct: 423 MTCVKCGKHEGGEFISCKGAKCSAAYHAECVGVKSPH 459
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 121/506 (23%), Positives = 194/506 (38%), Gaps = 113/506 (22%)
Query: 410 KKTRKLLEADL------VSKSSSKSVSLRNLLKTRSPW--ELTRNSSRPGLIANSTPVT- 460
+KT+ +E D+ S S K+ RN L ++ + +R+SS+ + +P+
Sbjct: 437 RKTKSKIEKDMKRELHSASDSDGKATFARNFLAIKNEVGNDDSRDSSQGTTSKSESPLHH 496
Query: 461 SVHKSSQSQRQR----KITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 516
KS+ S R K +K + T+L+ + N P ++ L
Sbjct: 497 QTEKSTGSSSHRVDGGKSSKHGRSTLLVRRSVRG-------DNSESDGFVPSSEKRTVLA 549
Query: 517 F-DESGLPDGTEVGYYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHAECASLSS 572
+ +SG +E Y ++ +LEG+ GI C CC+ ++ S+FE HA
Sbjct: 550 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHA------- 602
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
G + QN+F N+ AG + G SV+ I + C
Sbjct: 603 ---GSKLRQPFQNIF---------LNSGGAGNI-GFCSVDVIAD---------DPNDDAC 640
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRI 690
+C G G ++ CD C FH CL + H M D L + + C + I
Sbjct: 641 GIC--------GDGG-DLVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGI 690
Query: 691 ------NSVLQ------NLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDIDVRWRL---- 733
NSV Q L VQ K +L+ +KKY G E + W L
Sbjct: 691 APIVHANSVRQLLKMLLRLWVQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRE 748
Query: 734 -------LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG-------RDLIPSMVYG-- 777
LSG E L+ A+ + +CF PI+D SG R+ + +G
Sbjct: 749 CTNSDLSLSGHPHIVENNSKLALALTVMDECFLPIIDRRSGHCKKFCLRNFTTVIFFGIS 808
Query: 778 -------------RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 824
N FGG Y A+L +V++ +R G +AE+P + T +
Sbjct: 809 LCWFVCLYIAFRRSNFNRLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVY 868
Query: 825 HGKGYFQLLFACIEKLLSFLRVKSIV 850
+G + LF+ +E + S V +
Sbjct: 869 RHQGMCRRLFSVVESVSSTADVAKLT 894
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 448 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 505
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F +
Sbjct: 506 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQTWKA 563
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 61/113 (53%), Gaps = 2/113 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 473 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 530
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F
Sbjct: 531 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNF 583
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/307 (21%), Positives = 112/307 (36%), Gaps = 77/307 (25%)
Query: 543 NGLGIIC--HCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 600
+G ++C HC ++ H +C + +P GDWYC+ C
Sbjct: 762 DGGDLVCCDHCAST--------FHLDCLGIK-LPSGDWYCRSC----------------- 795
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCER 658
LCR C F K P +L C QC R
Sbjct: 796 ---------------------------------LCRFCGFPQEKPSSSPELLLSCLQCSR 822
Query: 659 EFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKY 716
++H C +P FC C +I L LL + F + + +
Sbjct: 823 KYHQTCSSGTGTDSGCTMPGTSIDCFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLVHCF 882
Query: 717 AGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 776
A + + + K A ++ L A + +CF P +D SG ++I ++ Y
Sbjct: 883 ANDQAMPNKNKE--------KLAQCNSKTAL--AFTVLDECFQPHIDDRSGINMIHNVAY 932
Query: 777 --GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 834
G + +F G Y IL V++A +R+ G ++AE+P + T + +G + L
Sbjct: 933 NCGSDFSRLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIGTRGMYRHQGMCRRLL 992
Query: 835 ACIEKLL 841
IE ++
Sbjct: 993 NGIESVI 999
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 65/133 (48%), Gaps = 10/133 (7%)
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR---------GQEFGGMYCAILTVNS 797
+ QA+ +F F P++ +GRDL+ + G G F G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMD-NGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRG 279
Query: 798 SVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEA 857
+VV+A LRVFG+ AELP VAT + G + L +E LL V +V+P+ +
Sbjct: 280 AVVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPL 339
Query: 858 ESIWTDKFGFKKI 870
+W KFGF +
Sbjct: 340 LPMWAAKFGFTPL 352
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 68/115 (59%), Gaps = 4/115 (3%)
Query: 742 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG--QEFGGMYCAILTVNSSV 799
E R+ LS A+ + H C++P+ DS +G D++P ++ G L G ++ GM+ A+L +
Sbjct: 420 ELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGPAA 479
Query: 800 VSAGILRVFGQEVAELPLVAT-SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 853
V+ + R FG ++AE+P++A ++ G +LL A +E+LL K+I PA
Sbjct: 480 VAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPA 533
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 107/279 (38%), Gaps = 81/279 (29%)
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPEFH 709
D+ ER+ H+ ++ P G W C C C + E+ +
Sbjct: 132 DKVERDCHL---------HIQMFPHGDWHCPNCTCKFCRAV-----------VEECSQTL 171
Query: 710 LNAIKKYAG--NSLE------------TVSDIDVRWRLLSGKAATPETRLLLSQA----V 751
+KKY G + LE T SD +RW TP QA +
Sbjct: 172 FEGVKKYVGVKHELEARFSWSLVHRECTDSDFILRW--------TPSYCGKQFQAGHSSL 223
Query: 752 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 811
+ +CF PI+D SG G YC + L+ G
Sbjct: 224 TVMDECFLPIIDRRSG-------------------GKYC----------TKCPLQFHGNR 254
Query: 812 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 871
+AE+ + T + +G + LF+ +E L L+V+ +V+PA + +W KFGFK ++
Sbjct: 255 LAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATADLSHVWISKFGFKYVE 314
Query: 872 PELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 910
L R L+ F G +LQK + A R S+ D+
Sbjct: 315 DSLKK--ELRSMNLLAFPGIDVLQKELLAPRHAKSAADT 351
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 807
A+ + H+ F I++ + RDL +V+ R LR F G Y + V
Sbjct: 11 ALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL--------------V 56
Query: 808 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
G++ AELPL+ T +G +LL +EKLLS L V+ ++LP + WT FGF
Sbjct: 57 CGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGSFGF 116
Query: 868 KKIDPELLSIYRKRCSQLVTFKGTSMLQK 896
++ ++ + +++F+GT+M QK
Sbjct: 117 TEMSYS--DRFQYAANIILSFQGTTMCQK 143
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
+LDGV V Y+ + + LRGII+ G LC C CN +VI +FE HA + + +
Sbjct: 1 MLDGVPVKYIAWSREKE--LRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNN 58
Query: 296 YICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
+I FENGK++ +++ +S P L +Q+ S +KSF
Sbjct: 59 HIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSF 100
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 806 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 865
R+ G ++AE+P + T I +G L IE LS L V+ +V+PA E ++ WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 866 GFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPA 900
GFK ++P R++ L + GT +L+KR+ A
Sbjct: 61 GFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLA 93
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 16/126 (12%)
Query: 784 EFGGMYCAILTVNSSVVSAGI------------LRVFGQEVAELPLVATSKINHGKGYFQ 831
+F G Y L + V A LR+ G +VAE+PLVAT+ +G Q
Sbjct: 62 DFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYRRQGMCQ 121
Query: 832 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKG 890
+L +EK+LS L V+ +VLPA E +W FGF ++ E L + R + F+G
Sbjct: 122 VLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLR---FPFLGFQG 178
Query: 891 TSMLQK 896
T+M QK
Sbjct: 179 TTMFQK 184
>gi|357510881|ref|XP_003625729.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
gi|355500744|gb|AES81947.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
Length = 290
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 184 GFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKK-PMTVTELFETGLLD 238
G G E+++ ++ E A S AL + +ELK SKKI+++KK P T+ ELF TGLLD
Sbjct: 203 GSGEETVTKLDQEGAAVESEIDGALAVRRNKMELKTSKKIAVDKKRPTTMKELFRTGLLD 262
Query: 239 GVSVVYMGGIKFQAS 253
GVSVVY+ GIK + S
Sbjct: 263 GVSVVYVSGIKKEES 277
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 71/148 (47%), Gaps = 31/148 (20%)
Query: 750 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFG 809
AV I H+CF I++ + D+ +V+ R + GQ+F
Sbjct: 774 AVDILHECFVTIIEPRTQSDISEDIVFNREICGQKF------------------------ 809
Query: 810 QEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK 869
AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 810 ---AELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGFTV 866
Query: 870 I-DPELLSIYRKRCSQLVTFKGTSMLQK 896
+ + E L + + +++F+GT+M QK
Sbjct: 867 MSNSERLEL---AGNSILSFQGTTMCQK 891
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 106/265 (40%), Gaps = 70/265 (26%)
Query: 673 LRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG--NSLE------ 722
++ P G W C C C +V++++ K +KKY G + LE
Sbjct: 136 IKMFPHGDWHCPNCT-CKFCRAVVEDVSQTVGAKCL---FEGVKKYVGVKHELEARFSWS 191
Query: 723 ------TVSDIDVRWRLLSGKAATPETRLLLSQA----VAIFHDCFDPIVDSISGRDLIP 772
T SD +RW TP QA + + +CF PI+D SG
Sbjct: 192 LVHRECTDSDFILRW--------TPSYCGKQFQAGHSSLTVMDECFLPIIDRRSG----- 238
Query: 773 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVF-GQEVAELPLVATSKINHGKGYFQ 831
G YC + L++F G +AE+ + T + +G +
Sbjct: 239 --------------GKYC----------TKCPLQLFHGNRLAEMQFIGTRHVYRHQGMCR 274
Query: 832 LLFACIE------KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL 885
LF+ +E K L L+V+ +V+PA + +W KFGFK ++ L R L
Sbjct: 275 RLFSVVESMSFDVKTLQNLKVELLVIPATADLSHVWISKFGFKYVEDSLKK--ELRSMNL 332
Query: 886 VTFKGTSMLQKRVPACRIGSSSTDS 910
+ F G +LQK + A R S+ D+
Sbjct: 333 LAFPGIDVLQKELLAPRHAKSAADT 357
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
L GII GG LC CS CN +V+ +FE HA + R + +I ENGK + +++ ++
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 315 VPLPMLKATLQSALSSLPEEKSF 337
PL M+ ++ S E+ F
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFF 89
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGI 804
++ AVA+ +CF+P++D + +++ S++Y G N F G Y AIL +S
Sbjct: 22 IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVAS 81
Query: 805 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 843
+R+ G ++AE+P +AT +G L IE + S
Sbjct: 82 MRIHGNKLAEMPFIATRPSYRRQGMCHKLLVAIESVSSL 120
>gi|154309635|ref|XP_001554151.1| hypothetical protein BC1G_07288 [Botryotinia fuckeliana B05.10]
Length = 765
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 80/198 (40%), Gaps = 58/198 (29%)
Query: 546 GIICHCCN--SEVSPSQ--------FEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQH 595
G +C C V+P++ H +C S+ IP+GDW+CK CQ +
Sbjct: 291 GPVCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARA 350
Query: 596 DANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 654
N GV DS ++I+ C +C+G D K P I+LC+
Sbjct: 351 AEN-------DGVTDSDDEIS----------------CAVCQGLDSEK----PNEIILCE 383
Query: 655 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP-------- 706
C+ H C ++ + P+G+W C C ++ V +LL +E + P
Sbjct: 384 NCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLGPISNEVPSI 434
Query: 707 ---EFHLNAIKKYAGNSL 721
E HL +++ N L
Sbjct: 435 EGFETHLKTMQRVLLNRL 452
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 123/322 (38%), Gaps = 89/322 (27%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 598
+G GI+C+CC +S S F AHA S PQ FL+ +
Sbjct: 882 DGMVTWEGIVCNCCKKNLSVSDFMAHAG----RSHPQS-----------SLGLFLESGKS 926
Query: 599 A----VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 654
VEA S E +++R + +EA C C G G +L CD
Sbjct: 927 YTLCLVEAW------SAESMSRRSNAWGRKVEAIDESDDTCGFC-----GDGGE-LLCCD 974
Query: 655 QCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQNLLVQE 701
C +H CL +ELP+G W+C C C S +++ + +
Sbjct: 975 NCPSTYHQACLSA------KELPEGSWYCHNCTCQVCGGPFSEKEVSTFSAIFKCFQCGD 1028
Query: 702 A--------EKLP-EFHLN--------------AIKKYAGNSLETVSDIDVRWRLL---- 734
A EKLP E ++ ++ + G + + D D+ W +L
Sbjct: 1029 AYHDTCIEQEKLPLEDQISQTWFCGKYCKEIFIGLRSHVGT--DNILDSDLSWSILRCNN 1086
Query: 735 SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 784
G+ A E + L+ A+ + +CF +VD +G D+IP ++Y G N +
Sbjct: 1087 DGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVD 1146
Query: 785 FGGMYCAILTVNSSVVSAGILR 806
+ G Y IL ++ +R
Sbjct: 1147 YQGFYTVILEKGDEILCVASIR 1168
>gi|347838360|emb|CCD52932.1| hypothetical protein [Botryotinia fuckeliana]
Length = 886
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 80/198 (40%), Gaps = 58/198 (29%)
Query: 546 GIICHCCN--SEVSPSQ--------FEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQH 595
G +C C V+P++ H +C S+ IP+GDW+CK CQ +
Sbjct: 291 GPVCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARA 350
Query: 596 DANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 654
N GV DS ++I+ C +C+G D K P I+LC+
Sbjct: 351 AEN-------DGVTDSDDEIS----------------CAVCQGLDSEK----PNEIILCE 383
Query: 655 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP-------- 706
C+ H C ++ + P+G+W C C ++ V +LL +E + P
Sbjct: 384 NCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLGPISNEVPSI 434
Query: 707 ---EFHLNAIKKYAGNSL 721
E HL +++ N L
Sbjct: 435 EGFETHLKTMQRVLLNRL 452
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 53/110 (48%), Gaps = 7/110 (6%)
Query: 575 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG-CL 633
+G C YCQ+ ER RF QH+ + + G + +V+ + ++ S C+
Sbjct: 186 KGILLCDYCQSTVERNRFGQHEELLI--CKDCGNKAHPSCLSYSAELVEQIRSDGSWQCI 243
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ C + P T+L CD C++ +H+ C + L ++P GKW C
Sbjct: 244 DCKACIICEGTGDPDTLLFCDACDKGYHMNCHE----PKLTQMPSGKWAC 289
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 64/137 (46%), Gaps = 13/137 (9%)
Query: 728 DVRWRLLSGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR- 778
++ W LL A+ E LS A+ + H+CF P+ + + D + +++ +
Sbjct: 29 NLTWTLLKSNHSSDHKPDASDIENYSKLSIALHVMHECFQPVEEPRTKGDFLKDVIFRKR 88
Query: 779 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 837
L F G Y +L + ++ +RV+G++VAE+PLV T G +L +
Sbjct: 89 SELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVAEIPLVGTRVQYRRLGMCGILMNVL 148
Query: 838 EKLL---SFLRVKSIVL 851
EK L SFL + V+
Sbjct: 149 EKNLKDYSFLDFQDTVM 165
>gi|224090647|ref|XP_002309044.1| predicted protein [Populus trichocarpa]
gi|222855020|gb|EEE92567.1| predicted protein [Populus trichocarpa]
Length = 467
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
KK ++K +KK+ N P V L TG+LDGV V Y+ Q LRG+I+ G LC
Sbjct: 378 KKKDDIKTAKKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWS--QEKELRGVIKGSGYLCG 435
Query: 268 CSLCNGCRVI 277
C CN +VI
Sbjct: 436 CQTCNFSKVI 445
>gi|302832357|ref|XP_002947743.1| hypothetical protein VOLCADRAFT_87924 [Volvox carteri f.
nagariensis]
gi|300267091|gb|EFJ51276.1| hypothetical protein VOLCADRAFT_87924 [Volvox carteri f.
nagariensis]
Length = 305
Score = 53.9 bits (128), Expect = 4e-04, Method: Composition-based stats.
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 11/118 (9%)
Query: 789 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 848
+ A+LT + VSA I+ V+G + AEL LVAT + +GY L + L + V+
Sbjct: 190 FAALLTEGTPAVSAAIVDVYGADAAELYLVATRTVLQRQGYGTSLVRQLSAELGKIGVRR 249
Query: 849 IVLPA---AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTF-----KGTSMLQKRV 898
+++ EE + +W DKFGFK + +Y CS TF KGT L +++
Sbjct: 250 LLVSVDDDDEENQRLWRDKFGFKSLSAS--ELYELGCS-FGTFSAPATKGTVFLVRKL 304
>gi|341902027|gb|EGT57962.1| hypothetical protein CAEBREN_23443 [Caenorhabditis brenneri]
Length = 642
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ R +R+VK+ E C+ CR C +I+ CD C+R FH C A
Sbjct: 556 EMPDRMVRVVKSYEW---NCIECRTCSICHKKDNEDSIVSCDWCDRAFHYLC------AG 606
Query: 673 LRELPKGKWFCCMDCS 688
LR +P+G W C + CS
Sbjct: 607 LRAMPRGMWMCQVYCS 622
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 541 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 595
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 596 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 638
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 514 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 568
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 569 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 611
>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
Length = 2062
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ +R + V+N + SGC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1767 MPQRMVGRVRNYNWQCSGCKCCIKC---RSNQRPGKMLFCEQCDRGYHIYCLG------L 1817
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1818 RTVPDGRWSCERCCVCMRC 1836
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 77/194 (39%), Gaps = 41/194 (21%)
Query: 549 CHCCNSEVSPSQF--------EAHAEC--ASLSSIPQGDWYCKYCQ--NMFERKRFLQHD 596
C C + +P Q H C LS IPQGDW+C C + RKR
Sbjct: 978 CRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQLSPRKR----T 1033
Query: 597 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 656
VE D+ E++ + + + C +C P ++LCD C
Sbjct: 1034 KAPVEVSSEEEDDN-EKVDEDGDEDEEEEDLNQEVCNICE---------SPGELILCDFC 1083
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 714
+ FH+ C+ DL+ LP+G W C C+ + N L K+ N I+
Sbjct: 1084 PKSFHLDCI------DLKRLPRGTWKCPPCVLGKKKNKRGSPPLT----KVKVRSRNNIR 1133
Query: 715 KYAGNSLETVSDID 728
KY L TV+D+D
Sbjct: 1134 KY---DLATVTDVD 1144
>gi|302855516|ref|XP_002959250.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
gi|300255380|gb|EFJ39692.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
Length = 358
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 108/249 (43%), Gaps = 17/249 (6%)
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 710
L CD C H+GC+ ++A+ ++P WF C C L+ AE+ P H
Sbjct: 123 LRCDTCGCWVHLGCVGV-EVAE--QVPPRPWFHCRACRSTYLRLE----AAAERNP--HP 173
Query: 711 NAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 770
++ + T SD + + TP P++ D
Sbjct: 174 SSASPTHALYVLTPSDHQAAMAVALNRGLTPAVAAGGGAGGRGPVGAVLPLLAQGFNEDA 233
Query: 771 IPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 830
I +G+ R + GG Y A+L V+A VFG + A+L L+AT+ + KG
Sbjct: 234 IRG--FGQPAREYD-GGKYSAVLLNRGQPVAAATFNVFGAD-AQLCLLATAVQHRLKGNG 289
Query: 831 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP-ELLSIYRKRCSQLVTFK 889
L A +E LL+ + V +++ + A +W + G++ + P E L ++R + + +
Sbjct: 290 SALVADLEALLADVGVSRLLVQSRGVALPLWLGRLGYRLVPPQEALQLHR---TLPIAYY 346
Query: 890 GTSMLQKRV 898
+++QK++
Sbjct: 347 DCALMQKQL 355
>gi|255079372|ref|XP_002503266.1| JmjN/JmjC protein [Micromonas sp. RCC299]
gi|226518532|gb|ACO64524.1| JmjN/JmjC protein [Micromonas sp. RCC299]
Length = 2663
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
+ + LE + +GC+ C G +S ++LCD C+R +H+ CL L ELP+G
Sbjct: 246 VAQQLEEQPAGCVNCGGTSHEES------MILCDGCDRGYHMYCLSP----PLDELPQGD 295
Query: 681 WFCCMDC 687
WFC DC
Sbjct: 296 WFCP-DC 301
>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
Length = 2002
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1731 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1781
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1782 RTVPDGRWSCERCCVCMRC 1800
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 52/121 (42%), Gaps = 34/121 (28%)
Query: 564 HAECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIV 622
H +C S+ IP GDW+CK CQ N + ++ +A+ DS ++I
Sbjct: 316 HQKCYSVPKIPDGDWFCKKCQRNRVAAEAARANENDALN-------DSDDEIK------- 361
Query: 623 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
C +CRG D K P I+LC+ C+ H C D+ + P+ +W
Sbjct: 362 ---------CAVCRGLDSKK----PNEIILCENCDYAVHQTC------GDIPKKPREEWL 402
Query: 683 C 683
C
Sbjct: 403 C 403
>gi|429328961|gb|AFZ80720.1| hypothetical protein BEWA_001270 [Babesia equi]
Length = 238
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 44/93 (47%), Gaps = 12/93 (12%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C +C GCD S T+L+CD C+R FH+ C K+ E+PKG WF C DCS
Sbjct: 75 CKICAGCD-DNSSKRDHTMLICDACDRSFHMECTKEK----YSEVPKGAWF-CDDCSICQ 128
Query: 692 SVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 724
L +E+ + L GN L TV
Sbjct: 129 ICDIKLTERESNNPTNYSLE------GNKLCTV 155
>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
Length = 2012
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1736 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1786
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1787 RTVPDGRWSCERCCFCMRC 1805
>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
Full=Protein enhancer of yellow 3
gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
Length = 2006
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1730 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1780
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1781 RTVPDGRWSCERCCFCMRC 1799
>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
Length = 2011
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1735 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1785
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1786 RTVPDGRWSCERCCFCMRC 1804
>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
Length = 1843
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1567 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1617
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C CM C
Sbjct: 1618 RTVPDGRWSCERCCFCMRC 1636
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
>gi|312380117|gb|EFR26201.1| hypothetical protein AND_07843 [Anopheles darlingi]
Length = 2310
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 20/95 (21%)
Query: 620 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 679
R V+ + + S C LC C+ + + ++ CDQC+R +H+ C LR LP G
Sbjct: 2030 RRVQQYKWQCSECKLCMKCNRKPAAIDSK-MVYCDQCDRGYHLAC------KGLRNLPDG 2082
Query: 680 KWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNA 712
+W C C CS+ + + PE H NA
Sbjct: 2083 RWHCSLCTICSQCGA-----------QTPEGHPNA 2106
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 53/142 (37%), Gaps = 43/142 (30%)
Query: 544 GLGIICHCCNSEVSPSQFEAHAECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
G I+CH C P + H EC LS + Q W+C C V
Sbjct: 1434 GQLILCHDC-----PIVY--HCECLDPPLSKLTQDHWFCPLC----------------VM 1470
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
+G DS E++ + E+ +C C ++ CD C + FH
Sbjct: 1471 DRTTNGADSEEEMGSN--------DGEIEHEDVCSRCRHGGE------LICCDTCPKAFH 1516
Query: 662 VGCLKKHKMADLRELPKGKWFC 683
+ C K LR++PKG W C
Sbjct: 1517 MECCK----PVLRKVPKGHWEC 1534
>gi|443689527|gb|ELT91900.1| hypothetical protein CAPTEDRAFT_216422 [Capitella teleta]
Length = 1564
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 24/111 (21%)
Query: 603 GRVSGVDSVEQIT------------KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 650
GRV V++V+ T + C++ K+ AE + C +CR K G + +
Sbjct: 1140 GRVRWVEAVKSCTTWSRLHLLMSVMESCMKWEKS--AENAKCKICR-----KKGEEEK-V 1191
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 701
LLCD C + FH+ CL+ L E+PKG+WFC R V N+ +E
Sbjct: 1192 LLCDDCNQPFHLYCLR----PALYEVPKGEWFCAACAPRTRRVKTNVNYRE 1238
>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
Length = 1982
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1718 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCL------GL 1768
Query: 674 RELPKGKWFC-----CMDC 687
R +P G+W C C+ C
Sbjct: 1769 RTVPDGRWSCERCCVCIRC 1787
>gi|312371268|gb|EFR19500.1| hypothetical protein AND_22323 [Anopheles darlingi]
Length = 1628
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 11/61 (18%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 686
AE C++CR P LLCD+C R H+ CLK L+E+P G WF CM
Sbjct: 1168 AERISCMICR------RKVDPDLTLLCDECNRACHIYCLK----PKLKEVPAGDWF-CMK 1216
Query: 687 C 687
C
Sbjct: 1217 C 1217
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 44/95 (46%), Gaps = 16/95 (16%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 767 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 822
Query: 679 GKWFC-----------CMDCSR-INSVLQNLLVQE 701
G W C C C R +++V QNL +E
Sbjct: 823 GGWKCKCYREEDLILQCRQCDRWMHAVCQNLSTEE 857
>gi|310792252|gb|EFQ27779.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 897
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 18/88 (20%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL+C D SK+G I+ CD C++ +H C K K +P+G W+C C+D +
Sbjct: 384 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYKVPK------VPRGDWYCNECLDQKQ 433
Query: 690 INSVLQNLLVQEAEKLPEF--HLNAIKK 715
+ + EA K+P F HL+ +K+
Sbjct: 434 SRAAAAD----EAVKIPNFQQHLSKLKR 457
>gi|321460287|gb|EFX71331.1| hypothetical protein DAPPUDRAFT_308933 [Daphnia pulex]
Length = 501
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 45/108 (41%), Gaps = 11/108 (10%)
Query: 579 YCKYCQNMFERKRFLQHDANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C + H V + GR SG S Q T I VK + C+ C
Sbjct: 392 YCDFCLGDATENKKSGHPEELVSCADCGR-SGHPSCLQFTANMIISVKQYRWQ---CIEC 447
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CLK L E P+G W C
Sbjct: 448 KCCSLCGNSDNDEQLLFCDDCDRGYHMYCLK----PPLSEPPEGSWSC 491
>gi|301614223|ref|XP_002936596.1| PREDICTED: hypothetical protein LOC100485119 [Xenopus (Silurana)
tropicalis]
Length = 1043
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 43/88 (48%), Gaps = 19/88 (21%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV-- 699
P ILLCD C+ +H CL+ M +P G+WFC C C +++ LQNL V
Sbjct: 834 PELILLCDSCDSGYHTACLRPPLML----IPDGEWFCPPCQHKLLCEKLDEQLQNLDVVL 889
Query: 700 ---QEAEKLPEFHLNAIKKYAGNSLETV 724
+ AE+ E + Y G SLE +
Sbjct: 890 KKKERAERRKERLV-----YVGISLENI 912
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 10/97 (10%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQC 656
EA VS + + C+ + + + + G CL C C+ +G +LLCD+C
Sbjct: 454 EASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNGGDEEKLLLCDEC 513
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRIN 691
+ +HV C+K L +P G W C C C R N
Sbjct: 514 DVSYHVYCMK----PPLESVPSGPWRCHWCSRCRRCN 546
>gi|44917000|dbj|BAD12142.1| unichrom [Hemicentrotus pulcherrimus]
Length = 1637
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1022 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1054
>gi|115696716|ref|XP_783470.2| PREDICTED: uncharacterized protein LOC578189 isoform 2
[Strongylocentrotus purpuratus]
gi|390342402|ref|XP_003725656.1| PREDICTED: uncharacterized protein LOC578189 isoform 1
[Strongylocentrotus purpuratus]
Length = 1640
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1025 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1057
>gi|348532584|ref|XP_003453786.1| PREDICTED: remodeling and spacing factor 1-like [Oreochromis
niloticus]
Length = 1596
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 12/68 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C S P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 1055 CKHCGLSNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKQLCDK 1107
Query: 690 INSVLQNL 697
+ LQNL
Sbjct: 1108 LEEQLQNL 1115
>gi|308488788|ref|XP_003106588.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
gi|308253938|gb|EFO97890.1| hypothetical protein CRE_15947 [Caenorhabditis remanei]
Length = 452
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 10/79 (12%)
Query: 617 RCIRIVKNLEAELS----GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
RCI + + A + C+ CR C I+ CD+C+R FH C+
Sbjct: 365 RCIEMPDRMAALVRTYEWSCVDCRVCSICNKPEKENEIVFCDKCDRGFHTFCVG------ 418
Query: 673 LRELPKGKWFCCMDCSRIN 691
L+ LP+G W C CS N
Sbjct: 419 LKSLPRGTWICDTYCSETN 437
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 64/162 (39%), Gaps = 28/162 (17%)
Query: 542 KNGLGIICHCCNSEVSPSQFEAHAEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 599
+ G I+C C P + H C L P+G W C +C+ K +Q +A
Sbjct: 386 QGGEIILCDTC-----PRAY--HLVCLEPELDKAPEGKWSCPHCE-----KEGIQWEAKD 433
Query: 600 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQCER 658
+ I++ +R+ E E + CR C K G +L CD C
Sbjct: 434 EDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVC---KDG---GELLCCDTCTS 487
Query: 659 EFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 698
+H+ CL L E+P G+W C C C I +Q +L
Sbjct: 488 SYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 524
>gi|71028052|ref|XP_763669.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350623|gb|EAN31386.1| hypothetical protein TP04_0034 [Theileria parva]
Length = 250
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV + C +C C+ S+SG RT+L+CD C+R FH+ C + E+P G
Sbjct: 83 IVIRYPWHCNSCKICVKCNDSESGVS-RTLLICDSCDRAFHMECTRNK----YTEVPSGN 137
Query: 681 WFC 683
WFC
Sbjct: 138 WFC 140
>gi|310792256|gb|EFQ27783.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 312
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 18/88 (20%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL+C D SK+G I+ CD C++ +H C K + ++P+G W+C C+D +
Sbjct: 186 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYK------VPKVPRGDWYCNECLDQKQ 235
Query: 690 INSVLQNLLVQEAEKLPEF--HLNAIKK 715
+ + EA K+P F HL+ +K+
Sbjct: 236 SRAAAAD----EAVKIPNFQQHLSKLKR 259
>gi|340381804|ref|XP_003389411.1| PREDICTED: hypothetical protein LOC100638610 [Amphimedon
queenslandica]
Length = 2366
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 580 CKYCQNMFERKRFLQHDA--NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 637
C +C E R Q++ + E G SG S + +K +V+ + AE CL C+
Sbjct: 198 CSFCLGTEENNRDKQYEQLLSCHECGN-SGHPSCLKYSKE---LVEFITAEPWLCLECKK 253
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C + + +L+CD C++ FH+ CL + LP+G+W C
Sbjct: 254 CIYCNASANADDLLICDACDKGFHMVCLD----PPISSLPEGRWVC 295
>gi|321456775|gb|EFX67875.1| hypothetical protein DAPPUDRAFT_229111 [Daphnia pulex]
Length = 1870
Score = 47.0 bits (110), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 10/56 (17%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+++GC +CR D + T+LLCD+C++ +H+ CL L E+P WFC
Sbjct: 137 DITGCQVCRNGDREE------TMLLCDRCDQGYHMDCLD----PPLDEVPIEDWFC 182
>gi|405964745|gb|EKC30194.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 387
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 6/66 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C+ C+ C + P T+L CD C++ +H+ C H+ A + + P+GKW C C D
Sbjct: 195 CIDCKTCTVCQDSGDPDTMLFCDACDKGYHMTC---HEPA-IEDKPQGKWECQTCQDTEP 250
Query: 690 INSVLQ 695
+++ ++
Sbjct: 251 VDASME 256
>gi|403224362|dbj|BAM42492.1| uncharacterized protein TOT_040000860 [Theileria orientalis strain
Shintoku]
Length = 254
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 36/71 (50%), Gaps = 9/71 (12%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C +C CD +K G T+++CD C+R FH+ C +L E+P G W+ C DC
Sbjct: 98 CKVCVNCDENKDG----TLIICDACDRGFHINCTDD----NLEEIPSGSWY-CHDCQYCK 148
Query: 692 SVLQNLLVQEA 702
+ L EA
Sbjct: 149 LCYKKLSDSEA 159
>gi|320163082|gb|EFW39981.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 660
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 8/74 (10%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C C+ C S +LLCD C+R +H+ C+ DL+ +P G+W C C C+
Sbjct: 226 CSNCKSCAMCNSTENETEMLLCDVCDRGYHIQCI------DLKTMPLGRWVCSLCNACTN 279
Query: 690 INSVLQNLLVQEAE 703
+S L ++ E
Sbjct: 280 CHSKLAGAILPRKE 293
>gi|347966735|ref|XP_001689318.2| AGAP001877-PA [Anopheles gambiae str. PEST]
gi|333469922|gb|EDO63223.2| AGAP001877-PA [Anopheles gambiae str. PEST]
Length = 2382
Score = 46.6 bits (109), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 16/96 (16%)
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
V+ + + S C LC C+ + + ++ CDQC+R +H+ C LR LP+G+W
Sbjct: 2197 VQQYKWQCSECKLCMKCNRQPAAIDSK-MVYCDQCDRGYHLAC------KGLRNLPEGRW 2249
Query: 682 FC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK 715
C C C + Q E P HL+A ++
Sbjct: 2250 HCNICTICGLCGA-------QTPEGHPNPHLSAQQR 2278
>gi|224120882|ref|XP_002318442.1| SET domain protein [Populus trichocarpa]
gi|222859115|gb|EEE96662.1| SET domain protein [Populus trichocarpa]
Length = 319
Score = 46.6 bits (109), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 27/197 (13%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
C+ SG P +LLCD+C++ FH+ CL+ +A +PKG WF C CS+
Sbjct: 7 CEKCGSGESPGELLLCDKCDKGFHLFCLRPILVA----VPKGSWF-CPSCSKQKMPNSFP 61
Query: 698 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 757
LVQ K+ +F I++ + + DI + + S + + R LL
Sbjct: 62 LVQ--TKIIDFF--RIQRSTESIQKLSQDIQKKRKRSSSLVVSKKRRKLLP--------- 108
Query: 758 FDPIVDSISGRDLIPSMVYGRNLRGQEFG-------GMYCAILTVNSSVVSAGILRVFGQ 810
F P D + + S+ G EF GM A +VN + G ++V +
Sbjct: 109 FSPSEDPEKRLEQMRSLATALTASGTEFSNELTYRPGM--APRSVNQPALEKGGMQVLSK 166
Query: 811 EVAELPLVATSKINHGK 827
E AE + +N G+
Sbjct: 167 EDAETLNLCKRMMNRGE 183
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 46.6 bits (109), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 56/143 (39%), Gaps = 27/143 (18%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
++C CN F + +LS +P+GDW C C+ R+ D + G S
Sbjct: 1105 LLCDECNQP-----FHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDYAELNGG--S 1157
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 666
D ++ + E+E +C CD + ++ C +C +H C
Sbjct: 1158 DSDEYDE--------TDSDESEAEHDEMCCMCD------DDQELVYCSRCPAAYHRECHD 1203
Query: 667 KHKMADLRELPKGKWFC--CMDC 687
LR P+GKW C C +C
Sbjct: 1204 ----PPLRNFPRGKWVCSACTNC 1222
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 10/57 (17%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
AE + C +CR K G + +LLCD+C + FH+ CL+ L +PKG W C
Sbjct: 1087 AENAKCKICR-----KKGDEDK-LLLCDECNQPFHLYCLR----PALSYVPKGDWMC 1133
>gi|390179037|ref|XP_001359546.3| GA15524 [Drosophila pseudoobscura pseudoobscura]
gi|388859687|gb|EAL28693.3| GA15524 [Drosophila pseudoobscura pseudoobscura]
Length = 2503
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 12/68 (17%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--C 684
A+++ C +CR + +LLCD C + +H+ CL L E+P G W+C C
Sbjct: 281 ADVTNCEICRSPERED------VMLLCDSCNQGYHMDCLD----PPLDEIPAGSWYCDEC 330
Query: 685 MDCSRINS 692
+D + NS
Sbjct: 331 IDSNDENS 338
>gi|255646689|gb|ACU23818.1| unknown [Glycine max]
Length = 296
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 6/86 (6%)
Query: 815 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 874
+PL+AT +G +LL + IE++L +V+ +V+ A + WT FGF +D
Sbjct: 1 MPLIATCSQYRRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDD-- 58
Query: 875 LSIYRKRCSQ--LVTFKGTSMLQKRV 898
I R+R ++ L+ F GT +L K +
Sbjct: 59 --IERQRLNKINLMVFPGTVLLVKSL 82
>gi|195174305|ref|XP_002027919.1| GL27102 [Drosophila persimilis]
gi|194115608|gb|EDW37651.1| GL27102 [Drosophila persimilis]
Length = 2142
Score = 46.2 bits (108), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ +R + V+N + +GC C C KS P +L C+QC+R +H+ CL L
Sbjct: 1854 MPQRMVGRVRNYNWQCAGCKCCIKC---KSSQRPGKMLYCEQCDRGYHIYCL------GL 1904
Query: 674 RELPKGKWFC--CMDCSRINSVLQNLLVQEAE 703
+ +P +W C C C R +V L Q +E
Sbjct: 1905 KTVPDERWSCERCCICMRCGAVKPEGLPQVSE 1936
>gi|345483315|ref|XP_003424790.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase UHRF1
[Nasonia vitripennis]
Length = 721
Score = 46.2 bits (108), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 14/90 (15%)
Query: 596 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLCRGCDFSKSGFGPRTILLC 653
D N ++ +S + + + C + L+ + GC +C G D P I+LC
Sbjct: 254 DRNENDSNFISNIRRI--VAPECSKCFDKLDVKCKECGCKVCGGKD------NPDQIILC 305
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFC 683
D+CE E+H+GCL L ++P+ W+C
Sbjct: 306 DECEDEYHIGCL----TPALPKVPEEDWYC 331
>gi|198471111|ref|XP_002133666.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
gi|198145773|gb|EDY72293.1| GA22685 [Drosophila pseudoobscura pseudoobscura]
Length = 2132
Score = 46.2 bits (108), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ +R + V+N + +GC C C KS P +L C+QC+R +H+ CL L
Sbjct: 1844 MPQRMVGRVRNYNWQCAGCKCCIKC---KSSQRPGKMLYCEQCDRGYHIYCL------GL 1894
Query: 674 RELPKGKWFC--CMDCSRINSVLQNLLVQEAE 703
+ +P +W C C C R +V L Q +E
Sbjct: 1895 KTVPDERWSCERCCICMRCGAVKPEGLPQVSE 1926
>gi|195567753|ref|XP_002107423.1| GD15571 [Drosophila simulans]
gi|194204830|gb|EDX18406.1| GD15571 [Drosophila simulans]
Length = 770
Score = 46.2 bits (108), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 43/91 (47%), Gaps = 11/91 (12%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 495 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLFCEQCDRGYHIYCL------GL 545
Query: 674 RELPKGKWFC--CMDCSRINSVLQNLLVQEA 702
+ +P G+W C C C R + L Q A
Sbjct: 546 KTVPDGRWSCERCCFCVRCGATKPEGLPQVA 576
>gi|196002948|ref|XP_002111341.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
gi|190585240|gb|EDV25308.1| hypothetical protein TRIADDRAFT_55230 [Trichoplax adhaerens]
Length = 879
Score = 46.2 bits (108), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
+ ++ E C+ C+ C + ++ P +L CD C++ FH+ CL L +P G+
Sbjct: 228 LTSRVKTEPWQCIECKTCSYCQNAGNPDNLLFCDACDKGFHMECLS----PPLTGMPSGR 283
Query: 681 WFC 683
W C
Sbjct: 284 WVC 286
>gi|367017276|ref|XP_003683136.1| hypothetical protein TDEL_0H00660 [Torulaspora delbrueckii]
gi|359750800|emb|CCE93925.1| hypothetical protein TDEL_0H00660 [Torulaspora delbrueckii]
Length = 770
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E+E GCL+C + P I+LCD C++ FH+ CL L +PKG+W C
Sbjct: 237 ESEDEGCLVCDRSN------KPTKIILCDACDKPFHLYCLS----PPLTSIPKGEWIC 284
>gi|195345773|ref|XP_002039443.1| GM22724 [Drosophila sechellia]
gi|194134669|gb|EDW56185.1| GM22724 [Drosophila sechellia]
Length = 1889
Score = 46.2 bits (108), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 36/70 (51%), Gaps = 9/70 (12%)
Query: 614 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1603 MPARMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLFCEQCDRGYHIYCLG------L 1653
Query: 674 RELPKGKWFC 683
+ +P G+W C
Sbjct: 1654 KTVPDGRWSC 1663
>gi|157138655|ref|XP_001657329.1| zinc finger protein [Aedes aegypti]
gi|108869464|gb|EAT33689.1| AAEL014034-PA [Aedes aegypti]
Length = 1526
Score = 46.2 bits (108), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 10/60 (16%)
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
N AE C++CR + G P LLCD C R H+ CLK L+++P+G W+C
Sbjct: 1102 NRSAERIACMICR-----RKGI-PEQTLLCDDCNRACHMYCLK----PKLKQVPEGDWYC 1151
>gi|195153054|ref|XP_002017445.1| GL21523 [Drosophila persimilis]
gi|194112502|gb|EDW34545.1| GL21523 [Drosophila persimilis]
Length = 2392
Score = 46.2 bits (108), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 35/68 (51%), Gaps = 12/68 (17%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--C 684
A+++ C +CR + +LLCD C + +H+ CL L E+P G W+C C
Sbjct: 258 ADVTNCEICRSPERED------VMLLCDSCNQGYHMDCLD----PPLDEIPAGSWYCDEC 307
Query: 685 MDCSRINS 692
+D + NS
Sbjct: 308 IDSNDENS 315
>gi|384251456|gb|EIE24934.1| hypothetical protein COCSUDRAFT_65601 [Coccomyxa subellipsoidea
C-169]
Length = 1754
Score = 46.2 bits (108), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 36/79 (45%), Gaps = 11/79 (13%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 690
GC C D R ILLCD C+ EFH C+ + L ++P G+WFC C R
Sbjct: 991 GCAACWSNDDH------RNILLCDGCDLEFHHYCV----VPPLPDIPSGEWFCPA-CVRA 1039
Query: 691 NSVLQNLLVQEAEKLPEFH 709
+S AE +P H
Sbjct: 1040 SSEAGQKPGSAAEGVPPPH 1058
>gi|157103793|ref|XP_001648133.1| hypothetical protein AaeL_AAEL003955 [Aedes aegypti]
gi|108880488|gb|EAT44713.1| AAEL003955-PA [Aedes aegypti]
Length = 1823
Score = 46.2 bits (108), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 12/62 (19%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
+ EL+ C +CR D S +LLCD C +H+ CL L E+P G W+C
Sbjct: 136 DHELTNCEVCRRPDREDS------MLLCDSCNLGYHMDCLNPP----LTEIPSGSWYC-- 183
Query: 686 DC 687
DC
Sbjct: 184 DC 185
>gi|391331997|ref|XP_003740425.1| PREDICTED: uncharacterized protein LOC100907650 [Metaseiulus
occidentalis]
Length = 1922
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSV 693
C F SG ++LLCD C++ +H C K + +P+G WFC C++ +N V
Sbjct: 1538 CQFCGSGENEESLLLCDACDKGYHTYCFKPEMV-----VPQGDWFCFECINSVNLNKV 1590
>gi|336370765|gb|EGN99105.1| hypothetical protein SERLA73DRAFT_160636 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1506
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
CL C+ C+ + IL CD C+R +H+ CL+ L+E P GKW C
Sbjct: 11 CLECKNCEVCREKGDDERILFCDFCDRGWHMDCLQ----PPLQESPPGKWHC 58
>gi|392897209|ref|NP_499819.3| Protein SET-16 [Caenorhabditis elegans]
gi|316891988|emb|CAB03348.3| Protein SET-16 [Caenorhabditis elegans]
Length = 2475
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 618 CIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
C+ + L + + G CL C C+ +G +LLCD+C+ +H+ C+K L
Sbjct: 456 CVTLHDKLNSAVVGRGWRCLDCTVCEGCGTGGDEANLLLCDECDVSYHIYCMKPL----L 511
Query: 674 RELPKGKWFC--CMDCSRIN 691
++P+G W C C C R N
Sbjct: 512 DKIPQGPWRCQWCSRCRRCN 531
>gi|24644293|ref|NP_649554.1| CG2926, isoform A [Drosophila melanogaster]
gi|7296722|gb|AAF52001.1| CG2926, isoform A [Drosophila melanogaster]
Length = 2296
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 12/84 (14%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C D +LLCD C + +H+ CL L E+P G W+C
Sbjct: 241 EEEVTNCEICESPDRED------VMLLCDSCNQGYHMDCLD----PPLYEIPAGSWYCDN 290
Query: 684 CMDCSRINSVLQNLLVQEAEKLPE 707
C+D + Q L + ++L E
Sbjct: 291 CIDSDDEDDNEQLELADDLDQLYE 314
>gi|429862948|gb|ELA37533.1| origin recognition complex subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 830
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 24/91 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC---- 687
C++C D SK+G IL CD C+R +H C ++ ++PKG W+C DC
Sbjct: 337 CVICSKPD-SKAG---NKILFCDGCDRCYHQKC------HNVPKVPKGDWYCD-DCVQQK 385
Query: 688 -SRINSVLQNLLVQEAEKLPEF--HLNAIKK 715
SR+ L E K+P F HLN++K+
Sbjct: 386 ESRV------LAAGEVAKIPNFAQHLNSMKR 410
>gi|353237075|emb|CCA69056.1| related to regulator Ustilago maydis 1 protein (Rum1)
[Piriformospora indica DSM 11827]
Length = 1735
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 6/53 (11%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 688
C+ K+ P +LLCD C+ FH+ CL L +PKG+WFC C+D S
Sbjct: 436 CEICKASEKPSEMLLCDGCDGGFHIFCLDPR----LPTVPKGQWFCHSCLDGS 484
>gi|345489239|ref|XP_001604683.2| PREDICTED: hypothetical protein LOC100121098 [Nasonia vitripennis]
Length = 4322
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 55/106 (51%), Gaps = 16/106 (15%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
EA+ C+ C SK+ P ILLCD+C++ +H CL+ M +P+G W+C
Sbjct: 2975 EADAVDEYACQKC--SKADH-PEWILLCDECDKGWHCSCLRPALML----IPEGDWYCPP 3027
Query: 684 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KK---YAGNSLETV 724
C S ++ + ++L +A L + H N + KK Y G SL+ V
Sbjct: 3028 CQHNSLVSKLQESLKTLDA--LTKKHENEVLRKKRLAYVGISLDNV 3071
>gi|443894534|dbj|GAC71882.1| DNA-binding protein jumonji/RBP2/SMCY [Pseudozyma antarctica T-34]
Length = 2474
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 12/88 (13%)
Query: 597 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC-RGCDFSKSGFGPRTILLCDQ 655
A + + R GV + NL AE C +C RG D GP +LLCD+
Sbjct: 503 ATPLASRRRKGVSPLVDTDLHARAATGNLGAEEQMCEICLRGED------GP-NMLLCDE 555
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFC 683
C R +H+ CL+ L +PK +WFC
Sbjct: 556 CNRGYHMYCLQP----PLTSIPKSQWFC 579
>gi|298708138|emb|CBJ30479.1| myst-related protein [Ectocarpus siliculosus]
Length = 1620
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 45/128 (35%), Gaps = 52/128 (40%)
Query: 567 CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLE 626
A +S +P+G WYC C+ E+KR
Sbjct: 533 AAEISKVPKGQWYCPDCR---EKKRVF--------------------------------- 556
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCD--QCEREFHVGCLKKHKMADLRELPKGKWFCC 684
GC C D P IL CD C E+H GCL L ++P KW+ C
Sbjct: 557 ----GCNGCMQNDR------PTEILQCDGPMCGLEYHYGCLDPP----LDKVPSSKWWYC 602
Query: 685 MDCSRINS 692
DC R ++
Sbjct: 603 PDCVRTDN 610
>gi|390334110|ref|XP_003723851.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
isoform 3 [Strongylocentrotus purpuratus]
Length = 2266
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 14/68 (20%)
Query: 617 RCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
RCI K++ + C +CR CD +K +LLCD C+R +H+ CLK + E
Sbjct: 2020 RCIAWDKSIMK--ASCKVCRRSCDEAK-------LLLCDWCDRGYHMYCLKPK----ITE 2066
Query: 676 LPKGKWFC 683
+P+G W+C
Sbjct: 2067 VPEGDWYC 2074
>gi|327276821|ref|XP_003223166.1| PREDICTED: histone acetyltransferase MYST4-like [Anolis
carolinensis]
Length = 2024
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C+ + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 249 KFCLELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 304
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 305 RMPKGMWIC 313
>gi|390334106|ref|XP_783177.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
isoform 4 [Strongylocentrotus purpuratus]
Length = 2272
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 14/68 (20%)
Query: 617 RCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
RCI K++ + C +CR CD +K +LLCD C+R +H+ CLK + E
Sbjct: 2026 RCIAWDKSIMK--ASCKVCRRSCDEAK-------LLLCDWCDRGYHMYCLKPK----ITE 2072
Query: 676 LPKGKWFC 683
+P+G W+C
Sbjct: 2073 VPEGDWYC 2080
>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus
siliculosus]
Length = 3661
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 69/152 (45%), Gaps = 27/152 (17%)
Query: 634 LCRGCDFSKSGFGPR----TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 689
CR C GP +L+CD C+ E+H+ CL+ L +PKG+W C + C+
Sbjct: 537 WCRKC------VGPERKNNKMLVCDTCDAEYHLKCLR------LSSVPKGQWLCPI-CTV 583
Query: 690 INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID--VRWRLLSGKAATPETRLLL 747
+ Q L + E + + ++E V ++ ++W LS + T ETR L
Sbjct: 584 MLRKGQTLFSHQT----EVEKAKLSQMPQPTVEVVDELKYLIKWSGLSYQFCTWETREEL 639
Query: 748 SQAVAI--FHDCFD--PIVDSISGRDLIPSMV 775
+ AI FH D P+ +S +L+ ++
Sbjct: 640 NNDGAIDRFHKLNDHPPLSPPMSEEELMRTLA 671
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 13/84 (15%)
Query: 542 KNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 601
KN ++C C++E H +C LSS+P+G W C C M + + L VE
Sbjct: 547 KNNKMLVCDTCDAEY-------HLKCLRLSSVPKGQWLCPICTVMLRKGQTLFSHQTEVE 599
Query: 602 AGRVSGVDSVEQITKRCIRIVKNL 625
++S Q+ + + +V L
Sbjct: 600 KAKLS------QMPQPTVEVVDEL 617
>gi|170048389|ref|XP_001852258.1| ATP-dependent chromatin assembly factor large subunit [Culex
quinquefasciatus]
gi|167870529|gb|EDS33912.1| ATP-dependent chromatin assembly factor large subunit [Culex
quinquefasciatus]
Length = 1487
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 31/57 (54%), Gaps = 10/57 (17%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
AE C++CR + G P LLCD C R H+ CLK L+++P+G WFC
Sbjct: 1041 AEKIACMICR-----RKG-DPSQTLLCDDCNRACHMYCLK----PKLKQVPEGDWFC 1087
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 34/149 (22%), Positives = 54/149 (36%), Gaps = 60/149 (40%)
Query: 540 GYKNGLGIICHCCNSEVSPSQFEAHAEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 597
G K ++C C+ H C + +IP GDW+C C+N +R
Sbjct: 1617 GDKEDQLLLCDGCDKGY-------HTYCFRPPMDNIPDGDWFCYECRNKATGQR------ 1663
Query: 598 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI-LLCDQC 656
C++C G +TI +LCDQC
Sbjct: 1664 ---------------------------------NCIVC-------GKPGNKTISVLCDQC 1683
Query: 657 EREFHVGCLKKHKMADLRELPKGKWFCCM 685
+ +H+ CL+ L ++P+GKW C +
Sbjct: 1684 PKAYHIECLQ----PPLAKVPRGKWLCVL 1708
>gi|40215869|gb|AAR82791.1| LD09942p [Drosophila melanogaster]
gi|51092193|gb|AAT94510.1| LD08594p [Drosophila melanogaster]
Length = 1489
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 12/84 (14%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C D +LLCD C + +H+ CL L E+P G W+C
Sbjct: 241 EEEVTNCEICESPDRED------VMLLCDSCNQGYHMDCLD----PPLYEIPAGSWYCDN 290
Query: 684 CMDCSRINSVLQNLLVQEAEKLPE 707
C+D + Q L + ++L E
Sbjct: 291 CIDSDDEDDNEQLELADDLDQLYE 314
>gi|410915068|ref|XP_003971009.1| PREDICTED: LOW QUALITY PROTEIN: remodeling and spacing factor 1-like
[Takifugu rubripes]
Length = 1495
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 29/56 (51%), Gaps = 9/56 (16%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNL 697
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQNL
Sbjct: 1017 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKQLCDKLQEQLQNL 1068
>gi|390334108|ref|XP_003723850.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
isoform 2 [Strongylocentrotus purpuratus]
Length = 2303
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 14/68 (20%)
Query: 617 RCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
RCI K++ + C +CR CD +K +LLCD C+R +H+ CLK + E
Sbjct: 2057 RCIAWDKSIMK--ASCKVCRRSCDEAK-------LLLCDWCDRGYHMYCLKPK----ITE 2103
Query: 676 LPKGKWFC 683
+P+G W+C
Sbjct: 2104 VPEGDWYC 2111
>gi|358336343|dbj|GAA54879.1| tyrosine-protein kinase BAZ1B, partial [Clonorchis sinensis]
Length = 1921
Score = 45.4 bits (106), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 38/74 (51%), Gaps = 14/74 (18%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
CIR K++E + C +CR KS +LLCD C R FH+ CL+ LR +P
Sbjct: 1414 CIRWEKSVED--ARCRICR----HKSDDD--NLLLCDGCNRAFHLYCLR----PPLRRVP 1461
Query: 678 KGKWFC--CMDCSR 689
G W+C C SR
Sbjct: 1462 AGDWYCPSCRPASR 1475
>gi|159468546|ref|XP_001692435.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278148|gb|EDP03913.1| predicted protein [Chlamydomonas reinhardtii]
Length = 271
Score = 45.4 bits (106), Expect = 0.16, Method: Composition-based stats.
Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 2/89 (2%)
Query: 786 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 845
GG Y A+L V+A V+G+E A+L ++AT+ KG+ L A +E LL L
Sbjct: 161 GGRYSAVLVAGRQPVAAASFNVWGRE-AQLCMLATATAARRKGHGAALVADVEGLLKRLG 219
Query: 846 VKSIVLPAAEEAESIWTD-KFGFKKIDPE 873
V+ +++ A A +W +FG+ + PE
Sbjct: 220 VQRMLVQARRVALPMWLGPRFGYSLVAPE 248
>gi|390350878|ref|XP_788653.3| PREDICTED: zinc finger protein DPF3-like [Strongylocentrotus
purpuratus]
Length = 418
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 49/113 (43%), Gaps = 11/113 (9%)
Query: 574 PQGDWYCKYCQNMFERKRFLQHDANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELS 630
P+ + YC +C + Q + + + GR SG + Q T I+ VK +
Sbjct: 309 PETNNYCDFCLGDATENKKTQTPEDLISCSDCGR-SGHPTCLQFTDTMIQKVKGYRWQ-- 365
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ C+ C + +L CD C+R +H+ CL ++ P+G W C
Sbjct: 366 -CIECKSCGLCGTSDNDDQLLFCDDCDRGYHMYCLN----PPMQAPPEGSWIC 413
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 66/180 (36%), Gaps = 38/180 (21%)
Query: 525 GTEVGYYACGQKLLEGYKNGLGII-CHCCNSEVSPSQFEAHAEC--ASLSSIPQGDWYCK 581
G GY Q E + G II C C P + H C + P+G W C
Sbjct: 351 GAADGYETDHQDYCEVCQQGGEIILCDTC-----PRAY--HMVCLDPDMDKAPEGKWSCP 403
Query: 582 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 641
+C+ K +Q +A + +I V + E E CR C
Sbjct: 404 HCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC--- 445
Query: 642 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 699
K G +L CD C +H+ CL L E+P G+W C C C + +Q +L
Sbjct: 446 KDGG---ELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKILT 497
>gi|312071355|ref|XP_003138570.1| F/Y-rich family protein [Loa loa]
Length = 1597
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 10/104 (9%)
Query: 600 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQ 655
VE V+ V + C+ + L + L CL C C+ G ++LCD+
Sbjct: 284 VEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDGHDESNLILCDE 343
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 697
C+ +H+ CL+ L +P G W C C C R ++ + N+
Sbjct: 344 CDISYHIYCLE----PPLERIPHGPWRCKWCSACRRCSNQISNI 383
>gi|426199317|gb|EKV49242.1| hypothetical protein AGABI2DRAFT_177299 [Agaricus bisporus var.
bisporus H97]
Length = 1474
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ C+ C+ IL CD C+R +H C++ + ELP+G+W+C
Sbjct: 107 CIECKNCELCGDKGDDERILFCDGCDRGWHFDCMQ----PPINELPEGEWYC 154
>gi|409078326|gb|EKM78689.1| hypothetical protein AGABI1DRAFT_107193 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1494
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ C+ C+ IL CD C+R +H C++ + ELP+G+W+C
Sbjct: 107 CIECKNCELCGDKGDDERILFCDGCDRGWHFDCMQ----PPINELPEGEWYC 154
>gi|412986170|emb|CCO17370.1| transcriptional regulatory protein RCO1 [Bathycoccus prasinos]
Length = 1762
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 7/129 (5%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 690
GC C G D + G G +LLCD C REFH CL + D E WFC + ++
Sbjct: 1170 GCQSC-GMDMHEGGKG--EVLLCDGCPREFHKSCL--NPPVDENEKLPDTWFCPICVAKK 1224
Query: 691 NSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQA 750
N++ + E F A G + + + +L K+ + E R+LLSQ
Sbjct: 1225 NAIETSKTSCEMITPTSFTRVAEGSQLGEAAKKGTMFTN--EMLKSKSLSKEARILLSQK 1282
Query: 751 VAIFHDCFD 759
+ F + FD
Sbjct: 1283 LKRFAERFD 1291
>gi|145207283|gb|AAH46124.2| RSF1 protein [Homo sapiens]
Length = 805
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 695 CKKCGLPN---HPELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 747
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 748 LEEQLQDLDV 757
>gi|297847406|ref|XP_002891584.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337426|gb|EFH67843.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 611
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+S + C+ C + + T+L+CD CE+ +H+ CL+ H +++ +PK +W C
Sbjct: 311 MSKAMTCQMCQGTINEV--ETVLICDACEKGYHLKCLQAH---NIKGVPKSEWHC 360
>gi|442617590|ref|NP_001262289.1| CG2926, isoform B [Drosophila melanogaster]
gi|440217099|gb|AGB95672.1| CG2926, isoform B [Drosophila melanogaster]
Length = 874
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 12/84 (14%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C D +LLCD C + +H+ CL L E+P G W+C
Sbjct: 241 EEEVTNCEICESPDRED------VMLLCDSCNQGYHMDCLDPP----LYEIPAGSWYCDN 290
Query: 684 CMDCSRINSVLQNLLVQEAEKLPE 707
C+D + Q L + ++L E
Sbjct: 291 CIDSDDEDDNEQLELADDLDQLYE 314
>gi|195132645|ref|XP_002010753.1| GI21713 [Drosophila mojavensis]
gi|193907541|gb|EDW06408.1| GI21713 [Drosophila mojavensis]
Length = 2287
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 14/80 (17%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ +R + V+N + + C C C +S P +L C+QC+R +H+ CL
Sbjct: 1991 EMPQRMVGRVRNYNWQCAECKCCIKC---RSSQQPGKMLYCEQCDRGYHIYCL------G 2041
Query: 673 LRELPKGKWFC-----CMDC 687
++ +P+G+W C CM C
Sbjct: 2042 IKTVPEGRWSCERCSICMRC 2061
>gi|395814781|ref|XP_003780919.1| PREDICTED: remodeling and spacing factor 1 [Otolemur garnettii]
Length = 1186
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 648 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 701
>gi|327282592|ref|XP_003226026.1| PREDICTED: LOW QUALITY PROTEIN: remodeling and spacing factor
1-like [Anolis carolinensis]
Length = 1436
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 19/88 (21%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV-- 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 901 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKMEEQLQDLDVVL 956
Query: 700 ---QEAEKLPEFHLNAIKKYAGNSLETV 724
+ AE+ E + Y G SLE +
Sbjct: 957 KKKERAERRKERLV-----YVGISLENI 979
>gi|10803733|gb|AAF61709.2|AF227948_1 HBV pX associated protein-8 [Homo sapiens]
Length = 1189
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 651 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 704
>gi|426369892|ref|XP_004051915.1| PREDICTED: remodeling and spacing factor 1 [Gorilla gorilla
gorilla]
Length = 1189
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 651 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 704
>gi|356568947|ref|XP_003552669.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Glycine
max]
Length = 334
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ G P +LLCD+C+R +H+ CL+ L +PKG WFC
Sbjct: 25 CEECGGGHSPSKLLLCDKCDRGYHLFCLR----PILPSVPKGSWFC 66
>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
Length = 2288
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 10/104 (9%)
Query: 600 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQ 655
VE V+ V + C+ + L + L CL C C+ G ++LCD+
Sbjct: 307 VEGTMVTCVTCSQSYHTYCVGLHDKLNSTLIKRGWRCLDCTVCEGCGDGHDESNLILCDE 366
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 697
C+ +H+ CL+ L +P G W C C C R ++ + N+
Sbjct: 367 CDISYHIYCLE----PPLERIPHGPWRCKWCSACRRCSNQISNI 406
>gi|349602979|gb|AEP98951.1| Remodeling and spacing factor 1-like protein, partial [Equus
caballus]
Length = 827
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 662 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 715
>gi|296088520|emb|CBI37511.3| unnamed protein product [Vitis vinifera]
Length = 156
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 747 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGI 804
LSQA+ + H+CF+PI++ + RDLI +VY + +F G Y L + V A
Sbjct: 95 LSQALNLMHECFEPIIEHHTKRDLIVDIVYNSVSKFKRLDFRGFYIMALQKDDEFVCAAT 154
Query: 805 LR 806
+R
Sbjct: 155 VR 156
>gi|255077476|ref|XP_002502377.1| ring PHD BROMO family protein [Micromonas sp. RCC299]
gi|226517642|gb|ACO63635.1| ring PHD BROMO family protein [Micromonas sp. RCC299]
Length = 415
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 46/90 (51%), Gaps = 10/90 (11%)
Query: 596 DANAVEAGRVSGVDSV-EQITKRCIRIVKNLEAELSGCLLCRG-CDFSKSGFGPRTILLC 653
DA VEA V G+D +T R ++ ++A+L C +C G D ++ G LLC
Sbjct: 200 DAPRVEADDVVGMDEYGNAVTAR--EYIECIDAKL--CEVCEGGADETQPGMSDAETLLC 255
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFC 683
D C+ +H+ CL+ L +P+G W C
Sbjct: 256 DGCDSAWHMACLRP----PLTTIPEGDWMC 281
>gi|432891342|ref|XP_004075552.1| PREDICTED: remodeling and spacing factor 1-like [Oryzias latipes]
Length = 1421
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 48/118 (40%), Gaps = 17/118 (14%)
Query: 585 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 644
N+ ++R L D GR G E + + K+ E C C C
Sbjct: 775 NIRTKRRKLNDDEAMASRGRKDGG---EGRSDSEVESCKSTEIPSEDC--CSHCGLPNH- 828
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNL 697
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQNL
Sbjct: 829 --PELILLCDSCDSGYHTACLRPPVML----IPDGEWFCPPCQHKLLCEKLEEQLQNL 880
>gi|15237940|ref|NP_197821.1| histone-lysine N-methyltransferase ATXR6 [Arabidopsis thaliana]
gi|75262758|sp|Q9FNE9.1|ATXR6_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR6; AltName:
Full=Protein SET DOMAIN GROUP 34; AltName:
Full=Trithorax-related protein 6; Short=TRX-related
protein 6
gi|10177093|dbj|BAB10399.1| unnamed protein product [Arabidopsis thaliana]
gi|45773812|gb|AAS76710.1| At5g24330 [Arabidopsis thaliana]
gi|46402470|gb|AAS92337.1| At5g24330 [Arabidopsis thaliana]
gi|332005911|gb|AED93294.1| histone-lysine N-methyltransferase ATXR6 [Arabidopsis thaliana]
Length = 349
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ SG P +LLCD+C++ FH+ CL+ L +PKG WFC
Sbjct: 35 CEECSSGKQPAKLLLCDKCDKGFHLFCLR----PILVSVPKGSWFC 76
>gi|119595439|gb|EAW75033.1| hepatitis B virus x associated protein, isoform CRA_a [Homo
sapiens]
Length = 1008
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 654 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 707
>gi|312376807|gb|EFR23792.1| hypothetical protein AND_12238 [Anopheles darlingi]
Length = 3049
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2425 QNCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2468
>gi|449484819|ref|XP_002189691.2| PREDICTED: remodeling and spacing factor 1 [Taeniopygia guttata]
Length = 1351
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 812 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 865
>gi|307169034|gb|EFN61879.1| Bromodomain adjacent to zinc finger domain protein 2B [Camponotus
floridanus]
Length = 2352
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 613 QITKRCIRI-VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 671
++ RC+ + N +L C F SG +LLCD C+R +H C +
Sbjct: 2043 KLRNRCVSLKATNQYKQLLTTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----P 2098
Query: 672 DLRELPKGKWFC--CMD 686
+ +P G W+C CM+
Sbjct: 2099 KMENIPDGDWYCHECMN 2115
>gi|410972585|ref|XP_003992739.1| PREDICTED: remodeling and spacing factor 1 [Felis catus]
Length = 1435
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 901 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 954
>gi|195048475|ref|XP_001992534.1| GH24152 [Drosophila grimshawi]
gi|193893375|gb|EDV92241.1| GH24152 [Drosophila grimshawi]
Length = 2464
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 14/80 (17%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ +R + V+N + + C C C +S P +L C+QC+R +H+ CL
Sbjct: 2100 EMPQRMVGRVRNYNWQCAECKCCIKC---RSSQQPGKMLYCEQCDRGYHIYCL------G 2150
Query: 673 LRELPKGKWFC-----CMDC 687
++ +P+G+W C CM C
Sbjct: 2151 VKTVPEGRWSCERCSICMRC 2170
>gi|297812613|ref|XP_002874190.1| hypothetical protein ARALYDRAFT_489297 [Arabidopsis lyrata subsp.
lyrata]
gi|297320027|gb|EFH50449.1| hypothetical protein ARALYDRAFT_489297 [Arabidopsis lyrata subsp.
lyrata]
Length = 349
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ SG P +LLCD+C++ FH+ CL+ L +PKG WFC
Sbjct: 35 CEECSSGKQPAKLLLCDKCDKGFHLFCLR----PILVSVPKGSWFC 76
>gi|281209959|gb|EFA84127.1| hypothetical protein PPL_03200 [Polysphondylium pallidum PN500]
Length = 1630
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
++ C CER FH CL L +LPKG WFC DC
Sbjct: 493 LITCQSCERVFHTECLD----TPLTQLPKGDWFCSNDC 526
>gi|412987717|emb|CCO20552.1| predicted protein [Bathycoccus prasinos]
Length = 2492
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 7/64 (10%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPR-TILLCDQCEREFHVGCLKKHKMADLRELPKG 679
I + LE E C+ C +SG T L+CD C++ FH CL L+++PKG
Sbjct: 309 IAEQLEIERMTAQSCKHC--GQSGHDDEETFLVCDGCDQGFHTYCLS----PPLQKVPKG 362
Query: 680 KWFC 683
KWFC
Sbjct: 363 KWFC 366
>gi|432090050|gb|ELK23650.1| Remodeling and spacing factor 1 [Myotis davidii]
Length = 1345
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 813 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 866
>gi|340376023|ref|XP_003386533.1| PREDICTED: zinc finger protein ubi-d4-like [Amphimedon
queenslandica]
Length = 402
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
C+ C+ C +L CD C+R +H+ CLK ++E P+G W C M
Sbjct: 352 CIECKSCSLCGKSDNDDQLLFCDDCDRGYHMYCLK----PPMKEAPEGSWSCGM 401
>gi|397565065|gb|EJK44459.1| hypothetical protein THAOC_36994, partial [Thalassiosira oceanica]
Length = 735
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 644 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 697
GFG +++CD CE H+ CL L+ +P GKWFC C+ C S + L
Sbjct: 303 GFGDADMIVCDGCESLLHLSCLD----PPLKRVPAGKWFCDDCLKCDSETSSMVEL 354
>gi|294658300|ref|XP_460630.2| DEHA2F06204p [Debaryomyces hansenii CBS767]
gi|202953027|emb|CAG88958.2| DEHA2F06204p [Debaryomyces hansenii CBS767]
Length = 846
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + CL+C D P LLCD C+ FH+ CL+ L +P G W+C
Sbjct: 189 EDDYDNCLICGKHD------NPSQTLLCDNCDNPFHLSCLE----PSLETVPSGSWYC 236
>gi|363729420|ref|XP_424124.3| PREDICTED: remodeling and spacing factor 1 [Gallus gallus]
Length = 1424
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 878 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 930
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 931 LEEQLQDLDV 940
>gi|218664475|ref|NP_001136306.1| remodeling and spacing factor 1 [Sus scrofa]
gi|213688843|gb|ACJ53937.1| remodeling and spacing factor 1 [Sus scrofa]
Length = 1431
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 894 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 947
>gi|14211816|gb|AAK57515.1|AF380176_1 HBV pX associated protein 8 large isoform [Homo sapiens]
Length = 1431
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 893 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 946
>gi|427788397|gb|JAA59650.1| Putative bromodomain adjacent to zinc finger domain protein 1a
[Rhipicephalus pulchellus]
Length = 1568
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 605 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 664
VSG SV Q+ + +++ + S +L C + P +LLCD C+R H+ C
Sbjct: 1129 VSGCTSVAQLFLHLSSLERSVAWDRS--VLKAYCRICRRRRDPERMLLCDGCDRGHHLYC 1186
Query: 665 LKKHKMADLRELPKGKWFC 683
LK L E+PKG W+C
Sbjct: 1187 LK----PPLEEIPKGDWYC 1201
>gi|402894781|ref|XP_003910524.1| PREDICTED: remodeling and spacing factor 1 [Papio anubis]
Length = 1403
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 865 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 918
>gi|367027844|ref|XP_003663206.1| hypothetical protein MYCTH_2304831 [Myceliophthora thermophila ATCC
42464]
gi|347010475|gb|AEO57961.1| hypothetical protein MYCTH_2304831 [Myceliophthora thermophila ATCC
42464]
Length = 637
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 7/55 (12%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
L R C S +LLCD C+ +H C+ DL E+P+G WF CM+C
Sbjct: 141 LPGRHCPVCNSAGDEEVLLLCDGCDAAYHTHCI------DLDEVPRGPWF-CMEC 188
>gi|427788395|gb|JAA59649.1| Putative atp-dependent chromatin assembly factor large subunit
[Rhipicephalus pulchellus]
Length = 1568
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 605 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 664
VSG SV Q+ + +++ + S +L C + P +LLCD C+R H+ C
Sbjct: 1129 VSGCTSVAQLFLHLSSLERSVAWDRS--VLKAYCRICRRRRDPERMLLCDGCDRGHHLYC 1186
Query: 665 LKKHKMADLRELPKGKWFC 683
LK L E+PKG W+C
Sbjct: 1187 LK----PPLEEIPKGDWYC 1201
>gi|344293713|ref|XP_003418565.1| PREDICTED: remodeling and spacing factor 1 [Loxodonta africana]
Length = 1439
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 902 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 955
>gi|334327753|ref|XP_001377733.2| PREDICTED: remodeling and spacing factor 1-like [Monodelphis
domestica]
Length = 1659
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 1115 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 1167
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 1168 LEEQLQDLDV 1177
>gi|162287269|ref|NP_001104639.1| zinc finger protein DPF3 [Danio rerio]
gi|215275221|sp|A9LMC0.1|DPF3_DANRE RecName: Full=Zinc finger protein DPF3
gi|159906528|gb|ABX10892.1| D4 zinc and double PHD fingers family 3 protein [Danio rerio]
Length = 391
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 73/189 (38%), Gaps = 39/189 (20%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVS------------PSQFEAHAECASLSSIPQG---- 576
CG++ YKN G+ H ++ ++ P H E P G
Sbjct: 217 CGKR----YKNRPGLSYHYAHTHLAEEEGEEERETEIPQSPPVHHENHKPQKAPDGSIIP 272
Query: 577 DWYCKYCQNMFERKRFLQHDANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 633
+ YC +C R V + GR SG S Q T ++ V+ + + C+
Sbjct: 273 NDYCDFCLGDSGSNRKTGQAEELVSCSDCGR-SGHPSCLQFTDNMMQAVRTYQWQ---CI 328
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 693
C+ C + +L CD C+R +H+ CLK + + P+G W C +
Sbjct: 329 ECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLK----PPMTQPPEGSWSC--------HL 376
Query: 694 LQNLLVQEA 702
QNLL +A
Sbjct: 377 CQNLLKDKA 385
>gi|149719282|ref|XP_001493263.1| PREDICTED: remodeling and spacing factor 1 [Equus caballus]
Length = 1436
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 900 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 953
>gi|119595440|gb|EAW75034.1| hepatitis B virus x associated protein, isoform CRA_b [Homo
sapiens]
Length = 1440
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 902 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 955
>gi|109108037|ref|XP_001091378.1| PREDICTED: remodeling and spacing factor 1 isoform 2 [Macaca
mulatta]
Length = 1441
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|38788333|ref|NP_057662.3| remodeling and spacing factor 1 [Homo sapiens]
gi|251757329|sp|Q96T23.2|RSF1_HUMAN RecName: Full=Remodeling and spacing factor 1; Short=Rsf-1;
AltName: Full=HBV pX-associated protein 8; AltName:
Full=Hepatitis B virus X-associated protein; AltName:
Full=p325 subunit of RSF chromatin-remodeling complex
gi|151554975|gb|AAI48327.1| Remodeling and spacing factor 1 [synthetic construct]
gi|157169674|gb|AAI53211.1| Remodeling and spacing factor 1 [synthetic construct]
gi|261858180|dbj|BAI45612.1| remodeling and spacing factor 1 [synthetic construct]
Length = 1441
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|7022417|dbj|BAA91591.1| unnamed protein product [Homo sapiens]
Length = 572
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 407 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 460
>gi|301761792|ref|XP_002916323.1| PREDICTED: remodeling and spacing factor 1-like [Ailuropoda
melanoleuca]
Length = 1454
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 920 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 973
>gi|426252240|ref|XP_004019822.1| PREDICTED: remodeling and spacing factor 1 [Ovis aries]
Length = 1458
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 922 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 975
>gi|380796877|gb|AFE70314.1| remodeling and spacing factor 1, partial [Macaca mulatta]
Length = 1437
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 899 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 952
>gi|355566895|gb|EHH23274.1| hypothetical protein EGK_06710 [Macaca mulatta]
Length = 1441
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|124486905|ref|NP_001074736.1| remodeling and spacing factor 1 [Mus musculus]
Length = 1441
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 906 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 959
>gi|417413831|gb|JAA53227.1| Putative microtubule-associated protein futsch, partial [Desmodus
rotundus]
Length = 1421
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 898 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 951
>gi|410352577|gb|JAA42892.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1437
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 900 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 953
>gi|403287731|ref|XP_003935087.1| PREDICTED: remodeling and spacing factor 1 [Saimiri boliviensis
boliviensis]
Length = 1439
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|281341060|gb|EFB16644.1| hypothetical protein PANDA_004375 [Ailuropoda melanoleuca]
Length = 1433
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 899 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 952
>gi|417413842|gb|JAA53231.1| Putative microtubule-associated protein futsch, partial [Desmodus
rotundus]
Length = 1433
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 898 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 951
>gi|351698363|gb|EHB01282.1| Remodeling and spacing factor 1 [Heterocephalus glaber]
Length = 1433
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 897 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 950
>gi|332837299|ref|XP_001175061.2| PREDICTED: remodeling and spacing factor 1 isoform 2 [Pan
troglodytes]
gi|410297798|gb|JAA27499.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1440
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|76663102|ref|XP_869632.1| PREDICTED: remodeling and spacing factor 1 isoform 2 [Bos taurus]
gi|297491749|ref|XP_002699109.1| PREDICTED: remodeling and spacing factor 1 [Bos taurus]
gi|296471934|tpg|DAA14049.1| TPA: remodeling and spacing factor 1 [Bos taurus]
Length = 1429
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 893 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 946
>gi|449456407|ref|XP_004145941.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Cucumis
sativus]
gi|449526858|ref|XP_004170430.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Cucumis
sativus]
Length = 342
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 7/51 (13%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C SG P +LLCD+C+R +H+ CL L +PKG WFC
Sbjct: 28 IFCQKC---GSGDSPADLLLCDKCDRGYHLFCL----TPILPSVPKGTWFC 71
>gi|296216988|ref|XP_002754838.1| PREDICTED: remodeling and spacing factor 1 [Callithrix jacchus]
Length = 1438
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 902 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 955
>gi|148684364|gb|EDL16311.1| mCG124268, isoform CRA_a [Mus musculus]
Length = 1396
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 861 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 914
>gi|119595441|gb|EAW75035.1| hepatitis B virus x associated protein, isoform CRA_c [Homo
sapiens]
Length = 1428
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 890 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 943
>gi|443693752|gb|ELT95039.1| hypothetical protein CAPTEDRAFT_19718 [Capitella teleta]
Length = 421
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 49/108 (45%), Gaps = 11/108 (10%)
Query: 579 YCKYC--QNMFERKRFLQHD-ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C ++ +K ++ + + + GR SG S Q T I VK + C+ C
Sbjct: 305 YCDFCLGDDIENKKTGMKEELVSCADCGR-SGHPSCLQFTANMIISVKKYPWQ---CIEC 360
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL L E P+G W C
Sbjct: 361 KSCGLCGTSDNDDQLLFCDDCDRGYHMYCLN----PPLSEPPEGSWSC 404
>gi|410352579|gb|JAA42893.1| remodeling and spacing factor 1 [Pan troglodytes]
Length = 1440
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|397473367|ref|XP_003808185.1| PREDICTED: remodeling and spacing factor 1 [Pan paniscus]
Length = 1440
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|355717434|gb|AES05931.1| remodeling and spacing factor 1 [Mustela putorius furo]
Length = 1419
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 885 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 938
>gi|332211223|ref|XP_003254719.1| PREDICTED: remodeling and spacing factor 1 [Nomascus leucogenys]
Length = 1440
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|297689801|ref|XP_002822324.1| PREDICTED: remodeling and spacing factor 1 [Pongo abelii]
Length = 1441
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 903 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 956
>gi|431838462|gb|ELK00394.1| Remodeling and spacing factor 1 [Pteropus alecto]
Length = 1401
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 864 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 917
>gi|74183063|dbj|BAE20484.1| unnamed protein product [Mus musculus]
Length = 980
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 882 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 934
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 935 LEEQLQDLDV 944
>gi|73987867|ref|XP_534002.2| PREDICTED: remodeling and spacing factor 1 isoform 1 [Canis lupus
familiaris]
Length = 1434
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 900 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 953
>gi|150864953|ref|XP_001383979.2| conserved hypothetical protein DNA-binding protein
jumonji/RBP2/SMCY, contains JmjC domain [Scheffersomyces
stipitis CBS 6054]
gi|149386209|gb|ABN65950.2| conserved hypothetical protein DNA-binding protein
jumonji/RBP2/SMCY, contains JmjC domain [Scheffersomyces
stipitis CBS 6054]
Length = 844
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E E CL+C D P LLCD C+ +H+ CL L ++P G W+C
Sbjct: 185 EDEFDNCLICGKHD------RPSRTLLCDNCDNPYHMDCLP----TPLDDIPNGNWYC 232
>gi|74144745|dbj|BAE27351.1| unnamed protein product [Mus musculus]
Length = 998
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 900 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 952
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 953 LEEQLQDLDV 962
>gi|348565583|ref|XP_003468582.1| PREDICTED: remodeling and spacing factor 1-like [Cavia porcellus]
Length = 1433
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 899 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 952
>gi|344244350|gb|EGW00454.1| Remodeling and spacing factor 1 [Cricetulus griseus]
Length = 1308
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 827 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 880
>gi|9454550|gb|AAF87873.1|AC012561_6 Hypothetical protein [Arabidopsis thaliana]
Length = 620
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 684
+S + C+ C + + T+L+CD CE+ +H+ CL H +++ +PK +W C
Sbjct: 309 MSKAMTCQICQGTINEI--ETVLICDACEKGYHLKCLHAH---NIKGVPKSEWHCS 359
>gi|291384198|ref|XP_002708720.1| PREDICTED: remodeling and spacing factor 1 [Oryctolagus cuniculus]
Length = 1438
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 902 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 955
>gi|255556257|ref|XP_002519163.1| Histone-lysine N-methyltransferase ATXR6, putative [Ricinus
communis]
gi|223541826|gb|EEF43374.1| Histone-lysine N-methyltransferase ATXR6, putative [Ricinus
communis]
Length = 344
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Query: 624 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
NLE + + C+ C SG P +LLCD+C+R +H+ CL+ L +PKG WFC
Sbjct: 22 NLEDD-DDYVWCQEC---GSGQSPAELLLCDKCDRGYHLFCLR----PILVSVPKGSWFC 73
>gi|149068918|gb|EDM18470.1| rCG40672, isoform CRA_b [Rattus norvegicus]
Length = 1418
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 877 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 930
>gi|392337823|ref|XP_001064125.3| PREDICTED: remodeling and spacing factor 1 [Rattus norvegicus]
gi|392344459|ref|XP_218939.4| PREDICTED: remodeling and spacing factor 1 [Rattus norvegicus]
Length = 1448
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 907 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 960
>gi|47211882|emb|CAF91178.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1357
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 9/65 (13%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINS 692
C + G ILLCD C+ +H CL+ M +P G+WFC C C ++
Sbjct: 843 CSDADQLLGSLQILLCDWCDSGYHTACLRPPLMV----IPDGEWFCPPCQHKQLCDKLQE 898
Query: 693 VLQNL 697
LQNL
Sbjct: 899 QLQNL 903
>gi|395521278|ref|XP_003764745.1| PREDICTED: remodeling and spacing factor 1 [Sarcophilus harrisii]
Length = 1410
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 871 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 924
>gi|148684366|gb|EDL16313.1| mCG124268, isoform CRA_c [Mus musculus]
Length = 1010
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 852 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 904
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 905 LEEQLQDLDV 914
>gi|354496667|ref|XP_003510447.1| PREDICTED: remodeling and spacing factor 1-like [Cricetulus
griseus]
Length = 1373
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 9/58 (15%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 837 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDV 890
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 61/157 (38%), Gaps = 38/157 (24%)
Query: 547 IICHCCNSEVSPSQFEAHAEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 604
I+C C P + H C + P+G W C +C+ K +Q +A R
Sbjct: 379 ILCDTC-----PRAY--HMVCLDPDMEKAPEGTWSCPHCE-----KEGIQWEA------R 420
Query: 605 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 664
G D E ++E + CR C K G +L CD C +H+ C
Sbjct: 421 EDGSDGEEDNGD-----AGDMEEDDHHMEFCRVC---KDG---GELLCCDSCPSSYHIHC 469
Query: 665 LKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 699
L L E+P G+W C CM C + +Q +L
Sbjct: 470 LN----PPLPEIPNGEWICPRCM-CPPMKGKVQKILT 501
>gi|260949643|ref|XP_002619118.1| hypothetical protein CLUG_00277 [Clavispora lusitaniae ATCC 42720]
gi|238846690|gb|EEQ36154.1| hypothetical protein CLUG_00277 [Clavispora lusitaniae ATCC 42720]
Length = 825
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + CL+CR P +LLCD C +H+ CL L E+P+G W+C
Sbjct: 183 EDDTESCLVCR------KNHSPTQMLLCDHCNNPYHLKCLS----PPLTEVPEGTWYC 230
>gi|54020946|ref|NP_001005717.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
gi|49522323|gb|AAH75306.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 428
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 579 YCKYC-QNMFERKRFLQHD--ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C + E K+ + + + GR SG S Q + I VK + C+ C
Sbjct: 311 YCDFCLGDELENKKTGSKEEMVSCADCGR-SGHPSCLQFSPNMIISVKKYPWQ---CIEC 366
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CLK L E P+G W C
Sbjct: 367 KSCGLCGTSDNDDQLLFCDDCDRGYHMYCLK----PPLSEPPEGSWSC 410
>gi|74219112|dbj|BAE26697.1| unnamed protein product [Mus musculus]
Length = 391
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C + ++ R + + + GR SG S Q T + VK + C+ C
Sbjct: 272 YCDFCLGDSKINKKTRQPEELVSCSDCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIEC 327
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 328 KCCNLCGTSENDDQLLFCDDCDRGYHMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|168047222|ref|XP_001776070.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672580|gb|EDQ59115.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2557
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 4/35 (11%)
Query: 649 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ILLCD C+ E+H+ CL + L ++PKG WFC
Sbjct: 1951 SILLCDGCDAEYHIYCL----VPPLEKVPKGNWFC 1981
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F +SG +LLCD C+R +H+ C K + +P G W+C
Sbjct: 1679 IMKANCQFCQSGDNEDKLLLCDGCDRGYHMYCFK----PKMESIPDGDWYC 1725
>gi|15223606|ref|NP_175477.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|12322340|gb|AAG51196.1|AC079279_17 hypothetical protein [Arabidopsis thaliana]
gi|332194450|gb|AEE32571.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 629
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 684
+S + C+ C + + T+L+CD CE+ +H+ CL H +++ +PK +W C
Sbjct: 318 MSKAMTCQICQGTINEI--ETVLICDACEKGYHLKCLHAH---NIKGVPKSEWHCS 368
>gi|397483740|ref|XP_003813055.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Pan paniscus]
Length = 1783
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|195399353|ref|XP_002058285.1| GJ15575 [Drosophila virilis]
gi|194150709|gb|EDW66393.1| GJ15575 [Drosophila virilis]
Length = 2162
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 40/80 (50%), Gaps = 14/80 (17%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ R + V+N + + C C C S+ P +L C+QC+R +H+ CL
Sbjct: 1930 EMPHRMVGRVRNYNWQCADCKCCIKCGSSQQ---PGKMLYCEQCDRGYHIYCL------G 1980
Query: 673 LRELPKGKWFC-----CMDC 687
++ +P+G+W C CM C
Sbjct: 1981 VKTVPEGRWSCERCSICMRC 2000
>gi|402880392|ref|XP_003903787.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Papio anubis]
Length = 1887
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
Length = 2144
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 6/68 (8%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL C C+ G ++LCD+C+ +H+ CL+ L +P G W C C C R
Sbjct: 213 CLDCTVCEGCGDGHDESNLILCDECDISYHIYCLE----PPLERIPHGPWRCKWCSACRR 268
Query: 690 INSVLQNL 697
+ + N+
Sbjct: 269 CGNQISNI 276
>gi|158297171|ref|XP_317442.4| AGAP008017-PA [Anopheles gambiae str. PEST]
gi|157015066|gb|EAA12387.4| AGAP008017-PA [Anopheles gambiae str. PEST]
Length = 2930
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2382 IMKANCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2428
>gi|374349207|ref|NP_001243398.1| histone acetyltransferase KAT6B isoform 3 [Homo sapiens]
gi|119574943|gb|EAW54558.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_b [Homo sapiens]
Length = 1781
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|6002686|gb|AAF00095.1| histone acetyltransferase MORF [Homo sapiens]
gi|20521021|dbj|BAA20837.2| KIAA0383 [Homo sapiens]
gi|152012887|gb|AAI50271.1| MYST4 protein [Homo sapiens]
gi|168267336|dbj|BAG09724.1| MYST histone acetyltransferase 4 [synthetic construct]
Length = 1781
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|348678698|gb|EGZ18515.1| hypothetical protein PHYSODRAFT_332293 [Phytophthora sojae]
Length = 375
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ KS R I+LCD C+ E+HV CL L ++P+G W+C
Sbjct: 172 CEVCKSSERERDIVLCDDCDAEYHVFCLSPP----LPKVPEGTWYC 213
>gi|443695479|gb|ELT96377.1| hypothetical protein CAPTEDRAFT_147496 [Capitella teleta]
Length = 1490
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ G G ++LLCD C+ FH+ CL + L E+PKG W C
Sbjct: 283 CNLCGRGDGEESMLLCDSCDDAFHMHCL----IPPLHEVPKGDWRC 324
>gi|25396154|pir||A88925 protein F33E11.3 [imported] - Caenorhabditis elegans
Length = 1192
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 9/76 (11%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ +R +VK E C+ CR C I+ CD+C+R FH C+
Sbjct: 453 EMPERMAALVKTYEW---SCVDCRLCSICNKPEKEDEIVFCDRCDRGFHTYCV------G 503
Query: 673 LRELPKGKWFCCMDCS 688
L++LP+G W C C+
Sbjct: 504 LKKLPQGTWICDTYCA 519
>gi|297301093|ref|XP_002805721.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Macaca
mulatta]
Length = 1784
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|224132698|ref|XP_002321387.1| SET domain protein [Populus trichocarpa]
gi|222868383|gb|EEF05514.1| SET domain protein [Populus trichocarpa]
Length = 325
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ SG P +LLCD+C++ FH+ CL+ L +PKG WFC
Sbjct: 7 CEECGSGESPGELLLCDKCDKGFHLFCLR----PILVSVPKGSWFC 48
>gi|338710071|ref|XP_001916345.2| PREDICTED: zinc finger protein neuro-d4-like [Equus caballus]
Length = 205
Score = 44.3 bits (103), Expect = 0.34, Method: Composition-based stats.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 87 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 140
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 141 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 187
>gi|157125686|ref|XP_001660731.1| hypothetical protein AaeL_AAEL002037 [Aedes aegypti]
gi|108882584|gb|EAT46809.1| AAEL002037-PA [Aedes aegypti]
Length = 2884
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 8/88 (9%)
Query: 600 VEAGRVSGVDSVEQ--ITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
+ G VS D+VE+ T + + LE+ ++ C F +SG +LLCD
Sbjct: 2293 IPKGLVSWRDAVERSVTTAQLSMALYVLESCVAWDKSIMKANCQFCQSGEQEDKLLLCDG 2352
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFC 683
C+R +H C K + ++P G W+C
Sbjct: 2353 CDRGYHTYCFKPR----MDKIPDGDWYC 2376
>gi|348664628|gb|EGZ04474.1| hypothetical protein PHYSODRAFT_343250 [Phytophthora sojae]
Length = 388
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ KS R I+LCD C+ E+HV CL L ++P+G W+C
Sbjct: 169 CEVCKSSERERDIVLCDDCDAEYHVFCLSP----PLPKVPEGTWYC 210
>gi|6002694|gb|AAF00099.1|AF119230_1 histone acetyltransferase MORF alpha [Homo sapiens]
Length = 1890
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|397483742|ref|XP_003813056.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Pan paniscus]
Length = 1892
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|374349205|ref|NP_001243397.1| histone acetyltransferase KAT6B isoform 2 [Homo sapiens]
gi|119574942|gb|EAW54557.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_a [Homo sapiens]
Length = 1890
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|402880390|ref|XP_003903786.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Papio anubis]
Length = 1778
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|332244078|ref|XP_003271198.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Nomascus leucogenys]
Length = 2055
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 231 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 286
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 287 RMPKGMWIC 295
>gi|297301091|ref|XP_002805720.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Macaca
mulatta]
Length = 1893
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|356554670|ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max]
Length = 832
Score = 44.3 bits (103), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 13/88 (14%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
I+LCD C+ +HV C+K + + +PKGKWFC + I ++ Q E+ K
Sbjct: 721 IVLCDACDHAYHVYCMKPPQNS----IPKGKWFCIKCEAGIQAIRQARKAYESNK----- 771
Query: 710 LNAIKKYAGNSLETVSDIDVRWRLLSGK 737
K N + DID +W G+
Sbjct: 772 ----GKVGQNDSKPNEDIDKKWNKKRGR 795
>gi|68565919|sp|Q8WML3.1|KAT6B_MACFA RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4
gi|17025966|dbj|BAB72094.1| histone acetyltransferase MORF [Macaca fascicularis]
Length = 1784
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|395820448|ref|XP_003783578.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Otolemur
garnettii]
Length = 1771
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|395820450|ref|XP_003783579.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Otolemur
garnettii]
Length = 1880
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|395741628|ref|XP_002820847.2| PREDICTED: histone acetyltransferase KAT6B, partial [Pongo abelii]
Length = 1870
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 43 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 98
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 99 RMPKGMWIC 107
>gi|149068917|gb|EDM18469.1| rCG40672, isoform CRA_a [Rattus norvegicus]
Length = 979
Score = 43.9 bits (102), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 12/70 (17%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 689
C+ C P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 868 CKKCGLPNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEK 920
Query: 690 INSVLQNLLV 699
+ LQ+L V
Sbjct: 921 LEEQLQDLDV 930
>gi|426365193|ref|XP_004049670.1| PREDICTED: histone acetyltransferase KAT6B [Gorilla gorilla
gorilla]
Length = 2072
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|426255810|ref|XP_004021541.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Ovis aries]
Length = 1869
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|355733048|gb|AES10897.1| MYST histone acetyltransferase monocytic leukemia 4 [Mustela
putorius furo]
Length = 1534
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 12 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 67
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 68 RMPKGMWIC 76
>gi|326522755|dbj|BAJ88423.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 562
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 14/74 (18%)
Query: 617 RCIRIV-------KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
RC+R KNLE LCRGC +K I LCD C+ +HV C+ +
Sbjct: 459 RCLRTSQIALEKQKNLECWYCPSCLCRGCFKNKDD---EEITLCDGCDEAYHVYCMTPKR 515
Query: 670 MADLRELPKGKWFC 683
+PKG W+C
Sbjct: 516 TC----VPKGHWYC 525
>gi|18032212|gb|AAL56647.1|AF217500_1 histone acetyltransferase MOZ2 [Homo sapiens]
Length = 2072
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|332834457|ref|XP_003312688.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Pan troglodytes]
Length = 2070
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|30685722|ref|NP_566656.3| PHD finger protein [Arabidopsis thaliana]
gi|75251245|sp|Q5PNS0.1|Y3228_ARATH RecName: Full=PHD finger protein At3g20280
gi|56382027|gb|AAV85732.1| At3g20280 [Arabidopsis thaliana]
gi|332642837|gb|AEE76358.1| PHD finger protein [Arabidopsis thaliana]
Length = 482
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+S + C+ C+ + + T+L+CD CE+ +H+ CL+ + M + +PK +W C
Sbjct: 42 MSRAMACQICEVTINEM--DTLLICDACEKAYHLKCLQGNNM---KGVPKSEWHC 91
>gi|6002696|gb|AAF00100.1|AF119231_1 histone acetyltransferase MORF beta [Homo sapiens]
Length = 2073
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|332022570|gb|EGI62872.1| Bromodomain adjacent to zinc finger domain protein 2B [Acromyrmex
echinatior]
Length = 2202
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 625 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1904 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 1959
Query: 683 C--CMD 686
C CM+
Sbjct: 1960 CHECMN 1965
>gi|449495879|ref|XP_004159972.1| PREDICTED: LOW QUALITY PROTEIN: lysine-specific demethylase 5C-like
[Cucumis sativus]
Length = 1845
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 8/66 (12%)
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
V L+ E + +C C KSG +LLCD+C++ +H CL L+++P G W
Sbjct: 234 VSKLKEEENNDQICEQC---KSGLHGEVMLLCDRCDKGWHTYCLS----PPLKQVPPGNW 286
Query: 682 FCCMDC 687
+ C+DC
Sbjct: 287 Y-CLDC 291
>gi|390472131|ref|XP_002807481.2| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Callithrix jacchus]
Length = 2066
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|449452470|ref|XP_004143982.1| PREDICTED: uncharacterized protein LOC101216734 [Cucumis sativus]
Length = 1843
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 8/66 (12%)
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
V L+ E + +C C KSG +LLCD+C++ +H CL L+++P G W
Sbjct: 235 VSKLKEEENNDQICEQC---KSGLHGEVMLLCDRCDKGWHTYCLS----PPLKQVPPGNW 287
Query: 682 FCCMDC 687
+ C+DC
Sbjct: 288 Y-CLDC 292
>gi|417406752|gb|JAA50020.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1807
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|302811223|ref|XP_002987301.1| hypothetical protein SELMODRAFT_956 [Selaginella moellendorffii]
gi|300144936|gb|EFJ11616.1| hypothetical protein SELMODRAFT_956 [Selaginella moellendorffii]
Length = 553
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 596 DANAVEAGRVSGV--DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 653
DA V+ GR + D V + R EA + L + C+ +SG + +L C
Sbjct: 188 DAVDVKIGRKRSLAEDEVSASVEPPKRTKSEDEASMQQRLGDQSCEQCRSGAHEQLMLNC 247
Query: 654 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
D C+R +H+ CL L E+P G+W+ C+DC
Sbjct: 248 DLCDRAWHMYCLS----PPLSEMPSGRWY-CLDC 276
>gi|303277527|ref|XP_003058057.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460714|gb|EEH58008.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 280
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
CD K T++LCD C+R FH+ CL L ELP G W C
Sbjct: 16 CDVCKQSTDDDTMILCDSCDRGFHMACLNPV----LNELPIGDWLC 57
>gi|187957110|gb|AAI50619.1| MYST histone acetyltransferase (monocytic leukemia) 4 [Homo
sapiens]
Length = 2073
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|397483738|ref|XP_003813054.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Pan paniscus]
Length = 2075
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 625 LEAELS--GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1974 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 2029
Query: 683 C--CMD 686
C CM+
Sbjct: 2030 CHECMN 2035
>gi|308497100|ref|XP_003110737.1| CRE-SET-16 protein [Caenorhabditis remanei]
gi|308242617|gb|EFO86569.1| CRE-SET-16 protein [Caenorhabditis remanei]
Length = 2509
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 6/62 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL C C+ +G +LLC++C+ +H+ C+K L +PKG W C C C R
Sbjct: 503 CLDCTFCEGCGAGGDEEKLLLCEECDVSYHMYCIK----PPLEAIPKGPWRCQWCSRCRR 558
Query: 690 IN 691
N
Sbjct: 559 CN 560
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 32/68 (47%), Gaps = 6/68 (8%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL C C+ P +LLCD C+ +H CL L+ +PKG W C C+ C+
Sbjct: 1115 CLECTVCEACGKATDPGRLLLCDDCDISYHTFCLD----PPLQTVPKGGWKCKWCVSCTN 1170
Query: 690 INSVLQNL 697
++ L
Sbjct: 1171 CKAITPGL 1178
>gi|100816397|ref|NP_036462.2| histone acetyltransferase KAT6B isoform 1 [Homo sapiens]
gi|143811424|sp|Q8WYB5.3|KAT6B_HUMAN RecName: Full=Histone acetyltransferase KAT6B; AltName:
Full=Histone acetyltransferase MOZ2; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Monocytic leukemia zinc finger
protein-related factor
gi|119574944|gb|EAW54559.1| MYST histone acetyltransferase (monocytic leukemia) 4, isoform
CRA_c [Homo sapiens]
Length = 2073
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|363735536|ref|XP_421609.3| PREDICTED: histone acetyltransferase KAT6B [Gallus gallus]
Length = 2025
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|326923554|ref|XP_003208000.1| PREDICTED: histone acetyltransferase MYST4-like [Meleagris
gallopavo]
Length = 2028
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 253 KFCPELTSNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 308
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 309 RMPKGMWIC 317
>gi|403298004|ref|XP_003939830.1| PREDICTED: histone acetyltransferase KAT6B [Saimiri boliviensis
boliviensis]
Length = 2051
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|348576164|ref|XP_003473857.1| PREDICTED: histone acetyltransferase MYST4-like isoform 2 [Cavia
porcellus]
Length = 1762
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|449505049|ref|XP_002192975.2| PREDICTED: histone acetyltransferase KAT6B [Taeniopygia guttata]
Length = 1842
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|426255808|ref|XP_004021540.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Ovis aries]
Length = 1760
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|417406735|gb|JAA50012.1| Putative histone acetyltransferase myst family [Desmodus rotundus]
Length = 1778
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|402880388|ref|XP_003903785.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Papio anubis]
Length = 2070
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|345305893|ref|XP_001506182.2| PREDICTED: histone acetyltransferase MYST4 isoform 1
[Ornithorhynchus anatinus]
Length = 2066
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNAENMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|338716908|ref|XP_003363543.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Equus
caballus]
Length = 1769
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|170053718|ref|XP_001862805.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874114|gb|EDS37497.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 3017
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2469 QNCQFCQSGEQEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2512
>gi|403293045|ref|XP_003937534.1| PREDICTED: zinc finger protein neuro-d4 [Saimiri boliviensis
boliviensis]
Length = 293
Score = 43.9 bits (102), Expect = 0.42, Method: Composition-based stats.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 175 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 228
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 229 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 275
>gi|355782819|gb|EHH64740.1| hypothetical protein EGM_18047 [Macaca fascicularis]
Length = 2069
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|355562477|gb|EHH19071.1| hypothetical protein EGK_19714 [Macaca mulatta]
Length = 2077
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|291404131|ref|XP_002718449.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 2 [Oryctolagus cuniculus]
Length = 1774
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|195501992|ref|XP_002098034.1| GE24161 [Drosophila yakuba]
gi|194184135|gb|EDW97746.1| GE24161 [Drosophila yakuba]
Length = 2286
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 12/64 (18%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C + +LLCD C + +H+ CL L E+P G W+C
Sbjct: 237 EEEVTNCEICESPERED------VMLLCDSCNQGYHMDCLDPQ----LHEIPAGSWYCDN 286
Query: 684 CMDC 687
C+D
Sbjct: 287 CIDS 290
>gi|354468677|ref|XP_003496778.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Cricetulus
griseus]
Length = 1756
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|395820446|ref|XP_003783577.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Otolemur
garnettii]
Length = 2062
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 13/90 (14%)
Query: 598 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 657
N +++ ++ +E+I + ++ V+ + E C+ C G G + +LLCD C
Sbjct: 787 NTIQSQQLQPSLIIEEIYRDQVKKVQIQDGENIWEEQCKVC-----GQGGK-VLLCDTCP 840
Query: 658 REFHVGCLKKHKMADLRELPKGKWFCCMDC 687
R FH CLK L+E+PKGKW CM C
Sbjct: 841 RVFHPRCLK------LKEIPKGKW-SCMIC 863
>gi|431904096|gb|ELK09518.1| Histone acetyltransferase MYST4 [Pteropus alecto]
Length = 1926
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|338716911|ref|XP_003363544.1| PREDICTED: histone acetyltransferase MYST4 isoform 3 [Equus
caballus]
Length = 1878
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|426255806|ref|XP_004021539.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Ovis aries]
Length = 2052
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|115533182|ref|NP_001041113.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
gi|351062484|emb|CCD70456.1| Protein PHF-10, isoform b [Caenorhabditis elegans]
Length = 447
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 9/76 (11%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ +R +VK E C+ CR C I+ CD+C+R FH C+
Sbjct: 361 EMPERMAALVKTYEW---SCVDCRLCSICNKPEKEDEIVFCDRCDRGFHTYCV------G 411
Query: 673 LRELPKGKWFCCMDCS 688
L++LP+G W C C+
Sbjct: 412 LKKLPQGTWICDTYCA 427
>gi|320582838|gb|EFW97055.1| JmjC domain family histone demethylase [Ogataea parapolymorpha
DL-1]
Length = 797
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 50/110 (45%), Gaps = 22/110 (20%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CM-----------DCSRINSV 693
P TIL+CD C R FH+ CL L+E+P WFC C+ D I ++
Sbjct: 192 PDTILICDDCNRNFHMRCLN----PILKEVPDTDWFCDDCLKGSSAEYGFEEDFESIYTI 247
Query: 694 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-VRWRLLSGKAATPE 742
+ +E + L + N + + GN +V I+ V WRL+ + E
Sbjct: 248 RE--FARECDDLKRRYCNEM--FDGNMNPSVDAIESVFWRLVDTQDEEEE 293
>gi|410975403|ref|XP_003994122.1| PREDICTED: LOW QUALITY PROTEIN: histone acetyltransferase KAT6B
[Felis catus]
Length = 2078
Score = 43.9 bits (102), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|348506563|ref|XP_003440828.1| PREDICTED: zinc finger protein DPF3-like [Oreochromis niloticus]
Length = 391
Score = 43.9 bits (102), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 53/113 (46%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C Q+ + + + + GR SG + Q T+ ++ V+ + + C+ C
Sbjct: 275 YCDFCLGDQDSNRKTGQAEELVSCSDCGR-SGHPTCLQFTENMMQAVRTYQWQ---CIEC 330
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C + +L CD C+R +H+ CLK + + P+G W C C+D
Sbjct: 331 KSCSLCGTSENDDQLLFCDDCDRGYHMYCLK----PPMTQPPEGSWSCHLCLD 379
>gi|432100457|gb|ELK29089.1| Histone acetyltransferase MYST4 [Myotis davidii]
Length = 2022
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|356499679|ref|XP_003518664.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Glycine
max]
Length = 334
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ G P ++LCD+C+R +H+ CL+ L +PKG WFC
Sbjct: 25 CEECGGGHSPSKLILCDKCDRGYHLFCLR----PILPSVPKGSWFC 66
>gi|301773210|ref|XP_002922022.1| PREDICTED: histone acetyltransferase MYST4-like [Ailuropoda
melanoleuca]
gi|281342250|gb|EFB17834.1| hypothetical protein PANDA_010953 [Ailuropoda melanoleuca]
Length = 2063
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|84996013|ref|XP_952728.1| zinc-finger protein [Theileria annulata strain Ankara]
gi|65303725|emb|CAI76102.1| zinc-finger protein, putative [Theileria annulata]
Length = 237
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV + C +C C+ +++G T+L+CD C+R FH+ C + + E+P G
Sbjct: 69 IVIRYPWHCNSCKICVNCNEAENGVSS-TLLICDSCDRAFHMECTR----SKYTEIPSGN 123
Query: 681 WFC 683
W+C
Sbjct: 124 WYC 126
>gi|194042830|ref|XP_001928984.1| PREDICTED: histone acetyltransferase MYST4 [Sus scrofa]
Length = 2065
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|73953062|ref|XP_536397.2| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Canis lupus
familiaris]
Length = 2090
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|348576162|ref|XP_003473856.1| PREDICTED: histone acetyltransferase MYST4-like isoform 1 [Cavia
porcellus]
Length = 2053
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|149689991|ref|XP_001504001.1| PREDICTED: histone acetyltransferase MYST4 isoform 1 [Equus
caballus]
Length = 2061
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|297467918|ref|XP_872746.3| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Bos taurus]
gi|297491533|ref|XP_002698931.1| PREDICTED: histone acetyltransferase KAT6B [Bos taurus]
gi|296472060|tpg|DAA14175.1| TPA: MYST histone acetyltransferase (monocytic leukemia) 4 [Bos
taurus]
Length = 2054
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|298707919|emb|CBJ30305.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1534
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
CR C + K P LLC+ C+ E+H CL L E+PKGKW+C
Sbjct: 352 CRKC-YGKQD--PELALLCEHCDDEYHTYCLDP----PLTEVPKGKWYC 393
>gi|242036737|ref|XP_002465763.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
gi|241919617|gb|EER92761.1| hypothetical protein SORBIDRAFT_01g045403 [Sorghum bicolor]
Length = 158
Score = 43.9 bits (102), Expect = 0.47, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 261 DGGILCSCSLCNGC----RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
DGG C+C GC +V+ +FE HA + + +I NGKSL ++ R VP
Sbjct: 2 DGGYACACPASAGCGYHGKVLSARQFEKHAGAESNNPNGHILLRNGKSLYQLFHDLRHVP 61
Query: 317 LPMLKA 322
L A
Sbjct: 62 AEALAA 67
>gi|392333207|ref|XP_003752828.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1855
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|358344833|ref|XP_003636491.1| hypothetical protein MTR_042s0033 [Medicago truncatula]
gi|355502426|gb|AES83629.1| hypothetical protein MTR_042s0033 [Medicago truncatula]
Length = 355
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 10/111 (9%)
Query: 758 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE-VAELP 816
F P ++S ++ MVY N +G E GG Y I+ S + R + E+P
Sbjct: 254 FGPGIES-----MVEQMVY--NKKG-EHGGFYTCIIQKEGKCASVAVFRAHARNGFVEIP 305
Query: 817 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 867
+VAT + +G + L IE + L + ++LP+ ++W + FGF
Sbjct: 306 IVATMESFRSQGLCKRLIHSIENCVKRLGIHHLILPSLRTRLAMW-EHFGF 355
>gi|351714578|gb|EHB17497.1| Histone acetyltransferase MYST4 [Heterocephalus glaber]
Length = 2108
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 252 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 307
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 308 RMPKGMWIC 316
>gi|126272817|ref|XP_001366112.1| PREDICTED: histone acetyltransferase MYST4 [Monodelphis domestica]
Length = 2045
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|6716789|gb|AAF26744.1|AF222800_1 histone acetyltransferase querkopf [Mus musculus]
Length = 1763
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|74184716|dbj|BAE27963.1| unnamed protein product [Mus musculus]
Length = 1763
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|354468679|ref|XP_003496779.1| PREDICTED: histone acetyltransferase MYST4 isoform 2 [Cricetulus
griseus]
Length = 2047
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|169146772|emb|CAQ13473.1| novel protein similar to human D4, zinc and double PHD fingers
family 1 (DPF1) [Danio rerio]
Length = 127
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 49/116 (42%), Gaps = 10/116 (8%)
Query: 568 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 627
A S IP G YC +C ++ + + + GR SG S Q T V+
Sbjct: 4 ADGSVIPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRW 60
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 61 Q---CIECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLSP----PMSEPPEGSWSC 109
>gi|110556652|ref|NP_059507.2| histone acetyltransferase KAT6B [Mus musculus]
gi|327365366|ref|NP_001192170.1| histone acetyltransferase KAT6B [Mus musculus]
gi|148669523|gb|EDL01470.1| mCG123147, isoform CRA_a [Mus musculus]
Length = 1763
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|440893247|gb|ELR46092.1| Histone acetyltransferase MYST4 [Bos grunniens mutus]
Length = 2054
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|341942234|sp|Q8BRB7.3|KAT6B_MOUSE RecName: Full=Histone acetyltransferase KAT6B; AltName: Full=MOZ,
YBF2/SAS3, SAS2 and TIP60 protein 4; Short=MYST-4;
AltName: Full=Protein querkopf
Length = 1872
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|302143935|emb|CBI23040.3| unnamed protein product [Vitis vinifera]
Length = 400
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 77/182 (42%), Gaps = 23/182 (12%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
C+ SG +LLCD+C+R FH+ CL+ ++ +PKG WFC CS +
Sbjct: 88 CEECGSGDAADELLLCDKCDRGFHLFCLRPIIVS----VPKGPWFCP-SCSSQKKLKYFP 142
Query: 698 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 757
LVQ K+ +F I++ ++ + D R + S + + R LL
Sbjct: 143 LVQ--TKIVDFF--RIQRSTESTQKLNYDSQKRRKRSSSLVVSKKKRKLLP--------- 189
Query: 758 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-----TVNSSVVSAGILRVFGQEV 812
F+P D + + + S+ EF I + N +V+ G ++V +E
Sbjct: 190 FNPTEDPLRRMEQMASLATALTATRTEFSNELTYIPGMAPKSANRAVLEHGGMQVLSKED 249
Query: 813 AE 814
E
Sbjct: 250 TE 251
>gi|405965654|gb|EKC31016.1| Histone acetyltransferase MYST4 [Crassostrea gigas]
Length = 2037
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 8/99 (8%)
Query: 606 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 665
SG S + + VK L + C+ C+ C F + +L CD C+R FH+ C
Sbjct: 236 SGHPSCLKFSPELTETVKKLRWQ---CIDCKTCSFCQKSGREDNMLFCDLCDRGFHMECC 292
Query: 666 KKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLVQEAE 703
L + PKGKW C + D R N ++ L A+
Sbjct: 293 D----PPLSKAPKGKWKCNICDPDRGNGKGRHFLEMAAK 327
>gi|395501556|ref|XP_003755159.1| PREDICTED: histone acetyltransferase KAT6B isoform 1 [Sarcophilus
harrisii]
Length = 1753
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|350407087|ref|XP_003487980.1| PREDICTED: hypothetical protein LOC100749908 [Bombus impatiens]
Length = 2303
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2019 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2066
>gi|291404129|ref|XP_002718448.1| PREDICTED: MYST histone acetyltransferase (monocytic leukemia) 4
isoform 1 [Oryctolagus cuniculus]
Length = 2065
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|168029787|ref|XP_001767406.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681302|gb|EDQ67730.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2476
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVL 694
CR C + +I+LCD+C+ E+H CL L +P+G WFC +C ++ V
Sbjct: 1089 CRVCGVDEDY---ESIMLCDKCDAEYHTYCLN----PPLERVPEGTWFCP-ECVALDKVF 1140
Query: 695 QNLLVQEAE 703
+ L ++ E
Sbjct: 1141 PDRLRKDGE 1149
>gi|195505552|ref|XP_002099554.1| GE10967 [Drosophila yakuba]
gi|194185655|gb|EDW99266.1| GE10967 [Drosophila yakuba]
Length = 1475
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P+G W+C DC
Sbjct: 1067 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPQGNWYCN-DC 1112
>gi|392353369|ref|XP_003751480.1| PREDICTED: histone acetyltransferase KAT6B-like [Rattus norvegicus]
Length = 1640
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|344274631|ref|XP_003409118.1| PREDICTED: histone acetyltransferase MYST4 [Loxodonta africana]
Length = 1878
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|357615613|gb|EHJ69753.1| putative hepatitis B virus X associated protein, hbxa [Danaus
plexippus]
Length = 2087
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 11/62 (17%)
Query: 643 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 702
SG P ILLCD+C+ +H CLK L +P+G WFC +CS + +L+ E
Sbjct: 867 SGEQPEWILLCDRCDGGYHASCLK----PVLLLVPEGDWFCP-ECSHM------MLITEL 915
Query: 703 EK 704
EK
Sbjct: 916 EK 917
>gi|328909281|gb|AEB61308.1| remodeling and spacing factor 1-like protein, partial [Equus
caballus]
Length = 324
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 19/88 (21%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV-- 699
P ILLCD C+ +H CL+ M +P G+WFC C C ++ LQ+L V
Sbjct: 161 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCEKLEEQLQDLDVAL 216
Query: 700 ---QEAEKLPEFHLNAIKKYAGNSLETV 724
+ AE+ E + Y G S+E +
Sbjct: 217 KKKERAERRKERLV-----YVGISIENI 239
>gi|328792710|ref|XP_623473.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis mellifera]
Length = 2293
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2009 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2056
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
++ C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1932 IMKANCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1983
>gi|380485054|emb|CCF39608.1| origin recognition complex subunit 4 [Colletotrichum higginsianum]
Length = 863
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL+C D SK G IL CD C++ +H C K K +P+G W+C C+ +
Sbjct: 348 CLVCSKPD-SKPG---NRILFCDGCDKAYHQKCYKVPK------VPRGDWYCNECVQQKQ 397
Query: 690 INSVLQNLLVQEAEKLP--EFHLNAIKK 715
+ EA K+P E HLN++K+
Sbjct: 398 SRAAAAG----EAVKIPNVEQHLNSLKR 421
>gi|395501560|ref|XP_003755161.1| PREDICTED: histone acetyltransferase KAT6B isoform 3 [Sarcophilus
harrisii]
Length = 1862
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|348505986|ref|XP_003440541.1| PREDICTED: PHD and RING finger domain-containing protein 1-like
[Oreochromis niloticus]
Length = 1068
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 14/107 (13%)
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
G+V + +V++ K NL+ E + C +C G D +LLCD C+ +H
Sbjct: 158 GGKVKKMITVQKPVKEGQEETVNLDLEQTNCEVCGGSDREDR------LLLCDGCDAGYH 211
Query: 662 VGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLP 706
+ CL L +P +WFC C +R +S + L +A+ LP
Sbjct: 212 MECL----TPPLDSVPVEEWFCPECEANNRNSSSAEEL--SDADSLP 252
>gi|159490503|ref|XP_001703214.1| hypothetical protein CHLREDRAFT_195012 [Chlamydomonas reinhardtii]
gi|158270673|gb|EDO96510.1| predicted protein [Chlamydomonas reinhardtii]
Length = 862
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 56/113 (49%), Gaps = 4/113 (3%)
Query: 792 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 851
+L + V+A +R G E+P VAT + G+GY + L +E++ L +K ++L
Sbjct: 155 VLQSHGKPVTAATIRS-GPGYLEIPFVATHENKRGRGYCRCLVEAVEEIARTLGLKRLML 213
Query: 852 PAAEEA--ESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 902
+ +A +S W GF E + + + +V + T+ +QK VPA R
Sbjct: 214 CSTNDASVQSTW-KHLGFHYCPDEQMEEWDILHTDMVYLQNTTQMQKDVPAPR 265
>gi|340709835|ref|XP_003393506.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Bombus terrestris]
Length = 2263
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 1979 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2026
>gi|395501558|ref|XP_003755160.1| PREDICTED: histone acetyltransferase KAT6B isoform 2 [Sarcophilus
harrisii]
Length = 2045
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 9/76 (11%)
Query: 620 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 679
RI K + + C+ C C+ P +LLCD C+ +H CL L +PKG
Sbjct: 842 RITKVMLLKGWRCVECIVCEVCGKATDPSRLLLCDDCDISYHTYCLD----PPLHTVPKG 897
Query: 680 KWFC-----CMDCSRI 690
W C CM C +
Sbjct: 898 GWKCRWCVSCMQCGAV 913
>gi|327261925|ref|XP_003215777.1| PREDICTED: lysine-specific demethylase 5C-like [Anolis
carolinensis]
Length = 1551
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 7/74 (9%)
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
S E TK +R+ +N + +CR C G +LLCD C+ +H+ CL
Sbjct: 370 SPEPCTKMTMRLRRNHNTQFIESYVCRIC---ARGDEDDKLLLCDGCDDNYHIFCL---- 422
Query: 670 MADLRELPKGKWFC 683
+ L E+PKG W C
Sbjct: 423 LPPLPEIPKGVWRC 436
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 9/61 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW-----FCCMD 686
CL C C+ P +LLCD C+ +H CL+ L +PKG W CC+
Sbjct: 233 CLECIVCEMCGKASDPSRLLLCDDCDVSYHTYCLE----PPLHNVPKGGWKCKWCVCCVQ 288
Query: 687 C 687
C
Sbjct: 289 C 289
>gi|326671393|ref|XP_684139.4| PREDICTED: hypothetical protein LOC327618 [Danio rerio]
Length = 1899
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 9/56 (16%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNL 697
P ILLCD C+ +H CL+ M +P G+WFC C C R+ L NL
Sbjct: 1380 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKLLCDRLEEQLLNL 1431
>gi|225455318|ref|XP_002275757.1| PREDICTED: histone-lysine N-methyltransferase ATXR6-like [Vitis
vinifera]
Length = 354
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 82/195 (42%), Gaps = 23/195 (11%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 697
C+ SG +LLCD+C+R FH+ CL+ ++ +PKG WF C CS +
Sbjct: 42 CEECGSGDAADELLLCDKCDRGFHLFCLRPIIVS----VPKGPWF-CPSCSSQKKLKYFP 96
Query: 698 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 757
LVQ K+ +F I++ ++ + D R + S + + R LL
Sbjct: 97 LVQ--TKIVDFF--RIQRSTESTQKLNYDSQKRRKRSSSLVVSKKKRKLLP--------- 143
Query: 758 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-----TVNSSVVSAGILRVFGQEV 812
F+P D + + + S+ EF I + N +V+ G ++V +E
Sbjct: 144 FNPTEDPLRRMEQMASLATALTATRTEFSNELTYIPGMAPKSANRAVLEHGGMQVLSKED 203
Query: 813 AELPLVATSKINHGK 827
E + S + G+
Sbjct: 204 TETLNLCKSMMGRGE 218
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1790 NCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1837
>gi|348670550|gb|EGZ10371.1| hypothetical protein PHYSODRAFT_563445 [Phytophthora sojae]
Length = 3176
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 7/49 (14%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
CR C ++ + +LLCD+C+ +H CLK + L +PKG+WFC
Sbjct: 976 CRNC---QTIYAKGRLLLCDRCDAPYHTFCLK----SPLSVIPKGEWFC 1017
>gi|242072516|ref|XP_002446194.1| hypothetical protein SORBIDRAFT_06g003260 [Sorghum bicolor]
gi|241937377|gb|EES10522.1| hypothetical protein SORBIDRAFT_06g003260 [Sorghum bicolor]
Length = 159
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Query: 840 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 899
+L+ ++V+ +V+PA W F FK +DPEL R+R LV GT++LQK +
Sbjct: 2 VLASVQVEKLVIPAIAALVDTWKRFFSFKLLDPELKEEIRRR--SLVVITGTTLLQKPIV 59
Query: 900 ACRIGSSSTDSTECVSG 916
A S TE +
Sbjct: 60 AAPPSPSQHKQTEAAAA 76
>gi|126338798|ref|XP_001365864.1| PREDICTED: zinc finger protein ubi-d4-like [Monodelphis domestica]
Length = 395
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 9/87 (10%)
Query: 602 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
+G SG S Q T + VK + C+ C+ C+ + +L CD C+R +H
Sbjct: 301 SGTDSGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGYH 357
Query: 662 VGCLKKHKMADLRELPKGKWFC--CMD 686
+ CL + E P+G W C C+D
Sbjct: 358 MYCL----TPSMSEPPEGSWSCHLCLD 380
>gi|195396284|ref|XP_002056762.1| GJ11111 [Drosophila virilis]
gi|194143471|gb|EDW59874.1| GJ11111 [Drosophila virilis]
Length = 2347
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 12/70 (17%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV + +++ C +C D + +LLCD C + +H+ CL L E+P G
Sbjct: 285 IVAIDDDDITNCEICNRPDREE------IMLLCDSCNQGYHMDCLD----PPLYEIPAGS 334
Query: 681 WFC--CMDCS 688
W+C C+D S
Sbjct: 335 WYCDNCIDSS 344
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 43.5 bits (101), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 72/188 (38%), Gaps = 28/188 (14%)
Query: 517 FDESGLPDGTE-VGYYACGQKLLEGYKNGLGII-CHCCNSEVSPSQFEAHAEC--ASLSS 572
FD + +P E GY Q E + G II C C P + H C L
Sbjct: 255 FDLAPVPGDEEGDGYETDHQDYCEVCQQGGEIILCDTC-----PRAY--HLVCLEPELDK 307
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
P+G W C +C+ K +Q +A E I++ + + E E
Sbjct: 308 APEGKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPMGAEEEDDDH 362
Query: 633 L-LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRI 690
+ CR C K G +L CD C +H+ CL L E+P G+W C C I
Sbjct: 363 MEFCRVC---KDGG---ELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPI 412
Query: 691 NSVLQNLL 698
+Q +L
Sbjct: 413 KGRVQRIL 420
>gi|284080633|gb|ADB77890.1| lysin (K)-specific demethylase 5D, partial [Monodelphis domestica]
Length = 1479
Score = 43.5 bits (101), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 41/82 (50%), Gaps = 8/82 (9%)
Query: 603 GRVSGVDSVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 661
G+ + ++E TK +R+ +N A+ +CR C G +LLCD C+ +H
Sbjct: 255 GKEELIHNLEPCTKMTMRLRRNHSNAQFIDSYVCRIC---TRGDEDDKLLLCDGCDDNYH 311
Query: 662 VGCLKKHKMADLRELPKGKWFC 683
+ CL + L E+PKG W C
Sbjct: 312 IFCL----LPPLPEIPKGSWRC 329
>gi|410916367|ref|XP_003971658.1| PREDICTED: zinc finger protein DPF3-like [Takifugu rubripes]
Length = 391
Score = 43.5 bits (101), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 52/113 (46%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C Q+ + + + + GR SG + Q T ++ V+ + + C+ C
Sbjct: 275 YCDFCLGDQDSNRKTGQAEELVSCSDCGR-SGHPTCLQFTDNMMQAVRTYQWQ---CIEC 330
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C + +L CD C+R +H+ CLK + + P+G W C C+D
Sbjct: 331 KSCSICGTSENDDQLLFCDDCDRGYHMYCLK----PPMTQPPEGSWSCHLCLD 379
>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
Length = 3915
Score = 43.5 bits (101), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 52 IKITKVVLSKGWRCLECTVCEACGQASDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 107
Query: 679 GKWFC--CMDCSR 689
G W C C+ C+
Sbjct: 108 GSWKCKWCVLCTH 120
>gi|195449467|ref|XP_002072084.1| GK22656 [Drosophila willistoni]
gi|194168169|gb|EDW83070.1| GK22656 [Drosophila willistoni]
Length = 2325
Score = 43.5 bits (101), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 12/64 (18%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
+ E++ C +C+ + + +LLCD C +H+ CL L E+P+G W+C
Sbjct: 248 DVEVTNCEICQNPEREE------VMLLCDSCNHGYHMDCLD----PPLHEIPEGSWYCDN 297
Query: 684 CMDC 687
C+D
Sbjct: 298 CVDS 301
>gi|432891023|ref|XP_004075510.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Oryzias
latipes]
Length = 381
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 67/165 (40%), Gaps = 26/165 (15%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASL----------SSIPQG----DW 578
CG++ YKN G+ H ++ ++ + E +E +L P G +
Sbjct: 211 CGKR----YKNRPGLSYHYTHTHLADEEGEEDSERHTLPFQRKNNHKPKKAPDGSVIANG 266
Query: 579 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 638
YC +C ++ + + + GR SG S Q T V+ + C+ C+ C
Sbjct: 267 YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---CIECKSC 322
Query: 639 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +L CD C+R +H+ CL + E P+G W C
Sbjct: 323 SLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMSEPPEGSWSC 363
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 72/186 (38%), Gaps = 33/186 (17%)
Query: 518 DESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC--ASLSSIPQ 575
DE G DG E + + +G G I+C C P + H C L P+
Sbjct: 380 DEEG--DGYETDHQDYCEVCQQG---GEIILCDTC-----PRAY--HLVCLEPELDKAPE 427
Query: 576 GDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-L 634
G W C +C+ K +Q +A E I++ + + E E +
Sbjct: 428 GKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPTGAEEEDDDHMEF 482
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINS 692
CR C K G +L CD C +H+ CL L E+P G+W C C C I
Sbjct: 483 CRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKG 531
Query: 693 VLQNLL 698
+Q +L
Sbjct: 532 RVQKIL 537
>gi|341878859|gb|EGT34794.1| CBN-SET-16 protein [Caenorhabditis brenneri]
Length = 2498
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 9/62 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C +C GC G + ++LCD+C+ +H CLK + ++P G W C C C R
Sbjct: 482 CTICEGC---GKGGDDKNLMLCDECDVPYHTYCLK----PPIEKVPTGSWRCQWCSRCRR 534
Query: 690 IN 691
N
Sbjct: 535 CN 536
>gi|320166419|gb|EFW43318.1| mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]
Length = 1858
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 9/58 (15%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDC 687
C++C C+ S F T+++CD C H+GC+ L E+P G+W C C+ C
Sbjct: 634 CIVCTTCN---SSFPEETLVVCDNCAVGRHLGCMD----IPLAEVPSGRWLCSQCVKC 684
>gi|30685716|ref|NP_851007.1| PHD finger protein [Arabidopsis thaliana]
gi|332642836|gb|AEE76357.1| PHD finger protein [Arabidopsis thaliana]
Length = 743
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+S + C+ C+ + + T+L+CD CE+ +H+ CL+ + M + +PK +W C
Sbjct: 303 MSRAMACQICEVTINEM--DTLLICDACEKAYHLKCLQGNNM---KGVPKSEWHC 352
>gi|168048866|ref|XP_001776886.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671742|gb|EDQ58289.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 329
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 37/81 (45%), Gaps = 8/81 (9%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
+ N A LC C K G +LLCDQC+R FH+ CL L +P G
Sbjct: 3 VWTNGPASAYSLTLCEEC---KGGDSAEQMLLCDQCDRGFHMFCLS----PILVSIPPGD 55
Query: 681 WFCCMDCSRINSVLQNLLVQE 701
W C CS+ + L+VQ+
Sbjct: 56 WI-CPHCSKSTIAHEFLMVQK 75
>gi|194898789|ref|XP_001978950.1| GG10942 [Drosophila erecta]
gi|190650653|gb|EDV47908.1| GG10942 [Drosophila erecta]
Length = 2287
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 31/64 (48%), Gaps = 12/64 (18%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C + +LLCD C + +H+ CL L E+P G W+C
Sbjct: 237 EEEVTNCEICESPERED------VMLLCDSCNQGYHMDCLD----PPLHEIPAGSWYCDN 286
Query: 684 CMDC 687
C+D
Sbjct: 287 CIDS 290
>gi|301609145|ref|XP_002934140.1| PREDICTED: PHD finger protein 21A-like [Xenopus (Silurana)
tropicalis]
Length = 681
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 494 CSICR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPKCQD 537
>gi|357481617|ref|XP_003611094.1| Lysine-specific demethylase 5D [Medicago truncatula]
gi|355512429|gb|AES94052.1| Lysine-specific demethylase 5D [Medicago truncatula]
Length = 1586
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C+ KSG +LLCD+C++ +H+ CL L+++P G W+C C+ R
Sbjct: 245 CEQCKSGLHGEVMLLCDRCDKGWHIYCLS----PPLKQIPLGNWYCFNCLSSDR 294
>gi|221120366|ref|XP_002164134.1| PREDICTED: histone acetyltransferase KAT6B-like [Hydra
magnipapillata]
Length = 832
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 8/70 (11%)
Query: 618 CIRIVKNLEA----ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
C++ K L A E C+ C+ C+ K +L CD C++ +H+ CL L
Sbjct: 228 CMQYSKELTARVRQEPWQCMECKKCNICKDQGEAANLLFCDACDKGYHMACLD----PPL 283
Query: 674 RELPKGKWFC 683
++P G W C
Sbjct: 284 DDMPIGTWIC 293
>gi|195476347|ref|XP_002086095.1| GE11247 [Drosophila yakuba]
gi|194185954|gb|EDW99565.1| GE11247 [Drosophila yakuba]
Length = 497
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 381 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 438
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL M P+G W C CM+
Sbjct: 439 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLMTP----PEGSWSCKLCME 491
>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
Length = 2891
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2705 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 2758
Query: 678 KGKWFC-----CMDCS 688
+G+W C C +CS
Sbjct: 2759 QGRWHCQECAVCANCS 2774
>gi|255552612|ref|XP_002517349.1| DNA binding protein, putative [Ricinus communis]
gi|223543360|gb|EEF44891.1| DNA binding protein, putative [Ricinus communis]
Length = 2145
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 15/86 (17%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 693
+C+ C F K ++LLCD C+ E+H CL L +P+G W+C S
Sbjct: 1222 VCKVCGFDKDD---DSVLLCDTCDAEYHTYCLN----PPLARIPEGNWYC-------PSC 1267
Query: 694 LQNLLVQEAEKLPE-FHLNAIKKYAG 718
+ +VQEA + N+ KKY G
Sbjct: 1268 VSVRMVQEASVSTQVIGQNSCKKYQG 1293
>gi|357481615|ref|XP_003611093.1| Lysine-specific demethylase 5D [Medicago truncatula]
gi|355512428|gb|AES94051.1| Lysine-specific demethylase 5D [Medicago truncatula]
Length = 1832
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C+ KSG +LLCD+C++ +H+ CL L+++P G W+C C+ R
Sbjct: 245 CEQCKSGLHGEVMLLCDRCDKGWHIYCLS----PPLKQIPLGNWYCFNCLSSDR 294
>gi|357481613|ref|XP_003611092.1| Lysine-specific demethylase 5D [Medicago truncatula]
gi|355512427|gb|AES94050.1| Lysine-specific demethylase 5D [Medicago truncatula]
Length = 1836
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C+ KSG +LLCD+C++ +H+ CL L+++P G W+C C+ R
Sbjct: 245 CEQCKSGLHGEVMLLCDRCDKGWHIYCLS----PPLKQIPLGNWYCFNCLSSDR 294
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 7/70 (10%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
CL C C+ P +LLCD C+ +H CL L+ +PKG W C ++
Sbjct: 824 CLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPKGGWKCKW---WMH 876
Query: 692 SVLQNLLVQE 701
++ QNL +E
Sbjct: 877 AICQNLNTEE 886
>gi|194864252|ref|XP_001970846.1| GG10866 [Drosophila erecta]
gi|190662713|gb|EDV59905.1| GG10866 [Drosophila erecta]
Length = 497
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 381 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 438
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL M P+G W C CM+
Sbjct: 439 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLSPPLMTP----PEGSWSCKLCME 491
>gi|410053854|ref|XP_001162157.3| PREDICTED: zinc finger protein neuro-d4 [Pan troglodytes]
Length = 519
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 401 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 454
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 455 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 501
>gi|358054197|dbj|GAA99733.1| hypothetical protein E5Q_06436 [Mixia osmundae IAM 14324]
Length = 1169
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 25/38 (65%), Gaps = 5/38 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
+LLCD+C+R +H+ CL L+ +PKG+W C DC
Sbjct: 507 MLLCDECDRGYHIHCL----TPPLKSIPKGRWI-CKDC 539
>gi|432891025|ref|XP_004075511.1| PREDICTED: zinc finger protein neuro-d4-like isoform 2 [Oryzias
latipes]
Length = 371
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 67/165 (40%), Gaps = 26/165 (15%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASL----------SSIPQG----DW 578
CG++ YKN G+ H ++ ++ + E +E +L P G +
Sbjct: 201 CGKR----YKNRPGLSYHYTHTHLADEEGEEDSERHTLPFQRKNNHKPKKAPDGSVIANG 256
Query: 579 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 638
YC +C ++ + + + GR SG S Q T V+ + C+ C+ C
Sbjct: 257 YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---CIECKSC 312
Query: 639 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +L CD C+R +H+ CL + E P+G W C
Sbjct: 313 SLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMSEPPEGSWSC 353
>gi|427797535|gb|JAA64219.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2019
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS- 688
S C +C C K G +L C+ C+ FH+ CLK L + PKG W C CS
Sbjct: 285 SQCRMCSRCGNKKEG---EHLLCCEVCDSHFHLRCLK----PPLLKAPKGSWKCT-SCSS 336
Query: 689 -RINSV--LQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 724
R+ S+ + NL + +K FH ++K +L+ V
Sbjct: 337 RRLKSINFVNNLASKIKQKNKRFHNGMMRKQKITTLKDV 375
>gi|45382857|ref|NP_989970.1| zinc finger protein DPF3 [Gallus gallus]
gi|18202301|sp|P58270.1|DPF3_CHICK RecName: Full=Zinc finger protein DPF3; AltName: Full=Zinc finger
protein cer-d4
gi|14010362|gb|AAK51968.1|AF362754_1 cer-d4 [Gallus gallus]
Length = 427
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 310 YCDFCLGGSNMNKKSGRPEELVSCSDCGR-SGHPTCLQFTTNMTEAVKTYQWQ---CIEC 365
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 366 KSCSLCGTSENDDQLLFCDDCDRGYHMYCLN----PPVFEPPEGSWSC 409
>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
Length = 3312
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 3126 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 3179
Query: 678 KGKWFC-----CMDCS 688
+G+W C C +CS
Sbjct: 3180 QGRWHCQECAVCANCS 3195
>gi|342184650|emb|CCC94132.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 610
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 30/116 (25%), Positives = 54/116 (46%), Gaps = 16/116 (13%)
Query: 760 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA------ 813
PI+ + SGR L P + LRG + C +++ +++ + FG ++A
Sbjct: 180 PILATPSGRCLFPCSLSDEGLRGIQEQSPLCEVISTDTASFLDNSVACFGHKIAVHRGHV 239
Query: 814 --ELPLVATSKINHGKGYFQLLFACIEKLLSFL------RVKSIVLPAAEEAESIW 861
ELPL+ ++ GKG+ L +C +KLL + R ++ + P S+W
Sbjct: 240 ANELPLIG--RVKPGKGFQNLCMSCSDKLLRWQCLGIQGRRRTRLFPERIHLSSVW 293
>gi|195568470|ref|XP_002102239.1| GD19604 [Drosophila simulans]
gi|194198166|gb|EDX11742.1| GD19604 [Drosophila simulans]
Length = 2135
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 12/84 (14%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-- 683
E E++ C +C D +LLCD C + +H+ CL L E+P G W+C
Sbjct: 241 EEEVTNCEICECPDRED------VMLLCDSCNQGYHMDCLD----PPLYEIPAGSWYCDN 290
Query: 684 CMDCSRINSVLQNLLVQEAEKLPE 707
C+D + Q L + +L E
Sbjct: 291 CIDSDDEDDNEQLELADDLNQLYE 314
>gi|270015458|gb|EFA11906.1| hypothetical protein TcasGA2_TC004063 [Tribolium castaneum]
Length = 2306
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 16/114 (14%)
Query: 617 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
R R ++E E+ C C+ C S P ILLCD C+ +H CL+ + +
Sbjct: 1436 RTARKESDVE-EVDDCP-CQKCGKSDH---PEWILLCDSCDNGWHCSCLRPPLLV----I 1486
Query: 677 PKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK----YAGNSLETV 724
P+G WFC C S I S LQ L + +KL + + +K Y G SL +V
Sbjct: 1487 PEGDWFCPPCQHASLI-SKLQEKLKEYDKKLNKKEIEDRRKQRLAYVGISLNSV 1539
>gi|427797307|gb|JAA64105.1| Putative histone acetyltransferase myst family, partial
[Rhipicephalus pulchellus]
Length = 2011
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 47/99 (47%), Gaps = 12/99 (12%)
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS- 688
S C +C C K G +L C+ C+ FH+ CLK L + PKG W C CS
Sbjct: 285 SQCRMCSRCGNKKEG---EHLLCCEVCDSHFHLRCLK----PPLLKAPKGSWKCT-SCSS 336
Query: 689 -RINSV--LQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 724
R+ S+ + NL + +K FH ++K +L+ V
Sbjct: 337 RRLKSINFVNNLASKIKQKNKRFHNGMMRKQKITTLKDV 375
>gi|328719118|ref|XP_003246668.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Acyrthosiphon pisum]
Length = 1312
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+LCR S +LLCD C + H+ CLK L+++P+G WFC
Sbjct: 1005 NCVLCR------SNKDEDVMLLCDNCNKGHHIYCLK----PKLKKIPQGDWFC 1047
>gi|147801796|emb|CAN72320.1| hypothetical protein VITISV_000998 [Vitis vinifera]
Length = 256
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ SG +LLCD+C+R FH+ CL+ + +PKG WFC
Sbjct: 42 CEECGSGXAADELLLCDKCDRGFHLFCLR----PIIVSVPKGPWFC 83
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1023 IKITKVVLSKGWRCLECTVCEACGQATDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 1078
Query: 679 GKWFC--CMDCSRINSVLQNL 697
W C C+ C++ + L
Sbjct: 1079 DSWKCKWCVSCTQCGATTPGL 1099
>gi|260818595|ref|XP_002604468.1| hypothetical protein BRAFLDRAFT_122277 [Branchiostoma floridae]
gi|229289795|gb|EEN60479.1| hypothetical protein BRAFLDRAFT_122277 [Branchiostoma floridae]
Length = 514
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 9/80 (11%)
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
RVSG S ++ ++++K + C+ C+ C ++ CD C+R FH
Sbjct: 419 RVSGHPSCLDMSDELVQVIKTYPWQ---CMECKTCVICGDPSEEDKMMFCDVCDRGFHTF 475
Query: 664 CLKKHKMADLRELPKGKWFC 683
C+ L ELP G+W C
Sbjct: 476 CV------GLEELPIGQWVC 489
>gi|307199377|gb|EFN80002.1| Supporter of activation of yellow protein [Harpegnathos saltator]
Length = 1532
Score = 43.1 bits (100), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P+
Sbjct: 1346 LDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVPQ 1399
Query: 679 GKWFC-----CMDC 687
G+W C C +C
Sbjct: 1400 GRWHCQECAVCANC 1413
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 43.1 bits (100), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 923 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 978
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 979 GGWKCKWCVWC 989
>gi|328719116|ref|XP_001943234.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Acyrthosiphon pisum]
Length = 1273
Score = 43.1 bits (100), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+LCR S +LLCD C + H+ CLK L+++P+G WFC
Sbjct: 1005 NCVLCR------SNKDEDVMLLCDNCNKGHHIYCLK----PKLKKIPQGDWFC 1047
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 5/38 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
+LLCD+C+R H+ CLK L+ +PKG WFC DC
Sbjct: 1144 MLLCDKCDRGHHMYCLKPR----LKHVPKGDWFCP-DC 1176
>gi|225898018|dbj|BAH30341.1| hypothetical protein [Arabidopsis thaliana]
Length = 629
Score = 43.1 bits (100), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 16/56 (28%), Positives = 32/56 (57%), Gaps = 5/56 (8%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 684
+S + C+ C + + T+++CD CE+ +H+ CL H +++ +PK +W C
Sbjct: 318 MSKAMTCQICQGTINEI--ETVVICDACEKGYHLKCLHAH---NIKGVPKSEWHCS 368
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 43.1 bits (100), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 597 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 652
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 653 GGWKCKWCVWC 663
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 985 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1040
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1041 GGWKCKWCVWC 1051
>gi|429328203|gb|AFZ79963.1| hypothetical protein BEWA_028120 [Babesia equi]
Length = 721
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 13/76 (17%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CM 685
+++GC +C G D + +LLCDQC+ FH CL L +P+G W+C C
Sbjct: 363 DVNGCDIC-GLDNNWD-----QLLLCDQCDHGFHTYCLN----PPLDSVPEGDWYCTSCT 412
Query: 686 DCSRINSVLQNLLVQE 701
+ RI+S + N++ E
Sbjct: 413 NV-RISSGIVNVVEPE 427
>gi|449502401|ref|XP_004174505.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein DPF3
[Taeniopygia guttata]
Length = 392
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 275 YCDFCLGGSNMNKKSGRPEELVSCSDCGR-SGHPTCLQFTTNMTEAVKTYQWQ---CIEC 330
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 331 KSCSLCGTSENDDQLLFCDDCDRGYHMYCLN----PPVFEPPEGSWSC 374
>gi|195354526|ref|XP_002043748.1| GM16453 [Drosophila sechellia]
gi|194128948|gb|EDW50991.1| GM16453 [Drosophila sechellia]
Length = 1430
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1065 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPPGNWYCN-DC 1110
>gi|189241962|ref|XP_968788.2| PREDICTED: similar to CG8677 CG8677-PA [Tribolium castaneum]
Length = 2314
Score = 42.7 bits (99), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 16/114 (14%)
Query: 617 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
R R ++E E+ C C+ C S P ILLCD C+ +H CL+ + +
Sbjct: 1444 RTARKESDVE-EVDDCP-CQKCGKSDH---PEWILLCDSCDNGWHCSCLRPPLLV----I 1494
Query: 677 PKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK----YAGNSLETV 724
P+G WFC C S I S LQ L + +KL + + +K Y G SL +V
Sbjct: 1495 PEGDWFCPPCQHASLI-SKLQEKLKEYDKKLNKKEIEDRRKQRLAYVGISLNSV 1547
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 907 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 962
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 963 GGWKCKWCVWC 973
>gi|145345983|ref|XP_001417477.1| hypothetical protein, contains no bromo domain [Ostreococcus
lucimarinus CCE9901]
gi|144577704|gb|ABO95770.1| hypothetical protein, contains no bromo domain [Ostreococcus
lucimarinus CCE9901]
Length = 1474
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
GC +C G D T+LLCD C+ E+H CL +A+ P+G+WFC
Sbjct: 927 GCSVC-GLDVMAG-----TVLLCDSCDGEYHAKCLDPPLLAE----PEGEWFC 969
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 938 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 993
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 994 GGWKCKWCVWC 1004
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 9/61 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW-----FCCMD 686
CL C C+ P +LLCD C+ +H CL L +PKG W CC+
Sbjct: 733 CLECIVCEMCGKASDPSRLLLCDDCDVSYHTYCLD----PPLHNVPKGGWKCKWCVCCVQ 788
Query: 687 C 687
C
Sbjct: 789 C 789
>gi|402592532|gb|EJW86460.1| hypothetical protein WUBG_02629, partial [Wuchereria bancrofti]
Length = 2207
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 6/68 (8%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL C C+ G ++LCD+C+ +H+ CL+ L +P G W C C C R
Sbjct: 315 CLDCTVCEGCGDGHDESNLILCDECDISYHIYCLE----PPLERIPHGPWRCKWCSACRR 370
Query: 690 INSVLQNL 697
+ + N+
Sbjct: 371 CGNQIFNV 378
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1028 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1083
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1084 GGWKCKWCVWC 1094
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 974 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1029
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1030 GGWKCKWCVWC 1040
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 991 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1046
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1047 GGWKCKWCVWC 1057
>gi|412986673|emb|CCO15099.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 26/46 (56%), Gaps = 6/46 (13%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +SG R +LLCD C+ +HV C+ L+ +P+G+W C
Sbjct: 134 CTVCQSGDDERNLLLCDGCDEGYHVSCVG------LQRVPRGRWHC 173
>gi|322707184|gb|EFY98763.1| PHD and RING finger domain protein, putative [Metarhizium
anisopliae ARSEF 23]
Length = 651
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 30/125 (24%), Positives = 54/125 (43%), Gaps = 25/125 (20%)
Query: 580 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE--QITKRCIR------IVKNLEAELSG 631
C C+ F R N V+ +S D ++ Q+ + ++ IV E E +
Sbjct: 84 CPICRTPFHCVRVY----NGVDGTAISTYDVIDKKQVAEFDVQAWLGENIVDQEEEECNP 139
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C +C + +LLCD C+ +H C+ L +P+G W+ CM+C+ +
Sbjct: 140 CPICNSAERED------ILLLCDSCDAAYHTHCIG------LDHIPEGDWY-CMECAHLF 186
Query: 692 SVLQN 696
+ Q+
Sbjct: 187 QLTQD 191
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 991 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1046
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1047 GGWKCKWCVWC 1057
>gi|198455671|ref|XP_001357517.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
gi|198135345|gb|EAL24641.2| GA15428 [Drosophila pseudoobscura pseudoobscura]
Length = 507
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 391 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 448
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 449 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 501
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 991 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1046
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1047 GGWKCKWCVWC 1057
>gi|443694019|gb|ELT95254.1| hypothetical protein CAPTEDRAFT_227914 [Capitella teleta]
Length = 675
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 6/54 (11%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
C+ C+ C ++ CD C+R FH CL LR +P G+W C M
Sbjct: 611 CMDCKMCAKCNDAGDEEKMMFCDHCDRGFHTFCLG------LRVIPTGRWVCPM 658
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 52 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 107
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 108 GGWKCKWCVWC 118
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 949 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1004
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1005 GGWKCKWCVWC 1015
>gi|322796544|gb|EFZ19018.1| hypothetical protein SINV_05175 [Solenopsis invicta]
Length = 4168
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 7/67 (10%)
Query: 617 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 676
R + +N E C+ C P ILLCD C++ +H CLK M +
Sbjct: 2900 RTAQKAQNEVEEAEDEYACQKC---GKADHPEWILLCDSCDKGWHCSCLKPALML----I 2952
Query: 677 PKGKWFC 683
P+G WFC
Sbjct: 2953 PEGDWFC 2959
>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
Length = 3066
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 11/75 (14%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2879 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2932
Query: 678 KGKWFC-----CMDC 687
+G+W C C +C
Sbjct: 2933 QGRWHCQECAVCANC 2947
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 436 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 491
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 492 GGWKCKWCVWC 502
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 991 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1046
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1047 GGWKCKWCVWC 1057
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 52 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 107
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 108 GGWKCKWCVWC 118
>gi|60678057|gb|AAX33535.1| LD32807p [Drosophila melanogaster]
Length = 1151
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1064 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPPGNWYCN-DC 1109
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 991 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1046
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1047 GGWKCKWCVWC 1057
>gi|391343322|ref|XP_003745961.1| PREDICTED: uncharacterized protein LOC100901645 [Metaseiulus
occidentalis]
Length = 1253
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 10/52 (19%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
CL+C D T+LLCD+C+ +H CL LR +P G+WFC
Sbjct: 546 CLVCHRSDQED------TLLLCDECDDAYHCSCL----TPPLRSVPVGQWFC 587
>gi|351697354|gb|EHB00273.1| PHD finger protein 21A, partial [Heterocephalus glaber]
Length = 664
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 475 CSICR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 518
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 52 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 107
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 108 GGWKCKWCVWC 118
>gi|195391586|ref|XP_002054441.1| GJ24457 [Drosophila virilis]
gi|194152527|gb|EDW67961.1| GJ24457 [Drosophila virilis]
Length = 1501
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LC+ C + G P +LLCDQC H+ C+K +R +P+G W+C
Sbjct: 1081 LCKVC---RRGTDPEKMLLCDQCNGGTHMFCMK----PKMRTVPEGNWYC 1123
>gi|156356298|ref|XP_001623864.1| predicted protein [Nematostella vectensis]
gi|156210600|gb|EDO31764.1| predicted protein [Nematostella vectensis]
Length = 78
Score = 42.7 bits (99), Expect = 0.88, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
I+K+ + + C C CD +G +LLCD C+R +H+ CL L + PKG
Sbjct: 20 IIKHYPWQCNDCKTCHLCD---TGEVQNELLLCDNCDRGYHMSCLD----PKLTKAPKGA 72
Query: 681 WFCCM 685
W C +
Sbjct: 73 WHCVL 77
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 436 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 491
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 492 GGWKCKWCVWC 502
>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
Length = 2182
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 12/76 (15%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 688
GC +C D S+ +LLCD+C+ E H+ C + ELP+G+WFC C S
Sbjct: 235 GCCVCGNNDHSEE------LLLCDRCDGEVHMQC----ATPKVTELPEGEWFCGYCRARS 284
Query: 689 RINSVLQNLLVQEAEK 704
+ + ++++ + AEK
Sbjct: 285 KFIAKVEDVKQEIAEK 300
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 42.7 bits (99), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 562 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 617
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 618 GGWKCKWCVWC 628
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 937 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 992
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 993 GGWKCKWCVWC 1003
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 42.7 bits (99), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 942 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 997
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 998 GGWKCKWCVWC 1008
>gi|229594235|ref|XP_001024908.3| PHD-finger family protein [Tetrahymena thermophila]
gi|225566987|gb|EAS04663.3| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 425
Score = 42.7 bits (99), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV + + C LC C+ +L+CD C+R FH+ CL+ + ++P+G+
Sbjct: 302 IVSKYDWFCTECKLCIACN---KNTKENELLMCDCCDRPFHMSCLEPART----DIPEGR 354
Query: 681 WFCCMDCSR 689
WF C DC +
Sbjct: 355 WF-CKDCEK 362
>gi|62645406|ref|XP_342468.2| PREDICTED: PHD finger protein 21A isoform 2 [Rattus norvegicus]
gi|109470259|ref|XP_001070000.1| PREDICTED: PHD finger protein 21A isoform 1 [Rattus norvegicus]
Length = 694
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 506 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 549
>gi|14010360|gb|AAK51967.1|AF362753_1 cer-d4 [Gallus gallus]
Length = 378
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 261 YCDFCLGGSNMNKKSGRPEELVSCSDCGR-SGHPTCLQFTTNMTEAVKTYQWQ---CIEC 316
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 317 KSCSLCGTSENDDQLLFCDDCDRGYHMYCLN----PPVFEPPEGSWSC 360
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1010 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1065
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1066 GGWKCKWCVWC 1076
>gi|412985443|emb|CCO18889.1| unnamed protein product [Bathycoccus prasinos]
Length = 1938
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 10/52 (19%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C +C+G D +LLCD C+ FH+ CLK L+++P G WFC
Sbjct: 1573 CEMCQGGDREDE------VLLCDGCDCGFHIFCLK----PPLKKIPDGDWFC 1614
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1042 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1097
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1098 GGWKCKWCVWC 1108
>gi|393907985|gb|EJD74857.1| CBR-NURF-1 protein, variant [Loa loa]
Length = 2096
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 13/63 (20%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV+N E +L+ L CR C P +LLCD CE +H+ C L ++P GK
Sbjct: 295 IVRN-EGKLTADLHCRECG------KPGDVLLCDGCEACYHLSC------TSLADVPDGK 341
Query: 681 WFC 683
W C
Sbjct: 342 WLC 344
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 985 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1040
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1041 GGWKCKWCVWC 1051
>gi|327259541|ref|XP_003214595.1| PREDICTED: zinc finger protein DPF3-like [Anolis carolinensis]
Length = 398
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 281 YCDFCLGGSNMNKKSGRPEELVSCSDCGR-SGHPTCLQFTANMTEAVKTYQWQ---CIEC 336
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 337 KSCSLCGTSENDDQLLFCDDCDRGYHMYCLN----PPVSEPPEGSWSC 380
>gi|194904679|ref|XP_001981042.1| GG11833 [Drosophila erecta]
gi|190655680|gb|EDV52912.1| GG11833 [Drosophila erecta]
Length = 1481
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1070 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPTGNWYCN-DC 1115
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 503 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 558
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 559 GGWKCKWCVWC 569
>gi|449274633|gb|EMC83711.1| PHD finger protein 21A, partial [Columba livia]
Length = 651
Score = 42.7 bits (99), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 463 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 506
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 42.7 bits (99), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 986 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1041
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1042 GGWKCKWCVWC 1052
>gi|391334814|ref|XP_003741795.1| PREDICTED: uncharacterized protein LOC100898311 [Metaseiulus
occidentalis]
Length = 1347
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 35/71 (49%), Gaps = 9/71 (12%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINS 692
CR C+ S P ILLCD C+ +H+ CLK M +P G W+C C + I +
Sbjct: 806 CRKCNKSDH---PEFILLCDSCDAGYHMSCLKPALMV----IPLGNWYCPPCEHDALIEA 858
Query: 693 VLQNLLVQEAE 703
+ L EAE
Sbjct: 859 LNGKLSFIEAE 869
>gi|326920420|ref|XP_003206472.1| PREDICTED: PHD finger protein 21A-like [Meleagris gallopavo]
Length = 679
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 534
>gi|298204724|emb|CBI25222.3| unnamed protein product [Vitis vinifera]
Length = 248
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 2/94 (2%)
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HA +R +YI ENG SL V+ A ++ L L ++ A+ ++S C+ CKG
Sbjct: 143 HADSSNKRPPEYIYLENGTSLRGVMNAWKNAALYSLDEAIRVAIGCSMIKRSTFCLNCKG 202
Query: 345 TFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTG 378
I G LC SC++ K+ Q + TG
Sbjct: 203 R--IFGAGIGNSKVLCLSCLQLKESQAGPSQVTG 234
>gi|255073299|ref|XP_002500324.1| predicted protein [Micromonas sp. RCC299]
gi|226515586|gb|ACO61582.1| predicted protein [Micromonas sp. RCC299]
Length = 1219
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 6/33 (18%)
Query: 651 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LLCD C+ FH+ CLK+ + +PKG WFC
Sbjct: 1154 LLCDGCDGTFHLKCLKR------KTIPKGDWFC 1180
>gi|313851038|ref|NP_001186576.1| PHD finger protein 21A [Gallus gallus]
Length = 679
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 534
>gi|449502288|ref|XP_002200166.2| PREDICTED: PHD finger protein 21A isoform 1 [Taeniopygia guttata]
Length = 679
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 534
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 944 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 999
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1000 GGWKCKWCVWC 1010
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 984 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1039
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1040 GGWKCKWCVWC 1050
>gi|393907986|gb|EJD74858.1| CBR-NURF-1 protein [Loa loa]
Length = 2137
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 13/63 (20%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV+N E +L+ L CR C P +LLCD CE +H+ C L ++P GK
Sbjct: 295 IVRN-EGKLTADLHCRECG------KPGDVLLCDGCEACYHLSC------TSLADVPDGK 341
Query: 681 WFC 683
W C
Sbjct: 342 WLC 344
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 985 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1040
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1041 GGWKCKWCVWC 1051
>gi|410899461|ref|XP_003963215.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Takifugu
rubripes]
Length = 3715
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 9/61 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW-----FCCMD 686
CL C C+ P +LLCD C+ +H CL L +PKG W CC+
Sbjct: 463 CLECIVCEMCGKASDPSRLLLCDDCDVSYHTYCLD----PPLHNVPKGGWKCKWCVCCVQ 518
Query: 687 C 687
C
Sbjct: 519 C 519
>gi|399217986|emb|CCF74873.1| unnamed protein product [Babesia microti strain RI]
Length = 540
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 13/90 (14%)
Query: 599 AVEAGRVSGVDS---VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 655
+EAG++S + + V + + + ++ + + +GC +C G D +LLCD+
Sbjct: 189 TIEAGKISFIQNSGNVVLVEDKSLLSPRDQQIQDTGCEIC-GHDNDWD-----MMLLCDE 242
Query: 656 CEREFHVGCLKKHKMADLRELPKGKWFCCM 685
C+ FH+ CL L +P G WFC +
Sbjct: 243 CDNGFHIYCLN----PPLTHIPPGLWFCTV 268
>gi|157841176|ref|NP_001103160.1| PHD finger protein 21A isoform 2 [Mus musculus]
Length = 688
Score = 42.7 bits (99), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 500 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 543
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 42.7 bits (99), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 52 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 107
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 108 GGWKCKWCVWC 118
>gi|25777805|gb|AAN75611.1| RUM1 [Cryptococcus neoformans var. neoformans]
Length = 1863
Score = 42.7 bits (99), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ K + ILLCD C+R FH+ CL L +P +WFC
Sbjct: 487 CEICKGEYDADKILLCDSCDRGFHIYCLD----PPLASVPNNEWFC 528
>gi|312083536|ref|XP_003143902.1| PHD-finger family protein [Loa loa]
Length = 1483
Score = 42.7 bits (99), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 13/63 (20%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
IV+N E +L+ L CR C P +LLCD CE +H+ C L ++P GK
Sbjct: 243 IVRN-EGKLTADLHCRECG------KPGDVLLCDGCEACYHLSC------TSLADVPDGK 289
Query: 681 WFC 683
W C
Sbjct: 290 WLC 292
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 989 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1044
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1045 GGWKCKWCVWC 1055
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 66/180 (36%), Gaps = 38/180 (21%)
Query: 525 GTEVGYYACGQKLLEGYKNGLGII-CHCCNSEVSPSQFEAHAEC--ASLSSIPQGDWYCK 581
G GY Q E + G II C C P + H C + P+G W C
Sbjct: 350 GAADGYETDHQDYCEVCQQGGEIILCDTC-----PRAY--HMVCLDPDMDKAPEGKWSCP 402
Query: 582 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 641
+C+ K +Q +A + +I V + E E CR C
Sbjct: 403 HCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC--- 444
Query: 642 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 699
K G +L CD C +H+ CL L E+P G+W C C C + +Q +L
Sbjct: 445 KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKILT 496
>gi|449502292|ref|XP_004174497.1| PREDICTED: PHD finger protein 21A isoform 3 [Taeniopygia guttata]
Length = 678
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 490 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 533
>gi|156546894|ref|NP_001095272.1| PHD finger protein 21A isoform a [Homo sapiens]
gi|332836258|ref|XP_001161944.2| PREDICTED: PHD finger protein 21A isoform 14 [Pan troglodytes]
gi|426368107|ref|XP_004051054.1| PREDICTED: PHD finger protein 21A isoform 3 [Gorilla gorilla
gorilla]
gi|74731224|sp|Q96BD5.1|PF21A_HUMAN RecName: Full=PHD finger protein 21A; AltName: Full=BHC80a;
AltName: Full=BRAF35-HDAC complex protein BHC80
gi|16041692|gb|AAH15714.1| PHF21A protein [Homo sapiens]
gi|167773799|gb|ABZ92334.1| PHD finger protein 21A [synthetic construct]
Length = 680
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|307193181|gb|EFN76086.1| Remodeling and spacing factor 1 [Harpegnathos saltator]
Length = 4001
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2754 ACQKCNKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2796
>gi|296217998|ref|XP_002755258.1| PREDICTED: PHD finger protein 21A [Callithrix jacchus]
Length = 680
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|119588424|gb|EAW68018.1| PHD finger protein 21A, isoform CRA_d [Homo sapiens]
gi|410216394|gb|JAA05416.1| PHD finger protein 21A [Pan troglodytes]
gi|410306382|gb|JAA31791.1| PHD finger protein 21A [Pan troglodytes]
gi|410335365|gb|JAA36629.1| PHD finger protein 21A [Pan troglodytes]
Length = 681
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 492 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 535
>gi|449502284|ref|XP_004174496.1| PREDICTED: PHD finger protein 21A isoform 2 [Taeniopygia guttata]
Length = 686
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 498 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 541
>gi|5059246|gb|AAD38952.1|AF148962_1 ATP-dependent chromatin assembly factor large subunit [Drosophila
melanogaster]
Length = 1476
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1064 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPPGNWYCN-DC 1109
>gi|380813474|gb|AFE78611.1| PHD finger protein 21A isoform a [Macaca mulatta]
gi|383418943|gb|AFH32685.1| PHD finger protein 21A isoform a [Macaca mulatta]
gi|384947504|gb|AFI37357.1| PHD finger protein 21A isoform a [Macaca mulatta]
Length = 680
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|355566574|gb|EHH22953.1| hypothetical protein EGK_06314 [Macaca mulatta]
gi|355752187|gb|EHH56307.1| hypothetical protein EGM_05684 [Macaca fascicularis]
Length = 681
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 492 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 535
>gi|149236904|ref|XP_001524329.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451864|gb|EDK46120.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 837
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 622 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
++N E + CL+C D P LLCD C+ FH+ CL L ++P W
Sbjct: 182 IENDEQDSDYCLICGDND------NPEETLLCDHCDGAFHMACLN----PPLTQVPATNW 231
Query: 682 FC 683
+C
Sbjct: 232 YC 233
>gi|449502298|ref|XP_004174499.1| PREDICTED: PHD finger protein 21A isoform 5 [Taeniopygia guttata]
Length = 554
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 366 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 409
>gi|19921648|ref|NP_610163.1| d4, isoform A [Drosophila melanogaster]
gi|16417832|gb|AAL18868.1|AF427473_1 dd4 protein [Drosophila melanogaster]
gi|16198077|gb|AAL13829.1| LD29238p [Drosophila melanogaster]
gi|21626860|gb|AAF57340.2| d4, isoform A [Drosophila melanogaster]
gi|220942560|gb|ACL83823.1| d4-PA [synthetic construct]
gi|220952536|gb|ACL88811.1| d4-PA [synthetic construct]
Length = 497
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 381 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 438
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 439 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 491
>gi|380792697|gb|AFE68224.1| histone acetyltransferase KAT6B, partial [Macaca mulatta]
Length = 1077
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
Length = 2805
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2619 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2672
Query: 678 KGKWFC-----CMDC 687
+G+W C C++C
Sbjct: 2673 QGRWHCQECAVCVNC 2687
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 994 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1049
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1050 GGWKCKWCVWC 1060
>gi|24651761|ref|NP_536734.2| ATP-dependent chromatin assembly factor large subunit [Drosophila
melanogaster]
gi|7302099|gb|AAF57200.1| ATP-dependent chromatin assembly factor large subunit [Drosophila
melanogaster]
Length = 1476
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1064 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPPGNWYCN-DC 1109
>gi|332259882|ref|XP_003279013.1| PREDICTED: PHD finger protein 21A [Nomascus leucogenys]
Length = 680
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|7635502|emb|CAB88669.1| chromatin accessibility complex (CHRAC) [Drosophila melanogaster]
Length = 1476
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C DC
Sbjct: 1064 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLK----PKLRSVPPGNWYCN-DC 1109
>gi|50949620|emb|CAH10542.1| hypothetical protein [Homo sapiens]
Length = 560
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 371 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 414
>gi|410909614|ref|XP_003968285.1| PREDICTED: uncharacterized protein LOC101062098 [Takifugu rubripes]
Length = 1314
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 9/56 (16%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNL 697
P ILLCD C+ +H CL+ M +P G+WFC C C ++ L NL
Sbjct: 761 PELILLCDSCDSGYHTACLRPPLML----IPDGEWFCPPCQHKLLCEKLEEQLHNL 812
>gi|297298207|ref|XP_002805197.1| PREDICTED: hypothetical protein LOC694878 [Macaca mulatta]
Length = 472
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 355 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 410
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 411 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 454
>gi|303283986|ref|XP_003061284.1| JmjN/JmjC protein [Micromonas pusilla CCMP1545]
gi|226457635|gb|EEH54934.1| JmjN/JmjC protein [Micromonas pusilla CCMP1545]
Length = 2683
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 15/68 (22%)
Query: 621 IVKNLEAELSG-----CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
+ + L+ E G CL C G +S ++LCD C++ +H+ CL + E
Sbjct: 248 VAQQLDEEAKGPLIHACLNCGGSSHEES------MILCDGCDQGYHMYCLSP----PMEE 297
Query: 676 LPKGKWFC 683
LP+G WFC
Sbjct: 298 LPRGDWFC 305
>gi|242011986|ref|XP_002426724.1| hepatitis B virus X associated protein, hbxa, putative [Pediculus
humanus corporis]
gi|212510895|gb|EEB13986.1| hepatitis B virus X associated protein, hbxa, putative [Pediculus
humanus corporis]
Length = 2351
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
P ILLCD+C++ +H CL+ M +P+G WFC
Sbjct: 1278 PEWILLCDKCDKGWHASCLRPTLMI----IPEGDWFC 1310
>gi|449543136|gb|EMD34113.1| hypothetical protein CERSUDRAFT_141605, partial [Ceriporiopsis
subvermispora B]
Length = 260
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C+ C+ C+ S ++ CD C+R +H+ CL L E P GKW C
Sbjct: 32 CMECKNCEICHSKEDDNRMMFCDFCDRGWHMDCLD----PPLSEAPPGKWHC 79
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 930 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 985
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 986 GGWKCKWCVWC 996
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 877 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 932
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 933 GGWKCKWCVWC 943
>gi|449444070|ref|XP_004139798.1| PREDICTED: uncharacterized protein LOC101205573 [Cucumis sativus]
Length = 574
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LCR C ++ I+LCD C+ FH+ C++ L +PKGKWFC
Sbjct: 459 LCRACLINQD---DDKIVLCDGCDHGFHIYCMR----PPLAAIPKGKWFC 501
>gi|301780976|ref|XP_002925892.1| PREDICTED: zinc finger protein neuro-d4-like isoform 1 [Ailuropoda
melanoleuca]
gi|410983100|ref|XP_003997881.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Felis catus]
Length = 414
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 296 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 349
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 350 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 396
>gi|205830430|ref|NP_001128627.1| zinc finger protein neuro-d4 isoform a [Homo sapiens]
gi|297704629|ref|XP_002829197.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Pongo abelii]
gi|395847017|ref|XP_003796183.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Otolemur
garnettii]
gi|402905395|ref|XP_003915505.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Papio anubis]
gi|387540168|gb|AFJ70711.1| zinc finger protein neuro-d4 isoform a [Macaca mulatta]
Length = 414
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 296 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 349
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 350 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 396
>gi|198452824|ref|XP_001358956.2| GA15159 [Drosophila pseudoobscura pseudoobscura]
gi|198132094|gb|EAL28099.2| GA15159 [Drosophila pseudoobscura pseudoobscura]
Length = 1515
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LC+ C + G P +LLCD+C H+ C+K +R +PKG W+C
Sbjct: 1080 LCKVC---RRGSDPDKMLLCDECNGGTHMFCMK----PKMRSVPKGHWYC 1122
>gi|9294545|dbj|BAB02808.1| unnamed protein product [Arabidopsis thaliana]
Length = 1424
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/55 (30%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 629 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+S + C+ C+ + + T+L+CD CE+ +H+ CL+ + M + +PK +W C
Sbjct: 984 MSRAMACQICEVTINEMD--TLLICDACEKAYHLKCLQGNNM---KGVPKSEWHC 1033
>gi|406603646|emb|CCH44847.1| hypothetical protein BN7_4416 [Wickerhamomyces ciferrii]
Length = 722
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + CL+C K+ P+ LLCD C++ FH CL L ++P+ W+C
Sbjct: 214 ETDEDACLIC------KTNSHPQDTLLCDSCDKPFHRYCLS----PPLSKIPQDNWYC 261
>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
Length = 2857
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 11/61 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-----CMD 686
C C+ C +L CD C+R +H+ C+ LR +P+G+W C C++
Sbjct: 2745 CTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVPQGRWHCQECAVCVN 2798
Query: 687 C 687
C
Sbjct: 2799 C 2799
>gi|90167365|sp|Q6ZPK0.2|PF21A_MOUSE RecName: Full=PHD finger protein 21A; AltName: Full=BHC80a;
AltName: Full=BRAF35-HDAC complex protein BHC80;
Short=mBHC80
Length = 659
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 471 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 514
>gi|4808460|dbj|BAA77573.1| Requiem protein [Xenopus laevis]
Length = 290
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C N ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 173 YCDFCLGDSNTNKKSNQPEELVSCSDCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIEC 228
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 229 KCCNICGTSENDDQLLFCDDCDRGYHMYCLS----PPVAEPPEGSWSCHLCLD 277
>gi|37360512|dbj|BAC98234.1| mKIAA1696 protein [Mus musculus]
Length = 660
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 472 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 515
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 956 IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCL----APPLQTVPK 1011
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1012 GGWKCKWCVWC 1022
>gi|281348769|gb|EFB24353.1| hypothetical protein PANDA_011303 [Ailuropoda melanoleuca]
Length = 629
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 440 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 483
>gi|195148883|ref|XP_002015392.1| GL11042 [Drosophila persimilis]
gi|194109239|gb|EDW31282.1| GL11042 [Drosophila persimilis]
Length = 567
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 451 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 508
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 509 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 561
>gi|195144516|ref|XP_002013242.1| GL24024 [Drosophila persimilis]
gi|194102185|gb|EDW24228.1| GL24024 [Drosophila persimilis]
Length = 1343
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 34/66 (51%), Gaps = 12/66 (18%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
CI+ +++ L C +CR G P +LLCD+C H+ C+K +R +P
Sbjct: 1069 CIQWIRSTNKSL--CKVCR------RGSDPDKMLLCDECNGGTHMFCMK----PKMRSVP 1116
Query: 678 KGKWFC 683
KG W+C
Sbjct: 1117 KGHWYC 1122
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1003 IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCL----APPLQTVPK 1058
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1059 GGWKCKWCVWC 1069
>gi|255717396|ref|XP_002554979.1| KLTH0F18260p [Lachancea thermotolerans]
gi|238936362|emb|CAR24542.1| KLTH0F18260p [Lachancea thermotolerans CBS 6340]
Length = 799
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
GC LC+ ++ +TIL CD C++ FH+ CL L+E+PKGKW C
Sbjct: 261 GCTLCQ-----RTNKRTKTIL-CDSCDKPFHIFCLD----PPLKEVPKGKWVC 303
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 955 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1010
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1011 GGWKCKWCVWC 1021
>gi|380798715|gb|AFE71233.1| zinc finger protein neuro-d4 isoform a, partial [Macaca mulatta]
Length = 407
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 289 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 342
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 343 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 389
>gi|356503415|ref|XP_003520505.1| PREDICTED: uncharacterized protein LOC100806087 [Glycine max]
Length = 649
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 54/115 (46%), Gaps = 16/115 (13%)
Query: 771 IPSMVYGRNLRGQE-------FGGMYCAILTVNSSVVSAG--ILRVFGQ------EVAEL 815
+P M Y N Q G Y +L ++SV ++G I + Q E AE+
Sbjct: 96 LPGMNYAANTGKQSKFLERCVTNGKYRTLLIKSTSVGNSGKVIAAITYQIIPADTEYAEI 155
Query: 816 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
PL A + I KG+ L+F + K L + V+SI+ +E+E W K GF +I
Sbjct: 156 PLAAVNAIYQRKGFGHLMFLELRKRLQSVGVRSILCWGDKESEGFWL-KQGFVEI 209
>gi|344241713|gb|EGV97816.1| Histone acetyltransferase MYST4 [Cricetulus griseus]
Length = 709
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|427795587|gb|JAA63245.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1435
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1166 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPK----MDKIPDGDWYC 1207
>gi|198428271|ref|XP_002124973.1| PREDICTED: similar to RING and PHD-finger domain-containing protein
KIAA1542 [Ciona intestinalis]
Length = 1966
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 11/80 (13%)
Query: 604 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 663
R GV E + ++ I +V+ E + + C +C SG T+LLCD C+ +H
Sbjct: 118 RFGGVFVHELVVEKKINVVETHEND-TNCQVC------GSGENEETLLLCDGCDLGYHCA 170
Query: 664 CLKKHKMADLRELPKGKWFC 683
CL L ++P +WFC
Sbjct: 171 CLN----PPLEQVPSDEWFC 186
>gi|319411664|emb|CBQ73708.1| related to histone acetyltransferase 3 (myst) [Sporisorium
reilianum SRZ2]
Length = 1223
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 5/75 (6%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C+ C+ C+ + ++ CD+C+R +H+ CL L + PKG+W C C +
Sbjct: 140 CIECKKCEICRDKGDDAQLMFCDRCDRGWHLYCLSP----PLLKPPKGQWHCPT-CQAAD 194
Query: 692 SVLQNLLVQEAEKLP 706
Q A +P
Sbjct: 195 QNQHQRWPQSAHAMP 209
>gi|302773900|ref|XP_002970367.1| hypothetical protein SELMODRAFT_171548 [Selaginella moellendorffii]
gi|300161883|gb|EFJ28497.1| hypothetical protein SELMODRAFT_171548 [Selaginella moellendorffii]
Length = 431
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 44/102 (43%), Gaps = 16/102 (15%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVL 694
C CD + P+ LLCD C+R +H+ CL +A +PKG WF C CS+
Sbjct: 122 CGNCDRAND---PQRFLLCDGCDRGYHMYCLSPILVA----VPKGDWF-CPHCSKDR--- 170
Query: 695 QNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSG 736
Q+ + P I + +E +VR R SG
Sbjct: 171 -----QQVKVFPMVQRKLIDFFGIEKVEEEPTKEVRRRRHSG 207
>gi|354469852|ref|XP_003497336.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 21A-like
[Cricetulus griseus]
Length = 690
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 502 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 545
>gi|328716148|ref|XP_003245848.1| PREDICTED: hypothetical protein LOC100162709 isoform 3 [Acyrthosiphon
pisum]
Length = 1397
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 8/59 (13%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 688
C C+ C K +L CD C+R +H+ C+ LR +P+G+W C C CS
Sbjct: 1332 CNECKSCAQCKEVADEDKMLFCDLCDRGYHIYCV------GLRRVPEGRWHCQECAMCS 1384
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 859 IKITKVVLSKGWRCLECTVCEACGKASDPGRLLLCDDCDISYHTYCL----APPLQTVPK 914
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 915 GGWKCKWCVWC 925
>gi|449523423|ref|XP_004168723.1| PREDICTED: uncharacterized LOC101205573, partial [Cucumis sativus]
Length = 576
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LCR C ++ I+LCD C+ FH+ C++ L +PKGKWFC
Sbjct: 471 LCRACLINQD---DDKIVLCDGCDHGFHIYCMR----PPLAAIPKGKWFC 513
>gi|29387208|gb|AAH48199.1| MYST4 protein, partial [Homo sapiens]
gi|33874216|gb|AAH14143.1| MYST4 protein, partial [Homo sapiens]
Length = 325
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 250 KFCPELTTNVKALRWQCIECKTCSACRVQGRNADNMLFCDSCDRGFHMECCD----PPLS 305
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 306 RMPKGMWIC 314
>gi|403418283|emb|CCM04983.1| predicted protein [Fibroporia radiculosa]
Length = 1278
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C C+ C+ + G +++CD C+R +H+ CL+ L E+P G+W C +
Sbjct: 167 CSECKNCEICRKKEGENRMIMCDFCDRGWHMDCLQ----PPLVEMPPGRWHCPICPPIPP 222
Query: 692 SVLQNLLVQEA 702
+ Q +L +A
Sbjct: 223 ELPQTVLPTQA 233
>gi|356508991|ref|XP_003523236.1| PREDICTED: histone-lysine N-methyltransferase ATXR5-like [Glycine
max]
Length = 375
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 9/77 (11%)
Query: 625 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC- 683
LE E LLC C SG P +LLCD+C++ FH+ C++ + +P G W C
Sbjct: 53 LERETYSDLLCDQC---GSGELPEELLLCDKCDKGFHMKCVRPIVV----RVPIGSWLCP 105
Query: 684 -CMDCSRINSVLQNLLV 699
C R+ + Q ++
Sbjct: 106 NCQGGKRVRTFSQKKII 122
>gi|323347831|gb|EGA82093.1| Jhd2p [Saccharomyces cerevisiae Lalvin QA23]
Length = 728
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|395544822|ref|XP_003774305.1| PREDICTED: zinc finger protein ubi-d4 [Sarcophilus harrisii]
Length = 423
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 329 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 384
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 385 HMYCL----TPSMSEPPEGSWSCHLCLD 408
>gi|417403866|gb|JAA48716.1| Putative helicase [Desmodus rotundus]
Length = 679
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 490 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 533
>gi|449502294|ref|XP_004174498.1| PREDICTED: PHD finger protein 21A isoform 4 [Taeniopygia guttata]
Length = 632
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 487
>gi|351701967|gb|EHB04886.1| Zinc finger protein ubi-d4 [Heterocephalus glaber]
Length = 601
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 507 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 562
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 563 HMYCL----TPSMSEPPEGSWSCHLCLD 586
>gi|332838463|ref|XP_508960.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 isoform 3 [Pan troglodytes]
Length = 1825
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 699
I+LCD C R +H+ CL D+ + P+GKW C C + +Q +L+
Sbjct: 375 IILCDTCPRAYHMVCLD----PDMEKAPEGKWSCPHCCPALKGKVQKILI 420
>gi|427795843|gb|JAA63373.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1933
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1664 CQFCHSGDNEQMLLLCDGCDKGYHTYCFK----PKMDKIPDGDWYC 1705
>gi|47214709|emb|CAG01062.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1036
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 61/142 (42%), Gaps = 28/142 (19%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQN---MFERKRFLQ- 594
+G + + + C CN V H EC + IP+G W C++C + +++R L+
Sbjct: 226 DGADSNVILFCDSCNIAV-------HQECYGVPYIPEGQWLCRHCLQVRLLPQQRRSLKK 278
Query: 595 -------HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 647
H A A+ V D+V +R + +L+ C LCR + G G
Sbjct: 279 TDDGRWGHVACALWVPEVGFSDTVFIEPIDGVRNIPPARWKLT-CYLCR-----EKGAG- 331
Query: 648 RTILLCDQ--CEREFHVGCLKK 667
+ CD+ C FHV C +K
Sbjct: 332 -ACIQCDKVNCYTAFHVSCAQK 352
>gi|410973629|ref|XP_003993250.1| PREDICTED: PHD finger protein 21A isoform 1 [Felis catus]
Length = 679
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 490 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 533
>gi|356551207|ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786712 [Glycine max]
Length = 525
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 13/88 (14%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 709
I+LCD C+ +H+ C+K + +PKGKWFC + I ++ Q E++K
Sbjct: 414 IVLCDGCDHAYHIYCMKPPQ----NSIPKGKWFCIKCEAGIQAIRQARKAYESKK----- 464
Query: 710 LNAIKKYAGNSLETVSDIDVRWRLLSGK 737
K N + DID +W G+
Sbjct: 465 ----GKVGQNDSKPNEDIDKKWNKKRGR 488
>gi|116008472|ref|NP_724405.2| d4, isoform B [Drosophila melanogaster]
gi|113194576|gb|AAM68376.2| d4, isoform B [Drosophila melanogaster]
Length = 339
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 223 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 280
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 281 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 333
>gi|327276253|ref|XP_003222884.1| PREDICTED: zinc finger protein neuro-d4-like [Anolis carolinensis]
Length = 388
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 270 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 323
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 324 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 370
>gi|344228994|gb|EGV60880.1| JmjC-domain-containing protein [Candida tenuis ATCC 10573]
Length = 749
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 27/58 (46%), Gaps = 12/58 (20%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E EL CLLC P LLC+ C+ +H+ CL L ++P G W+C
Sbjct: 177 EDELENCLLC------GQNHSPDETLLCENCDNPYHIRCL------GLEKVPSGTWYC 222
>gi|151945184|gb|EDN63435.1| jmjc domain-containing histone demethylase [Saccharomyces
cerevisiae YJM789]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|116003927|ref|NP_001070323.1| zinc finger protein neuro-d4 [Bos taurus]
gi|115305354|gb|AAI23606.1| D4, zinc and double PHD fingers family 1 [Bos taurus]
Length = 387
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 269 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 322
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 323 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 369
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 9/61 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW-----FCCMD 686
CL C C+ P +LLCD C+ +H CL L +PKG W CC+
Sbjct: 873 CLECIVCEVCGEASDPSRLLLCDDCDVSYHTYCLD----PPLHTVPKGGWKCKWCVCCVQ 928
Query: 687 C 687
C
Sbjct: 929 C 929
>gi|358341576|dbj|GAA49216.1| histone acetyltransferase MYST4 [Clonorchis sinensis]
Length = 2752
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 23/36 (63%), Gaps = 4/36 (11%)
Query: 648 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ +LLCD C+R FH+ CL+ + ELP+G W C
Sbjct: 191 KDLLLCDSCDRGFHMTCLEP----AVSELPEGSWIC 222
>gi|344280768|ref|XP_003412154.1| PREDICTED: PHD finger protein 21A [Loxodonta africana]
Length = 680
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|323308507|gb|EGA61752.1| Jhd2p [Saccharomyces cerevisiae FostersO]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|255537475|ref|XP_002509804.1| transcription factor, putative [Ricinus communis]
gi|223549703|gb|EEF51191.1| transcription factor, putative [Ricinus communis]
Length = 1509
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 49/99 (49%), Gaps = 13/99 (13%)
Query: 590 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-----LCRGCDFSKSG 644
KR ++HD + G + V +KR R V+ + E+ + L + C+ +SG
Sbjct: 203 KRGMRHDKKCDDGGHRAEVSC----SKRRRRNVEGEKVEVCNKVEKVEELDQICEQCRSG 258
Query: 645 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+LLCD+C + +H+ CL L+++P G W+C
Sbjct: 259 LHGEVMLLCDRCNKGWHIYCLS----PPLKQIPPGNWYC 293
>gi|148669525|gb|EDL01472.1| mCG123147, isoform CRA_c [Mus musculus]
Length = 938
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 261 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 316
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 317 RMPKGMWIC 325
>gi|431915740|gb|ELK16073.1| PHD finger protein 21A [Pteropus alecto]
Length = 666
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 477 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 520
>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
Length = 591
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 44/108 (40%), Gaps = 11/108 (10%)
Query: 579 YCKYCQNMFERKRFLQHDANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C E + V + GR SG + Q T I VK + C+ C
Sbjct: 475 YCDFCLGDSEENKKSNQPEELVSCSDCGR-SGHPTCLQFTANMIISVKKYPWQ---CIEC 530
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL L E P+G W C
Sbjct: 531 KSCGLCGTSDNDDQLLFCDDCDRGYHMYCLN----PPLSEPPEGNWSC 574
>gi|259147582|emb|CAY80833.1| Jhd2p [Saccharomyces cerevisiae EC1118]
gi|349579302|dbj|GAA24465.1| K7_Jhd2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|190409591|gb|EDV12856.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|442622301|ref|NP_001260707.1| d4, isoform D [Drosophila melanogaster]
gi|440214084|gb|AGB93242.1| d4, isoform D [Drosophila melanogaster]
Length = 496
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 380 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 437
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 438 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 490
>gi|6322579|ref|NP_012653.1| Jhd2p [Saccharomyces cerevisiae S288c]
gi|1352920|sp|P47156.1|JHD2_YEAST RecName: Full=Histone demethylase JHD2; AltName: Full=Jumonji/ARID
domain-containing protein 2
gi|1015841|emb|CAA89649.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285813006|tpg|DAA08904.1| TPA: Jhd2p [Saccharomyces cerevisiae S288c]
gi|392298547|gb|EIW09644.1| Jhd2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|395815595|ref|XP_003781311.1| PREDICTED: PHD finger protein 21A [Otolemur garnettii]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKAIPKGMWICPRCQD 488
>gi|348562787|ref|XP_003467190.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein neuro-d4-like
[Cavia porcellus]
Length = 358
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 240 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 293
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 294 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 340
>gi|328716146|ref|XP_003245847.1| PREDICTED: hypothetical protein LOC100162709 isoform 2 [Acyrthosiphon
pisum]
Length = 1426
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 8/59 (13%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 688
C C+ C K +L CD C+R +H+ C+ LR +P+G+W C C CS
Sbjct: 1332 CNECKSCAQCKEVADEDKMLFCDLCDRGYHIYCV------GLRRVPEGRWHCQECAMCS 1384
>gi|298705309|emb|CBJ48999.1| trithorax 3 [Ectocarpus siliculosus]
Length = 2249
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 10/57 (17%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
A GC +C+ D ILLCD CE E+H C+ LR++P +WFC
Sbjct: 367 AAKGGCFVCKKNDKQD------LILLCDGCEGEYHTFCVD----PPLRKIPDDEWFC 413
>gi|296420656|ref|XP_002839885.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636091|emb|CAZ84076.1| unnamed protein product [Tuber melanosporum]
Length = 990
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 646 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
GP L C++C R FH CL L E+P+G WFC
Sbjct: 599 GPGRFLCCERCPRSFHFTCLNP----PLEEVPEGMWFC 632
>gi|207343778|gb|EDZ71134.1| YJR119Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 728
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|432090381|gb|ELK23807.1| PHD finger protein 21A [Myotis davidii]
Length = 554
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 365 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 408
>gi|403362853|gb|EJY81162.1| PHD zinc finger-containing protein [Oxytricha trifallax]
Length = 473
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 34/69 (49%), Gaps = 9/69 (13%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 680
+VK + E S C C+ C+ I++CD C++ H+ CL L ++P
Sbjct: 297 LVKRFQWECSDCKTCKNCN---QNNDEDKIIICDMCDKAVHIHCLN----PPLFQIPSHN 349
Query: 681 WFC--CMDC 687
WFC C++C
Sbjct: 350 WFCKDCVNC 358
>gi|403254657|ref|XP_003920077.1| PREDICTED: PHD finger protein 21A [Saimiri boliviensis boliviensis]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|383863051|ref|XP_003706996.1| PREDICTED: uncharacterized protein LOC100875915 [Megachile rotundata]
Length = 3343
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 618 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 677
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 3157 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 3210
Query: 678 KGKWFC-----CMDCS 688
+G+W C C +C+
Sbjct: 3211 QGRWHCQECAVCANCA 3226
>gi|328716144|ref|XP_001947369.2| PREDICTED: hypothetical protein LOC100162709 isoform 1 [Acyrthosiphon
pisum]
Length = 1495
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 29/59 (49%), Gaps = 8/59 (13%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 688
C C+ C K +L CD C+R +H+ C+ LR +P+G+W C C CS
Sbjct: 1332 CNECKSCAQCKEVADEDKMLFCDLCDRGYHIYCV------GLRRVPEGRWHCQECAMCS 1384
>gi|24585823|ref|NP_724404.1| d4, isoform C [Drosophila melanogaster]
gi|7302246|gb|AAF57339.1| d4, isoform C [Drosophila melanogaster]
Length = 495
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
I Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 379 IAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTANMIISVKRYRWQ- 436
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 437 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 489
>gi|74197305|dbj|BAC32253.2| unnamed protein product [Mus musculus]
Length = 933
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|348564966|ref|XP_003468275.1| PREDICTED: zinc finger protein ubi-d4-like [Cavia porcellus]
Length = 391
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|347800720|ref|NP_001099199.2| zinc finger protein neuro-d4 [Rattus norvegicus]
Length = 387
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 269 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 322
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 323 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 369
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 44/106 (41%), Gaps = 10/106 (9%)
Query: 597 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILL 652
N +E VS + C+ + L + + CL C C+ G +LL
Sbjct: 384 GNDIEGTMVSCATCAQSYHTFCVGLHDKLNSTVVKRGWRCLDCTVCEGCGDGRDESNLLL 443
Query: 653 CDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 696
CD+C+ +H+ CL L +P G W C C C R ++ + N
Sbjct: 444 CDECDISYHIYCLD----PPLECIPHGSWRCKWCATCRRCSAQIPN 485
>gi|297688829|ref|XP_002821872.1| PREDICTED: PHD finger protein 21A isoform 2 [Pongo abelii]
gi|297688831|ref|XP_002821873.1| PREDICTED: PHD finger protein 21A isoform 3 [Pongo abelii]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|119588422|gb|EAW68016.1| PHD finger protein 21A, isoform CRA_c [Homo sapiens]
Length = 649
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 492 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 535
>gi|388454500|ref|NP_001252601.1| PHD finger protein 21A [Macaca mulatta]
gi|387539684|gb|AFJ70469.1| PHD finger protein 21A isoform b [Macaca mulatta]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|10435111|dbj|BAB14492.1| unnamed protein product [Homo sapiens]
gi|20135652|gb|AAM09095.1| BRAF35/HDAC2 complex 80 kDa protein [Homo sapiens]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|18203563|sp|Q9W636.2|REQUB_XENLA RecName: Full=Zinc finger protein ubi-d4 B; AltName: Full=Apoptosis
response zinc finger protein B; AltName: Full=Protein
requiem B; Short=xReq B
Length = 366
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C N ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 249 YCDFCLGDSNTNKKSNQPEELVSCSDCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIEC 304
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 305 KCCNICGTSENDDQLLFCDDCDRGYHMYCLS----PPVAEPPEGSWSCHLCLD 353
>gi|12697937|dbj|BAB21787.1| KIAA1696 protein [Homo sapiens]
Length = 635
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 446 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 489
>gi|410973631|ref|XP_003993251.1| PREDICTED: PHD finger protein 21A isoform 2 [Felis catus]
gi|410973633|ref|XP_003993252.1| PREDICTED: PHD finger protein 21A isoform 3 [Felis catus]
Length = 633
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|395857461|ref|XP_003801110.1| PREDICTED: zinc finger protein DPF3 [Otolemur garnettii]
Length = 489
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 372 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 427
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 428 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 471
>gi|301773852|ref|XP_002922354.1| PREDICTED: PHD finger protein 21A-like, partial [Ailuropoda
melanoleuca]
Length = 582
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 393 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 436
>gi|221507890|gb|EEE33477.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 546
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C C+ C+ + +L+CD C+R +H+ C++ + E+P G WF C DC R
Sbjct: 373 CADCKRCECCQLNTNEEQMLICDACDRAYHMDCMEP----PVEEVPDGTWF-CADCGRCA 427
Query: 692 SVLQNL 697
+ L
Sbjct: 428 CCDRRL 433
>gi|156546910|ref|NP_057705.3| PHD finger protein 21A isoform b [Homo sapiens]
gi|397473591|ref|XP_003808291.1| PREDICTED: PHD finger protein 21A [Pan paniscus]
gi|426368103|ref|XP_004051052.1| PREDICTED: PHD finger protein 21A isoform 1 [Gorilla gorilla
gorilla]
gi|426368105|ref|XP_004051053.1| PREDICTED: PHD finger protein 21A isoform 2 [Gorilla gorilla
gorilla]
gi|119588418|gb|EAW68012.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588419|gb|EAW68013.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588421|gb|EAW68015.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588423|gb|EAW68017.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|168270650|dbj|BAG10118.1| PHD finger protein 21A [synthetic construct]
Length = 634
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|380813476|gb|AFE78612.1| PHD finger protein 21A isoform b [Macaca mulatta]
Length = 633
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|187469691|gb|AAI66788.1| Dpf2 protein [Rattus norvegicus]
Length = 390
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 296 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 351
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 352 HMYCL----TPSMSEPPEGSWSCHLCLD 375
>gi|73982505|ref|XP_861897.1| PREDICTED: PHD finger protein 21A isoform 5 [Canis lupus
familiaris]
Length = 633
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|365764765|gb|EHN06286.1| Jhd2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 661
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|367049578|ref|XP_003655168.1| hypothetical protein THITE_2118555 [Thielavia terrestris NRRL 8126]
gi|347002432|gb|AEO68832.1| hypothetical protein THITE_2118555 [Thielavia terrestris NRRL 8126]
Length = 644
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 15/64 (23%)
Query: 626 EAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E+EL G C +C SG +LLCD C+ +H C+ DL +P G WF
Sbjct: 147 ESELPGSRCPVC------NSGGDEEVLLLCDGCDAAYHTHCI------DLDSVPVGPWF- 193
Query: 684 CMDC 687
CM+C
Sbjct: 194 CMEC 197
>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
Length = 1472
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P+
Sbjct: 1287 LDMVPHIQSYAWQCTDCKTCVQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVPQ 1340
Query: 679 GKWFC-----CMDC 687
G+W C C +C
Sbjct: 1341 GRWHCQECAVCANC 1354
>gi|335289596|ref|XP_003355927.1| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Sus scrofa]
gi|115527907|gb|AAI25154.1| DPF1 protein [Homo sapiens]
gi|119577167|gb|EAW56763.1| D4, zinc and double PHD fingers family 1, isoform CRA_d [Homo
sapiens]
gi|208966118|dbj|BAG73073.1| D4, zinc and double PHD fingers family 1 [synthetic construct]
Length = 387
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 269 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 322
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 323 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 369
>gi|405973225|gb|EKC37949.1| Remodeling and spacing factor 1 [Crassostrea gigas]
Length = 2475
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 6/53 (11%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 697
P +LLCD+C+ FH CL+ M +P G W+C C S ++ + +NL
Sbjct: 1169 PEWLLLCDKCDAAFHTACLRPPLMI----IPDGDWYCPPCEHKSLVSRLQENL 1217
>gi|354505054|ref|XP_003514587.1| PREDICTED: zinc finger protein ubi-d4-like [Cricetulus griseus]
gi|344258641|gb|EGW14745.1| Zinc finger protein ubi-d4 [Cricetulus griseus]
Length = 391
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|347964247|ref|XP_311194.5| AGAP000668-PA [Anopheles gambiae str. PEST]
gi|333467442|gb|EAA06826.6| AGAP000668-PA [Anopheles gambiae str. PEST]
Length = 1733
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 28/52 (53%), Gaps = 10/52 (19%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C++CR + G P LLCD+C R H CLK L+E+P G W+C
Sbjct: 1216 CMICR-----RKG-DPGLTLLCDECNRACHTYCLKPK----LKEVPAGDWYC 1257
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 727 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 782
Query: 679 GKWFC 683
G W C
Sbjct: 783 GGWKC 787
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like
[Oreochromis niloticus]
Length = 4907
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 9/73 (12%)
Query: 620 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 679
+I K + + CL C C+ P +LLCD C+ +H CL L +PKG
Sbjct: 880 KITKTMLRKGWRCLECIVCEVCGKASDPSRLLLCDDCDVSYHTYCLD----PPLHTVPKG 935
Query: 680 KW-----FCCMDC 687
W CC+ C
Sbjct: 936 GWKCKWCVCCVQC 948
>gi|297682047|ref|XP_002818745.1| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Pongo
abelii]
Length = 1215
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 653 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 708
Query: 679 GKWFC 683
G W C
Sbjct: 709 GGWKC 713
>gi|237839305|ref|XP_002368950.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211966614|gb|EEB01810.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 551
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C C+ C+ + +L+CD C+R +H+ C++ + E+P G WF C DC R
Sbjct: 378 CADCKRCECCQLNTNEEQMLICDACDRAYHMDCMEP----PVEEVPDGTWF-CADCGRCA 432
Query: 692 SVLQNL 697
+ L
Sbjct: 433 CCDRRL 438
>gi|63146269|gb|AAH95974.1| Myst4 protein, partial [Mus musculus]
Length = 828
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 251 KFCPELTANVKALRWQCIECKTCSACRVQGKNADNMLFCDSCDRGFHMECCD----PPLS 306
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 307 RMPKGMWIC 315
>gi|13431818|sp|Q9QX66.2|DPF1_MOUSE RecName: Full=Zinc finger protein neuro-d4; AltName:
Full=BRG1-associated factor 45B; Short=BAF45B; AltName:
Full=D4, zinc and double PHD fingers family 1
Length = 387
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 269 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 322
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 323 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 369
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 42.0 bits (97), Expect = 1.4, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 42 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 97
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 98 GGWKCKWCVWC 108
>gi|355711240|gb|AES03946.1| PHD finger protein 21A [Mustela putorius furo]
Length = 679
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|242011982|ref|XP_002426722.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
gi|212510893|gb|EEB13984.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
Length = 1196
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C + G +LLCD C+R FH+ CLK L +P G WFC
Sbjct: 1077 CKVCRRGGDGENMLLCDSCDRGFHLYCLK----PKLSSVPLGDWFC 1118
>gi|195382643|ref|XP_002050039.1| GJ20409 [Drosophila virilis]
gi|194144836|gb|EDW61232.1| GJ20409 [Drosophila virilis]
Length = 490
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 374 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTPNMIISVKRYRWQ- 431
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 432 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 484
>gi|126010798|gb|AAI33649.1| DPF1 protein [Bos taurus]
Length = 388
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 270 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 323
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 324 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 370
>gi|157817959|ref|NP_001101986.1| zinc finger protein ubi-d4 [Rattus norvegicus]
gi|149062118|gb|EDM12541.1| D4, zinc and double PHD fingers family 2 (predicted) [Rattus
norvegicus]
Length = 391
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|432948522|ref|XP_004084087.1| PREDICTED: uncharacterized protein LOC101155159, partial [Oryzias
latipes]
Length = 1229
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 9/56 (16%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNL 697
P ILLCD C+ +H CL+ M +P G+WFC C C ++ L NL
Sbjct: 876 PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKQLCDKLEEQLLNL 927
>gi|431909715|gb|ELK12873.1| Zinc finger protein neuro-d4 [Pteropus alecto]
Length = 425
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 307 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 360
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 361 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 407
>gi|390332557|ref|XP_786050.3| PREDICTED: uncharacterized protein LOC580929 isoform 3
[Strongylocentrotus purpuratus]
Length = 2843
Score = 42.0 bits (97), Expect = 1.4, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C+ C G G + +L+CD C + +H CLK L+ +PKG W C
Sbjct: 260 CAECKTCAICSQG-GTKELLVCDACNQGYHASCLK----PPLKRIPKGCWRC 306
>gi|328713195|ref|XP_001950157.2| PREDICTED: hypothetical protein LOC100158992 [Acyrthosiphon pisum]
Length = 1118
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 31/68 (45%), Gaps = 7/68 (10%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 675
KR K E E +C+ C+ S P ILLCD C + +H CL+ M
Sbjct: 874 KRARTAHKKDEGEQLKDDMCQKCNKSDQ---PEWILLCDTCNQGWHASCLRPPLMV---- 926
Query: 676 LPKGKWFC 683
+P G W+C
Sbjct: 927 IPDGDWYC 934
>gi|326529635|dbj|BAK04764.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1413
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 71/153 (46%), Gaps = 18/153 (11%)
Query: 623 KNLEAELS-GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 681
+ L+AE+S +C C+ SG +LLCD+C++ +H+ CL L +P G W
Sbjct: 303 RKLDAEVSQDDQVCEQCE---SGLHGDAMLLCDRCDKGWHMYCLS----PPLESVPPGNW 355
Query: 682 FCCMDCSRINS----VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-VRWRLLSG 736
+C DC +S ++ + + L F +K+ G + ++ + W ++ G
Sbjct: 356 YCS-DCMNSDSDCFGFVERRKTCQLDALQRFDEKVRRKWFGQRPPSRVQVEKLFWEIVEG 414
Query: 737 KAATPE----TRLLLSQAVAIFHDCFDPIVDSI 765
KA E + + S+ + F DP++ S+
Sbjct: 415 KAGELEVMYGSDMDTSKYGSGFPRLGDPVLPSV 447
>gi|291384911|ref|XP_002708905.1| PREDICTED: BRAF35/HDAC2 complex [Oryctolagus cuniculus]
Length = 682
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
>gi|221483410|gb|EEE21729.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
Length = 556
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C C+ C+ + +L+CD C+R +H+ C++ + E+P G WF C DC R
Sbjct: 383 CADCKRCECCQLNTNEEQMLICDACDRAYHMDCMEP----PVEEVPDGTWF-CADCGRCA 437
Query: 692 SVLQNL 697
+ L
Sbjct: 438 CCDRRL 443
>gi|395543775|ref|XP_003773788.1| PREDICTED: PHD finger protein 21A [Sarcophilus harrisii]
Length = 714
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 499 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 542
>gi|390332561|ref|XP_003723529.1| PREDICTED: uncharacterized protein LOC580929 isoform 2
[Strongylocentrotus purpuratus]
Length = 3278
Score = 42.0 bits (97), Expect = 1.4, Method: Composition-based stats.
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C+ C G G + +L+CD C + +H CLK L+ +PKG W C
Sbjct: 260 CAECKTCAICSQG-GTKELLVCDACNQGYHASCLK----PPLKRIPKGCWRC 306
>gi|293346151|ref|XP_002726280.1| PREDICTED: PHD finger protein 21A isoform 2 [Rattus norvegicus]
gi|293357934|ref|XP_002729244.1| PREDICTED: PHD finger protein 21A isoform 1 [Rattus norvegicus]
gi|149022666|gb|EDL79560.1| similar to PHF21A protein (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 610
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 422 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 465
>gi|197098008|ref|NP_001127678.1| zinc finger protein ubi-d4 [Pongo abelii]
gi|332250195|ref|XP_003274239.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Nomascus
leucogenys]
gi|397516926|ref|XP_003828672.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Pan paniscus]
gi|402892869|ref|XP_003909629.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Papio anubis]
gi|403293484|ref|XP_003937746.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Saimiri
boliviensis boliviensis]
gi|426369137|ref|XP_004051553.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Gorilla gorilla
gorilla]
gi|56403615|emb|CAI29608.1| hypothetical protein [Pongo abelii]
Length = 405
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 311 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 366
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 367 HMYCL----TPSMSEPPEGSWSCHLCLD 390
>gi|6755314|ref|NP_035392.1| zinc finger protein ubi-d4 [Mus musculus]
gi|2500148|sp|Q61103.1|REQU_MOUSE RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|1167972|gb|AAC52783.1| ubi-d4 [Mus musculus]
gi|12836275|dbj|BAB23583.1| unnamed protein product [Mus musculus]
gi|15215228|gb|AAH12709.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
gi|74184334|dbj|BAE25702.1| unnamed protein product [Mus musculus]
gi|74201274|dbj|BAE26098.1| unnamed protein product [Mus musculus]
gi|74201435|dbj|BAE26153.1| unnamed protein product [Mus musculus]
gi|74206142|dbj|BAE23543.1| unnamed protein product [Mus musculus]
gi|148701237|gb|EDL33184.1| D4, zinc and double PHD fingers family 2 [Mus musculus]
Length = 391
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|426245369|ref|XP_004016484.1| PREDICTED: PHD finger protein 21A isoform 3 [Ovis aries]
Length = 682
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 490 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 533
>gi|312374253|gb|EFR21843.1| hypothetical protein AND_16267 [Anopheles darlingi]
Length = 2451
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 18/79 (22%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
E EL+ C +CR + T+LLCD C +H+ CL L E+P G W+C
Sbjct: 222 EDELTYCEVCRLAHSEE------TMLLCDACNLGYHMECLNPP----LLEIPTGSWYC-- 269
Query: 686 DC------SRINSVLQNLL 698
DC ++ LQ+LL
Sbjct: 270 DCCFASGTDEDDNELQDLL 288
>gi|329664170|ref|NP_001193126.1| PHD finger protein 21A [Bos taurus]
gi|296479679|tpg|DAA21794.1| TPA: PHD finger protein 21A isoform 2 [Bos taurus]
Length = 681
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 489 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 532
>gi|440895555|gb|ELR47708.1| PHD finger protein 21A, partial [Bos grunniens mutus]
Length = 666
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 474 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 517
>gi|410900220|ref|XP_003963594.1| PREDICTED: lysine-specific demethylase 5C-like [Takifugu rubripes]
Length = 1592
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 36/76 (47%), Gaps = 8/76 (10%)
Query: 609 DSVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 667
DS +K +R+ +N+ + +CR C G +LLCD CE +H CL
Sbjct: 351 DSDRPFSKMTMRLRRNISNPQCVDSFVCRMCG---RGEDDEKLLLCDGCEDNYHTYCL-- 405
Query: 668 HKMADLRELPKGKWFC 683
+ L +LPKG W C
Sbjct: 406 --LPALTDLPKGNWRC 419
>gi|355755784|gb|EHH59531.1| hypothetical protein EGM_09668 [Macaca fascicularis]
Length = 340
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 222 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 275
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 276 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 322
>gi|348558812|ref|XP_003465210.1| PREDICTED: PHD finger protein 21A isoform 3 [Cavia porcellus]
Length = 682
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 492 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 535
>gi|343961817|dbj|BAK62496.1| jumonji/ARID domain-containing protein 1C [Pan troglodytes]
Length = 384
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 610 SVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
S E TK +R+ +N A+ +CR C G +LLCD C+ +H+ CL
Sbjct: 301 SPEPCTKMTMRLRRNHSNAQFIESYVCRMCS---RGDEDDKLLLCDGCDDNYHIFCL--- 354
Query: 669 KMADLRELPKGKWFC 683
+ L E+PKG W C
Sbjct: 355 -LPPLPEIPKGVWRC 368
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 942 IKISKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 997
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 998 GGWKCKWCVWC 1008
>gi|147904561|ref|NP_001081346.1| zinc finger protein ubi-d4 B [Xenopus laevis]
gi|47682308|gb|AAH70839.1| LOC397786 protein [Xenopus laevis]
gi|52078456|gb|AAH82478.1| LOC397786 protein [Xenopus laevis]
Length = 387
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C N ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 270 YCDFCLGDSNTNKKSNQPEELVSCSDCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIEC 325
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 326 KCCNICGTSENDDQLLFCDDCDRGYHMYCLS----PPVAEPPEGSWSCHLCLD 374
>gi|390470774|ref|XP_003734353.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Callithrix
jacchus]
Length = 405
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 311 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 366
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 367 HMYCL----TPSMSEPPEGSWSCHLCLD 390
>gi|149725409|ref|XP_001492666.1| PREDICTED: zinc finger protein ubi-d4 [Equus caballus]
Length = 391
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|22326695|ref|NP_196541.2| histone-lysine N-methyltransferase ATXR5 [Arabidopsis thaliana]
gi|75248529|sp|Q8VZJ1.1|ATXR5_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR5; AltName:
Full=Protein SET DOMAIN GROUP 15; AltName:
Full=Trithorax-related protein 5; Short=TRX-related
protein 5
gi|17381240|gb|AAL36039.1| AT5g09790/F17I14_20 [Arabidopsis thaliana]
gi|21464579|gb|AAM52244.1| AT5g09790/F17I14_20 [Arabidopsis thaliana]
gi|332004063|gb|AED91446.1| histone-lysine N-methyltransferase ATXR5 [Arabidopsis thaliana]
Length = 352
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C+ SG G +LLCD+C+R FH+ CL+ + +P G W C+DCS
Sbjct: 67 CEKCGSGEGDDELLLCDKCDRGFHMKCLR----PIVVRVPIGTWL-CVDCS 112
>gi|116207774|ref|XP_001229696.1| hypothetical protein CHGG_03180 [Chaetomium globosum CBS 148.51]
gi|88183777|gb|EAQ91245.1| hypothetical protein CHGG_03180 [Chaetomium globosum CBS 148.51]
Length = 650
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
R C S +LLCD C+ +H C+ DL ++P+G WF CM+C
Sbjct: 145 RRCPVCNSAGDEEVLLLCDGCDAAYHTQCV------DLDDVPRGPWF-CMEC 189
>gi|19112966|ref|NP_596174.1| Lid2 complex subunit, predicted histone demethylase Lid2
[Schizosaccharomyces pombe 972h-]
gi|74698143|sp|Q9HDV4.1|LID2_SCHPO RecName: Full=Lid2 complex component lid2; Short=Lid2C component
lid2
gi|12044483|emb|CAC19756.1| Lid2 complex subunit, predicted histone demethylase Lid2
[Schizosaccharomyces pombe]
Length = 1513
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 24/49 (48%), Gaps = 7/49 (14%)
Query: 635 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C K+ P TILLCD CE +H CL L +PK W+C
Sbjct: 271 CEYCGLDKN---PETILLCDGCEAAYHTSCLD----PPLTSIPKEDWYC 312
>gi|218198934|gb|EEC81361.1| hypothetical protein OsI_24556 [Oryza sativa Indica Group]
Length = 1699
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/136 (23%), Positives = 58/136 (42%), Gaps = 12/136 (8%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR-INSVL 694
C+ SG +LLCD+C++ +H+ CL L +P G W+C CM+ R +
Sbjct: 259 CEQCNSGLHGEVMLLCDRCDKGWHLYCLS----PPLESVPPGNWYCSECMNSDRNCFGFV 314
Query: 695 QNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR-WRLLSGKAATPETRLLLSQAVAI 753
+ E +F K++ G+ + ++ + W ++ GK E +I
Sbjct: 315 HRRKTCQVETFRKFEERVRKRWFGHKNPSRVQVEKQFWEIVEGKVGELEVMYGSDLDTSI 374
Query: 754 FHDCF----DPIVDSI 765
+ F DP+ S+
Sbjct: 375 YGSGFPRLCDPVPSSV 390
>gi|145334355|ref|NP_001078559.1| histone-lysine N-methyltransferase ATXR5 [Arabidopsis thaliana]
gi|7671410|emb|CAB89351.1| putative protein [Arabidopsis thaliana]
gi|332004064|gb|AED91447.1| histone-lysine N-methyltransferase ATXR5 [Arabidopsis thaliana]
Length = 379
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C+ SG G +LLCD+C+R FH+ CL+ + +P G W C+DCS
Sbjct: 67 CEKCGSGEGDDELLLCDKCDRGFHMKCLR----PIVVRVPIGTWL-CVDCS 112
>gi|449666731|ref|XP_002165158.2| PREDICTED: uncharacterized protein LOC100198869 [Hydra
magnipapillata]
Length = 1600
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%), Gaps = 5/38 (13%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
+LLCD C+R FH+ CL L+++P G WFC DC
Sbjct: 1332 MLLCDGCDRGFHMNCLN----PPLKKVPTGNWFCS-DC 1364
>gi|13938144|gb|AAH07188.1| Dpf2 protein, partial [Mus musculus]
Length = 351
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 257 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 312
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 313 HMYCL----TPSMSEPPEGSWSCHLCLD 336
>gi|5454004|ref|NP_006259.1| zinc finger protein ubi-d4 [Homo sapiens]
gi|350534556|ref|NP_001233580.1| zinc finger protein ubi-d4 [Pan troglodytes]
gi|73983120|ref|XP_866588.1| PREDICTED: zinc finger protein ubi-d4 isoform 2 [Canis lupus
familiaris]
gi|332250193|ref|XP_003274238.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Nomascus
leucogenys]
gi|397516924|ref|XP_003828671.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Pan paniscus]
gi|402892867|ref|XP_003909628.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Papio anubis]
gi|403293482|ref|XP_003937745.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Saimiri
boliviensis boliviensis]
gi|410974412|ref|XP_003993641.1| PREDICTED: zinc finger protein ubi-d4 [Felis catus]
gi|426369135|ref|XP_004051552.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Gorilla gorilla
gorilla]
gi|2842711|sp|Q92785.2|REQU_HUMAN RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName:
Full=BRG1-associated factor 45D; Short=BAF45D; AltName:
Full=D4, zinc and double PHD fingers family 2; AltName:
Full=Protein requiem
gi|2121234|gb|AAB58307.1| requiem homolog [Homo sapiens]
gi|2529705|gb|AAB81203.1| requiem [Homo sapiens]
gi|15928853|gb|AAH14889.1| D4, zinc and double PHD fingers family 2 [Homo sapiens]
gi|28144169|gb|AAO26041.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|30582275|gb|AAP35364.1| requiem, apoptosis response zinc finger gene [Homo sapiens]
gi|61361059|gb|AAX41982.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|61361064|gb|AAX41983.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
gi|119594781|gb|EAW74375.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|119594782|gb|EAW74376.1| D4, zinc and double PHD fingers family 2, isoform CRA_a [Homo
sapiens]
gi|123983164|gb|ABM83323.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|123997873|gb|ABM86538.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|158257320|dbj|BAF84633.1| unnamed protein product [Homo sapiens]
gi|208967739|dbj|BAG72515.1| D4, zinc and double PHD fingers family 2 [synthetic construct]
gi|343962233|dbj|BAK62704.1| zinc-finger protein ubi-d4 [Pan troglodytes]
gi|355566318|gb|EHH22697.1| Protein requiem [Macaca mulatta]
gi|355751970|gb|EHH56090.1| Protein requiem [Macaca fascicularis]
gi|380815318|gb|AFE79533.1| zinc finger protein ubi-d4 [Macaca mulatta]
gi|410218232|gb|JAA06335.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410249592|gb|JAA12763.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410288496|gb|JAA22848.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
gi|410336195|gb|JAA37044.1| D4, zinc and double PHD fingers family 2 [Pan troglodytes]
Length = 391
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|348518782|ref|XP_003446910.1| PREDICTED: zinc finger protein ubi-d4-like [Oreochromis niloticus]
Length = 399
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 11/99 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 303 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNMCGTSENDDQLLFCDDCDRGY 358
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMDCSRIN-SVLQN 696
H+ CL + E P+G W C C+D + S+ QN
Sbjct: 359 HMYCLN----PPMSEPPEGSWSCHLCLDLLKDKASIYQN 393
>gi|158289753|ref|XP_311414.4| AGAP010697-PA [Anopheles gambiae str. PEST]
gi|157018478|gb|EAA06992.4| AGAP010697-PA [Anopheles gambiae str. PEST]
Length = 2062
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 37/77 (48%), Gaps = 18/77 (23%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
EL+ C +CR + T+LLCD C +H+ CL L E+P G W+C DC
Sbjct: 114 ELTYCEVCRQAHSEE------TMLLCDSCNLGYHMECLNPP----LLEIPSGSWYC--DC 161
Query: 688 ------SRINSVLQNLL 698
++ LQ+LL
Sbjct: 162 CFASGSDEDDNELQDLL 178
>gi|432091134|gb|ELK24346.1| Zinc finger protein ubi-d4 [Myotis davidii]
Length = 405
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 311 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 366
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 367 HMYCL----TPSMSEPPEGSWSCHLCLD 390
>gi|431910278|gb|ELK13351.1| Zinc finger protein ubi-d4 [Pteropus alecto]
Length = 391
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|30584805|gb|AAP36655.1| Homo sapiens requiem, apoptosis response zinc finger gene
[synthetic construct]
gi|61370771|gb|AAX43549.1| D4 zinc and double PHD fingers family 2 [synthetic construct]
Length = 392
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|2131106|emb|CAA89651.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 368
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
E + C++CR + P+ +LCD C++ FH+ CL L +P G W C
Sbjct: 232 EDDDDACIVCRKTN------DPKRTILCDSCDKPFHIYCLS----PPLERVPSGDWIC 279
>gi|9759010|dbj|BAB09537.1| unnamed protein product [Arabidopsis thaliana]
Length = 378
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C+ SG G +LLCD+C+R FH+ CL+ + +P G W C+DCS
Sbjct: 67 CEKCGSGEGDDELLLCDKCDRGFHMKCLR----PIVVRVPIGTWL-CVDCS 112
>gi|7305309|ref|NP_038902.1| zinc finger protein neuro-d4 [Mus musculus]
gi|6649546|gb|AAF21455.1|U48238_1 zinc finger protein neuro-d4 [Mus musculus]
gi|30481687|gb|AAH52348.1| D4, zinc and double PHD fingers family 1 [Mus musculus]
gi|148692120|gb|EDL24067.1| neuronal d4 domain family member [Mus musculus]
Length = 388
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 270 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 323
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 324 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 370
>gi|311247329|ref|XP_003122585.1| PREDICTED: zinc finger protein ubi-d4 [Sus scrofa]
Length = 391
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|71388893|gb|AAZ31374.1| ATXR5 [Arabidopsis thaliana]
Length = 379
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C+ SG G +LLCD+C+R FH+ CL+ + +P G W C+DCS
Sbjct: 67 CEKCGSGEGDDELLLCDKCDRGFHMKCLR----PIVVRVPIGTWL-CVDCS 112
>gi|6648956|gb|AAF21306.1|AF108134_1 ubi-d4/requiem [Mus musculus]
Length = 380
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 286 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 341
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 342 HMYCL----TPSMSEPPEGSWSCHLCLD 365
>gi|297267436|ref|XP_002808108.1| PREDICTED: LOW QUALITY PROTEIN: zinc finger protein ubi-d4-like
[Macaca mulatta]
Length = 391
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|149056396|gb|EDM07827.1| neuronal d4 domain family member, isoform CRA_b [Rattus norvegicus]
gi|166796914|gb|AAI59416.1| D4, zinc and double PHD fingers family 1 [Rattus norvegicus]
Length = 332
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 214 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 267
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 268 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 314
>gi|338712006|ref|XP_003362635.1| PREDICTED: PHD finger protein 21A [Equus caballus]
Length = 675
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 486 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 529
>gi|149022665|gb|EDL79559.1| similar to PHF21A protein (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 563
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 375 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 418
>gi|391344898|ref|XP_003746731.1| PREDICTED: zinc finger protein DPF3-like [Metaseiulus occidentalis]
Length = 470
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 46/108 (42%), Gaps = 11/108 (10%)
Query: 579 YCKYCQNM-FERKRFLQHD--ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C E K+ Q + + GR SG + Q T VK + C+ C
Sbjct: 349 YCDFCLGTESENKKTKQPEEMVTCADCGR-SGHPTCLQFTDVMTNNVKKYRWQ---CIEC 404
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL L+E P+G W C
Sbjct: 405 KTCTLCGTSENDDQMLFCDDCDRGYHMYCLS----PPLKEPPEGSWSC 448
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 967 IKISKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 1022
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 1023 GGWKCKWCVWC 1033
>gi|348523632|ref|XP_003449327.1| PREDICTED: zinc finger protein neuro-d4-like [Oreochromis
niloticus]
Length = 381
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 67/167 (40%), Gaps = 30/167 (17%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAECASL----------------SSIPQG 576
CG++ YKN G+ H ++ ++ + E +E +L S I G
Sbjct: 211 CGKR----YKNRPGLSYHYTHTHLADEEGEEDSERHTLPFQRKNNHKPKKAADGSVIANG 266
Query: 577 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 636
YC +C ++ + + + GR SG S Q T V+ + C+ C+
Sbjct: 267 --YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---CIECK 320
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 321 SCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMSEPPEGSWSC 363
>gi|293346153|ref|XP_002726281.1| PREDICTED: PHD finger protein 21A isoform 3 [Rattus norvegicus]
gi|392346534|ref|XP_003749579.1| PREDICTED: PHD finger protein 21A [Rattus norvegicus]
Length = 562
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 374 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 417
>gi|195027395|ref|XP_001986568.1| GH21439 [Drosophila grimshawi]
gi|193902568|gb|EDW01435.1| GH21439 [Drosophila grimshawi]
Length = 504
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 388 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTPNMIISVKRYRWQ- 445
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 446 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 498
>gi|440894955|gb|ELR47273.1| Zinc finger protein neuro-d4, partial [Bos grunniens mutus]
Length = 383
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 265 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 318
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 319 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 365
>gi|353229269|emb|CCD75440.1| putative bromodomain containing protein [Schistosoma mansoni]
Length = 2486
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 15/65 (23%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC-SRINSVLQNLLVQEAEKLPEF 708
+LLCD C+R +H C + L +P G WF C DC S+ S P +
Sbjct: 2211 LLLCDGCDRGYHTYCFRPQ----LSNIPSGDWF-CYDCVSKATS---------KHLKPTY 2256
Query: 709 HLNAI 713
HLNA+
Sbjct: 2257 HLNAL 2261
>gi|256087592|ref|XP_002579950.1| bromodomain containing protein [Schistosoma mansoni]
Length = 2487
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 15/65 (23%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC-SRINSVLQNLLVQEAEKLPEF 708
+LLCD C+R +H C + L +P G WF C DC S+ S P +
Sbjct: 2211 LLLCDGCDRGYHTYCFRPQ----LSNIPSGDWF-CYDCVSKATS---------KHLKPTY 2256
Query: 709 HLNAI 713
HLNA+
Sbjct: 2257 HLNAL 2261
>gi|148695645|gb|EDL27592.1| PHD finger protein 21A, isoform CRA_b [Mus musculus]
Length = 605
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 417 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 460
>gi|157841178|ref|NP_001103161.1| PHD finger protein 21A isoform 3 [Mus musculus]
gi|26332010|dbj|BAC29735.1| unnamed protein product [Mus musculus]
Length = 604
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 416 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 459
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 6/71 (8%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + + CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 943 IKITKVVLHKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 998
Query: 679 GKWFC--CMDC 687
G W C C+ C
Sbjct: 999 GGWKCKWCVWC 1009
>gi|195446065|ref|XP_002070611.1| GK12157 [Drosophila willistoni]
gi|194166696|gb|EDW81597.1| GK12157 [Drosophila willistoni]
Length = 1497
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LC+ C + G P +LLCD+C H+ C+K +R +P+G W+C
Sbjct: 1084 LCKVC---RRGTDPEKMLLCDECNAGTHMFCMK----PKMRTVPEGNWYC 1126
>gi|154757359|gb|AAI51762.1| MYST4 protein [Bos taurus]
Length = 349
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 5/69 (7%)
Query: 616 KRCIRIVKNLEAELSGCLLCRGCDFSK-SGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
K C + N++A C+ C+ C + G +L CD C+R FH+ C L
Sbjct: 249 KFCPELTTNVKALRWQCIECKTCSACRIQGKNADNMLFCDSCDRGFHMECCD----PPLS 304
Query: 675 ELPKGKWFC 683
+PKG W C
Sbjct: 305 RMPKGMWIC 313
>gi|344295884|ref|XP_003419640.1| PREDICTED: zinc finger protein ubi-d4 [Loxodonta africana]
Length = 391
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2875 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2918
>gi|205830434|ref|NP_001128628.1| zinc finger protein neuro-d4 isoform c [Homo sapiens]
gi|395751088|ref|XP_002829195.2| PREDICTED: zinc finger protein neuro-d4 isoform 1 [Pongo abelii]
gi|395751090|ref|XP_003779218.1| PREDICTED: zinc finger protein neuro-d4 [Pongo abelii]
gi|395847019|ref|XP_003796184.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Otolemur
garnettii]
gi|395847023|ref|XP_003796186.1| PREDICTED: zinc finger protein neuro-d4 isoform 4 [Otolemur
garnettii]
gi|402905397|ref|XP_003915506.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Papio anubis]
gi|402905399|ref|XP_003915507.1| PREDICTED: zinc finger protein neuro-d4 isoform 3 [Papio anubis]
gi|410983102|ref|XP_003997882.1| PREDICTED: zinc finger protein neuro-d4 isoform 2 [Felis catus]
gi|193787694|dbj|BAG52900.1| unnamed protein product [Homo sapiens]
Length = 332
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 214 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 267
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 268 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 314
>gi|350580137|ref|XP_003353943.2| PREDICTED: PHD finger protein 21A, partial [Sus scrofa]
Length = 517
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 325 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 368
>gi|224071864|ref|XP_002303585.1| jumonji domain protein [Populus trichocarpa]
gi|222841017|gb|EEE78564.1| jumonji domain protein [Populus trichocarpa]
Length = 1503
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 7/65 (10%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
+++ +E E +C C +SG +LLCD+C + +H+ CL L+++P
Sbjct: 241 VKVCNKVEEEEEHDQICEQC---RSGLHGEVMLLCDRCNKGWHIYCLS----PPLKQVPP 293
Query: 679 GKWFC 683
G W+C
Sbjct: 294 GNWYC 298
>gi|222639770|gb|EEE67902.1| hypothetical protein OsJ_25746 [Oryza sativa Japonica Group]
Length = 904
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LCRGC K I++CD C+ +H+ C++ + +PKGKW+C
Sbjct: 476 LCRGCFQDKDD---DQIVMCDGCDEGYHIYCMRPAR----NTIPKGKWYC 518
>gi|119577164|gb|EAW56760.1| D4, zinc and double PHD fingers family 1, isoform CRA_a [Homo
sapiens]
Length = 425
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 307 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 360
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 361 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 407
>gi|390332559|ref|XP_003723528.1| PREDICTED: uncharacterized protein LOC580929 isoform 1
[Strongylocentrotus purpuratus]
Length = 3300
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C+ C G G + +L+CD C + +H CLK L+ +PKG W C
Sbjct: 260 CAECKTCAICSQG-GTKELLVCDACNQGYHASCLK----PPLKRIPKGCWRC 306
>gi|320168272|gb|EFW45171.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1143
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 30/62 (48%), Gaps = 11/62 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
E + GC +C + + +L+CD CE H+ CLK L+ +P G WFC
Sbjct: 986 EPAVDGCSVCHSNEQEEK------LLICDGCEAMMHMFCLKPA----LKRVPAGDWFCD- 1034
Query: 686 DC 687
DC
Sbjct: 1035 DC 1036
>gi|256053108|ref|XP_002570049.1| hepatitis B virus X associated protein hbxa [Schistosoma mansoni]
gi|353229228|emb|CCD75399.1| putative hepatitis B virus X associated protein, hbxa [Schistosoma
mansoni]
Length = 1273
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 32/59 (54%), Gaps = 6/59 (10%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAE 703
P ILLCD+C+ H CL L +P+G WFC C + I+++ + + V EAE
Sbjct: 623 PDWILLCDRCDLGHHAMCLS----PPLHIIPEGDWFCPRCQHATLISALKETITVLEAE 677
>gi|149635024|ref|XP_001507820.1| PREDICTED: PHD finger protein 21A [Ornithorhynchus anatinus]
Length = 638
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 448 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 491
>gi|45383492|ref|NP_989662.1| zinc finger protein ubi-d4 [Gallus gallus]
gi|18202299|sp|P58268.1|REQU_CHICK RecName: Full=Zinc finger protein ubi-d4; AltName: Full=Apoptosis
response zinc finger protein; AltName: Full=Protein
requiem
gi|14010356|gb|AAK51965.1|AF362751_1 requiem [Gallus gallus]
Length = 405
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 311 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 366
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 367 HMYCL----TPPMSEPPEGSWSCHLCLD 390
>gi|1083466|pir||A55302 probable transcription factor requiem - mouse
gi|606661|gb|AAA64637.1| Requiem [Mus musculus]
Length = 371
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 277 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 332
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 333 HMYCL----TPSMSEPPEGSWSCHLCLD 356
>gi|154152087|ref|NP_001093826.1| zinc finger protein ubi-d4 [Bos taurus]
gi|296218741|ref|XP_002755572.1| PREDICTED: zinc finger protein ubi-d4 isoform 1 [Callithrix
jacchus]
gi|426252022|ref|XP_004019718.1| PREDICTED: zinc finger protein ubi-d4 [Ovis aries]
gi|118582243|gb|ABL07500.1| zinc-finger protein ubi-d4 [Capra hircus]
gi|151557067|gb|AAI49970.1| DPF2 protein [Bos taurus]
gi|152941218|gb|ABS45046.1| D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|296471617|tpg|DAA13732.1| TPA: D4, zinc and double PHD fingers family 2 [Bos taurus]
gi|417400089|gb|JAA47013.1| Putative transcription factor requiem/neuro-d4 [Desmodus rotundus]
Length = 391
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|296479678|tpg|DAA21793.1| TPA: PHD finger protein 21A isoform 1 [Bos taurus]
Length = 635
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 443 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 486
>gi|312383274|gb|EFR28426.1| hypothetical protein AND_03646 [Anopheles darlingi]
Length = 2544
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 26/53 (49%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C+ D P ILLCD C++ +H CLK L +P+G WFC
Sbjct: 954 ACQECKKMDQ------PEWILLCDSCDKGYHCACLKPV----LFTIPEGDWFC 996
>gi|426245365|ref|XP_004016482.1| PREDICTED: PHD finger protein 21A isoform 1 [Ovis aries]
gi|426245367|ref|XP_004016483.1| PREDICTED: PHD finger protein 21A isoform 2 [Ovis aries]
Length = 636
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|301092341|ref|XP_002997028.1| histone deacetylase, putative [Phytophthora infestans T30-4]
gi|262112154|gb|EEY70206.1| histone deacetylase, putative [Phytophthora infestans T30-4]
Length = 1512
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 39/64 (60%), Gaps = 12/64 (18%)
Query: 627 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--C 684
A+ +GC++C G D ++G +ILLCD C+ E+H+ CL + L E+P G ++C C
Sbjct: 25 AQSNGCVVC-GLD--QNG---DSILLCDGCDGEYHMFCL----VPPLTEIPAGDFYCKKC 74
Query: 685 MDCS 688
D +
Sbjct: 75 TDAN 78
>gi|308802794|ref|XP_003078710.1| PHD finger family protein / methyl-CpG binding domain-containing
protein (ISS) [Ostreococcus tauri]
gi|116057163|emb|CAL51590.1| PHD finger family protein / methyl-CpG binding domain-containing
protein (ISS) [Ostreococcus tauri]
Length = 1445
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
GC++C G D +LLCD C+ E+H CL A+ P+G+WFC
Sbjct: 904 GCIVC-GLDVMAG-----VVLLCDSCDAEYHTKCLDPPLSAE----PEGEWFC 946
>gi|444724505|gb|ELW65108.1| Zinc finger protein ubi-d4 [Tupaia chinensis]
Length = 412
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 318 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 373
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 374 HMYCL----TPSMSEPPEGSWSCHLCLD 397
>gi|307105064|gb|EFN53315.1| hypothetical protein CHLNCDRAFT_53872 [Chlorella variabilis]
Length = 455
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 7/69 (10%)
Query: 621 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC--EREFHVGCLKKHKMADLRE--- 675
++ + A C++C G D+ + G+GPRT++ CD C H+ C H L E
Sbjct: 5 LLPDYWAAKEDCVVCGGTDY-QPGYGPRTLVFCDCCLGGAATHIECWHAHSGELLTEDRL 63
Query: 676 -LPKGKWFC 683
P +WFC
Sbjct: 64 ASPAFQWFC 72
>gi|195122592|ref|XP_002005795.1| GI20660 [Drosophila mojavensis]
gi|193910863|gb|EDW09730.1| GI20660 [Drosophila mojavensis]
Length = 492
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 376 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTPNMIISVKRYRWQ- 433
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 434 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 486
>gi|395852334|ref|XP_003798694.1| PREDICTED: zinc finger protein ubi-d4 [Otolemur garnettii]
Length = 391
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWSCHLCLD 376
>gi|348558808|ref|XP_003465208.1| PREDICTED: PHD finger protein 21A isoform 1 [Cavia porcellus]
Length = 635
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|291414421|ref|XP_002723450.1| PREDICTED: D4, zinc and double PHD fingers family 2-like
[Oryctolagus cuniculus]
Length = 388
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 294 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 349
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 350 HMYCL----TPSMSEPPEGSWSCHLCLD 373
>gi|156394527|ref|XP_001636877.1| predicted protein [Nematostella vectensis]
gi|156223984|gb|EDO44814.1| predicted protein [Nematostella vectensis]
Length = 317
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 5/50 (10%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
C + G +LLCD C+R +H C M L +P+G W+ CMDC
Sbjct: 102 CQMCRKGDNEELLLLCDGCDRGYHTYCC----MPKLTTIPEGDWY-CMDC 146
>gi|195575390|ref|XP_002105662.1| GD21606 [Drosophila simulans]
gi|194201589|gb|EDX15165.1| GD21606 [Drosophila simulans]
Length = 205
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
LC+ C + G P +LLCD+C H+ CLK LR +P G W+C
Sbjct: 30 LCKVC---RRGSDPEKMLLCDECNAGTHMFCLKPK----LRSVPPGNWYC 72
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2602 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2645
>gi|347965797|ref|XP_001689355.2| AGAP001386-PA [Anopheles gambiae str. PEST]
gi|333470348|gb|EDO63260.2| AGAP001386-PA [Anopheles gambiae str. PEST]
Length = 2528
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 10/54 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
C C+ D P ILLCD C++ +H CLK L +P+G WFC +
Sbjct: 1039 CQECKKTDQ------PEWILLCDSCDKGYHCACLKPV----LFTIPEGDWFCPV 1082
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2648 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2691
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2588 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2631
>gi|332842752|ref|XP_001140541.2| PREDICTED: zinc finger protein DPF3 [Pan troglodytes]
Length = 367
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 250 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 305
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 306 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 349
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2871 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2914
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2639 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2682
>gi|9937325|gb|AAG02418.1|AF286030_1 regulator Ustilago maydis 1 protein [Ustilago maydis]
Length = 2289
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 12/61 (19%)
Query: 624 NLEAELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
N E C +C RG D GP +LLCD+C R +H+ CL+ L +PK +WF
Sbjct: 534 NQAQEEQMCEICLRGED------GP-NMLLCDECNRGYHMYCLQPA----LTSIPKSQWF 582
Query: 683 C 683
C
Sbjct: 583 C 583
>gi|347969322|ref|XP_003436402.1| AGAP003137-PB [Anopheles gambiae str. PEST]
gi|333468471|gb|EGK96959.1| AGAP003137-PB [Anopheles gambiae str. PEST]
Length = 1358
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 10/58 (17%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
E + C++CR + G T LLCD+C R H+ CLK L+++P+G WFC M
Sbjct: 1123 ERAVCMVCR-----RKGDANLT-LLCDECNRACHMYCLK----PKLKKVPEGDWFCKM 1170
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2566 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2609
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C +C GC KS R +LLCD C+ +H CL L+ +PKG W C C+ C+
Sbjct: 835 CTVCEGC--GKSSDEAR-LLLCDDCDISYHTYCLD----PPLQTVPKGGWKCKWCVCCTH 887
Query: 690 INSV 693
SV
Sbjct: 888 CGSV 891
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2634 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2677
>gi|440907399|gb|ELR57553.1| Zinc finger protein ubi-d4, partial [Bos grunniens mutus]
Length = 380
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 286 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 341
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 342 HMYCL----TPSMSEPPEGSWSCHLCLD 365
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C +C GC KS R +LLCD C+ +H CL L+ +PKG W C C+ C+
Sbjct: 815 CTVCEGC--GKSSDEAR-LLLCDDCDISYHTYCLD----PPLQTVPKGGWKCKWCVCCTH 867
Query: 690 INSV 693
SV
Sbjct: 868 CGSV 871
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 997 IKINKVVLSKGWRCLECTVCEACGQATDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 1052
Query: 679 GKWFC--CMDCSRINSVLQNL 697
W C C+ C++ + L
Sbjct: 1053 DSWKCKWCVSCTQCGATTPGL 1073
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 33/64 (51%), Gaps = 9/64 (14%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C +C GC KS R +LLCD C+ +H CL L+ +PKG W C C+ C+
Sbjct: 832 CTVCEGC--GKSSDEAR-LLLCDDCDISYHTYCLD----PPLQTVPKGGWKCKWCVCCTH 884
Query: 690 INSV 693
SV
Sbjct: 885 CGSV 888
>gi|427792405|gb|JAA61654.1| Putative remodeling and spacing factor 1, partial [Rhipicephalus
pulchellus]
Length = 2760
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 644 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
G P ILLCD C+ +H CLK M +P G WFC
Sbjct: 1865 GDHPEWILLCDVCDAGYHTSCLKPALMI----IPDGDWFC 1900
>gi|297811079|ref|XP_002873423.1| hypothetical protein ARALYDRAFT_487800 [Arabidopsis lyrata subsp.
lyrata]
gi|297319260|gb|EFH49682.1| hypothetical protein ARALYDRAFT_487800 [Arabidopsis lyrata subsp.
lyrata]
Length = 379
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 5/51 (9%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C+ SG G +LLCD+C+R FH+ CL+ + +P G W C+DCS
Sbjct: 67 CEKCGSGEGDDELLLCDKCDRGFHMKCLR----PIVVRVPIGPWL-CVDCS 112
>gi|297459745|ref|XP_001254780.2| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|297479887|ref|XP_002691098.1| PREDICTED: zinc finger protein DPF3 [Bos taurus]
gi|296483064|tpg|DAA25179.1| TPA: Zinc finger protein DPF3-like [Bos taurus]
Length = 474
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 357 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 412
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 413 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 456
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 72/186 (38%), Gaps = 43/186 (23%)
Query: 518 DESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC--ASLSSIPQ 575
DE G DG E + + +G G I+C C P + H C + P+
Sbjct: 342 DEDG--DGYETDHQDYCEVCQQG---GEIILCDTC-----PRAY--HMVCLDPDMEKAPE 389
Query: 576 GDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
G W C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 390 GTWSCPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC 436
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSV 693
K G +L CD C +H+ CL L E+P G+W C C C +
Sbjct: 437 ------KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGK 482
Query: 694 LQNLLV 699
+Q +L
Sbjct: 483 VQKILT 488
>gi|28971718|dbj|BAC65327.1| PFTF1 [Mus musculus]
Length = 556
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|71014560|ref|XP_758729.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
gi|46098519|gb|EAK83752.1| hypothetical protein UM02582.1 [Ustilago maydis 521]
Length = 2289
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 12/61 (19%)
Query: 624 NLEAELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 682
N E C +C RG D GP +LLCD+C R +H+ CL+ L +PK +WF
Sbjct: 534 NQAQEEQMCEICLRGED------GP-NMLLCDECNRGYHMYCLQPA----LTSIPKSQWF 582
Query: 683 C 683
C
Sbjct: 583 C 583
>gi|17512445|gb|AAH19181.1| PHD finger protein 21A [Mus musculus]
Length = 556
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|149725049|ref|XP_001489490.1| PREDICTED: PHD finger protein 21A isoform 1 [Equus caballus]
gi|338712004|ref|XP_003362634.1| PREDICTED: PHD finger protein 21A [Equus caballus]
Length = 629
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 440 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 483
>gi|148695644|gb|EDL27591.1| PHD finger protein 21A, isoform CRA_a [Mus musculus]
Length = 557
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 369 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 412
>gi|157841174|ref|NP_620094.2| PHD finger protein 21A isoform 1 [Mus musculus]
Length = 556
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|413935108|gb|AFW69659.1| hypothetical protein ZEAMMB73_060614 [Zea mays]
Length = 527
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 56/132 (42%), Gaps = 8/132 (6%)
Query: 638 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR-INSVL 694
C+ SG +LLCD+C++ +H+ CL L +P G W+C C++ R +
Sbjct: 312 CEQCSSGLHGDVMLLCDRCDKGWHLYCLS----PPLERVPPGNWYCSDCLNSDRDCFGFI 367
Query: 695 QNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR-WRLLSGKAATPETRLLLSQAVAI 753
Q E F K++ G + I+ + W ++ GKA E +I
Sbjct: 368 QRRKSCLLETFRRFDDRVRKRWFGQRNPSRIQIEKQFWEIVEGKAGELEVMYGSDLDTSI 427
Query: 754 FHDCFDPIVDSI 765
+ F + D++
Sbjct: 428 YGSGFPRLGDTV 439
>gi|410915434|ref|XP_003971192.1| PREDICTED: zinc finger protein ubi-d4-like [Takifugu rubripes]
Length = 398
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 11/99 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 302 DCGR-SGHPSCLQFTPIMMAAVKTYRWQ---CIECKCCNMCGTSENDDQLLFCDDCDRGY 357
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMDCSRIN-SVLQN 696
H+ CL + E P+G W C C+D + S+ QN
Sbjct: 358 HMYCLN----PPMSEPPEGSWSCHLCLDLLKDKASIYQN 392
>gi|406865311|gb|EKD18353.1| phd and ring finger domain protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 624
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 13/57 (22%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 688
C +CR D + +LLCD C+ +H C+ L +P G WF CM+C+
Sbjct: 141 CPICRSADQEE------VLLLCDSCDAPYHTHCI------GLDRVPNGHWF-CMECA 184
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 6/58 (10%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDC 687
CL C C+ P +LLCD C+ +H CL L+ +PKG W C C+ C
Sbjct: 917 CLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPKGGWKCKWCVWC 970
>gi|327259735|ref|XP_003214691.1| PREDICTED: PHD finger protein 21A-like, partial [Anolis
carolinensis]
Length = 567
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 378 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 421
>gi|92110221|gb|AAI15733.1| Dpf2 protein [Mus musculus]
Length = 209
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 115 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNLCGTSENDDQLLFCDDCDRGY 170
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 171 HMYCL----TPSMSEPPEGSWSCHLCLD 194
>gi|33869836|gb|AAH21191.1| DPF1 protein, partial [Homo sapiens]
Length = 414
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T V+ + C
Sbjct: 296 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFTVNMTAAVRTYRWQ---C 349
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 350 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 396
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 29/63 (46%), Gaps = 6/63 (9%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
CL C C+ P +LLCD C+ +H CL L +PKG W C C+ C +
Sbjct: 196 CLECTVCEACGEASDPGRLLLCDDCDISYHTYCLD----PPLHTVPKGAWKCKWCVRCVQ 251
Query: 690 INS 692
S
Sbjct: 252 CGS 254
>gi|194740838|ref|XP_001952897.1| GF17489 [Drosophila ananassae]
gi|190625956|gb|EDV41480.1| GF17489 [Drosophila ananassae]
Length = 1510
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
LC+ C + G P +LLCD+C H+ C+K +R +P+G W+ C DC
Sbjct: 1088 LCKVC---RRGSDPEKMLLCDECNGGTHMFCMK----PKMRTVPEGNWY-CKDC 1133
>gi|37362278|gb|AAQ91267.1| requiem, apoptosis response zinc finger gene [Danio rerio]
Length = 368
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C +M ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 247 YCDFCLGDSSMNQKTGQSEELVSCSDCGR-SGHPSCLQFTAVMMAAVKTYRWQ---CIEC 302
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ + +L CD C+R +H+ CL + + P+G W C
Sbjct: 303 KCCNVCGTSENDDQLLFCDDCDRGYHMYCLS----PPMSDPPEGSWSC 346
>gi|426224931|ref|XP_004006622.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2A
isoform 1 [Ovis aries]
Length = 1897
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
CL+CR D + +LLCD C+R H+ CL+ + +P+G WFC +
Sbjct: 1672 CLVCRKGDNDE------FLLLCDGCDRGCHIYCLR----PKMEAVPEGDWFCAV 1715
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|444707577|gb|ELW48842.1| PHD finger protein 21A [Tupaia chinensis]
Length = 497
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 274 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 317
>gi|328927020|ref|NP_001180159.2| bromodomain adjacent to zinc finger domain protein 2A [Bos taurus]
Length = 1898
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
CL+CR D + +LLCD C+R H+ CL+ + +P+G WFC +
Sbjct: 1673 CLVCRKGDNDE------FLLLCDGCDRGCHIYCLR----PKMEAVPEGDWFCAV 1716
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 703 IKINKVVLSKGWRCLECTVCEACGQATDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 758
Query: 679 GKWFC--CMDCSRINSVLQNL 697
W C C+ C++ + L
Sbjct: 759 DSWKCKWCVTCTQCGATTPGL 779
>gi|410899989|ref|XP_003963479.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2A-like
[Takifugu rubripes]
Length = 2283
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 12/64 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 689
C +CR D +LLCD C+R H+ CLK + ++P+G WFC C+D +
Sbjct: 2061 CQVCRKGDNDD------CLLLCDGCDRGCHMYCLK----PKITQVPEGDWFCPTCVDQTE 2110
Query: 690 INSV 693
SV
Sbjct: 2111 NTSV 2114
>gi|341888296|gb|EGT44231.1| hypothetical protein CAEBREN_09890 [Caenorhabditis brenneri]
gi|341902393|gb|EGT58328.1| hypothetical protein CAEBREN_07005 [Caenorhabditis brenneri]
Length = 454
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 9/76 (11%)
Query: 613 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 672
++ +R IVK E C+ CR C ++ CD+C+R FH C+
Sbjct: 364 EMPERMAMIVKTYEW---SCVDCRVCSVCHKPGEENEVVFCDRCDRGFHNSCV------G 414
Query: 673 LRELPKGKWFCCMDCS 688
L+ P G W C + CS
Sbjct: 415 LKSTPIGSWICEIFCS 430
>gi|357139082|ref|XP_003571114.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Brachypodium distachyon]
Length = 924
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 13/82 (15%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDC---S 688
+C+ C + ++LLCD+C+ E+H CL L +P+G W+C CM S
Sbjct: 60 VCKVCGIDRDD---DSVLLCDKCDSEYHTYCLN----PPLARIPQGNWYCPSCMSGQKKS 112
Query: 689 RINSVLQNLLVQEAEKL-PEFH 709
++ +++L Q+ + + EFH
Sbjct: 113 HLDQGVKDLKRQQKKHVGEEFH 134
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2715 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2761
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|307179212|gb|EFN67624.1| Remodeling and spacing factor 1 [Camponotus floridanus]
Length = 3956
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2714 ACQKCGKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2756
>gi|397611137|gb|EJK61191.1| hypothetical protein THAOC_18363, partial [Thalassiosira oceanica]
Length = 584
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 8/59 (13%)
Query: 634 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 692
+C+ C GFG +++CD CE H+ CL L+ +P G+WF C DC + +S
Sbjct: 221 VCKTC---GCGFGDVEMIVCDGCESLLHLSCLD----PPLKRVPAGRWF-CNDCLKRDS 271
>gi|347969324|ref|XP_001237499.3| AGAP003137-PA [Anopheles gambiae str. PEST]
gi|333468470|gb|EAU77009.3| AGAP003137-PA [Anopheles gambiae str. PEST]
Length = 1307
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 10/58 (17%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
E + C++CR + G T LLCD+C R H+ CLK L+++P+G WFC M
Sbjct: 1072 ERAVCMVCR-----RKGDANLT-LLCDECNRACHMYCLK----PKLKKVPEGDWFCKM 1119
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2652 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2698
>gi|440899481|gb|ELR50778.1| Bromodomain adjacent to zinc finger domain protein 2A, partial [Bos
grunniens mutus]
Length = 1898
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
CL+CR D + +LLCD C+R H+ CL+ + +P+G WFC +
Sbjct: 1673 CLVCRKGDNDE------FLLLCDGCDRGCHIYCLR----PKMEAVPEGDWFCAV 1716
>gi|62177137|ref|NP_997861.2| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|62026699|gb|AAH92130.1| D4, zinc and double PHD fingers family 2, like [Danio rerio]
gi|182892074|gb|AAI65789.1| Dpf2l protein [Danio rerio]
Length = 405
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C +M ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 284 YCDFCLGDSSMNQKTGQSEELVSCSDCGR-SGHPSCLQFTAVMMAAVKTYRWQ---CIEC 339
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ + +L CD C+R +H+ CL + + P+G W C
Sbjct: 340 KCCNVCGTSENDDQLLFCDDCDRGYHMYCLS----PPMSDPPEGSWSC 383
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2536 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2583
>gi|449529471|ref|XP_004171723.1| PREDICTED: uncharacterized LOC101209162 [Cucumis sativus]
Length = 383
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 16/115 (13%)
Query: 771 IPSMVYGRNLRGQEF-------GGMYCAILT-----VNSSVVSAGILRVF---GQEVAEL 815
+P+M Y N Q G YC +L VN +V A I + AE+
Sbjct: 71 LPTMAYAANTGKQSTFMEKCVSNGKYCTLLLESKSEVNPGLVIAAITYQIVPADTQYAEI 130
Query: 816 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 870
PL A S KG+ +L+ + K L + +++I +E+E W+ K GF I
Sbjct: 131 PLAAVSLAYQHKGFGHILYMELRKRLQSVGIRTIFCWGDKESEGFWS-KQGFLSI 184
>gi|291190717|ref|NP_001167047.1| Zinc finger protein ubi-d4 [Salmo salar]
gi|223647844|gb|ACN10680.1| Zinc finger protein ubi-d4 [Salmo salar]
Length = 402
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 47/112 (41%), Gaps = 11/112 (9%)
Query: 579 YCKYC--QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 636
YC +C + +K + + SG S Q T + VK + C+ C+
Sbjct: 281 YCDFCLGDSALNQKTGQSEELQSCSDCGRSGHPSCLQFTPVMMAAVKTYRWQ---CIECK 337
Query: 637 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 338 CCNICGTSENDDQLLFCDDCDRGYHMYCLSPA----MAEPPEGSWSCHLCLD 385
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2448 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2495
>gi|397594639|gb|EJK56298.1| hypothetical protein THAOC_23853 [Thalassiosira oceanica]
Length = 462
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 20/84 (23%)
Query: 628 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 687
E+ GC +C G D +L+C+ C E+H CL L E+P+G WFC
Sbjct: 71 EVEGCRIC-GKDDDHG-----NLLICEFCGDEYHTYCLS----PPLDEIPEGDWFCGE-- 118
Query: 688 SRINSVLQNLLVQEAEKLPEFHLN 711
S I ++ N++ FHLN
Sbjct: 119 SNIQGLVDNII--------PFHLN 134
>gi|332019010|gb|EGI59549.1| Remodeling and spacing factor 1 [Acromyrmex echinatior]
Length = 4018
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2771 ACQKCGKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2813
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2503 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2550
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 6/58 (10%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDC 687
CL C C+ P +LLCD C+ +H CL L+ +PKG W C C+ C
Sbjct: 851 CLECTVCEACGKATDPGRLLLCDDCDISYHTYCL----XPPLQTVPKGGWKCKWCVWC 904
>gi|312377713|gb|EFR24474.1| hypothetical protein AND_10892 [Anopheles darlingi]
Length = 539
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 48/108 (44%), Gaps = 11/108 (10%)
Query: 579 YCKYC-QNMFERKRFLQHD--ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C + E K+ L+ + + + GR SG S Q T I V+ + C+ C
Sbjct: 429 YCDFCLGDARENKKTLEPEELVSCSDCGR-SGHPSCLQFTANMIISVRKYRWQ---CIEC 484
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL L P+G W C
Sbjct: 485 KYCTICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVSPPEGSWSC 528
>gi|296487472|tpg|DAA29585.1| TPA: bromodomain adjacent to zinc finger domain, 2A [Bos taurus]
Length = 2013
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 685
CL+CR D + +LLCD C+R H+ CL+ + +P+G WFC +
Sbjct: 1788 CLVCRKGDNDE------FLLLCDGCDRGCHIYCLR----PKMEAVPEGDWFCAV 1831
>gi|383859786|ref|XP_003705373.1| PREDICTED: uncharacterized protein LOC100874631 [Megachile rotundata]
Length = 3823
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2576 ACQKCGKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2618
>gi|350412090|ref|XP_003489540.1| PREDICTED: hypothetical protein LOC100746698 [Bombus impatiens]
Length = 3834
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2587 ACQKCGKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2629
>gi|324512595|gb|ADY45214.1| Zinc finger protein [Ascaris suum]
Length = 382
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/70 (27%), Positives = 33/70 (47%), Gaps = 8/70 (11%)
Query: 618 CIRIVKNLEAELS----GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 673
C++ +N+ + C+ C+ C + +L CD C+R FH+ CL+ L
Sbjct: 298 CLKFTENMITSTNKYGWQCIECKSCAICGTSDNDDQLLFCDDCDRGFHLYCLRPR----L 353
Query: 674 RELPKGKWFC 683
P+G+W C
Sbjct: 354 ATAPEGEWSC 363
>gi|291234837|ref|XP_002737353.1| PREDICTED: rCG40672-like, partial [Saccoglossus kowalevskii]
Length = 887
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 22/37 (59%), Gaps = 4/37 (10%)
Query: 647 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
P ILLCD+C+ +H CL+ M +P G+WFC
Sbjct: 337 PEWILLCDKCDAGYHTACLRPPLMM----IPDGEWFC 369
>gi|354547685|emb|CCE44420.1| hypothetical protein CPAR2_402210 [Candida parapsilosis]
Length = 777
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 626 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++E CL+C D P+ LLCD C+ FH+ CL L ++P W+C
Sbjct: 179 DSESENCLICGDND------SPQETLLCDNCDHAFHMKCLS----PPLTQIPATNWYC 226
>gi|328780804|ref|XP_001121484.2| PREDICTED: hypothetical protein LOC725662 [Apis mellifera]
Length = 3827
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 10/53 (18%)
Query: 631 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
C C D P ILLCD C++ +H CL+ M +P+G WFC
Sbjct: 2588 ACQKCGKADH------PEWILLCDSCDKGWHCSCLRPALML----IPEGDWFC 2630
>gi|6648954|gb|AAF21305.1|AF108133_1 neuro-d4 [Mus musculus]
Length = 323
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 47/111 (42%), Gaps = 10/111 (9%)
Query: 573 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 632
IP G YC +C ++ + + + GR SG S Q T + + + C
Sbjct: 205 IPNG--YCDFCLGGSKKTGCPEDLISCADCGR-SGHPSCLQFT---VNMTAPVRTYRWQC 258
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 259 IECKSCSLCGTSENDDQLLFCDDCDRGYHMYCLS----PPMAEPPEGSWSC 305
>gi|301769749|ref|XP_002920299.1| PREDICTED: zinc finger protein DPF3-like [Ailuropoda melanoleuca]
Length = 633
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 516 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 571
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 572 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 615
>gi|308235954|ref|NP_001184101.1| D4, zinc and double PHD fingers family 2 [Xenopus (Silurana)
tropicalis]
Length = 388
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 11/99 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 296 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 351
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMDCSRIN-SVLQN 696
H+ CL + E P+G W C C+D + S+ QN
Sbjct: 352 HMYCLS----PPMAEPPEGSWSCHLCLDLLKDKASIYQN 386
>gi|443725765|gb|ELU13216.1| hypothetical protein CAPTEDRAFT_167868 [Capitella teleta]
Length = 236
Score = 41.6 bits (96), Expect = 2.3, Method: Composition-based stats.
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 4/61 (6%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C+ C+ C + P +L CD C++ +H+ C H + + P GKW C C I
Sbjct: 66 CIDCKTCCLCEDAGDPDAMLFCDACDKGYHMSC---HSPV-IEDKPTGKWVCSRCCQEIE 121
Query: 692 S 692
+
Sbjct: 122 A 122
>gi|443690042|gb|ELT92280.1| hypothetical protein CAPTEDRAFT_224752 [Capitella teleta]
Length = 1892
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 5/78 (6%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 691
C+ C+ C +L CD C+R H+ C + L P+GKW CC+ C
Sbjct: 261 CIECKKCSLCGETGKEDNMLFCDACDRGIHMECC----IPPLTSAPEGKWVCCL-CDEDT 315
Query: 692 SVLQNLLVQEAEKLPEFH 709
L Q KL + H
Sbjct: 316 IEAFKALNQPTIKLSDLH 333
>gi|350409455|ref|XP_003488744.1| PREDICTED: hypothetical protein LOC100749759 [Bombus impatiens]
Length = 2491
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 10/79 (12%)
Query: 617 RCI--RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 674
RCI + + E +L C C S +LLCD C+R +H+ CL L
Sbjct: 203 RCIPVEVASSEEEKLDDLTFCEVCHQSNRE---DRMLLCDGCDRGYHLECL----TPPLD 255
Query: 675 ELPKGKWFCCMDCSRINSV 693
E+P +WFC +CS+ N
Sbjct: 256 EVPIEEWFCP-ECSQNNQT 273
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 6/77 (7%)
Query: 619 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 678
I+I + + + CL C C+ P +LLCD C+ +H CL L +PK
Sbjct: 1053 IKITRVILTKGWRCLECTVCEACGDASDPGRLLLCDDCDISYHTYCLD----PPLHTVPK 1108
Query: 679 GKWFC--CMDCSRINSV 693
G W C C+ C + S
Sbjct: 1109 GAWKCKWCVWCVQCGST 1125
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 72/186 (38%), Gaps = 43/186 (23%)
Query: 518 DESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAEC--ASLSSIPQ 575
DE G DG E + + +G G I+C C P + H C + P+
Sbjct: 342 DEDG--DGYETDHQDYCEVCQQG---GEIILCDTC-----PRAY--HMVCLDPDMEKAPE 389
Query: 576 GDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
G W C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 390 GTWSCPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC 436
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSV 693
K G +L CD C +H+ CL L E+P G+W C C C +
Sbjct: 437 ------KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGK 482
Query: 694 LQNLLV 699
+Q +L
Sbjct: 483 VQKILT 488
>gi|7108675|gb|AAF36510.1|AF132069_1 SMCX [Sus scrofa]
Length = 316
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 610 SVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
S E TK +R+ +N A+ +CR C G +LLCD C+ +H+ CL
Sbjct: 238 SPEPCTKMTMRLRRNHSNAQFIESYVCRMCS---RGDEDDKLLLCDGCDDNYHIFCL--- 291
Query: 669 KMADLRELPKGKWFC 683
+ L E+PKG W C
Sbjct: 292 -LPPLPEIPKGVWRC 305
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 633 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2615 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2661
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 650 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 708
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|226088571|dbj|BAH37030.1| AT rich interactive domain 1C protein [Tokudaia osimensis]
Length = 453
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 610 SVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
S E TK +R+ +N A+ +CR C G +LLCD C+ +H+ CL
Sbjct: 96 SPEPCTKMTMRLRRNHSNAQFIESYICRMC---SRGDEDDKLLLCDGCDDNYHIFCL--- 149
Query: 669 KMADLRELPKGKWFC 683
+ L E+PKG W C
Sbjct: 150 -LPPLPEIPKGVWRC 163
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 64/156 (41%), Gaps = 34/156 (21%)
Query: 547 IICHCCNSEVSPSQFEAHAECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 606
I+C C P + + + P+G W C +C+ K +Q +A +
Sbjct: 357 ILCDTC-----PRAYHMVCLDSDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS---E 403
Query: 607 GVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 665
G + +E++ +LE E + CR C K G +L CD C +H+ CL
Sbjct: 404 GEEILEEVGG-------DLEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCL 450
Query: 666 KKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 699
L E+P G+W C C C + +Q +L+
Sbjct: 451 N----PPLPEIPNGEWLCPRCT-CPALKGKVQKILI 481
>gi|164665416|ref|NP_001106929.1| lysine-specific demethylase 5D [Canis lupus familiaris]
gi|122138462|sp|Q30DN6.1|KDM5D_CANFA RecName: Full=Lysine-specific demethylase 5D; AltName: Full=Histone
demethylase JARID1D; AltName: Full=Jumonji/ARID
domain-containing protein 1D; AltName: Full=Protein SmcY
gi|76162896|gb|ABA40834.1| JARID1D [Canis lupus familiaris]
Length = 1545
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 610 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 669
SVE TK +++ + A+ +CR C G +LLCD C+ +H+ CL
Sbjct: 295 SVEPCTKMTMQLRTHSSAQFIDLYVCRIC---SRGDEDDKLLLCDGCDDTYHIFCL---- 347
Query: 670 MADLRELPKGKWFC 683
+ L E+P+G W C
Sbjct: 348 IPPLPEIPRGVWRC 361
>gi|148235385|ref|NP_001079117.1| zinc finger protein ubi-d4 A [Xenopus laevis]
gi|18203564|sp|Q9W638.1|REQUA_XENLA RecName: Full=Zinc finger protein ubi-d4 A; AltName: Full=Apoptosis
response zinc finger protein A; AltName: Full=Protein
requiem A; Short=xReq A
gi|4808462|dbj|BAA77574.1| Requiem protein [Xenopus laevis]
Length = 388
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 11/99 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 296 DCGR-SGHPSCLQFTAVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDDCDRGY 351
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMDCSRIN-SVLQN 696
H+ CL + + E P+G W C C+D + S+ QN
Sbjct: 352 HMYCL----VPPVAEPPEGSWSCHLCLDLLKDKASIYQN 386
>gi|348558814|ref|XP_003465211.1| PREDICTED: PHD finger protein 21A isoform 4 [Cavia porcellus]
Length = 598
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 408 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 451
>gi|195425644|ref|XP_002061104.1| GK10624 [Drosophila willistoni]
gi|194157189|gb|EDW72090.1| GK10624 [Drosophila willistoni]
Length = 515
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 50/119 (42%), Gaps = 13/119 (10%)
Query: 573 IPQGDWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 629
+ Q YC +C Q ++ + + + GR SG S Q T I VK +
Sbjct: 399 VAQPSPYCDFCLGDQRENKKTNMPEELVSCSDCGR-SGHPSCLQFTPNMIISVKRYRWQ- 456
Query: 630 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
C+ C+ C + +L CD C+R +H+ CL L P+G W C CM+
Sbjct: 457 --CIECKYCSICGTSDNDDQLLFCDDCDRGYHMYCLS----PPLVTPPEGSWSCKLCME 509
>gi|410056522|ref|XP_003317520.2| PREDICTED: lysine-specific demethylase 5C, partial [Pan
troglodytes]
Length = 909
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 610 SVEQITKRCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 668
S E TK +R+ +N A+ +CR C G +LLCD C+ +H+ CL
Sbjct: 292 SPEPCTKMTMRLRRNHSNAQFIESYVCRMCS---RGDEDDKLLLCDGCDDNYHIFCL--- 345
Query: 669 KMADLRELPKGKWFC 683
+ L E+PKG W C
Sbjct: 346 -LPPLPEIPKGVWRC 359
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 9/65 (13%)
Query: 632 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW-----FCCMD 686
C+ C C+ P +LLCD C+ +H CL L +PKG W CC+
Sbjct: 1017 CVECIVCEVCGKASDPSRLLLCDDCDISYHTYCLD----PPLNTVPKGGWKCKWCVCCVQ 1072
Query: 687 CSRIN 691
C ++
Sbjct: 1073 CGAVS 1077
>gi|148670787|gb|EDL02734.1| D4, zinc and double PHD fingers, family 3, isoform CRA_a [Mus
musculus]
Length = 381
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 259 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 314
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 315 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 358
>gi|426234251|ref|XP_004011110.1| PREDICTED: zinc finger protein DPF3 [Ovis aries]
Length = 408
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C NM ++ + + + GR SG + Q T VK + + C+ C
Sbjct: 291 YCDFCLGGSNMNKKSGRPEELVSCADCGR-SGHPTCLQFTLNMTEAVKTYKWQ---CIEC 346
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 683
+ C + +L CD C+R +H+ CL + E P+G W C
Sbjct: 347 KSCILCGTSENDDQLLFCDDCDRGYHMYCLN----PPVAEPPEGSWSC 390
>gi|4808456|dbj|BAA77571.1| Requiem protein [Xenopus laevis]
Length = 198
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 13/113 (11%)
Query: 579 YCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 635
YC +C N ++ + + + GR SG S Q T + VK + C+ C
Sbjct: 81 YCDFCLGDSNTNKKSNQPEELVSCSDCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIEC 136
Query: 636 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 686
+ C+ + +L CD C+R +H+ CL + E P+G W C C+D
Sbjct: 137 KCCNICGTSENDDQLLFCDDCDRGYHMYCLS----PPVAEPPEGSWSCHLCLD 185
>gi|339522307|gb|AEJ84318.1| D4 zinc and double PHD fingers family 2 [Capra hircus]
Length = 391
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 601 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 660
+ GR SG S Q T + VK + C+ C+ C+ + +L CD C+R +
Sbjct: 297 DCGR-SGHPSCLQFTPVMMAAVKTYRWQ---CIECKCCNICGTSENDDQLLFCDACDRGY 352
Query: 661 HVGCLKKHKMADLRELPKGKWFC--CMD 686
H+ CL + E P+G W C C+D
Sbjct: 353 HMYCL----TPSMSEPPEGSWRCHLCLD 376
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,518,951,648
Number of Sequences: 23463169
Number of extensions: 552977564
Number of successful extensions: 1454250
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 273
Number of HSP's successfully gapped in prelim test: 1464
Number of HSP's that attempted gapping in prelim test: 1448370
Number of HSP's gapped (non-prelim): 5654
length of query: 920
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 768
effective length of database: 8,792,793,679
effective search space: 6752865545472
effective search space used: 6752865545472
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)