BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002860
(873 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/924 (54%), Positives = 623/924 (67%), Gaps = 136/924 (14%)
Query: 1 MKRELDYELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIVYSRVKR 60
MKRE+ G E+ +Q QA D + N CKRFKV VNGF VYSR+++
Sbjct: 1 MKREVGAFDGGIQFESESQ-------QAQD----SNNNNNCKRFKV--VNGFFVYSRLRK 47
Query: 61 SRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQMTVENV 120
++ S+ + H+ + K + I+++V E E V++V
Sbjct: 48 NKPSSRE---------------CHDDKDRKCQQ-------IIQTVSEVETVNKDPQVKDV 85
Query: 121 IEE-TVKGKKAPICKEEPISK-VECFPRKEGGSE-------------VSNGLNKKCLKRP 165
E ++ + PICK E S+ EGG+E SN L + L R
Sbjct: 86 SRELSLCNVQLPICKIESFSEPSRVILANEGGTEDTERKLAHVGTEGKSNKLRQ--LTR- 142
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S KVEPVEV V E +E +S ++VE IAEGSALT PKKNLELKMSKKI+L+ P
Sbjct: 143 SNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKIALDNIP 202
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
MTV ELFETGLL+GV VVYMGG K A LRG I+D GILC CS C GCRVIPPS+FEIH
Sbjct: 203 MTVKELFETGLLEGVPVVYMGGKK--AFCLRGTIKDVGILCYCSFCKGCRVIPPSQFEIH 260
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
A KQYRRA+QYICFENGKSLL+VL ACR+ PL L+AT+QSA+S LP+EK+F C RCKGT
Sbjct: 261 AIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKTFTCKRCKGT 320
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQ 405
+P VGK GP LC+SCV+SK+ G+ T I+ SS+P +
Sbjct: 321 YPTILVGKVGP--LCSSCVESKESNGSPACETNIKSRSSKPATV---------------- 362
Query: 406 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 465
SK +A +S NK +W IT KDQRLHKLVF++ GLPDGTEV
Sbjct: 363 ---------------SKSLNSALEGVSSENKCQWKITTKDQRLHKLVFEDGGLPDGTEVA 407
Query: 466 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------------ 501
YYA GQKLL GYK G GI+C CCN EVSPS FEAHA
Sbjct: 408 YYARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVSLHELA 467
Query: 502 ----------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
DGG+L+ CDGCPRAFHK CASLSSIP+G W+C++CQNM
Sbjct: 468 ISLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQFCQNM 527
Query: 540 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFG 599
F+R++F++H+ANAV AGR+SGVD +EQIT+RCIRIVKN+EAEL+GC+LCRG DFS+SGFG
Sbjct: 528 FQREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYDFSRSGFG 587
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP 659
PRTI+LCDQC +EFHVGCL+ HK+A+L+ELPKGKWFCC DC RI+S L+ LL +EAE +P
Sbjct: 588 PRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLAREAEIIP 647
Query: 660 EFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 718
L + KK LETV++IDVRW+LL+GK+A+PET+LLLSQA+AIF +CFDPIVD+
Sbjct: 648 NKLLEVVMKKNEEKGLETVNNIDVRWKLLTGKSASPETKLLLSQALAIFQECFDPIVDT- 706
Query: 719 SGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 778
+GRDLIP MVYG+N +GQ++GGMYCA+L VNS VVSA I+R+FGQEVAELPLVATS NH
Sbjct: 707 TGRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLVATSNGNH 766
Query: 779 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL 838
GKGYFQLLF+ IEKLL++L+V SIVLPAAEEAESIWTDKFGF+KI P+ LS YRK C Q+
Sbjct: 767 GKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSKYRKSCCQI 826
Query: 839 VTFKGTSMLQKRVPACRIGSSSTD 862
+TFKGTSMLQK VP CRI + +T+
Sbjct: 827 LTFKGTSMLQKAVPPCRIVNQNTE 850
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 894 bits (2310), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/960 (52%), Positives = 627/960 (65%), Gaps = 132/960 (13%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RISSSRPGLI------ 389
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+ + P
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSA 392
Query: 390 -----------------ANSTPVTSVHKSSQSQRQR---------KITKKSKKTVLISKP 423
+ S+ TS+ +S +S R K+ KKS K L+ K
Sbjct: 393 SVSISSRYKRKWVTKAKSKSSEYTSISRSPRSAPMRIPSKNKSALKMRKKSLKPALMLKS 452
Query: 424 FENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGI 483
++AS S K++W IT KDQRLHKLVF+E GLPDGTEV Y+A GQKLL+GYK G GI
Sbjct: 453 SQSASKCSSSLAKNQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGI 512
Query: 484 ICHCCNSEVSPSQFEAHA------------------------------------------ 501
+C CCN VSPSQFE HA
Sbjct: 513 LCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLC 572
Query: 502 ----DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 557
DGGNLL CDGCPRAFHKECASLSS P+GDWYCK+CQNMF+R++F++H+ NAV AGR
Sbjct: 573 IICLDGGNLLLCDGCPRAFHKECASLSSTPRGDWYCKFCQNMFQREKFVEHNVNAVAAGR 632
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
V GVD +EQITKRCIRIV+N+E +LSGC+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGC
Sbjct: 633 VHGVDPIEQITKRCIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGC 692
Query: 618 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 677
LK HKMA L+ELP+GKWFC + C+RI+S LQ LL++ EKLP L A+ + G + +
Sbjct: 693 LKDHKMAFLKELPRGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDI 752
Query: 678 S-DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 736
D+DV WRL+SGK A+PETRLLLS+A+AIFHD FDPIVD SGRDLIP+MVYGR++ GQ
Sbjct: 753 QVDVDVSWRLISGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQ 812
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
EFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGKGYFQ LF+CIE+LL+F
Sbjct: 813 EFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTLFSCIERLLAF 872
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
L+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 873 LKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 932
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 887 bits (2293), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 501/969 (51%), Positives = 628/969 (64%), Gaps = 141/969 (14%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RISSSRPGLI------ 389
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+ + P
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEPTTSKSSGSA 392
Query: 390 -----------------ANSTPVTSVHKSSQSQRQR---------KITKKSKKTVLISKP 423
+ S+ TS+ +S +S R K+ KKS K L+ K
Sbjct: 393 SVSISSRYKRKWVTKAKSKSSEYTSISRSPRSAPMRIPSKNKSALKMRKKSLKPALMLKS 452
Query: 424 FENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGI 483
++AS S K++W IT KDQRLHKLVF+E GLPDGTEV Y+A GQKLL+GYK G GI
Sbjct: 453 SQSASKCSSSLAKNQWKITTKDQRLHKLVFEEDGLPDGTEVAYFARGQKLLQGYKKGSGI 512
Query: 484 ICHCCNSEVSPSQFEAHA------------------------------------------ 501
+C CCN VSPSQFE HA
Sbjct: 513 LCCCCNCVVSPSQFEVHAGWSSRKKPYAYIYTSNGVSLHELAISLSKGRKYSAKDNDDLC 572
Query: 502 ----DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 557
DGGNLL CDGCPRAFHKECASLSSIP+GDWYCK+CQNMF+R++F++H+ NAV AGR
Sbjct: 573 IICLDGGNLLLCDGCPRAFHKECASLSSIPRGDWYCKFCQNMFQREKFVEHNVNAVAAGR 632
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
V GVD +EQITKRCIRIV+N+E +LSGC+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGC
Sbjct: 633 VHGVDPIEQITKRCIRIVRNIETDLSGCVLCRGSDFSKSGFGPRTIILCDQCEKEFHVGC 692
Query: 618 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 677
LK HKMA L+ELP+GKWFC + C+RI+S LQ LL++ EKLP L A+ + G + +
Sbjct: 693 LKDHKMAFLKELPRGKWFCSIVCTRIHSALQKLLIRGPEKLPNSLLGAVNRKLGENCSDI 752
Query: 678 S-DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 736
D+DV WRL+SGK A+PETRLLLS+A+AIFHD FDPIVD SGRDLIP+MVYGR++ GQ
Sbjct: 753 QVDVDVSWRLISGKIASPETRLLLSEAIAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQ 812
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQ---------EVAELPLVATSKINHGKGYFQLLF 787
EFGGMYCAIL VNS VVSA +LRVFGQ ++AELPLVATS NHGKGYFQ LF
Sbjct: 813 EFGGMYCAILIVNSFVVSAAMLRVFGQYCRAAIGCXDIAELPLVATSNGNHGKGYFQTLF 872
Query: 788 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 847
+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSML
Sbjct: 873 SCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSML 932
Query: 848 QKRVPACRI 856
QK VP+CR+
Sbjct: 933 QKTVPSCRV 941
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 857 bits (2213), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/859 (54%), Positives = 577/859 (67%), Gaps = 135/859 (15%)
Query: 45 KVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILES 104
KV+ VNG+IVY+R KRS S NG E
Sbjct: 21 KVSVVNGYIVYTRAKRSLDSC---------------------------------NGFSEH 47
Query: 105 VVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLKR 164
++N V++ EN EC K +EV K+ R
Sbjct: 48 AELKDNAEVEVKTENG---------------------ECEKLKNESTEVVARTRKR--SR 84
Query: 165 PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
SA++ KVE + +V +E+ ++ AL +P+ +ELKMSKKI +N+K
Sbjct: 85 RSALEAKVECCDQMVV------SETEQVVANGGSGINGALGAPRNKMELKMSKKIVVNRK 138
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
PMTV +LF+TG LDGVSVVYMGGIK +ASGLRG+IRDGGILCSC LCNG RVIPPS+FEI
Sbjct: 139 PMTVKKLFDTGFLDGVSVVYMGGIK-KASGLRGVIRDGGILCSCCLCNGRRVIPPSQFEI 197
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HACKQYRRA+QYIC ENGKSLL++LRACR L L+ T+Q+ + S EE+ F C RCKG
Sbjct: 198 HACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYFTCKRCKG 257
Query: 345 TFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQS 404
FP + V + GP +C SCV+S+K + + G R+ S RP +++N + + + SSQ
Sbjct: 258 CFPSSFVERVGP--ICRSCVESRKSEESSNNVVGKRVRSPRPVVLSNPSSTSELSVSSQV 315
Query: 405 QRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEV 464
+R RK K+T L+ F + S L DQRLHKLVF+E+GLPDGTEV
Sbjct: 316 KRHRK-----KRTKLV---FISISSVL-------------DQRLHKLVFEENGLPDGTEV 354
Query: 465 GYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------- 501
YYA GQKLLEG+K G GI+C CCN+E+SPSQFE HA
Sbjct: 355 AYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVSLHEL 414
Query: 502 -----------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 538
DGGNLL CDGCPRAFHKECA+LSSIP+GDWYC++CQN
Sbjct: 415 AISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSIPRGDWYCQFCQN 474
Query: 539 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 598
MF+R++F+ H+ANAV AGRV GVD +EQI RCIRIVK++EA+LS C LCRG DFS+SGF
Sbjct: 475 MFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDFSRSGF 534
Query: 599 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 658
GPRTI+LCDQCE+E+HVGCL+ HKMA L+ELP+G W CC DC+RI+S L+NLLV+ AE+L
Sbjct: 535 GPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVKGAERL 594
Query: 659 PEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 717
PE L IKK LE + IDVRWRLL+GK A+PETR LL +AV+IFH+CF+PIVD+
Sbjct: 595 PESLLGVIKKKQEEKGLEPI--IDVRWRLLNGKIASPETRPLLLEAVSIFHECFNPIVDA 652
Query: 718 ISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 777
SGRDLIP+MVYGRN+RGQEFGGMYCA+L VNSSVVSAG+LR+FG +VAELPLVATS N
Sbjct: 653 ASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVATSNGN 712
Query: 778 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ 837
HGKGYFQ LF+CIE+LL+FL VK++VLPAAEEAESIWTDKFGF K++P+ L+ YRK C Q
Sbjct: 713 HGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSKMNPDELTNYRKNCHQ 772
Query: 838 LVTFKGTSMLQKRVPACRI 856
+V+FKGT+ML K VP+CR+
Sbjct: 773 MVSFKGTNMLHKMVPSCRV 791
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 853 bits (2203), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/737 (59%), Positives = 524/737 (71%), Gaps = 91/737 (12%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI--RISSSRPGLIANS 392
++F C RCKG FP CVGK GP LCN C +SK+ T+T + I R + P LI S
Sbjct: 121 RTFTCKRCKGIFPSICVGKIGP--LCNLCAESKESHPTLTIGSSIISRYCQNLPSLILIS 178
Query: 393 TPV-------------------------------------TSVHKSSQSQRQRKITKKSK 415
+ S+ SQ RK +K +
Sbjct: 179 WIINLKTITSGQFLLMLAHCSFRLSFLSPEQVLALEYFKPASLSTFSQDNTLRKKKRKPE 238
Query: 416 KTVLISKPFENASPPLSFPNKSRWN-ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLL 474
+ LI+KP + AS LS P K ++ I+P+DQRLH+LVF+E GLPDGTE+ YYA GQKLL
Sbjct: 239 EPDLIAKPSKVASVHLS-PRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKLL 297
Query: 475 EGYKNGLGIICHCCNSEV-SPSQFEAHA-------------------------------- 501
GYK G GI+CHCCN EV SPS FEAHA
Sbjct: 298 GGYKRGFGILCHCCNCEVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHDLAISLSKSRK 357
Query: 502 --------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 547
DGG+LL CDGCPRAFHK CASLS++P GDWYC++CQN F+R++F++
Sbjct: 358 YSSQDNDDLCIICADGGDLLLCDGCPRAFHKGCASLSTVPSGDWYCQHCQNTFQREKFVE 417
Query: 548 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 607
H+ANA AGRVS +DS+EQITKRC RIVKN+EAEL+GC LCRG DF +SGFGPRTI+LCD
Sbjct: 418 HNANAFAAGRVSEIDSIEQITKRCFRIVKNVEAELTGCALCRGYDFMRSGFGPRTIILCD 477
Query: 608 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI- 666
QCE+EFHVGCL+ HKMA+L+ELPKG WFCCMDCSRI+S LQ LL++ AEKLP+ LN I
Sbjct: 478 QCEKEFHVGCLRSHKMANLKELPKGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIK 537
Query: 667 KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 726
KK+ L + IDVRW LLSGK A+PE +LLLS+A++IF +CFDPIVDS GRDLIP
Sbjct: 538 KKHEEKGLNISNSIDVRWTLLSGKIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPL 597
Query: 727 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
MVYG+N +GQ++GGMYCA+L VNS +VSAGILRVFG+EVAELPLVAT +HGKGYFQLL
Sbjct: 598 MVYGKNSKGQDYGGMYCAVLIVNSCIVSAGILRVFGEEVAELPLVATRNGDHGKGYFQLL 657
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 846
F+CIEKLL+FL V+++VLPAAEEAESIW +KFGF+KI PE LS YRK C Q+V F+GTSM
Sbjct: 658 FSCIEKLLAFLNVQNLVLPAAEEAESIWIEKFGFQKIKPEQLSKYRKNCCQMVRFEGTSM 717
Query: 847 LQKRVPACRIGSSSTDS 863
LQK VP C+I + S +S
Sbjct: 718 LQKAVPTCKIVNQSIES 734
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 54/118 (45%), Gaps = 11/118 (9%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQA 252
E + IA+ S + S + + KKIS + + E GL DG + Y G K
Sbjct: 239 EPDLIAKPSKVASVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKL-- 296
Query: 253 SGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF--ENGKSLLEV 308
L G R GILC C CN C V+ PS FE HA R+ Y C NG SL ++
Sbjct: 297 --LGGYKRGFGILCHC--CN-CEVVSPSTFEAHAGWATRK-KPYACIYTSNGVSLHDL 348
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 809 bits (2089), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/892 (50%), Positives = 560/892 (62%), Gaps = 149/892 (16%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1589 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1628
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKEGGS 151
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+ +
Sbjct: 1629 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1684
Query: 152 ----------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1685 DGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1743
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1744 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1800
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1801 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1860
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K P L NSC+K
Sbjct: 1861 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKMDP--LGNSCIKR------- 1902
Query: 374 TYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPP 430
N++P TS+H++S+ R K +TK S + N+S
Sbjct: 1903 -----------------NNSPATSIHRTSERARLLKPIPVTKSSGSALY------NSSE- 1938
Query: 431 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 490
NKS IT KDQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+
Sbjct: 1939 ----NKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHC 1994
Query: 491 EVSPSQFEAHA----------------------------------------------DGG 504
EVS SQFEAHA DGG
Sbjct: 1995 EVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGG 2054
Query: 505 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 564
NLL CDGCPRAFH+ CASL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +
Sbjct: 2055 NLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPI 2114
Query: 565 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 624
EQITKRCIRIV N EAE+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM
Sbjct: 2115 EQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQ 2173
Query: 625 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVR 683
DL+ELP GKWFCC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VR
Sbjct: 2174 DLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVR 2233
Query: 684 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYC 743
WRLLSGK A+PETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YC
Sbjct: 2234 WRLLSGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYC 2293
Query: 744 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 803
A++TVNS VVSAGILRVFGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S V
Sbjct: 2294 AVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFV 2353
Query: 804 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
LPAAEEAE IWT KFGFKKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2354 LPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2405
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 792 bits (2045), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/687 (56%), Positives = 490/687 (71%), Gaps = 80/687 (11%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I ++KKP TV ELF+TGLLDGV VVY+G K + LRG I+DGGILCSCSLCNG RV
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
IPPS+FEIHAC Y+RA+QYIC ENGKS+LE++RACR+ PL L+AT+Q+ ++S PEEK
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVT 396
F C C+G FP + V + G LC SCV+S+K + + + G RI SS+ + + P+T
Sbjct: 207 FTCKNCRGCFPSSNVERVGL--LCLSCVESRKSEKSSIHAVGKRIRSSKLSVKLKTAPIT 264
Query: 397 SVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDES 456
S S Q NKS+W I+ + QRLHKL+F+E
Sbjct: 265 SKCLSPQ-------------------------------NKSQWRISKRYQRLHKLIFEED 293
Query: 457 GLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--------------- 501
GLP+G EV YYA GQKLLEG K GI+C CCN+E+SPSQFE HA
Sbjct: 294 GLPNGAEVAYYARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYTS 353
Query: 502 -------------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGD 530
DGGNLL CDGCPRAFHKECAS+SSIP+G+
Sbjct: 354 NGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGE 413
Query: 531 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 590
WYC+ CQ+ F R+R + H+A+AV AGRV GVD +EQI KRCIRIVK++ AE+ GC+LCR
Sbjct: 414 WYCQICQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCRS 473
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 650
DFS+SGFGPRTI++CDQCE+E+HVGCL+ HKMA L+ELP+G WFCC DC+RI+S L+NL
Sbjct: 474 SDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLENL 533
Query: 651 LVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 709
L++ AE+LPE L+ IKK G LE +++IDVRW+LL+GK A+PETR LL +AV++FH+
Sbjct: 534 LIRVAERLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGKIASPETRPLLLEAVSMFHE 593
Query: 710 CFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 769
CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L VNSSVVSAG++R+FG+++AELP
Sbjct: 594 CFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAELP 653
Query: 770 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 829
LVAT N GKGYFQ LFACIE+LL+FL VK++VLPAAEEA SIWT+KFGF K+ P L+
Sbjct: 654 LVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKPNQLT 713
Query: 830 IYRKRCSQLVTFKGTSMLQKRVPACRI 856
YR C Q++ FKGT+ML K VP CR+
Sbjct: 714 NYRMNCHQIMAFKGTNMLHKTVPQCRV 740
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/881 (50%), Positives = 556/881 (63%), Gaps = 147/881 (16%)
Query: 47 TKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVV 106
+ +G I YSR KR + LE+ D+R + + E ES
Sbjct: 68 NRWDGVIQYSRNKRLK------RLEESKNDER--------------RTIAEEPKDDESTT 107
Query: 107 EEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKEGGS----------- 151
+EE Q T EN V+E+ G PIC+EEP S+ + K+ +
Sbjct: 108 DEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESNDGSLKLQTAGL 163
Query: 152 -----EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSAL 204
E+ + +K KR SA+K K + VE L + F N ++ + +L
Sbjct: 164 IDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGVDEKTNGAVRSL 222
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
TSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G + + L+G I+ GI
Sbjct: 223 TSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDG--RKKGYRLQGTIKGNGI 279
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+ PL L+AT+
Sbjct: 280 LCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATI 339
Query: 325 QSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSS 384
QSA+ S P ++S P K P L NSC+K
Sbjct: 340 QSAIGSFPVKRSL---------PADEAAKMDP--LGNSCIKR------------------ 370
Query: 385 RPGLIANSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNI 441
N++P TS+H++S+ R K +TK S + S NKS I
Sbjct: 371 ------NNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSE-----------NKSLGKI 413
Query: 442 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 501
T KDQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHA
Sbjct: 414 TKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHA 473
Query: 502 ----------------------------------------------DGGNLLPCDGCPRA 515
DGGNLL CDGCPRA
Sbjct: 474 GWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRA 533
Query: 516 FHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIV 575
FH+ CASL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV
Sbjct: 534 FHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV 593
Query: 576 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 635
N EAE+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWF
Sbjct: 594 -NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWF 652
Query: 636 CCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATP 694
CC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+P
Sbjct: 653 CCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASP 712
Query: 695 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVS 754
ETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVS
Sbjct: 713 ETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVS 772
Query: 755 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIW 814
AGILRVFGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPAAEEAE IW
Sbjct: 773 AGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIW 832
Query: 815 TDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
T KFGFKKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 833 TKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 873
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 767 bits (1981), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/712 (56%), Positives = 490/712 (68%), Gaps = 82/712 (11%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L PMTV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCV------KSKKPQGTMTYTTGIRISSSRPGL 388
++F C RCK + + P L S K +KP+ +
Sbjct: 121 RTFTCKRCKEQ--VLALEYFKPASLSTSSQDNTPRKKKRKPEEQDS-------------- 164
Query: 389 IANSTPVTSVHKSSQSQRQRKITKK-----------SKKTVLISKPFENASPPLSFPNKS 437
I + SV+ SS+ ++ +KI+ + +L PF F K
Sbjct: 165 ITKPSKSASVYLSSRKRKYKKISPRLVCFFYPIDILFGLVMLSPFPFLWLVKIFVFIRKY 224
Query: 438 RWNITP---------KDQRLHKLVFDESGLPDGTEVGYYACGQ----------------- 471
+ ++P +DQRLH+LVF+E GLPDGTE+ YYA GQ
Sbjct: 225 AY-LSPFCPFSGYQSQDQRLHRLVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIV 283
Query: 472 --------KLLEGYK---NGLGIICHCCNSEVSPSQFEAH----------ADGGNLLPCD 510
KLL GY G+ H +S S+ + ADGGNLL CD
Sbjct: 284 NKINSSSQKLLGGYAYIYTSNGVSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCD 343
Query: 511 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 570
GCPRAFHK CAS+ ++P GDWYC+YCQN FER++ ++H+ANA AGR SG+DS+EQITKR
Sbjct: 344 GCPRAFHKGCASIPTVPSGDWYCQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKR 403
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
C RIVKN+EAEL+GC LCRG DF +SGFGPRTI+LCDQCE+EFHVGCL+ HKM +L+ELP
Sbjct: 404 CFRIVKNIEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELP 463
Query: 631 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 689
KG WFCCMDCSRI+S LQ LL++ AEKLP+ LN I KK+ L ++IDVRW LLSG
Sbjct: 464 KGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSG 523
Query: 690 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVN 749
K A+PE +LLLS+A++IF +CFDPIVDS GRDLIP MVYG+N +GQ++GGMYCA+LT+N
Sbjct: 524 KIASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTIN 583
Query: 750 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 809
SS+VSAGILRVFG+EVAELPLVAT HGKGYFQLLF+CIEKLL+FL V+++VLPAAEE
Sbjct: 584 SSIVSAGILRVFGEEVAELPLVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEE 643
Query: 810 AESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 861
AESIWT+KFGF+KI PE L+ YRK C Q+V F+GTSMLQK VP CRI + T
Sbjct: 644 AESIWTEKFGFQKIKPEQLNKYRKSCCQMVRFEGTSMLQKAVPTCRIVNQRT 695
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/730 (54%), Positives = 506/730 (69%), Gaps = 97/730 (13%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTS 397
C RCKG FP +C+ + GP +C+SCV+S K + + RI S RP L++ S+ +
Sbjct: 118 TCKRCKGRFPSSCMERVGP--ICSSCVESSKSEESSKNVVSKRIRSPRPVLVSKSSCASE 175
Query: 398 VHKSSQSQR---------------------------QRKITKKSKKTVLISKPFENASPP 430
+ S + +R +RK+T K+KK L K ++
Sbjct: 176 MSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILPRRKVTPKTKKKSLSVKLKTTSNSN 235
Query: 431 LSFPN-KSRWNITPK----------DQRLHKLVFDESGLPDGTEVGYYACGQ------KL 473
P KS W IT K D RLHKLVF+E+GLPDG+E+ YYA GQ KL
Sbjct: 236 CLSPQIKSEWKITKKLVPYSFPTCGDNRLHKLVFEENGLPDGSELAYYAGGQLYSDRQKL 295
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHA-------------------------------- 501
LEG+K G GI+C CCN+E+SPSQFE HA
Sbjct: 296 LEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVSLHELSISLSKDRK 355
Query: 502 --------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 547
DGGNLL CDGCPRAFHKECASLSSIP+GDWYC++CQNMF+R++F+
Sbjct: 356 YSANDNDDLCVVCWDGGNLLLCDGCPRAFHKECASLSSIPRGDWYCQFCQNMFQREKFVA 415
Query: 548 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 607
++ NA AGRV GVD +EQITKRCIRIVK+++AELS C LCRG DFSKSGFGPRTI+LCD
Sbjct: 416 YNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAELSACALCRGVDFSKSGFGPRTIILCD 475
Query: 608 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 667
QCE+E+HVGCL+ HKM L+ELPKG W CC DC+RI+S L+N+LV+ AE+LP+ L IK
Sbjct: 476 QCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCTRIHSTLENVLVRGAERLPKSLLAVIK 535
Query: 668 K-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 726
K L+ ++DI+VRWRLLSGK A+PETR LL +AV+IFH+CFDPIVD++SGRDLI +
Sbjct: 536 KKQEEKGLDPINDINVRWRLLSGKKASPETRPLLLEAVSIFHECFDPIVDAVSGRDLIRA 595
Query: 727 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
MVYG+++RGQEFGGMYCA+L VNSSVVSAG+LR+FG ++AELPLVATS HGKGYFQ L
Sbjct: 596 MVYGKSVRGQEFGGMYCALLIVNSSVVSAGMLRIFGTDIAELPLVATSNSQHGKGYFQAL 655
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 846
F+CIE+LL+F++VK++VLPAAEEA+SIWTDKFGF KI P+ L+ YR+ C+Q VTF+GT+M
Sbjct: 656 FSCIERLLAFMKVKNLVLPAAEEAQSIWTDKFGFSKIKPDELANYRRNCNQFVTFQGTNM 715
Query: 847 LQKRVPACRI 856
L K VP CR+
Sbjct: 716 LHKMVPPCRV 725
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/700 (54%), Positives = 483/700 (69%), Gaps = 81/700 (11%)
Query: 208 KKNLELKMSKKI-SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILC 266
K LK +KKI ++KKP+TV ELF+TGLLDGV VVY+G K + LRG I+DGGILC
Sbjct: 85 KTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILC 144
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
SC LCNG RVIPPS+FEIHAC Y+RA+QYIC ENGKSLL+++RACR+ PL L+AT+Q+
Sbjct: 145 SCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQN 204
Query: 327 ALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRP 386
++S PEEK F C C+G P G Y + I +
Sbjct: 205 FINSPPEEKYFTCKSCRG------------------------PLGQ--YYSPIHVHVV-- 236
Query: 387 GLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQ 446
+ S+ K S RQ + L + P S LS NKS+W I+ + Q
Sbjct: 237 ---LLNLNSVSLLKLRNSGRQEQSWSSKLSVKLKTVPI--TSKCLSPQNKSQWRISKRYQ 291
Query: 447 RLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----- 501
RLHKL+F+E GLP+G EV YYA GQKLLEG K GI+C CCN+EVSPSQFE HA
Sbjct: 292 RLHKLIFEEDGLPNGAEVAYYARGQKLLEGIKTRCGIVCRCCNTEVSPSQFEVHAGWASR 351
Query: 502 -----------------------------------------DGGNLLPCDGCPRAFHKEC 520
DGGNLL CDGCPRAFHKEC
Sbjct: 352 RKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKEC 411
Query: 521 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 580
AS+SSIP+G+WYC+ CQ+ F R+R + ++A+AV AGRV GVD +EQI KRCIRIVK++ A
Sbjct: 412 ASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGA 471
Query: 581 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
E+ GC+LCR DFS+SGFGPRTI++CDQCE+E+HVGCL+ HK A L+ELP+G WFCC DC
Sbjct: 472 EMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKAYLKELPEGDWFCCNDC 531
Query: 641 SRINSVLQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLL 699
+ I+S L+NLL++ AE+LPE L+ IKK LE +++IDVRW+LL+GK A+PETR L
Sbjct: 532 TIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVRWKLLNGKIASPETRPL 591
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILR 759
L +AV++FH+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L VNSSVVSAG++R
Sbjct: 592 LLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVR 651
Query: 760 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 819
+FG+++AELPLVAT N GKGYFQ LFACIE+LL+FL VK++VLPAAEEAESIWT+KFG
Sbjct: 652 IFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAESIWTEKFG 711
Query: 820 FKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 859
F K+ + L+ YR C Q++ FKGT+ML K VP CR+ +S
Sbjct: 712 FSKMKLDQLTNYRMNCHQIMAFKGTNMLHKTVPRCRVTNS 751
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 55/118 (46%), Gaps = 14/118 (11%)
Query: 193 IEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQ 251
++++ + S SP+ + ++SK+ K + E GL +G V Y G K
Sbjct: 264 VKLKTVPITSKCLSPQNKSQWRISKRYQRLHKLI----FEEDGLPNGAEVAYYARGQKL- 318
Query: 252 ASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRAS-QYICFENGKSLLEV 308
L GI GI+C C CN + PS+FE+HA RR YI NG SL E+
Sbjct: 319 ---LEGIKTRCGIVCRC--CNT--EVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHEL 369
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/957 (44%), Positives = 560/957 (58%), Gaps = 161/957 (16%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDD-------LLEDDVIDKRINSK----IHEGRI- 88
CKR K T+VNGFIVY+R ++++F+ + LLE+ + + SK + G I
Sbjct: 42 CKRIKTTQVNGFIVYTRTRKTKFTKLHEQGDENAGLLENRMSNHLEESKPTIGVTNGSIG 101
Query: 89 ------NKVVKNVLNENGILESVVEEE----------------NQLVQMTVENVIEETVK 126
N +KN E+ + VEE + LV + ++++ +
Sbjct: 102 ETNVSGNSCIKNTFVESPAGKIAVEERLVTGSLAESPAVETDSSSLVDVVIDDINFVELL 161
Query: 127 GKKAPI------CKEEPISKVECFPRKEGGS-EVSNGLNKKCLKRPSAMKPKV------- 172
++ P+ + + ++ R G S VS KR + + +
Sbjct: 162 HEEIPVEILSEGSLDFEVKRLGTKVRTMGKSYSVSEKKRHGSFKRTAQIYKSILRMKKVN 221
Query: 173 --EPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTE 230
P V V FG E L ++ L K I + ++P TV E
Sbjct: 222 NLVPENVEVLSEPDFGRE--------------GLDEQSHSVSLA-DKSILIRRRPETVRE 266
Query: 231 LFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQY 290
LFETG+LDG+SVVYMG +K QA GLRGII+DGGILCSCS C+ VI SKFEIHACKQY
Sbjct: 267 LFETGILDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHVISTSKFEIHACKQY 326
Query: 291 RRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITC 350
RRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK F C RCKG FP +
Sbjct: 327 RRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKCFTCKRCKGAFPFSS 386
Query: 351 VGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIAN--------------STPVT 396
+G G LC SC + + Q + + S+S P IA+ S ++
Sbjct: 387 LGHRGF--LCMSCSEVETSQAS---PAAMWTSTSSPACIASPVKSRLKITRKPSESMSIS 441
Query: 397 SVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSR---------WNITPK--- 444
V S R IT+K+ + L+ K + +AS +S NK R +++TPK
Sbjct: 442 PVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSKFKKMLTQYSVTPKAVK 501
Query: 445 ------------------DQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICH 486
DQ LHKLVF+ GLP+GTE+GYYA GQKLL GYK G GI C+
Sbjct: 502 SVSLSVSSKKRSYRLTRKDQGLHKLVFERGGLPEGTELGYYARGQKLLGGYKMGAGIYCY 561
Query: 487 CCNSEVSPSQFEAHA--------------------------------------------- 501
CC SEVSPS FEAHA
Sbjct: 562 CCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSANDNNDLCVIC 621
Query: 502 -DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNLL CD CPRAFH EC SL SIP+G+W+CKYC+N F + +++ N+ G++ G
Sbjct: 622 ADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEG 681
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VD V+Q RCIR+VKN+EAE +GC+LC G DF +SGFGPRTI++CDQCE+E+H+GCL
Sbjct: 682 VDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSS 741
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSD 679
+ DL+ELPKG WFC MDC+RINS LQ LL+ AE L + L I +K + ++SD
Sbjct: 742 QNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQERTDVYSISD 801
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 739
+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SGR+LIP MVYG+ ++GQ++G
Sbjct: 802 LDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRMVYGKTMQGQDYG 861
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT + KGYFQLLF+CIEKLLS L V
Sbjct: 862 GICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNV 921
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C Q+V FKG SMLQK V A +I
Sbjct: 922 ESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVHAHQI 978
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/889 (47%), Positives = 528/889 (59%), Gaps = 192/889 (21%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1654 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1693
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKEGGS 151
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+ +
Sbjct: 1694 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1749
Query: 152 ----------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1750 DGSLKLQTAXLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1808
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1809 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1865
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1866 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1925
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K P L NSC+K
Sbjct: 1926 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKMDP--LGNSCIKR------- 1967
Query: 374 TYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSF 433
N++P TS+H++S+ R
Sbjct: 1968 -----------------NNSPATSIHRTSERAR--------------------------- 1983
Query: 434 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 493
DQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS
Sbjct: 1984 -----------DQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVS 2032
Query: 494 PSQFEAHA----------------------------------------------DGGNLL 507
SQFEAHA DGGNLL
Sbjct: 2033 ASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLL 2092
Query: 508 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 567
CDGCPRAFH+ CASL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQI
Sbjct: 2093 LCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQI 2152
Query: 568 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 627
TKRCIRIV N EAE+S C+LCRG DFSKSGFGPRTI+LCDQ
Sbjct: 2153 TKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQ------------------- 2192
Query: 628 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRL 686
ELP GKWFCC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRL
Sbjct: 2193 ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRL 2252
Query: 687 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 746
LSGK A+PETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++
Sbjct: 2253 LSGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVI 2312
Query: 747 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 806
TVNS VVSAGILRVFGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPA
Sbjct: 2313 TVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPA 2372
Query: 807 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
AEEAE IWT KFGFKKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2373 AEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2421
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 743 bits (1917), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/731 (52%), Positives = 481/731 (65%), Gaps = 96/731 (13%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLI------- 389
F C RCKG FP + +G G LC SC + + Q ++ T R S+S P I
Sbjct: 373 FTCKRCKGPFPFSSLGHRGF--LCKSCSEVETSQASLAAT---RTSTSAPACITSPVKSR 427
Query: 390 -------ANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSR---- 438
+ ST ++ V SS RKIT+K+ + L+ K + +AS +S K R
Sbjct: 428 LKITRKPSESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFK 487
Query: 439 -----WNITPK---------------------DQRLHKLVFDESGLPDGTEVGYYACGQK 472
++TPK DQ LHKLVFD GLP+GTE+GYYA GQK
Sbjct: 488 KMLTQHSVTPKALKSVSLSVSSKKRSYRLARKDQGLHKLVFDRGGLPEGTELGYYARGQK 547
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------------------- 501
LL GYK G GI C+CC EVSPS FEAHA
Sbjct: 548 LLGGYKMGAGIYCYCCKCEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSHGR 607
Query: 502 ---------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 546
DGGNLL CD CPRAFH EC SL SIP+G+W+CKYC+N F +
Sbjct: 608 KYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAG 667
Query: 547 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 606
+++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +GC+LC G DF +SGFGPRTI++C
Sbjct: 668 EYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIIC 727
Query: 607 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI 666
DQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AEKL + L I
Sbjct: 728 DQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGII 787
Query: 667 K-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 725
+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SG +LIP
Sbjct: 788 QTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIP 847
Query: 726 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 785
MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT + KGYFQL
Sbjct: 848 RMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQL 907
Query: 786 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTS 845
LF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C Q+V FKG S
Sbjct: 908 LFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGAS 967
Query: 846 MLQKRVPACRI 856
MLQK V + +I
Sbjct: 968 MLQKPVDSHQI 978
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGK 128
S V E N V+N + E+ GK
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGK 122
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/773 (47%), Positives = 484/773 (62%), Gaps = 88/773 (11%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
+ +QSA+ +++ C+ CKG P + G LC SC+ SKKPQ
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIAML--LCCSCMDSKKPQAI----- 453
Query: 378 GIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKS 437
L++ S H + + K VL SK + + +S K
Sbjct: 454 ---------DLLSLS------HYYMKEFWADHLIITPKPNVL-SKSSDTITKSVSTRGKI 497
Query: 438 RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQF 497
IT KD RLHKLVF+E LPDGTEV YYA GQKLL GYK G GI C CCNSEVSPSQF
Sbjct: 498 HGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQKLLVGYKKGSGIFCSCCNSEVSPSQF 557
Query: 498 EAHA----------------------------------------------DGGNLLPCDG 511
EAHA DGG+LL CDG
Sbjct: 558 EAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICADGGDLLCCDG 617
Query: 512 CPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC 571
CPR+FH++C L IP G WYCKYCQN+F++++F++H+ANAV AGRV+GVD +EQIT RC
Sbjct: 618 CPRSFHRDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRC 677
Query: 572 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 631
IRIVK +E E+ GC LCR DFSKSGFGPRT++LCDQCE+EFHVGCLK++ M DL+ELP+
Sbjct: 678 IRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQ 737
Query: 632 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGK 690
GKWFCC +C+RI+S L+ L+V EKLPE L ++ KK +++D+++RWR+L+ K
Sbjct: 738 GKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDVEIRWRVLNWK 797
Query: 691 A-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVN 749
++ ETR LLS+AV+IFHDCFDPIVDS SGRD IPSM+YGRN+RGQEFGG+YCA+LTVN
Sbjct: 798 MLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVN 857
Query: 750 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 809
SVVS GI R+FG EVAELPLVAT G+GYFQ L+ACIE+ L FL VK++VLPAA+E
Sbjct: 858 ESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADE 917
Query: 810 AESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTD 862
AES+W +KFGF K+ PE + + KR Q++ F+GTSMLQK VP R+ +S+ +
Sbjct: 918 AESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGTSMLQKEVPKYRVINSAAN 969
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/773 (46%), Positives = 485/773 (62%), Gaps = 88/773 (11%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
+ +QSA+ +++ C+ CKG P + G LC SC+ S+KPQ + + +
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIAML--LCCSCMDSRKPQVSSSPSP 458
Query: 378 GIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKS 437
S + + TP K ++SK + + +S K
Sbjct: 459 SPSPSPTPIVFSKDRTP---------------------KPNVLSKSSDTITKSVSTRGKI 497
Query: 438 RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQF 497
IT KD RLHKLVF+E LPDGTEV YYA GQKLL GYK G GI C CCNSEVSPSQF
Sbjct: 498 HGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQKLLVGYKKGSGIFCSCCNSEVSPSQF 557
Query: 498 EAHA----------------------------------------------DGGNLLPCDG 511
EAHA DGG+LL CDG
Sbjct: 558 EAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICADGGDLLCCDG 617
Query: 512 CPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC 571
CPR+FH++C L IP G WYCKYCQN+F++++F++H+ANAV AGRV+GVD +EQIT RC
Sbjct: 618 CPRSFHRDCVPLPCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGVDPIEQITTRC 677
Query: 572 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 631
IRIVK +E E+ GC LCR DFSKSGFGPRT++LCDQCE+EFHVGCLK++ M DL+ELP+
Sbjct: 678 IRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENNMEDLKELPQ 737
Query: 632 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGK 690
GKWFCC +C+RI+S L+ L+V EKLPE L ++ KK +++D+++RWR+L+ K
Sbjct: 738 GKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDVEIRWRVLNWK 797
Query: 691 A-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVN 749
++ ETR LLS+AV+IFHDCFDPIVDS SGRD IPSM+YGRN+RGQEFGG+YCA+LTVN
Sbjct: 798 MLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFGGIYCAVLTVN 857
Query: 750 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 809
SVVS GI R+FG EVAELPLVAT G+GYFQ L+ACIE+ L FL VK++VLPAA+E
Sbjct: 858 ESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNVKNLVLPAADE 917
Query: 810 AESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTD 862
AES+W +KFGF K+ PE + + KR Q++ F+GTSMLQK VP R+ +S+ +
Sbjct: 918 AESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGTSMLQKEVPKYRVINSAAN 969
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/723 (48%), Positives = 445/723 (61%), Gaps = 129/723 (17%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVT 396
F C RCK F CV + RP + ST ++
Sbjct: 373 FTCKRCKEGF----------------CVSHAR----------------RP---SESTSIS 397
Query: 397 SVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSR------------------ 438
V SS RKIT+K+ + L+ K + +AS +S K R
Sbjct: 398 PVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFKKMLVFLLHNDVLML 457
Query: 439 --------------------WNITP---------KDQRLHKLVFDESGLPDGTEVGYYAC 469
+ P KDQ LHKLVFD GLP+GTE+GYYA
Sbjct: 458 AEPTLMIKSLILLHLVSLYVLKVDPTFCDPQGFEKDQGLHKLVFDRGGLPEGTELGYYAR 517
Query: 470 GQKLL-------EGYKNGL------------------GIICHCCNSEVSPS-QFEAH--- 500
GQ + E K L G+ H + S ++ A+
Sbjct: 518 GQTYITVDRNCSEATKWALEYIVTVASASYFYIYTSNGVSLHEWATTFSHGRKYSANDNN 577
Query: 501 ------ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 554
ADGGNLL CD CPRAFH EC SL SIP+G+W+CKYC+N F + +++ N+
Sbjct: 578 DLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSA 637
Query: 555 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 614
G++ GVD V+Q+ RCIR+VKN+EAE +G SGFGPRTI++CDQCE+E+H
Sbjct: 638 VGQLEGVDPVDQLAGRCIRVVKNMEAETNG-----------SGFGPRTIIICDQCEKEYH 686
Query: 615 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNS 673
+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AEKL + L I+ K N
Sbjct: 687 IGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQERND 746
Query: 674 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNL 733
+ ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SG +LIP MVYG+ +
Sbjct: 747 VYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTM 806
Query: 734 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 793
+GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT + KGYFQLLF+CIEKL
Sbjct: 807 QGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKL 866
Query: 794 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
LS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C Q+V FKG SMLQK V +
Sbjct: 867 LSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDS 926
Query: 854 CRI 856
+I
Sbjct: 927 HQI 929
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 340/708 (48%), Positives = 439/708 (62%), Gaps = 65/708 (9%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLR 256
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y+ K + GL+
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQ 404
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
G+I+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 405 GVIQGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAP 464
Query: 317 LPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYT 376
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T +
Sbjct: 465 ADALEAAIRNATGAGDARKSTFCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTTPSQV 522
Query: 377 TGIRISSSRPG-----LIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL 431
+ SR G + N ++K + S +VL K E S
Sbjct: 523 GDA--NCSRDGQLEFIFLLNYYWADDLYKLGLPDLRGLQWSPSSNSVL--KSTERMSSGT 578
Query: 432 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 491
P+K +T KD R+HKLVF+ LPDGT + YY G+KLLEGYK G I C+CC SE
Sbjct: 579 CPPSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGKKLLEGYKKGGAIFCYCCQSE 638
Query: 492 VSPSQFEAHA----------------------------------------------DGGN 505
VSPSQFEAHA DGG+
Sbjct: 639 VSPSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGD 698
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE 565
LL CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D++E
Sbjct: 699 LLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALE 757
Query: 566 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 625
QITK IRIV L AE+ C+LCR DFS SGFGP+T+++CDQCE+E+HV CL++H M D
Sbjct: 758 QITKCSIRIVDTLHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDD 817
Query: 626 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRW 684
L+ELPK KWFCC +C+ I+ LQ L+ + LP+ + I +K +LE S DV+W
Sbjct: 818 LKELPKDKWFCCKECNSIHYALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSINDVKW 877
Query: 685 RLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR-DLIPSMVYGRNLRGQEFGGMYC 743
RLLSGK +T ETR+ LS AV+IFHD FDPI DS + R DLIP+MVYGRN + Q+FGGM C
Sbjct: 878 RLLSGKNSTEETRVWLSGAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLC 937
Query: 744 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 803
AIL VNS VVSAG++R+FG+EVAELPLVATS GKGYFQ LF IE LL L VK +V
Sbjct: 938 AILMVNSLVVSAGVIRIFGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLV 997
Query: 804 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
LPAAEEAESIWT KFGF+ I PE L Y+ QL+ F+GT+MLQK+V
Sbjct: 998 LPAAEEAESIWTKKFGFQHITPEELKHYKDN-YQLMIFQGTAMLQKQV 1044
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 338/752 (44%), Positives = 451/752 (59%), Gaps = 108/752 (14%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+EP+++ + S+G + V AI T+P K + L K P
Sbjct: 244 SALKPKIEPLDI--SSSDGVKVDDTGSSSVAAIT-----TTPTKMFAID-----GLKKFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++++ GILC C C G V+ P+ F
Sbjct: 292 TKLKDLLDSGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIF 351
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA +R +YI ENG +L +V+ AC++ L +L ++ ++ P +KS C+ C
Sbjct: 352 ELHAGSANKRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSC 411
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSS 402
+G+ IT G LC+ C++ K Q + T + + RP + S+ S
Sbjct: 412 RGS--ITGAGTRKSKVLCSQCLELKDSQAILAPETDTKERTPRPSPVPESSSALLKSSPS 469
Query: 403 QSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGT 462
+S Q ++TKK D R+HKLVF+E LPDGT
Sbjct: 470 RSNSQGRLTKK-------------------------------DIRMHKLVFEEEVLPDGT 498
Query: 463 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--------------------- 501
EVGYY+ G+KLL GYK G GI C CCN+EVSPSQFEAHA
Sbjct: 499 EVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLH 558
Query: 502 -------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
DGG LL CD CPRAFH+EC SL SIP+G WYCKYC
Sbjct: 559 ELAISLSKCRRHSTKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPKGKWYCKYC 618
Query: 537 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 596
N FE+++F++ +ANA+ AGRV+G D +EQIT+RCIRIVK EAE+ GC+ CRG DF ++
Sbjct: 619 LNTFEKEKFVERNANAIAAGRVAGTDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT 678
Query: 597 GFGPRTILLCDQCEREFHVGCLKKHKMADLR---ELPKGKWFCCMDCSRINSVLQNLLVQ 653
FGPRT+++CDQCE+EFHVGCLK+H+M DL+ ELP GKWFCC C RI+S LQ L+++
Sbjct: 679 -FGPRTVIICDQCEKEFHVGCLKEHQMQDLKAICELPTGKWFCCTGCERIHSALQKLVIR 737
Query: 654 EAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPE-TRLLLSQAVAIFHDCF 711
EKLP+ LN IKK + ++ E+ D+RWRLLS K + T LLS+AVAIFH+ F
Sbjct: 738 GEEKLPDSSLNFIKKKHEESASESGGGDDIRWRLLSKKTDPSDVTESLLSEAVAIFHERF 797
Query: 712 DPIVDSISGR-----DLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 766
PI S R D IPSMV G +++GQ+ GGMYCA+L VN VVSA ++R+FGQE+A
Sbjct: 798 APITVDKSKRKRDDHDFIPSMVKGGDMKGQDLGGMYCAVLLVNHEVVSAAVMRIFGQELA 857
Query: 767 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI--D 824
ELP+VATS + G+GYFQ LF CIEKLL FL VK++VLPAAEE ESIWT+KFGF I D
Sbjct: 858 ELPIVATSSKSQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVESIWTNKFGFSTITQD 917
Query: 825 PELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
L YRK Q++ F+G+ MLQK VP CR+
Sbjct: 918 EVRLMEYRKS-YQIMEFQGSLMLQKPVPKCRV 948
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 342/746 (45%), Positives = 443/746 (59%), Gaps = 100/746 (13%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+E V++ + S+G ++V+ SA + N KM K P
Sbjct: 266 SALKPKIETVDI--SSSDG--------VKVDDRGSSSAAAATTTNTPTKMFSIDGSKKFP 315
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++R+ GILC C C G V+ P+ F
Sbjct: 316 TKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVTPAIF 375
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
+HA +R +YIC ENG +L +V+ AC++ L L ++ + P +KS C C
Sbjct: 376 VLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNFCWNC 435
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSS 402
+G+ IT G LC+ C K Q T + +++P + S+ S
Sbjct: 436 RGS--ITGAGSRKSKVLCSQCFGLKDFQAGSAPKTAKKERTAKPHSVPESSCNLLKSSLS 493
Query: 403 QSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGT 462
S+ Q ++TKK D R HKLVF+E LPDGT
Sbjct: 494 GSKSQGRVTKK-------------------------------DIRTHKLVFEEEVLPDGT 522
Query: 463 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--------------------- 501
EVGYY G+KLL GYK G GI C CCNSEVSPSQFEAHA
Sbjct: 523 EVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIYTSNGVSLH 582
Query: 502 -------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
DGG LL CD CPRAFH+EC SL SIP+G WYCKYC
Sbjct: 583 ELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPRGKWYCKYC 642
Query: 537 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 596
N FE+++F++ +ANA+ AGRV+GVD +EQIT+RCIRIVK EAE+ GC+ CRG DF ++
Sbjct: 643 LNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT 702
Query: 597 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 656
FGPRT+++CDQCE+EFHVGCLK+HKM DL+ELPKGKWFCC C RI+S LQ L+++ E
Sbjct: 703 -FGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCCTGCERIHSALQKLVIRGEE 761
Query: 657 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPE-TRLLLSQAVAIFHDCFDPIV 715
KLP+ LN IKK+ ++ E+ DVRWRLLS K + + T LLS AVAIFH+CFDPI
Sbjct: 762 KLPDSSLNFIKKHEESASESGCSDDVRWRLLSKKTDSSDVTEALLSDAVAIFHECFDPIT 821
Query: 716 DSISGR-----DLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 770
S R D IPSMV G N++GQ+ GGMYCA+L VN VVS ++R+FGQE+AELP+
Sbjct: 822 VDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLVNHVVVSVAVVRIFGQELAELPI 881
Query: 771 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 830
VATS G+GYFQ LF CIEKLL FL VK++VLPAAEE SIW +KFGF I + L
Sbjct: 882 VATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVGSIWKNKFGFGAITQDELME 941
Query: 831 YRKRCSQLVTFKGTSMLQKRVPACRI 856
YR+R Q++ F+G MLQK VP CRI
Sbjct: 942 YRRR-YQIMVFQGALMLQKPVPKCRI 966
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 330/748 (44%), Positives = 441/748 (58%), Gaps = 90/748 (12%)
Query: 166 SAMKPKVE-PVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
S +KPK+E E V S +++ GS + LK+ K + K
Sbjct: 321 SLLKPKMEIGQEYAVKDSSSAADDA-----------GSPSAASNSGTMLKVWKNDTSKKF 369
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVIPPSK 281
P + +L ++G+L+G V YM G K + +G L+G+I ILC C C G V+ PS
Sbjct: 370 PTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSCRGNEVVTPSI 429
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI ENG +L +V+ AC++ L L L + + S C++
Sbjct: 430 FEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCSSLKNSTFCLK 489
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKS 401
C+G G++ LC+ C+ K Q ++ TT + + A +T
Sbjct: 490 CRGKLAEASTGRSMT--LCSQCMVLKDSQASIPATTDTDKGYAESDVCAYRIVLTP---- 543
Query: 402 SQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDG 461
+ ++K S + S + +KS+ +T KD R+HKLVF+E LPDG
Sbjct: 544 ----KSHPVSKSSDSVLKCS----------TSRSKSQGRLTVKDLRMHKLVFEEDVLPDG 589
Query: 462 TEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------------------- 501
TEV YY+ GQKLL GYK G GI C CCN+EVSPSQFEAHA
Sbjct: 590 TEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSNGVSL 649
Query: 502 --------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 535
DGG+LL CD CPRA+HK+C +L IP G WYCK+
Sbjct: 650 HELAISLSKSRKFSTHQNDDLCQICRDGGDLLCCDVCPRAYHKDCLALPEIPTGRWYCKF 709
Query: 536 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 595
C N F++++F++H+ANA+ AGRV+GVD ++QIT+RCIRIVK ++A+ GC+ CRG DF K
Sbjct: 710 CLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVKTMDADFGGCVFCRGHDFDK 769
Query: 596 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 655
FGPRT+LLCDQCE+EFHVGCLK H M DL+ELPKG WFCC DC RI+S L+ L+++
Sbjct: 770 I-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCCSDCCRIHSALEKLVLRGE 828
Query: 656 EKLPEFHLNAIKKYAGNSLETV--SDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFD 712
E+L + LN I K + S+IDVRWRLL+ K +T LLS+A+AI H+ F+
Sbjct: 829 ERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAGDTAALLSEALAILHEQFN 888
Query: 713 PIV----DSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 768
PI+ S + RDLI SMV+G NL+GQEFGGMYCA+L +N +VVS I+R FG E+AEL
Sbjct: 889 PILVAGTSSKADRDLITSMVFGDNLKGQEFGGMYCAVLMINQAVVSCAIIRFFGLELAEL 948
Query: 769 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 828
PLVATS GKGYFQ LF CIEKLL FL +K++VLPAAEEAESIW +KFGF+K+ E
Sbjct: 949 PLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAEEAESIWINKFGFRKLTHEEF 1008
Query: 829 SIYRKRCSQLVTFKGTSMLQKRVPACRI 856
+RK Q++ F+GTSML K VP RI
Sbjct: 1009 LKFRKD-YQMMVFQGTSMLHKPVPKIRI 1035
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 321/730 (43%), Positives = 443/730 (60%), Gaps = 83/730 (11%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
E EA AE S L +P + + S+ L K P + +L TG+L+G+ V+YM G K +
Sbjct: 286 ETEASAEASLLMTPPSSAKFSNSR---LKKFPSKLKDLLATGILEGLPVMYMKGAKVLFA 342
Query: 254 G---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLE 307
G L+G+I+D G+LC C +CNG V+ P+ FE+HA +R +YI +G K+L +
Sbjct: 343 GEKGLQGVIQDSGVLCFCKICNGVEVVTPTVFELHAGSANKRPPEYIYIHDGNCGKTLRD 402
Query: 308 VLRACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVK 365
V+ AC PL + +Q L +KS C+ C+G C G + +C+ C+
Sbjct: 403 VMNACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSKL--VCDLCLA 456
Query: 366 SKKPQGTMT----YTTGIRISSSRPGLIANS-----TPVTSVHKSSQSQRQRKITKKSKK 416
S PQ M + ++ S P +I S P + ++ + ++ S
Sbjct: 457 SP-PQTAMASRKVISQPVQPRSPEPVVIQKSLDNEVQPNSLDNEVPPNSLDNEVQPNSLD 515
Query: 417 TVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEG 476
T + K F N + KS+ +T KD RLHKLVF+ LPDGTE+ YYA GQKLL G
Sbjct: 516 TGVQPKSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVG 575
Query: 477 YKNGLGIICHCCNSEVSPSQFEAHA----------------------------------- 501
YK G GI C CCN +VS SQFEAHA
Sbjct: 576 YKKGCGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFS 635
Query: 502 ------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 549
DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++ R QH+
Sbjct: 636 NNDNDDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHE 695
Query: 550 ANAVEA-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 608
NA+ A GR++G D +E + KRCIR+VK +E + GC LC +FSKS FGPRT+++CDQ
Sbjct: 696 VNALAAAGRIAGPDILELMNKRCIRVVKTVEVDHGGCALCSRPNFSKS-FGPRTVIICDQ 754
Query: 609 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK 668
CE+E+HVGCLK+H M +L +LP+G WFC +CS I++ L +L+ + + +P+ L+ IKK
Sbjct: 755 CEKEYHVGCLKEHNMENLEKLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKK 814
Query: 669 -YAGNSLETVSDIDVRWRLLSGKAATP-----ETRLLLSQAVAIFHDCFDPIVDSISGRD 722
+ SLE + +DV+WR+++ K + ETR LLS+AVAIFH+ FDPIVDS SGRD
Sbjct: 815 KHEEKSLEIGAGLDVKWRVMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRD 874
Query: 723 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 782
IP+M++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G+GY
Sbjct: 875 FIPTMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGY 934
Query: 783 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 842
FQ LF+CIE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y K+ +++ F+
Sbjct: 935 FQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKY-KKFYRMMIFQ 993
Query: 843 GTSMLQKRVP 852
GTS+LQK VP
Sbjct: 994 GTSVLQKPVP 1003
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 350/852 (41%), Positives = 487/852 (57%), Gaps = 114/852 (13%)
Query: 98 ENGILESVVEEENQLVQMTV---ENVIEETVKGKKAPICKEEPISKVECFPRKEGG---S 151
E +L+ V+ EE +V T+ E ++ ET+K + E+P+ E + G +
Sbjct: 142 EPKVLDDVINEEEAIVAETLKEQEPIVPETLKEEVVDEMAEQPLCIEESEEKDSNGVALA 201
Query: 152 EVSNGLN-----KKCLKRP--------SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAI 198
V++G KK L+RP SA+K K E E G +S
Sbjct: 202 LVNDGAKGKKSMKKRLERPQSERRFTRSALKVKSEET----NDGEHVGVAGISDGVKRET 257
Query: 199 AEGSAL--TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF---QAS 253
G++L T+P +K S + L K P + +L TG+L+G+ V+YM G+K
Sbjct: 258 EAGASLVMTTPSS---VKFSNRGKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEK 314
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLEVLR 310
GL+G+I+D G+LC C +C G V+ P+ FE+HA +R +YI +G K+L +V+
Sbjct: 315 GLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMN 374
Query: 311 ACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKK 368
AC PL + +Q L +KS C+ C+G C G + +C+SC+ S
Sbjct: 375 ACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSRL--VCDSCLVSPA 428
Query: 369 PQGTMTYTTGI----RISSSRPGLIA----NSTPVTSVHKSSQSQR-QRKITKKSKKTVL 419
Q + GI + S P +I N S+H Q + + S +
Sbjct: 429 -QTAVASNKGISQPVQPRSPEPVVIQKSLDNEVQPNSLHNEVQPNKLDTGMQPNSLDNGM 487
Query: 420 ISKPFENASPPLSFPN---------KSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 470
N+ P SF N KS+ +T KD RLHKLVF+ LPDGTE+ YYA G
Sbjct: 488 EPDSLNNSMKPKSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHG 547
Query: 471 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------------- 501
QKLL GYK G GI C CCN +VS SQFEAHA
Sbjct: 548 QKLLVGYKKGYGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSK 607
Query: 502 ------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++
Sbjct: 608 DHRRFSNNDNDDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKD 667
Query: 544 RFLQHDANAVEA-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 602
R QH+ NA+ A GR++G D +E + KRCIR+V+ LE + GC LC +FSKS FGP+T
Sbjct: 668 RHGQHEVNALAAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSRPNFSKS-FGPQT 726
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 662
+++CDQCE+E+HVGCLK H M +L ELP G WFC +CS+I++ L +L+ + + +P+
Sbjct: 727 VIICDQCEKEYHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDLVASKEKDVPDPL 786
Query: 663 LNAIKK-YAGNSLETVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSIS 719
LN IKK + SL+ + +DV+WR+++ K + + ETR LLS+AVAIFH+ FDPIVDS S
Sbjct: 787 LNLIKKKHEEKSLDIGAGLDVKWRVINWKLDSDSVETRKLLSKAVAIFHERFDPIVDSTS 846
Query: 720 GRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 779
GRD IP+M++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G
Sbjct: 847 GRDFIPAMLFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQG 906
Query: 780 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 839
+GYFQ LF+CIE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y K+ +++
Sbjct: 907 QGYFQCLFSCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKY-KKFYRMM 965
Query: 840 TFKGTSMLQKRV 851
F+GTS+LQK V
Sbjct: 966 IFQGTSVLQKPV 977
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 328/768 (42%), Positives = 450/768 (58%), Gaps = 73/768 (9%)
Query: 154 SNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGN-------ESMSLIEVEAIA---EGSA 203
S G++KK + + KP LV Q N E L++V+ A E
Sbjct: 297 SQGVDKKAVN-DTVDKPLRRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDG 355
Query: 204 LTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGII 259
SP K + K L P + ++F+ G+L+G+ V Y+ G K + +G L+G+I
Sbjct: 356 FQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVI 415
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ G+LC CS C G +V+ P+ FE+HA +R +YI E+G +L +V+ AC+ PL
Sbjct: 416 KGSGVLCFCSACIGIQVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLAT 475
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTT 377
L+ L+ + + + KS C+ C+G C T +C SC++SK+P+ + +
Sbjct: 476 LEEKLRVVVGPILK-KSSLCLSCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKAN 532
Query: 378 GIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKS 437
SSRP + S S QS R+ + T+KS + ++ + S S + S
Sbjct: 533 DALNGSSRPSVDPKSILRRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNS 592
Query: 438 RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQF 497
+T KD RLHKLVF++ LPDGTEVGY+ G+K+L GYK G GI C CCN VSPS F
Sbjct: 593 HGKLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTF 652
Query: 498 EAHA----------------------------------------------DGGNLLPCDG 511
EAHA DGG L+ CD
Sbjct: 653 EAHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDT 712
Query: 512 CPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC 571
CPR++HK CASL S+P W CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RC
Sbjct: 713 CPRSYHKVCASLPSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRC 772
Query: 572 IRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
IRIV + EL S C+LCRG F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP
Sbjct: 773 IRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELP 832
Query: 631 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRL 686
+ KWFC + C IN+ L NL+V+ EKL LN ++K + E D D+RWR+
Sbjct: 833 EEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRV 892
Query: 687 LSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 745
LSGK ++ +T++LL++A++I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +
Sbjct: 893 LSGKLTSSDDTKILLAKALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTM 952
Query: 746 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 805
L V+ +VS GI RVFG E+AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLP
Sbjct: 953 LAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLP 1012
Query: 806 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
AA+EA+SIWTDKFGF K+ E + YRK S ++ F GTSML+K VPA
Sbjct: 1013 AADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI-FHGTSMLRKSVPA 1059
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 556 bits (1434), Expect = e-155, Method: Compositional matrix adjust.
Identities = 342/863 (39%), Positives = 474/863 (54%), Gaps = 137/863 (15%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 65 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 119
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 120 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 173
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 174 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 211
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 330
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHK 400
CKG+ I+ VG + LC SCV K+ + TG +S R ST T+V K
Sbjct: 331 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG---TSDR------STKTTTVSK 379
Query: 401 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 460
S S +K+ +T KD LHKL F E+ LP+
Sbjct: 380 CSSSG-----------------------------SKNYGRVTKKDVGLHKLAFGENDLPE 410
Query: 461 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------- 501
G+EV YY G++LL G+K G I+C CCNSEVSPSQFEAH+
Sbjct: 411 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 470
Query: 502 ---------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 534
DGG LL CDGCPR FHKEC SL +IP+G W+CK
Sbjct: 471 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 530
Query: 535 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 594
+C N ++ +F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS
Sbjct: 531 FCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFS 590
Query: 595 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
SGFGP T+++CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++
Sbjct: 591 TSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQ 650
Query: 655 AEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFD 712
E+LP+ L IKK G S + D++WRLL G +A+ E LLSQA+++FH+ F+
Sbjct: 651 EEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFHEQFN 710
Query: 713 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 772
PI D+ GRDL+ MV+ + EFGGMYCAILTV VVSA RV G+EVAELPLVA
Sbjct: 711 PIADA-EGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVA 769
Query: 773 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 832
T G+GYFQ L+ CIE+LL FL+V S+VLPAAE AES+W +KF F K++ E L+ +
Sbjct: 770 TRSDCQGQGYFQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HL 828
Query: 833 KRCSQLVTFKGTSMLQKRVPACR 855
R Q++TF+GTSMLQK VP R
Sbjct: 829 CRDFQMMTFQGTSMLQKPVPEYR 851
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 317/718 (44%), Positives = 419/718 (58%), Gaps = 122/718 (16%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
V+ A ++ L L ++ A+ +KS C+ CKG +G + LC SC++ K
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNSKV--LCLSCLQLK 352
Query: 368 KPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENA 427
+ Q + + TG SS S +S K IS+ E+
Sbjct: 353 ESQASPSQVTG----------------------SSDSHL------RSPKPSTISRSAESV 384
Query: 428 SPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ---KLLEGYKNGLGII 484
S S +KS +T KD LHKLVF E+GLP+GTEVGYY GQ +LL GYK G GI
Sbjct: 385 SKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQVVTQLLVGYKRGSGIX 444
Query: 485 CHCCNSEVSPSQFEAHA------------------------------------------- 501
C CCNSEVSPSQFEAHA
Sbjct: 445 CTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCS 504
Query: 502 ---DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 558
DGGNLL CDGCPR FHKEC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV
Sbjct: 505 ICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRV 564
Query: 559 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 618
+GVD +EQITKRCIRIV E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL
Sbjct: 565 AGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCL 624
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETV 677
++H M DL+E+PKGKWFCC DC RINS LQ L+V E+LP L IK KY N
Sbjct: 625 REHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACS 684
Query: 678 SDIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 736
D D++WRL+ G +A++ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G
Sbjct: 685 KDPDIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHG------ 737
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
S VVSA R+FG+EVAELPLVAT G+GYFQ LF+C+E LL
Sbjct: 738 -------------SQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGV 784
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPE--LLSIYRKRCSQLVTFKGTSMLQKRVP 852
L V+S+VLPAAE AESIWT+KFGF K+ E ++ ++ Q QKR+P
Sbjct: 785 LEVRSLVLPAAEGAESIWTNKFGFNKVTQEQYIMDLFGIAAEQ---------FQKRLP 833
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 262/460 (56%), Positives = 323/460 (70%), Gaps = 50/460 (10%)
Query: 441 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 500
+T KD LHKLVF E+GLP+GTEVGYY GQ+LL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 366 VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQLLVGYKRGSGIFCTCCNSEVSPSQFEAH 425
Query: 501 A----------------------------------------------DGGNLLPCDGCPR 514
A DGGNLL CDGCPR
Sbjct: 426 AGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLLCCDGCPR 485
Query: 515 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 574
FHKEC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV+GVD +EQITKRCIRI
Sbjct: 486 VFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRI 545
Query: 575 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 634
V E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL++H M DL+E+PKGKW
Sbjct: 546 VNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKW 605
Query: 635 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSG-KAA 692
FCC DC RINS LQ L+V E+LP L IK KY N D D++WRL+ G +A+
Sbjct: 606 FCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGRRAS 665
Query: 693 TPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSV 752
+ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G++ R +FGGMYCAILT++S V
Sbjct: 666 SIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHGKSTREWDFGGMYCAILTISSQV 724
Query: 753 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 812
VSA R+FG+EVAELPLVAT G+GYFQ LF+C+E LL L V+S+VLPAAE AES
Sbjct: 725 VSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAEGAES 784
Query: 813 IWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
IWT+KFGF K+ E + +R R Q+VTF+GT MLQK VP
Sbjct: 785 IWTNKFGFNKVTQEQRNNFR-RDYQMVTFQGTLMLQKLVP 823
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 91/167 (54%), Gaps = 10/167 (5%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKT 354
V+ A ++ L L ++ A+ +KS C+ CKG +G +
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS 341
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 286/626 (45%), Positives = 384/626 (61%), Gaps = 58/626 (9%)
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI E+G +L +V+ AC+ PL L+ L+ + + + KS C+
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILK-KSSLCLS 60
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIRISSSRPGLIANSTPVTSVH 399
C+G C K+ +C SC++SK+P+ + + SSRP + S S
Sbjct: 61 CQGPMIEPCDTKSLV--VCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSILRRSKS 118
Query: 400 KSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP 459
QS R+ + T+KS + ++ + S S + S +T KD RLHKLVF++ LP
Sbjct: 119 SPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILP 178
Query: 460 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------ 501
DGTEVGY+ G+K+L GYK G GI C CCN VSPS FEAHA
Sbjct: 179 DGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGV 238
Query: 502 ----------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 533
DGG L+ CD CPR++HK CASL S+P W C
Sbjct: 239 SLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSC 298
Query: 534 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCD 592
KYC NM ER++F+ + NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG
Sbjct: 299 KYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHS 358
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 652
F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL+V
Sbjct: 359 FCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIV 418
Query: 653 QEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIF 707
+ EKL LN ++K + E D D+RWR+LSGK ++ +T++LL++A++I
Sbjct: 419 RGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALSIL 478
Query: 708 HDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 767
H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+AE
Sbjct: 479 HERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAE 538
Query: 768 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 827
LPLVATSK G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+ E
Sbjct: 539 LPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEE 598
Query: 828 LSIYRKRCSQLVTFKGTSMLQKRVPA 853
+ YRK S ++ F GTSML+K VPA
Sbjct: 599 VKEYRKDYSVMI-FHGTSMLRKSVPA 623
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 297/688 (43%), Positives = 410/688 (59%), Gaps = 68/688 (9%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVI 277
L P + E+F G+L+G++V Y+ G K + +G L+G+I+ G+LC C C G +V+
Sbjct: 363 LRNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVV 422
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPE-EKS 336
+ +E+HA +R +YI E+G +L +V+ AC+ P L+ L+ + P +KS
Sbjct: 423 STAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVG--PNLKKS 480
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIRISSSRPGLIANSTP 394
C+ C+G C T +C SC++SK+P+ + + G SSRP + S
Sbjct: 481 SLCLNCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSIL 538
Query: 395 VTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFD 454
S QS RQ + T+KS + ++ + S S + S+ +T KD RLHKLVF+
Sbjct: 539 SRSKSSPRQSNRQEQPTRKSTEPGVVPGTILSESKSSSIKSNSQGKLTRKDVRLHKLVFE 598
Query: 455 ESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------- 501
+ LPDGTEVGY+ G EVSPS FEAHA
Sbjct: 599 DDILPDGTEVGYFVAG--------------------EVSPSSFEAHAGCASRRKPFQHIY 638
Query: 502 --DGGNLLPC----------------DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
+G +L D C CASLSS+P W CKYC NM ER+
Sbjct: 639 TTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGVCASLSSLPSERWSCKYCVNMVERE 698
Query: 544 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRT 602
+F+ + NA+ AGRV GVD++ +IT RCIR+V + EL S C+LCRG F + GF RT
Sbjct: 699 KFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPSVCVLCRGHSFCRLGFNSRT 758
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 662
+++CDQCE+EFHVGCLK+H +ADL+ELP+ KWFC +DC +IN+ L NL+++ EKL
Sbjct: 759 VIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEKINTTLGNLIIRGEEKLTNNI 818
Query: 663 LNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDS 717
LN I+ + E+ D D+RWR+LSGK ++ ET++LL++AV+I H+ FDPI ++
Sbjct: 819 LNFIRTKEKPNEESCPDDNTTPDIRWRVLSGKLTSSDETKILLAKAVSILHERFDPISET 878
Query: 718 ISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 777
+ DLIP+MVYGR +GQ+F GMYC +L V+ +VS GI RVFG E+AELPLVATSK
Sbjct: 879 GTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDC 938
Query: 778 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ 837
G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+ E + YRK S
Sbjct: 939 QGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSV 998
Query: 838 LVTFKGTSMLQKRVPACRIGSSSTDSTE 865
++ F GTSML+K VPA S + S E
Sbjct: 999 MI-FHGTSMLRKSVPAPSAPSKTEGSKE 1025
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 288/694 (41%), Positives = 382/694 (55%), Gaps = 127/694 (18%)
Query: 220 SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRV 276
+L P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V
Sbjct: 208 TLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNV 267
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
+ P+ FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +
Sbjct: 268 VTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKST 327
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVT 396
F C CKG+ I+ VG + LC SCV K+ + TG
Sbjct: 328 F-CFNCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG------------------ 366
Query: 397 SVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDES 456
Q+QR + + V I D LHKL F E+
Sbjct: 367 -------QQKQRLCPSAAHQEV---------------------RIMGGDVGLHKLAFGEN 398
Query: 457 GLPDGTEVGYYACGQ-------KLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------- 501
LP+G+EV YY G+ +LL G+K G I+C CCNSEVSPSQFEAH+
Sbjct: 399 DLPEGSEVSYYVRGEVGTMRSKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKP 458
Query: 502 --------------------------------------DGGNLLPCDGCPRAFHKECASL 523
DGG LL CDGCPR FHKEC SL
Sbjct: 459 YLHIYTSNGVSLHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSL 518
Query: 524 SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS 583
+IP+G W+CK+C N ++ +F++ +ANAV AGR+ GVD +EQI KRCIRIVK+ E
Sbjct: 519 ENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKSQTDEAG 578
Query: 584 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 643
GC LCR +FS SGFGP T+++CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC I
Sbjct: 579 GCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKGKWFCCRDCKDI 638
Query: 644 NSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSGKAATP-ETRLLLS 701
NS L+ ++V+ E+LP+ L IKK G S + D++WRLL G+ A+ E LLS
Sbjct: 639 NSSLRKIVVRREEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRXASATEAGSLLS 698
Query: 702 QAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVF 761
QA+++FH+ F+PI D+ GRDL+ MV+ + EFGGMYCAILTV VVSA RV
Sbjct: 699 QALSLFHEQFNPIADA-EGRDLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVL 757
Query: 762 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 821
G+EVAELPLVAT G+ V S+VLPAAE AES+W +KF F
Sbjct: 758 GKEVAELPLVATRSDCQGQ------------------VNSLVLPAAEGAESLWINKFKFH 799
Query: 822 KIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 800 KMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 832
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 281/695 (40%), Positives = 383/695 (55%), Gaps = 93/695 (13%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 184 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 241
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 242 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 301
Query: 330 SLPEEKSFACVRCKGTFP----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSR 385
S + F K +F +T + L N PQ +++ + +
Sbjct: 302 SSVNMECFKA--WKASFHQNNGVTEADENYHAQLLN------HPQSIVSFP----VQAVE 349
Query: 386 PGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKD 445
+ P+ + ++RK K + + ++ S I +D
Sbjct: 350 DSFTGSRLPLKQKELMKEMTQERKHAAKKPSSYIYGSGLQHK-------KSSEGAIKKRD 402
Query: 446 QRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---- 501
LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+SEVSPSQFEAHA
Sbjct: 403 NDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAA 462
Query: 502 ------------------------------------------DGGNLLPCDGCPRAFHKE 519
DGG+L+ CDGCPRAFH
Sbjct: 463 RRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRAFHPA 522
Query: 520 CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLE 579
C L +P+GDW C C F R + I + R VK E
Sbjct: 523 CLELQCLPEGDWRCPCCVENFCPDRKV-----------------ARPIRIQLTRAVKAPE 565
Query: 580 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 639
+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ + DL+ELPK KWFCC D
Sbjct: 566 SEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDD 625
Query: 640 CSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETR 697
CSR++ LQNL + E +P + I K ++ +D D++W +LSGK+ E
Sbjct: 626 CSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DIQWCILSGKSCYKEHL 684
Query: 698 LLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGI 757
LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMYC +L S+VVSAG+
Sbjct: 685 PLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMYCVVLLAKSTVVSAGL 743
Query: 758 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 817
+RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++VLPAAEEAE+IWT+K
Sbjct: 744 IRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNK 803
Query: 818 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 804 LGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 837
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 281/695 (40%), Positives = 383/695 (55%), Gaps = 93/695 (13%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 239 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 296
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 297 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 356
Query: 330 SLPEEKSFACVRCKGTFP----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSR 385
S + F K +F +T + L N PQ +++ + +
Sbjct: 357 SSVNMECFKA--WKASFHQNNGVTEADENYHAQLLN------HPQSIVSFP----VQAVE 404
Query: 386 PGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKD 445
+ P+ + ++RK K + + ++ S I +D
Sbjct: 405 DSFTGSRLPLKQKELMKEMTQERKHAAKKPSSYIYGSGLQHK-------KSSEGAIKKRD 457
Query: 446 QRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---- 501
LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+SEVSPSQFEAHA
Sbjct: 458 NDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHAGWAA 517
Query: 502 ------------------------------------------DGGNLLPCDGCPRAFHKE 519
DGG+L+ CDGCPRAFH
Sbjct: 518 RRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRAFHPA 577
Query: 520 CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLE 579
C L +P+GDW C C F R + I + R VK E
Sbjct: 578 CLELQCLPEGDWRCPCCVENFCPDRKV-----------------ARPIRIQLTRAVKAPE 620
Query: 580 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 639
+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ + DL+ELPK KWFCC D
Sbjct: 621 SEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWFCCDD 680
Query: 640 CSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETR 697
CSR++ LQNL + E +P + I K ++ +D D++W +LSGK+ E
Sbjct: 681 CSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DIQWCILSGKSCYKEHL 739
Query: 698 LLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGI 757
LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMYC +L S+VVSAG+
Sbjct: 740 PLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMYCVVLLAKSTVVSAGL 798
Query: 758 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 817
+RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++VLPAAEEAE+IWT+K
Sbjct: 799 IRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEAIWTNK 858
Query: 818 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 859 LGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 892
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 283/695 (40%), Positives = 381/695 (54%), Gaps = 126/695 (18%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 227 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 281
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 282 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIK 341
Query: 326 SALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK-KPQGTMTYTTGIRISSS 384
+ S E+SF + S ++S K Q +Y+T +
Sbjct: 342 NVAGSSVNEESFQAWK-------------------ESLLQSNGKVQAHKSYSTKLV---- 378
Query: 385 RPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK 444
P T++ SS + + K+S +
Sbjct: 379 -------GMPHTNIRPSSYTSNSGVLQKRSADGC----------------------TKRR 409
Query: 445 DQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--- 501
D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 410 DNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMA 469
Query: 502 -------------------------------------------DGGNLLPCDGCPRAFHK 518
DGG+L+ C+GCPRAFH
Sbjct: 470 ARRQPYRHIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHA 529
Query: 519 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
C L +P W C C + + N GR S + V I R R+ K
Sbjct: 530 ACLGLQCVPDSGWQCLNC---------IDNAGN----GRESSI--VRPIMIRLTRVDKTP 574
Query: 579 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 638
E E+ GC++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC
Sbjct: 575 EVEMGGCVVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCD 634
Query: 639 DCSRINSVLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPET 696
DC+RI + LQN + AE +P F I+K+ L T + D++WR+LSGK+ PE
Sbjct: 635 DCNRIYAALQNSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEH 694
Query: 697 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 756
LLS+A AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VN VVSAG
Sbjct: 695 LPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAG 753
Query: 757 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 816
+LR+FG+ VAELPLVATS+ + GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT
Sbjct: 754 LLRIFGRNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTK 813
Query: 817 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
K GF+K+ + LS + + QL F TSML+K V
Sbjct: 814 KLGFRKMSEDQLSKHLREV-QLTLFNKTSMLEKTV 847
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 286/692 (41%), Positives = 373/692 (53%), Gaps = 89/692 (12%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ N P V +L TG+LDG V Y+ + L GII GG LC C
Sbjct: 201 NMELKMSKKVLPNTFPSNVKKLLSTGILDGARVKYISPQR----ELYGIIDGGGYLCGCP 256
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN RV+ +FE+HA + R + +I ENGK + +++ ++ PL + ++ A
Sbjct: 257 SCNFSRVLTAYEFELHAGAKTRHPNNHIYLENGKPICSIIQELKAAPLGAVDEVIKDAAG 316
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLI 389
S E+ F + C G G C S + P +Y++ S P
Sbjct: 317 SSINEEFFQVWKASLH---QCNGIIGADEKCYSMLP-YSPHSLGSYSSQGLEESGCP--- 369
Query: 390 ANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK-DQRL 448
P +S S+ +RQ+ + + +P LS P K+ T + D L
Sbjct: 370 ----PCSSFVHSNPFRRQKYMDSSEEHKRAFRRP-----SSLSHPKKTNEGGTRRRDNDL 420
Query: 449 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------- 501
H+L+F +GLPDG E+ YY GQK+L GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 421 HRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCDREISPSQFEAHAGMAARRQ 480
Query: 502 ---------------------------------------DGGNLLPCDGCPRAFHKECAS 522
DGG+L+ C+ CPRAFH C
Sbjct: 481 PYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLIFCESCPRAFHLVCLG 540
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 582
L +P W+C C F + I R R+VK E E+
Sbjct: 541 LKYVPSDVWHCPNCNKFGHGGNFSR------------------SIVIRLTRVVKTPEYEV 582
Query: 583 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 642
GC+ CR DFS F RT++LCDQCEREFHVGCL+ + + DL+E+PK WFC DC+R
Sbjct: 583 GGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKEIPKDNWFCSNDCNR 642
Query: 643 INSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI-DVRWRLLSGKAATPETRLLL 700
I LQN + + +P LN I K+A L D +WR+L GK+ E LL
Sbjct: 643 IYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWRILMGKSRYQEDLSLL 702
Query: 701 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 760
S A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V + VVSAG+LR+
Sbjct: 703 SAAAAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVLLLVKNVVVSAGLLRI 761
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
FG++VAELPLVATS+ + GKGYFQ LF+CIE+LL L V +VLPAAEEAESIWT +FGF
Sbjct: 762 FGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEEAESIWTRRFGF 821
Query: 821 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+K+ E LS Y + QL FKGTSML+K VP
Sbjct: 822 RKMTEEQLSQYTREL-QLTIFKGTSMLEKEVP 852
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 284/695 (40%), Positives = 381/695 (54%), Gaps = 126/695 (18%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 230 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 284
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 285 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIK 344
Query: 326 SALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK-KPQGTMTYTTGIRISSS 384
+ S E+SF + S ++S K Q +Y+T +
Sbjct: 345 NVAGSSVNEESFQAWK-------------------ESLLQSNGKVQAHKSYSTKLV---- 381
Query: 385 RPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK 444
P T++ SS + + K+S +
Sbjct: 382 -------GMPHTNIRPSSYTSNTGVLQKRSADGC----------------------TKRR 412
Query: 445 DQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--- 501
D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 413 DNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMA 472
Query: 502 -------------------------------------------DGGNLLPCDGCPRAFHK 518
DGG+L+ C+GCPRAFH
Sbjct: 473 ARRQPYRHIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHA 532
Query: 519 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
C L +P W C C++ NA GR S + V I R R+ K
Sbjct: 533 ACLGLQCVPDSGWQCLNCRD------------NAGN-GRESSI--VRPIMIRLTRVDKTP 577
Query: 579 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 638
E E+ GC++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC
Sbjct: 578 EFEMGGCVVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCD 637
Query: 639 DCSRINSVLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPET 696
DC+RI LQN + AE +P I+K+ L T + D++WR+LSGK+ PE
Sbjct: 638 DCNRIYVALQNSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEH 697
Query: 697 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 756
LLS+A AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSAG
Sbjct: 698 LPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAG 756
Query: 757 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 816
+LR+FG+ VAELPLVATS+ + GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT
Sbjct: 757 LLRIFGRNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTK 816
Query: 817 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
K GF+K+ + LS + + QL F TSML+K V
Sbjct: 817 KLGFRKMSEDQLSKHLREV-QLTLFNKTSMLEKTV 850
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 285/718 (39%), Positives = 385/718 (53%), Gaps = 141/718 (19%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 202
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 203 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 262
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIA 390
P++++F C CK +F GK LC+SC++SK Q
Sbjct: 263 APKKRTFRCQTCKSSFATLRTGKFAL--LCDSCLESKGSQ-------------------- 300
Query: 391 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 448
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 301 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 358
Query: 449 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------- 501
HKL F LP+GT+VGYY G++LL+GY GI CHCCN+ VSPSQFEAHA
Sbjct: 359 HKLAFMSGVLPEGTDVGYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRK 418
Query: 502 ---------------------------------------DGGNLLPCDGCPRAFHKECAS 522
DGG LL CD CPRAFH+EC
Sbjct: 419 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 478
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 582
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 479 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 538
Query: 583 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 615
GC LC DFSK F RT+LLCDQ
Sbjct: 539 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 592
Query: 616 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 674
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 593 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 639
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 734
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 640 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 694
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL
Sbjct: 695 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLL 739
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 740 ASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLK-GGRTTVFQGTSTLHKLVP 796
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 298/750 (39%), Positives = 411/750 (54%), Gaps = 101/750 (13%)
Query: 154 SNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGN-------ESMSLIEVEAIA---EGSA 203
S G++KK + + KP LV Q N E L++V+ A E
Sbjct: 297 SQGVDKKAVN-DTVDKPLRRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDG 355
Query: 204 LTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDG 262
SP K + K L P + ++F+ G+L+G+ V Y+
Sbjct: 356 FQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEGLIVYYV----------------- 398
Query: 263 GILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKA 322
G +V+ P+ FE+HA +R +YI E+G +L +V+ AC+ PL L+
Sbjct: 399 ---------RGAKVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEE 449
Query: 323 TLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIR 380
L+ + + + KS C+ C+G C T +C SC++SK+P+ + +
Sbjct: 450 KLRVVVGPILK-KSSLCLSCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKANDAL 506
Query: 381 ISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN 440
SSRP + S S QS R+ + T+KS + ++ + S S + S
Sbjct: 507 NGSSRPSVDPKSILRRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGK 566
Query: 441 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 500
+T KD RLHKLVF++ LPDGTEVGY+ G EVSPS FEAH
Sbjct: 567 LTRKDLRLHKLVFEDDILPDGTEVGYFVAG--------------------EVSPSTFEAH 606
Query: 501 A---------------DGGNLLPC----------------DGCPRAFHKECASLSSIPQG 529
A +G +L D C CASL S+P
Sbjct: 607 AGCASRRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGVCASLPSLPSE 666
Query: 530 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLC 588
W CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RCIRIV + EL S C+LC
Sbjct: 667 RWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLC 726
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQ 648
RG F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L
Sbjct: 727 RGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLG 786
Query: 649 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQA 703
NL+V+ EKL LN ++K + E D D+RWR+LSGK ++ +T++LL++A
Sbjct: 787 NLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKA 846
Query: 704 VAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 763
++I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG
Sbjct: 847 LSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGS 906
Query: 764 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
E+AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+
Sbjct: 907 ELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKM 966
Query: 824 DPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
E + YRK S ++ F GTSML+K VPA
Sbjct: 967 TDEEVKEYRKDYSVMI-FHGTSMLRKSVPA 995
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 286/702 (40%), Positives = 383/702 (54%), Gaps = 104/702 (14%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILC 266
N+ELKMSKK+ N P V +L TG+LDG +V Y+ G ++ L GII DGG LC
Sbjct: 240 NMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVE-----LDGIIGDGGYLC 294
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR----ACRSVPLPMLKA 322
CS+C+ RV+ +FE HA + R + +I ENGK + ++ A S P ++K
Sbjct: 295 GCSMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKN 354
Query: 323 TLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI-RI 381
S+++ +G+F + L S K + T +TGI
Sbjct: 355 VAGSSIN-------------EGSFQVW------KESLLQSNKKVPTQKKYSTKSTGIPHT 395
Query: 382 SSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN- 440
+S+ A+S V + Q T K V + KP + P K +
Sbjct: 396 YNSQSIESASSFSSLRVRNHFEQQMYVNQTADEWKRV-VKKP-STYTYYSGIPQKRSADG 453
Query: 441 -ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEA 499
+D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEA
Sbjct: 454 CTKKRDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEA 513
Query: 500 HA----------------------------------------------DGGNLLPCDGCP 513
HA DGG+L+ C+GCP
Sbjct: 514 HAGMAARRQPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCP 573
Query: 514 RAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIR 573
RAFH C L S+P+ W+C C++ +R I R R
Sbjct: 574 RAFHAACLGLHSVPESGWHCLNCEDNTGDER------------------GARPIMIRLTR 615
Query: 574 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 633
+ K E E+ GC++CR DFS F RT+++CDQCE+E+HVGCL+ + +L ELPK K
Sbjct: 616 VDKEPEYEVGGCVVCRANDFSVDKFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDK 675
Query: 634 WFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDI-DVRWRLLSGKA 691
WFCC DC+RI LQN + A+ +P I+K+ L T D+ D++WR+LSGK+
Sbjct: 676 WFCCDDCNRIYVALQNSVSAGADTIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKS 735
Query: 692 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSS 751
E LLS+A AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS
Sbjct: 736 RYAEHLPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSI 794
Query: 752 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 811
VVSAG+LR+FG+ +AELPLVATS+ + GKGYFQ LF+CIE+LLS L V+ +VLPAA +AE
Sbjct: 795 VVSAGLLRIFGRNIAELPLVATSREHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAE 854
Query: 812 SIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
SIWT K GF K+ + L+ + K QL F TS+L+K V A
Sbjct: 855 SIWTKKLGFHKMSEDQLTKHLKEV-QLTLFNKTSVLEKMVQA 895
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 280/710 (39%), Positives = 386/710 (54%), Gaps = 82/710 (11%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY 375
P +L+ ++ S E+ F KG+F K N + Q ++Y
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQAW--KGSFQ---QDKNMTEDDSNH-IMDHSFQSLVSY 325
Query: 376 T-TGIRISSSRPGLIANSTPV---TSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL 431
+G + S+ +STP + + + + K K L S F
Sbjct: 326 PGSGWSLDESQ-----SSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHK- 379
Query: 432 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 491
+W +D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+++
Sbjct: 380 KVSGGGKWK---RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTK 436
Query: 492 VSPSQFEAHA-----------------------------------------------DGG 504
+SPSQFEAHA +GG
Sbjct: 437 ISPSQFEAHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGG 496
Query: 505 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 564
+LL C GCP+AFH C S+P+G WYC C +D + ++
Sbjct: 497 DLLLCAGCPQAFHTACLKFQSMPEGTWYCSSC-----------NDGPTSCKIATASDPNL 545
Query: 565 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 624
+ I R R+VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++++
Sbjct: 546 KPIVIRLTRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELC 605
Query: 625 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVR 683
DL+ +P+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY + + V
Sbjct: 606 DLKGIPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVE 665
Query: 684 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYC 743
WR+LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC
Sbjct: 666 WRMLSGKSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYC 724
Query: 744 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 803
+L VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++
Sbjct: 725 LVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLL 784
Query: 804 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
LPAAEEAESIWT+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 785 LPAAEEAESIWTNKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 833
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 277/685 (40%), Positives = 371/685 (54%), Gaps = 90/685 (13%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYM-GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P + EL TGLL+G+ V Y+ K Q + L+G+I I C C CNG + +
Sbjct: 369 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCS 428
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C
Sbjct: 429 YFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNC 488
Query: 340 VRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVH 399
+ C + G LC C++ K+ Q + + + +SS LI +S
Sbjct: 489 LNCNEHLSSSQTEIFG-SFLCQRCLEPKQHQDPPSPSYACKSNSS---LIPSS------- 537
Query: 400 KSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP 459
K L+ K PL+ S +T KD LHKLVF L
Sbjct: 538 ----------------KDFLLKKT------PLNTKGGSAGKVTTKDTGLHKLVF--KVLL 573
Query: 460 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------ 501
DGTEV YY GQ+ ++GY I C+ CN VSPS FEAHA
Sbjct: 574 DGTEVAYYVDGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGV 633
Query: 502 ----------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 533
GG++ PC CPR+FH C LS +P +WYC
Sbjct: 634 SLHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYC 692
Query: 534 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 593
C N+ ++++ L + NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF
Sbjct: 693 DNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDF 751
Query: 594 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 653
+ S F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++
Sbjct: 752 NNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISD 811
Query: 654 EAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 712
A L E ++ I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FD
Sbjct: 812 GALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFD 871
Query: 713 PIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 768
PI++ SGRDLIP MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAEL
Sbjct: 872 PIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAEL 931
Query: 769 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 828
PLVATSK G GYFQ LF+CIE++L L++K +LPAA+EAE IW +KFGF KI E
Sbjct: 932 PLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQS 991
Query: 829 SIYRKRCSQLVTFKGTSMLQKRVPA 853
Y + L F GTS L K +P+
Sbjct: 992 DAYLN-GAHLTIFHGTSNLYKAIPS 1015
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 282/705 (40%), Positives = 389/705 (55%), Gaps = 95/705 (13%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 160 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 216
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 217 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 276
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLI 389
+L EE A K +F +S T +S PGL
Sbjct: 277 ALNEEGLRAW---KASFQ-----------------QSNSMSDRNYITDHSTVSYLGPGLD 316
Query: 390 ANS--TPVT-SVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQ 446
+ TP + H S+ + + K+ + K + S S + +D
Sbjct: 317 ESQSLTPCSVENHYFSEKTYAKDTLDEPKR--IAKKLTSHVSGTGCHKKVSEGSNRKRDN 374
Query: 447 RLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----- 501
LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 375 DLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGMAAR 434
Query: 502 -----------------------------------------DGGNLLPCDGCPRAFHKEC 520
DGG+LL C GCP+AFH C
Sbjct: 435 RQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTAC 494
Query: 521 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRIVKN 577
S+P+G WYC C + + + + + D + I R R+VK
Sbjct: 495 LKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRVVKA 542
Query: 578 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 637
E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KWFCC
Sbjct: 543 PESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCC 602
Query: 638 MDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPET 696
+CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+ PE
Sbjct: 603 SNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEH 662
Query: 697 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 756
LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA
Sbjct: 663 LPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAA 721
Query: 757 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 816
+LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESIWT
Sbjct: 722 LLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTK 781
Query: 817 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 861
KFGF K+ + L Y+K QL FKGTSML+K+VP G S +
Sbjct: 782 KFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSES 825
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 274/696 (39%), Positives = 364/696 (52%), Gaps = 87/696 (12%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 202 ERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 258
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS C+ +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 259 GCSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 318
Query: 327 ALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRP 386
S E+ F + VG KK + +S +
Sbjct: 319 VAGSSINEEFFRVWKASLNQSNALVGA------------DKKSYSELPCLPHSHVSYASQ 366
Query: 387 GLIANSTPVTS---VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITP 443
L + P++S + + SQ+ T K F + +
Sbjct: 367 ALKESFCPISSSFLYNNNFVSQQTNMETSGVNKQTSKRPSFYVPGSATKQKKTAESGVRK 426
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 501
+D LH+L+F +GLPDGTE+ YY GQK+L GYK G GI+C CC E+SPSQFE+HA
Sbjct: 427 RDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGM 486
Query: 502 --------------------------------------------DGGNLLPCDGCPRAFH 517
DGG+L+ C CPRAFH
Sbjct: 487 SARRQPYRHIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFH 546
Query: 518 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 577
C L P+G W+C C L H N I R R+VK
Sbjct: 547 AACLDLHDTPEGAWHCPNCNK-------LGHGGNFARP-----------IVIRLTRVVKT 588
Query: 578 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 637
E ++ GC +CR DFS F RT++LCDQCE+EFHVGCL++ + DL+E+PK WFCC
Sbjct: 589 PEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCC 648
Query: 638 MDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPE 695
DC+ I L+N + + +P LN I K L + DV+W++L GK+ E
Sbjct: 649 QDCNNIYVALRNSVSTGVQTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILMGKSRNRE 708
Query: 696 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 755
LLS A AIF +CFDPIV + +GRDLIP MVYGRN+ GQEFGGMYC +LTV VVSA
Sbjct: 709 DLSLLSGAAAIFRECFDPIV-AKTGRDLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSA 767
Query: 756 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 815
G+LR+FG+EVAELPLVAT++ + GKGYFQ LF+CIE+LL L V+ +VLPAAEEAESIWT
Sbjct: 768 GLLRIFGREVAELPLVATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWT 827
Query: 816 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
+FGF+K+ L Y R QL FKGTSML+K V
Sbjct: 828 RRFGFRKMSEGQLLKY-TREFQLTIFKGTSMLEKEV 862
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 255/580 (43%), Positives = 343/580 (59%), Gaps = 53/580 (9%)
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
+V+ P FE+HA + S YI ENG +L +VLRAC +V L ML++ ++ A+ P++
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTP 394
++F C CK +F C GK C+SC++SK + + G+ G++ T
Sbjct: 406 RTFRCKGCKSSFSTLCSGKF--ALFCDSCLESKGAKNN-SRDKGMHKVVFMSGVLQEGTD 462
Query: 395 VTSV--HKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 452
V K S SQ + + +++ KP+ N LH+L
Sbjct: 463 VGYYVGGKVSPSQFEAHAGRAARR-----KPYHNI-------------YMSNGVSLHELS 504
Query: 453 FDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGC 512
GQK+ + L IC +DGG LL CD C
Sbjct: 505 IS------------LLKGQKMSNRQSDDLCSIC---------------SDGGQLLLCDTC 537
Query: 513 PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCI 572
PRAFH+EC SLSS P+G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R I
Sbjct: 538 PRAFHRECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSI 597
Query: 573 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 632
RI E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G
Sbjct: 598 RIATTPETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEG 657
Query: 633 KWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKA 691
W+C DC RIN LQ+LL E +P L+ IKK +D+DVRWR+L K+
Sbjct: 658 AWYCSTDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDKS 717
Query: 692 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSS 751
+ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYCA+LTVN++
Sbjct: 718 SD-DSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNT 776
Query: 752 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 811
VVSAG+ R+ G E+AELPLVATS+ + G GYFQ LF+CIE+LL+ L VK VLPAAEEAE
Sbjct: 777 VVSAGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEEAE 836
Query: 812 SIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
SIWT++FGF KI + L Y K + F+GTS L K V
Sbjct: 837 SIWTERFGFTKISQDELREYLK-GGRTTVFQGTSNLHKLV 875
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 276/684 (40%), Positives = 370/684 (54%), Gaps = 90/684 (13%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V Y+ +A L+G+I I C C CNG + +
Sbjct: 380 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKKAV-LKGVITGCNIRCFCLSCNGSKDVCSY 438
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C+
Sbjct: 439 FFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCL 498
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHK 400
C + G LC C++ K+ Q + + + +SS LI +S
Sbjct: 499 NCNEHLSSSQTEIFG-SFLCQRCLEPKQHQDPPSPSYACKSNSS---LIPSS-------- 546
Query: 401 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 460
K L+ K PL+ S +T KD LHKLVF L D
Sbjct: 547 ---------------KDFLLKKT------PLNTKGGSAGKVTTKDTGLHKLVF--KVLLD 583
Query: 461 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------- 501
GTEV YY GQ+ ++GY I C+ CN VSPS FEAHA
Sbjct: 584 GTEVAYYVDGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVS 643
Query: 502 ---------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 534
GG++ PC CPR+FH C LS +P +WYC
Sbjct: 644 LHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCD 702
Query: 535 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 594
C N+ ++++ L + NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+
Sbjct: 703 NCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFN 761
Query: 595 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
S F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++
Sbjct: 762 NSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDG 821
Query: 655 AEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 713
A L E ++ I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FDP
Sbjct: 822 ALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDP 881
Query: 714 IVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 769
I++ SGRDLIP MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELP
Sbjct: 882 IIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELP 941
Query: 770 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 829
LVATSK G GYFQ LF+CIE++L L++K +LPAA+EAE IW +KFGF KI E
Sbjct: 942 LVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSD 1001
Query: 830 IYRKRCSQLVTFKGTSMLQKRVPA 853
Y + L F GTS L K +P+
Sbjct: 1002 AYLN-GAHLTIFHGTSNLYKAIPS 1024
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 271/683 (39%), Positives = 370/683 (54%), Gaps = 93/683 (13%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TG+L+G+ V+Y+ +A L+G+I I C C CNG + I
Sbjct: 207 LTKHPGNIRELLNTGMLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGAKAISAY 265
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + C+
Sbjct: 266 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIRSSIDPVVKRSRINCL 325
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHK 400
C + + LC C++SK+PQ +T + +S+ +
Sbjct: 326 NCNELV----LPSSHENVLCQVCLESKQPQDPLTASY-------------TCNGSSSLSR 368
Query: 401 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 460
SS+ R I+ K S +T KD RLHKLVF+ L D
Sbjct: 369 SSKEALLRNISSGKK-------------------GGSAGKVTNKDNRLHKLVFNV--LLD 407
Query: 461 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------- 501
GTEV YY GQ+ ++GY I C+ CN VSPS FEAHA
Sbjct: 408 GTEVAYYVDGQRKVDGYIKDHRIYCNHCNRVVSPSAFEAHAGEGSRRKPYDNIFTSNGVS 467
Query: 502 ---------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 534
GG++ PC CPR+FH C LS +P +WYC
Sbjct: 468 LHELAMKISKDMELSERETDDLCRECGQGGDIFPCKICPRSFHPACVGLSKVP-AEWYCD 526
Query: 535 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 594
C+N+ ++++ L + NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+
Sbjct: 527 SCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFN 585
Query: 595 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
+ F RT++LCDQCE+E+HVGCL+ +L+ELP+ +WFCC CS S L ++
Sbjct: 586 NAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEEEWFCCSSCSETRSSLDKIISDG 645
Query: 655 AEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 713
A+ L + L IKK + L + D++W+LLSGK AT + +LLS AV IFH FDP
Sbjct: 646 AQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSGKRATEDGSILLSAAVPIFHQSFDP 705
Query: 714 IVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 769
I ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA +LRV G +VAELP
Sbjct: 706 IREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVMGGDVAELP 765
Query: 770 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 829
LVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW +KFGF +I PE L
Sbjct: 766 LVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMNKFGFSRISPEELE 825
Query: 830 IYRKRCSQLVTFKGTSMLQKRVP 852
Y + L F GTS + K VP
Sbjct: 826 AYLNG-AHLTIFHGTSYMYKAVP 847
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 229/481 (47%), Positives = 303/481 (62%), Gaps = 53/481 (11%)
Query: 423 PFENASPPLSFP----NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYK 478
P +A P +F + S +T KD LHKLVF LP+GT+VGYY G++LL+GY
Sbjct: 259 PTSSARVPKNFSPGAKSTSAGRLTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYI 318
Query: 479 NGLGIICHCCNSEVSPSQFEAHA------------------------------------- 501
GI CHCCN+ VSPSQFE HA
Sbjct: 319 KEPGIHCHCCNTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQ 378
Query: 502 ---------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 552
DGG LL CD CPRAFH+EC L+++P+G W C+YC+ +R+ L ++ NA
Sbjct: 379 SDDLCSICSDGGELLLCDTCPRAFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNA 438
Query: 553 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 612
+ AGR+ G+DS+EQI R IRI E GC LC+ DF K F RT+LLCDQC RE
Sbjct: 439 IAAGRIDGIDSMEQIFTRSIRIATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGRE 498
Query: 613 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAG 671
+HVGCLK+H MADL LP+G W+C DC RI+ +++LL AE +P + I KK
Sbjct: 499 YHVGCLKEHSMADLTALPEGAWYCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKRED 558
Query: 672 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 731
L D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ + +GRDLIP+MVYGR
Sbjct: 559 KGLNEDGDLDVRWRVLRDKSSE-DSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGR 617
Query: 732 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 791
++R Q++ GMYCA+LTV ++VVSAG+ R+ G+E AELPLVATS+ N G GYFQ LF CIE
Sbjct: 618 SVRDQDYTGMYCAVLTVGNTVVSAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIE 677
Query: 792 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
+LL+ L+VK VLPAA+EA SIWT +FGF KI + L + + ++ F+GTS L K +
Sbjct: 678 RLLASLKVKYFVLPAADEAVSIWTQRFGFSKISRDEL-LEHLKGARTTVFQGTSTLHKLI 736
Query: 852 P 852
P
Sbjct: 737 P 737
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 282/742 (38%), Positives = 390/742 (52%), Gaps = 109/742 (14%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEV-EAIAEGSALTSPK-KNLELKMSKKIS-LN 222
S +KPKVE + SL+ V E + + T P +E+KMSKK++ L+
Sbjct: 472 SLLKPKVE------------APPASSLVVVPEEPVDSTPETPPSVTKMEMKMSKKVAFLS 519
Query: 223 KKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
K P +L TGLL+G+ V+Y+ + L+G+I I C C C+G + I F
Sbjct: 520 KHPGNTRDLLSTGLLEGMPVMYIIP-NSKKPVLKGVIAGCNIRCFCVKCDGSKTITTYFF 578
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA + ++YI NG L +VLRAC S PL L T+QS + + C+ C
Sbjct: 579 ELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCLNC 638
Query: 343 KGTFPITCVGKTGPGPLCNSC-VKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKS 401
G P +T LC+ C +S +PQ + L + + + S
Sbjct: 639 NGELP----SQTEEQFLCHDCCPESNQPQDPTS------------PLACSKSSSSLTPSS 682
Query: 402 SQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDG 461
+S +R K T +T KD LHKLVF L DG
Sbjct: 683 KESLLKRMSASKGAST---------------------GKVTTKDTGLHKLVF--KVLLDG 719
Query: 462 TEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAD------------------- 502
TEV YY GQK ++GY I C+ CN VSPS FEAHA
Sbjct: 720 TEVNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSL 779
Query: 503 ---------------------------GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 535
GG++ PC CPR+FH C L +P +W+C
Sbjct: 780 HELSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDN 839
Query: 536 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 595
C + ++++ L + NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+
Sbjct: 840 CTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKDFNN 898
Query: 596 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 655
+ F RT++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ + A
Sbjct: 899 AVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISEGA 958
Query: 656 EKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 714
L E ++ I KK+ L ++ ++RW+L++G++AT + LLS AV + H FDPI
Sbjct: 959 HPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPI 1018
Query: 715 VDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 770
+++ +GRDLIP MV+GR + GQ++ GMYCA+LTV S+VVSA +LRV G +VAELPL
Sbjct: 1019 IEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPL 1078
Query: 771 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 830
VATS G GYFQ+LF+CIE+LL L+VK +LPAA EAE+IW KFGF KI + +
Sbjct: 1079 VATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEA 1138
Query: 831 YRKRCSQLVTFKGTSMLQKRVP 852
Y L F GT L K +P
Sbjct: 1139 YLNG-GHLTVFHGTLNLYKAIP 1159
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 268/712 (37%), Positives = 377/712 (52%), Gaps = 88/712 (12%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EGS L +KM KI P V +L TG+LDG V Y+ A L
Sbjct: 147 EHTWEGSPSNVASSTLGVKMLDKIDSTNFPSNVKKLLATGILDGARVKYLS--ISPAREL 204
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C++C+ +V+ +FE HA + + + +I ENG+ + +++ R
Sbjct: 205 QGIIHSGGYLCGCTVCDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVA 264
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY 375
P +L+ ++ S E+ F + + M Y
Sbjct: 265 PPDVLEEVIRKVAGSALSEEGFQAWK--------------------ESFQQDDSNHIMDY 304
Query: 376 TTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSF-- 433
+ +S G + + ++ + + ++KI+ K + K + S
Sbjct: 305 SFQSLVSYPGSGWSIDESQSSTPYFPENNYFRKKISTKDTRHEHKPKAKKVTSHMFGMGC 364
Query: 434 ----PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCN 489
+W +D LH+L+F +GLPDGTE+ Y+ QKLL+GYK G GI+C CC+
Sbjct: 365 HKKAAGGGKWK---RDNDLHRLLFLPNGLPDGTELAYFVKSQKLLQGYKQGSGIVCSCCD 421
Query: 490 SEVSPSQFEAHA-----------------------------------------------D 502
+E+SPSQFEAHA D
Sbjct: 422 TEISPSQFEAHAGMAGRRQPYRHIHISSGLSLHDIAMSLADGGHVITTGDSDDMCSICGD 481
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+LL C GCP+AFH C S+P+G WYC C + + A A + S V
Sbjct: 482 GGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCNDGPTSCK----TATATDPNLKSIVG 537
Query: 563 SVEQITKRC-IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
S+ + IR++ + A + R DFS F RT++LCDQCE+E+HVGCL+++
Sbjct: 538 SIAIFSLSAHIRVLHS--AYCFSPISDRSLDFSIGKFDDRTVILCDQCEKEYHVGCLREN 595
Query: 622 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDI 680
+ DL+ +P+ KWFCC DCSRI++ LQ+ + +P L+ I +KY + +
Sbjct: 596 DLCDLKGIPQDKWFCCSDCSRIHTALQSSASCGPQTIPTVLLDTISRKYREKGICIDNGD 655
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 740
+V WR+LSGK+ E LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGG
Sbjct: 656 NVEWRMLSGKSRYAEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGG 714
Query: 741 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 800
MYC +L VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V+
Sbjct: 715 MYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVE 774
Query: 801 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+++LPAAEEAESIWT KFGF K+ L Y++ QL FKGTSML+K+VP
Sbjct: 775 NLLLPAAEEAESIWTKKFGFTKMTEHQLQKYQREV-QLTIFKGTSMLEKKVP 825
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 277/718 (38%), Positives = 378/718 (52%), Gaps = 100/718 (13%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY 375
P +L+ ++ S E+ F KG+F K N + Q ++Y
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQA--WKGSFQ---QDKNMTEDDSNH-IMDHSFQSLVSY 325
Query: 376 T-TGIRISSSRPGLIANSTPV---TSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL 431
+G + S+ +STP + + + + K K L S F
Sbjct: 326 PGSGWSLDESQ-----SSTPCFPEDNYFREKICTKDTRHAHKPKAKKLTSHMFGMGCHK- 379
Query: 432 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 491
+W +D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+++
Sbjct: 380 KVSGGGKWK---RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTK 436
Query: 492 VSPSQFEAHA-----------------------------------------------DGG 504
+SPSQFEAHA +GG
Sbjct: 437 ISPSQFEAHAGMAGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGG 496
Query: 505 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 564
+LL C GCP+AFH C S+P+G WYC C + G S
Sbjct: 497 DLLLCAGCPQAFHTACLKFQSMPEGTWYCSSCND---------------------GPTSC 535
Query: 565 EQITKRCIRIVKNLEAEL----SGCLLC----RGCDFSKSGFGPRTILLCDQCEREFHVG 616
+ T + NL A + S L R DFS F RT++LCDQCE+E+HVG
Sbjct: 536 KIATASWLYTYFNLNANILVLHSAYSLSPISDRSHDFSIGKFDDRTVILCDQCEKEYHVG 595
Query: 617 CLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLE 675
CL+++++ DL+ +P+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY +
Sbjct: 596 CLRENELCDLKGIPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIY 655
Query: 676 TVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG 735
+ V WR+LSGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ G
Sbjct: 656 IDNGNTVEWRMLSGKSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISG 714
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
QEFGGMYC +L VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS
Sbjct: 715 QEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLS 774
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
L V++++LPAAEEAESIWT+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 775 SLNVENLLLPAAEEAESIWTNKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 831
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 444 bits (1142), Expect = e-121, Method: Compositional matrix adjust.
Identities = 274/718 (38%), Positives = 371/718 (51%), Gaps = 161/718 (22%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 201
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 202 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 261
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIA 390
P++++F C CK +F GK LC+SC++SK Q
Sbjct: 262 APKKRTFRCQTCKSSFATLRTGKFAL--LCDSCLESKGSQ-------------------- 299
Query: 391 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 448
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 300 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 357
Query: 449 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------- 501
HKL F LP+GT+VGYY G+ VSPSQFEAHA
Sbjct: 358 HKLAFMSGVLPEGTDVGYYVGGK--------------------VSPSQFEAHAGRAARRK 397
Query: 502 ---------------------------------------DGGNLLPCDGCPRAFHKECAS 522
DGG LL CD CPRAFH+EC
Sbjct: 398 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 457
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 582
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 458 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 517
Query: 583 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 615
GC LC DFSK F RT+LLCDQ
Sbjct: 518 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 571
Query: 616 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 674
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 572 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 618
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 734
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 619 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 673
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL
Sbjct: 674 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLL 718
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 719 ASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLK-GGRTTVFQGTSTLHKLVP 775
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 223/468 (47%), Positives = 289/468 (61%), Gaps = 64/468 (13%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 501
+D LH+L+F +GLPDGTE+ YY QKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 402 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYKQGSGIVCSCCSREISPSQFEAHAGM 461
Query: 502 --------------------------------------------DGGNLLPCDGCPRAFH 517
DGG+LL C GCP+AFH
Sbjct: 462 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 521
Query: 518 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 574
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 522 TACLKFQSVPEGTWYCSSCND------------GPISSKKATATDPSGNARPIVIRLSRV 569
Query: 575 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 634
VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ + DL+E+P+ KW
Sbjct: 570 VKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGLCDLKEIPQEKW 629
Query: 635 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 693
FCC DCSRI++ +QN + + +P L+ I +K + T + V WR+LSGK+
Sbjct: 630 FCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNGDIVEWRILSGKSRY 689
Query: 694 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 753
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 690 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 748
Query: 754 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 813
SA +LR+FGQ+VAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 749 SAALLRIFGQQVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 808
Query: 814 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 861
WT KFGF K+ + L Y+K QL FKGTSML+K+VP S ST
Sbjct: 809 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKTTSLSEST 855
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 72/123 (58%), Gaps = 3/123 (2%)
Query: 213 LKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN 272
+KM KKI P V +L ETG+L+G V Y+ + L+GII GG LC C+ C+
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPVRE--LQGIIHSGGYLCGCTTCS 216
Query: 273 GCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALSSL 331
+V+ +FE+HA + R + +I ENG+++ +++ ++ P +L+ +++ A S+L
Sbjct: 217 FSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAGSAL 276
Query: 332 PEE 334
EE
Sbjct: 277 NEE 279
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/470 (47%), Positives = 289/470 (61%), Gaps = 64/470 (13%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 501
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 77 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 136
Query: 502 --------------------------------------------DGGNLLPCDGCPRAFH 517
DGG+LL C GCP+AFH
Sbjct: 137 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 196
Query: 518 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 574
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 197 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 244
Query: 575 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 634
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 245 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 304
Query: 635 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 693
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 305 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 364
Query: 694 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 753
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 365 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 423
Query: 754 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 813
SA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 424 SAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 483
Query: 814 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 863
WT KFGF K+ + L Y+K QL FKGTSML+K+VP G S + +
Sbjct: 484 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSESTT 532
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 224/485 (46%), Positives = 298/485 (61%), Gaps = 64/485 (13%)
Query: 420 ISKPFENASPPLS-----FPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLL 474
+S P E +P S + +D LH+L+F +GLPDG E+ Y+ GQ++L
Sbjct: 364 LSHPVERPNPNFSNAVLQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRIL 423
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHA--------------------------------- 501
G+K G GI+C CN E+SPSQFEAHA
Sbjct: 424 GGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLASGQKL 483
Query: 502 -------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH 548
+GG+L+ CD CPRA+H C L ++P+G W C C++ +
Sbjct: 484 TTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDK------VGS 537
Query: 549 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 608
++ A+ G +S + I R R+VK E E+ GC++CR DFS + F RT+LLCDQ
Sbjct: 538 NSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQ 594
Query: 609 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN-AIK 667
CEREFHVGCL+ + DL+ELPK KWFCC +CS I+ LQN ++ A+ +P+ + I+
Sbjct: 595 CEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIR 654
Query: 668 KYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 726
K+ G L + DVRW++LSGK+ PE LS+A AIF +CFDPIV + SGRDLIP
Sbjct: 655 KHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECFDPIV-AKSGRDLIPV 713
Query: 727 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+EVAELP+VATS+ + GKGYFQ+L
Sbjct: 714 MVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREHQGKGYFQVL 773
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 846
F+CIE+LLS L V+++VLPAAE+AESIWT K GF+K+ E L Y + QL F GTSM
Sbjct: 774 FSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREV-QLTIFNGTSM 832
Query: 847 LQKRV 851
L+K V
Sbjct: 833 LEKVV 837
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E AE S + +ELKMSKK+ N P V +L TG+LDG V Y+ L
Sbjct: 199 EGSAESSRYSLGPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK--L 256
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG +C CS CN ++ +FE HA + R + +I ENG+ + V++ +S
Sbjct: 257 QGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSA 316
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
PL +L + S SF
Sbjct: 317 PLSILDEVIMEVAGSSVNMNSF 338
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 229/507 (45%), Positives = 306/507 (60%), Gaps = 66/507 (13%)
Query: 400 KSSQSQRQRKITKKSKKTVL--ISKPFENASPPLS-----FPNKSRWNITPKDQRLHKLV 452
K+S Q I ++ L +S P E +P S + +D LH+L+
Sbjct: 281 KASFHQDSANIVVENHDVKLPKLSHPVERPNPNFSNAVLQHKKTAEKGTKRRDNDLHRLL 340
Query: 453 FDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------- 501
F +GLPDG E+ Y+ GQ++L G+K G GI+C CN E+SPSQFEAHA
Sbjct: 341 FMPNGLPDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRH 400
Query: 502 -----------------------------------DGGNLLPCDGCPRAFHKECASLSSI 526
+GG+L+ CD CPRA+H C L ++
Sbjct: 401 IYTTNGLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNV 460
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C C++ + ++ A+ G +S + I R R+VK E E+ GC+
Sbjct: 461 PEGVWSCPNCRDK------VGSNSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCV 511
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 646
+CR DFS + F RT+LLCDQCEREFHVGCL+ + DL+ELPK KWFCC +CS I+
Sbjct: 512 VCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVA 571
Query: 647 LQNLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAV 704
LQN ++ A+ +P+ + I+K+ G L + DVRW++LSGK+ PE LS+A
Sbjct: 572 LQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRAT 631
Query: 705 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 764
AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+E
Sbjct: 632 AIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGRE 690
Query: 765 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 824
VAELP+VATS+ + GKGYFQ+LF+CIE+LLS L V+++VLPAAE+AESIWT K GF+K+
Sbjct: 691 VAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMS 750
Query: 825 PELLSIYRKRCSQLVTFKGTSMLQKRV 851
E L Y + QL F GTSML+K V
Sbjct: 751 EEQLIKYMREV-QLTIFNGTSMLEKVV 776
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 258/716 (36%), Positives = 369/716 (51%), Gaps = 135/716 (18%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 127 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 181
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 182 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 235
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 236 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 273
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 274 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 333
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 334 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 392
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHK 400
CKG+ I+ VG + LC SCV K+ + TG +S R ST T+V K
Sbjct: 393 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG---TSDR------STKTTTVSK 441
Query: 401 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 460
S S +K+ +T KD LHKL F E+ LP+
Sbjct: 442 CSSSG-----------------------------SKNYGRVTKKDVGLHKLAFGENDLPE 472
Query: 461 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------- 501
G+EV YY G++LL G+K G I+C CCNSEVSPSQFEAH+
Sbjct: 473 GSEVSYYVRGERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVS 532
Query: 502 ---------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 534
DGG LL CDGCPR FHKEC SL +IP+G W+CK
Sbjct: 533 LHELSLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCK 592
Query: 535 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 594
+C N ++ +F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS
Sbjct: 593 FCLNTLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFS 652
Query: 595 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
SGFGP T+++CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++
Sbjct: 653 TSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQ 712
Query: 655 AEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFH 708
E+LP+ L IKK G S + D++WRLL G +A+ E LLSQA+++FH
Sbjct: 713 EEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFH 768
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 211/472 (44%), Positives = 280/472 (59%), Gaps = 56/472 (11%)
Query: 437 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 496
S +T KD LHKLVF L DGTEV YY GQ+ ++GY I C+ C+ VSPS
Sbjct: 723 SSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCSRVVSPSA 780
Query: 497 FEAHAD----------------------------------------------GGNLLPCD 510
FEAHA GG++ PC
Sbjct: 781 FEAHAGEGSRRKPYDNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGLGGDIFPCK 840
Query: 511 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 570
CPR+FH C LS P +W+C C N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 841 MCPRSFHPACVRLSEFPS-EWFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKR 899
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP
Sbjct: 900 AIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELP 958
Query: 631 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 689
G+WFCC CS I S L ++ A+ L L I KK+ L +DID+RW+LL+G
Sbjct: 959 DGEWFCCSSCSEIRSCLDKMISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAG 1018
Query: 690 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 745
++AT + LLLS AV I H FDPI+++ +GRDLIP MV GR + GQ++ GMYCA+
Sbjct: 1019 RSATEDGSLLLSSAVPIIHQSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAV 1078
Query: 746 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 805
+T+ S+VVSA +LR+ G +VAELPLVATS G GYFQ+LF+C+E++L L++K +LP
Sbjct: 1079 ITLGSTVVSAALLRIMGGDVAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLP 1138
Query: 806 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIG 857
AA+EAE+IW KFGF +I E L Y + L F GTS L K VP+ G
Sbjct: 1139 AAQEAEAIWMKKFGFSRIPQEQLEAYLNG-AHLTVFHGTSNLYKAVPSPSPG 1189
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 77/150 (51%), Gaps = 5/150 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
K P V EL +TGLL+G+ V+Y+ +A ++G+I I C C CNG R +
Sbjct: 544 FTKHPGNVKELLQTGLLEGMPVMYIIPNSKKAV-VKGVITGCNIRCFCIKCNGSRALSTY 602
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA + +++I NG SL +VLRAC L L+ T +S++ + C+
Sbjct: 603 FFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCL 662
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C G P + LC+ C+ SK+PQ
Sbjct: 663 NCGGHLPSSETEHF----LCHCCLDSKQPQ 688
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 208/468 (44%), Positives = 273/468 (58%), Gaps = 84/468 (17%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 501
+D LH+L+F +GLPDGTE+ YY +++SPSQFEAHA
Sbjct: 388 RDNDLHRLLFMPNGLPDGTELAYYV--------------------KTQISPSQFEAHAGM 427
Query: 502 --------------------------------------------DGGNLLPCDGCPRAFH 517
DGG+LL C GCP+AFH
Sbjct: 428 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 487
Query: 518 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 574
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 488 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 535
Query: 575 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 634
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 536 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 595
Query: 635 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 693
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 596 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 655
Query: 694 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 753
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 656 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 714
Query: 754 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 813
SA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 715 SAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 774
Query: 814 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 861
WT KFGF K+ + L Y+K QL FKGTSML+K+VP G S +
Sbjct: 775 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSES 821
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 151 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 207
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 208 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 267
Query: 330 SLPEE 334
+L EE
Sbjct: 268 ALNEE 272
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 368 bits (945), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 182/336 (54%), Positives = 233/336 (69%), Gaps = 4/336 (1%)
Query: 519 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 579 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 638
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 64 ETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCST 123
Query: 639 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKY--AGNSLETVSDIDVRWRLLSGKAATPET 696
DC RIN LQ+LL E + L+ IKK + +D+DVRWR+L K+ + ++
Sbjct: 124 DCVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDKS-SDDS 182
Query: 697 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 756
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYC +LTVN+ VVSAG
Sbjct: 183 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSAG 242
Query: 757 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 816
+ R+ G E+AELPLVATS+ G GYFQ LF+CIE+LLS L VK VLPAAEEAESIWT+
Sbjct: 243 LFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWTE 302
Query: 817 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+FGF KI + L Y K + F+GTS L K V
Sbjct: 303 RFGFAKISQDELREYLKG-GRTTVFQGTSNLHKLVA 337
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 223/601 (37%), Positives = 309/601 (51%), Gaps = 117/601 (19%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
LRG+I+ GILCSCS C G V+ P FE+HA + S YI ENG +L ++LRAC
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 315 VPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
L ML++ +Q+A+ P++++F C CK +F GK LC+SC++SK Q + T
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-T 191
Query: 375 YTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFP 434
T+ I P +S +S K+ +S K +L
Sbjct: 192 RTSKI-----------GRNPTSSARRSKNESPGSKLALRSLKPMLA-----------VLL 229
Query: 435 NKSRWNITPKDQRLHKLVFDE--SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEV 492
+ +R N++ + D+ S DG E+ ++C C
Sbjct: 230 DANRRNMSNRQS-------DDLCSICSDGGEL------------------LLCDSC---- 260
Query: 493 SPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 552
P F G +P +G W C+YC+N +R+ L ++ NA
Sbjct: 261 -PRAFHRECVGFTTIP-------------------RGTWCCRYCENRQQRESSLAYNHNA 300
Query: 553 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 612
+ AGR+ G+D +EQI R IRI GC LCR DFSK F RT+LLCDQ
Sbjct: 301 IAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQA--- 357
Query: 613 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAG 671
LP+G W+C DC RI+ L++LL + AE + + IK KY
Sbjct: 358 ----------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQ 401
Query: 672 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 731
+L D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 402 KALNKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG- 459
Query: 732 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 791
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE
Sbjct: 460 ------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIE 501
Query: 792 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K V
Sbjct: 502 RLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTTVFQGTSTLHKLV 560
Query: 852 P 852
P
Sbjct: 561 P 561
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 352 bits (904), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 190/412 (46%), Positives = 245/412 (59%), Gaps = 61/412 (14%)
Query: 471 QKLLEGYKNGLGIICHC--CN-SEVSPSQFEAHA-------------------------- 501
+ +L+G G I C C CN S+VSPS FEAHA
Sbjct: 326 KAVLKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMK 385
Query: 502 --------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
GG++ PC CPR+FH C LS +P +WYC C N+ +
Sbjct: 386 ISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQ 444
Query: 542 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 601
+++ L + NA AGR +GVDS+EQI KR IRIV NL EL DF+ S F R
Sbjct: 445 KEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQK------DFNNSVFDER 498
Query: 602 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
T++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ A L E
Sbjct: 499 TVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAES 558
Query: 662 HLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 720
++ I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SG
Sbjct: 559 DIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSG 618
Query: 721 RDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 776
RDLIP MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK
Sbjct: 619 RDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKD 678
Query: 777 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 828
G GYFQ LF+CIE++L L++K +LPAA+EAE IW +KFGF KI E L
Sbjct: 679 LQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQL 730
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 49/119 (41%), Gaps = 22/119 (18%)
Query: 248 IKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF-ENGKSLL 306
+K Q + L+G+I I C C CNG +V PS FE HA + RR F NG SL
Sbjct: 322 LKLQKAVLKGVIAGCNIRCFCLSCNGSKV-SPSAFEAHAGEGTRRKPYDNIFTSNGVSLH 380
Query: 307 EVLRACRSVPLPMLKATLQSALSSLPEE----------KSFACVRCKGTFPITCVGKTG 355
E+ +K + LS + F C C +F CVG +G
Sbjct: 381 EL----------SMKISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSG 429
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 164/271 (60%), Positives = 205/271 (75%), Gaps = 2/271 (0%)
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 651
DFSKSGF RT+++CDQCERE+HVGCLK+HKMADL+ELP+G+WFC DC RI+S LQ LL
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 652 VQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 710
++ A+ LP ++ I+K ++ D+RW+LLSGK A E+RLLLS+AVAIFH+
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAESRLLLSKAVAIFHES 122
Query: 711 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 770
FDPIVD+ +GRDLIP+MVYGR +R Q++GG+YCA+LTV SSVVSAGILRV G E+AELPL
Sbjct: 123 FDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAELPL 182
Query: 771 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 830
VATS+ + G+GYFQ LF+CIE+LL L+VK VLPAA+EAESIWT KFGF KI + L
Sbjct: 183 VATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDELHK 242
Query: 831 YRKRCSQLVTFKGTSMLQKRVPACRIGSSST 861
Y V F+GTS L K V + S T
Sbjct: 243 YLNGARTTV-FQGTSTLHKPVTVPHVSSRET 272
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 338 bits (868), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 235/355 (66%), Gaps = 7/355 (1%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG++ PC CPR+FH C L +P +W+C C + ++++ L + NA AGR +GVD
Sbjct: 32 GGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVD 91
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
S+EQI KR IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+
Sbjct: 92 SIEQILKRAIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEW 150
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDID 681
DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+ L ++ +
Sbjct: 151 QVDLKELPEGEWFCCDSCSEIRSSLDKMISGGAHPLSESDVDIIRKKHESKGLVMDANTE 210
Query: 682 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQE 737
+RW+L++G++AT + LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ+
Sbjct: 211 IRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQD 270
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ GMYCA+LTV S+VVSA +LRV G +VAELPLVATS G GYFQ+LF+CIE+LL L
Sbjct: 271 YSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSL 330
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+VK +LPAA EAE+IW KFGF KI + + Y L F GT L K +P
Sbjct: 331 KVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYLNG-GHLTVFHGTLNLYKAIP 384
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 329 bits (843), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 194/486 (39%), Positives = 268/486 (55%), Gaps = 66/486 (13%)
Query: 441 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN--GLGIICHCC---------- 488
I +DQ LHKLVF E+ L DG VGY+ +K L+G N GI+C CC
Sbjct: 83 INYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQSGILCDCCKEVILEFFFC 142
Query: 489 -----NSEVSPSQFEAHA---------------DG------------------------- 503
+ +VSPS+FEAHA DG
Sbjct: 143 QIKKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAINHRISISNSDEHCSKCK 202
Query: 504 --GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNLL CDGC RAFH C + S P+ WYC+YC+N ++ + ++H N V ++
Sbjct: 203 QRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIES 262
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
D EQI K C VK+ E E S C LC F+ F P T+++CDQCE+++HVGCLK H
Sbjct: 263 DPSEQIAKICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLKDH 322
Query: 622 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI 680
MA+L+++PK WFC +DC I+ L+N + + L + L+ IK K LET +
Sbjct: 323 NMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGL 382
Query: 681 DVRWRLLSGKAATPE--TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-E 737
D++W++ + + + T LLS V IFH+ FD IV + + DLIP+MV GR ++ +
Sbjct: 383 DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKDKYY 442
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
FGGMYCA+L VN VVSAGI RVFG+EVAEL L+AT +G+F+ L +CIE +L L
Sbjct: 443 FGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIENVLKEL 502
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKG---TSMLQKRVPAC 854
+V+ +VLPAA EAES+W DKFGF + + L Y +R KG + L ++ C
Sbjct: 503 KVERLVLPAAHEAESMWIDKFGFTEPNQGLGRRYYRRSWSFHLNKGVEQNTHLTGKLEFC 562
Query: 855 RIGSSS 860
+ +SS
Sbjct: 563 QKCASS 568
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 182/442 (41%), Positives = 237/442 (53%), Gaps = 91/442 (20%)
Query: 432 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 491
++ +KS IT KD+ LHKLVF E+ L DG VGY+ G+
Sbjct: 174 TYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGK-------------------- 213
Query: 492 VSPSQFEAHA---------------DG---------------------------GNLLPC 509
VSPS+FEAHA DG GNLL C
Sbjct: 214 VSPSKFEAHAGRASRRKPYSYIRTADGVSLHELANNRRISMSDSDERCSHCEQVGNLLWC 273
Query: 510 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 569
D C R+FH EC L S P+ YC+YC+N F + + ++H N V GR++ D EQIT+
Sbjct: 274 DRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITE 333
Query: 570 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 629
C K E + C LC DF+ + GPRT+++C QCE+EFHV CLK H MA+L EL
Sbjct: 334 VCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVEL 393
Query: 630 PKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSG 689
PK KWFC +DC I+ LQ L+ + +L +D++WRLL+
Sbjct: 394 PKDKWFCGIDCDDIHMKLQKLMARGEAEL--------------------GLDIKWRLLNT 433
Query: 690 KAATPETRL--LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAIL 746
K P+ + L+S+A AIFH+ F I D + DLI +M+YG + GQ F GMYCA+L
Sbjct: 434 KLNNPKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVL 493
Query: 747 TVNSSVVSAGILRVFGQEVAELPLVATSK------INHGKGYFQLLFACIEKLLSFLRVK 800
+ AGI RV GQEVAELPLVAT+ I GYF+ LF+CIE +L L+VK
Sbjct: 494 YFKKVIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGYFRSLFSCIENMLRHLKVK 553
Query: 801 SIVLPAAEEAESIWTDKFGFKK 822
++VLPAA EAES+W DKFGF K
Sbjct: 554 TLVLPAAHEAESMWIDKFGFTK 575
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 154/358 (43%), Positives = 196/358 (54%), Gaps = 67/358 (18%)
Query: 471 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------------- 501
QK+L GYK G GI+C CC E+SPSQFE+HA
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 502 -----------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 544
DGG+L+ C CPRAFH C L P+G W+C C +
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNKLGHGGN 136
Query: 545 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
F I R R+VK E ++ GC +CR DFS F RT++
Sbjct: 137 F------------------ARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVI 178
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 664
LCDQCE+EFHVGCL++ + DL+E+PK WFCC DC+ I L+N + + +P LN
Sbjct: 179 LCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLN 238
Query: 665 AI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 722
I K L + DV+W++L GK+ E LLS A AIF +CFDPIV + +GRD
Sbjct: 239 IINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDPIV-AKTGRD 297
Query: 723 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 780
LIP MVYGRN+ GQEFGGMYC +LTV VVSAG+LR+FG+EVAELPLVAT++ + GK
Sbjct: 298 LIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGK 355
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 285 bits (729), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 208/684 (30%), Positives = 314/684 (45%), Gaps = 121/684 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C G ++
Sbjct: 286 LLKAPRNAKELMATRLMEGHHVRCSCRGIQ-----LTGMLKDMGVQCDCRNCRGSVIVSI 340
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRA------CRSVPLPMLKATLQSALSSLPE 333
S FE H+ S I ENGK+L ++L A C L LK + + + +
Sbjct: 341 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAI-GDVQGVEK 399
Query: 334 EKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ-GTMTYTTGIRISSSRPGLIANS 392
KS C K P+ G + Y G R S ++A+S
Sbjct: 400 SKS-------------------------KCAKCGNPEEGDLIYCKGARCS-----VVAHS 429
Query: 393 --TPVTSVHKSSQSQRQRKITKKSKKTVLISKPF----ENASPPLSFPNKSRWNITPKDQ 446
+ + H + + TKK +V + +P E + + + +D
Sbjct: 430 GCVEIANPHLGDWFCGKCEKTKKPHASVKVKRPISSGAEKEDSRVREKDATVSARLSRDA 489
Query: 447 RLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS----QFEAHA- 501
LHK +F GL +GTE+GYY Q L+G K G GI C CCN E S +FE HA
Sbjct: 490 HLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQHAG 549
Query: 502 ---------------DG--------------------------------------GNLLP 508
DG G L
Sbjct: 550 CEARRNPYGNILVLVDGRSLKDVCKDLTHKNKLGEQQNCEPLARDVNCCYECSSSGELKT 609
Query: 509 CDGCPRAFHKECASLSSI-PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 567
C+GC A+ C + WYC+ C+N + + + V S+ +I
Sbjct: 610 CNGCEEAWCDNCTKGEEVDSDSKWYCRMCRNDTLK---VAQNGQKVSGKHQEESSSITEI 666
Query: 568 TKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMA 624
+R R +++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK M
Sbjct: 667 DERG-RCIRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGMD 725
Query: 625 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDID 681
+L ELP+G+WFC C I+ +L L+ E L ++ + ++ +E I
Sbjct: 726 NLNELPEGEWFCQKGCKVIDEILTQLVAIGPESLSHSIISELPENRQQKSGVIEKAESIS 785
Query: 682 --VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 739
W++L GK ++P L++AV IF +C DPI D+ +G++LIP MV R + +F
Sbjct: 786 PSFEWQILCGKGSSPANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDYDFE 845
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G++C +L +N VVSA +L++FG+E AE+PLVATS + G+G+ + L IE+LL L V
Sbjct: 846 GVFCVVLKLNGKVVSAALLQIFGREFAEVPLVATSLPHQGQGFCKALMTTIERLLGVLSV 905
Query: 800 KSIVLPAAEEAESIWTDKFGFKKI 823
+ +VLP A++ ES+W +KFGF ++
Sbjct: 906 ERLVLPTAKDTESLWVNKFGFSRV 929
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 212/686 (30%), Positives = 315/686 (45%), Gaps = 142/686 (20%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRA--- 311
L GI++D G++C+C +C G +V+ S FE H+ S I ENGK+L ++L A
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 312 ---CRSVPLPMLKATLQSALSSLPEEKSFAC----------------VRCKGTFPITCVG 352
C L LK + + +P+ K AC +C + CVG
Sbjct: 630 SADCGGDILGALKHAI-GEIQGIPK-KEGACGKCGKREGGDFVSCKEPKCSAVYHAECVG 687
Query: 353 KTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITK 412
P + C K +K Q M T + RP + + R+ T+
Sbjct: 688 LPSPHRVDWFCAKCEKAQVKMPKTV---LKMKRPPAVTD----------------REDTR 728
Query: 413 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQK 472
+K + + + +D LHK +F GL DGTE+GYYA Q
Sbjct: 729 LKEKELTVR--------------------SARDAHLHKALFLPGGLADGTELGYYARNQC 768
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHAD-------GGNLLPCDGCPRAFHKECASLS- 524
+L+G K G GI C CCN E+S S FE HA G++L DG R+ C L+
Sbjct: 769 ILKGVKQGGGICCKCCNQEISCSAFEQHAGCESRRNPYGSILLADG--RSLKDMCKELAY 826
Query: 525 ----------------------SIPQG----------DW------------------YCK 534
S QG W CK
Sbjct: 827 QSKLGDRAHQVARTGDVKSSSGSEEQGVLASSQRCESTWCINFGTRFSCQEADSGHPLCK 886
Query: 535 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI--TKRCIRIVKNLEAELSGCLLCRGCD 592
CQ E A+ RV ++ T R +R+ + ++ SGC +C+
Sbjct: 887 ICQKNVE-------GAHKTSKKRVDATANIPATDDTGRNVRLFQAPDSS-SGCAICKKWT 938
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 652
K GF T+L+CDQC RE+HVGCL++ + D ELP+ +W+C +C I VL L+
Sbjct: 939 LKKCGFD-MTMLVCDQCGREYHVGCLRESGILD--ELPEAEWYCQPNCQHIVQVLSQLVA 995
Query: 653 QEAEKLPEFHLNAI---KKYAGNSLETV--SDIDVRWRLLSGKAATPETRLLLSQAVAIF 707
E L + +N + +++ +E S W++L G P L+QAV IF
Sbjct: 996 NGPELLSDNIVNDLLESRQHQQGIVEMAESSSPVFGWQILHGAGENPVNGRTLAQAVEIF 1055
Query: 708 HDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 767
+C DPI D+ SG+++IP MVY R + +F G+YC +LT+N VVS +L++FG+EVAE
Sbjct: 1056 TECSDPIKDAPSGQNMIPIMVYSRRFKDYDFDGIYCVVLTLNEKVVSTALLQIFGREVAE 1115
Query: 768 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 827
+PL+ATS + +G+ + L IE+LL L V+ +VLPA++ AE +W ++FGF +++
Sbjct: 1116 VPLIATSVDHQDQGFCKALMTTIERLLGVLNVERLVLPASKNAEFVWVNRFGFSRMEDAQ 1175
Query: 828 LSIYRKRCSQLVTFKGTSMLQKRVPA 853
L R LV F GT+ML K + A
Sbjct: 1176 LKHIRSMMGLLV-FTGTTMLVKHIDA 1200
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 145/230 (63%), Positives = 172/230 (74%), Gaps = 4/230 (1%)
Query: 150 GSEVSNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKK 209
G SN K+ K S +K K++ VEV V E E++S ++VE IAEGSALT PKK
Sbjct: 10 GEPNSNNRPKRVTK--SKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKK 67
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
NLELKMSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+
Sbjct: 68 NLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCA 127
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CNG RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S
Sbjct: 128 FCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAIS 187
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
LP E++F C RCKG FP CVGK GPLCN C +SK+ T+T + I
Sbjct: 188 GLPVERTFTCKRCKGIFPSICVGKI--GPLCNLCAESKESHPTLTIGSSI 235
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/271 (52%), Positives = 180/271 (66%), Gaps = 6/271 (2%)
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 646
LCR DF+ + F RT++LCDQCE+E+HVGCLK +L+ELP+G+WFCC CS S
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 647 LQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 705
L ++ A+ L E L IKK + L + D++W+LLSGK T + +LLS AV
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRTTEDGSILLSAAVP 131
Query: 706 IFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGILRVF 761
IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA +LRV
Sbjct: 132 IFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVM 191
Query: 762 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 821
G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KFGF
Sbjct: 192 GGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFGFS 251
Query: 822 KIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 852
+ PE L Y + L F GTS L K VP
Sbjct: 252 RTTPEELEAYLNG-AHLTIFHGTSYLYKAVP 281
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 142/277 (51%), Positives = 183/277 (66%), Gaps = 12/277 (4%)
Query: 584 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 643
G L R DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP+G+WFCC CS
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 644 NSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 702
S L ++ A+ L E L I KK+ L + D++W+LLSGK AT E +LLS
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKRATEEGSILLSA 174
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGIL 758
AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA ++
Sbjct: 175 AVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALM 234
Query: 759 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 818
RV G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KF
Sbjct: 235 RVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMKKF 294
Query: 819 GFKKIDPELLSIYRKRC---SQLVTFKGTSMLQKRVP 852
GF +I PE L + C + L F GTS L K VP
Sbjct: 295 GFSRIPPEEL----EACLNGAHLTIFHGTSYLYKAVP 327
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 123/196 (62%), Positives = 157/196 (80%)
Query: 666 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 725
+KK LE + +IDVRWRLL+G+ A+PET+ LL +AV++FH+CFDPIVD +GRDLIP
Sbjct: 1 MKKQEERCLEPLREIDVRWRLLNGRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIP 60
Query: 726 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 785
+MV GRNLR Q+FGGMYCA+L VNSSV SA +LR+FG ++AELPL+AT N GKGYFQ
Sbjct: 61 AMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQT 120
Query: 786 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTS 845
LF+CIE+LLSFL VK++VLPAAEEAESIW KFGF K++P+ L+ YRK Q++ FKGT
Sbjct: 121 LFSCIERLLSFLSVKNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTV 180
Query: 846 MLQKRVPACRIGSSST 861
ML K VP CR+ ++ +
Sbjct: 181 MLHKTVPQCRVTNTQS 196
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 266 bits (679), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 204/690 (29%), Positives = 310/690 (44%), Gaps = 121/690 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C ++
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVRCDCRNCKSSVIVSI 441
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL---PMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 442 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 500
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIR---ISSSRPGLIAN 391
C +C + +G + Y G R I+ SR I
Sbjct: 501 WKVTCAKCWNS-----------------------DEGDLIYCKGARCSIIAHSR--CIGI 535
Query: 392 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 451
S P + ++ +K K IS E + + + +D LHK
Sbjct: 536 SNPRLGDWFCDKCEKMKKPHATVKVKRSISSGTEKDDGRVREKDATESTRLNRDAHLHKA 595
Query: 452 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 501
+F GL DGTE+GYY Q L+G K G E++ +FE HA
Sbjct: 596 LFLPGGLEDGTELGYYTKSQLKLKGVKRGEAF--KKVVVEINCYKFEQHAGCEARRNPYG 653
Query: 502 ------DG--------------------------------------GNLLPCDGCPRAFH 517
DG G L C GC +
Sbjct: 654 NILLVADGRSLKDVCKELAHKNKLGEKEKRVARAGKVNSCYECGTRGELKNCHGCVETWC 713
Query: 518 KECA-SLSSIPQGDWYCKYCQN------MFERKRFLQHDANAVEAGRVSGVDSVEQITKR 570
C L + G WYC+ C+ E+KR +H + G+ ++ + +R
Sbjct: 714 NSCTKGLETDSDGKWYCRMCRQDTLNVAQIEQKRSNKH---------IEGMSNIAETDER 764
Query: 571 CIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLR 627
R V++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK + DL
Sbjct: 765 -DRCVRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLN 823
Query: 628 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-----KKYAGNSLETVSDIDV 682
ELP+G+WFC DC I+ +L L+ E L + ++ + ++ S
Sbjct: 824 ELPEGEWFCQKDCKVIDEILTQLVANGPELLTDSIISELLESRQQQTGAKDKAESSCPSF 883
Query: 683 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMY 742
W++L GK+ L++A+ IF +C DPI D+ +G++LIP MV R + +F G++
Sbjct: 884 AWQILCGKSGNTANTQTLAEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVF 943
Query: 743 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 802
C +L +N VVSA +L++FG E+AE+PLVATS + G+G+ + L IE+LL L V+ +
Sbjct: 944 CIVLKLNEKVVSAALLQIFGGEIAEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERL 1003
Query: 803 VLPAAEEAESIWTDKFGFKKIDPELLSIYR 832
VLP A+ ESIW +KFGF ++ + S R
Sbjct: 1004 VLPTAKNTESIWINKFGFSRVPDDEGSFLR 1033
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 167/446 (37%), Positives = 226/446 (50%), Gaps = 86/446 (19%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGII 259
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y G GL+G+I
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYARG--RPEKGLQGVI 402
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 403 QGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADA 462
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T +
Sbjct: 463 LEAAIRNATGAGDARKSTVCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTTPSQVGD- 519
Query: 380 RISSSRPG-----LIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFP 434
+ SR G + N ++K + S +VL K E S P
Sbjct: 520 -ANCSRDGQLEFIFLLNYYWADDLYKLGLPDLRGLQWSPSSNSVL--KSTERMSSGTCPP 576
Query: 435 NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSP 494
+K +T KD R+HKLVF+ LPDGT + YY G+ VSP
Sbjct: 577 SKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGK--------------------VSP 616
Query: 495 SQFEAH---------------------------------------------------ADG 503
SQFEAH ADG
Sbjct: 617 SQFEAHAGCASRRKPGWYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADG 676
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
G+LL CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D+
Sbjct: 677 GDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDA 735
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCR 589
+EQITKR IRIV L AE+ C+LCR
Sbjct: 736 LEQITKRSIRIVDTLHAEVGVCVLCR 761
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 148/476 (31%), Positives = 224/476 (47%), Gaps = 124/476 (26%)
Query: 501 ADGGNLLPCDGCPRAFHK-------------------------EC-ASLSSIPQGDWYCK 534
D G+L C GCP A+H+ +C S S G+++C
Sbjct: 1005 GDSGDLQLCTGCPNAYHQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCP 1064
Query: 535 YCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-ELSGCLLC 588
CQ +R+R + + G + + +++ RC R+++ EA L GC+ C
Sbjct: 1065 DCQEQRFGGTKDRRRSMTKRRSK---GAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCVFC 1121
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVG------------------------------CL 618
+ DF+K+GFGP+T LLCDQCERE+HVG C+
Sbjct: 1122 KSGDFAKTGFGPKTTLLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCV 1181
Query: 619 KKHKMAD------------------------LRELPKGKWFCCMDCSRINSVLQNLLVQE 654
++ D +ELP+G+WFC DC I+S+L L+
Sbjct: 1182 ATVRLRDCDRGCESQGIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNG 1241
Query: 655 AEKLPEFHLNAIKKYAGNSLETVSDI------DVRWRLLSGKAATPETRLLLSQAVAIFH 708
E L + ++ + + LE D W+LL G+ P L++AV IF
Sbjct: 1242 PEPLADSIISKVLRTNQARLERSEDATESSCSGFEWQLLHGRGGDPSNGKALAEAVQIFS 1301
Query: 709 ----------------------------DCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 740
+CFDPI D +SG DLIP MVY R+LR Q+FGG
Sbjct: 1302 VRNLSDPGFPVRTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGG 1361
Query: 741 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 800
+YC +L ++ VVS ++RVFG+++AELPL+AT+ + G+G+ + L IE+LL LRV+
Sbjct: 1362 IYCVVLKYDNRVVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVE 1421
Query: 801 SIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
+ LPAAE AE IW +KFGF+++ + + + +V F G+ ML+K +P I
Sbjct: 1422 RLALPAAEGAEGIWLNKFGFRRMAEGQVKQFHSDLNMMV-FTGSFMLEKEIPPLEI 1476
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 126/498 (25%), Positives = 202/498 (40%), Gaps = 110/498 (22%)
Query: 58 VKRSRFSNSDDLLEDDVIDKR----------INSKIHEGRINKVVKNVL-------NENG 100
V SR + + L DD +D R N + E + ++ + V+ NG
Sbjct: 386 VAGSRNDATSNPLSDDGLDGREQDGNGGKGLTNGTVEEDKCTELFEPVVLLPEHSGGTNG 445
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKK 160
+ES V+ E+ E+ + G A SK FP+ G + + ++++
Sbjct: 446 FVESAVDSEDLSSDPAREDSLGNQELGGSAGAAVAS--SKDLEFPQHSGSEKRGDSMDRQ 503
Query: 161 ------------CLKRPSAMKPKVEPVEVLVTQSE---------GFGNESMSLIEVEAIA 199
R +K V P+++ + +G + S ++++
Sbjct: 504 DSDVATPSAATSAQSRSPGLKVGVVPLKIWLWAVGWLGGWAWLVSYGADGNSRLKLKGQG 563
Query: 200 EGSALTSPKKNLELK-MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGI 258
+ S SP N + K++ L + P + L ++GLLDG V YMG + L GI
Sbjct: 564 DVSMRQSPTLNGARGFVVKEVLLKEAPASAKLLLQSGLLDGHHVRYMG--RGGHIMLTGI 621
Query: 259 IRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGK--------------- 303
I++GG+LC CS C G +V+ S FE HA R S +I ENGK
Sbjct: 622 IQEGGVLCDCSSCKGVQVVNVSAFEKHAGSSARHPSDFIFLENGKCLKDILEIGWNANKQ 681
Query: 304 --SLLEVLRAC-----------RSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITC 350
++++VL++ S+ P++ +Q AL LP+ KS + K P+
Sbjct: 682 KMNIMDVLKSAIGEVGGVKVQISSLEHPLI--AIQPALKKLPQPKSL--LDTKPRVPVD- 736
Query: 351 VGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKI 410
VK + PQG ++P + +++ + ++ R
Sbjct: 737 -------------VKPRTPQG-----------DTKPKMPSDTKARFTPEVKARGSDARAS 772
Query: 411 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 470
+ +T + E ASPP+ S N LHK +F GL D TEVGYY G
Sbjct: 773 MPRLDRT---PREKETASPPVLSRESSGAN-------LHKALFLPGGLEDDTEVGYYVKG 822
Query: 471 QKLLEGYKNGLGIICHCC 488
QK L G K G GI+C CC
Sbjct: 823 QKSLAGVKKGAGILCSCC 840
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 232 bits (592), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 198/333 (59%), Gaps = 15/333 (4%)
Query: 528 QGDWYCKYC-QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
+G WYC+ C Q+ + + Q + + G + ++ EQ RCIR + E+ GC
Sbjct: 983 EGRWYCRMCRQDSLKVAQNGQKGSEKIMEGMSNIAETNEQ--GRCIRHLDGPR-EVGGCA 1039
Query: 587 LCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 645
+C+ + SK+GF TIL+CDQC RE+HV CLK + DL ELP G+WFC DC I+
Sbjct: 1040 ICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDCKVIDE 1099
Query: 646 VLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDIDVR--WRLLSGKAATPETRLLL 700
+L L+ E L ++ + ++ ++ E + + R W++L GK ++P L
Sbjct: 1100 ILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSPADVQTL 1159
Query: 701 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 760
++A IF DC DPI D +G++LIP MV R + +F G++C +L +N VVSA +L++
Sbjct: 1160 AEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVVSAALLQI 1219
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
FG+E+AE+PL+ATS +HG+ + + L IE+LL L V+ +VLP A+ ES+W +KFGF
Sbjct: 1220 FGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESVWINKFGF 1279
Query: 821 KKIDPELLSIYRKRCS--QLVTFKGTSMLQKRV 851
++ + L + C+ +L TF GTSM+ K +
Sbjct: 1280 SRVQEDQL---KSICTTIRLTTFTGTSMVVKAI 1309
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 130/310 (41%), Gaps = 53/310 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L + P EL T L++G V GI+ L G+++D G+ C+C C G ++
Sbjct: 563 LVRAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVQCNCRNCKGSVIVSI 617
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL---PMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 618 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 676
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIR---ISSSRPGLIAN 391
+ C +C+ + +G + Y G R ++ SR IAN
Sbjct: 677 RKVTCAKCESS-----------------------QEGDLIYCKGARCSVVAHSRCVGIAN 713
Query: 392 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFEN------------ASPPLSFPNKSRW 439
+ ++ K K+++ E+ ASP S +
Sbjct: 714 PQLGDWFCGKCEKTKKHHAAAKVKRSISGGADPEDGKVRLLRDLQGYASPSASKDKAATA 773
Query: 440 NIT-PKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFE 498
++ +D LHK +F GL DGTE+GYY Q L+G K G GI C CCN E +
Sbjct: 774 SVRLNRDAHLHKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNEET--RHVQ 831
Query: 499 AHADGGNLLP 508
+ GG + P
Sbjct: 832 GGSRGGLMFP 841
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 219 bits (558), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 208/787 (26%), Positives = 328/787 (41%), Gaps = 184/787 (23%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL T L++G + + L G+++D G+ C+C C G ++ S
Sbjct: 515 LLKAPRNAKELMATKLMEG----HFVRCSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSIS 570
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL---PMLKATLQSALSSLP--EEK 335
FE H+ S I ENGK+L +VL A + +L+A L+ A+ + E+
Sbjct: 571 AFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADCGDNILRA-LKMAIGDIQGVEKS 629
Query: 336 SFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIR---ISSSRPGLIANS 392
C C G+ +G + Y G R +S SR + ++
Sbjct: 630 KVTCAECGGS-----------------------EEGDLIYCKGARCSVVSHSR--CVGSA 664
Query: 393 TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKS-RWNITPKDQRLHKL 451
P + ++ +K +K IS E++ S R N +D L K
Sbjct: 665 NPQLGDWFCGKCEKTKKRHAAAKVKRSISAGTEDSEVRDKATTASARLN---RDAHLRKA 721
Query: 452 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 501
+F GL DGTE+GYY Q L+G K G GI C CCN E+S +FE HA
Sbjct: 722 LFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYEFEQHAGCEARRNPYG 781
Query: 502 ------DG-------------------------------------GNLLPCDGCPRAFHK 518
DG G L C C A+
Sbjct: 782 NILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVCCYECSNSGELKRCHSCEEAWCD 841
Query: 519 ECA-SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 577
+C + + +G WYC+ C+ + + Q+ + + G+ ++ + ++ R V++
Sbjct: 842 KCTKGMETDSEGRWYCRMCRQ--DSLKVAQNGHKGTDK-IIEGMSNIAETDEKG-RCVRH 897
Query: 578 LEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHV----------------GCL 618
LE E+ GC +C+ + SK+GF TIL+CDQ + L
Sbjct: 898 LEGPREVGGCAICKKWNLSKTGFVDGMTILVCDQVRSLNQMLPGTRITWKVNGFTDPNIL 957
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
+ +ELP+G+WFC DC I+ +L L+ PE N+I S + S
Sbjct: 958 MQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANG----PELLSNSIISELLESRQQQS 1013
Query: 679 DIDVR---------WRLLSGKAATPETRLLLSQAVAIF---------------------- 707
+ V+ W++L G+ + L++A IF
Sbjct: 1014 SVKVKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFTSIDDINLPYLWLVVGNYSPST 1073
Query: 708 -------------HDC-----FDPIVDSISGRDL----IPSMVYGRNLRGQEFGGMYCAI 745
C FDP S D+ + +V R + +F G++C +
Sbjct: 1074 HTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNVALVVCSRRAKDHDFEGVFCVV 1133
Query: 746 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 805
L +N VVSA +L++FG+E+AE+PL+ATS + G+G+ + L IE+LL L V+ +VLP
Sbjct: 1134 LKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCKALMTTIERLLGVLSVERLVLP 1193
Query: 806 AAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPAC---RIGSSS 860
A+ ESIW +KFGF ++ + L ++ C+ +L+TF GT ML K + RI S
Sbjct: 1194 TAKNTESIWINKFGFSRVPEDQL---KRICTTIRLMTFTGTRMLGKAITPMTLNRIQRQS 1250
Query: 861 TDSTECV 867
D C+
Sbjct: 1251 RDGCVCI 1257
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 99/133 (74%), Positives = 116/133 (87%)
Query: 727 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
MVYGRN+RGQEFGGMYCA+L VNSSVVSA +LR+FG +VAELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 846
F+CIE+LL+F+ VKS+VLPAAEEAESIWTDKFGF ++ P+ LS YRK +Q+VTFKGT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 847 LQKRVPACRIGSS 859
L K VP CRI S+
Sbjct: 121 LHKLVPPCRIISN 133
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/130 (75%), Positives = 116/130 (89%)
Query: 727 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 846
F+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 847 LQKRVPACRI 856
LQK VP+CR+
Sbjct: 121 LQKTVPSCRV 130
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 160/538 (29%), Positives = 244/538 (45%), Gaps = 65/538 (12%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 204 ERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 260
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS CN +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 261 GCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 320
Query: 327 ALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNS---CVKSKKPQGTMTYTTGIRISS 383
S E+ F + VG C+S C+ P ++Y + S
Sbjct: 321 VAGSSINEEFFRVWKASLNQSNALVGADKK---CHSELPCL----PHSHVSYASQALKES 373
Query: 384 SRP---GLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKS--- 437
P + N+ V+ S ++ +K+ P L FP +
Sbjct: 374 FCPISSSFLYNNNFVSQQMYMETSGVNKQTSKR---------------PSLYFPGSATKQ 418
Query: 438 ----RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 493
+ +D LH+L+F +GLPDGTE+ YY GQK+L GYK G GI+C CC E+S
Sbjct: 419 KKTAESGVRKRDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEVEIS 478
Query: 494 PSQFEAHADGG-------NLLPCDGCPRAFHKECASLSSIPQ-----GDWYCKYC---QN 538
PSQFE+HA ++ +G H SL++ GD C C +
Sbjct: 479 PSQFESHAGMSARRQPYRHIYTSNGL--TLHDIAISLANGQNITTGIGDDMCAECGDGGD 536
Query: 539 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR------GCD 592
+ + + ++ G + R R+ K E + C++CR D
Sbjct: 537 LMWHVWIYRILLKVLGIVQIDGGNFARPTVIRLTRVGKIPEYNVGDCVVCRLNLLKFLID 596
Query: 593 FSKSGFGP-RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 651
F+ + ++L ++EFHVGCL++ + DL E+P+ WFCC DC+ I L+N +
Sbjct: 597 FTLANSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFCCQDCNNIYVALRNSV 656
Query: 652 VQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIF 707
+K+P LN I K L + DV+W++L GK+ E LLS A AIF
Sbjct: 657 STGVQKIPASLLNIINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIF 714
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/70 (54%), Positives = 47/70 (67%), Gaps = 4/70 (5%)
Query: 783 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 842
F LF+CIE+LL L V+ +VLPA AE+IWT +FGF+K+ L Y R QL FK
Sbjct: 736 FNPLFSCIERLLCSLNVEQLVLPA---AETIWTRRFGFRKMSEGQLLKY-TREFQLTIFK 791
Query: 843 GTSMLQKRVP 852
GTSML+K VP
Sbjct: 792 GTSMLEKEVP 801
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 196/677 (28%), Positives = 285/677 (42%), Gaps = 122/677 (18%)
Query: 226 MTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRV--IPPS 280
+T+ E+ + G L G V + G + S I +G I C C C + + S
Sbjct: 228 ITLREVLKGGALRGQPVFFQSRHGDLLLNGS----ITEEGQIACPCKQCRAKKTPGVSCS 283
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
+FE HA + RR + I N S+ L A + S S AC
Sbjct: 284 EFEEHAGSRERRPGESIYLTN-----------LSISLKEFCALVNDEGRSADRHGS-ACG 331
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRP---------GLIAN 391
C + C C+ C P Y G+ G
Sbjct: 332 LCMDGGDLLC---------CDGC-----PTAVHAYCAGLEEVPEGDWFCDACVARGAQLK 377
Query: 392 STPVTSVHKSSQSQRQRKITKKSKK-------------TVLISKPFENASPPLSFPNKSR 438
+ P+ S K S+ + R+ +K+ K + A+P L + +R
Sbjct: 378 AKPLPSPQKPSKPAKWRQPKQKAPKEKKHGGSGAAKKAAHAGAPRVHMAAPALRVVSGAR 437
Query: 439 WNITPKDQRLHKLVF--DE-SGLPDGTEVGYYAC-GQKLLEGY-------KNGLGIICHC 487
++ HK +F DE GL DG V Y G++LL+G GI+C C
Sbjct: 438 RE---RNSNKHKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCAC 494
Query: 488 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 547
CN +S SQFEAHA G+ RA + + + + C E + +
Sbjct: 495 CNGVISCSQFEAHAGRGSR-------RAPYDNIFTAAGVSLRKLAC--LMPASEAESPIS 545
Query: 548 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 607
H A+ A E +T V A GC+LC+ DF + GFG RT+++CD
Sbjct: 546 HRPAALCAVADRRALEPELVT------VSGEAALHGGCVLCKVPDFLRGGFGERTMIICD 599
Query: 608 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA-- 665
QCERE+H+GCL +H A L ELP+GK + L ++L+ HLN
Sbjct: 600 QCEREYHIGCLAEHGRAHLTELPEGK-----------ASLYDILLT-------LHLNGEW 641
Query: 666 -----IKKYAGNSLETVSDIDV------RWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 714
K A E VS + V W++L GK T T L A I + FDPI
Sbjct: 642 HCSPECKGIATRMRERVSSVPVPLQGEYSWQVLRGKDGTHATTWALKAAQEILTESFDPI 701
Query: 715 VDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 774
+D ++G DL+ +MVY + L ++ GMY A+L V + + RVFG+++AE+PLVAT
Sbjct: 702 LDLVTGADLMMAMVYAQELGDWDYTGMYTAVLRRRGKAVCSAVFRVFGRQLAEVPLVATR 761
Query: 775 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR 834
+G+ ++L A E L V+S+ LPAA+ W FGF I PE +
Sbjct: 762 LGARRQGHARVLMAAFEDYFRSLGVQSLCLPAAQSTVETWIHGFGFAAITPEEQAAT--- 818
Query: 835 CSQL--VTFKGTSMLQK 849
CS+L + F GT +LQK
Sbjct: 819 CSELRVLIFPGTELLQK 835
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 190/678 (28%), Positives = 286/678 (42%), Gaps = 81/678 (11%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPP 279
L+ V L TGLL+G V YM K + + G I G C CS CN ++
Sbjct: 108 LDSDLRDVRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNA 163
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
+FE H + + +I + G SL V+ A + L ML ++ + P +
Sbjct: 164 CEFEEHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN- 222
Query: 340 VRCKGTF-----------PITCVGKTGPGPLCNSCVKSKKPQ---------GTMTYTTGI 379
+ K +F C+ ++ G + S + + ++
Sbjct: 223 -KWKASFQKRKDYLDAVASDGCLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNWSASK 281
Query: 380 RISSSR--PGLIANSTPVTSVHKS------SQSQRQRKITKKSKKTVLISKPFENASPPL 431
R S R G STP S S ++K T+++ + L S P + PL
Sbjct: 282 RRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHRLSL-SSPVKITQRPL 340
Query: 432 ---SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHC 487
S +KS+ + T +D LH L+F E GL D T + Y G+ L +GYK G IIC+C
Sbjct: 341 RNCSIDSKSKESKT-RDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNC 399
Query: 488 CNSEVSPSQFEAHADGG-------NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 540
CN E SPS FE HA G N+ +G + HK L + + N
Sbjct: 400 CNQEFSPSHFEEHAGMGRRRQPYHNIYTLEGL--SLHKLALQLQDHLNPNGF----DNAS 453
Query: 541 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 600
H+ + GR S + + R ++ E C C + P
Sbjct: 454 VSSVSDYHNLTSSGCGREPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINP 511
Query: 601 RTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
TI+ C+QCER H+ C KK + L+E + CC +C + + L+
Sbjct: 512 DTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLEE----- 566
Query: 655 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 714
++K G + ++ WRLLSG A+ + +L + Q + IF D F
Sbjct: 567 ----------GLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDS 616
Query: 715 VDSISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 773
D S D+I MV G+N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 617 TDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIAT 674
Query: 774 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 833
KGYF LL IE L V + P E IW++K GF + E +
Sbjct: 675 RSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESMLE 734
Query: 834 RCSQLVTFKGTSMLQKRV 851
LV FK ++QK +
Sbjct: 735 S-HPLVMFKNLVLVQKSL 751
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 156/513 (30%), Positives = 223/513 (43%), Gaps = 77/513 (15%)
Query: 370 QGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASP 429
QG + I SS G+ S + K ++ R + T S P +A
Sbjct: 63 QGDTEISAPALIGSSDKGISGLSAGTSKEKKGTEETRSAQNTGDLGLISSSSSPVTSAQR 122
Query: 430 PL---SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIIC 485
PL S + SR + +D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C
Sbjct: 123 PLPSSSVGSNSRES-KKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVC 181
Query: 486 HCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 545
+CCN EVSPS+FE HA G P + Y + +
Sbjct: 182 NCCNQEVSPSEFEKHAGMGK------------------RRQPYQNIYTSQGLTLHDVALQ 223
Query: 546 LQH---DANAVEAGRVSGVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRGC 591
L H ++N VS +T + + + L+ + C C
Sbjct: 224 LHHLNLNSNGFSNASVSSFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYFC 283
Query: 592 DFSKSGFG---PRTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSR 642
+ + G P I+ C+QCER HV C KK + L++ + CC +C
Sbjct: 284 GYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQS 343
Query: 643 INSVLQNLLVQEAEKLPEF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLS 701
+ L+ + EK E L I+ ++ WRLLS A+ + +L LS
Sbjct: 344 LRVRLEGM-----EKCEEIAFLGRIRS------------NICWRLLSSADASRDVKLYLS 386
Query: 702 QAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRV 760
Q + IF D F ++S I MVYG+N G+ +F GMYC +LT ++ VVSA IL+V
Sbjct: 387 QVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKV 443
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
+ AEL L+AT KGYF+LL IE L V ++ P E IW+DK GF
Sbjct: 444 RVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLGF 503
Query: 821 KKIDPELLSIYRKR----CSQLVTFKGTSMLQK 849
+LS K+ LV FK ++QK
Sbjct: 504 T-----ILSADEKKSMLESHPLVMFKNLVLVQK 531
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 186 bits (472), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 138/436 (31%), Positives = 196/436 (44%), Gaps = 73/436 (16%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAD 502
+D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE HA
Sbjct: 170 RDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAG 229
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH---DANAVEAGRVS 559
G P + Y + + L H ++N VS
Sbjct: 230 MGK------------------RRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVS 271
Query: 560 GVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PRTILL 605
+T + + + L+ + C C + + G P I+
Sbjct: 272 SFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHTELGKIDPNMIVF 331
Query: 606 CDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP 659
C+QCER HV C KK + L++ + CC +C + L+ + EK
Sbjct: 332 CNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGM-----EKCE 386
Query: 660 EF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 718
E L I+ ++ WRLLS A+ + +L LSQ + IF D F ++S
Sbjct: 387 EIAFLGRIRS------------NICWRLLSSADASRDVKLYLSQVIDIFKDAF---LEST 431
Query: 719 SGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 777
I MVYG+N G+ +F GMYC +LT ++ VVSA IL+V + AEL L+AT
Sbjct: 432 DAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIATRSEC 491
Query: 778 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR--- 834
KGYF+LL IE L V ++ P E IW+DK GF +LS K+
Sbjct: 492 RKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLGFT-----ILSADEKKSML 546
Query: 835 -CSQLVTFKGTSMLQK 849
LV FK ++QK
Sbjct: 547 ESHPLVMFKNLVLVQK 562
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 160/533 (30%), Positives = 233/533 (43%), Gaps = 65/533 (12%)
Query: 344 GTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANS---TPVTSVHK 400
GT T G G G +S S+K T++ S G+ ++S T VT+ H
Sbjct: 294 GTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDGVKSDSPLPTAVTTNHS 353
Query: 401 SSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRWNITPKDQRLHKLVFDESG 457
S + + +S P + PL S +KS+ + T +D LH L+F E G
Sbjct: 354 KHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKSKESKT-RDTTLHPLIFKEDG 403
Query: 458 LPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGG-------NLLPC 509
L D T + Y G+ L +GYK G IIC+CCN E SPS FE HA G N+
Sbjct: 404 LADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNIYTL 463
Query: 510 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 569
+G + HK L + + N H+ + GR S +
Sbjct: 464 EGL--SLHKLALQLQDHLNPNGF----DNASVSSVSDYHNLTSSGCGREPSTTSGPIVPL 517
Query: 570 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKM 623
+ R ++ E C C + P TI+ C+QCER H+ C KK +
Sbjct: 518 K--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPL 575
Query: 624 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 683
L+E + CC +C + + L+ ++K G + ++
Sbjct: 576 EILKEYMCFHFLCCQECQSLRARLEE---------------GLEKCVGITFLRRIRSNIC 620
Query: 684 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 742
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 621 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 678
Query: 743 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 802
CA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V +
Sbjct: 679 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 738
Query: 803 VLPAAEEAESIWTDKFGFKKIDPELLSIYRK----RCSQLVTFKGTSMLQKRV 851
P E IW++K GF +LS K LV FK ++QK +
Sbjct: 739 TAPVDPEMAQIWSEKLGFT-----ILSAEEKESMLESHPLVMFKNLVLVQKSL 786
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLP 332
+ + +I + G SL V+ A + L ML ++ + P
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPP 216
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 152/280 (54%), Gaps = 40/280 (14%)
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK P + EL TG+L+G V Y+ K + + LRG+I+ GILCSCS C G +V+ P
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIM-RKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYY 315
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA + S YI ENG +L +VLRAC + L ML+ ++ A+ P+E+ F C
Sbjct: 316 FEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKS 375
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKS 401
CK +F GK C+SC++SK + ISSS+ G S+
Sbjct: 376 CKSSFSTLRSGKF--ALFCDSCLESKGAKNN--------ISSSKVGRSQTSS-------- 417
Query: 402 SQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDG 461
+K +++ASP + S +T KD+ +HK+VF LP+G
Sbjct: 418 -------------------AKVYKSASP--GAKSSSVGRLTRKDKGMHKVVFMSGILPEG 456
Query: 462 TEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 501
T+VGYY G++LL+GY LGI CHCC++ VSPSQFE HA
Sbjct: 457 TDVGYYVGGKRLLDGYIKELGIYCHCCSTVVSPSQFEGHA 496
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 95/280 (33%), Positives = 145/280 (51%), Gaps = 30/280 (10%)
Query: 583 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 642
+ C+LC DF + GF +T+L+CDQCE+E+H+GCL++HKM D++ +P+G+WFC +C R
Sbjct: 1510 AACVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVR 1569
Query: 643 INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 702
I L + +P GN RW++L GK +T LS
Sbjct: 1570 IRDALGEDVAAGEVLMP-----------GNPA-------YRWQILRGKNGRQQTWHALST 1611
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-------TVNSSVVSA 755
+ I + FDPI+D+ SG DL+P+MV +F GMY +L V A
Sbjct: 1612 VLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVLA 1671
Query: 756 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 815
+ RV G +AE+PLVAT +G+ + L + L L V++IVLPA +A+ W
Sbjct: 1672 ALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAWR 1731
Query: 816 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---RVP 852
+ F+ +D + R ++V F T++L + RVP
Sbjct: 1732 -QLQFQDLDEPSTRVARSE-HRMVIFPHTTVLARPLIRVP 1769
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 138/438 (31%), Positives = 201/438 (45%), Gaps = 41/438 (9%)
Query: 429 PPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHC 487
P S KS+ + T +D LH L+F E GL D T + Y G+ L +GYK G GIIC+C
Sbjct: 91 PNYSIGTKSKESKT-RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNC 149
Query: 488 CNSEVSPSQFEAHADGG-------NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 540
C+ E +PS FE HA G N+ +G HK L + + + F
Sbjct: 150 CSQEFAPSHFEEHAGMGRRRQPYHNIYTPEG--STLHKLALQLQDHLNSNGFDNASVSSF 207
Query: 541 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 600
L +A GR S + + R ++ E C C + P
Sbjct: 208 SDYPNL---TSASGCGRQPSTTSGPIVPLK--RTLQGRVVETESCYFCGYGHTTIGNIDP 262
Query: 601 RTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
I+ C+QCER HV C KK + L+E ++ CC +C +L++ L +
Sbjct: 263 DMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQECQ----LLRDRLEEG 318
Query: 655 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 714
EK E +++ N + WRLLSG A+ + +L + Q + IF D F
Sbjct: 319 LEKCEEIAF--LRRIRSN---------ICWRLLSGMDASRDVKLFMPQVIDIFKDAFVES 367
Query: 715 VDSISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 773
D S D+ MV +N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 368 TDEHS--DIFSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRMEQIAELVLIAT 425
Query: 774 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 833
+ KGYF LL IE L V + P E IW++K GF + E +
Sbjct: 426 RRECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESVLE 485
Query: 834 RCSQLVTFKGTSMLQKRV 851
LV FK ++QK +
Sbjct: 486 S-HPLVMFKNLVLVQKSL 502
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 162 bits (409), Expect = 9e-37, Method: Composition-based stats.
Identities = 101/303 (33%), Positives = 159/303 (52%), Gaps = 31/303 (10%)
Query: 562 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
DSV + + + L A+L S C+LC +F + GF +T+L+CDQCE+E+H+GCL+K
Sbjct: 1762 DSVTLTGRDEEHVSEALAADLASSCVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRK 1821
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
HKM D++ +P+G+WFC +C RI +L L +E E +GN
Sbjct: 1822 HKMVDMQAVPEGEWFCSDECVRIRELLTKSL-EEGE----------TTMSGNPA------ 1864
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 740
RW+ + G+ T T L + I + FDPI+D+ SG DL+P MV+ + +F G
Sbjct: 1865 -YRWQFIRGRDGTKATARALKTVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQG 1923
Query: 741 MYCAILTVNSS-------VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 793
MY +L + V AG++RV G +AE+PLVAT +G+ + L +
Sbjct: 1924 MYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHR 1983
Query: 794 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---R 850
L L V+++VLPA +A W + F +D + + R +++ F TS++ + R
Sbjct: 1984 LIALGVRAMVLPATADALPAWR-QLAFMDLDEGSVRVARGE-HRMIIFPHTSVVVRQLIR 2041
Query: 851 VPA 853
VP
Sbjct: 2042 VPG 2044
Score = 41.2 bits (95), Expect = 2.7, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 458 LPDGTEVGYYACGQKLLEGY------KNGLGIICHCCNSEVSPSQFEAHA 501
L DG V Y GQ+LL G G GI+C CC+ +S S FE+HA
Sbjct: 1535 LQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHA 1584
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 183/677 (27%), Positives = 276/677 (40%), Gaps = 106/677 (15%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
V L TGLL+G V Y K G I G C CS C ++ +FE H+
Sbjct: 246 VRGLLSTGLLEGFRVTY----KKNEVERIGRINGQGYSCGCSECGYRNIMNACEFEQHSG 301
Query: 288 KQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFP 347
+ + +I ++G SL V++ + L ML + +S P + + K +F
Sbjct: 302 ESSNNQNNHIFLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYE--KWKASFQ 359
Query: 348 I--------------TCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRP---GLIA 390
+ T + L +S S ++ + R S R G
Sbjct: 360 LEKDYFDDAPSDPCSTQSSQESNIALTDSLKDSTSNASSILNWSSFRRRSDRQFKRGGTE 419
Query: 391 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISK--PFENAS--------------PPLSFP 434
STP+ S +++I+ S T + S+ P EN + P +
Sbjct: 420 TSTPILS------RSPEKEISDLSTSTSMKSEETPSENTAGLLTTDVTVIQDPPPDHNVD 473
Query: 435 NKSRWNITPK--DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSE 491
+ S+ PK D LH ++F E GLPD T + Y G+ L +GYK G GIIC CC+ E
Sbjct: 474 SNSKDLGQPKVRDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIE 533
Query: 492 V--SPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 549
V +PSQFE H G + SI D + + + + L +
Sbjct: 534 VQYTPSQFEKHVGMG-------------RRRQPYRSIYTSDGLTLH-ELALKLQDGLSSN 579
Query: 550 ANAVEAGRV-SGVDSVEQITKRCI-----RIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 603
N E + SG T R I R ++ + C +CR I
Sbjct: 580 VNIDELPTLTSGSGKEYSTTSRPIIVPLKRTLQERVLTVESCYMCRKPHTVLGVISVDMI 639
Query: 604 LLCDQCEREFHVGC----LKKHK--MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 657
+ C+QCER HV C L+K K + L E + + CC C + + L
Sbjct: 640 VFCNQCERALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLHE-------- 691
Query: 658 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 717
+KK + ++ W+LL+G + + Q + IF D F +
Sbjct: 692 -------GLKKREDIAFLRRIRYNICWQLLNGTNMRSDVQ---HQVIEIFKDAFAET--A 739
Query: 718 ISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 776
D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT
Sbjct: 740 PQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNE 799
Query: 777 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR-- 834
KGYF LL + IE L V+ + P E IW++K G+ +LS +K
Sbjct: 800 CRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSM 854
Query: 835 --CSQLVTFKGTSMLQK 849
LV F S++QK
Sbjct: 855 LMAHPLVMFANLSLVQK 871
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/370 (29%), Positives = 159/370 (42%), Gaps = 74/370 (20%)
Query: 484 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
IC CN +GG +L CD CP +FH C L S P+G WYC C+
Sbjct: 811 ICSICN------------EGGEILLCDNCPSSFHHACVGLESTPEGSWYCPSCR------ 852
Query: 544 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS--KSGFGPR 601
C +C D+ + F +
Sbjct: 853 -----------------------------------------CSICDSSDYDPDTNKFTEK 871
Query: 602 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
TI+ CDQCERE+HVGC++ +K L P+G WFC CS I LQ L+ + E
Sbjct: 872 TIMYCDQCEREYHVGCMR-NKGDQLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPTPVEG 930
Query: 662 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 721
I ++ + D E L A+ + H+CF I++ + R
Sbjct: 931 LSCTILRFDRENASQHGDF--------YNEIIAEQYGKLCIALDVLHECFVTIIEPSTRR 982
Query: 722 DLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 779
DL +V+ R LR F G Y IL + ++S G RV G++ AELPL+ T
Sbjct: 983 DLSEDIVFNRESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAELPLIGTRVQYRR 1042
Query: 780 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 839
+G +LL +EKLLS L V+ +VLPA + WT FGF+ + + S ++
Sbjct: 1043 QGMCRLLMNELEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAM--SFSDRFELAESSIL 1100
Query: 840 TFKGTSMLQK 849
+F+GT++ QK
Sbjct: 1101 SFQGTTICQK 1110
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 101/139 (72%), Gaps = 4/139 (2%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGP 356
C RCKG FP +C+ + GP
Sbjct: 118 TCKRCKGRFPSSCMERVGP 136
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 159/358 (44%), Gaps = 71/358 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP +H C L +P+G+W+C C+
Sbjct: 103 GDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR----------------------- 139
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCL 618
C +C G +++ G F T+LLCDQCERE+HV CL
Sbjct: 140 ------------------------CAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCL 175
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
MA + P WFC C +I L+ L V + + E + + + L + S
Sbjct: 176 YSRGMAKMTSCPDDSWFCGDHCDKIFEGLRKL-VGISNTIGEGLSWTLLRSGEDDLPSAS 234
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRG 735
++ + E R L+ A+ + +CF P+VD + DL+ ++Y R +
Sbjct: 235 SMN--------REQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 286
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
F G Y +L + V+S +R+ G +AE+PL+ T + +G + L IE LL
Sbjct: 287 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQ 346
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 849
L ++S VLPA E W + FGF+++ P EL+ + +V+F G ++LQK
Sbjct: 347 RLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLELVKL------SVVSFPGVTLLQK 398
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 159/358 (44%), Gaps = 71/358 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP +H C L +P+G+W+C C+
Sbjct: 128 GDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR----------------------- 164
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCL 618
C +C G +++ G F T+LLCDQCERE+HV CL
Sbjct: 165 ------------------------CAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCL 200
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
MA + P WFC C +I L+ L V + + E + + + L + +
Sbjct: 201 YSRGMAKMTSCPDDSWFCGDHCDKIFQGLRKL-VGISNNIGEGLSWTLLRSGEDDLPSAN 259
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRG 735
++ + E R L+ A+ + +CF P+VD + DL+ ++Y R +
Sbjct: 260 SMN--------REQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 311
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
F G Y +L + V+S +R+ G +AE+PL+ T + +G + L IE LL
Sbjct: 312 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQ 371
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 849
L ++S VLPA E W + FGF+++ P EL+ + +V+F G ++LQK
Sbjct: 372 RLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLELVKL------SVVSFPGVTLLQK 423
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 156/353 (44%), Gaps = 64/353 (18%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
DGG+LL CD CP ++H +C L +IP+G+WYC C+
Sbjct: 686 DGGDLLLCDNCPSSYHHDCVGLEAIPEGNWYCPSCR------------------------ 721
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS--KSGFGPRTILLCDQCEREFHVGCLK 619
C +C D+ S F +TI+ CDQCERE+HVGC +
Sbjct: 722 -----------------------CSICNLSDYDPDTSQFTEKTIVYCDQCEREYHVGCTR 758
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
L P+G WFC CS + LQ L+ + E I K+ + D
Sbjct: 759 NSD-NQLICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEGVSWTILKFCSGNGSDHGD 817
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQE 737
D + L AV I H+CF I++ + D+ +V+ R LR
Sbjct: 818 YD--------DEIMADHYGKLCVAVGILHECFVTIIEPRTQSDISEDIVFNRESELRRLN 869
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
F G Y +L +S G R+ GQ+ AELPL+ TS +G +LL +EKLL L
Sbjct: 870 FRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTSSPYRRQGMCRLLINELEKLLLDL 929
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ ++LPA E WT FGF + + + L + + +++F+GT+M QK
Sbjct: 930 GVERLILPAVPELLETWTCSFGFTIMSNSDRLEL---AGNSILSFQGTTMCQK 979
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 173/394 (43%), Gaps = 85/394 (21%)
Query: 488 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 547
C+ E S D G LL CD CP AFH C L + P+GDW C C+
Sbjct: 435 CSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------- 484
Query: 548 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTIL 604
C +C G D + GF +TI+
Sbjct: 485 -------------------------------------CGVCGGSDLDDDTAEGFTDKTII 507
Query: 605 LCDQCEREFHVGCLKK-----HKMADL-RELPKGK--------WFCCMDCSRINSVLQNL 650
C+QCERE+HVGC+++ A+ R L + + W C +C + LQ L
Sbjct: 508 YCEQCEREYHVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQAL 567
Query: 651 LV-QEAEKLPEFHLNAI-------KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 702
+ A +P + A ++Y + TV+ I RW+ AA L
Sbjct: 568 VASSRARSIPHYSRGAYHSAPCGRRRY----MSTVTRI-TRWQHEEEDAADHGQ---LCA 619
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRV 760
A+ + H+CFD +V+ + DL +V+ + LR F G Y L +++ G LRV
Sbjct: 620 ALDVLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRV 679
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
FG +VAELPLV T + +G +LL +EK+L + V+ +VLPA E +WT GF
Sbjct: 680 FGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELMPMWTASLGF 739
Query: 821 KKID-PELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
+ +++ + + +++FKGT+M QK + A
Sbjct: 740 HAMTRSDVMEMAVEHA--ILSFKGTTMCQKTLLA 771
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 58/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 810 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 863
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H GC +
Sbjct: 864 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 885
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 886 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 945
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 946 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 995
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 996 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLN 1055
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1056 VEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1104
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 58/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 965 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 1018
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H GC +
Sbjct: 1019 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 1040
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 1041 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 1100
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1101 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 1150
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 1151 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLN 1210
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1211 VEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1259
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 58/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 418 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 471
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H GC +
Sbjct: 472 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 493
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 494 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 553
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 554 DER----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 603
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 604 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLN 663
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 664 VEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 712
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 58/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 946 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 999
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H GC +
Sbjct: 1000 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 1021
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 1022 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 1081
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1082 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 1131
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 1132 RGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLN 1191
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 1192 VEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 1240
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 154/351 (43%), Gaps = 59/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L +P DW C C + L HDA
Sbjct: 1036 GDGGNLICCDGCPSTFHMSCLELEELPSDDWRCTNCSCKLCHE-HLNHDA---------- 1084
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D+ E P + C QCE+++H C +
Sbjct: 1085 PDNAE--------------------------------IDP--LHSCSQCEKKYHPSCSPE 1110
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ G FC C + LQNLL E + PE+ I+ ++ ETV D+
Sbjct: 1111 TEKLSSVSSQAGNHFCQQSCRLLFEELQNLLAVEKDLGPEYACRIIQCIHEDAPETVLDL 1170
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1171 DGRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFLRLDF 1220
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y IL +VSA +R+ G ++AE+P + T + +G + L IE +LS L+
Sbjct: 1221 RGFYIFILERGDEIVSAASVRIHGTKLAEMPFIGTRHMYRRQGMCRRLLDGIEMILSSLK 1280
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF + E+ + ++ F GT +LQK
Sbjct: 1281 VEKLIIPAINELVDTWTSKFGFSPL--EVSDKQEVKSINMLVFPGTGLLQK 1329
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 68/93 (73%), Positives = 83/93 (89%)
Query: 764 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
++AELPLVATS NHGKGYFQ LF+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 824 DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 228
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 58/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 281 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 334
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H GC +
Sbjct: 335 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 356
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 357 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 416
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 417 DER----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 466
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 467 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLN 526
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF +D + + + ++ F GT +LQK
Sbjct: 527 VEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQEVKSTSMLVFPGTGLLQK 575
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 174/656 (26%), Positives = 267/656 (40%), Gaps = 109/656 (16%)
Query: 258 IIRDGGILCSCSLCNGCRV-------IPPSKFEIHACKQYRRASQYICFENGKSLLEVLR 310
I++D L S L G RV + +FE H+ + + +I ++G SL V++
Sbjct: 364 ILKDVRGLLSTGLLEGFRVTYKKNEIMNACEFEQHSGESSNNQNNHIFLDSGISLYMVIQ 423
Query: 311 ACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPI--------------TCVGKTGP 356
+ L ML + +S P + + K +F + T +
Sbjct: 424 GLKYTKLDMLGDVIGKVISLPPNMIQYE--KWKASFQLEKDYFDDAPSDPCSTQSSQESN 481
Query: 357 GPLCNSCVKSKKPQGTMTYTTGIRISSSRP---GLIANSTPVTSVHKSSQSQRQRKITKK 413
L +S S ++ + R S R G STP+ S +++I+
Sbjct: 482 IALTDSLKDSTSNASSILNWSSFRRRSDRQFKRGGTETSTPILS------RSPEKEISDL 535
Query: 414 SKKTVLISK--PFENAS--------------PPLSFPNKSRWNITPK--DQRLHKLVFDE 455
S T + S+ P EN + P + + S+ PK D LH ++F E
Sbjct: 536 STSTSMKSEETPSENTAGLLTTDVTVIQDPPPDHNVDSNSKDLGQPKVRDNTLHPMLFKE 595
Query: 456 SGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHADGGNLLPCDGC 512
GLPD T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H G
Sbjct: 596 GGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMG-------- 647
Query: 513 PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV-SGVDSVEQITKRC 571
+ SI D + + + + L + N E + SG T R
Sbjct: 648 -----RRRQPYRSIYTSDGLTLH-ELALKLQDGLSSNVNIDELPTLTSGSGKEYSTTSRP 701
Query: 572 I-----RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC----LKKHK 622
I R ++ + C +CR I+ C+QCER HV C L+K K
Sbjct: 702 IIVPLKRTLQERVLTVESCYMCRKPHTVLGVISVDMIVFCNQCERALHVKCYNNGLQKPK 761
Query: 623 --MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ L E + + CC C + + L +KK +
Sbjct: 762 APLKVLGEYTQFNFMCCEKCQLLRASLHE---------------GLKKREDIAFLRRIRY 806
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFG 739
++ W+LL+G + + Q + IF D F D I +MV ++ G+ +F
Sbjct: 807 NICWQLLNGTNMRSDVQ---HQVIEIFKDAFAETAPQ--DIDDIRNMVNSKDTTGEKDFR 861
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G+YCA+LT ++ VVSA IL+V +EVAEL L+AT KGYF LL + IE L V
Sbjct: 862 GIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNV 921
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CSQLVTFKGTSMLQKRV 851
+ + P E IW++K G+ +LS +K LV F S++QK +
Sbjct: 922 RLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAHPLVMFANLSLVQKSL 972
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC----RVIPPSKFEIHACKQYR 291
LL GV V Y + + L G + GG C+C GC +V+ +FE HA +
Sbjct: 31 LLQGVPVTYR--FEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGVTSK 88
Query: 292 RASQYICFENGKSLLEVLRACRSVP 316
+ +I NG+SL E+ R VP
Sbjct: 89 NQNGHIFLRNGRSLYELFHKLREVP 113
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 155/376 (41%), Gaps = 77/376 (20%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
DGG+LL CD CP AFH C L + P+GDW+C C+
Sbjct: 512 DGGDLLLCDNCPSAFHHACVGLQATPEGDWFCPSCR------------------------ 547
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS-----GFGPRTILLCDQCEREFHVG 616
C +C G DF + GF +TI+ CDQCERE+HVG
Sbjct: 548 -----------------------CGVCGGSDFDATAAGGGGFTDKTIIYCDQCEREYHVG 584
Query: 617 CLKKHKMADL-----------RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 665
C+++ + E + W C +C + LQ L E+
Sbjct: 585 CVRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEVFRHLQGLAAVARERSIPIPTTV 644
Query: 666 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LSQAVAIFHDCFDPIVD 716
G SL + R + + + L A+ + H+CF +++
Sbjct: 645 PTTVEGVSLSILRRRRRRPISMVATGSGCQEEEEEEDAAEHGQLCSALDVLHECFVTLIE 704
Query: 717 SISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 774
+ DL +V+ R LR F G Y L +++ G LRV G EVAELPLV T
Sbjct: 705 PRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGELITVGTLRVLGTEVAELPLVGTR 764
Query: 775 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRK 833
+ +G LL +EK+L + V+ +VLPA E +WT GF + +++ I +
Sbjct: 765 FAHRRQGMCHLLVTELEKVLRQVGVRRLVLPAVPELLPMWTASLGFHPMTRSDVMEIAAE 824
Query: 834 RCSQLVTFKGTSMLQK 849
+++F+GT+M K
Sbjct: 825 H--AILSFQGTTMCHK 838
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 177/692 (25%), Positives = 274/692 (39%), Gaps = 146/692 (21%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
V L TGLL+G SV Y + G + +C +FE HA
Sbjct: 274 VRGLLSTGLLEGFSVTY---------------KKNGKMNAC------------EFEQHAG 306
Query: 288 KQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFP 347
+ + +I ++G SL ++++A + L +L ++ P + + K +F
Sbjct: 307 QSSNNQNDHIFLDSGISLYKLIQALKYKKLHLLADLIEEQTGLPPNLIEYG--KWKASFE 364
Query: 348 ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTPVTSVHKSSQSQRQ 407
+ L ++ Q + G+ IS S S + + Q +
Sbjct: 365 VQN------DDLEDAASDHCSTQSSQDSDAGVTISMKESTSNGISNLNWSAFRRPRWQYK 418
Query: 408 RKITKKSKKTVLIS--------------------KPFENASPPL--------------SF 433
R T S +T+ S P EN + PL S
Sbjct: 419 RGGTATSTQTLSRSPEKGISGLSTGTSMKINTEETPSENTAGPLHSEVTIVQEPPRGHSV 478
Query: 434 PNKSRWNITPK--DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNS 490
KS+ + T K D LH+LVF E G+P+ T + Y G+ L +GYK G I+C CC+
Sbjct: 479 GPKSKESRTSKVRDNSLHQLVFKEGGVPELTILTYKLKHGEVLKQGYKQGTCILCDCCSE 538
Query: 491 EV--SPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH 548
EV +PS FE HA G P + Y + E LQ
Sbjct: 539 EVQFTPSHFEEHAGMGK------------------RRQPYRNIYTPEGLTLHELALKLQG 580
Query: 549 --DANAVEAGRVSGVDSVEQITKRCI---------------RIVKNLEAELSGCLLCRGC 591
++N + G D ++ R ++ + + C LC
Sbjct: 581 GLNSNGNSSANFPGGDEPPNLSSGSSRESSTTYRPSIVPLKRTLQQIADKTESCRLCGDA 640
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGC----LKKHK--MADLRELPKGKWFCCMDCSRINS 645
+ I+ C+QCER HV C L+K K + L E + +FCC C + +
Sbjct: 641 CTTIGTISEDMIVFCNQCERPCHVKCYNNGLQKQKGPLNVLAEYMQFHFFCCQKCQLLRA 700
Query: 646 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 705
L +L + EK+ + + Y V W++L+G + + Q +
Sbjct: 701 SLHEVL-NKREKIRQ-----KRSY------------VFWQILNGMNPGINVQKYIHQVIE 742
Query: 706 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSS-VVSAGILRVFGQ 763
IF F S G +I MV +++ G+ +F GMYCA+LT +S VVSA +L+V +
Sbjct: 743 IFKVAFPKTAASDFG--VIQDMVNAKDVGGEKDFRGMYCAVLTTSSKLVVSAAVLKVRTE 800
Query: 764 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
EVAEL +VAT KGYF LL IE L + V+ + E ESIW+ K GF +
Sbjct: 801 EVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMNVRLLTALVDPEMESIWSKKLGFTIL 860
Query: 824 DPE----LLSIYRKRCSQLVTFKGTSMLQKRV 851
E LL + LV F+ +++QK +
Sbjct: 861 SGEEKETLLEAH-----PLVMFEDLTLMQKSL 887
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 160/368 (43%), Gaps = 64/368 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + P GDW+C C F
Sbjct: 692 GDGGDLVCCDGCPSTFHQRCLDIRMFPLGDWHCPNCTCKF-------------------- 731
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +++++ + G T C CE+++H C+ K
Sbjct: 732 ----------CKAVIEDVTQTV----------------GANT---CKMCEKKYHKSCMPK 762
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ FC C ++ ++ + + E F + + + NS D+
Sbjct: 763 ANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHRECTNS-----DL 817
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
LSG E L+ A+ + +CF PI+D SG +++ +++Y G N F
Sbjct: 818 S-----LSGHPHIVENNSKLALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNRLNF 872
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
GG Y A+L +V++ +R G +AE+P + T + +G + LF+ +E L L+
Sbjct: 873 GGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESALQHLK 932
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGS 858
VK +++PA + +W KFGF++++ L R L+TF G +LQK + A R
Sbjct: 933 VKLLIIPATADFSHVWISKFGFRQVEDSLKK--EMRSMNLLTFPGIDVLQKELLAPRHTE 990
Query: 859 SSTDSTEC 866
S+ D T+C
Sbjct: 991 SAVD-TDC 997
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 143 bits (361), Expect = 3e-31, Method: Composition-based stats.
Identities = 74/119 (62%), Positives = 89/119 (74%), Gaps = 1/119 (0%)
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+GYFQ L+ CIE+LL F
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
L+V S+VLPAAE AES+W +KF F K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 134
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 152/351 (43%), Gaps = 59/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L +P DW C C +F Q +N +A ++
Sbjct: 982 GDGGNLICCDGCPSTFHMSCLELEELPSDDWRCANCCC-----KFCQEHSND-DAPDIAE 1035
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS LC C QCE +H C +
Sbjct: 1036 VDS-----------------------LC----------------TCSQCEENYHPVCSPE 1056
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ G FC C + LQNLL + + PEF IK + ET +
Sbjct: 1057 TENPSSVPSQAGDLFCQQSCRLLFEELQNLLAVKKDLEPEFACRIIKCIHEDVPETALAL 1116
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PI+D +G +LI ++VY G N +F
Sbjct: 1117 DERV----------ECNSKIAVALSLMDECFLPIIDQRTGINLIRNVVYNCGSNFLRLDF 1166
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y IL +VSA +R+ G + AE+P + T + +G + L IE +LS L+
Sbjct: 1167 RGFYIFILERGDEIVSAASVRIHGTKCAEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLK 1226
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF + E+ + ++ F GT +LQK
Sbjct: 1227 VQKLIIPAISELVDTWTSKFGFSPL--EVSEKQEVKSISMLVFPGTGLLQK 1275
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 106/352 (30%), Positives = 154/352 (43%), Gaps = 59/352 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + P GDW+C YC F G SG
Sbjct: 256 GDGGDLICCDGCPSTFHQSCLDIQKFPSGDWHCIYCSCKF--------------CGMFSG 301
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK- 619
+ +Q+ NL+ S +L C CE ++H C +
Sbjct: 302 --NTDQMN-------YNLDVNDSA------------------LLTCQLCEEKYHHMCTQG 334
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
+ + D P FC C + LQ LL + E F +++ E D
Sbjct: 335 EDSILDDSSSPS---FCGKTCRELFEQLQMLLGVKHELEDGFSWTLVQR-----TEVGFD 386
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
I L+G E L+ A++I +CF PIVD SG +LI +++Y G N
Sbjct: 387 IS-----LNGIPQKVECNSKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLN 441
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G + AIL ++SA +R+ G ++AE+P + T I +G + L IE L L
Sbjct: 442 YSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESALHSL 501
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E WT FGFK + E+ S R ++ F GT MLQK
Sbjct: 502 NVEKLVIPAISELMQTWTSVFGFKPL--EVSSRKEMRNMNMLVFHGTDMLQK 551
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 155/373 (41%), Gaps = 61/373 (16%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+L+ CD CP FH+ C L +P+GDW+C C
Sbjct: 360 GGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSC-------------------------- 393
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
C RI C F + C QCE ++HVGCL+K +
Sbjct: 394 --------CCRI-------------CGENRFDEYSEEDNFKFSCHQCELQYHVGCLRKQR 432
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 682
L P G FC C +I LL + +P N ++ D+DV
Sbjct: 433 HVKLETYPDGTRFCSTQCEKI---FLGLLKLLGKPIPVGVDNLTWTLLKPTISEWFDMDV 489
Query: 683 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 740
A E L+ A+ + H+CF+PI + +GRDL+ +++ G +L+ F G
Sbjct: 490 -----PDNKALTEVYSKLNIALNVMHECFEPIKEPHTGRDLVEDVIFCRGSDLKRLNFRG 544
Query: 741 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 800
Y +L N ++S +RV G++VAE+PLV T G ++L IEK L L V+
Sbjct: 545 FYIVLLERNDELISVATIRVHGEKVAEVPLVGTRSQYRRLGMCRILINEIEKKLVELGVE 604
Query: 801 SIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 859
+ LPAA W FGF K+ D E L+ + F+ T M QK + S
Sbjct: 605 RLTLPAAPSVLDTWVTSFGFSKMTDSERLTFLD---YTFLDFQDTVMCQKLLMKIPSTKS 661
Query: 860 STDSTECVSGVEV 872
S + C + + +
Sbjct: 662 SQSTVNCAASLWI 674
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/113 (63%), Positives = 88/113 (77%)
Query: 744 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 803
I+ N++VV+AG+LRVFG+EVAELPLVAT + KGYFQLLF+CIEKLLS L V SIV
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIV 138
Query: 804 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
+PAAEE E +W +KFGF+K+ PE LS Y K C Q+V FKG SMLQK V + +I
Sbjct: 139 VPAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQI 191
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 28/35 (80%)
Query: 626 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 660
L+ELPKG WFC MDC+RINS LQ LL+ AEKL +
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSD 75
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 154/352 (43%), Gaps = 65/352 (18%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
DGG+L+ CD CP FHK C L IP+G+W+C C + +++ + R
Sbjct: 630 DGGDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGICGENKFKYNVQEPKDSR---- 685
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
+L CDQCER++H+GCL+
Sbjct: 686 -----------------------------------------LLSCDQCERKYHIGCLRNK 704
Query: 622 KMADL-RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ L R+ PK WFC C I LQ LL + P+ + K+ + D+
Sbjct: 705 GVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVGPDNLTWTLWKFMDSD---SCDV 761
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
+ +GK + L AV + H+CF+P ++ +GRD+ +++ R NL F
Sbjct: 762 EAP----TGKHSK------LDLAVEVIHECFEPATETYTGRDIAEDVIFSRECNLNRLNF 811
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L N +++ +RVFG +VAE+PLV T + G ++L +EK L L
Sbjct: 812 RGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGTRFLFRRLGMCKILMDELEKQLMNLG 871
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ ++LPA W + FGF K+ D E + + F GT QK
Sbjct: 872 VERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYLD---HTFLDFPGTIKCQK 920
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 155/351 (44%), Gaps = 56/351 (15%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CD CP FH+ C + P G W+C YC + +V G
Sbjct: 647 GDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------------------SCKVCG 688
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
Q+T + + EA + +LC+ CD CE ++H C++
Sbjct: 689 -----QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQM 725
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + + FC C ++ LQ LL + F I++ SD+
Sbjct: 726 NNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDV 774
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D L + A + L+ A+ + +CF P++D SG +LI +++Y G N F
Sbjct: 775 DSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNF 834
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL + V+ A LR+ G E+AE+P + T + +G + + IE +LS L
Sbjct: 835 SGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIESVLSSLN 894
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E W FGFK +D + R R L+ F G MLQK
Sbjct: 895 VEKLVIPAISEVRDTWISVFGFKPLDE--TTKQRMRKMSLLVFPGVEMLQK 943
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 102/350 (29%), Positives = 151/350 (43%), Gaps = 63/350 (18%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP +FHK C L IP GDW+C C +R ++ G D
Sbjct: 746 GGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR------------KIDGDD 793
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
V Q+ L C QCE ++HV CL+ +
Sbjct: 794 EVGQL------------------------------------LPCIQCEHKYHVRCLE-NG 816
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDID 681
AD+ G WFC DC +I L LL + + +K +S E S
Sbjct: 817 AADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWTLVKFINPDSCEHDS--- 873
Query: 682 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFG 739
S E+ L+ A+++ H+CF+P+ +S++ RDL+ +++ R L F
Sbjct: 874 ------SKSDLLAESYSKLNLAISVMHECFEPLKESLTNRDLVEDVIFSRWSELNRLNFQ 927
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y +L N ++S +RV+G++VAE+PLV T +G +L +EK L L V
Sbjct: 928 GFYTVLLERNEELISVATVRVYGKKVAEIPLVGTRLQYRRRGMCHILIEELEKKLKQLGV 987
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
+ +VLPA WT FGF K+ S + + F+G M QK
Sbjct: 988 ERLVLPAVPSVLETWTRSFGFAKMTNLERSQFLD--YTFLDFQGAIMCQK 1035
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 155/351 (44%), Gaps = 56/351 (15%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CD CP FH+ C + P G W+C YC + +V G
Sbjct: 674 GDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------------------SCKVCG 715
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
Q+T + + EA + +LC+ CD CE ++H C++
Sbjct: 716 -----QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQM 752
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + + FC C ++ LQ LL + F I++ SD+
Sbjct: 753 NNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDV 801
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D L + A + L+ A+ + +CF P++D SG +LI +++Y G N F
Sbjct: 802 DSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNF 861
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL + V+ A LR+ G E+AE+P + T + +G + + IE +LS L
Sbjct: 862 SGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTRYMYRRQGMCRRFLSAIESVLSSLN 921
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E W FGFK +D + R R L+ F G MLQK
Sbjct: 922 VEKLVIPAISEVRDTWISVFGFKPLDE--TTKQRMRKMSLLVFPGVEMLQK 970
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 152/352 (43%), Gaps = 60/352 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L +P W C C F +H + E +
Sbjct: 1009 GDGGNLICCDGCPSTFHMSCLGLEELPSDYWCCANCSCKF----CHEHSNDGAED--TAD 1062
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE ++H C +
Sbjct: 1063 VDS--------------------------------------SLHTCSQCEEQYHEACSPE 1084
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ G FC C + LQNLL + + PE+ +++ + E V +
Sbjct: 1085 NDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLPL 1144
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1145 DTRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYSCGSNFARLDF 1194
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y IL +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 1195 RGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLN 1254
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT +FGF + D E + K S LV F GT +LQK
Sbjct: 1255 VEKLIIPAITELVDTWTSRFGFSPLEDSEKEEV--KSISMLV-FPGTGLLQK 1303
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 149/564 (26%), Positives = 214/564 (37%), Gaps = 165/564 (29%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
M K++ L + P + L ++GLLDG V Y+G + L GII++GG+LC CS C G
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLG--RGGHIMLTGIIQEGGVLCDCSSCKGV 634
Query: 275 ---------------------------------------------------RVIPPSKFE 283
+V+ S FE
Sbjct: 635 QVTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFE 694
Query: 284 IHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKA--TLQSALSSLPEEKSFACVR 341
HA R S +I ENGK L ++L + + L+SA+ + EK
Sbjct: 695 KHAGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNVMDVLKSAIGEVGGEKVQIISL 754
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANS---TPVTSV 398
I K P P V KP+ + + +P ++ ++ P+ +
Sbjct: 755 DHPLIAIQPAEKKLPQP---RLVLDTKPRVPVDLKPRMPQVDMKPRVMLDTRSRMPLDTK 811
Query: 399 HKSSQSQRQRKITKKSKKTVL--ISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDES 456
KS+ + R ++ L ++ E ASPP+ S N LHK +F
Sbjct: 812 AKSTSDVKARGGDVRATIPRLDRTTREKEAASPPVPSRESSGAN-------LHKALFLPG 864
Query: 457 GLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS------------------------EV 492
GL D EVGYY GQK L G K G GI+C CC +
Sbjct: 865 GLEDDIEVGYYVKGQKFLAGLKRGAGILCSCCQQVARNGMSSILMDSVVCGLTYEGERMI 924
Query: 493 SPSQFEAHADGG-------NLLPCDGCPRAFHKECASL---------SSIP--------- 527
S S FE HA G ++ DG R+ H SL + P
Sbjct: 925 SCSLFEQHAGWGSRRNPYTSIYLADG--RSLHDAAQSLVVEQTVKQEGNTPAKIEHLDQC 982
Query: 528 -----QGDW-YCKYCQNMFERKRFLQHDA---------------------------NAVE 554
+GD C C N + + + D+ N
Sbjct: 983 VECGDRGDLQLCTRCPNAYHQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRRSMVNRRS 1042
Query: 555 AGRVSGVDSVEQITKRCIRIVKNLEA-ELSGCLLCRGCDFSKSGFGPRTILLCDQ----- 608
G + S +++T RC R+++ EA L GC+ C+ DF+K+GFGP+T LLCDQ
Sbjct: 1043 KGAAKTLLSKDRVTGRCTRLLQVPEAVVLGGCVFCKSGDFAKTGFGPKTTLLCDQVSVGD 1102
Query: 609 -----CEREFHVGCLKKHKMADLR 627
CERE+HVGCLKKH + DL+
Sbjct: 1103 MKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 781 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 840
G+ + L IE+LL LRV+ + LPAAE AE IW ++FGF+++ E + + + +V
Sbjct: 1477 GHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMV- 1535
Query: 841 FKGTSMLQKRVPACRI 856
F G+SML+K +P I
Sbjct: 1536 FTGSSMLEKDIPPLEI 1551
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 626 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----- 680
+ELP+G+WFC DC I+S+L L+ E L + ++ + K LE D
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 681 -DVRWRLLSGKAATPETRLLLSQAVAIF 707
W+LL G+ P L++ V IF
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKALAEVVQIF 1289
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 165/390 (42%), Gaps = 88/390 (22%)
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 532
L EG + IC CN GG L+ CD CP A+HK C +L IP GDW+
Sbjct: 779 LFEGENDN---ICSVCNY------------GGELILCDQCPSAYHKNCLNLEGIPDGDWF 823
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C C+ C +C
Sbjct: 824 CPSCR-----------------------------------------------CGICGQNK 836
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 652
++ G L C QCE ++HV CL+ + D R K WFC +C R+ + LQNLL
Sbjct: 837 IEETEDG--HFLTCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLG 893
Query: 653 QEAEKLPEFHLNAIKKYAGNSLETV----SDIDVRWRLLSGKAATPETRLLLSQAVAIFH 708
+ + + KY + V SD+ V E LS A+++ H
Sbjct: 894 KPVLVGADNLTWTLVKYVNSETCGVGGAESDLVV------------ENYSKLSVALSVMH 941
Query: 709 DCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 766
+CF+P+ + S RD++ +++ + L F G Y +L N ++S +R+FG+++A
Sbjct: 942 ECFEPLHNPFSSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIA 1001
Query: 767 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE 826
E+PLV T G ++L +EK L L V+ +VLPA WT+ FGF+++
Sbjct: 1002 EVPLVGTRFQYRRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNF 1061
Query: 827 LLSIYRKRCSQLVTFKGTSMLQK---RVPA 853
S + + F+GT M QK R P+
Sbjct: 1062 ERSQFLD--YSFLDFQGTVMCQKLLTRFPS 1089
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 152/365 (41%), Gaps = 66/365 (18%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+L+ CD CP +FHK C L GDW+C C
Sbjct: 675 GGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSC-------------------------- 708
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
C C +C F + C QCER++HVGCL+K
Sbjct: 709 --------C-------------CGICGENKFDGGSEQDNVVFSCYQCERQYHVGCLRKWG 747
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDID 681
L P G WFC C +I LQ LL + + +K LE ID
Sbjct: 748 HVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLE----ID 803
Query: 682 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 739
L A E L+ A+ + H+CF+P+ + + RD++ +++ G +L F
Sbjct: 804 -----LPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRLNFQ 858
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y +L N ++S +RV+G++VAE+PL+ T G +L +EK L L V
Sbjct: 859 GFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHILMNELEKKLMELGV 918
Query: 800 KSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK---RVPACR 855
+ +VLPA + WT FGF K+ D E L + F+ T M QK ++P +
Sbjct: 919 ERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFLD---YSFLDFQDTVMCQKLLMKIPLAK 975
Query: 856 IGSSS 860
S+
Sbjct: 976 SNQST 980
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 153/352 (43%), Gaps = 60/352 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L +P W C C F +H ++ E
Sbjct: 1008 GDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCCANCSCKF----CHEHSSDGAE------ 1057
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D+ + D+S + C QCE ++H C +
Sbjct: 1058 -DTAD-------------------------VDYS--------LHTCSQCEEQYHEACSPE 1083
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
G FC C + LQNLL + + PE+ +++ + E V +
Sbjct: 1084 TDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLAL 1143
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D R E ++ A+++ +CF PI+D +G +LI ++VY G N +F
Sbjct: 1144 DKRV----------ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDF 1193
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y IL +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 1194 RGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLN 1253
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT KFGF + D E + K S LV F GT +LQK
Sbjct: 1254 VEKLIIPAITELVDTWTSKFGFSPLEDSEKQEV--KSISMLV-FPGTGLLQK 1302
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 148/356 (41%), Gaps = 70/356 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H+ C S IP G+WYC C
Sbjct: 990 GDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCL----------------------- 1026
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C SK L C QCER++HV C+
Sbjct: 1027 ------------------------CDICGEVIDSKELVTSLPALDCSQCERQYHVKCVSA 1062
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
E G WFC C I ++ + + ++ + T +I
Sbjct: 1063 K--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI 1120
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---E 737
A E + L A +I +CF PI+D +G D+IPS++Y N R
Sbjct: 1121 ----------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILY--NWRSDLHFN 1168
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G Y +L + S+VS +R+ G +AE+PLVATSK N +G + L IE++L L
Sbjct: 1169 YKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSL 1228
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 851
+V+ ++L A WT FGF++ID +KR S+ L GT +L+K +
Sbjct: 1229 KVEMLLLSAIPHLAETWTSTFGFREIDES----DKKRLSKVRLAAVPGTVLLKKDL 1280
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 132/262 (50%), Gaps = 26/262 (9%)
Query: 573 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-------LKKHKMAD 625
R ++ + C CR D P TI C+QCER HV C +KK +
Sbjct: 63 RTLQERVVQTESCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 626 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWR 685
L+E ++ CC +C + + L+ V++ E++ L I+ ++ WR
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG--VEKGEEIA--FLRQIRS------------NICWR 166
Query: 686 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCA 744
LLS A+ + +L +SQA+ IF D F D+ S D+ MVYG+N G+ +F GMYC
Sbjct: 167 LLSKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGMYCV 224
Query: 745 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 804
+LT ++ VVSA IL+V ++ AEL L+AT KGYF+LL IE L V ++
Sbjct: 225 VLTASTHVVSAAILKVRVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSLLMA 284
Query: 805 PAAEEAESIWTDKFGFKKIDPE 826
P E IW++K GF + E
Sbjct: 285 PVDPEMAQIWSEKLGFTILSAE 306
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 150/359 (41%), Gaps = 77/359 (21%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 632 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------------------------ 667
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 616
C+ CD + + G ++L C CER +H
Sbjct: 668 --------------------------TCKFCDAAVASGGKDGNSISLLSCGMCERRYHQL 701
Query: 617 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 674
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 702 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 753
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 754 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 808
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ G Y AIL ++SA LR G ++AE+P + T I +G + LF IE
Sbjct: 809 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIES 868
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 849
+ L+V+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 869 AMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 923
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 150/359 (41%), Gaps = 77/359 (21%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 632 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------------------------ 667
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 616
C+ CD + + G ++L C CER +H
Sbjct: 668 --------------------------TCKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 701
Query: 617 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 674
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 702 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 753
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 754 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 808
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ G Y AIL ++SA LR G ++AE+P + T I +G + LF IE
Sbjct: 809 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIES 868
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 849
+ L+V+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 869 AMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 923
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 156/353 (44%), Gaps = 63/353 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 1038 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 1073
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1074 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 1110
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 1111 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 1158
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 1159 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1218
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++
Sbjct: 1219 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVK 1278
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
V+ IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1279 VEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1329
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 156/353 (44%), Gaps = 63/353 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 974 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 1009
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1010 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 1046
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 1047 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 1094
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 1095 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1154
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++
Sbjct: 1155 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVK 1214
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
V+ IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1215 VEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1265
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 156/353 (44%), Gaps = 63/353 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 877 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 912
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 913 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 949
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 950 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 997
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 998 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1057
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L + ++VS +RV G VAE+PL+AT + KG +LL IEK+L ++
Sbjct: 1058 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYEKFRSKGMCRLLMNAIEKMLKSVK 1117
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
V+ IV+ A WT FGFK ++ + + +K L+ F GT +L+K +
Sbjct: 1118 VEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK--INLMVFPGTILLKKSL 1168
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 152/369 (41%), Gaps = 68/369 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H+ C IP G+WYC C
Sbjct: 979 GDGGELICCDNCPASYHQACLPCQDIPDGNWYCSSCL----------------------- 1015
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C SK L C QCER++HV C+
Sbjct: 1016 ------------------------CNICGEVITSKELRTSLPALECSQCERQYHVKCVSA 1051
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
E G WFC C +I + ++ + + ++ + T +I
Sbjct: 1052 K--VSCNEDGPGTWFCGRKCQQIYMIFRSRVGVPDHVDNDLSCTILRNNGDKKVRTAGEI 1109
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG- 739
A E + L A++I +CF PI+D +G D+IPS++Y F
Sbjct: 1110 ----------ALMAECNMKLMIALSIMEECFLPILDPRTGIDIIPSILYNWRSDFIHFNH 1159
Query: 740 -GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L + S+VS +R+ G VAE+PLVATS N +G + L IE++L L+
Sbjct: 1160 KGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVATSTENRQQGMCRRLMDYIEEMLKSLK 1219
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPACRI 856
V+ ++L A WT FGF++ID +KR S +L GT +L+K + C
Sbjct: 1220 VEMLLLSAIPHLVETWTSTFGFREIDDS----DKKRLSMVRLAAVPGTVLLKKNLCECS- 1274
Query: 857 GSSSTDSTE 865
G TD E
Sbjct: 1275 GVEDTDVAE 1283
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 154/354 (43%), Gaps = 66/354 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H+ C S IP G+WYC C G V
Sbjct: 16 GDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCLCDI--------------CGEV-- 59
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+DS E +T +L A L C QCER++HV C+
Sbjct: 60 IDSKELVT--------SLPA-----------------------LDCSQCERQYHVKCVSA 88
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
E G WFC C I ++ + + ++ + T +I
Sbjct: 89 K--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI 146
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-RNLRGQEFG 739
A E + L A +I +CF PI+D +G D+IPS++Y R+ +
Sbjct: 147 ----------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYK 196
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y +L + S+VS +R+ G +AE+PLVATSK N +G + L IE++L L+V
Sbjct: 197 GFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIEEMLKSLKV 256
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 851
+ ++L A WT FGF++ID +KR S+ L GT +L+K +
Sbjct: 257 EMLLLSAIPHLAETWTSTFGFREIDES----DKKRLSKVRLAAVPGTVLLKKDL 306
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 151/353 (42%), Gaps = 65/353 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP +H+ C + +P GDW+C C F DA G+
Sbjct: 615 GDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPNCTCKF-------CDAAVASGGK--- 664
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
G L ++L C CER +H CL
Sbjct: 665 ----------------------DGNFL--------------SLLSCSMCERRYHQLCLSD 688
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
++ FC C + LQ L + E + + I + ++T SDI
Sbjct: 689 EAQK-VQSFGSASSFCGPKCLELFEKLQKYLGVKNEIEGGYSWSLIHR-----VDTDSDI 742
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
+ + LS + ++L + +AI +CF PIVD SG +LI +++Y G N +
Sbjct: 743 NSQ---LSAQRIENNSKLAV--GLAIMDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINY 797
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL ++SA LR G ++AE+P + T I +G + LF IE + L+
Sbjct: 798 TGFYTAILERGDEIISAASLRFHGTQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLK 857
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 849
V+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 858 VEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 906
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 149/330 (45%), Gaps = 62/330 (18%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY-CQNMFERKRFLQHDANAVEAGRVSGV 561
GG+L+ CD CP AFH C L +P GDW+C C + R + Q A+ E
Sbjct: 491 GGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTCCCKICYRPKCKQECADGNE------- 543
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
L+C QCE++FH GC+K
Sbjct: 544 ---------------------------------------NNFLVCVQCEQKFHFGCVKTT 564
Query: 622 KMADLR---ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
+ + K WFC + C + L+ LL + + + +K VS
Sbjct: 565 RFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINWTLLK--------NVS 616
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE- 737
D S + + + +L + A+ + ++ F+P +D++SGR+LI +V+ R+ +
Sbjct: 617 SDDDGGDFTSNEFSQEKHKL--NAALGVLYEGFNPTIDALSGRELIKDLVFSRDSEHKRL 674
Query: 738 -FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y IL V+S +R+FGQ+VAE+ VAT + + G+G +LL +E+ L+
Sbjct: 675 NFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMCRLLMDELEEQLTR 734
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPE 826
L V +VL ++E+A + WT FGF ++ E
Sbjct: 735 LGVGRLVLHSSEDAINTWTKSFGFARMTSE 764
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 147/352 (41%), Gaps = 59/352 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CD CP FH+ C + +P G W C YC F G G
Sbjct: 94 GDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYCSCKF--------------CGMAGG 139
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL-CDQCEREFHVGCLK 619
C ++ R LL C CE ++H C+
Sbjct: 140 ----------------------------DACQMDENDAAARPALLTCCLCEEKYHHSCIP 171
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
+ + FC C ++ LQ LL + E F ++++ + SD
Sbjct: 172 AEDT--INDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRF-----DVGSD 224
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
I LSG E ++ A+ I +CF P+ D SG +LI ++VY G N
Sbjct: 225 I-----TLSGMHRKVECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLN 279
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G AIL V+SA +R+ G ++AE+P + T + +G + L IE L L
Sbjct: 280 YCGFLTAILERGDEVISAASIRIHGNQLAEMPFIGTRHMYRRQGMCRRLLGAIETALCSL 339
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E WT FGFK++ E LS + R ++V F G MLQK
Sbjct: 340 NVEKLVIPAISELRETWTSVFGFKQL--EGLSKQKMRYMKMVAFPGVDMLQK 389
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 152/352 (43%), Gaps = 61/352 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + P GDW+C C + G
Sbjct: 593 GDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNC------------------TCKYCG 634
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-LK 619
V S++ +C+G + S S I C CE++FH C L+
Sbjct: 635 VASID---------------------ICQGDNTSVS-----EISTCILCEKKFHESCNLE 668
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
L FC C + LQ L + E F + I++ T D
Sbjct: 669 MDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------TSED 719
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
DV R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N
Sbjct: 720 SDVSVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLN 776
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G Y AIL ++SA +R G ++AE+P + T I +G + LF IE L
Sbjct: 777 YSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVF 836
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
+V+ +++PA E W FGF ++P L R ++ F GT MLQK
Sbjct: 837 KVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQ--EMRLMNMLVFPGTDMLQK 886
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 64/327 (19%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP +FHK C L +P GDW+C C
Sbjct: 439 GGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSC-------------------------- 472
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
C +I G L R D S G +L C QCER++HV CL +
Sbjct: 473 --------CCKIC--------GQCLKRDSDLSMEDDG---VLDCTQCERKYHVVCLGNKR 513
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP----EFHLNAIKKYAGNSLETVS 678
L PK WFC C +I L LL +K+P +K N S
Sbjct: 514 EECLEYFPKEHWFCSKRCQQIFLGLHELL---GKKIPVGLHNLTWTLLKSIQFNDQCEAS 570
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQ 736
DI+ A E +L+ A+ + H+ FDP+ + + RDL+ +++ + L
Sbjct: 571 DIE----------ALSENYSMLNIALDMMHEFFDPVEEPHTKRDLLKDVIFSKRSELNRL 620
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y +L + +S +RV+G++VAE+PLV T G +L +EK L
Sbjct: 621 NFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGTRFQYRRLGMCCILMNVLEKKLRE 680
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKI 823
L V+ ++LPA A + W FGF K+
Sbjct: 681 LGVQRLILPAVPSALNTWIGSFGFSKL 707
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 153/357 (42%), Gaps = 71/357 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC-----KYCQNMFERKRFLQHDANAVEA 555
DGG+L+ CDGCP FH+ C + P GDW+C KYC
Sbjct: 711 GDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKYC------------------- 751
Query: 556 GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 615
GV S++ +C+G + S S I C CE++FH
Sbjct: 752 ----GVASID---------------------ICQGDNTSVS-----EISTCILCEKKFHE 781
Query: 616 GC-LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 674
C L+ L FC C + LQ L + E F + I++
Sbjct: 782 SCNLEMDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR------ 833
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
T D DV R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N
Sbjct: 834 -TSEDSDVSVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSN 889
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ G Y AIL ++SA +R G ++AE+P + T I +G + LF IE
Sbjct: 890 FYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIES 949
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
L +V+ +++PA E W FGF ++P L R ++ F GT MLQK
Sbjct: 950 ALRVFKVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQ--EMRLMNMLVFPGTDMLQK 1004
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 156/366 (42%), Gaps = 96/366 (26%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH +C L ++P+GDW+C C
Sbjct: 135 GDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC------------------------ 170
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT---ILLCDQCEREFHVGC 617
C C +S + P IL CDQCERE+H C
Sbjct: 171 --------------------------CCASC--GRSLYDPTIQTEILYCDQCEREYHSNC 202
Query: 618 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKY----AG 671
+ M + FC C +I L+ L+ V + + + + L + Y
Sbjct: 203 VPGSAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSEEN 259
Query: 672 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 731
+ LE+V+D++ R L+ A+ + +CF P++D S D++ ++Y R
Sbjct: 260 SKLESVADLNTR----------------LALALTVIQECFRPMIDPRSNIDMVSHILYNR 303
Query: 732 NLRGQE----FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 787
RG++ F G Y +L ++S +RV G AE+P + T +G + L
Sbjct: 304 --RGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLI 361
Query: 788 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKG 843
I+++L L V+++VLPA E WT FGF+K+ +L+ + +VTF G
Sbjct: 362 NVIQQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQGIQLMEL------NIVTFPG 415
Query: 844 TSMLQK 849
+S+LQK
Sbjct: 416 SSVLQK 421
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 150/359 (41%), Gaps = 77/359 (21%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 114 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNCT----------------------- 150
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 616
C+ CD + + G ++L C CER +H
Sbjct: 151 ---------------------------CKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 183
Query: 617 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 674
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 184 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 235
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 236 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 290
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ G Y AIL ++SA LR G ++AE+P + T I +G + LF IE
Sbjct: 291 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIES 350
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 849
+ L+V+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 351 AMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 405
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 150/359 (41%), Gaps = 77/359 (21%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 94 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNCT----------------------- 130
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 616
C+ CD + + G ++L C CER +H
Sbjct: 131 ---------------------------CKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 163
Query: 617 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 674
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 164 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 215
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 216 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 270
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ G Y AIL ++SA LR G ++AE+P + T I +G + LF IE
Sbjct: 271 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIES 330
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 849
+ L+V+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 331 AMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 385
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 153/366 (41%), Gaps = 71/366 (19%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+L+ CD CP +FHK C L ++P+GDW+C C
Sbjct: 930 GGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSC-------------------------- 963
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
C C +C F + C QCER+ CL+K
Sbjct: 964 --------C-------------CGICGENKFDGGSEQDNVVFSCYQCERQC---CLRKWG 999
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLE-TVSDI 680
L P G WFC C +I LQ LL + + +K LE + DI
Sbjct: 1000 HVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLEIDLPDI 1059
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
+ A E L+ A+ + H+CF+P+ + + RD++ +++ G +L F
Sbjct: 1060 E----------ALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRLNF 1109
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L N ++S +RV+G++VAE+PL+ T G +L +EK L L
Sbjct: 1110 QGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYRRLGMCHILMNELEKKLMELG 1169
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK---RVPAC 854
V+ +VLPA + WT FGF K+ D E L + F+ T M QK ++P
Sbjct: 1170 VERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFLD---YSFLDFQDTVMCQKLLMKIPLA 1226
Query: 855 RIGSSS 860
+ S+
Sbjct: 1227 KSNQST 1232
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 148/353 (41%), Gaps = 58/353 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + +P GDW+C C F G
Sbjct: 94 GDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPNCSCKF------------------CG 135
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
V S + +F + +L C C +++H C+++
Sbjct: 136 VASDK--------------------------NFQRDDTTVSKLLTCSLCVKKYHKSCMQE 169
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ FC C + LQ L + E F + I + T +D
Sbjct: 170 INTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIHR-------TDADS 222
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
D L G E L+ ++++ +CF PIVD SG +LI +++Y G N F
Sbjct: 223 DTS---LQGLPQRVECNSKLAVSLSVMDECFLPIVDRRSGINLIQNVLYNCGSNFNRLNF 279
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
GG Y IL ++SA +R G +AE+P + T + +G + LF IE L L+
Sbjct: 280 GGFYALILERGDEIISAASIRFHGTRLAEMPFIGTRHMYRRQGMCRRLFYAIESTLCSLK 339
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
V+ +++PA E WT+ FGF +D L + ++ F G MLQK++
Sbjct: 340 VEKLIIPAISELMHTWTEVFGFTTLDESLKQELKSM--NMLVFPGIDMLQKQL 390
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 156/369 (42%), Gaps = 62/369 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + P GDW+C YC F G VSG
Sbjct: 572 GDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIYCCCKF--------------CGSVSG 617
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
S Q IV L L C CE ++H C++
Sbjct: 618 --SSNQRDDNDELIVSKL-------------------------LTCQLCEEKYHRSCIEA 650
Query: 621 H--KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
+ D R++ +FC C ++ L+ LL + E + I++
Sbjct: 651 NDANTDDSRDV----FFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRRS--------- 697
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQ 736
DV + K E L+ AV+I +CF P +D SG +LI S++Y R N
Sbjct: 698 --DVGFDASQIKPQMVECNSKLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGSNFNRL 755
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
+ G AIL ++SA +R+ G ++AE+P + T + +G + L +E L
Sbjct: 756 NYSGFVTAILERGDEIISAASIRIRGNQLAEMPFIGTRYMYRRQGMCRRLLNAVEWGLGS 815
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
L V+ +V+PA E WT FGF+ ++ I + L+ F MLQK++ ++
Sbjct: 816 LNVELLVIPAISELRETWTSVFGFESLESTSKQILHNK--NLLVFPHVDMLQKKISKHKL 873
Query: 857 GSSSTDSTE 865
+ + +E
Sbjct: 874 AGQNLNPSE 882
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 152/356 (42%), Gaps = 67/356 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P W C C F H+ ++ +A +
Sbjct: 1013 GDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNCSCKF------CHEHSSDDAEDTAD 1066
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
VDS ++ C QCE + C
Sbjct: 1067 VDS--------------------------------------SLHTCSQCEEQCTEACSPD 1088
Query: 621 -HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
+A G FC C + LQNLL + + PE+ +++ E V
Sbjct: 1089 IDSIATNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLA 1148
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
+D R E ++ A+++ +CF PIVD +G +LI ++VY G N +
Sbjct: 1149 LDKRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLD 1198
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
F G Y IL +++A +R+ G ++AE+P + T + +G + L IE +LS L
Sbjct: 1199 FRGFYIIILERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSL 1258
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 849
++ +++PA E WT KFGF +D E+ S+ ++ F GT +LQK
Sbjct: 1259 NIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKSV------SMLVFPGTGLLQK 1308
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 149/351 (42%), Gaps = 59/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + +P G+W C C F G SG
Sbjct: 717 GDGGDLICCDGCPSTFHQSCLDIQMLPPGEWRCMNCTCKF--------------CGIASG 762
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ + S C+L +C+ CE+++H C K+
Sbjct: 763 TSEKD---------------DASVCVL----------------HICNLCEKKYHDSCTKE 791
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC +C ++ L+ L + E F + I + T D
Sbjct: 792 MDTLPNNINSSSLSFCGKECKELSEHLKKYLGTKHELESGFSWSLIHR-------TDDDS 844
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
+ R +S + E L+ + + +CF P++D SG +LI +++Y G N +
Sbjct: 845 EAACRGISQRV---ECNSKLAITLTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLSY 901
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL +++A +R G ++AE+P + T I +G + LF+ IE L L+
Sbjct: 902 SGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHIYRRQGMCRRLFSAIESTLCSLK 961
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E + WT FGF +D L + ++ F G MLQK
Sbjct: 962 VEKLVIPAIAEVTNTWTTVFGFTHLDKSLRQ--EMKSLNMMVFPGIDMLQK 1010
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 146/356 (41%), Gaps = 64/356 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF---ERKRFLQHDANAVEAGR 557
DGG+L+ CDGCP FH+ C + +P GDW+C C F + F+Q D V
Sbjct: 768 GDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCPNCTCKFCGIASEDFVQEDGTNVSE-- 825
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
+L C C +++H C
Sbjct: 826 ---------------------------------------------LLTCSLCAKKYHKSC 840
Query: 618 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 677
L+ + FC C + LQ L + E F + + +
Sbjct: 841 LQDVDAPCIDFNNSTPCFCGKTCRELFEQLQKYLGIKHELESGFSWSLVHRM-------- 892
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 735
DID+ L G E L+ A+++ +CF PIVD SG ++I +++Y G N
Sbjct: 893 -DIDLDMSL-QGLPQRVECNSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNR 950
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
+ G Y AIL ++SA +R G ++AE+P + T + +G + LF+ IE L
Sbjct: 951 LNYSGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRRLFSAIESALC 1010
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
L+V+ +++PA E WT FGF + L + ++ F G MLQK++
Sbjct: 1011 SLKVQKLIIPAISELTHTWTGVFGFTTLSDSLKQELKSM--NMLVFPGIDMLQKQL 1064
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 154/356 (43%), Gaps = 68/356 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H C IP G WYC C+
Sbjct: 1008 GDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR----------------------- 1044
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C SK P C QCER++H+ C+
Sbjct: 1045 ------------------------CDVCGEVVSSKEPRTPLHAFECSQCERQYHIKCISG 1080
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + P G WFC C +I + L++ + +P+ HL+ ++ L D
Sbjct: 1081 KVLCNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPD-HLD--DGFSCTILHNNGDQ 1131
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEF 738
VR + + A E + L A++I +CF PI D +G D++P ++Y N ++
Sbjct: 1132 KVR--MAADIALLAECNMKLIIALSILEECFLPIFDPRTGMDIMPLILYNWRSNFVHLDY 1189
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y +L + S++S +R+ G VAE+PL+AT N +G + + IE++L L+
Sbjct: 1190 KGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMCRRIVDYIEQMLKSLK 1249
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLVTFKGTSMLQKRV 851
V+ ++L A WT FGF+ I D + LS R L + GT +L+K +
Sbjct: 1250 VEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSKIR-----LASVPGTVLLKKDL 1300
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 152/360 (42%), Gaps = 70/360 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H++C IP G WYC
Sbjct: 1051 GDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY-------------------------- 1084
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
RC+ C +C K L C QCER++H C+
Sbjct: 1085 ---------RCL------------CDICGEVINLKELRSSLPALECAQCERQYHAKCIYG 1123
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + P WFC C +I L++ + + F ++ + T +DI
Sbjct: 1124 KLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTVLRNNGDQRVSTAADI 1182
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 736
A E + L A++I +CF PI+D+ +G D+IP ++Y N R
Sbjct: 1183 ----------AILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILY--NWRSDFVHL 1230
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
++ G Y +L + ++S +R+ G VAE+PL+AT N +G + L IE++L
Sbjct: 1231 DYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKS 1290
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPAC 854
L+V+ ++L A WT FGF ID + RK S +LV+ GT +L++ + C
Sbjct: 1291 LKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRLRLVSVPGTVLLKRNLYEC 1346
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 129 bits (324), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 152/360 (42%), Gaps = 70/360 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP ++H++C IP G WYC
Sbjct: 1000 GDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY-------------------------- 1033
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
RC+ C +C K L C QCER++H C+
Sbjct: 1034 ---------RCL------------CDICGEVINLKELRSSLPALECAQCERQYHAKCIYG 1072
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + P WFC C +I L++ + + F ++ + T +DI
Sbjct: 1073 KLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTVLRNNGDQRVSTAADI 1131
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 736
A E + L A++I +CF PI+D+ +G D+IP ++Y N R
Sbjct: 1132 ----------AILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILY--NWRSDFVHL 1179
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
++ G Y +L + ++S +R+ G VAE+PL+AT N +G + L IE++L
Sbjct: 1180 DYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKS 1239
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPAC 854
L+V+ ++L A WT FGF ID + RK S +LV+ GT +L++ + C
Sbjct: 1240 LKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRLRLVSVPGTVLLKRNLYEC 1295
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 135/288 (46%), Gaps = 33/288 (11%)
Query: 573 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADL 626
R ++ E C C + P TI+ C+QCER H+ C KK + L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 627 RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRL 686
+E + CC +C + + L+ ++K G + ++ WRL
Sbjct: 130 KEYMCFHFLCCQECQSLRARLEE---------------GLEKCVGITFLRRIRSNICWRL 174
Query: 687 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAI 745
LSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMYCA+
Sbjct: 175 LSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCAL 232
Query: 746 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 805
LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V + P
Sbjct: 233 LTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAP 292
Query: 806 AAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 849
E IW++K GF + E +L + LV FK ++QK
Sbjct: 293 VDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 335
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 147/351 (41%), Gaps = 57/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CD CP FH+ C + P G W C YC F ++G
Sbjct: 94 GDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYCSCKF---------------CGMAG 138
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D+ + D + + P +L C CE ++H C+
Sbjct: 139 GDTCQM-------------------------DENDTAAQP-ALLACCLCEEKYHHSCILA 172
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + FC C + LQ LL + E F ++++ + SDI
Sbjct: 173 ENTVN--DGYSSVSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRF-----DVGSDI 225
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 738
LSG E ++ A+ I +CF P+ D SG +LI ++VY G N +
Sbjct: 226 S-----LSGMHRKVECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNY 280
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G AIL ++SA +R+ G +AE+P + T + +G + L + IE L L
Sbjct: 281 SGFLTAILERGDEIISAASIRIHGNHLAEMPFIGTRHMYRRQGMCRRLLSAIETALCSLN 340
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +V+PA E WT FGFK ++ S + R ++V F G MLQK
Sbjct: 341 VEKLVIPAISELRETWTSVFGFKPLEGS--SKQKMRNMKMVAFPGIDMLQK 389
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 146/366 (39%), Gaps = 87/366 (23%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 965 GDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT----------------------- 1001
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C G K I C QC +H C+++
Sbjct: 1002 ------------------------CQVCGGPFSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1037
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
K+ L + WFC C I ++ + G + + D
Sbjct: 1038 EKLP-LEDQISQTWFCGKYCKEI-------------------FIGLRSHVGT--DNILDS 1075
Query: 681 DVRWRLL----SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 728
D+ W +L G+ A E + L+ A+ + +CF +VD +G D+IP ++
Sbjct: 1076 DLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVL 1135
Query: 729 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 786
Y G N ++ G Y IL ++ +RV G + AELP +ATS +G ++L
Sbjct: 1136 YNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDFRRQGMCRIL 1195
Query: 787 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTS 845
+ IEK+L VK +VL A E S W FGFK I D E ++ L+ F GTS
Sbjct: 1196 MSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHN---VNLMLFPGTS 1252
Query: 846 MLQKRV 851
+L KR+
Sbjct: 1253 LLTKRL 1258
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 145/351 (41%), Gaps = 59/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C ++ +P GDW+C C F G G
Sbjct: 506 GDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADG 551
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
++ + T +EL C LC E+++H C++
Sbjct: 552 SNAEDDTTV----------SELVTCSLC---------------------EKKYHTSCIQG 580
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + LQ + + E F + I + + SD
Sbjct: 581 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDT 635
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
VR G E+ L+ A+ + +CF IVD S +LI +++Y R N +
Sbjct: 636 SVR-----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNY 690
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL ++ A +R+ G ++AE+P + T I +G + LF IE L L+
Sbjct: 691 SGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLK 750
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT FGF + E R ++ F GT MLQK
Sbjct: 751 VEMLIIPAISELMHTWTVGFGFNPL--EESHKQELRSLNMLVFPGTDMLQK 799
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 144/355 (40%), Gaps = 65/355 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H C S +P+G WYC C
Sbjct: 989 GDGGELLCCDNCPSTYHPACLSAKELPEGSWYCHNCT----------------------- 1025
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C G K I C QC +H C+++
Sbjct: 1026 ------------------------CQICGGPVSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1061
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 679
K+ L + WFC C I L++ + E E + ++ G L +V
Sbjct: 1062 EKLP-LEDQISQTWFCGKYCKEIFIGLRSHVGTENILDSELSWSILRCNNDGQKLHSVQK 1120
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
I A E + L+ A+ + +CF +VD +G D+IP ++Y G N +
Sbjct: 1121 I----------ACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVD 1170
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G Y IL ++ +RV G + AELP +ATS +G ++L IEK+L
Sbjct: 1171 YQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRILMNIIEKMLCSF 1230
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRV 851
VK +VL A E S W FGFK I D E ++ L+ F GTS+L KR+
Sbjct: 1231 NVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQLHN---VNLMLFPGTSLLTKRL 1282
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 145/351 (41%), Gaps = 59/351 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C ++ +P GDW+C C F G G
Sbjct: 674 GDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADG 719
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
++ + T +EL C LC E+++H C++
Sbjct: 720 SNAEDDTTV----------SELVTCSLC---------------------EKKYHTSCIQG 748
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
FC C + LQ + + E F + I + + SD
Sbjct: 749 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDT 803
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 738
VR G E+ L+ A+ + +CF IVD S +LI +++Y R N +
Sbjct: 804 SVR-----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNY 858
Query: 739 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 798
G Y AIL ++ A +R+ G ++AE+P + T I +G + LF IE L L+
Sbjct: 859 SGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLK 918
Query: 799 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
V+ +++PA E WT FGF + E R ++ F GT MLQK
Sbjct: 919 VEMLIIPAISELMHTWTVGFGFNPL--EESHKQELRSLNMLVFPGTDMLQK 967
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 138/341 (40%), Gaps = 90/341 (26%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFLQHDANAVEAGR 557
GG L+ CDGCP AFH C L +P GDW+ C C F + NA E
Sbjct: 712 GGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLK----TTSTNAKEEKF 767
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
+S C QCE ++H C
Sbjct: 768 IS----------------------------------------------CKQCELKYHPSC 781
Query: 618 LKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 676
L+ D L ++ KWFC DC I +L +L+ + E
Sbjct: 782 LRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE-------------------- 821
Query: 677 VSDIDVRWRLL------------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 724
VS + WRL+ S A E +LS A+ + H+ F+P+ GRDL
Sbjct: 822 VSVEKLTWRLVQSLEPNMYGDDASKIEAAAENHCILSVALDVMHELFEPVKRPHGGRDLA 881
Query: 725 PSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 782
+++ R + F G Y +L N+ +VS +R+ G++VAE+P + T + +G
Sbjct: 882 EDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGM 941
Query: 783 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
++L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 942 CRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 982
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 147/361 (40%), Gaps = 84/361 (23%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP +FH+ C L +P+GDW+C C + L AN V+
Sbjct: 908 GGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICGQNKLSEHANIVD-------- 959
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
GP L C QCE ++HV CL+ K
Sbjct: 960 ------------------------------------GP--FLTCYQCECKYHVQCLRGTK 981
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 682
K WFC C +I LQ LL G S+ D ++
Sbjct: 982 --KFGSCSKPHWFCNKHCKQIYWGLQKLL-------------------GKSIPVGGD-NL 1019
Query: 683 RWRLLSGKAATP------------ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 730
W LL ++ E + L+ A+ + H+CF+P+ + + RD++ +++
Sbjct: 1020 TWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHECFEPVREQHTRRDIVEDVIFS 1079
Query: 731 R--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 788
R L+ F G Y +L N +++ +RV+G++VAE+PLV T G +L
Sbjct: 1080 RRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGTRFQYRRLGMCHILMN 1139
Query: 789 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQ 848
+E+ L L V+ +VLPA WT FGF K+ S + + F+ T M Q
Sbjct: 1140 ELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFLNY--TFLNFQETVMCQ 1197
Query: 849 K 849
K
Sbjct: 1198 K 1198
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 166/402 (41%), Gaps = 75/402 (18%)
Query: 466 YYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 522
YY G L EG + GI C+CC S S F+AH G N+ C F S
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGNNI--CRPAENLFLGNGKS 106
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI-----TKRCIRIVKN 577
L S CQ RK+ ++ + V +G S ++ ++ C V +
Sbjct: 107 LVS----------CQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENCNDYVCS 156
Query: 578 LEAELSGCLLCRGCD-----------------------------------FSKSG--FGP 600
+ G L+C CD F K F
Sbjct: 157 I-CHYGGDLIC--CDRCPSSFHATCLNIERVPEGDWFCPCCCCGICGDSQFDKMAEQFAD 213
Query: 601 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 660
++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 214 DSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILVGHN 269
Query: 661 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 720
IK + + V D D+ AA E LS A+ + H+CFDP+ D +
Sbjct: 270 LTCTLIKPMQYQAEDRV-DYDL--------AAMAENYSKLSVALEVMHECFDPVKDPKTK 320
Query: 721 RDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 778
RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T +
Sbjct: 321 RDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRFQHR 380
Query: 779 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 381 RLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 126 bits (317), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 164/404 (40%), Gaps = 79/404 (19%)
Query: 466 YYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 522
YY G L EG + GI C+CC S + F+AH G N+ C F S
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGNNI--CRPAENLFLGNGKS 106
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI-------TKRCIRIV 575
L S CQ RK+ + D A R +G S + ++ C V
Sbjct: 107 LVS----------CQVELMRKKIMMFDQGP--AVRAAGTGSRSKFRSLAPLGSENCNDYV 154
Query: 576 KNLEAELSGCLLCRGCD-----------------------------------FSKSG--F 598
++ G L+C CD F K F
Sbjct: 155 CSI-CHYGGDLIC--CDRCPSSFHAACLNIESVPEGDWFCPCCCCGICGDSQFDKMAEQF 211
Query: 599 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 658
++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 212 ADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILVG 267
Query: 659 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 718
IK + E D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 268 QNLTCTLIKPMQYQA-EDREDYDL--------AAMAENYSKLSVALEVMHECFDPVKDPK 318
Query: 719 SGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 776
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 319 TKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRFQ 378
Query: 777 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
+ G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 379 HRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 126 bits (316), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 146/356 (41%), Gaps = 69/356 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FH+ C + +P G+W+C C F
Sbjct: 732 GDGGDLICCDGCPSTFHQSCLDIQMLPLGEWHCPNCTCKF-------------------- 771
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C + G + K + +C+ CE+++H C K
Sbjct: 772 ------------------------CGIASG-NSEKDDASVYVLQICNLCEKKYHDSCTK- 805
Query: 621 HKMADLRELPKG-----KWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLE 675
++ LP FC +C ++ L+ L + E F + I + +S
Sbjct: 806 ----EMDNLPNNINTSSLSFCGKECKELSEHLKKYLGTKHELEAGFSWSLIHRIDEDSEA 861
Query: 676 TVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNL 733
I R E L+ A+ + +CF P++D SG +LI +++Y G N
Sbjct: 862 ACRGISQR----------VECNSKLAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNF 911
Query: 734 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 793
+ G Y A L ++++ +R G ++AE+P + T + +G + LF+ IE
Sbjct: 912 SRLNYSGFYTATLERGDEIIASASIRFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIEST 971
Query: 794 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
L L+V+ +V+PA E + WT FGF +D L + ++ F G ML K
Sbjct: 972 LCSLKVEKLVIPAIAELTNTWTTVFGFTHLDESLRQ--EMKSLNMMVFPGIDMLMK 1025
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 161/378 (42%), Gaps = 72/378 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH C S+ +P+G+WYC C
Sbjct: 952 GDGGELICCDNCPSTFHHSCLSIQELPEGNWYCLNC------------------------ 987
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
T R + N E E+S L C QCE+++H CLK+
Sbjct: 988 -------TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQ 1024
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVSD 679
+ E WFC C +I + LQ+ L ++A G S +
Sbjct: 1025 RDINSGVE--SHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWTLLRC 1070
Query: 680 IDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRG 735
I ++LS A E L A+ I +CF +VD +G D+IP +VY +
Sbjct: 1071 IHYDQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPR 1130
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
+F G Y IL + ++ +RV G E+AE+PL+AT +G + L IE++L
Sbjct: 1131 LDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLM 1190
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
+VK +V+ A WT+ FGF ++ E K L+ F GT +L+K A
Sbjct: 1191 SFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHKF--NLMVFPGTVLLKK---ALY 1245
Query: 856 IGSSSTDSTECV-SGVEV 872
+ +T++T + SGV++
Sbjct: 1246 VSGQTTETTVGIHSGVQL 1263
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 163/382 (42%), Gaps = 80/382 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH C S+ +P+G+WYC C
Sbjct: 952 GDGGELICCDNCPSTFHHSCLSIQELPEGNWYCLNC------------------------ 987
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
T R + N E E+S L C QCE+++H CLK+
Sbjct: 988 -------TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQ 1024
Query: 621 HKMADLRELPKGK----WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLE 675
R++ G WFC C +I + LQ+ L ++A G S
Sbjct: 1025 ------RDIDSGVESHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWT 1066
Query: 676 TVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--R 731
+ I ++LS A E L A+ I +CF +VD +G D+IP +VY
Sbjct: 1067 LLRCIHYDQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKS 1126
Query: 732 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 791
+ +F G Y IL + ++ +RV G E+AE+PL+AT +G + L IE
Sbjct: 1127 SFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIE 1186
Query: 792 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
++L +VK +V+ A WT+ FGF ++ E K L+ F GT +L+K
Sbjct: 1187 EMLMSFKVKKLVIAAIPSLVETWTEGFGFVTVENEEKQSLHKF--NLMVFPGTVLLKK-- 1242
Query: 852 PACRIGSSSTDSTECV-SGVEV 872
A + +T++T + SGV++
Sbjct: 1243 -ALYVSGQTTETTVGIHSGVQL 1263
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 151/367 (41%), Gaps = 89/367 (24%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
D G LL CD CP FH C L S PQGDW+C C
Sbjct: 419 DCGELLMCDRCPSMFHHACVGLESTPQGDWFCPACT------------------------ 454
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDF-------SKSGFGP-RTILLCDQCEREF 613
C +C D + GF R ++ C+QC RE+
Sbjct: 455 -----------------------CAICGSSDLDDPPATTTTQGFSSDRMVISCEQCRREY 491
Query: 614 HVGCLKK------HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 667
HVGC+++ + AD +G W C CS+I L+ L V +A
Sbjct: 492 HVGCMRERDNGLWYPEAD----GEGPWLCSEACSKIYLRLEELAVVQAP--------CRS 539
Query: 668 KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 727
+G SL V R + + E L A+ + +CF +++ + DL +
Sbjct: 540 VASGLSL-------VVLRRGAARDGEEEEHAKLCMALDVLRECFVTLIEPRTQTDLTADI 592
Query: 728 VYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 785
V+ LR +F G Y L +++ LRV+G+EVAE+PLV T +G +L
Sbjct: 593 VFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFARRRQGMCRL 652
Query: 786 LFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRKRCSQ--LVTFK 842
L I+KLL + V+ +VLPA E + WT FGF+ E+ R+ + ++ F+
Sbjct: 653 LMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFR----EMGQADRQDVAHHAILRFQ 708
Query: 843 GTSMLQK 849
GT M K
Sbjct: 709 GTIMCHK 715
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 143/358 (39%), Gaps = 72/358 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
+GG L+ CD CP FH C S IP GDWYC C
Sbjct: 805 GEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----------------------- 841
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C K L C QCE ++H CL+
Sbjct: 842 ------------------------CRICGNLVIDKDTLDAHDSLQCSQCEHKYHEKCLED 877
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLETV 677
+ L WFC C + S LQ+ L+ Q A+ + L I + + +
Sbjct: 878 RDKQEGAILDT--WFCGQSCQEVYSGLQSQVGLVNQVADGISWTLLRCI--HDDQKVHSA 933
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 735
+W L T L+ A+ I +CF + D +G LIP ++Y G
Sbjct: 934 -----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHLIPQVLYNWGSEFAR 983
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
F G Y +L + ++S +RV G VAE+PL+AT +G +LL IE++L
Sbjct: 984 LNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVTAIEQVLI 1043
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 851
+V+ +V+ A + WT FGF +D I R+R ++ L+ F GT +L K +
Sbjct: 1044 SFKVEKLVISAIPDLVETWTKGFGFIPVD----DIERQRLNKINLMVFPGTVLLVKSL 1097
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 142/350 (40%), Gaps = 86/350 (24%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP +FHK C + +P GDW+C C ++ G +
Sbjct: 708 GGELILCDHCPSSFHKRCLGMKDVPDGDWFCPSC------------------CCKICGQN 749
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
+++ TK I V N C QCE ++H+ CL
Sbjct: 750 KLKKDTKDFIDGVLN----------------------------CTQCEHQYHIMCLSNSW 781
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 682
++ PK FC C + + + KL F + +ET S + +
Sbjct: 782 TDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDAFDDETL-------VETYSKLKI 827
Query: 683 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 740
A+ + H+CF+PI + +GRDL+ +++ G L F G
Sbjct: 828 --------------------ALDVVHECFEPIEEPRTGRDLMKDVIFSNGSELNRLNFQG 867
Query: 741 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVK 800
Y +L N +VS +R+ G +VAE+PLV T G ++L +EK L L V+
Sbjct: 868 FYTILLEKNDELVSVATVRIHGDKVAEIPLVGTRFQFRQLGMCRILMDVLEKKLMELGVQ 927
Query: 801 SIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
+VLPA + WT FGF K+ D E L + F+ T M QK
Sbjct: 928 RLVLPAVPGVLNTWTGSFGFSKMTDSERLQFVD---YTFLDFQDTVMCQK 974
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 144/359 (40%), Gaps = 74/359 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
+GG L+ CD CP FH C S IP GDWYC C
Sbjct: 805 GEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----------------------- 841
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C K L C QCE ++H CL+
Sbjct: 842 ------------------------CRICGNLVIDKDTSDAHDSLQCSQCEHKYHEKCLED 877
Query: 621 HKMADLRELP-KGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLET 676
D +E+ WFC C + S LQ L+ Q A+ + L I + + +
Sbjct: 878 R---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISWTLLRCI--HDDQKVHS 932
Query: 677 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 734
+W L T L+ A+ I +CF + D +G +IP ++Y G
Sbjct: 933 A-----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHMIPQVLYNWGSEFA 982
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
F G Y +L ++S +RV G VAE+PL+AT +G +LL + IE++L
Sbjct: 983 RLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQYRRQGMCRLLVSAIEQML 1042
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQKRV 851
+V+ +V+ A + WT FGF +D I R+R ++ L+ F GT +L K +
Sbjct: 1043 ISFKVEKLVVSAIPDLVETWTKGFGFITVD----DIERQRLNKINLMVFPGTVLLVKSL 1097
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 163/406 (40%), Gaps = 91/406 (22%)
Query: 472 KLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDW 531
KL G K+ C C ADGG LL CD CP FH C ++ +P+G W
Sbjct: 502 KLRSGEKDSSDDACGVC------------ADGGELLCCDSCPSTFHPACLAMK-VPEGLW 548
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C YC+ C+LC
Sbjct: 549 ACHYCR-----------------------------------------------CVLCMAN 561
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 651
D + + C C ++H C + +++ R +C C ++++ L +++
Sbjct: 562 DD-------QGLSRCQHCTLKYHEIC--RPSLSNGR---GNGAYCSETCKKVSAQLSDMI 609
Query: 652 VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 711
F +K + + + DV E + L+ A+ + ++CF
Sbjct: 610 GITNHTEDGFSWALLKIQKDEPVSSQNSPDVL-----------ECNVKLAVALGVLNECF 658
Query: 712 DPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 769
+P+ D + D++ VY G + + G Y +L N ++SA +LR+ G +VAE+P
Sbjct: 659 NPVKDRRTKIDMLHQAVYSLGSEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMP 718
Query: 770 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 829
T +G + L +E++L+ ++V+ +V+PA W F FK +DPEL
Sbjct: 719 FAGTLPAYRKQGMMRRLVNAVEQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPELKE 778
Query: 830 IYRKRCSQLVTFKGTSMLQKRVPACRIGSSS----TDSTECVSGVE 871
R+R LV GT++LQK V A SS T++ SG E
Sbjct: 779 EIRRR--SLVVITGTTLLQKPVVAAPPSPSSLHKQTEAAAAKSGAE 822
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 144/368 (39%), Gaps = 91/368 (24%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CD C FH +C + +P GDWYC+ C
Sbjct: 741 GDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSC------------------------ 775
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCEREFHVGCL 618
LCR C F K P +L C QC R++H C
Sbjct: 776 --------------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCS 809
Query: 619 KKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 676
+P FC C +I L LL IK +
Sbjct: 810 SGTGTDFDCTIPGTSIDCFCSPGCRKIYKRLNKLL-------------GIKNHM------ 850
Query: 677 VSDIDVRWRLL----SGKAATPETRLLLSQ-------AVAIFHDCFDPIVDSISGRDLIP 725
+ W L+ + +A P+ + ++Q A + +CF P +D SG ++I
Sbjct: 851 --EAGFSWSLVHCFPNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHIDERSGINMIH 908
Query: 726 SMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 783
++ Y G + +F G Y IL V+SA +R+ G ++AE+P + T + +G
Sbjct: 909 NVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGMYRHQGML 968
Query: 784 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKG 843
+ L IE L L V+ +V+ A E E+ WT FGFK + P R + L+ G
Sbjct: 969 RRLLNGIESALCSLNVQKLVVSAVTEMENTWTTVFGFKPVQPS--KKQRIKSLNLLIMNG 1026
Query: 844 TSMLQKRV 851
T +L+KR+
Sbjct: 1027 TGLLEKRL 1034
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 155/354 (43%), Gaps = 72/354 (20%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 341 GGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC-------------------------- 374
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
K C ++ + + ++S F S + C QCE+ H+GC+K +
Sbjct: 375 ----CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIGCVKSIQ 416
Query: 623 MADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + + + + WFC C I+ LQNLL ++ +P G++ E ++
Sbjct: 417 VLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDARENLT-- 461
Query: 681 DVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-- 736
W L+ + R L++A+ + H F P+ D I+ DLI + + +
Sbjct: 462 ---WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKRSESKRL 518
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L +E L
Sbjct: 519 NFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLNELEHQLIE 578
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
+ VK + LPA EA + WT FGF K+ D + L + + + F+ T QK
Sbjct: 579 MGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTVRCQK 629
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 155/354 (43%), Gaps = 72/354 (20%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 364 GGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC-------------------------- 397
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
K C ++ + + ++S F S + C QCE+ H+GC+K +
Sbjct: 398 ----CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIGCVKSIQ 439
Query: 623 MADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + + + + WFC C I+ LQNLL ++ +P G++ E ++
Sbjct: 440 VLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDARENLT-- 484
Query: 681 DVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-- 736
W L+ + R L++A+ + H F P+ D I+ DLI + + +
Sbjct: 485 ---WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKRSESKRL 541
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L +E L
Sbjct: 542 NFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLNELEHQLIE 601
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
+ VK + LPA EA + WT FGF K+ D + L + + + F+ T QK
Sbjct: 602 MGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTVRCQK 652
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 150/364 (41%), Gaps = 68/364 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 559
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 619
LS C LC E ++H C+
Sbjct: 713 ----------------------LSSCRLC---------------------EEKYHQACIN 729
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVS 678
+ FC C + LQ L + LPE F + ++++ S V+
Sbjct: 730 QDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVA 784
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 736
D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 785 DCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRL 836
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
+F A+L +++ +R+ G ++AE+P + T + +G + L IE L
Sbjct: 837 DFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALGS 896
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
L+V +V+PA E WT FGF + D E +I + L+ F G ML K + +
Sbjct: 897 LKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKEK 953
Query: 856 IGSS 859
I S
Sbjct: 954 ITDS 957
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 130/329 (39%), Gaps = 92/329 (27%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFLQHDANAVEAGR 557
GG L+ CDGCP AFH C L +P GDW+ C C F + NA E
Sbjct: 703 GGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT----STNAKEEKF 758
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
+S C QCE ++H C
Sbjct: 759 IS----------------------------------------------CKQCELKYHPSC 772
Query: 618 LKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 676
L+ D L ++ KWFC DC S+ N+ +A K+
Sbjct: 773 LRYDGACDSLDKILGEKWFCSKDCE--ESLEPNMYGDDASKI------------------ 812
Query: 677 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLR 734
A E +LS A+ + H+ F+P+ GRDL +++ R +
Sbjct: 813 --------------EAAAENHCILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFK 858
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
F G Y +L N+ +VS +R+ G++VAE+P + T + +G ++L +EK+L
Sbjct: 859 RLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLINELEKVL 918
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
L V+ +VLPA + W + FGF K+
Sbjct: 919 IDLGVERLVLPAVPCVLNTWINSFGFTKM 947
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 154/365 (42%), Gaps = 60/365 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 559
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCL 618
LS C LC C S P T+ C + G +
Sbjct: 713 ----------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTV 744
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 677
+ D FC C + LQ L + LPE F + ++++ S V
Sbjct: 745 PGERSTDS--------FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EV 793
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 735
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 794 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHR 845
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
+F A+L +++ +R+ G ++AE+P + T + +G + L IE L
Sbjct: 846 LDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALG 905
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 854
L+V +V+PA E WT FGF + D E +I + L+ F G ML K +
Sbjct: 906 SLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKE 962
Query: 855 RIGSS 859
+I S
Sbjct: 963 KITDS 967
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 151/359 (42%), Gaps = 79/359 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
DGG LL CD CP FH C ++ +PQG W C YC
Sbjct: 461 DGGELLCCDSCPSTFHPACLAMK-VPQGWWACHYC------------------------- 494
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
RC+ + N + LS C C ++H C ++
Sbjct: 495 --------RCVLCMANDDQGLS---------------------TCQHCSLKYHEVC-RRP 524
Query: 622 KMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSD 679
+++ R + +C C ++++ L +++ E + L I+K E VS
Sbjct: 525 SLSNGRGIGA---YCSETCKKVSARLSDMVGVTNHTEDGFSWALLKIQKD-----EAVSS 576
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
D AA E + L+ A+ + ++CF+P D + D++ VY G +
Sbjct: 577 QDT--------AAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAVYSLGSEFKRVS 628
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G Y +L + ++A +LR+ G +VAE+P AT +G + L +E++L+ +
Sbjct: 629 YEGFYTMVLDKDGETIAAALLRIHGTKVAEMPFAATLPAYRKQGMMRRLVNAVEQVLASV 688
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKK-IDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
+V +V+PA WT F F+ +DPE R+R LV GT++L K V A R
Sbjct: 689 QVDKLVIPAIAALVDTWTRSFSFRPLLDPESREEIRRR--SLVVIAGTTLLHKPVAAAR 745
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 145/594 (24%), Positives = 214/594 (36%), Gaps = 118/594 (19%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPP 279
L+ V L TGLL+G V YM K + + G I G C CS CN ++
Sbjct: 62 LDSDLRDVRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNA 117
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
+FE H + + +I + G SL V+ A + L ML ++ + P +
Sbjct: 118 CEFEEHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN- 176
Query: 340 VRCKGTFP-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANSTP 394
+ K +F + V G C + G M Y+ + S I+N
Sbjct: 177 -KWKASFQKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNW 231
Query: 395 VTSVHKSSQSQRQRKI-------------------TKKSKKTVLISKPFENASPPLSF-- 433
S +S + RQ T S+K EN PLS
Sbjct: 232 SASKRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDG 291
Query: 434 ------------PNKSRWNIT------------------------------PKDQRLHKL 451
N S+ + T +D LH L
Sbjct: 292 VKSDSPLPTAVTTNHSKHDSTNLGLSLSSPVKITQRPLRNCSIDSKSKESKTRDTTLHPL 351
Query: 452 VFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGG------ 504
+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS FE HA G
Sbjct: 352 IFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPY 411
Query: 505 -NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
N+ +G + HK L + + N H+ + GR S
Sbjct: 412 HNIYTLEGL--SLHKLALQLQDHLNPNGF----DNASVSSVSDYHNLTSSGCGREPSTTS 465
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL----- 618
+ + R ++ E C C + P TI+ C+QCER H+ C
Sbjct: 466 GPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVV 523
Query: 619 -KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 677
KK + L+E + CC +C + + L+ ++K G +
Sbjct: 524 KKKVPLEILKEYMCFHFLCCQECQSLRARLE---------------EGLEKCVGITFLRR 568
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 731
++ WRLLSG A+ + +L + Q + IF D F D S D+I MV G+
Sbjct: 569 IRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGK 620
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 129/288 (44%), Gaps = 36/288 (12%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL TGLL+G V + L GI +D G++C+C +C G +V+ S
Sbjct: 308 LTKPPRNAKELMATGLLEGHYVH----CSCRGEQLTGIFQDMGVVCNCRICKGTQVVSIS 363
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL---PMLKATLQSALSSLPE-EKS 336
FE H+ S I ENGK+L ++L A + +L+A LQ A+ + K
Sbjct: 364 AFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRA-LQHAIGEIQGISKE 422
Query: 337 FACVRC---KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSSRPGLIANST 393
CV+C +G I+C G CV K P + + +R
Sbjct: 423 MTCVKCGKHEGGEFISCKGAKCSAAYHAECVGVKSPHLEDWFCAKCEKTQAR-----KPQ 477
Query: 394 PVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVF 453
P+ V +S + K K+ +I++ + +D LHK +F
Sbjct: 478 PLLKVKRSPAGTDKEDARSKGKEQTMIAR-------------------SARDAHLHKALF 518
Query: 454 DESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 501
GL DGTE+GYYA Q +L+G K G GI C CCN E++ S FE HA
Sbjct: 519 LPGGLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHA 566
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 158/383 (41%), Gaps = 63/383 (16%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 559
DGG+L+ CDGCP FH+ C + P G WYC C F E+ HD +A+ +
Sbjct: 754 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEKVEAAIHDTSALHS---- 809
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCL 618
LS C LC C S P T+ C + G +
Sbjct: 810 ----------------------LSSCRLCEEKC----SNHYPHTLADHQACINQ--DGTV 841
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 677
+ D FC C + LQ LL+ LPE F + ++++ S V
Sbjct: 842 PGERSTDS--------FCGKYCQELFEELQ-LLIGVKHPLPEGFSWSFLRRFELPS--EV 890
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRG 735
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY N
Sbjct: 891 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFWSNFHR 942
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
F A+L +++ +R+ G ++AE+P + T + +G + L IE L
Sbjct: 943 LNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESALG 1002
Query: 796 FLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK-----R 850
L+V +V+PA E WT FGF ++ + L+ F G ML K +
Sbjct: 1003 SLKVAKLVIPAVPELIDTWTSGFGFTPVNESEKKTIKNL--NLLVFPGVDMLGKSLVKEQ 1060
Query: 851 VPACRIGSSSTDSTECVSGVEVG 873
+ + SS+ DS + VE G
Sbjct: 1061 ITDSIVSSSNVDSCLKLRNVEEG 1083
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 152/367 (41%), Gaps = 86/367 (23%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+LL CD CP FH C + +P GDW+C+ C F
Sbjct: 316 GDGGDLLCCDNCPSTFHLACLGIK-MPSGDWHCRSCICRF-------------------- 354
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
S ++IT AEL CL QC R++H C
Sbjct: 355 CGSTQEITTS--------SAELLSCL---------------------QCSRKYHQVCAPG 385
Query: 621 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
++ FC C +I L+ LL + NAI+ AG S V
Sbjct: 386 TMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------NAIE--AGFSWSLV- 433
Query: 679 DIDVRWRLLSGK-AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY- 729
R K AA P+ + L + A ++ +CF P +D SG ++I +++Y
Sbjct: 434 ------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYN 487
Query: 730 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 788
G + F Y IL V+SA +R+ G ++AE+P + T I +G L
Sbjct: 488 CGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLN 547
Query: 789 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSM 846
IE LS L V+ +V+PA E ++ WT FGFK ++P R++ L + GT +
Sbjct: 548 AIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGL 603
Query: 847 LQKRVPA 853
L+KR+ A
Sbjct: 604 LEKRLLA 610
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 142/355 (40%), Gaps = 66/355 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+LL CD C FH C + +P GDW+C+
Sbjct: 856 GDGGDLLCCDRCTSTFHVACLGIE-MPSGDWFCR-------------------------- 888
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
CI C C + S P +L C QC R++H C +
Sbjct: 889 ---------NCI------------CKFCGSAEERTSS--PAELLSCLQCSRKYHQVCAQG 925
Query: 621 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
+ + P FC C++I L+ LL + + F + ++ +A
Sbjct: 926 IEREFVSTTPSASIDCFCSPGCTKIYKRLKRLLGLKNDLEAGFSWSLVRCFA-------- 977
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 736
D KA + A ++ +CF P +D SG ++I ++VY G +
Sbjct: 978 --DTEATSTKKKAQLVHCNSKTALAFSVLDECFLPRIDERSGINIIHNVVYNCGSDFSRL 1035
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y IL V+SA +R+ G + AE+P + T + +G L IE L
Sbjct: 1036 NFSGFYTFILERGDEVISAATVRIHGTDFAEMPFIGTRGMYRHQGMCHRLLDAIESALCS 1095
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
L V+ +V+PA E ++ W+ FGFK + P + + L+ GT +L+KR+
Sbjct: 1096 LNVRRLVIPAIPELQNTWSTVFGFKPVGP--TKKQKIKSVNLLIIHGTGLLEKRL 1148
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 141/365 (38%), Gaps = 63/365 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G WY C N R N + VS
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GNPLSEKEVST 63
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ IL C QC +H C+ +
Sbjct: 64 FSA---------------------------------------ILKCLQCGDSYHDTCIDQ 84
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 679
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKK 143
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 737
I A E L+ A+ I +CF +VD +G D+IP ++Y G N +
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVLYNKGSNFARLD 193
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
+ G Y IL ++ +RV G + AELP +ATS +G + L IE +L
Sbjct: 194 YQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRSF 253
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIG 857
V+++VL A E + W FGFK I+ R L+ F GTS+L KR+
Sbjct: 254 HVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGITAA 311
Query: 858 SSSTD 862
S D
Sbjct: 312 KSEED 316
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 154/364 (42%), Gaps = 75/364 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
ADGG+L+ C+ C H +C L IPQGDW C YC
Sbjct: 217 ADGGDLICCEKCWSTSHLKCMGLERIPQGDWICPYC------------------------ 252
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ K C + K+L+ C QC++++H CL
Sbjct: 253 ------VCKHCNKNDKDLQT-------------------------CVQCDKKYHCQCLVS 281
Query: 621 HKMADLRELPKGKWFCC-MDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 679
+K DL G+ C C + LQ+L+ + E F +++ ++L+ D
Sbjct: 282 NKELDLN--ASGETLACDSHCGEVYEKLQSLVGVKHELEGGFCWTLLQRMEPDNLD-FKD 338
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQE 737
+ + E ++ A + +CF I+D + +++ S+ Y R NL
Sbjct: 339 LHL----------ITECNSKIALAWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRIN 388
Query: 738 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 797
F G Y AIL N ++SA +RV G ++AE+P + T + G ++L +E + S +
Sbjct: 389 FRGFYTAILEKNDDIISAATIRVHGTDLAEMPFIGTRHLYRQNGMSRMLLVTLESIFSVM 448
Query: 798 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS-QLVTFKGTSMLQKRVPACRI 856
V+ +++P+ +E +W K GF I+ ++ +K + +TF LQK + +
Sbjct: 449 GVEHLIIPSVQELTEMWEGKCGFSPIED---AVSQKITNWNTLTFPSAVRLQKALLSTPA 505
Query: 857 GSSS 860
SSS
Sbjct: 506 SSSS 509
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 142/359 (39%), Gaps = 73/359 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S+ +P WYC C
Sbjct: 662 GDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHNCI----------------------- 698
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK---SGFGPRTILLCDQCEREFHVGC 617
C +C GC ++ S F I+ C QC H C
Sbjct: 699 ------------------------CRIC-GCPVTEKEISSFS--AIIKCLQCGAAHHDTC 731
Query: 618 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLET 676
++ A E+ +WFC C I L + E+ ++ G + +
Sbjct: 732 VEMGATA-FEEMDSDEWFCGTHCKEIYLGLHGCVGVESSLGDGLSWTILRCNSGGQKMHS 790
Query: 677 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 734
V I A E L+ A+ + +CF +VD+ +G ++IP ++Y G
Sbjct: 791 VQKI----------AHAIECNSKLAVALTLMEECFAQMVDTRTGINMIPHVLYNQGSKYA 840
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
+ G Y IL ++ A +RV G + AELP +AT + + KG + L IE++L
Sbjct: 841 RLNYQGFYTVILEKGEEILCAASIRVHGMKAAELPFIATCREHRRKGMCRRLINTIEEML 900
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRV 851
VK +VL A E S W FGFK I+ RK+ L+ F GTS+L K +
Sbjct: 901 KSFHVKMLVLSAIPELVSTWVSGFGFKPIE----EYERKQLDTINLMLFPGTSLLIKSL 955
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 151/367 (41%), Gaps = 86/367 (23%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+LL CD CP FH C + +P GDW+C C F
Sbjct: 738 GDGGDLLCCDNCPSTFHLACLGI-KMPSGDWHCSSCICRF-------------------- 776
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
S ++IT AEL CL QC R++H C
Sbjct: 777 CGSTQEITTSS--------AELLSCL---------------------QCSRKYHQVCAPG 807
Query: 621 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
++ FC C +I L+ LL + NAI+ AG S V
Sbjct: 808 TMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------NAIE--AGFSWSLV- 855
Query: 679 DIDVRWRLLSGK-AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY- 729
R K AA P+ + L + A ++ +CF P +D SG ++I +++Y
Sbjct: 856 ------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYN 909
Query: 730 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 788
G + F Y IL V+SA +R+ G ++AE+P + T I +G L
Sbjct: 910 CGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLN 969
Query: 789 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSM 846
IE LS L V+ +V+PA E ++ WT FGFK ++P R++ L + GT +
Sbjct: 970 AIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGL 1025
Query: 847 LQKRVPA 853
L+KR+ A
Sbjct: 1026 LEKRLLA 1032
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 151/367 (41%), Gaps = 86/367 (23%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+LL CD CP FH C + +P GDW+C C F
Sbjct: 739 GDGGDLLCCDNCPSTFHLACLGI-KMPSGDWHCSSCICRF-------------------- 777
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
S ++IT AEL CL QC R++H C
Sbjct: 778 CGSTQEITTSS--------AELLSCL---------------------QCSRKYHQVCAPG 808
Query: 621 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
++ FC C +I L+ LL + NAI+ AG S V
Sbjct: 809 TMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------NAIE--AGFSWSLV- 856
Query: 679 DIDVRWRLLSGK-AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY- 729
R K AA P+ + L + A ++ +CF P +D SG ++I +++Y
Sbjct: 857 ------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYN 910
Query: 730 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 788
G + F Y IL V+SA +R+ G ++AE+P + T I +G L
Sbjct: 911 CGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLN 970
Query: 789 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL--VTFKGTSM 846
IE LS L V+ +V+PA E ++ WT FGFK ++P R++ L + GT +
Sbjct: 971 AIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPS----KRQKIKSLNILIIHGTGL 1026
Query: 847 LQKRVPA 853
L+KR+ A
Sbjct: 1027 LEKRLLA 1033
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 135/353 (38%), Gaps = 66/353 (18%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH+ C S +P+G WYC C
Sbjct: 873 GDGGELICCDNCPSTFHQACLSTEELPEGSWYCPNCT----------------------- 909
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C K C QCE ++H C K
Sbjct: 910 ------------------------CWICGELVNDKEDINSSNAFKCSQCEHKYHDSCWKN 945
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + WFC C + LQ+ + +N I +L
Sbjct: 946 KTIG--KGGASDTWFCGGSCQAVYFGLQSRVGI---------INHIADGVCWTLLKCIHE 994
Query: 681 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 736
D + A E L+ A+ I +CF +VD +G D+IP ++Y N R +
Sbjct: 995 DQKVHSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHVLY--NWRSEFARL 1052
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y +L + ++S +R+ G VAE+PL+AT +G + L IE++L
Sbjct: 1053 NFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLIATCSNYRRQGMCRRLMTAIEEMLIS 1112
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
+V+ +V+ A + WT+ FGF + + K L+ F GT +L+K
Sbjct: 1113 FKVEKLVVSAIPDLVETWTEGFGFTPMSNDEKQSLNK--INLMVFPGTILLKK 1163
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 114 bits (284), Expect = 3e-22, Method: Composition-based stats.
Identities = 52/70 (74%), Positives = 63/70 (90%), Gaps = 1/70 (1%)
Query: 539 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 598
MF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGF
Sbjct: 1 MFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGF 59
Query: 599 GPRTILLCDQ 608
GPR I++CDQ
Sbjct: 60 GPRMIIMCDQ 69
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 146/352 (41%), Gaps = 89/352 (25%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
+GG L+ C+ CP FH EC SL +P+ W+
Sbjct: 277 EGGELVCCETCPLTFHMECVSLLEVPKDAWF----------------------------- 307
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
C R CL C + P C+QCER FH GC
Sbjct: 308 ---------CFR-----------CLCCHCGE-------PLRTQPCEQCERCFHPGCCDDA 340
Query: 622 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 681
+A G +F C +S N+ + AE + ++ + +
Sbjct: 341 ILA-------GDFFFC------SSGCWNIFQRLAEMVA-------------TVNPLGRSE 374
Query: 682 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ--EFG 739
+ W LL + LL++A+ + FDP++D + D + +MV+ R+ +F
Sbjct: 375 LSWSLLRRGRCDDK---LLAEALQVISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRLDFS 431
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y A+L + VV +LR+ G +AE+P +AT G+G + LF +E++L+ L V
Sbjct: 432 GFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGV 491
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
+ +VL AA++ E +W + F F +D +L + R LV G LQK V
Sbjct: 492 EMMVLLAAKDTEKMWKNSFEFHAMDRKLKA--RTVALGLVALNGAGFLQKSV 541
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 131/268 (48%), Gaps = 35/268 (13%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 662
++ C+QC+R FH+ CLK+ D + WFC C+R+ S L+NLL
Sbjct: 315 LMACEQCQRRFHLTCLKE----DSCIVSSRGWFCSSQCNRVFSALENLL----------- 359
Query: 663 LNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSI 718
G+ + +D D+ W L+ G+ E L AV I H F+P D
Sbjct: 360 --------GSKIAVGNDGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTNDVF 411
Query: 719 SGRDLIPSMVYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 777
SGRDL+ ++Y ++ G G G Y ++ + ++ +RV ++V E+PLVAT
Sbjct: 412 SGRDLVEELIYRKDRTG--VGRGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSY 468
Query: 778 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCS 836
G ++L +EK +S + V +VLPAA+E + WT++FGF ++ E L + +
Sbjct: 469 RRSGMCRVLMDELEKQMSQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVK---H 525
Query: 837 QLVTFKGTSMLQKRVPACRIGSSSTDST 864
++ F GT M K + R + S + +
Sbjct: 526 GMLDFVGTIMCHKFLQKERAENDSAEES 553
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%), Gaps = 1/31 (3%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGD-WY 532
GG+LL CDGCP AFH C LSS+P+ D W+
Sbjct: 265 GGDLLLCDGCPSAFHHACLGLSSLPEEDLWF 295
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 155/369 (42%), Gaps = 87/369 (23%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+LL CDGCP AFH C LSS+P+ D + F + S V+
Sbjct: 263 GGDLLLCDGCPSAFHHTCLGLSSLPEEDLW------------FCPCCCCDICGSMESPVN 310
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
S +L C C+ R FH+ CLK+
Sbjct: 311 S-----------------KLMACEQCQ---------------------RRFHLKCLKEEP 332
Query: 623 -MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 681
+ R WFC C+R++S L+NL+ G + ++ D
Sbjct: 333 GIVSCR-----GWFCSSQCNRVSSALENLI-------------------GCKIAVGNNGD 368
Query: 682 VRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE 737
+ W L+ G+ E L AV I H F+P D SGRDL+ +++ ++ G
Sbjct: 369 LVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTKDVFSGRDLVEELIFRKDRTG-- 426
Query: 738 FG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
G G Y ++ ++ +RV ++V E+PLVAT G ++L +EK +S
Sbjct: 427 VGRGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMCRVLVDELEKQMSQ 485
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCSQLVTFKGTSMLQKRVPACR 855
+ V +VLPAA+E S WT +FGF ++ E L + + ++ F GT M K + R
Sbjct: 486 MGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVK---HGMLDFVGTVMCHKFLVKER 542
Query: 856 IGSSSTDST 864
+ S + +
Sbjct: 543 AENDSAEES 551
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 144/376 (38%), Gaps = 66/376 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG L+ CD CP FH+ C +P+G WYC C
Sbjct: 226 GDGGELICCDNCPSTFHQACLCTEDLPEGSWYCPNCT----------------------- 262
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C K C QCE ++H C +
Sbjct: 263 ------------------------CWICGDLVNDKEASSSVGAYKCLQCEHKYHGACQQG 298
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
+ + L WFC C + S L + + N I G + I
Sbjct: 299 KQTHE--GLVSDAWFCSGSCQEVYSGLHSRVGIN---------NPIAD--GFCWTLLRCI 345
Query: 681 DVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 736
++LS + A E L+ A+ I +CF +VD +G D+IP +Y G +
Sbjct: 346 HEDQKVLSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHALYNWGSDFARL 405
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
F G Y +L + +VSA +RV G VAE+PL+AT +G + L IE++L
Sbjct: 406 NFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSNYRRQGMCRHLMTAIEEMLIS 465
Query: 797 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 856
+V+ +V+ A + WT FGF + + K + F GT +L+K++ +
Sbjct: 466 YKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSLNK--INFMVFPGTILLKKQLYKTKE 523
Query: 857 GSSSTDSTECVSGVEV 872
+ +D + EV
Sbjct: 524 ADTQSDWGDAAPLTEV 539
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 151/401 (37%), Gaps = 108/401 (26%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CDGCP FHK C + P GDW+C YC F R+ G
Sbjct: 924 GDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYCCCKF---------------CRLVG 968
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
S + + F +L C CE +FH+ C++
Sbjct: 969 GSSNQSVVN--------------------------DEFTMPALLTCHLCEEKFHISCVEA 1002
Query: 621 H--KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 678
+ K D K FC C ++ L+ LL + E F + I++ S
Sbjct: 1003 NGGKTDD----SKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWSFIRR---------S 1049
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 736
D+ L + + ++L + A++I ++CF P +D SG +L+ S++Y G N +
Sbjct: 1050 DVGCDLSLTNPQLVECNSKLAV--ALSIMNECFMPYIDHRSGTNLLRSILYNCGSNFKRL 1107
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE----- 791
++ G IL ++ +RV G +AE+P + T + +G + L IE
Sbjct: 1108 DYSGFITVILERGDEIICVASIRVHGNRLAEMPYIGTRYMYRRQGMCRRLLNAIESEAVY 1167
Query: 792 -----------------------------------------KLLSFLRVKSIVLPAAEEA 810
K LS L V+ +V+PA E
Sbjct: 1168 GLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDWGGPKALSSLDVELLVIPAISEL 1227
Query: 811 ESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
WT FGF+ + I L+ F +LQK++
Sbjct: 1228 RETWTSVFGFEPLKQTSKQITNNM--NLLVFPHVDLLQKKI 1266
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 144/375 (38%), Gaps = 64/375 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G WY C N R N + VS
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GNPLSEKEVST 63
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ IL C QC +H C+ +
Sbjct: 64 FSA---------------------------------------ILKCLQCGDSYHDTCIDQ 84
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 679
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKK 143
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 739
I A E L+ A+ I +CF +VD +G D+IP ++ N ++
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDYQ 191
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y IL ++ +RV G + AELP +ATS +G + L IE +L V
Sbjct: 192 GFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRSFHV 251
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 859
+++VL A E + W FGFK I+ R L+ F GTS+L KR+ S
Sbjct: 252 ETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGITAAKS 309
Query: 860 STD-STECVSGVEVG 873
D VSG+ G
Sbjct: 310 EEDKDAYNVSGLPNG 324
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 113/444 (25%), Positives = 176/444 (39%), Gaps = 112/444 (25%)
Query: 463 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 522
+V Y + +LEG+ GI C CC+ ++ S+FE HA G+ LP
Sbjct: 621 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHA--GSKLP-------------- 664
Query: 523 LSSIPQGDWYCKYCQNMFERKRFLQHDA-NAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 581
P + Y + ++ + Q DA N E G SV+ + +
Sbjct: 665 ---QPYQNIYLESGVSLLQ----CQIDAWNRQEHAEKIGFHSVD---------IDGNDPN 708
Query: 582 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----- 636
C +C G G ++ CD C FH CL D++ LP G+W C
Sbjct: 709 DDTCGIC--------GDGG-DLICCDGCPSTFHQSCL------DIQMLPPGEWHCPNCTC 753
Query: 637 ----------------------CMDCSR----------------INSVLQNLLVQEAEKL 658
C+ C + INS + +E ++L
Sbjct: 754 KFCGIASETSDKDDASVNVLRTCILCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKEL 813
Query: 659 PEFHLNAIKKYAGNSLETVSDIDVRWRLLS-----------GKAATPETRLLLSQAVAIF 707
E+ +KKY G E + W L+ G E L+ A+ +
Sbjct: 814 SEY----LKKYLGTKHEL--EAGFSWCLIHRSDEDSEAACRGLTQRVECNSKLAIALTVM 867
Query: 708 HDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 765
+CF P++D SG +LI +++Y G N + G Y AIL +++A +R G ++
Sbjct: 868 DECFLPVIDRRSGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKI 927
Query: 766 AELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP 825
AE+P + T I +G + LF+ IE L L+V+ +V+PA E WT FGF +D
Sbjct: 928 AEMPFIGTRHIYRRQGMCRRLFSAIELALCSLKVEKLVIPAVAELTHTWTTVFGFTYLDE 987
Query: 826 ELLSIYRKRCSQLVTFKGTSMLQK 849
L + ++ F G MLQK
Sbjct: 988 SLRQ--EMKSLNMMVFPGIDMLQK 1009
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 140/367 (38%), Gaps = 92/367 (25%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG+L+ CD CP FH C L +P +W+C C
Sbjct: 152 GGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC-------------------------- 185
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
C C LC D S S C QC R +HV CL K
Sbjct: 186 --------C-------------CELCGKGDSSTSTNA------CLQCARAYHVHCLTKDG 218
Query: 623 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 682
+ P + FC C + + L LL G S T D +
Sbjct: 219 CLLPTDYP-SENFCSKSCYELCAQLHQLL-------------------GISNPTSVD-GL 257
Query: 683 RWRLLSGKAAT--------PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 732
W L T + Q + + H+CF + + + +D++ ++Y G
Sbjct: 258 TWTLTRSSKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDMVTDLIYNSGSK 317
Query: 733 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
+ F G Y +L +VS LR+ G + AE+PLVAT +G +LL + K
Sbjct: 318 FKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQGMCRLLMQEVLK 377
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLVTFKGTSMLQK 849
LL+ RV+ ++LPA + +W FGF ++ + + LS Y V F+GT MLQ
Sbjct: 378 LLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQLSGY-----SFVGFQGTMMLQN 432
Query: 850 RVPACRI 856
+ + RI
Sbjct: 433 VLTSSRI 439
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 142/375 (37%), Gaps = 64/375 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G W YC H+ G
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSW---YC-----------HNCTCRSCGNPLS 57
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
V + IL C QC +H C+ +
Sbjct: 58 EKEVSTFS---------------------------------AILKCLQCGDSYHDTCIDQ 84
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 679
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGRKLHSSKK 143
Query: 680 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 739
I A E L+ A+ I +CF +VD +G D+IP ++ N ++
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDYQ 191
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y IL ++ +RV G + AELP +ATS +G + L IE +L V
Sbjct: 192 GFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMMLRSFHV 251
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 859
+++VL A E + W FGFK I+ R L+ F GTS+L KR+ S
Sbjct: 252 ETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKRLDGITAAKS 309
Query: 860 STD-STECVSGVEVG 873
D VSG+ G
Sbjct: 310 EEDKDAYNVSGLPNG 324
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 128/273 (46%), Gaps = 46/273 (16%)
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNL 650
D + GF IL CDQC R+FHV C + + L R+ WFC C + S LQ+L
Sbjct: 40 DVQQDGF----ILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHL 95
Query: 651 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LS 701
L G S+ +D ++ W LL K P+ L L
Sbjct: 96 L-------------------GKSVPVGTD-NLTWTLL--KRVEPDCFDLEVLSANNSKLK 133
Query: 702 QAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 759
A+ + H+CF+P D+ +G+DL+ +++ G NL F G Y +L N+ + + +R
Sbjct: 134 LALEVMHECFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVR 193
Query: 760 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 819
VFG +VAE+P VAT G ++L +E+ L L V+ +VLPAA W FG
Sbjct: 194 VFGDKVAEVPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFG 253
Query: 820 FKKI---DPELLSIYRKRCSQLVTFKGTSMLQK 849
F + D + S Y ++ F+GT + QK
Sbjct: 254 FSVMTYSDKKAHSDY-----PILFFQGTVLCQK 281
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 147/380 (38%), Gaps = 74/380 (19%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G W YC H+ G
Sbjct: 1032 GDGGELLCCDNCPSTYHQTCLSDQELPEGSW---YC-----------HNCTCRSCGNPLS 1077
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
E E+S FS IL C QC +H C+
Sbjct: 1078 ------------------EKEVST--------FS-------AILKCLQCGDSYHDTCI-- 1102
Query: 621 HKMADLRELPKGK-----WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 674
D LP G WFC C I L N + E E + +K G L
Sbjct: 1103 ----DQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKL 1158
Query: 675 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 734
+ I A E L+ A+ I +CF +VD +G D+IP ++ N
Sbjct: 1159 HSSKKI----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFA 1206
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
++ G Y IL ++ +RV G + AELP +ATS +G + L IE +L
Sbjct: 1207 RLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTIEMML 1266
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 854
V+++VL A E + W FGFK I+ + R L+ F GTS+L KR+
Sbjct: 1267 RSFHVETLVLSAIPELVNTWVSGFGFKPIEDN--EKKQLRNVNLMLFPGTSLLTKRLDGI 1324
Query: 855 RIGSSSTD-STECVSGVEVG 873
S D VSG+ G
Sbjct: 1325 TAAKSEEDKDAYNVSGLPNG 1344
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 141/337 (41%), Gaps = 81/337 (24%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
ADGGNL+ CD CP +H C + PQG+W C C
Sbjct: 270 ADGGNLICCDKCPSTYHISCLQMEDEPQGEWRCPAC------------------------ 305
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C C F S + C QC++++H C ++
Sbjct: 306 -----------------------ACKFCHTHAFDIS------VFTCSQCDKKYHWECFRE 336
Query: 621 HK--MADLR-ELPKGKW-FCCMDCSRINSVLQNLL---VQEAEKLPEFHLNAIKKYAGNS 673
++ + DL + P FC CS+I L+ L+ + E L L + AG
Sbjct: 337 NEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDEGLTWTLLRRMDPEAGVY 396
Query: 674 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 731
LE D R L ++ AVA+ +CF+P++D + +++ S++Y G
Sbjct: 397 LEESYD-----RTLCNSK--------IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGA 443
Query: 732 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 791
N F G Y AIL +S +R+ G ++AE+P +AT + ++ L C +
Sbjct: 444 NFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIAT------RPSYRRLGMCHK 497
Query: 792 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 828
L++ V+ +V+P+ E+ W + +GF+ I+ +++
Sbjct: 498 LLVAIESVQYLVIPSIEQRVRRWEESYGFQAIENKVM 534
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 93/171 (54%), Gaps = 12/171 (7%)
Query: 684 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 742
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 40 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 97
Query: 743 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 802
CA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V +
Sbjct: 98 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 157
Query: 803 VLPAAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 849
P E IW++K GF + E +L + LV FK ++QK
Sbjct: 158 TAPVDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 203
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 111/221 (50%), Gaps = 16/221 (7%)
Query: 632 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKA 691
G FC C + LQNLL + + PE+ +++ + E V +D R
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLALDKRV------- 76
Query: 692 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVN 749
E ++ A+++ +CF PI+D +G +LI ++VY G N +F G Y IL
Sbjct: 77 ---ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERG 133
Query: 750 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 809
+++A +R+ G ++AE+P + T + +G + L IE +LS L V+ +++PA E
Sbjct: 134 DEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITE 193
Query: 810 AESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
WT KFGF + D E + K S LV F GT +LQK
Sbjct: 194 LVDTWTSKFGFSPLEDSEKQEV--KSISMLV-FPGTGLLQK 231
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 113/458 (24%), Positives = 195/458 (42%), Gaps = 102/458 (22%)
Query: 448 LHKLVFDESGLPDGTEVGYY-ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNL 506
+ + D L +G V Y ++ G + GI+C CCN S + F+ HA
Sbjct: 3 IFSWLIDGEILSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGD--- 59
Query: 507 LPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE-RKRFLQHDANAVEAGRVSGVDSVE 565
H+ A+L ++ G +++ E +K+ L+ A +G +V+
Sbjct: 60 --------EVHRT-AALLTLEDG-------RSVLECQKQALKKIEQAKCDEPANGQLTVD 103
Query: 566 QITKRCIR------IVKNLEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
+ + + +V ++E + + C +C G G + ++ CD C FH+ C
Sbjct: 104 ETALKAMELKESELVVDDVEMDENDDTCAVC--------GDGGQ-LVCCDHCPSTFHLKC 154
Query: 618 LKKHKMADLRELPKGKWFC----CMDCSR--INSVLQN-LLVQEAEKLP----------- 659
L+ L +P+G WFC C C R + +Q +L + +P
Sbjct: 155 LR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHSNCVPGCAMKYESSDN 208
Query: 660 EF-------HLNAIKKYAGNSLETVSDIDVRWRLLSGK-------------AATPETRLL 699
+F ++K G + V D+ W LL + A TRL
Sbjct: 209 QFCSRKCFKIFRGLRKLVGR-VNKVDDM-YSWTLLRSEHYDQSAENSKLESVADLNTRLA 266
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE----FGGMYCAILTVNSSVVSA 755
L A+ + +CF P++D S D++ ++Y R RG++ F G Y +L ++S
Sbjct: 267 L--ALTVIQECFRPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYTVVLEKEQELISV 322
Query: 756 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 815
+RV G AE+P + T +G + L I+++L L V+++VLPA E WT
Sbjct: 323 ASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETWT 382
Query: 816 DKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 849
FGF+K+ +L+ + +VTF G+S+LQK
Sbjct: 383 SAFGFQKLTAAQGIQLMEL------NIVTFPGSSVLQK 414
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 172/391 (43%), Gaps = 83/391 (21%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
I+C CC+ + + FE+HA GC R H+ S S + + CQ
Sbjct: 37 AIVCDCCHVTFTITGFESHA---------GCTR--HR--PSTSILLEDGRSLLDCQREAL 83
Query: 542 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK-NLEAELSG-CLLCRGCDFSKSGFG 599
+ + + V + V++ K+ +VK N EA+ C +C GFG
Sbjct: 84 SSSDHKGNHSVVNENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSIC--------GFG 135
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI---------------N 644
+ LCD+C FH+GCL L +P G+WFC C +I N
Sbjct: 136 G-DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQECKDHKDN 188
Query: 645 SVLQNLLVQEAEKLPEFHLNAIKKYA--GNSLET-----------------------VSD 679
++L + VQ +K +H +K N +E V+D
Sbjct: 189 NIL--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGKPIKVAD 243
Query: 680 IDVRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 734
++ W L+ ++ + L+ A+ + ++ F+P D++SGR+LI +V+ R
Sbjct: 244 -NLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESE 302
Query: 735 GQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEK 792
F G Y IL V+S +R++GQ+VAE+ VAT + +G LL IEK
Sbjct: 303 HNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVATKEQYRRQGICHLLMDEIEK 362
Query: 793 LLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
L+ L V+ ++L ++E+A +IWT FGF ++
Sbjct: 363 QLTRLGVEKLLLHSSEDAMNIWTKSFGFARM 393
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 117/461 (25%), Positives = 187/461 (40%), Gaps = 70/461 (15%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQF 497
K + + D + DG V Y G +K++ G G+ C CC+ V F
Sbjct: 367 KKHTILTWLIDGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVHCSCCDGVVPLPVF 426
Query: 498 EAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA-VEAG 556
EAHA P G + + K + G+ + Q +E ++ A A V A
Sbjct: 427 EAHAGARRRDPGPGQRQPWEKLL-----LVSGNSLLRCMQEAWEMEKVRTFHAQAKVRAA 481
Query: 557 RVSGVDSVEQITKRCIR-------IVKNLEAELSGCLLCRGCDFSKSGFG----PRTILL 605
D Q +R + +V+ + + + D S G +L
Sbjct: 482 LEQEEDKCSQAKRRLLAKHLKKGVVVERIMSPRMEKIKAGEKDSSDDACGVCADGGELLC 541
Query: 606 CDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLV--QEAEKLP 659
CD C FH CL ++P+G W C C+ C N LQ L Q A K
Sbjct: 542 CDSCTSTFHPECLAI-------KVPEGSWSCHYCRCVLCMS-NDDLQGLSTCQQCARKYH 593
Query: 660 E----FHLNA--IKKYAGNS-------LETVSDI------DVRWRLLSGKAATP------ 694
E N I Y G + L V+ + W LL + P
Sbjct: 594 ESCRPLPGNGCDIGTYCGETCKKLFSQLAQVTGVTNPTGDGFWWALLRIQKDEPASSEEM 653
Query: 695 ----ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTV 748
E + L+ A+ +F++CF+P+ D + D++ VY G + + G Y +L
Sbjct: 654 PAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGSQFKRLSYEGFYTMVLEK 713
Query: 749 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 808
+ +VSA +LR+ G +VAE+P T +G + L + +E++L+ ++V+ +V+PA +
Sbjct: 714 DGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRRLVSAVEQVLASVQVEKLVIPAID 773
Query: 809 EAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
W F F+ +DP+L KR S LV GT++L K
Sbjct: 774 SLVDTWKRSFFFRPVDPQLREEL-KRLS-LVVITGTTLLHK 812
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 146/379 (38%), Gaps = 75/379 (19%)
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 532
LLEG+ + GI C CC+ + S+FE HA + C+ + SL W
Sbjct: 208 LLEGWISRDGIRCGCCSEIFTISKFEIHA---GMKLCEPSQNIILETGISLLQCQLDSWN 264
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
Q ER F D G D T C +C
Sbjct: 265 K---QEESERSGFHLVDV---------GADDPNDDT----------------CGIC---- 292
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 652
G G ++ CD C FH CL L+ L K + LV
Sbjct: 293 ----GDG-GDLICCDGCPSTFHQSCLDIQLFEQLQMLLGVK-------HELEDGFSWTLV 340
Query: 653 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 712
Q E + LN I + E L+ A++I +CF
Sbjct: 341 QRTEVGFDISLNGIPQKV------------------------ECNSKLAVALSIMDECFL 376
Query: 713 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 770
PIVD SG +LI +++Y G N + G + AIL ++SA +R+ G ++AE+P
Sbjct: 377 PIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPF 436
Query: 771 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 830
+ T I +G + L IE L L V+ +V+PA E WT FGFK + E+ S
Sbjct: 437 IGTRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPL--EVSSR 494
Query: 831 YRKRCSQLVTFKGTSMLQK 849
R ++ F GT MLQK
Sbjct: 495 KEMRNMNMLVFHGTDMLQK 513
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 103 bits (256), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/422 (25%), Positives = 166/422 (39%), Gaps = 111/422 (26%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC--QNM 539
G++C CCN VS S+F+ HA PC F +S W +Y +N
Sbjct: 657 GVVCTCCNKTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEYKARRNG 713
Query: 540 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFG 599
+ ++ D N G V G G L+C
Sbjct: 714 WRLEKASDDDPNDDSCG-VCGD---------------------GGELIC----------- 740
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEA 655
CD C FH CL ++ LP+G W+C C CS L+ A
Sbjct: 741 ------CDNCPSTFHQACLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNA 781
Query: 656 EKLPEFHLNA-IKKYAGNSLETVS--------------------------------DID- 681
E+ +F + KY G L+ +S + D
Sbjct: 782 ERSQDFKCSQCAHKYHGTCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADG 841
Query: 682 VRWRLL----------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 729
+ W +L S + A E L+ A++I + F +VD +G D+IP ++Y
Sbjct: 842 LSWSILKCFQEDGMVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLY 901
Query: 730 --GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 787
G +F G Y ++ + ++S +RV G +AE+PLVAT +G ++L
Sbjct: 902 NWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILV 961
Query: 788 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 847
A IE++L L+V+ +V+ A WT+ FGFK +D E ++ L+ F GT++L
Sbjct: 962 AAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTTLL 1019
Query: 848 QK 849
+K
Sbjct: 1020 KK 1021
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/424 (25%), Positives = 167/424 (39%), Gaps = 111/424 (26%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC--QNM 539
G++C CCN VS S+F+ HA PC F +S W +Y +N
Sbjct: 613 GVVCTCCNKTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEYKARRNG 669
Query: 540 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFG 599
+ ++ D N G V G G L+C
Sbjct: 670 WRLEKASDDDPNDDSCG-VCGD---------------------GGELIC----------- 696
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEA 655
CD C FH CL ++ LP+G W+C C CS L+ A
Sbjct: 697 ------CDNCPSTFHQACLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNA 737
Query: 656 EKLPEFHLNA-IKKYAGNSLETVS--------------------------------DID- 681
E+ +F + KY G L+ +S + D
Sbjct: 738 ERSQDFKCSQCAHKYHGTCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADG 797
Query: 682 VRWRLL----------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 729
+ W +L S + A E L+ A++I + F +VD +G D+IP ++Y
Sbjct: 798 LSWSILKCFQEDGMVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLY 857
Query: 730 --GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 787
G +F G Y ++ + ++S +RV G +AE+PLVAT +G ++L
Sbjct: 858 NWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGMCRILV 917
Query: 788 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 847
A IE++L L+V+ +V+ A WT+ FGFK +D E ++ L+ F GT++L
Sbjct: 918 AAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTTLL 975
Query: 848 QKRV 851
+K +
Sbjct: 976 KKTL 979
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 164/417 (39%), Gaps = 96/417 (23%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
GI+C+CC +S S F+AHA GC +SL Q CQ
Sbjct: 967 GILCNCCTKTLSISDFKAHA---------GC----RLRLSSLGLFLQSGKSYTLCQ---- 1009
Query: 542 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 601
VEA S E +++R + +EA C C G G
Sbjct: 1010 -----------VEAW------SAELMSRRSDAYGRKVEAVDENDDTCGFC-----GDGGE 1047
Query: 602 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQ 648
+L CD C +H CL +ELP+G W+C C C S + +L+
Sbjct: 1048 -LLCCDNCPSTYHEACLSS------QELPEGSWYCHNCTCRSCGNPVNEKEVSSFSDILK 1100
Query: 649 NLLVQEAEK--------LPEFHLNAIKKYAGN-------------SLETVSDIDVRW--- 684
L +A LP + + G +E V + D+ W
Sbjct: 1101 CLQCGDAYHNTCIDRVMLPSDGKRSDTWFCGRYCKEIFMGLHSQVGVENVINNDLSWTIL 1160
Query: 685 -------RLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNL 733
RL S + E L+ A+ + +CF +VD +G D+IP ++Y G N
Sbjct: 1161 RCNSDGQRLHSAQKIGLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNF 1220
Query: 734 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 793
++ G Y IL ++ +R+ G + AELP +ATS +G + L IEK+
Sbjct: 1221 ARLDYKGFYTVILEKGDEILCVASIRLHGTKAAELPFIATSVDYRRQGMCRRLLDIIEKM 1280
Query: 794 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 850
L V+ +VL A E + W FGFK I+ + + R L+ F G S+L KR
Sbjct: 1281 LRSFHVEMLVLSAIPELVNTWVSGFGFKPIEDD--EKKQLRNVNLMLFPGASLLTKR 1335
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 171/389 (43%), Gaps = 79/389 (20%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
I+C CC+ + + FE+HA GC R H+ S S + + CQ
Sbjct: 37 AIVCDCCHVTFTITGFESHA---------GCTR--HR--PSTSILLEDGRSLLDCQREAL 83
Query: 542 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 601
+ + + V + V++ K+ +VK ++E + +C C GFG
Sbjct: 84 SSSDHKGNHSVVNENQKKNHSIVKENRKKNHCVVKE-KSEANNDNVCSIC-----GFGG- 136
Query: 602 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI---------------NSV 646
+ LCD+C FH+GCL L +P G+WFC C +I N++
Sbjct: 137 DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQECKDHKDNNI 190
Query: 647 LQNLLVQEAEKLPEFHLNAIKKYA--GNSLET-----------------------VSDID 681
L + VQ +K +H +K N +E V+D +
Sbjct: 191 L--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGKPIKVAD-N 244
Query: 682 VRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 736
+ W L+ ++ + L+ A+ + ++ F+P D++SGR+LI +V+ R
Sbjct: 245 LTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHN 304
Query: 737 --EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
F G Y IL V+S +R++GQ+VAE+ VAT + +G LL IEK L
Sbjct: 305 RLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVATKEQYRRQGMCHLLMDEIEKQL 364
Query: 795 SFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
+ L V+ ++L ++E+A + WT FGF ++
Sbjct: 365 TRLGVEKLLLHSSEDAMNTWTRSFGFARM 393
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 170/387 (43%), Gaps = 41/387 (10%)
Query: 482 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC--QNM 539
G++C CCN VS S+F+ HA PC F +S W +Y +N
Sbjct: 645 GVVCTCCNRTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEYKARRNG 701
Query: 540 FERKRFLQHDANAVEAGRVSGVDSVEQIT-KRCIRIVKNLEAELSGCLLCRGCDFSK--- 595
+ + D N G V G D E I C +A LS +L G +
Sbjct: 702 WRSEEASDDDPNDDSCG-VCG-DGGELICCDNCPSTFH--QACLSMQVLPEGSWYCSSCS 757
Query: 596 --------SGFGPRT-ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 646
S G R+ C QC ++H CL+ ++ R+L +FC +C ++ +
Sbjct: 758 CQICSELVSDNGERSQDFKCSQCAHKYHGICLQG--ISKRRKLFPETYFCGKNCEKVYTG 815
Query: 647 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK--AATPETRLLLSQAV 704
L + ++ + NA G S + ++ S + A E L+ A+
Sbjct: 816 L-------SSRVGVINPNA----DGLSWSILKCFQEDGKVHSARRLALKAECNSKLAVAL 864
Query: 705 AIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFG 762
+I + F +VD +G D+IP ++Y G N +F G Y +L + ++S +RV G
Sbjct: 865 SIMEESFLSMVDPRTGIDMIPHVLYNWGSNFARLDFDGFYTMVLEKDDVMISVASIRVHG 924
Query: 763 QEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK 822
VAE+PLVAT +G ++L A IE++L L+V+ +V+ A WT+ FGFK
Sbjct: 925 VTVAEMPLVATCSKYRRQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKP 984
Query: 823 IDPELLSIYRKRCSQLVTFKGTSMLQK 849
+D E ++ L+ F GT +L K
Sbjct: 985 MDDEERDALKR--INLMVFPGTILLMK 1009
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 129/270 (47%), Gaps = 33/270 (12%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM------------DCSRINSVLQNL 650
+LLCD+C FH C+ L+ P+G W C + D + + +
Sbjct: 576 LLLCDKCPSAFHHACVG------LQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 651 LVQEAEKLPE----FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 706
+ EA +P L+ +++ + TV+ I RW+ AA L A+ +
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRR--RYMSTVTRI-TRWQHEEEDAADHGQ---LCAALDV 683
Query: 707 FHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 764
H+CFD +V+ + DL +V+ + L F G Y L +++ G LRVFG +
Sbjct: 684 LHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVFGNQ 743
Query: 765 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 824
VAELPLV T + +G +LL +EK+L + V+ +VLPA E +WT GF +
Sbjct: 744 VAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFHAMT 803
Query: 825 -PELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
+++ + + +++FKGT+M QK + A
Sbjct: 804 RSDVMEMAVEHA--ILSFKGTTMCQKTLLA 831
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 27/50 (54%)
Query: 488 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
C+ E S DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 558 CSEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR 607
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 113/480 (23%), Positives = 187/480 (38%), Gaps = 105/480 (21%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYYACG-------QKLLEGYKNGLGIICHCCNSEVSPSQ 496
K + + D L DG V YY G +K++ G G+ C+CC++ V
Sbjct: 277 KKHTILTWLIDGGFLSDGETV-YYVPGDSGGAGKEKIVSGAVTRAGVHCNCCDAVVPLPV 335
Query: 497 FEAHA---------------------DGGNLLPC------DGCPRAFHKECASLSSIPQG 529
FE HA G +LL + R FH + +++ Q
Sbjct: 336 FEVHAGRVPGTGQQQQQVAWEKLLLVSGDSLLQSMQEAWQNEKVRTFHAQAKVRAALEQ- 394
Query: 530 DWYCKYCQNMFERKRFL-QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCL 586
+ +N ++R L +H V VE+I + +K E + S C
Sbjct: 395 ----EEEKNSQAKRRLLAKHQKKGV---------VVERIMSPRMEKIKAGEKDSSDDACG 441
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC---------- 636
+C D + +L CD C FH CL E+P G W C
Sbjct: 442 VC--ADGGE-------LLCCDFCTSTFHPECLAI-------EVPDGSWSCHYCRCTLCMS 485
Query: 637 --------CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-----VR 683
C +C+ L+ + + KK + E + ++
Sbjct: 486 NDDQDLSTCQECACKYHESCRPLLGNGRDIGAYCGEICKKLSAKLSEVIGVMNSTEDGFS 545
Query: 684 WRLL----------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 731
W LL G A E + L+ A+ + + CF+P+ D + D++ VY G
Sbjct: 546 WSLLRIHEDEPASSQGMPAVLERNVKLAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGS 605
Query: 732 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 791
+ + G Y IL + +VS +LR+ G++VAE+P T +G + + +E
Sbjct: 606 QFKRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAVE 665
Query: 792 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 851
++L+ ++V+++++PA W F F+ +DP+L KR S LV GT+ML K V
Sbjct: 666 QVLASVQVETLIIPAIASMVDTWKRSFSFRPVDPQLREEL-KRLS-LVVITGTTMLHKPV 723
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 147/380 (38%), Gaps = 86/380 (22%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 559
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 504 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 559
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 619
LS C LC E ++H C+
Sbjct: 560 ----------------------LSSCRLC---------------------EEKYHQACIN 576
Query: 620 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVS 678
+ FC C + LQ L + LPE F + ++++ S V+
Sbjct: 577 QDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVA 631
Query: 679 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 736
D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 632 DCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRL 683
Query: 737 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
+F A+L +++ +R+ G ++AE+P + T + +G + L IE +++
Sbjct: 684 DFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESFVAY 743
Query: 797 LRVKSIVLPAAEEAESIWT----------------DKFGFKKI-DPELLSIYRKRCSQLV 839
+ L +E +W FGF + D E +I + L+
Sbjct: 744 F--SQMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTI---KNLNLL 798
Query: 840 TFKGTSMLQKRVPACRIGSS 859
F G ML K + +I S
Sbjct: 799 VFPGVDMLGKSLVKEKITDS 818
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 103/221 (46%), Gaps = 11/221 (4%)
Query: 606 CDQCEREFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 664
C QCE ++H CL+ D L KWFC DC I L L+ K E +
Sbjct: 770 CKQCELKYHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELI----GKPREVGVE 825
Query: 665 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 724
+ S E D +++ A E LS A+ + H+ F+P+ GRDL
Sbjct: 826 KLTWRLVQSFEPNMYGDDAYKI----EAVAENHCKLSVALDVMHELFEPVKRPHGGRDLA 881
Query: 725 PSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 782
+++ R + F G Y +L N +V+ +R+ G++VAE+P + T + +G
Sbjct: 882 EDVIFSRWSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGM 941
Query: 783 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 823
++L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 942 CRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 982
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
GG L+ CDGCP AFH C L +P GDW+C+ C
Sbjct: 712 GGKLILCDGCPSAFHANCLGLEEVPDGDWFCESC 745
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 189/466 (40%), Gaps = 92/466 (19%)
Query: 452 VFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLL 507
+ D L D +V Y +K++ G GI C CCN+ V + FE HA
Sbjct: 378 LIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETHAR----- 432
Query: 508 PCDGCPRAFHK----ECASLSSIPQGDWYCKYCQNMFERKRFL----QHDANAVEAGR-- 557
C+ + + K LS Q W + M R++ + Q + +A R
Sbjct: 433 -CERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEKSSQAKRKL 491
Query: 558 --------VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 609
+ GV V + R ++ KN + C C + G +L CD C
Sbjct: 492 AKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG----QLLCCDTC 544
Query: 610 EREFHVGCL------KKHKMADLRELP----KGKW------------------------- 634
FH CL K + D ++L + W
Sbjct: 545 PSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRPLQSPGFE 604
Query: 635 ---FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSG 689
+C C +++S L +++ + E + L I+K + L T D+ V +L
Sbjct: 605 IGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSEDMPV---IL-- 656
Query: 690 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILT 747
E+ + L+ A+ + ++CF+P+ D + D++ VY G + + G Y +L
Sbjct: 657 -----ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLE 711
Query: 748 VNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAA 807
+ ++S +LR G+++AE+P T +G + L +EK+L+ L+V+++V+PA
Sbjct: 712 KDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAV 771
Query: 808 EEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
+ W F F+ + E+ +K LV GT++LQK + A
Sbjct: 772 ADLVETWKRSFSFRPMQAEVRDEAKKLS--LVAITGTTLLQKPISA 815
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 109/474 (22%), Positives = 191/474 (40%), Gaps = 92/474 (19%)
Query: 444 KDQRLHKLVFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEA 499
K + + D L D +V Y +K++ G GI C CCN+ V + FE
Sbjct: 372 KKHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFET 431
Query: 500 HADGGNLLPCDGCPRAFHK----ECASLSSIPQGDWYCKYCQNMFERKRFL----QHDAN 551
HA C+ + + K LS Q W + M R++ + Q
Sbjct: 432 HAR------CERPGQPWEKLLLMSGKPLSKCMQEAWAQERVTAMRAREKAMASLEQEKEK 485
Query: 552 AVEAGR----------VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 601
+ +A R + GV V + R ++ KN + C C + G
Sbjct: 486 SSQAKRKLAKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG---- 538
Query: 602 TILLCDQCEREFHVGCL------KKHKMADLRELP----KGKW----------------- 634
+L CD C FH CL K + D ++L + W
Sbjct: 539 QLLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCR 598
Query: 635 -----------FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSDID 681
+C C +++S L +++ + E + L I+K + L T D+
Sbjct: 599 PLQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSEDMP 655
Query: 682 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 739
V +L E+ + L+ A+ + ++CF+P+ D + D++ VY G + +
Sbjct: 656 V---IL-------ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYE 705
Query: 740 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 799
G Y +L + ++S +LR G+++AE+P T +G + L +EK+L+ L+V
Sbjct: 706 GFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQV 765
Query: 800 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 853
+++V+PA + W F F+ + E+ +K LV GT++LQK + A
Sbjct: 766 ENLVIPAVADLVETWKRSFSFRPMQAEVRDEAKKLS--LVAITGTTLLQKPISA 817
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 109/224 (48%), Gaps = 31/224 (13%)
Query: 606 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 665
C+QCER FH GC +A G +F C +S NL + AE +
Sbjct: 164 CEQCERCFHPGCCDDAILA-------GDFFFC------SSGCWNLFQRLAEMVA------ 204
Query: 666 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 725
++ + ++ W LL + LL++A+ + FDP++D + D +
Sbjct: 205 -------TVNPLGRSELSWSLLRRGRCDDK---LLAEALQLISSRFDPVLDCWTQLDYLD 254
Query: 726 SMVYGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 783
+MV+ R+ +F G Y A+L + VV +LR+ +AE+P +AT G+G
Sbjct: 255 AMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGIC 314
Query: 784 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 827
+ LF +E++L+ L V+++ L AA++ E +W + F F +D +L
Sbjct: 315 RSLFTAVEEMLARLGVETMALLAAKDTEKMWKNSFEFHAVDRKL 358
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+GG L+ C+ CP FH EC SL +P+ W+C C
Sbjct: 116 EGGELVCCETCPLTFHMECVSLLEVPKDAWFCFRC 150
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 151/381 (39%), Gaps = 78/381 (20%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 559
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCL 618
LS C LC C S P T+ C + G +
Sbjct: 713 ----------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTV 744
Query: 619 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 677
+ D FC C + LQ L + LPE F + ++++ S V
Sbjct: 745 PGERSTDS--------FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EV 793
Query: 678 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 735
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 794 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHR 845
Query: 736 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLS 795
+F A+L +++ +R+ G ++AE+P + T + +G + L IE ++
Sbjct: 846 LDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMYRRQGMCRRLMDGIESFVA 905
Query: 796 FLRVKSIVLPAAEEAESIWT----------------DKFGFKKI-DPELLSIYRKRCSQL 838
+ + L +E +W FGF + D E +I + L
Sbjct: 906 YF--SQMFLAISEVLLDVWQFCCYPACFGDGPFCFFSGFGFAPVNDSEKKTI---KNLNL 960
Query: 839 VTFKGTSMLQKRVPACRIGSS 859
+ F G ML K + +I S
Sbjct: 961 LVFPGVDMLGKSLVKEKITDS 981
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 3/136 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
S + + KK +LK KK+ N P V L TG+LDGV V Y I + LRG+I+
Sbjct: 259 SGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKY---IAWSREELRGVIKG 315
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LC C CN +VI +FE HA + + + +I FENGK++ +++ RS+P ML
Sbjct: 316 SGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLF 375
Query: 322 ATLQSALSSLPEEKSF 337
+Q+ S +KSF
Sbjct: 376 EVIQTITGSPINQKSF 391
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/427 (23%), Positives = 152/427 (35%), Gaps = 132/427 (30%)
Query: 482 GIICHCCNSEVSPSQFEAHA--------------DGGNLLPC------------------ 509
GI C CCN + ++FEAHA G +LL C
Sbjct: 800 GIQCDCCNKTFTSAEFEAHAGGKSCQPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFHF 859
Query: 510 ---DG----------------------CPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 544
DG CP FH+ C + P G W+C YC F
Sbjct: 860 IDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKF---- 915
Query: 545 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
G V G C R + ++ ++
Sbjct: 916 ----------CGMVGG--------NTCQR-----DGNMAAV--------------SHALV 938
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 664
C CE ++H C ++ + + P FC +C + LQ L + E F
Sbjct: 939 TCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQELYERLQMLFGVKQELEAGFSWT 996
Query: 665 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 724
++++ + SDI V SG + + ++ A+ I +CF P+VD SG +LI
Sbjct: 997 FVRRF-----DVSSDISV-----SGMSWKVDCNSKVAVALQIMDECFVPMVDHKSGVNLI 1046
Query: 725 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 782
++VY G N + G + A+L +++A +R F +P+ S ++ G
Sbjct: 1047 RNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF----YSMPVSFHSSLSMG--- 1099
Query: 783 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 842
L L V +V+PA E WT FGFK ++ I R ++ F
Sbjct: 1100 -----------LCSLNVGKLVIPAISELTGTWTSVFGFKHLEGSDKQIMRNM--NMMVFP 1146
Query: 843 GTSMLQK 849
G MLQK
Sbjct: 1147 GVDMLQK 1153
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 3/147 (2%)
Query: 191 SLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+LI I + T KK E K+SKK+ N P V L TG+LDGV V Y I +
Sbjct: 429 ALISTAQITASGSETVSKKKEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKY---IAW 485
Query: 251 QASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR 310
LRGII+ G LC C CN +VI +FE HA + + + +I FENGK++ +++
Sbjct: 486 SREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQ 545
Query: 311 ACRSVPLPMLKATLQSALSSLPEEKSF 337
+S P L +Q+ S +KSF
Sbjct: 546 ELKSTPQNSLFDVIQTITGSPINQKSF 572
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y I LRGII+ G LC
Sbjct: 306 KNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---ISLSRKELRGIIKGSGYLCG 362
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +ICFENGK++ ++++ R+ P ML +Q+
Sbjct: 363 CQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTV 422
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 423 FGAPINQKSF 432
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
+ + S KN ELK SKKI N P V L TGLLDGV+V Y+ + + LRG I+
Sbjct: 401 TTIDSASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKT--LRGTIKG 458
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LCSC +C G +V+ +FE HA + + + +I FENGK++ V++ ++ P ML
Sbjct: 459 TGYLCSCKVC-GNKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLF 517
Query: 322 ATLQSALSSLPEEKSFACVRC 342
+++ S +K+F +
Sbjct: 518 NAIETVTGSAINQKNFLSWKA 538
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 116/258 (44%), Gaps = 43/258 (16%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 662
++ CD C +H CL +LR SR+ + + F
Sbjct: 1056 LICCDNCPASYHQDCLPCQIYMNLR-------------SRVGIPIHTI--------DGFS 1094
Query: 663 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 722
++ + T +DI A E + L A++I +CF PI+D+ +G D
Sbjct: 1095 CTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGID 1144
Query: 723 LIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 778
+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT N
Sbjct: 1145 IIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1202
Query: 779 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS-- 836
+G + L IE++L L+V+ ++L A WT FGF ID + RK S
Sbjct: 1203 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRL 1258
Query: 837 QLVTFKGTSMLQKRVPAC 854
+LV+ GT +L++ + C
Sbjct: 1259 RLVSVPGTVLLKRNLYEC 1276
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 59/87 (67%), Gaps = 2/87 (2%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG++ PC CPR+FH C LS +P +WYC C+N+ ++++ L + NA AGR +GVD
Sbjct: 24 GGDIFPCKICPRSFHPACVGLSKVPS-EWYCDNCRNLVQKEKALAENKNAKAAGRQAGVD 82
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCR 589
S+EQI KR IRIV + +L GC LC+
Sbjct: 83 SIEQIMKRAIRIVP-ISDDLGGCALCK 108
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +KK + N P V L TG+LDGV V Y + LRG
Sbjct: 302 VAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKY---VSVSREELRG 358
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 359 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 418
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 419 SLLFDTIQTVFGAPINQKAF 438
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 4/152 (2%)
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGI 757
L A+++ H+CF+P+ +S+S RDL+ +++ R L F G Y +L N ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 758 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 817
+RV+G++VAE+PLV T G +L +EK L L V+ +VLPA WT
Sbjct: 1039 VRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRS 1098
Query: 818 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
FGF K+ S + + F+G M QK
Sbjct: 1099 FGFAKMTNLERSQFLD--YTFLDFQGAIMCQK 1128
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 55/141 (39%), Gaps = 49/141 (34%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CP +FHK C L IP GDW+C C +R + D D
Sbjct: 650 GGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQRKIDRD------------D 697
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
VEQ +L C QCE ++HV CL ++
Sbjct: 698 EVEQ------------------------------------LLPCIQCEHKYHVRCL-ENG 720
Query: 623 MADLRELPKGKWFCCMDCSRI 643
AD+ G WFC DC ++
Sbjct: 721 AADISTRYLGNWFCGKDCEKL 741
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 86/172 (50%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYGQETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVTY---VSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +K + N P V L TG+LDGV V Y I LRG
Sbjct: 341 VAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKY---ISVSREELRG 397
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 458 SLLFDTIQTVFGAPIHQKAF 477
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y + LRGII+ G LC
Sbjct: 243 KNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---VSLSREELRGIIKGSGYLCG 299
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P ML +Q+
Sbjct: 300 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTV 359
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 360 FGAPINQKSF 369
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 82/159 (51%), Gaps = 3/159 (1%)
Query: 179 VTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLD 238
V Q E +++ + +A+ + + E+K +K + N P V L TG+LD
Sbjct: 251 VQQKEFDASDAHATASNTRVAKSKTESVSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLD 310
Query: 239 GVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYIC 298
GV V Y I LRG+I+ G LCSC CN +V+ +FE HA + + + +I
Sbjct: 311 GVPVKY---IALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIY 367
Query: 299 FENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
FENGK++ ++++ RS P ML +Q+ + +KSF
Sbjct: 368 FENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSF 406
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENG+++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 72/133 (54%), Gaps = 3/133 (2%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
++ K E KMSKK + N P V L TG+LDGV V Y + L GII+ G
Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGY 388
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +
Sbjct: 389 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAI 448
Query: 325 QSALSSLPEEKSF 337
Q+ S +KSF
Sbjct: 449 QTVTGSPINQKSF 461
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K E KMSKK + N P V L TG+LDGV V Y + L GII+ G LC
Sbjct: 265 KNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGYLCG 321
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +Q+
Sbjct: 322 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 381
Query: 328 LSSLPEEKSF 337
S +KSF
Sbjct: 382 TGSPINQKSF 391
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 428 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 485
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 546 GSTINQKNF 554
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 6/146 (4%)
Query: 707 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 764
+CF PI+D +G +LI ++VY G N +F G Y IL +++A +R+ G +
Sbjct: 1 MDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTK 60
Query: 765 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI- 823
+AE+P + T + +G + L IE +LS L V+ +++PA E WT KFGF +
Sbjct: 61 LAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLE 120
Query: 824 DPELLSIYRKRCSQLVTFKGTSMLQK 849
D E + K S LV F GT +LQK
Sbjct: 121 DSEKQEV--KSISMLV-FPGTGLLQK 143
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 417 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 474
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 475 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 534
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 535 QEMLFDAIQNVTGSPINQKNF 555
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 396 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 453
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 454 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 513
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 514 QEMLFDAIQNVTGSPINQKNF 534
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 74/140 (52%), Gaps = 7/140 (5%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM-------GGIKFQASGLRG 257
++ K E KMSKK + N P V L TG+LDGV V Y+ G I L G
Sbjct: 443 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHG 502
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 503 IIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 562
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L +Q+ S +KSF
Sbjct: 563 SLLFBAIQTVTGSPINQKSF 582
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 82/150 (54%), Gaps = 6/150 (4%)
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 760
AV I H+CF I++ + D+ +V+ R LR F G Y +L +VS G R+
Sbjct: 11 AVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGTFRI 70
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
GQ+ AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 71 CGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGF 130
Query: 821 KKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
+ + E L + + +++F+GT+M QK
Sbjct: 131 TVMSNSERLELAG---NSILSFQGTTMCQK 157
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 85/164 (51%), Gaps = 4/164 (2%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V+Y+ +A L+G+I I C C CNG + +
Sbjct: 403 LTKHPGNIRELLNTGLLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGSKAVSAY 461
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + S C+
Sbjct: 462 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCL 521
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRISSS 384
C P+ ++ LC C++SK+PQ +T + SSS
Sbjct: 522 NCNE--PVLPSSQSE-NVLCQVCLESKQPQDPLTASYACNGSSS 562
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 68/137 (49%), Gaps = 30/137 (21%)
Query: 468 ACGQKLLEGYKNGLGIICHCCNSEVS----PSQF--EAHA---------DGGNLLPCDGC 512
+ G++ ++GY I C+ CN VS S F AH G N PC G
Sbjct: 583 SAGKRKVDGYIKDQRIYCNHCNRVVSLFSHLSYFFRLAHQHLKLMRVRDQGAN--PCVG- 639
Query: 513 PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCI 572
L +P +WYC C N+ ++++ L + NA AGR +GVDS+EQI KR I
Sbjct: 640 ----------LRKVP-SEWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAI 688
Query: 573 RIVKNLEAELSGCLLCR 589
RIV + +L GC LC+
Sbjct: 689 RIVP-ISDDLGGCALCK 704
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 113/300 (37%), Gaps = 67/300 (22%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG+L+ CD C FH +C + +P GDWYC+ C
Sbjct: 761 GDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSC------------------------ 795
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCEREFHVGCL 618
LCR C F K P +L C QC R++H C
Sbjct: 796 --------------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCS 829
Query: 619 KKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 676
+P FC C +I L LL + F + + +A +
Sbjct: 830 SGTGTDSGCTMPGTSIDCFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLVHCFANDQAMP 889
Query: 677 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 734
+ + K A ++ L A + +CF P +D SG ++I ++ Y G +
Sbjct: 890 NKNKE--------KLAQCNSKTAL--AFTVLDECFQPHIDDRSGINMIHNVAYNCGSDFS 939
Query: 735 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 794
+F G Y IL V++A +R+ G ++AE+P + T + +G + L IE ++
Sbjct: 940 RLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIGTRGMYRHQGMCRRLLNGIESVI 999
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/139 (34%), Positives = 73/139 (52%), Gaps = 2/139 (1%)
Query: 204 LTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGG 263
+ S KN ELK SKK+ N P V L TGLLDGV V Y+ + + L GII+ G
Sbjct: 432 IDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT--LEGIIKGTG 489
Query: 264 ILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKAT 323
LC C C + + +FE HA + + + +I FENGK++ V++ ++ P +L
Sbjct: 490 YLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNA 549
Query: 324 LQSALSSLPEEKSFACVRC 342
+Q+ S +K+F +
Sbjct: 550 IQTVTGSHINQKNFRIWKA 568
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 119/269 (44%), Gaps = 36/269 (13%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKL 658
+L+CD+C FH C+ L P+G WFC C C + L
Sbjct: 423 LLMCDRCPSMFHHACVG------LESTPQGDWFCPACTCAICG-------------SSDL 463
Query: 659 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 718
+ + + +S R G+ E L A+ + +CF +++
Sbjct: 464 DDPPATTTTQGFSSDRMVISCEQCRRESRDGEE---EEHAKLCMALDVLRECFVTLIEPR 520
Query: 719 SGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 776
+ DL +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 521 TQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFA 580
Query: 777 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRKRC 835
+G +LL I+KLL + V+ +VLPA E + WT FG + E+ R+
Sbjct: 581 RRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIR----EMGQADRQDV 636
Query: 836 SQ--LVTFKGTSMLQKRVPA-CRIGSSST 861
+ ++ F+GT M K++P ++G ++T
Sbjct: 637 AHHAILRFQGTIMCHKQLPPQPQLGHTTT 665
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 21/35 (60%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
D G LL CD CP FH C L S PQGDW+C C
Sbjct: 419 DCGELLMCDRCPSMFHHACVGLESTPQGDWFCPAC 453
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 429 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 486
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ ML +Q+
Sbjct: 487 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVT 546
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 547 GSTINQKNF 555
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 83/161 (51%), Gaps = 8/161 (4%)
Query: 182 SEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGL 236
SE G + + A S +T K ++ LK + + K+ P V L TG+
Sbjct: 279 SEADGVPELESSSFDVPASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGM 338
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDGV V Y+ + + LRGII+ G LC C CN +++ +FE HA + + + +
Sbjct: 339 LDGVPVKYVSVTREE---LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNH 395
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
I FENGK++ ++++ RS P +L T+Q+ + +KSF
Sbjct: 396 IYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPINQKSF 436
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 83/161 (51%), Gaps = 8/161 (4%)
Query: 182 SEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGL 236
SE G + + A S +T K ++ LK + + K+ P V L TG+
Sbjct: 285 SEADGVPELESSSFDVPASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGM 344
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDGV V Y+ + + LRGII+ G LC C CN +++ +FE HA + + + +
Sbjct: 345 LDGVPVKYVSVTREE---LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNH 401
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
I FENGK++ ++++ RS P +L T+Q+ + +KSF
Sbjct: 402 IYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPINQKSF 442
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK S N P V L TG+ DG+ V Y + L+G+I+ G LCSC
Sbjct: 386 KNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYC--TWSREKNLQGVIKGTGYLCSC 443
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
+C G + + +FE HA + + + +I FENGKS+ V++ ++ P ML +Q+
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 329 SSLPEEKSF 337
+ +++F
Sbjct: 504 GATINQRNF 512
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 84.0 bits (206), Expect = 3e-13, Method: Composition-based stats.
Identities = 42/73 (57%), Positives = 51/73 (69%), Gaps = 1/73 (1%)
Query: 780 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 839
+GYFQ LF CIE+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y K +
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTT 61
Query: 840 TFKGTSMLQKRVP 852
F+GTS L K VP
Sbjct: 62 VFQGTSTLHKLVP 74
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 72/134 (53%), Gaps = 2/134 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN + + SKK++ N P V L TG+LDGV V Y+ + L+G+I+ G LC C
Sbjct: 339 KNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSR--EKNLKGLIKGAGYLCGC 396
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 397 QECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTVT 456
Query: 329 SSLPEEKSFACVRC 342
S +K+F +
Sbjct: 457 GSPINQKNFRSWKA 470
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 82.4 bits (202), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%)
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGI 757
L A + H+CF +V+ + DL +V+ R LR F G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 758 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 817
LRV+G++VAELPLV T + +G LL +E LL V+ +VLPA E WT
Sbjct: 69 LRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQTWTGS 128
Query: 818 FGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 849
FGF+ + + L I + ++ F+GT+M K
Sbjct: 129 FGFQVMTQSQKLDIAQH---TIMCFQGTTMCHK 158
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 81.6 bits (200), Expect = 2e-12, Method: Composition-based stats.
Identities = 52/135 (38%), Positives = 75/135 (55%), Gaps = 10/135 (7%)
Query: 722 DLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 780
D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT K
Sbjct: 104 DDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKK 163
Query: 781 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CS 836
GYF LL + IE L V+ + P E IW++K G+ +LS +K
Sbjct: 164 GYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAH 218
Query: 837 QLVTFKGTSMLQKRV 851
LV F S++QK +
Sbjct: 219 PLVMFANLSLVQKSL 233
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 1/134 (0%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
K+ + K +KK S N P V L TG+ DGV+V Y + + L+GII+ G LC C
Sbjct: 367 KSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR-EVRNLKGIIKGTGYLCGC 425
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN RV+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+
Sbjct: 426 GNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVT 485
Query: 329 SSLPEEKSFACVRC 342
S K+F +
Sbjct: 486 GSDINHKNFNTWKA 499
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ++K ++K S N P V L TG+LDGV V Y+ + + LRGII+ LC
Sbjct: 293 KNKQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE---LRGIIKGTTYLCG 349
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN + + +FE HA + + + +I FENGK++ ++++ RS P L T+Q+
Sbjct: 350 CQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTI 409
Query: 328 LSSLPEEKSF 337
+ +K+F
Sbjct: 410 FGAPINQKAF 419
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 63/250 (25%)
Query: 647 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 706
LQNLL + + PE+ +++ E V +D R E ++ A+++
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDKR----------VECNSKIAVALSL 1291
Query: 707 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR----- 759
+CF PIVD +G +LI ++VY G N +F G Y IL +++A +R
Sbjct: 1292 MDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLKEKN 1351
Query: 760 ------------------------------------VFGQEVAELPLVATSKINHGKGYF 783
+ G ++AE+P + T + +G
Sbjct: 1352 ILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQGMC 1411
Query: 784 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLV 839
+ L IE +LS L ++ +++PA E WT KFGF +D E+ S+ ++
Sbjct: 1412 RRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKSV------SML 1465
Query: 840 TFKGTSMLQK 849
F GT +LQK
Sbjct: 1466 VFPGTGLLQK 1475
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGGNL+ CDGCP FH C L ++P W C C F H+ ++ +A +
Sbjct: 1013 GDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNCSCKF------CHEHSSDDAEDTAD 1066
Query: 561 VDS 563
VDS
Sbjct: 1067 VDS 1069
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + Q + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQRN-LKGMIKGTGYLCGCGNC 444
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 445 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 504
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 505 INHKNFNTWKA 515
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 74.3 bits (181), Expect = 3e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 46/71 (64%)
Query: 519 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 579 EAELSGCLLCR 589
E GC LC+
Sbjct: 64 ETGFGGCALCK 74
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 66/131 (50%), Gaps = 2/131 (1%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSR--ERNLKGMIKGTGYLCGCGNC 443
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 444 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 503
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 504 INHKNFNTWKA 514
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/273 (22%), Positives = 95/273 (34%), Gaps = 83/273 (30%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 965 GDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT----------------------- 1001
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
C +C G K I C QC +H C+++
Sbjct: 1002 ------------------------CQVCGGPFSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1037
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 680
K+ L + WFC C I ++ + G + + D
Sbjct: 1038 EKLP-LEDQISQTWFCGKYCKEI-------------------FIGLRSHVGT--DNILDS 1075
Query: 681 DVRWRLL----SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 728
D+ W +L G+ A E + L+ A+ + +CF +VD +G D+IP ++
Sbjct: 1076 DLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVL 1135
Query: 729 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 759
Y G N ++ G Y IL ++ +R
Sbjct: 1136 YNKGSNFARVDYQGFYTVILEKGDEILCVASIR 1168
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 448 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 505
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F +
Sbjct: 506 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQTWKA 563
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 473 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 530
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F +
Sbjct: 531 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQTWKA 588
>gi|154309635|ref|XP_001554151.1| hypothetical protein BC1G_07288 [Botryotinia fuckeliana B05.10]
Length = 765
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 88/207 (42%), Gaps = 59/207 (28%)
Query: 482 GIICHCCN--SEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +C C V+P++ +L CDGCP H++C S+ IP+GDW+CK CQ
Sbjct: 291 GPVCEICTKPDSVAPNK---------ILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKA 341
Query: 540 FERKRFLQHDANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 598
+ N GV DS ++I+ C +C+G D K
Sbjct: 342 RVAAEAARAAEN-------DGVTDSDDEIS----------------CAVCQGLDSEK--- 375
Query: 599 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 658
P I+LC+ C+ H C ++ + P+G+W C C ++ V +LL +E +
Sbjct: 376 -PNEIILCENCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLG 425
Query: 659 P-----------EFHLNAIKKYAGNSL 674
P E HL +++ N L
Sbjct: 426 PISNEVPSIEGFETHLKTMQRVLLNRL 452
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 71.2 bits (173), Expect = 2e-09, Method: Composition-based stats.
Identities = 65/252 (25%), Positives = 104/252 (41%), Gaps = 34/252 (13%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQC-EREFHVGC---LKKHKMADLRELPKGKWFCCMDC 640
C C G D G R ++LC C H GC + ++ G +FC +C
Sbjct: 16 CTHCGGGDVEPEG--RRVLVLCSACFAAGTHTGCHEDVTGEPLSSEITHGDGLYFCGKEC 73
Query: 641 SRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 700
R L+ + + E ++Y + L+ K R +
Sbjct: 74 QRSYEALEAATGRRSRIRDE-----PEQYT-------------FELVHYKQDDRTVRSAV 115
Query: 701 SQAVAIFHDCFDPIVDSISGRDLI---------PSMVYGRNLRGQEFGGMYCAILTVNSS 751
A+ +F F P++ +GRDL+ P G F AIL + +
Sbjct: 116 ETAMRMFRTSFAPLIME-NGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFRLAILRMGGT 174
Query: 752 VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAE 811
+++A LRVFG + AE+P V+T + + G+ + L +E LL V +V+P+ E
Sbjct: 175 IITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCLVIPSINELL 234
Query: 812 SIWTDKFGFKKI 823
+WT+KFGF KI
Sbjct: 235 PMWTNKFGFAKI 246
>gi|347838360|emb|CCD52932.1| hypothetical protein [Botryotinia fuckeliana]
Length = 886
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 88/207 (42%), Gaps = 59/207 (28%)
Query: 482 GIICHCCN--SEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +C C V+P++ +L CDGCP H++C S+ IP+GDW+CK CQ
Sbjct: 291 GPVCEICTKPDSVAPNK---------ILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKA 341
Query: 540 FERKRFLQHDANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 598
+ N GV DS ++I+ C +C+G D K
Sbjct: 342 RVAAEAARAAEN-------DGVTDSDDEIS----------------CAVCQGLDSEK--- 375
Query: 599 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 658
P I+LC+ C+ H C ++ + P+G+W C C ++ V +LL +E +
Sbjct: 376 -PNEIILCENCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLG 425
Query: 659 P-----------EFHLNAIKKYAGNSL 674
P E HL +++ N L
Sbjct: 426 PISNEVPSIEGFETHLKTMQRVLLNRL 452
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 65/133 (48%), Gaps = 10/133 (7%)
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR---------GQEFGGMYCAILTVNS 750
+ QA+ +F F P++ +GRDL+ + G G F G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMD-NGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRG 279
Query: 751 SVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEA 810
+VV+A LRVFG+ AELP VAT + G + L +E LL V +V+P+ +
Sbjct: 280 AVVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPL 339
Query: 811 ESIWTDKFGFKKI 823
+W KFGF +
Sbjct: 340 LPMWAAKFGFTPL 352
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 95/219 (43%), Gaps = 31/219 (14%)
Query: 603 ILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL-- 658
++ CD C FH CL + H M D L + + C + I ++ V + K+
Sbjct: 650 LVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGIAPIVHANSVGQLLKMLL 708
Query: 659 -PEFHLNA----------IKKYAGNSLETVSDIDVRWRLLSGKAATP-----------ET 696
P + A +KKY G E + W L+ + A E
Sbjct: 709 RPRMQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRECADSDLFLGEHPHIVEN 766
Query: 697 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 754
L+ A+ + +CF PIVD SG +++ +++Y G N FGG Y A+L VV+
Sbjct: 767 NSKLALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVA 826
Query: 755 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 793
+ +R G +AE+P + T + +G + LF+ IE +
Sbjct: 827 SASIRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESV 865
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 69.7 bits (169), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 68/115 (59%), Gaps = 4/115 (3%)
Query: 695 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG--QEFGGMYCAILTVNSSV 752
E R+ LS A+ + H C++P+ DS +G D++P ++ G L G ++ GM+ A+L +
Sbjct: 420 ELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGPAA 479
Query: 753 VSAGILRVFGQEVAELPLVAT-SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 806
V+ + R FG ++AE+P++A ++ G +LL A +E+LL K+I PA
Sbjct: 480 VAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPA 533
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 12/54 (22%)
Query: 484 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+CH C GG+L+ C+ CP FH C L++ P+GD++C C+
Sbjct: 121 LCHICGL------------GGDLMCCETCPGVFHAACLGLAAPPEGDYHCPLCR 162
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 760
A+ + H+ F I++ + RDL +V+ R LR F G Y + V
Sbjct: 11 ALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL--------------V 56
Query: 761 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 820
G++ AELPL+ T +G +LL +EKLLS L V+ ++LP + WT FGF
Sbjct: 57 CGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGSFGF 116
Query: 821 KKIDPELLSIYRKRCSQLVTFKGTSMLQK 849
++ ++ + +++F+GT+M QK
Sbjct: 117 TEMSYS--DRFQYAANIILSFQGTTMCQK 143
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 107/279 (38%), Gaps = 81/279 (29%)
Query: 607 DQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPEFH 662
D+ ER+ H+ ++ P G W C C C + E+ +
Sbjct: 132 DKVERDCHL---------HIQMFPHGDWHCPNCTCKFCRAV-----------VEECSQTL 171
Query: 663 LNAIKKYAG--NSLE------------TVSDIDVRWRLLSGKAATPETRLLLSQA----V 704
+KKY G + LE T SD +RW TP QA +
Sbjct: 172 FEGVKKYVGVKHELEARFSWSLVHRECTDSDFILRW--------TPSYCGKQFQAGHSSL 223
Query: 705 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 764
+ +CF PI+D SG G YC + L+ G
Sbjct: 224 TVMDECFLPIIDRRSG-------------------GKYC----------TKCPLQFHGNR 254
Query: 765 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 824
+AE+ + T + +G + LF+ +E L L+V+ +V+PA + +W KFGFK ++
Sbjct: 255 LAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATADLSHVWISKFGFKYVE 314
Query: 825 PELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 863
L R L+ F G +LQK + A R S+ D+
Sbjct: 315 DSLKK--ELRSMNLLAFPGIDVLQKELLAPRHAKSAADT 351
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
+LDGV V Y+ + + LRGII+ G LC C CN +VI +FE HA + + +
Sbjct: 1 MLDGVPVKYIAWSREKE--LRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNN 58
Query: 296 YICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
+I FENGK++ +++ +S P L +Q+ S +KSF
Sbjct: 59 HIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSF 100
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 85/203 (41%), Gaps = 42/203 (20%)
Query: 485 CHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ--NMF 540
C C + +P Q +L CDGC R +H C LS IPQGDW+C C +
Sbjct: 978 CRVCRKKSNPEQ---------MLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQLS 1028
Query: 541 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 600
RKR VE D+ E++ + + + C +C P
Sbjct: 1029 PRKR----TKAPVEVSSEEEDDN-EKVDEDGDEDEEEEDLNQEVCNICE---------SP 1074
Query: 601 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKL 658
++LCD C + FH+ C+ DL+ LP+G W C C+ + N L K+
Sbjct: 1075 GELILCDFCPKSFHLDCI------DLKRLPRGTWKCPPCVLGKKKNKRGSPPLT----KV 1124
Query: 659 PEFHLNAIKKYAGNSLETVSDID 681
N I+KY L TV+D+D
Sbjct: 1125 KVRSRNNIRKY---DLATVTDVD 1144
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 72/181 (39%), Gaps = 31/181 (17%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 371 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAPEGKWS 418
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGC 591
C +C+ K +Q +A + I++ +R+ E E + CR C
Sbjct: 419 CPHCE-----KEGIQWEAKDEDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVC 473
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C I +Q +
Sbjct: 474 ---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKGRVQKI 523
Query: 651 L 651
L
Sbjct: 524 L 524
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 34/132 (25%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRVSGVDSV 564
+L CDGCP A H++C S+ IP GDW+CK CQ N + ++ +A+ DS
Sbjct: 305 ILFCDGCPLAVHQKCYSVPKIPDGDWFCKKCQRNRVAAEAARANENDALN-------DSD 357
Query: 565 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 624
++I C +CRG D K P I+LC+ C+ H C
Sbjct: 358 DEIK----------------CAVCRGLDSKK----PNEIILCENCDYAVHQTC------G 391
Query: 625 DLRELPKGKWFC 636
D+ + P+ +W C
Sbjct: 392 DIPKKPREEWLC 403
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 6/98 (6%)
Query: 759 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 818
R+ G ++AE+P + T I +G L IE LS L V+ +V+PA E ++ WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 819 GFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPAC 854
GFK ++P R++ L + GT +L+KR+ A
Sbjct: 61 GFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLAT 94
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 16/126 (12%)
Query: 737 EFGGMYCAILTVNSSVVSAGI------------LRVFGQEVAELPLVATSKINHGKGYFQ 784
+F G Y L + V A LR+ G +VAE+PLVAT+ +G Q
Sbjct: 62 DFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYRRQGMCQ 121
Query: 785 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKG 843
+L +EK+LS L V+ +VLPA E +W FGF ++ E L + R + F+G
Sbjct: 122 VLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLR---FPFLGFQG 178
Query: 844 TSMLQK 849
T+M QK
Sbjct: 179 TTMFQK 184
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 27/35 (77%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
DGG+LL CD CPR+FH +C L SIP+ DWYCK C
Sbjct: 129 DGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 69/181 (38%), Gaps = 40/181 (22%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 354 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWS 401
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A + +I V + E E CR C
Sbjct: 402 CPHCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC- 445
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 446 --KDGG---ELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKIQKIL 496
Query: 652 V 652
Sbjct: 497 T 497
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 406
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A R G D E ++E + CR C
Sbjct: 407 CPHCE-----KEGIQWEA------REDGSDGEEDNGD-----AGDMEEDDHHMEFCRVC- 449
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 450 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCM-CPPMKGKVQKI 499
Query: 651 LV 652
L
Sbjct: 500 LT 501
>gi|357510881|ref|XP_003625729.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
gi|355500744|gb|AES81947.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
Length = 290
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 184 GFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKK-PMTVTELFETGLLD 238
G G E+++ ++ E A S AL + +ELK SKKI+++KK P T+ ELF TGLLD
Sbjct: 203 GSGEETVTKLDQEGAAVESEIDGALAVRRNKMELKTSKKIAVDKKRPTTMKELFRTGLLD 262
Query: 239 GVSVVYMGGIKFQAS 253
GVSVVY+ GIK + S
Sbjct: 263 GVSVVYVSGIKKEES 277
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 137/379 (36%), Gaps = 106/379 (27%)
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 532
+LEG+ GI C CC+ ++ S+FE HA P
Sbjct: 574 MLEGWITRDGIHCGCCSKILAVSKFEIHAGSKLRQPF----------------------- 610
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
QN+F N+ AG + G SV+ I + C +C
Sbjct: 611 ----QNIF---------LNSGGAGNI-GFCSVDVIAD---------DPNDDACGIC---- 643
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRI------N 644
G G ++ CD C FH CL + H M D L + + C + I N
Sbjct: 644 ----GDGG-DLVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGIAPIVHAN 697
Query: 645 SVLQ------NLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDIDVRWRL----------- 686
SV Q L VQ K +L+ +KKY G E + W L
Sbjct: 698 SVRQLLKMLLRLWVQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRECTNSDLS 755
Query: 687 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG-------RDLIPSMVYG--------- 730
LSG E L+ A+ + +CF PI+D SG R+ + +G
Sbjct: 756 LSGHPHIVENNSKLALALTVMDECFLPIIDRRSGHCKKFCLRNFTTVIFFGISLCWFVCL 815
Query: 731 ------RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQ 784
N FGG Y A+L +V++ +R G +AE+P + T + +G +
Sbjct: 816 YIAFRRSNFNRLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCR 875
Query: 785 LLFACIEKLLSFLRVKSIV 803
LF+ +E + S V +
Sbjct: 876 RLFSVVESVSSTADVAKLT 894
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 71/148 (47%), Gaps = 31/148 (20%)
Query: 703 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFG 762
AV I H+CF I++ + D+ +V+ R + GQ+F
Sbjct: 774 AVDILHECFVTIIEPRTQSDISEDIVFNREICGQKF------------------------ 809
Query: 763 QEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK 822
AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 810 ---AELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGFTV 866
Query: 823 I-DPELLSIYRKRCSQLVTFKGTSMLQK 849
+ + E L + + +++F+GT+M QK
Sbjct: 867 MSNSERLEL---AGNSILSFQGTTMCQK 891
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 73/186 (39%), Gaps = 31/186 (16%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G + +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 379 GDEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 426
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 586
+G W C +C+ K +Q +A E I++ + + E E +
Sbjct: 427 EGKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPTGAEEEDDDHME 481
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
CR C K G +L CD C +H+ CL L E+P G+W C C I
Sbjct: 482 FCRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKG 531
Query: 646 VLQNLL 651
+Q +L
Sbjct: 532 RVQKIL 537
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 59/135 (43%), Gaps = 31/135 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
DGG+L+ CD CP++FH+ C +L+ IP GDW C C L D ++ + +
Sbjct: 465 DGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTG-----EGLPEDGDSSNSAQEEEE 519
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
E + ++ K RG D ++LCD C FH+ CL
Sbjct: 520 GEEETEHDQFCKVCK------------RGGD----------VILCDFCSCVFHLRCLN-- 555
Query: 622 KMADLRELPKGKWFC 636
L E+P+G W C
Sbjct: 556 --PPLGEVPEGDWKC 568
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 64/151 (42%), Gaps = 50/151 (33%)
Query: 489 NSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRF 545
NS++S QF D + LL CDGC + +H C + +IP+GDWYC C N +R
Sbjct: 1785 NSKLSNCQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHECMNKATGER- 1843
Query: 546 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 605
C++C G S SG ++L
Sbjct: 1844 --------------------------------------NCIVC-GKKSSTSGT---RLIL 1861
Query: 606 CDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C+ C R +H C+ H + + ++P+GKW+C
Sbjct: 1862 CELCPRAYHTDCI--HPI--MHKVPRGKWYC 1888
Score = 46.2 bits (108), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 10/86 (11%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQF------EAHADGGNLLPCDGC 512
DG + GY+ C + +E G CH C ++ + + ++ G L+ C+ C
Sbjct: 1807 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKATGERNCIVCGKKSSTSGTRLILCELC 1865
Query: 513 PRAFHKECAS--LSSIPQGDWYCKYC 536
PRA+H +C + +P+G WYC C
Sbjct: 1866 PRAYHTDCIHPIMHKVPRGKWYCSKC 1891
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 639
C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1790 NCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1837
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 69/181 (38%), Gaps = 40/181 (22%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 353 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWS 400
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A + +I V + E E CR C
Sbjct: 401 CPHCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC- 444
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 445 --KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKIQKIL 495
Query: 652 V 652
Sbjct: 496 T 496
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 73/186 (39%), Gaps = 31/186 (16%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G + +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 262 GDEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 309
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 586
+G W C +C+ K +Q +A E I++ + + E E +
Sbjct: 310 EGKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPMGAEEEDDDHME 364
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
CR C K G +L CD C +H+ CL L E+P G+W C C I
Sbjct: 365 FCRVC---KDGG---ELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKG 414
Query: 646 VLQNLL 651
+Q +L
Sbjct: 415 RVQRIL 420
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDSDMEKAPEGKW 383
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 428
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 429 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 478
Query: 650 LLV 652
+L+
Sbjct: 479 ILI 481
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 497
Query: 650 LLV 652
+L+
Sbjct: 498 ILI 500
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 344 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 391
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 392 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 436
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 437 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 486
Query: 650 LLV 652
+L+
Sbjct: 487 ILI 489
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
L GII GG LC CS CN +V+ +FE HA + R + +I ENGK + +++ ++
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 315 VPLPMLKATLQSALSSLPEEKSF 337
PL M+ ++ S E+ F
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFF 89
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 406
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 407 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 451
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 452 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 501
Query: 650 LLV 652
+L+
Sbjct: 502 ILI 504
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 497
Query: 650 LLV 652
+L+
Sbjct: 498 ILI 500
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 297 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 344
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 345 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 389
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 390 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 439
Query: 650 LLV 652
+L+
Sbjct: 440 ILI 442
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 497
Query: 650 LLV 652
+L+
Sbjct: 498 ILI 500
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 42/92 (45%), Gaps = 19/92 (20%)
Query: 456 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRA 515
SG DG E G A +G+ + C+ C DGG LL CD CPRA
Sbjct: 137 SGEEDGNESGETAD-----DGWADHNRWYCNICK------------DGGQLLCCDRCPRA 179
Query: 516 FHKEC--ASLSSIPQGDWYCKYCQNMFERKRF 545
FH C S+ IP +WYCK C +R+R
Sbjct: 180 FHMSCLGMSVDMIPDSEWYCKMCTECLDRRRL 211
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
GG LL CDGCPRAFH C L+ IP +W+C C
Sbjct: 1248 GGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC 1281
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 65/164 (39%), Gaps = 36/164 (21%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN----MFERKRFLQHDANAVEAGR 557
+GG LL CDGCP FH C L IP+G +C C +F ++ A + R
Sbjct: 1122 EGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVKPVFPVNGAKKNGKAASKRPR 1181
Query: 558 VSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKS----------------- 596
S + + ++ ++ K + E+E SG + S
Sbjct: 1182 RSNSPTSRRRPRKQAKLGKAKSDDSESEDSGAESDSAVSTTSSARPASRPKAPEDQWDVD 1241
Query: 597 ----GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
G G +L CD C R FHV C+ L E+P +WFC
Sbjct: 1242 CSVCGLGGE-LLCCDGCPRAFHVNCIG------LAEIPDTEWFC 1278
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 496 QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 540
+ ++H G+ GC R FH +CA L ++P DWYCK C+
Sbjct: 1295 RLDSHVICGSEDGTKGCDRVFHLKCAKLDAVPADDWYCKKCRTKL 1339
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 106/265 (40%), Gaps = 70/265 (26%)
Query: 626 LRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG--NSLE------ 675
++ P G W C C C +V++++ K +KKY G + LE
Sbjct: 136 IKMFPHGDWHCPNCT-CKFCRAVVEDVSQTVGAKCL---FEGVKKYVGVKHELEARFSWS 191
Query: 676 ------TVSDIDVRWRLLSGKAATPETRLLLSQA----VAIFHDCFDPIVDSISGRDLIP 725
T SD +RW TP QA + + +CF PI+D SG
Sbjct: 192 LVHRECTDSDFILRW--------TPSYCGKQFQAGHSSLTVMDECFLPIIDRRSG----- 238
Query: 726 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVF-GQEVAELPLVATSKINHGKGYFQ 784
G YC + L++F G +AE+ + T + +G +
Sbjct: 239 --------------GKYC----------TKCPLQLFHGNRLAEMQFIGTRHVYRHQGMCR 274
Query: 785 LLFACIE------KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL 838
LF+ +E K L L+V+ +V+PA + +W KFGFK ++ L R L
Sbjct: 275 RLFSVVESMSFDVKTLQNLKVELLVIPATADLSHVWISKFGFKYVEDSLKK--ELRSMNL 332
Query: 839 VTFKGTSMLQKRVPACRIGSSSTDS 863
+ F G +LQK + A R S+ D+
Sbjct: 333 LAFPGIDVLQKELLAPRHAKSAADT 357
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 497
Query: 650 LLV 652
+L+
Sbjct: 498 ILI 500
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 360 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 407
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 408 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 452
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 453 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 502
Query: 650 LLV 652
+L+
Sbjct: 503 ILI 505
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 406
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 407 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 451
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 452 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 501
Query: 650 LLV 652
+L+
Sbjct: 502 ILI 504
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 352 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 399
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 400 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 444
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 445 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 494
Query: 650 LLV 652
+L+
Sbjct: 495 ILI 497
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 504
Query: 650 LLV 652
+L+
Sbjct: 505 ILI 507
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 59.7 bits (143), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 75/183 (40%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 356 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 403
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 404 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 448
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 449 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 498
Query: 650 LLV 652
+L+
Sbjct: 499 ILI 501
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 346 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 393
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 394 CPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC---- 436
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 437 --KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 486
Query: 651 LV 652
L
Sbjct: 487 LT 488
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 75/183 (40%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 363 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 410
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 411 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 455
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 456 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 505
Query: 650 LLV 652
+L+
Sbjct: 506 ILI 508
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 59.3 bits (142), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 75/183 (40%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 363 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 410
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 411 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 455
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 649
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 456 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 505
Query: 650 LLV 652
+L+
Sbjct: 506 ILI 508
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 346 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 393
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 394 CPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC---- 436
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 437 --KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 486
Query: 651 LV 652
L
Sbjct: 487 LT 488
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 59.3 bits (142), Expect = 9e-06, Method: Composition-based stats.
Identities = 30/97 (30%), Positives = 37/97 (38%), Gaps = 46/97 (47%)
Query: 483 IICHCCNSEVSPSQFEAHA----------------------------------------- 501
I+C CC E+SPSQFE+HA
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 502 -----DGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 533
DGG+L+ C CPRAFH C L P+G W+C
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 55/134 (41%), Gaps = 49/134 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP+GDWYC C N +R
Sbjct: 1949 LLLCDGCDKGYHTYCFKPKMENIPEGDWYCHECMNKATGER------------------- 1989
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S SG ++LC+ C R +H C+ H +
Sbjct: 1990 --------------------NCIVC-GKKSSTSGT---RLILCELCPRAYHTDCI--HPI 2023
Query: 624 ADLRELPKGKWFCC 637
+ ++P+GKW+C
Sbjct: 2024 --MHKVPRGKWYCS 2035
Score = 46.6 bits (109), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 10/86 (11%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQF------EAHADGGNLLPCDGC 512
DG + GY+ C + +E G CH C ++ + + ++ G L+ C+ C
Sbjct: 1953 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKATGERNCIVCGKKSSTSGTRLILCELC 2011
Query: 513 PRAFHKECAS--LSSIPQGDWYCKYC 536
PRA+H +C + +P+G WYC C
Sbjct: 2012 PRAYHTDCIHPIMHKVPRGKWYCSKC 2037
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 6/57 (10%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 639
++ C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1931 SIMKANCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1983
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 71/184 (38%), Gaps = 47/184 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 347 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 394
Query: 533 CKYCQNM---FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 589
C +C+ M +E + DA+ E +G ++ E + + C +C+
Sbjct: 395 CPHCEKMGIQWEAR----EDASEGEEDNEAGGEAEED------------DHHMEFCRVCK 438
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQ 648
+L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 439 DGG---------ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCTCPSMKGKVQ 485
Query: 649 NLLV 652
+L
Sbjct: 486 KILT 489
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 40/166 (24%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 40/166 (24%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 69/154 (44%), Gaps = 29/154 (18%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A + G
Sbjct: 366 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS---EG 417
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 619
+ +E++ +LE E + CR C K G +L CD C +H+ CL
Sbjct: 418 EEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCLN 464
Query: 620 KHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 652
L E+P G+W C C + +Q +L+
Sbjct: 465 ----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 494
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 53/136 (38%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDW+C C+N +R
Sbjct: 1623 LLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQR------------------- 1663
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI-LLCDQCEREFHVGCLKKHK 622
C++C G +TI +LCDQC + +H+ CL+
Sbjct: 1664 --------------------NCIVC-------GKPGNKTISVLCDQCPKAYHIECLQ--- 1693
Query: 623 MADLRELPKGKWFCCM 638
L ++P+GKW C +
Sbjct: 1694 -PPLAKVPRGKWLCVL 1708
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 8/84 (9%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV- 558
DGG LL CD CPRAFH C +S IP +WYCK C +R+R + + E RV
Sbjct: 166 DGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMCSECLDRRRLKK---ESKEKARVM 222
Query: 559 SGVDSVEQITKRCIRIVKNLEAEL 582
+ +E+ +R R+ + + EL
Sbjct: 223 RETEKLERDARR--RMAEQMREEL 244
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
GG LL CDGCPRAFH C L IP+ +W+C C
Sbjct: 1201 GGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC 1234
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 496 QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+ ++H G+ GC R FH +CA L ++P DWYCK C+
Sbjct: 1248 RLDSHVICGSEDGTKGCDRVFHLKCAKLDAVPADDWYCKKCR 1289
Score = 44.3 bits (103), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
++GG L+ CDGCP FH C L +P+G +C C
Sbjct: 1066 SEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 68/166 (40%), Gaps = 40/166 (24%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 357 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 404
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 590
C +C+ K +Q +A + G +++E V + E E + CR
Sbjct: 405 SCPHCE-----KEGIQWEAKEDNS---EGEETMED-------AVGDAEEEDDHHMEFCRV 449
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 450 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLC 485
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 452 DGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 700 LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGI 757
++ AVA+ +CF+P++D + +++ S++Y G N F G Y AIL +S
Sbjct: 22 IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVAS 81
Query: 758 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 796
+R+ G ++AE+P +AT +G L IE + S
Sbjct: 82 MRIHGNKLAEMPFIATRPSYRRQGMCHKLLVAIESVSSL 120
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 366 MDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 413
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 414 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 459
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 460 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 509
Query: 651 LV 652
L+
Sbjct: 510 LI 511
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 71/187 (37%), Gaps = 33/187 (17%)
Query: 468 ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSS 525
A G + +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 373 ALGDEDGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELEK 420
Query: 526 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 585
P+G W C +C+ K +Q +A + + + + + + + C
Sbjct: 421 APEGKWSCPHCE-----KEGIQWEAKEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFC 475
Query: 586 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRIN 644
+C+ +L CD C +H+ CL L E+P G+W C C I
Sbjct: 476 RVCKDGG---------ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIK 522
Query: 645 SVLQNLL 651
+Q +L
Sbjct: 523 GRVQKIL 529
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRV 558
DGG L+ CDGCPRAFH C L+SIP+G W C+ CQ N + + + A E
Sbjct: 300 DGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLCQSNRLKDRTYTHVQPPATETSSG 359
Query: 559 SGVD 562
S VD
Sbjct: 360 SAVD 363
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGGDA------EEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2616 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2656
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2657 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2688
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2689 PPLLKVPRGKWYC 2701
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2670 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2704
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2602 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2645
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2580 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2620
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2621 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2652
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2653 PPLLKVPRGKWYC 2665
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2634 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2668
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2566 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2609
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2669 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2709
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2710 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2741
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2742 PPLLKVPRGKWYC 2754
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2723 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2757
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2651 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2698
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2662 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2702
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2703 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2734
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2735 PPLLKVPRGKWYC 2747
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2716 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2750
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2648 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2691
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2602 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2642
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2643 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2674
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2675 PPLLKVPRGKWYC 2687
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2656 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2690
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2588 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2631
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2732 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2772
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2773 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2804
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2805 PPLLKVPRGKWYC 2817
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2786 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2820
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2714 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2761
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2466 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2506
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2507 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2538
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2539 PPLLKVPRGKWYC 2551
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2520 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2554
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2448 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2495
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2653 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2693
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2694 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2725
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2726 PPLLKVPRGKWYC 2738
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2707 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2741
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2639 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2682
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2648 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2688
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2689 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2720
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2721 PPLLKVPRGKWYC 2733
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2702 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2736
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2634 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2677
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 MDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2521 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2561
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2562 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2593
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2594 PPLLKVPRGKWYC 2606
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2575 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2609
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2503 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2550
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2885 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2925
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2926 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2957
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2958 PPLLKVPRGKWYC 2970
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2939 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2973
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2871 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2914
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2554 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2594
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2595 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2626
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2627 PPLLKVPRGKWYC 2639
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2608 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2642
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2536 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2583
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2889 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2929
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2930 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2961
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2962 PPLLKVPRGKWYC 2974
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2943 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2977
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2875 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2918
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 383
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 429
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 430 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 479
Query: 651 LV 652
L+
Sbjct: 480 LI 481
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 498
Query: 651 LV 652
L+
Sbjct: 499 LI 500
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 498
Query: 651 LV 652
L+
Sbjct: 499 LI 500
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 67/170 (39%), Gaps = 30/170 (17%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 368 GEEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 415
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 586
+G W C +C+ K +Q +A + I++ + + E +
Sbjct: 416 EGKWSCPHCE-----KEGIQWEAKDEDFEDFEEDSEDRVISEVSSGVPAGGDDEDDDHME 470
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CR C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 471 FCRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLC 510
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 477 DGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 513
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 498
Query: 651 LV 652
L+
Sbjct: 499 LI 500
>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
Length = 3086
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2597 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2637
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2638 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2669
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2670 PPLLKVPRGKWYC 2682
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2651 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2685
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2579 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2626
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 356 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 403
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 404 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 449
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 450 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 499
Query: 651 LV 652
L+
Sbjct: 500 LI 501
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 498
Query: 651 LV 652
L+
Sbjct: 499 LI 500
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 498
Query: 651 LV 652
L+
Sbjct: 499 LI 500
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 383 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 430
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 431 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 476
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 477 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 526
Query: 651 LV 652
L+
Sbjct: 527 LI 528
>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
Length = 2123
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 1645 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 1685
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 1686 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 1717
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 1718 PPLLKVPRGKWYC 1730
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 1699 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 1733
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 586 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 1628 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 1674
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 343 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 390
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 391 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 436
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 437 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 486
Query: 651 LV 652
L+
Sbjct: 487 LI 488
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 383
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 429
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 430 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 479
Query: 651 LV 652
L+
Sbjct: 480 LI 481
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2632 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2672
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 2673 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2704
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 2705 PPLLKVPRGKWYC 2717
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2686 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2720
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2614 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2661
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 413 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 460
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 461 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 506
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 507 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 556
Query: 651 LV 652
L+
Sbjct: 557 LI 558
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 64/164 (39%), Gaps = 40/164 (24%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 420
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A R D+ E + +E + CR C
Sbjct: 421 CPHCE-----KEGIQWEA------REDVSDAEEDNGE-----TGEMEEDDHHMEFCRVC- 463
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 464 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWIC 498
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 501
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 66/164 (40%), Gaps = 36/164 (21%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 297 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 344
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A +S + ++ +R + + + + C +C+
Sbjct: 345 CPHCE-----KEGIQWEAR----DDLSDGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG 395
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+L CD C +H+ CL L E+P G+W C
Sbjct: 396 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 426
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 393 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 430
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 651 LV 652
L+
Sbjct: 506 LI 507
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 63/153 (41%), Gaps = 28/153 (18%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A
Sbjct: 370 GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-----KEGVQWEAK--------- 415
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ ++ V + E E CR C K G +L CD C +H+ CL
Sbjct: 416 -EDNSELDDDLDDAVGDPEEEDHHMEFCRVC---KDG---GELLCCDVCPSSYHIHCLN- 467
Query: 621 HKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 652
L E+P G+W C C + +Q +L
Sbjct: 468 ---PPLPEIPNGEWLCPRCTCPPLKGKIQKILT 497
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 59/143 (41%), Gaps = 24/143 (16%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
D LL CD C + FH C +LS +P+GDW C C+ R+ D + G S
Sbjct: 1100 DEDKLLLCDECNQPFHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDYAELNGG--S 1157
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 619
D ++ + E+E +C CD + ++ C +C +H C
Sbjct: 1158 DSDEYDE--------TDSDESEAEHDEMCCMCD------DDQELVYCSRCPAAYHRECHD 1203
Query: 620 KHKMADLRELPKGKWFC--CMDC 640
LR P+GKW C C +C
Sbjct: 1204 ----PPLRNFPRGKWVCSACTNC 1222
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 10/57 (17%)
Query: 580 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
AE + C +CR K G + +LLCD+C + FH+ CL+ L +PKG W C
Sbjct: 1087 AENAKCKICR-----KKGDEDK-LLLCDECNQPFHLYCLR----PALSYVPKGDWMC 1133
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 205 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 245
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G S G ++ CD C R +H C +
Sbjct: 246 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 277
Query: 624 ADLRELPKGKWFC 636
L ++P+GKW+C
Sbjct: 278 PPLLKVPRGKWYC 290
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 259 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 293
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKP----KMDNIPDGDWYC 234
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 10/135 (7%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
G NLL D +K A+ + +G C YCQ+ ER RF QH+ + + G
Sbjct: 164 GDNLLSLDH--NDIYKTGATYDN-DKGILLCDYCQSTVERNRFGQHEELLI--CKDCGNK 218
Query: 563 SVEQITKRCIRIVKNLEAELSG-CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
+ +V+ + ++ S C+ C+ C + P T+L CD C++ +H+ C +
Sbjct: 219 AHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDKGYHMNCHE-- 276
Query: 622 KMADLRELPKGKWFC 636
L ++P GKW C
Sbjct: 277 --PKLTQMPSGKWAC 289
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 281 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 328
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 329 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 374
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 375 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 424
Query: 651 LV 652
L+
Sbjct: 425 LI 426
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 67/155 (43%), Gaps = 27/155 (17%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 558
+ GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A +
Sbjct: 288 SQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS--- 339
Query: 559 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 618
G + +E++ E + CR C K G +L CD C +H+ CL
Sbjct: 340 EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCL 387
Query: 619 KKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 652
L E+P G+W C C + +Q +L+
Sbjct: 388 N----PPLPEIPNGEWLCPRCTCPSLKGKVQKILI 418
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 66/153 (43%), Gaps = 27/153 (17%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A + G
Sbjct: 233 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS---EG 284
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ +E++ E + CR C K G +L CD C +H+ CL
Sbjct: 285 EEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCLN- 331
Query: 621 HKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 652
L E+P G+W C C + +Q +L+
Sbjct: 332 ---PPLPEIPNGEWLCPRCTCPALKGKVQKILI 361
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 66/165 (40%), Gaps = 38/165 (23%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|14586370|emb|CAC42901.1| putative protein [Arabidopsis thaliana]
Length = 1595
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 489 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 539
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 596 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 653
Query: 540 FERK 543
F+R+
Sbjct: 654 FDRR 657
>gi|334187637|ref|NP_568273.2| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
gi|332004422|gb|AED91805.1| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
Length = 1602
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 489 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 539
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 596 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 653
Query: 540 FERK 543
F+R+
Sbjct: 654 FDRR 657
>gi|297807283|ref|XP_002871525.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
gi|297317362|gb|EFH47784.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
Length = 1581
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 489 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 539
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 593 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 650
Query: 540 FERK 543
F+R+
Sbjct: 651 FDRR 654
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 69/152 (45%), Gaps = 38/152 (25%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE------------RKRFLQHD 549
+GG ++ CDGC R FH C ++ +P+G YCK+C R++ L+
Sbjct: 1085 EGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCSEGDTKGAEEKSVGGDGRRQSLRLS 1144
Query: 550 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG-----CLLCRGCDFSKSGFGPRTIL 604
A+ GR V+ ++I R + LE+ G C +C+ +G +L
Sbjct: 1145 AD----GRHDDVEENDEI--RMKSSNRELESGAVGPWDVECFICK-------LYGE--LL 1189
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CD C + FH+ C+ ++ P+ +WFC
Sbjct: 1190 GCDGCPKAFHLACI------GIKSWPQEEWFC 1215
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 54/133 (40%), Gaps = 49/133 (36%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
G LL CDGCP+AFH C + S PQ +W+C C ++ V G +
Sbjct: 1186 GELLGCDGCPKAFHLACIGIKSWPQEEWFCDECD---------------MQTCGVCGRNK 1230
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
+ +L+ ++C D SK C++ FH+ C+K
Sbjct: 1231 I----------------KLNSHVICGSEDGSKG------------CDKVFHLKCVK---- 1258
Query: 624 ADLRELPKGKWFC 636
L ++P+ WFC
Sbjct: 1259 --LEKVPESDWFC 1269
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 26/56 (46%), Gaps = 21/56 (37%)
Query: 502 DGGNLLPCDGCPRAFH-----------------KECASL----SSIPQGDWYCKYC 536
DGG LL CD CPRAFH ASL IP+ +WYCK+C
Sbjct: 141 DGGELLCCDRCPRAFHLKWYVGCFPSAVVAHQASRYASLGLQKEEIPESEWYCKFC 196
>gi|307169034|gb|EFN61879.1| Bromodomain adjacent to zinc finger domain protein 2B [Camponotus
floridanus]
Length = 2352
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 59/153 (38%), Gaps = 53/153 (34%)
Query: 488 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2062 TTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2121
Query: 545 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
CL+C ++G + ++
Sbjct: 2122 ---------------------------------------NCLVC----GKRAG---KNLV 2135
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 637
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2136 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2164
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2085 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRAGKNLVLCELCPRA 2143
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 2144 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2174
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 566 QITKRCIRI-VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 624
++ RC+ + N +L C F SG +LLCD C+R +H C +
Sbjct: 2043 KLRNRCVSLKATNQYKQLLTTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----P 2098
Query: 625 DLRELPKGKWFC--CMD 639
+ +P G W+C CM+
Sbjct: 2099 KMENIPDGDWYCHECMN 2115
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 474 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 531
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 107 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 154
Query: 532 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 591
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 155 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 200
Query: 592 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 649
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 201 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 249
Query: 650 LLV 652
+L+
Sbjct: 250 ILI 252
>gi|332022570|gb|EGI62872.1| Bromodomain adjacent to zinc finger domain protein 2B [Acromyrmex
echinatior]
Length = 2202
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 52/134 (38%), Gaps = 52/134 (38%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1931 LLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------------------- 1971
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
CL+C ++G + ++LC+ C R +H C H
Sbjct: 1972 --------------------NCLVC----GKRAG---KNLVLCELCPRAYHTDC---HNP 2001
Query: 624 ADLRELPKGKWFCC 637
+ ++P+GKW+C
Sbjct: 2002 V-MPKMPRGKWYCS 2014
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 1935 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRAGKNLVLCELCPRA 1993
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 1994 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2024
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 578 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 635
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1904 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 1959
Query: 636 C--CMD 639
C CM+
Sbjct: 1960 CHECMN 1965
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 71/181 (39%), Gaps = 42/181 (23%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 405 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 452
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E G D+ + V +E + CR C
Sbjct: 453 CPHCE-----KEGIQWEAR--EEGSEGEDDNGD---------VGEMEDD-HHMEFCRVC- 494
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 495 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCTCPSLKGKVQRIL 545
Query: 652 V 652
Sbjct: 546 T 546
>gi|340709835|ref|XP_003393506.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Bombus terrestris]
Length = 2263
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 53/153 (34%)
Query: 488 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1973 TTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2032
Query: 545 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
CL+C G K+ ++
Sbjct: 2033 ---------------------------------------NCLVC-GKRVGKN------LV 2046
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 637
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2047 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2075
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 1996 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2054
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 2055 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2085
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 639
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 1979 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2026
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 51/134 (38%), Gaps = 52/134 (38%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2001 LLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------------------- 2041
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
CL+C G K+ ++LC+ C R +H C H
Sbjct: 2042 --------------------NCLVC-GKRVGKN------LVLCELCPRAYHTDC---HNP 2071
Query: 624 ADLRELPKGKWFCC 637
+ ++P+GKW+C
Sbjct: 2072 V-MPKMPRGKWYCS 2084
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2005 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2063
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 2064 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2094
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 578 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 635
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1974 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 2029
Query: 636 C--CMD 639
C CM+
Sbjct: 2030 CHECMN 2035
>gi|328792710|ref|XP_623473.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis mellifera]
Length = 2293
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 53/153 (34%)
Query: 488 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2003 TTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2062
Query: 545 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
CL+C G K+ ++
Sbjct: 2063 ---------------------------------------NCLVC-GKRVGKN------LV 2076
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 637
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2077 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2105
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2026 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2084
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 2085 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2115
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 639
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2009 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2056
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 64/137 (46%), Gaps = 13/137 (9%)
Query: 681 DVRWRLLSGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR- 731
++ W LL A+ E LS A+ + H+CF P+ + + D + +++ +
Sbjct: 29 NLTWTLLKSNHSSDHKPDASDIENYSKLSIALHVMHECFQPVEEPRTKGDFLKDVIFRKR 88
Query: 732 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 790
L F G Y +L + ++ +RV+G++VAE+PLV T G +L +
Sbjct: 89 SELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVAEIPLVGTRVQYRRLGMCGILMNVL 148
Query: 791 EKLL---SFLRVKSIVL 804
EK L SFL + V+
Sbjct: 149 EKNLKDYSFLDFQDTVM 165
>gi|350407087|ref|XP_003487980.1| PREDICTED: hypothetical protein LOC100749908 [Bombus impatiens]
Length = 2303
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 51/134 (38%), Gaps = 52/134 (38%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2032 LLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------------------- 2072
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
CL+C G K+ ++LC+ C R +H C H
Sbjct: 2073 --------------------NCLVC-GKRVGKN------LVLCELCPRAYHTDC---HNP 2102
Query: 624 ADLRELPKGKWFCC 637
+ ++P+GKW+C
Sbjct: 2103 V-MPKMPRGKWYCS 2115
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 515
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2036 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2094
Query: 516 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 544
+H +C + + +P+G WYC C + +KR
Sbjct: 2095 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2125
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 639
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2019 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2066
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 72/182 (39%), Gaps = 44/182 (24%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 249 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 296
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E G D+ + V +E + CR C
Sbjct: 297 CPHCE-----KEGIQWEAR--EEGSEGDEDNGD---------VGEMEDD-HHMEFCRVC- 338
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 339 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 388
Query: 651 LV 652
L
Sbjct: 389 LT 390
>gi|224090647|ref|XP_002309044.1| predicted protein [Populus trichocarpa]
gi|222855020|gb|EEE92567.1| predicted protein [Populus trichocarpa]
Length = 467
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
KK ++K +KK+ N P V L TG+LDGV V Y+ Q LRG+I+ G LC
Sbjct: 378 KKKDDIKTAKKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWS--QEKELRGVIKGSGYLCG 435
Query: 268 CSLCNGCRVI 277
C CN +VI
Sbjct: 436 CQTCNFSKVI 445
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 66/164 (40%), Gaps = 36/164 (21%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 378 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 425
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A +S + ++ +R + + + + C +C+
Sbjct: 426 CPHCE-----KEGIQWEAK----DELSEGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG 476
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+L CD C +H+ CL L E+P G+W C
Sbjct: 477 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 507
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 474 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 511
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 26/136 (19%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C + P+G W C +C+ + + ++ E G
Sbjct: 268 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVG 327
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D E+ ++E CR C K G +L CD C +H+ CL
Sbjct: 328 GDPEEEDD-------HHME-------FCRVC---KDG---GELLCCDPCPSSYHIHCLN- 366
Query: 621 HKMADLRELPKGKWFC 636
L E+P G+W C
Sbjct: 367 ---PPLPEIPNGEWLC 379
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C +C
Sbjct: 346 DGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQ 547
DGG L+ CDGCPRAFH C LSSIP G W C++C+ +K Q
Sbjct: 268 DGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCRGHRVKKEKAQ 315
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 320 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 367
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 368 CPHCE-----KEGIQWEAKDDEEEEEEAPGEEEDDHMEFCRVCKD-----GGELLC---- 413
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 414 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 455
Query: 651 L 651
L
Sbjct: 456 L 456
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 344 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 391
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 392 CPHCE-----KEGIQWEAKDDEEEEDEVAGEEEDDHMEFCRVCKD-----GGELLC---- 437
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 438 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 479
Query: 651 L 651
L
Sbjct: 480 L 480
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 65/164 (39%), Gaps = 36/164 (21%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 379 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 426
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A +S + + +R + + + + C +C+
Sbjct: 427 CPHCE-----KEGIQWEAR----DDLSEAEGEDDDDRRDEGMEEEDDHHIEFCRVCKDGG 477
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+L CD C +H+ CL L E+P G+W C
Sbjct: 478 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 508
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 475 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 512
>gi|380485054|emb|CCF39608.1| origin recognition complex subunit 4 [Colletotrichum higginsianum]
Length = 863
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 8/73 (10%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
G +L CDGC +A+H++C + +P+GDWYC C Q + A AG +
Sbjct: 359 GNRILFCDGCDKAYHQKCYKVPKVPRGDWYCNEC--------VQQKQSRAAAAGEAVKIP 410
Query: 563 SVEQITKRCIRIV 575
+VEQ R++
Sbjct: 411 NVEQHLNSLKRVL 423
Score = 43.1 bits (100), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 642
CL+C D SK G IL CD C++ +H C K K +P+G W+C C+ +
Sbjct: 348 CLVCSKPD-SKPG---NRILFCDGCDKAYHQKCYKVPK------VPRGDWYCNECVQQKQ 397
Query: 643 INSVLQNLLVQEAEKLP--EFHLNAIKK 668
+ EA K+P E HLN++K+
Sbjct: 398 SRAAAAG----EAVKIPNVEQHLNSLKR 421
>gi|348690302|gb|EGZ30116.1| hypothetical protein PHYSODRAFT_474458 [Phytophthora sojae]
Length = 239
Score = 53.9 bits (128), Expect = 3e-04, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 503 GGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 538
GG LL CDGC RA+H C SL +P+GDW+C YC++
Sbjct: 200 GGKLLCCDGCERAYHLNCVRPSLLDVPEGDWFCPYCRD 237
Score = 40.0 bits (92), Expect = 6.1, Method: Composition-based stats.
Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 641
+L CD CER +H+ C++ L ++P+G WFC C D S
Sbjct: 203 LLCCDGCERAYHLNCVR----PSLLDVPEGDWFCPYCRDAS 239
>gi|345489407|ref|XP_001604290.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Nasonia vitripennis]
Length = 1443
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 60/154 (38%), Gaps = 32/154 (20%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-----------------QNMFER 542
D N+L CDGC + H C L+S+P GDW+C C ++ E
Sbjct: 1075 DAENMLLCDGCNKGHHLYCLKPKLTSVPAGDWFCHLCKPRETKAKEKAKKRRKFEDEIEE 1134
Query: 543 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 602
L + A RV V+S E ++ E E LC C+
Sbjct: 1135 DTTLTKETRHNRAKRV--VESDEDAEADSDE-NQDEEMEHETTQLCSVCE------SDGK 1185
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ CD C + FH CL+ L P+G+W C
Sbjct: 1186 LIECDMCSKFFHTDCLE----PPLARAPRGRWSC 1215
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 322 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELEKAPEGKWS 369
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 370 CPHCE-----KEGIQWEAKGEEEEEEEAAGEEEDDHMEFCRVCKD-----GGELLC---- 415
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 416 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 457
Query: 651 L 651
L
Sbjct: 458 L 458
>gi|345489409|ref|XP_003426132.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Nasonia vitripennis]
Length = 1407
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 60/154 (38%), Gaps = 32/154 (20%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-----------------QNMFER 542
D N+L CDGC + H C L+S+P GDW+C C ++ E
Sbjct: 1075 DAENMLLCDGCNKGHHLYCLKPKLTSVPAGDWFCHLCKPRETKAKEKAKKRRKFEDEIEE 1134
Query: 543 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 602
L + A RV V+S E ++ E E LC C+
Sbjct: 1135 DTTLTKETRHNRAKRV--VESDEDAEADSDE-NQDEEMEHETTQLCSVCE------SDGK 1185
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ CD C + FH CL+ L P+G+W C
Sbjct: 1186 LIECDMCSKFFHTDCLE----PPLARAPRGRWSC 1215
>gi|383863769|ref|XP_003707352.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Megachile rotundata]
Length = 1448
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 35/161 (21%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 558
D +L CDGC + H C LS++P+GDWYCK C+ + K L+ E
Sbjct: 1082 GDAEKMLLCDGCNKGHHLYCLKPKLSTVPEGDWYCKVCKPPIKSKEKLKQRKKFEEELEE 1141
Query: 559 SGVDSVEQITKRCIRIVKNL-------------------EAELSGCLLCRGCDFSKSGFG 599
+ + E R RI+++ +++ C CR SG
Sbjct: 1142 EVILTKETRHNRAKRILESEGEEERDDDELEEDSDMDISSQQVNVCTACR------SG-- 1193
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
++ CD C +H+ C++ + P+GKW C DC
Sbjct: 1194 -GKLISCDACSSYYHIECIE----PPIARAPRGKW-SCSDC 1228
>gi|302832357|ref|XP_002947743.1| hypothetical protein VOLCADRAFT_87924 [Volvox carteri f.
nagariensis]
gi|300267091|gb|EFJ51276.1| hypothetical protein VOLCADRAFT_87924 [Volvox carteri f.
nagariensis]
Length = 305
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 11/118 (9%)
Query: 742 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 801
+ A+LT + VSA I+ V+G + AEL LVAT + +GY L + L + V+
Sbjct: 190 FAALLTEGTPAVSAAIVDVYGADAAELYLVATRTVLQRQGYGTSLVRQLSAELGKIGVRR 249
Query: 802 IVLPA---AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTF-----KGTSMLQKRV 851
+++ EE + +W DKFGFK + +Y CS TF KGT L +++
Sbjct: 250 LLVSVDDDDEENQRLWRDKFGFKSLSAS--ELYELGCS-FGTFSAPATKGTVFLVRKL 304
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 68/181 (37%), Gaps = 48/181 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 497 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 544
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A E V E R+ K+ G LLC
Sbjct: 545 CPHCE-----KEGIQWEAKDEEEDEEEPVGEEEDDHMEFCRVCKD-----GGELLC---- 590
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 650
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 591 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 632
Query: 651 L 651
L
Sbjct: 633 L 633
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 69/183 (37%), Gaps = 42/183 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ--------------NMFERKRF 545
DGG+LL CD C ++FH C + L IP+GDWYC C+ + K +
Sbjct: 74 DGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSCKYKKSKTNVTKSPSTTIINNKTY 133
Query: 546 LQHDANAVE---------------AGRVSGVDSVEQITKRCIRI----VKNLEAELSGCL 586
+ + E + S VD++ + ++ KN+ S L
Sbjct: 134 FKESEQSPEEMSPPYLPISSSPIGSTMSSLVDNLSSVNPSTFQLPQEYTKNVNRNSSKKL 193
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 646
C C+ S S G IL C++C +H C+ + + W C I S
Sbjct: 194 NCLVCEES-SNSG--DILQCNKCNAAYHSTCVDSSSLGNKT----SAWLCPKHNKNIESQ 246
Query: 647 LQN 649
N
Sbjct: 247 SAN 249
>gi|255544948|ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis]
gi|223547443|gb|EEF48938.1| hypothetical protein RCOM_1578820 [Ricinus communis]
Length = 1915
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 4/61 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNL+ CDGCP A+H +C ++ S+P+GDW+C C +R + N++ + GV
Sbjct: 741 GNLICCDGCPAAYHSKCVGVANDSLPEGDWFCPEC--AIDRHKPWMKTRNSLRGAELLGV 798
Query: 562 D 562
D
Sbjct: 799 D 799
>gi|350410934|ref|XP_003489181.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Bombus impatiens]
Length = 1416
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 547
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 548 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSQQINVCCICKSG---GKLI 1195
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG L+ CD C +H EC L+ P+G W C C++ +RK +++D++ E
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKDRKTNIRYDSSTSE------ 1244
Query: 561 VDSVEQITKRCIRIVKNLEAE-----LSGCL 586
D+ + T+R + +E E + GC+
Sbjct: 1245 -DTEPRQTRRAAKRAAEIEQEEDKGTIKGCM 1274
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 44/126 (34%), Gaps = 50/126 (39%)
Query: 488 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 547
C+ E S DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 459 CSEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------- 508
Query: 548 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTIL 604
C +C G D + GF +TI+
Sbjct: 509 -------------------------------------CGVCGGSDLDDDTAEGFTDKTII 531
Query: 605 LCDQCE 610
C+QCE
Sbjct: 532 YCEQCE 537
>gi|340714616|ref|XP_003395822.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Bombus terrestris]
Length = 1416
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 547
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 548 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSRQINVCCICKSG---GKLI 1195
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG L+ CD C +H EC L+ P+G W C C++ ERK +++D++ E
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKERKTNIRYDSSTSE------ 1244
Query: 561 VDSVEQITKRCIRIVKNLEAE-----LSGCL 586
D+ + T+R + +E E + GC+
Sbjct: 1245 -DTEPRQTRRAAKRAAEIEQEEDKGTIKGCM 1274
>gi|380025897|ref|XP_003696700.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Apis florea]
Length = 1447
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 64/157 (40%), Gaps = 29/157 (18%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-------QNMFERKRF------- 545
DG +L CDGC + H C LS +P GDWYCK C + + +RK+F
Sbjct: 1075 DGDKMLLCDGCNKGHHLYCLQPKLSCVPDGDWYCKVCKPSTKPKEKIXKRKKFEDELEED 1134
Query: 546 --LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 603
L + A RV + + + + +C C KSG +
Sbjct: 1135 VILTKETRHNRAKRVLESEEEDNSEDEELEEDSDDNISNQQINVCSAC---KSG---GKL 1188
Query: 604 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
+ CD C +H+ C++ + P+G+W C DC
Sbjct: 1189 ISCDICPNFYHIECIE----PPITRAPRGRWICS-DC 1220
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 548
GG L+ CD CP +H EC ++ P+G W C C++ +RK +++
Sbjct: 1185 GGKLISCDICPNFYHIECIEPPITRAPRGRWICSDCKDRRDRKMNIKY 1232
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 53/136 (38%), Gaps = 37/136 (27%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C + P G W C +C+N L +D +AV + +
Sbjct: 348 GGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCEND------LVNDNDAVTSKEAAP 401
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ + C LCR +L CD C +H CL
Sbjct: 402 AKA----------------GNMEFCRLCRDGG---------ELLCCDSCPSSYHRYCL-- 434
Query: 621 HKMADLRELPKGKWFC 636
+ L +P+G W C
Sbjct: 435 --IPPLTTIPEGDWHC 448
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG LL CD CP ++H+ C L++IP+GDW+C C
Sbjct: 415 DGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|341902027|gb|EGT57962.1| hypothetical protein CAEBREN_23443 [Caenorhabditis brenneri]
Length = 642
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 566 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 625
++ R +R+VK+ E C+ CR C +I+ CD C+R FH C A
Sbjct: 556 EMPDRMVRVVKSYEW---NCIECRTCSICHKKDNEDSIVSCDWCDRAFHYLC------AG 606
Query: 626 LRELPKGKWFCCMDCS 641
LR +P+G W C + CS
Sbjct: 607 LRAMPRGMWMCQVYCS 622
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 13/54 (24%)
Query: 484 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK-YC 536
ICH ++E S ++ CD C RAFH CA L ++P+G W C+ YC
Sbjct: 580 ICHKKDNEDS------------IVSCDWCDRAFHYLCAGLRAMPRGMWMCQVYC 621
>gi|350410937|ref|XP_003489182.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Bombus impatiens]
Length = 1454
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 547
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 548 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSQQINVCCICKSG---GKLI 1195
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 40.0 bits (92), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 548
GG L+ CD C +H EC L+ P+G W C C++ +RK +++
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKDRKTNIRY 1238
>gi|340714618|ref|XP_003395823.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Bombus terrestris]
Length = 1454
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 547
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 548 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 604
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSRQINVCCICKSG---GKLI 1195
Query: 605 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 548
GG L+ CD C +H EC L+ P+G W C C++ ERK +++
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKERKTNIRY 1238
>gi|221484244|gb|EEE22540.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
gi|221505773|gb|EEE31418.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 527
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE----AGRVSGV 561
+L CDGC A H+ C + ++P+ DWYC+YC+ + K AN + A + SG
Sbjct: 193 MLLCDGCDVAVHQTCYYVKTVPKADWYCQYCEEKNQAK------ANVAKLQRLAAKASGK 246
Query: 562 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREF----HVG 616
+ +Q+ V L E+ S C+L + C FG +C +F HV
Sbjct: 247 ATDKQVETTLKTEVDRLTGEMESVCVLPKRCPLCPRSFGAHV-----RCGEDFRMWVHVN 301
Query: 617 C 617
C
Sbjct: 302 C 302
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 541 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 595
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 596 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 638
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 50/134 (37%), Gaps = 54/134 (40%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + SIP GDWYC C+N ++
Sbjct: 1696 LLLCDGCDRGYHMYCFKPKMESIPDGDWYCHECKNKSNGEK------------------- 1736
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP-RTILLCDQCEREFHVGCLKKHK 622
C++C G P + ++C+ C R +H+ CL
Sbjct: 1737 --------------------NCIVC--------GKRPIKNYVICEHCPRIYHIECLN--- 1765
Query: 623 MADLRELPKGKWFC 636
L ++P+ KW C
Sbjct: 1766 -PPLSKVPRAKWNC 1778
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 586 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F +SG +LLCD C+R +H+ C K + +P G W+C
Sbjct: 1679 IMKANCQFCQSGDNEDKLLLCDGCDRGYHMYCFK----PKMESIPDGDWYC 1725
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 514 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 568
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 569 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 611
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 431 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILGT 487
Query: 562 D 562
D
Sbjct: 488 D 488
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYC 536
GNL+ CDGCP AFH C ++ S+P+GDWYC C
Sbjct: 806 GNLICCDGCPAAFHSRCVGIASDSLPEGDWYCPEC 840
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 41/144 (28%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYC--------KYCQNMFERKRFLQHDANAVEAGR 557
L+ C CPR+FH C + P+ DW C KY Q + + K+ ++ + A EA
Sbjct: 1004 LISCGSCPRSFHLICIQMKRAPRRDWRCLACTAGVKKYKQELKDLKKIIE-EKEAFEAKD 1062
Query: 558 VSGVD-SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 616
+ D S+ Q C++ G LL RG + C C R++H+
Sbjct: 1063 SNEEDFSINQ----CLKC---------GELLSRGH------------IECIGCGRKYHLA 1097
Query: 617 CLKKHKMADLRELPKGKWFCCMDC 640
C ADL + PKG W+C C
Sbjct: 1098 C------ADLTKRPKGDWYCKKRC 1115
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/173 (19%), Positives = 64/173 (36%), Gaps = 40/173 (23%)
Query: 466 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASL 523
+Y+ + +E K C C + +P + ++ C+ C FH C +L
Sbjct: 897 FYSSLESSIEWQKGLTNARCKVCRGKATPDR---------MIRCETCDLVFHLPCIKPAL 947
Query: 524 SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS 583
IP+G+W+CK C ++V ++ ++ + E S
Sbjct: 948 REIPRGEWFCKACT-----------------------PETVPDSPRKKPKVTSAEDEEES 984
Query: 584 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ DF + ++ C C R FH+ C++ ++ P+ W C
Sbjct: 985 TGEVPESNDFCEVCLNDEQLISCGSCPRSFHLICIQ------MKRAPRRDWRC 1031
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 431 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILGT 487
Query: 562 D 562
D
Sbjct: 488 D 488
>gi|427795843|gb|JAA63373.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1933
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 51/139 (36%), Gaps = 55/139 (39%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + IP GDWYC C N + ++
Sbjct: 1676 LLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKV------------------ 1717
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C+LC K G ++ CD C + FH CL
Sbjct: 1718 ---------------------CILC-----GKKG----KLVRCDACPKVFHHTCLD---- 1743
Query: 624 ADLRELPKGKWFCCMDCSR 642
L + PKGKW CC C++
Sbjct: 1744 PPLSKPPKGKW-CCSGCAK 1761
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1664 CQFCHSGDNEQMLLLCDGCDKGYHTYCFK----PKMDKIPDGDWYC 1705
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1527 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1562
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1938 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1976
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 58/164 (35%), Gaps = 48/164 (29%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 408 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCFDPELEEAPEGRWS 455
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + +T++ R + + C +C+
Sbjct: 456 CPHCEG--------------------EGI-TAATVTEKAGRNAADDDEHSEFCRICKDGG 494
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+L CD C +H CL L E+P G W C
Sbjct: 495 ---------ELLCCDSCTSAYHTFCLN----PPLSEIPDGDWKC 525
Score = 44.7 bits (104), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 492 DGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1716 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1751
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2127 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2165
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 66/178 (37%), Gaps = 53/178 (29%)
Query: 501 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC---QNMFERK--------RFLQ 547
+GG L+ CD CP +FH EC + L +P GDW+C+ C FE + R LQ
Sbjct: 34 GEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQDTPFEPQSMEMTIMGRLLQ 93
Query: 548 H-----------------------------DANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
H DA+A G + DS + + R +R K
Sbjct: 94 HLEGRNTVAFSLPLGVIKAVEPEDYADGQDDADA--PGESNLFDSDQSDSDRGLRQGKRK 151
Query: 579 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
S C +C S+ C +C +H+ CL +A P KW C
Sbjct: 152 RRHDSYCSVCSLPSPSRDDLA-----QCTRCPHSYHLWCLDPPLLAK----PTVKWLC 200
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1611 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1646
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2022 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1611 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1646
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2022 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1718 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1753
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2129 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2167
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|157125686|ref|XP_001660731.1| hypothetical protein AaeL_AAEL002037 [Aedes aegypti]
gi|108882584|gb|EAT46809.1| AAEL002037-PA [Aedes aegypti]
Length = 2884
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2347 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2387
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G G ++ C+ C R +H C +
Sbjct: 2388 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2419
Query: 624 ADLRELPKGKWFC 636
L + P+GKW+C
Sbjct: 2420 PPLLKYPRGKWYC 2432
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 8/88 (9%)
Query: 553 VEAGRVSGVDSVEQ--ITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGFGPRTILLCDQ 608
+ G VS D+VE+ T + + LE+ ++ C F +SG +LLCD
Sbjct: 2293 IPKGLVSWRDAVERSVTTAQLSMALYVLESCVAWDKSIMKANCQFCQSGEQEDKLLLCDG 2352
Query: 609 CEREFHVGCLKKHKMADLRELPKGKWFC 636
C+R +H C K + ++P G W+C
Sbjct: 2353 CDRGYHTYCFKPR----MDKIPDGDWYC 2376
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ C+ CPRA+H++C L P+G WYC+ C
Sbjct: 2401 GKMVYCELCPRAYHQDCYIPPLLKYPRGKWYCQNC 2435
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
GG +L CD CPR FH C L IP+G W C C + F R+
Sbjct: 830 GGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQ 870
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 13/90 (14%)
Query: 551 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 610
N +++ ++ +E+I + ++ V+ + E C+ C G G + +LLCD C
Sbjct: 787 NTIQSQQLQPSLIIEEIYRDQVKKVQIQDGENIWEEQCKVC-----GQGGK-VLLCDTCP 840
Query: 611 REFHVGCLKKHKMADLRELPKGKWFCCMDC 640
R FH CLK L+E+PKGKW CM C
Sbjct: 841 RVFHPRCLK------LKEIPKGKW-SCMIC 863
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|298707514|emb|CBJ30116.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1227
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 30/45 (66%), Gaps = 3/45 (6%)
Query: 500 HADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 544
HA GG+L+ CDGC +H EC LS +P+GDW+C C + RK+
Sbjct: 838 HA-GGDLICCDGCEAVYHPECVGLSVVPEGDWFCPAC--VIRRKK 879
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1609 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1644
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2020 GDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2058
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1430 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1465
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1841 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1879
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1444 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1479
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1855 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1893
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1448 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1483
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1859 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1897
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1345 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1380
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1756 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1794
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
>gi|237838373|ref|XP_002368484.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211966148|gb|EEB01344.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 527
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE----AGRVSGV 561
+L CDGC A H+ C + ++P+ DWYC+YC+ + K AN + A + SG
Sbjct: 193 MLLCDGCDVAVHQTCYYVKTVPKADWYCQYCEEKNKAK------ANVAKLQRLAAKASGK 246
Query: 562 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREF----HVG 616
+ +Q+ V L E+ S C+L + C FG +C +F HV
Sbjct: 247 ATDKQVETTLKTEVDRLTGEMESVCVLPKRCPLCPRSFGAHV-----RCGEDFRMWVHVN 301
Query: 617 C 617
C
Sbjct: 302 C 302
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1358 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1393
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 68/180 (37%), Gaps = 42/180 (23%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
EGY+ C C GG ++ CD CPRA+H C L PQG W
Sbjct: 394 EGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELERAPQGKWS 441
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ V+ + + KR R ++ E CR C
Sbjct: 442 CPHCEK------------EGVQWEAKELEEEEMEEPKRERREEEDDHME-----FCRVC- 483
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L ++P G+W C C ++ +Q +L
Sbjct: 484 --KDG---GELLCCDACVSSYHIHCLN----PPLPDIPHGEWLCPRCTCPQLKGKVQKIL 534
>gi|427795587|gb|JAA63245.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1435
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 51/139 (36%), Gaps = 55/139 (39%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + +H C + IP GDWYC C N + ++
Sbjct: 1178 LLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKV------------------ 1219
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C+LC K G ++ CD C + FH CL
Sbjct: 1220 ---------------------CILC-----GKKG----KLVRCDACPKVFHHTCLD---- 1245
Query: 624 ADLRELPKGKWFCCMDCSR 642
L + PKGKW CC C++
Sbjct: 1246 PPLSKPPKGKW-CCSGCAK 1263
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1166 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPK----MDKIPDGDWYC 1207
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 460 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHK 518
DG + GY+ C + ++ +G C N G L+ CD CP+ FH
Sbjct: 1182 DGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKVCILCGKKGKLVRCDACPKVFHH 1241
Query: 519 EC--ASLSSIPQGDWYCKYC 536
C LS P+G W C C
Sbjct: 1242 TCLDPPLSKPPKGKWCCSGC 1261
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1358 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1393
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1758 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1796
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1446 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1481
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1857 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1446 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1481
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1857 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1406 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1441
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1817 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1855
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1449 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1484
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1860 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1898
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1445 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1480
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1856 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1718 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1753
>gi|312376807|gb|EFR23792.1| hypothetical protein AND_12238 [Anopheles darlingi]
Length = 3049
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + IP GDWYC C N +R
Sbjct: 2439 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECNNKATGER------------------- 2479
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G G ++ C+ C R +H C +
Sbjct: 2480 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2511
Query: 624 ADLRELPKGKWFC 636
+ + P+GKW+C
Sbjct: 2512 PPMLKYPRGKWYC 2524
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2425 QNCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2468
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ C+ CPRA+H++C + P+G WYC+ C
Sbjct: 2493 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCQNC 2527
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1445 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1480
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1856 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1409 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1444
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1706 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1741
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1404 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1439
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1815 GDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1853
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1408 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1443
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1163 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1198
>gi|397596945|gb|EJK56894.1| hypothetical protein THAOC_23124 [Thalassiosira oceanica]
Length = 1752
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 24/138 (17%)
Query: 505 NLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRVSGV 561
+L+ CDGCP+ +H C + +P G+W C +C+ +RK+ Q ++ G
Sbjct: 541 DLVCCDGCPKVYHSNCHKPKIRELPDGEWLCMHCKPKGADRKKKYQ----GFRLAKIPG- 595
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP---RTILLCDQCEREFHVGCL 618
++V+ + V+ E E C++C G + + GP + C C+ +H C+
Sbjct: 596 ETVDSPARHVKCTVRWPEME---CIICEGTEVT----GPLKDNDWVTCATCDDAYHTRCV 648
Query: 619 KKHKMADLRELPKGKWFC 636
L P GKW C
Sbjct: 649 ------GLETRPGGKWRC 660
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 12/69 (17%)
Query: 489 NSEVSPSQ--FEAHAD--------GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
N E +PS+ F+ D GG+LL CD C +A+H +C L+ IP+G+W C+ C
Sbjct: 676 NKENAPSKPLFKGEHDDTCYMCYQGGDLLCCDYCSKAYHMKCHLPPLTEIPEGNWKCQEC 735
Query: 537 QNMFERKRF 545
+ ++ F
Sbjct: 736 AAVEMKRMF 744
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 7/57 (12%)
Query: 483 IICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
++ HC + E + + DGG L+ CDGCPRAFH C + L SIP G W C+ C+
Sbjct: 151 MVIHCNDDECAVCK-----DGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERCR 202
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 46/164 (28%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 268 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 315
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + + G + + + C R+ K+ G LLC
Sbjct: 316 CPHCE-----KEGIQWEPKEEDEEEEEGGEEEDDHMEFC-RVCKD-----GGELLC---- 360
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CD C +H+ CL L E+P G+W C
Sbjct: 361 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 387
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 354 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 62/170 (36%), Gaps = 39/170 (22%)
Query: 473 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 530
L+ GY C C GG ++ CD CPRAFH C L P+G
Sbjct: 356 LMSGYDTDHQDYCEVCQQ------------GGEIMLCDTCPRAFHLVCLDPELEEAPEGS 403
Query: 531 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK----NLEAELSGCL 586
W C +C+ V A R + + +++ I K N E +
Sbjct: 404 WSCPHCEK-----------EGVVAASRSTTPATGGDMSQNPQNIRKSAQPNEEEKDEHQE 452
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C C K G ++ C +C +H CL L E+P+G W C
Sbjct: 453 FCNEC---KDG---GDLICCAKCPVSYHPECL----YPPLSEIPEGPWLC 492
Score = 42.7 bits (99), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG+L+ C CP ++H EC LS IP+G W C C
Sbjct: 459 DGGDLICCAKCPVSYHPECLYPPLSEIPEGPWLCPRC 495
>gi|307203232|gb|EFN82387.1| Bromodomain adjacent to zinc finger domain protein 1A [Harpegnathos
saltator]
Length = 1466
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 33/164 (20%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRF------ 545
D N+L CD C + H C L+++P+GDW+C C + +RKRF
Sbjct: 1091 DAENMLLCDECNKGHHLYCLKPKLNAVPEGDWFCTTCRPPVIKPKEKTQKRKRFEDEMED 1150
Query: 546 ---LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLCRGCDFSKSGFGP 600
L + A RV E I+ + + + + +++ +C C KSG
Sbjct: 1151 EAILTKETRHNRAKRVVTYSDDEAISDQEDDVDEESDQDINVRSENICASC---KSG--- 1204
Query: 601 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 642
++ CD C +H+ C++ L P+G+W C C D R
Sbjct: 1205 GKLITCDTCPDRYHLECVE----PPLSRAPRGRWSCTKCKDKRR 1244
>gi|170053718|ref|XP_001862805.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874114|gb|EDS37497.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 3017
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 47/134 (35%), Gaps = 49/134 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2483 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2523
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G G ++ C+ C R +H C +
Sbjct: 2524 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2555
Query: 624 ADLRELPKGKWFCC 637
+ + P+GKW+C
Sbjct: 2556 PPMLKYPRGKWYCT 2569
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 589 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2469 QNCQFCQSGEQEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2512
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ C+ CPRA+H++C + P+G WYC C
Sbjct: 2537 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCTNC 2571
>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 588
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 538 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 573
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 874 GGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQ 914
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 869 GGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQ 909
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 677 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 712
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 60/164 (36%), Gaps = 45/164 (27%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 327 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 374
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + E G + E R+ K+ G LLC
Sbjct: 375 CPHCE-----KEGIQWEPKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 420
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CD C +H+ CL L E+P G+W C
Sbjct: 421 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 447
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 414 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|320167629|gb|EFW44528.1| hypothetical protein CAOG_02553 [Capsaspora owczarzaki ATCC 30864]
Length = 1716
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASL--SSIPQGDWYCKYC 536
G LL CDGCPR +H C L +S+PQGDW+C C
Sbjct: 437 GELLCCDGCPRVYHATCLKLDTASLPQGDWFCPTC 471
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 538
GN+L CD CPR++H +C +S P+GDW C C++
Sbjct: 1445 GNVLCCDYCPRSYHLKCLKPPMSKPPRGDWKCPICKS 1481
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 61/164 (37%), Gaps = 46/164 (28%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 348 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEMEKAPEGKWS 395
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + + + + + C R+ K+ G LLC
Sbjct: 396 CPHCE-----KEGIQWEPKDDDEEDEDLCEEADDHMEFC-RVCKD-----GGELLC---- 440
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CD C +H+ CL L E+P G+W C
Sbjct: 441 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLC 467
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 434 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 470
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 60/164 (36%), Gaps = 45/164 (27%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 312 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 359
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + E G + E R+ K+ G LLC
Sbjct: 360 CPHCE-----KEGIQWEPKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 405
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CD C +H+ CL L E+P G+W C
Sbjct: 406 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 432
Score = 42.7 bits (99), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 399 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|307109592|gb|EFN57830.1| hypothetical protein CHLNCDRAFT_143249 [Chlorella variabilis]
Length = 295
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+GG L+ CDGC A+H++CA L ++P+ DW+C C
Sbjct: 200 EGGELVCCDGCTAAYHEQCAGLEAVPETDWFCPMC 234
>gi|291234752|ref|XP_002737311.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 281
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 35/133 (26%), Positives = 51/133 (38%), Gaps = 54/133 (40%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC + FH C ++SIP+GDWYC C
Sbjct: 33 LLLCDGCDKGFHTYCFKPKMNSIPEGDWYCYEC--------------------------- 65
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
+ + T I C+LCR ++ C+ C R +H C+
Sbjct: 66 IYKATGEYI------------CVLCR---------HKGRLVKCENCPRAYHPDCID---- 100
Query: 624 ADLRELPKGKWFC 636
L ++P+G+WFC
Sbjct: 101 PPLLKMPRGRWFC 113
Score = 44.3 bits (103), Expect = 0.31, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 504 GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
G L+ C+ CPRA+H +C L +P+G W+C+ CQ
Sbjct: 82 GRLVKCENCPRAYHPDCIDPPLLKMPRGRWFCQACQ 117
Score = 39.7 bits (91), Expect = 7.0, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C F G +LLCD C++ FH C K + +P+G W+C
Sbjct: 21 CQFCLKGDNEELLLLCDGCDKGFHTYCFK----PKMNSIPEGDWYC 62
>gi|301093217|ref|XP_002997457.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110713|gb|EEY68765.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 248
Score = 51.6 bits (122), Expect = 0.002, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 503 GGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 538
GG LL CDGC RA+H C +L +P+GDW+C +C++
Sbjct: 196 GGKLLCCDGCERAYHLNCVRPALLDVPEGDWFCSHCRD 233
Score = 39.7 bits (91), Expect = 7.2, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 6/43 (13%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRI 643
+L CD CER +H+ C++ L ++P+G WFC C D S +
Sbjct: 199 LLCCDGCERAYHLNCVR----PALLDVPEGDWFCSHCRDASPV 237
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 489 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 524
>gi|158297171|ref|XP_317442.4| AGAP008017-PA [Anopheles gambiae str. PEST]
gi|157015066|gb|EAA12387.4| AGAP008017-PA [Anopheles gambiae str. PEST]
Length = 2930
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2399 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2439
Query: 564 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 623
C++C G G ++ C+ C R +H C +
Sbjct: 2440 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2471
Query: 624 ADLRELPKGKWFC 636
+ + P+GKW+C
Sbjct: 2472 PPMLKYPRGKWYC 2484
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 586 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2382 IMKANCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2428
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
G ++ C+ CPRA+H++C + P+G WYC+ C
Sbjct: 2453 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCQNC 2487
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 59/173 (34%), Gaps = 44/173 (25%)
Query: 466 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASL 523
++ G +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 53 FFMMGVDDGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEL 100
Query: 524 SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS 583
P+G W C +C+ G+ + E E
Sbjct: 101 EKAPEGKWSCPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDD 140
Query: 584 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CR C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 141 HMEFCRVC---KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 183
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 150 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 186
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 65/186 (34%), Gaps = 42/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 425 AGEEEADGYETDHQDYCEVCQQ------------GGEIILCDSCPRAYHLVCLDPELDKA 472
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C E V E+ E E
Sbjct: 473 PEGKWSCPHC-----------------EKEGVQWEPKEEEDEYEGEMDDAEKEEEDDHME 515
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
CR C K G +L CD C +H+ CL L E+P G+W C C +
Sbjct: 516 YCRVC---KDG---GELLCCDACISSYHIHCLN----PPLPEIPNGEWLCPRCTCPMLKG 565
Query: 646 VLQNLL 651
+Q +L
Sbjct: 566 RVQKIL 571
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 49/136 (36%), Gaps = 46/136 (33%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C L P+G W C YCQ E +
Sbjct: 384 GGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQ---------------AEGNQEQE 428
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D Q RI K+ G LLC CD C +H CL
Sbjct: 429 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHRFCLN- 462
Query: 621 HKMADLRELPKGKWFC 636
L E+P G+W C
Sbjct: 463 ---PPLEEVPDGEWKC 475
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H+ C + L +P G+W C C
Sbjct: 442 DGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRC 478
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 87 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 122
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 25/36 (69%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+GGNL+ CD CPR H C LS IP+GD+YC C+
Sbjct: 1767 EGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDCE 1802
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 473
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 292 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 327
>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
Length = 2062
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ +R + V+N + SGC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1767 MPQRMVGRVRNYNWQCSGCKCCIKC---RSNQRPGKMLFCEQCDRGYHIYCLG------L 1817
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1818 RTVPDGRWSCERCCVCMRC 1836
>gi|429862948|gb|ELA37533.1| origin recognition complex subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 830
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
G +L CDGC R +H++C ++ +P+GDWYC C Q ++ + AG V+ +
Sbjct: 348 GNKILFCDGCDRCYHQKCHNVPKVPKGDWYCDDC--------VQQKESRVLAAGEVAKIP 399
Query: 563 SVEQITKRCIRIV 575
+ Q R++
Sbjct: 400 NFAQHLNSMKRVL 412
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 24/91 (26%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC---- 640
C++C D SK+G IL CD C+R +H C ++ ++PKG W+C DC
Sbjct: 337 CVICSKPD-SKAG---NKILFCDGCDRCYHQKC------HNVPKVPKGDWYCD-DCVQQK 385
Query: 641 -SRINSVLQNLLVQEAEKLPEF--HLNAIKK 668
SR+ L E K+P F HLN++K+
Sbjct: 386 ESRV------LAAGEVAKIPNFAQHLNSMKR 410
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 63/180 (35%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 727 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 774
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + + E E CR C
Sbjct: 775 CPHCEK--------------------EGIQWEPKDDEDEEEEGGCEEEEDDHMEFCRVC- 813
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 814 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 864
>gi|302774224|ref|XP_002970529.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
gi|300162045|gb|EFJ28659.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
Length = 1340
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 482 GNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 66/180 (36%), Gaps = 46/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 339 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEMEKAPEGKWS 386
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + + G + E R+ K+ G LLC
Sbjct: 387 CPHCE-----KEGIQWEPKDDDEEDEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 432
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 433 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 475
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 474 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 521
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 522 CPHCEK--------------------EGIQWEPKEDDEEEEEGGCEEEEDDHMEFCRVC- 560
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 561 --KDG---GELLCCDACPSSYHLHCLN----PPLAEIPNGEWLCPRCTCPPLKGKVQRIL 611
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 546
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 394 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 45 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 546
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 394 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 482 GNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-QNMFERKRFLQHD 549
DGG L+ CDGCPRAFH C L +P G W C C +N+ E + L+ D
Sbjct: 255 DGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSCVENVTEPGQLLEAD 305
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 31/58 (53%), Gaps = 5/58 (8%)
Query: 481 LGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
+G+ C C +E + DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 89 MGVSCLCQKNE---DECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|358386993|gb|EHK24588.1| hypothetical protein TRIVIDRAFT_114174, partial [Trichoderma virens
Gv29-8]
Length = 633
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
LL CDGC A+H C L IP GDWYC C ++F+
Sbjct: 141 LLLCDGCDAAYHTHCIGLDYIPDGDWYCMECAHLFQ 176
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 42 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 42 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 322 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 369
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 370 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 408
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 409 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 459
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG LL CDGCPRAFH C LS IP G W C C
Sbjct: 441 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 477
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 334 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 381
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 382 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 420
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 421 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 471
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 62/171 (36%), Gaps = 42/171 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 189 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 236
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C E V E+ R K E +
Sbjct: 237 PEGKWSCPHC-----------------EKEGVQWEAKEEEEDYEEERGGKERRREEDDHM 279
Query: 587 -LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CR C K G +L CD C +H+ CL L ++P G+W C
Sbjct: 280 EYCRVC---KDGG---ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 320
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 287 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 586 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 633
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 634 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 672
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 673 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 723
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 386 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 424
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 425 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 475
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 347 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 394
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 395 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 433
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 434 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 468
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 435 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 471
>gi|310792256|gb|EFQ27783.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 312
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G ++ CDGC +A+H++C + +P+GDWYC C
Sbjct: 197 GNQIMFCDGCDKAYHQKCYKVPKVPRGDWYCNEC 230
Score = 46.2 bits (108), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 18/88 (20%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 642
CL+C D SK+G I+ CD C++ +H C K + ++P+G W+C C+D +
Sbjct: 186 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYK------VPKVPRGDWYCNECLDQKQ 235
Query: 643 INSVLQNLLVQEAEKLPEF--HLNAIKK 668
+ + EA K+P F HL+ +K+
Sbjct: 236 SRAAAAD----EAVKIPNFQQHLSKLKR 259
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 9/80 (11%)
Query: 466 YYACGQ--KLLEGYKNGLGIICH-----CCNSEVSPSQFEAHADGGNLLPCDGCPRAFHK 518
+Y+CGQ + K I H + V+ + A DGG L+ CDGCP+AFH
Sbjct: 217 FYSCGQSEETKRASKAVESIFHHKGEPLTDGAHVNDDECAACKDGGELICCDGCPQAFHL 276
Query: 519 ECAS--LSSIPQGDWYCKYC 536
C L+SIP G W C +C
Sbjct: 277 TCLDPPLTSIPSGPWQCDWC 296
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
GG ++ CD CP+ FH +C L IP+G W C C + FER+
Sbjct: 853 GGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNFERQ 893
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 70/186 (37%), Gaps = 42/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E + + + C R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEGEEEGEKEEEDDHMEYC-RVCKD-----GGEL 467
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 468 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 506
Query: 646 VLQNLL 651
+Q +L
Sbjct: 507 RVQKIL 512
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 320 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 367
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 368 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 406
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 407 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 457
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 321 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 368
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 369 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 407
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 408 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 442
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 409 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 445
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 35/53 (66%), Gaps = 4/53 (7%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 552
+GG LL CD CP+ FH C +L++ P G+WYC +C+++ + +Q++ NA
Sbjct: 705 NGGELLCCDKCPKVFHLSCHVPTLTASPSGEWYCTFCRDLNSPE--MQYNVNA 755
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 386 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 424
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 425 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 426 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 62/171 (36%), Gaps = 42/171 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 443 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 490
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C E V E+ R K E +
Sbjct: 491 PEGKWSCPHC-----------------EKEGVQWEAKEEEEDYEEERGGKERRREEDDHM 533
Query: 587 -LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
CR C K G +L CD C +H+ CL L ++P G+W C
Sbjct: 534 EYCRVC---KDGG---ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 574
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 541 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 420
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 421 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 459
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 460 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 461 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 424 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 471
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 472 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 510
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 511 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 561
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 310 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 357
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 358 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 396
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 397 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 431
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 398 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 434
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 420
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 421 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 459
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 460 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 510
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 189 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 236
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 237 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 290
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 291 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 319
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 286 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|440799762|gb|ELR20806.1| PHD-finger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 482
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 54/154 (35%), Gaps = 65/154 (42%)
Query: 485 CHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFER 542
C C P G +L CDGC R FH C + L S+P G+WYCK C
Sbjct: 241 CQICRRSTQP---------GCMLLCDGCDRGFHTFCLNPRLKSVPSGEWYCKSC------ 285
Query: 543 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 602
L S C +C G G R
Sbjct: 286 -----------------------------------LANSKSACEVCEG--------GGR- 301
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+L C+ C R +H+ CL L+++PK KW C
Sbjct: 302 LLCCEVCPRVYHLKCLD----PPLKQVPKEKWTC 331
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 439 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 486
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 487 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 525
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 526 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 576
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C L P+G W C +C+N G
Sbjct: 382 GGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------------EGPAEQ 425
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D Q RI K+ G LLC CD C +H CL
Sbjct: 426 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHTHCLN- 459
Query: 621 HKMADLRELPKGKWFC 636
L E+P G W C
Sbjct: 460 ---PPLVEIPDGDWKC 472
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 466 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------DGGNLLPCDGCPRAFHK 518
+ C LE G HC N + + H DGG LL CD CP A+H
Sbjct: 396 HLVCLDPELEDTPEGKWSCPHCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHT 455
Query: 519 ECAS--LSSIPQGDWYCKYC 536
C + L IP GDW C C
Sbjct: 456 HCLNPPLVEIPDGDWKCPRC 475
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 383 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 430
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 431 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 484
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 485 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 513
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 51/138 (36%), Gaps = 46/138 (33%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG L+ CD CPRA+H C A++ P+GDW C +C ++H V+
Sbjct: 264 GGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHGPEIVKEEPAKQ 314
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D C +C+ + +LLCD C FH C+
Sbjct: 315 NDDF--------------------CKICKETE---------NLLLCDNCTCSFHAYCMD- 344
Query: 621 HKMADLRELP--KGKWFC 636
L ELP W C
Sbjct: 345 ---PPLLELPPQDESWAC 359
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 51/138 (36%), Gaps = 46/138 (33%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG L+ CD CPRA+H C A++ P+GDW C +C ++H V+
Sbjct: 264 GGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHGPEIVKEEPAKQ 314
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D C +C+ + +LLCD C FH C+
Sbjct: 315 NDDF--------------------CKICKETE---------NLLLCDNCTCSFHAYCMD- 344
Query: 621 HKMADLRELP--KGKWFC 636
L ELP W C
Sbjct: 345 ---PPLLELPPQDESWAC 359
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 63/180 (35%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 363 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 410
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + + E E CR C
Sbjct: 411 CPHCEK--------------------EGIQWEPKDDEDDEEEGGCEEEEDDHMEFCRVC- 449
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 450 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 500
>gi|359479239|ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260139 [Vitis vinifera]
Length = 1976
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNL+ CDGCP A+H C ++S +P GDWYC C ++ + ++ + GV
Sbjct: 603 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC--AIDKDKPWMKQRKSLRGAELLGV 660
Query: 562 D 562
D
Sbjct: 661 D 661
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C L P+G W C +C+N G
Sbjct: 385 GGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------------EGPAEQ 428
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D Q RI K+ G LLC CD C +H CL
Sbjct: 429 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHTHCLN- 462
Query: 621 HKMADLRELPKGKWFC 636
L E+P G W C
Sbjct: 463 ---PPLVEIPDGDWKC 475
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 466 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------DGGNLLPCDGCPRAFHK 518
+ C LE G HC N + + H DGG LL CD CP A+H
Sbjct: 399 HLVCLDPELEDTPEGKWSCPHCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHT 458
Query: 519 ECAS--LSSIPQGDWYCKYC 536
C + L IP GDW C C
Sbjct: 459 HCLNPPLVEIPDGDWKCPRC 478
>gi|328788377|ref|XP_395223.4| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Apis mellifera]
Length = 1449
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 60/157 (38%), Gaps = 29/157 (18%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC----------------QNMFERK 543
DG +L CDGC + H C L+ +P GDWYCK C ++ E
Sbjct: 1077 DGDKMLLCDGCNKGHHLYCLQPKLNCVPDGDWYCKVCKPSTKPKEKIKKRKKFEDELEED 1136
Query: 544 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 603
L + A RV + + + + +C C KSG +
Sbjct: 1137 VILTKETRHNRAKRVLESEEEDNSEDEELEEDSDDNISNQQINVCSAC---KSG---GKL 1190
Query: 604 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 640
+ CD C +H+ C++ + P+G+W C DC
Sbjct: 1191 ISCDMCPNFYHIECIE----PPITRAPRGRWICS-DC 1222
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 548
GG L+ CD CP +H EC ++ P+G W C C++ +RK +++
Sbjct: 1187 GGKLISCDMCPNFYHIECIEPPITRAPRGRWICSDCKDRRDRKMNIKY 1234
>gi|296083821|emb|CBI24209.3| unnamed protein product [Vitis vinifera]
Length = 1805
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNL+ CDGCP A+H C ++S +P GDWYC C ++ + ++ + GV
Sbjct: 589 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC--AIDKDKPWMKQRKSLRGAELLGV 646
Query: 562 D 562
D
Sbjct: 647 D 647
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 546
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 312 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 358
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 58/143 (40%), Gaps = 44/143 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG LL CD CPRA+H C +S+ P+GDW C +C +E G
Sbjct: 260 GGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHC----------------IEHG---- 299
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+I K + V + C +C+ + +LLCD C FH C+
Sbjct: 300 ----PEIVKEEPQKVND-----DFCKICKETE---------NLLLCDTCVCAFHAYCMD- 340
Query: 621 HKMADLRELPKGKWFCCMDCSRI 643
L ++P+ + + C C +
Sbjct: 341 ---PPLTQVPQEETWNCPRCELV 360
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 406 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 453
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 454 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 492
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 493 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 527
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 494 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 530
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 807 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 854
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 855 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 893
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 894 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 944
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 41/136 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CP+A+H C + P+G W C C EA +
Sbjct: 265 GGEIILCDTCPKAYHMVCLDPDMEEAPEGHWSCPSC-----------------EAAGIPQ 307
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D E+ ++ N+E C +C+ + +L CD C +H C+
Sbjct: 308 KDEEEE-----KKVATNMEY----CRVCKDVGW---------LLCCDTCPSSYHAYCMN- 348
Query: 621 HKMADLRELPKGKWFC 636
L E+P+G+W C
Sbjct: 349 ---PPLTEVPEGEWSC 361
Score = 39.7 bits (91), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
D G LL CD CP ++H C + L+ +P+G+W C C
Sbjct: 328 DVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPRC 364
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 44 DGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|224059262|ref|XP_002299795.1| predicted protein [Populus trichocarpa]
gi|222847053|gb|EEE84600.1| predicted protein [Populus trichocarpa]
Length = 89
Score = 50.4 bits (119), Expect = 0.004, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 26/35 (74%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNL+ CDGCP A+H +C +++ +P+GDWYC C
Sbjct: 16 GNLICCDGCPAAYHAKCVGVANNYLPEGDWYCPEC 50
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 44 DGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|156033159|ref|XP_001585416.1| hypothetical protein SS1G_13655 [Sclerotinia sclerotiorum 1980]
gi|154699058|gb|EDN98796.1| hypothetical protein SS1G_13655 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 670
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 549
LL CDGC +H C LSSIP G WYC C E F QHD
Sbjct: 179 LLLCDGCDAPYHTHCIGLSSIPTGHWYCMEC---VESGAFTQHD 219
>gi|147864569|emb|CAN78969.1| hypothetical protein VITISV_022739 [Vitis vinifera]
Length = 1318
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNL+ CDGCP A+H C ++S +P GDWYC C ++ + ++ + GV
Sbjct: 587 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC--AIDKDKPWMKQRKSLRGAELLGV 644
Query: 562 D 562
D
Sbjct: 645 D 645
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 424 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 471
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 472 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 510
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 511 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 512 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
GG ++ CD CP+ FH +C L +P+G W C C + FER+
Sbjct: 853 GGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNFERQ 893
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 539
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 824 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 863
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 539
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 929 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 968
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 539
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 942 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 981
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 329 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 376
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 377 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 415
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 416 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 466
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 419 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 466
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 467 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 520
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 521 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 549
>gi|310792252|gb|EFQ27779.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 897
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G ++ CDGC +A+H++C + +P+GDWYC C
Sbjct: 395 GNQIMFCDGCDKAYHQKCYKVPKVPRGDWYCNEC 428
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 18/88 (20%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 642
CL+C D SK+G I+ CD C++ +H C K K +P+G W+C C+D +
Sbjct: 384 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYKVPK------VPRGDWYCNECLDQKQ 433
Query: 643 INSVLQNLLVQEAEKLPEF--HLNAIKK 668
+ + EA K+P F HL+ +K+
Sbjct: 434 SRAAAAD----EAVKIPNFQQHLSKLKR 457
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNLL CDGCP AFH +C + +P+GDWYC C
Sbjct: 436 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 470
>gi|356544359|ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max]
Length = 1702
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
G+L+ CDGCP AFH C ++S +P+GDWYC C
Sbjct: 685 GSLICCDGCPAAFHSRCVGIASDHLPEGDWYCPEC 719
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 538
GG+L+ C+ CP AFH +C IP G WYC+ C N
Sbjct: 1095 GGSLMCCESCPAAFHPDCIGYDEIPDGSWYCRDCTN 1130
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 60/164 (36%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 312 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 359
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ E + D E+ + ++ E CR C
Sbjct: 360 CPHCEK---------------EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC- 398
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 399 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 433
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 400 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 436
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 66/180 (36%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 360 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 407
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ E + D E+ + ++ E CR C
Sbjct: 408 CPHCEK---------------EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC- 446
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 447 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 497
>gi|408391355|gb|EKJ70734.1| hypothetical protein FPSE_09104 [Fusarium pseudograminearum CS3096]
Length = 710
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS-V 564
LL CD C A+H C L IP GDWYC C ++F + + A EAG S S V
Sbjct: 177 LLLCDSCDAAYHTHCIGLEVIPDGDWYCMECAHLF----HMVDEPEATEAGESSPRPSYV 232
Query: 565 EQITKRCIR 573
+ R +R
Sbjct: 233 RRPNPRNVR 241
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 35/164 (21%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 368 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMERAPEGTWS 415
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A R + E+ E + CR C
Sbjct: 416 CPHCE-----KEGIQWEA------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC- 463
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L ++P G+W C
Sbjct: 464 --KDG---GELLCCDTCPSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
Length = 209
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 501 ADGGNLLPCDG-CPRAFHKECASLSSIPQGD-WYCKYCQNMFERKR 544
DGG+LL CDG C R +H C +L+S+P+G+ W C YC E+ R
Sbjct: 62 GDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLCPYCARQKEKAR 107
>gi|397610251|gb|EJK60736.1| hypothetical protein THAOC_18861, partial [Thalassiosira oceanica]
Length = 578
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 6/40 (15%)
Query: 503 GGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 536
GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 448 GGDLIVCDGGDNEGGCGRAFHLECINLRTLPKGDWICKDC 487
>gi|357616639|gb|EHJ70297.1| hypothetical protein KGM_09919 [Danaus plexippus]
Length = 1569
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 52/140 (37%), Gaps = 49/140 (35%)
Query: 505 NLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
LL CDGC + +H C + IP GDWYC C N G
Sbjct: 1267 QLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWEC-------------VNKARGG------ 1307
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
S E++ C++C G ++ L C C R +H+ C
Sbjct: 1308 SRERV-----------------CIVCGGAARGRA-------LPCALCVRAYHLDC----H 1339
Query: 623 MADLRELPKGKWFCCMDCSR 642
L ++P+GKW+C SR
Sbjct: 1340 YPPLTKMPRGKWYCSQCASR 1359
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 8/91 (8%)
Query: 454 DESGLPDGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFE-----AHADGGNLL 507
D+ L DG + GY+ C + +E +G C N S+ A G L
Sbjct: 1266 DQLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWECVNKARGGSRERVCIVCGGAARGRAL 1325
Query: 508 PCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
PC C RA+H +C L+ +P+G WYC C
Sbjct: 1326 PCALCVRAYHLDCHYPPLTKMPRGKWYCSQC 1356
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 590 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 642
C F SG +LLCD C++ +H C K + ++P G W+C C++ +R
Sbjct: 1255 NCQFCLSGDNEDQLLLCDGCDKGYHTYCFKPR----MEKIPDGDWYCWECVNKAR 1305
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 46 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 93
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 94 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 132
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 133 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 167
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 134 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 170
>gi|342877621|gb|EGU79070.1| hypothetical protein FOXB_10409 [Fusarium oxysporum Fo5176]
Length = 673
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
LL CD C A+H C L +IP GDWYC C ++F+
Sbjct: 167 LLLCDSCDAAYHTHCIGLDAIPDGDWYCMECSHLFQ 202
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 62/155 (40%), Gaps = 28/155 (18%)
Query: 510 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD--SVEQI 567
DGC H C + S Q C C+N F R N V+ +S D +Q+
Sbjct: 77 DGCNHIIHDAC--IRSWAQKTNTCPICRNPFHSVRVY----NGVDGTAISKYDVQDKKQV 130
Query: 568 TKRCIR-------IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+ +R + E + + C +C + +LLCD C+ +H C+
Sbjct: 131 AEFDVRQWLGENPEDEEEEEQGNPCPICNSSERED------VLLLCDSCDAAYHTHCIG- 183
Query: 621 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 655
L +P G W+ CM+CS + +++ E+
Sbjct: 184 -----LDAIPDGDWY-CMECSHLFQLVEEPRTTES 212
>gi|302855516|ref|XP_002959250.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
gi|300255380|gb|EFJ39692.1| hypothetical protein VOLCADRAFT_100661 [Volvox carteri f.
nagariensis]
Length = 358
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 108/249 (43%), Gaps = 17/249 (6%)
Query: 604 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 663
L CD C H+GC+ ++A+ ++P WF C C L+ AE+ P H
Sbjct: 123 LRCDTCGCWVHLGCVGV-EVAE--QVPPRPWFHCRACRSTYLRLE----AAAERNP--HP 173
Query: 664 NAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 723
++ + T SD + + TP P++ D
Sbjct: 174 SSASPTHALYVLTPSDHQAAMAVALNRGLTPAVAAGGGAGGRGPVGAVLPLLAQGFNEDA 233
Query: 724 IPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 783
I +G+ R + GG Y A+L V+A VFG + A+L L+AT+ + KG
Sbjct: 234 IRG--FGQPAREYD-GGKYSAVLLNRGQPVAAATFNVFGAD-AQLCLLATAVQHRLKGNG 289
Query: 784 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP-ELLSIYRKRCSQLVTFK 842
L A +E LL+ + V +++ + A +W + G++ + P E L ++R + + +
Sbjct: 290 SALVADLEALLADVGVSRLLVQSRGVALPLWLGRLGYRLVPPQEALQLHR---TLPIAYY 346
Query: 843 GTSMLQKRV 851
+++QK++
Sbjct: 347 DCALMQKQL 355
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
GNLL CDGCP AFH +C + +P+G+WYC C + +R ++ A V G+
Sbjct: 432 GNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPEC--LMQRNNGSRNMAKLGRGAEVLGI 489
Query: 562 D 562
D
Sbjct: 490 D 490
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 61/164 (37%), Gaps = 42/164 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 332 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 379
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + + G + E+ + CR C
Sbjct: 380 CPHCE-----KEGIQWEPKEDDEDEEDGAEEEEEEEDDHME-------------FCRVC- 420
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 421 --KDG---GELLCCDTCPSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 42.7 bits (99), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 422 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
Length = 591
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 59/150 (39%), Gaps = 27/150 (18%)
Query: 506 LLPCDGCPRAFHKECAS--LSSIPQ--------------GDWYCKYCQNMFERKRFLQHD 549
LL CD C R +H C + LS P+ + YC +C E +
Sbjct: 433 LLFCDDCDRGYHMYCLNPPLSEPPEEKSGRSGMDKRDISANNYCDFCLGDSEENKKSNQP 492
Query: 550 ANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 606
V + GR SG + Q T I VK + C+ C+ C + +L C
Sbjct: 493 EELVSCSDCGR-SGHPTCLQFTANMIISVKKYPWQ---CIECKSCGLCGTSDNDDQLLFC 548
Query: 607 DQCEREFHVGCLKKHKMADLRELPKGKWFC 636
D C+R +H+ CL L E P+G W C
Sbjct: 549 DDCDRGYHMYCLN----PPLSEPPEGNWSC 574
>gi|313247391|emb|CBY15642.1| unnamed protein product [Oikopleura dioica]
Length = 1498
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
GG LL C+ CPR +H +C + + IP GDW+C YC
Sbjct: 157 GGELLACESCPRVYHPKCLNPPQTEIPDGDWFCPYC 192
>gi|47214709|emb|CAG01062.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1036
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 29/164 (17%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQG 529
GQ L+ + + C C + + AD +L CD C A H+EC + IP+G
Sbjct: 205 GQSDLQAMVDEDAVCCICMDGD--------GADSNVILFCDSCNIAVHQECYGVPYIPEG 256
Query: 530 DWYCKYCQN---MFERKRFLQ--------HDANAVEAGRVSGVDSVEQITKRCIRIVKNL 578
W C++C + +++R L+ H A A+ V D+V +R +
Sbjct: 257 QWLCRHCLQVRLLPQQRRSLKKTDDGRWGHVACALWVPEVGFSDTVFIEPIDGVRNIPPA 316
Query: 579 EAELSGCLLCRGCDFSKSGFGPRTILLCDQ--CEREFHVGCLKK 620
+L+ C LCR + G G + CD+ C FHV C +K
Sbjct: 317 RWKLT-CYLCR-----EKGAG--ACIQCDKVNCYTAFHVSCAQK 352
>gi|17562600|ref|NP_504523.1| Protein LET-418 [Caenorhabditis elegans]
gi|403399446|sp|G5EBZ4.1|LE418_CAEEL RecName: Full=Protein let-418; AltName: Full=Lethal protein 418
gi|11095333|gb|AAG29838.1| LET-418 [Caenorhabditis elegans]
gi|351020697|emb|CCD62685.1| Protein LET-418 [Caenorhabditis elegans]
Length = 1829
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 53/142 (37%), Gaps = 44/142 (30%)
Query: 504 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
G LL CD CPRA+H C ++ P+GDW C +C ++H V+
Sbjct: 266 GELLLCDTCPRAYHTVCIDENMEEPPEGDWSCAHC---------IEHGPEVVKEEPAKQN 316
Query: 562 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
D C +C+ + +LLCD C FH C+
Sbjct: 317 DEF--------------------CKICKETE---------NLLLCDSCVCSFHAYCID-- 345
Query: 622 KMADLRELPKGKWFCCMDCSRI 643
L E+PK + + C C +
Sbjct: 346 --PPLTEVPKEETWSCPRCETV 365
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 316 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 363
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 364 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 402
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 651
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 403 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 453
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 896 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 953
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q ++C R++
Sbjct: 954 KTVQGLSPVDQ--RKCERLL 971
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 364 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 411
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 412 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 458
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 459 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPMLKG 505
Query: 646 VLQNLL 651
+Q +L
Sbjct: 506 RVQKIL 511
>gi|397574031|gb|EJK48991.1| hypothetical protein THAOC_32170 [Thalassiosira oceanica]
Length = 884
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 6/40 (15%)
Query: 503 GGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 536
GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 754 GGDLIVCDGGDNEGGCGRAFHLECINLRTLPEGDWICKDC 793
>gi|46124757|ref|XP_386932.1| hypothetical protein FG06756.1 [Gibberella zeae PH-1]
Length = 717
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS-V 564
LL CD C A+H C L +IP GDWYC C ++F + + EAG S S V
Sbjct: 176 LLLCDSCDAAYHTHCIGLEAIPDGDWYCMECAHLF----HMVDEPETTEAGESSPRPSYV 231
Query: 565 EQITKRCIR 573
+ R +R
Sbjct: 232 RRPNPRNVR 240
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 35/164 (21%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 368 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMERAPEGTWS 415
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q +A R + E+ E + CR C
Sbjct: 416 CPHCE-----KEGIQWEA------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC- 463
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L ++P G+W C
Sbjct: 464 --KDG---GELLCCDTCPSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 384 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 431
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 432 PEGKWSCPHCEKEGVQWEAKEEEEDYEEDGEEEGEKEEEDDHMEYCRVCKD-----GGEL 486
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 487 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 525
Query: 646 VLQNLL 651
+Q +L
Sbjct: 526 RVQKIL 531
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.1 bits (100), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 896 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 953
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q ++C R++
Sbjct: 954 KTVQGLSPVDQ--RKCERLL 971
>gi|346327633|gb|EGX97229.1| PHD and RING finger domain protein, putative [Cordyceps militaris
CM01]
Length = 754
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 555
LL CD C A+H C L IP GDWYC C + FE Q+ + V++
Sbjct: 256 LLLCDSCDAAYHTHCLGLDHIPDGDWYCMECAHAFELTEESQNGSQPVDS 305
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG+LL CD CP +H+ C + L SIP+GDW C C
Sbjct: 16 DGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|322707184|gb|EFY98763.1| PHD and RING finger domain protein, putative [Metarhizium
anisopliae ARSEF 23]
Length = 651
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
LL CD C A+H C L IP+GDWYC C ++F+
Sbjct: 152 LLLCDSCDAAYHTHCIGLDHIPEGDWYCMECAHLFQ 187
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 62/148 (41%), Gaps = 27/148 (18%)
Query: 510 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE--QI 567
DGC H C + S Q C C+ F R N V+ +S D ++ Q+
Sbjct: 63 DGCEHIIHDAC--IRSWAQKTNTCPICRTPFHCVRVY----NGVDGTAISTYDVIDKKQV 116
Query: 568 TKRCIR------IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 621
+ ++ IV E E + C +C + +LLCD C+ +H C+
Sbjct: 117 AEFDVQAWLGENIVDQEEEECNPCPICNSAERED------ILLLCDSCDAAYHTHCIG-- 168
Query: 622 KMADLRELPKGKWFCCMDCSRINSVLQN 649
L +P+G W+ CM+C+ + + Q+
Sbjct: 169 ----LDHIPEGDWY-CMECAHLFQLTQD 191
>gi|449547717|gb|EMD38685.1| hypothetical protein CERSUDRAFT_113863 [Ceriporiopsis subvermispora
B]
Length = 906
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 75/203 (36%), Gaps = 65/203 (32%)
Query: 504 GNLLPCDGCPRAFHKEC----ASLSSIPQGD--WYCKYCQN-------MFERKRF----L 546
G+L+ CDGCPRAFH C + S +P+GD WYC C N + + +F L
Sbjct: 225 GSLVYCDGCPRAFHLWCLDPPMAASDLPEGDERWYCPACTNQQKPPPKISAKLKFIAPLL 284
Query: 547 QHDANAVEA------------------GRVSGVDSVEQITKRCIRIVK-------NLEAE 581
+H A + A R + VD+ E R R+ + L+
Sbjct: 285 EHLATIIPAEYSLPNEIKTHFKDVATGPRGAYVDTSEIKAPRLNRLGQVEDRDPYRLKDR 344
Query: 582 LSGCLLCRGC---------------------DFSKSGFGPRTILLCDQCEREFHVGCLKK 620
+LC C D + PR I+ CD C +H+ CL
Sbjct: 345 NGDPVLCFQCGTSALPPAVAATSPAAKRTKRDHNTFHDNPRAIITCDYCHLHWHLDCLDP 404
Query: 621 HKMADLRELPKGKWFCCMDCSRI 643
+A + K KW C ++
Sbjct: 405 -PLACMPPWSK-KWMCPNHADQV 425
>gi|400602572|gb|EJP70174.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
2860]
Length = 633
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 22/36 (61%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
LL CD C A+H C L IP GDWYC C ++FE
Sbjct: 140 LLLCDSCDAAYHTHCIGLDHIPDGDWYCIECAHLFE 175
>gi|358336343|dbj|GAA54879.1| tyrosine-protein kinase BAZ1B, partial [Clonorchis sinensis]
Length = 1921
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 537
+D NLL CDGC RAFH C L +P GDWYC C+
Sbjct: 1433 SDDDNLLLCDGCNRAFHLYCLRPPLRRVPAGDWYCPSCR 1471
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 38/74 (51%), Gaps = 14/74 (18%)
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
CIR K++E + C +CR KS +LLCD C R FH+ CL+ LR +P
Sbjct: 1414 CIRWEKSVED--ARCRICR----HKSDDD--NLLLCDGCNRAFHLYCLR----PPLRRVP 1461
Query: 631 KGKWFC--CMDCSR 642
G W+C C SR
Sbjct: 1462 AGDWYCPSCRPASR 1475
>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
Length = 3661
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 27/159 (16%)
Query: 502 DGGNLLPCDG-CPRAFHKECASLSSIPQGD-WYCKYCQNMFER-----KRFLQHDANAVE 554
DGG + CDG C R+FH C + P+ D W C C N ++ K+ + D++
Sbjct: 2745 DGGVTIMCDGPCQRSFHPACLGMDDNPEEDPWMCNRCMNKVQKCLECGKKGSEMDSHN-R 2803
Query: 555 AGRVSGVDSVEQIT-------KRCIRIVKNLEAELS--GCL-----LCRGCDFSKSGFGP 600
A ++ G S Q++ K C+ + S G C C + + GP
Sbjct: 2804 AVKIPGGVSRCQLSSCGRYYHKECLDKITPNRTSYSKEGNFKCPQHFCIDCGKTSTNLGP 2863
Query: 601 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 639
RT++ C +C + CLK R + KGKW C D
Sbjct: 2864 RTLVKCLRCAKARCPDCLKT-----ARYVKKGKWMVCSD 2897
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 64/132 (48%), Gaps = 17/132 (12%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 662
+L+CD C+ E+H+ CL+ L +PKG+W C + C+ + Q L + E
Sbjct: 551 MLVCDTCDAEYHLKCLR------LSSVPKGQWLCPI-CTVMLRKGQTLFSHQT----EVE 599
Query: 663 LNAIKKYAGNSLETVSDID--VRWRLLSGKAATPETRLLLSQAVAI--FHDCFD--PIVD 716
+ + ++E V ++ ++W LS + T ETR L+ AI FH D P+
Sbjct: 600 KAKLSQMPQPTVEVVDELKYLIKWSGLSYQFCTWETREELNNDGAIDRFHKLNDHPPLSP 659
Query: 717 SISGRDLIPSMV 728
+S +L+ ++
Sbjct: 660 PMSEEELMRTLA 671
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 6/74 (8%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE 565
+L CD C +H +C LSS+P+G W C C M + + L VE ++S
Sbjct: 551 MLVCDTCDAEYHLKCLRLSSVPKGQWLCPICTVMLRKGQTLFSHQTEVEKAKLS------ 604
Query: 566 QITKRCIRIVKNLE 579
Q+ + + +V L+
Sbjct: 605 QMPQPTVEVVDELK 618
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 649 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 706
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q ++C R++
Sbjct: 707 KTVQGLSPVDQ--RKCERLL 724
>gi|332018342|gb|EGI58947.1| Bromodomain adjacent to zinc finger domain protein 1A [Acromyrmex
echinatior]
Length = 1453
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 70/169 (41%), Gaps = 40/169 (23%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRF------ 545
D N+L CDGC R H C L+++P GDW+C C + +RKRF
Sbjct: 1082 DAENMLLCDGCNRGHHLYCLKPKLTAVPAGDWFCTACRPPEIKPKEKTQKRKRFEDEIEE 1141
Query: 546 --------LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 597
Q+ A + + +S ++ + I LE LC C K+G
Sbjct: 1142 ETMLTKETRQNRAKRIHSDDEDDQESDDEDDESEEDINIRLEN------LCASC---KNG 1192
Query: 598 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 646
++ CD C FH+ C++ L P+G+W C + + N++
Sbjct: 1193 ---GKLIACDTCPNRFHLECVE----PPLSRAPRGRWSCTICKKKKNAI 1234
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
G LL CDGCPR +H +C + L+ +P+GDW+C C
Sbjct: 728 GELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPAC 762
>gi|170592228|ref|XP_001900871.1| CHD4 protein [Brugia malayi]
gi|158591738|gb|EDP30342.1| CHD4 protein, putative [Brugia malayi]
Length = 1846
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 265 GGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCES--------------------TG 304
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
++ K +I N+E CR C + G+ +L CD C +H CL
Sbjct: 305 ATKDDEEEK---KITTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 347
Query: 621 HKMADLRELPKGKWFC 636
L E+P+G W C
Sbjct: 348 ---PSLTEIPEGDWSC 360
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 327 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 363
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 323 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 370
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 371 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 424
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 425 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 453
Score = 39.3 bits (90), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 420 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 338
>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
Length = 2002
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1731 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCL------GL 1781
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1782 RTVPDGRWSCERCCVCMRC 1800
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 381 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 428
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 429 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 475
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 476 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 522
Query: 646 VLQNLL 651
+Q +L
Sbjct: 523 RVQKIL 528
>gi|401402451|ref|XP_003881253.1| hypothetical protein NCLIV_042870 [Neospora caninum Liverpool]
gi|325115665|emb|CBZ51220.1| hypothetical protein NCLIV_042870 [Neospora caninum Liverpool]
Length = 476
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query: 488 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 546
C S+ Q + +G + +L CDGC A H+ C + ++P+ DWYC+YC+
Sbjct: 133 CQSDDVSKQASSEQEGHDEILLCDGCDVAVHQTCYYVETVPKADWYCQYCE--------- 183
Query: 547 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL-- 604
D N +A V + ++ K + K +EA L D S P +L
Sbjct: 184 --DRNQAQA----NVTKLRRLAKSSGKTDKQVEATFRTEL-----DRMASAKEPFCVLPK 232
Query: 605 LCDQCEREF--HVGCLKKHKM 623
C C R F HV C + +M
Sbjct: 233 RCPLCPRSFGAHVRCGEDFRM 253
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 323 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 370
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 371 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 424
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 425 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 453
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 420 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 72/186 (38%), Gaps = 42/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 433 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 480
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 481 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 534
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 535 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 573
Query: 646 VLQNLL 651
+Q +L
Sbjct: 574 RVQKIL 579
>gi|429328961|gb|AFZ80720.1| hypothetical protein BEWA_001270 [Babesia equi]
Length = 238
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 44/93 (47%), Gaps = 12/93 (12%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 644
C +C GCD S T+L+CD C+R FH+ C K+ E+PKG WF C DCS
Sbjct: 75 CKICAGCD-DNSSKRDHTMLICDACDRSFHMECTKEK----YSEVPKGAWF-CDDCSICQ 128
Query: 645 SVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 677
L +E+ + L GN L TV
Sbjct: 129 ICDIKLTERESNNPTNYSLE------GNKLCTV 155
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 325 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 372
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 373 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 426
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 427 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 455
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 422 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 458
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + D + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 198 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 245
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 246 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 300
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 301 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 329
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 296 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 364 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 411
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 412 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 458
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 459 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 495
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 462 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 372 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 419
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 420 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 466
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 467 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 513
Query: 646 VLQNLL 651
+Q +L
Sbjct: 514 RVQKIL 519
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 322 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 369
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 370 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 423
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 424 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 452
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 419 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 455
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + D + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 18/73 (24%)
Query: 482 GIICHCCNSEVSPSQFEAH-----------------ADGGNLLPCDGCPRAFHKECASLS 524
G + NS + P+ F A ++GG+LL C+ CP AFH+EC ++
Sbjct: 1109 GTVILASNSMICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1167
Query: 525 SIPQGDWYCKYCQ 537
+P+G WYC C+
Sbjct: 1168 DMPEGSWYCNDCK 1180
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C SL+ P G W C + Q
Sbjct: 1555 GDGGQLVSCKRPGCPKVYHADCLSLTRRPAGKWECPWHQ 1593
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 60/154 (38%), Gaps = 42/154 (27%)
Query: 502 DGGNLLPCDGCPRAFHKECASLS---SIPQGDWYCKYC---------------------- 536
+ G++L CDGCPR+FH EC +L+ +P DWYC C
Sbjct: 904 NAGDVLCCDGCPRSFHFECVNLTQSEDLPD-DWYCSECIMRRFPSRVPIHKGAFAPALNA 962
Query: 537 -----QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-------ELSG 584
F + +Q+ V+AG G D E + K R E + +
Sbjct: 963 LEKSIPRAFSLPKHIQNRFEGVKAG--PGGDYEEIVGKTVKRRTGFDETPDLFKQRDENQ 1020
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 618
+LC C KS R IL C C +H+ CL
Sbjct: 1021 PVLCHAC--QKSSNDTRAILPCSLCSYYWHLDCL 1052
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 646 VLQNLL 651
+Q +L
Sbjct: 508 RVQKIL 513
>gi|302915931|ref|XP_003051776.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
gi|256732715|gb|EEU46063.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
Length = 1194
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 89/235 (37%), Gaps = 67/235 (28%)
Query: 427 ASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP----DGTEVGYYACGQKLLEGYKNGLG 482
A+P L P K R + K + K +G+P DGT A Q + + ++
Sbjct: 703 ATPSLRPPKKQRTGLRVKSSPVKKRGGTAAGVPRAMGDGTATSAAAKDQ-ISDNDED--- 758
Query: 483 IICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGD----WYCKYCQN 538
C C + G+++ CDGCPR+FH EC + +P D WYC C
Sbjct: 759 --CSACGA------------AGDVVCCDGCPRSFHFECVGM--VPSEDLPDEWYCNEC-- 800
Query: 539 MFER--KRFLQHDA------NAVEAG-------------RVSGVDS-----VEQITKRCI 572
+F+R R H N +E R GV + E++T
Sbjct: 801 LFKRYPSRVPVHKGVFGPALNNLEKSIPRAFSLPKKLQTRFEGVKAGPDGEYEEVTTAKT 860
Query: 573 RIVKNLEAELSG---------CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 618
KN E+ +LC GC K+ R I+ C C R +H+ CL
Sbjct: 861 TKRKNGYEEVPDFFRQRDDGQPVLCHGC--QKAATDVRAIIPCSVCPRYWHIDCL 913
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 424 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 471
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 472 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 518
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 519 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 565
Query: 646 VLQNLL 651
+Q +L
Sbjct: 566 RVQKIL 571
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 360 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 407
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 408 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 454
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 455 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 501
Query: 646 VLQNLL 651
+Q +L
Sbjct: 502 RVQKIL 507
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 425 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 472
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 473 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 519
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 520 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 566
Query: 646 VLQNLL 651
+Q +L
Sbjct: 567 RVQKIL 572
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 411 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 458
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 459 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 505
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 506 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 552
Query: 646 VLQNLL 651
+Q +L
Sbjct: 553 RVQKIL 558
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 377 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 424
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 425 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 479
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 480 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 508
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 362 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 409
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 410 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 456
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 457 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 493
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 460 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 496
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 333 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 380
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 381 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 435
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 436 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 464
>gi|307172331|gb|EFN63819.1| Bromodomain adjacent to zinc finger domain protein 1A [Camponotus
floridanus]
Length = 1460
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 37/158 (23%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRFLQHDAN 551
D N+L CDGC R H C L+++P GDW+C C + +RKRF +
Sbjct: 1093 DAENMLLCDGCNRGHHLYCLKPKLNAVPAGDWFCTACRPPEIKLKEKAQKRKRFEDEIED 1152
Query: 552 AVEAGRVSGVDSVEQITKRCIRIVK-------------NLEAELSGCLLCRGCDFSKSGF 598
V + + + ++I + + N+ E + C LC KSG
Sbjct: 1153 EVILTKETRHNRAKRIPQSDDENDQEDDEDDEDSEEDINMRLE-NLCALC------KSG- 1204
Query: 599 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ CD C +H+ C++ L P+G+W C
Sbjct: 1205 --GKVISCDTCPNYYHLECVE----PPLSRAPRGRWSC 1236
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GG ++ CD CP +H EC LS P+G W C C+
Sbjct: 1204 GGKVISCDTCPNYYHLECVEPPLSRAPRGRWSCSKCK 1240
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 538
GG+L+ C+ CP A+H +C S+P G+W+C+ C N
Sbjct: 1004 GGDLICCESCPAAYHAKCLGFDSVPDGNWFCRDCVN 1039
>gi|307110583|gb|EFN58819.1| hypothetical protein CHLNCDRAFT_19495 [Chlorella variabilis]
Length = 176
Score = 49.3 bits (116), Expect = 0.010, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+GG LL CDGC A+H C +L + P GDW+C C
Sbjct: 20 EGGELLCCDGCTAAYHFSCVNLDAAPPGDWFCPLC 54
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|336270508|ref|XP_003350013.1| hypothetical protein SMAC_00903 [Sordaria macrospora k-hell]
gi|380095404|emb|CCC06877.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 990
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G ++ CDGC +A H++C + +P+GDWYCK C
Sbjct: 444 GNQIVFCDGCDKAVHQKCYGIPRLPRGDWYCKEC 477
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 61/164 (37%), Gaps = 43/164 (26%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ K +Q + + E+ ++E CR C
Sbjct: 386 CPHCE-----KEGIQWEPKDDDD-------EEEEGGCEEEEEDDHME-------FCRVC- 425
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 426 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 460
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 427 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 463
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 308 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSC 344
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 341 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 388
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 389 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 435
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 436 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 472
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 439 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 475
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 517 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 564
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 565 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 619
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 620 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 648
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 615 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 651
>gi|312077956|ref|XP_003141528.1| CHromoDomain protein family member [Loa loa]
Length = 1696
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 120 GGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES--------------------TG 159
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
++ K ++ N+E CR C + G+ +L CD C +H CL
Sbjct: 160 APKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 202
Query: 621 HKMADLRELPKGKWFC 636
L E+P+G W C
Sbjct: 203 ---PSLTEIPEGDWSC 215
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 182 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 218
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 259 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 318
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 319 AQGLSPVDQ-----RKCERLL 334
>gi|393911013|gb|EJD76123.1| LET-418 protein [Loa loa]
Length = 1755
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 179 GGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES--------------------TG 218
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
++ K ++ N+E CR C + G+ +L CD C +H CL
Sbjct: 219 APKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 261
Query: 621 HKMADLRELPKGKWFC 636
L E+P+G W C
Sbjct: 262 ---PSLTEIPEGDWSC 274
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 241 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 341 DGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 377
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 338
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 272 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 319
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 320 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 374
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 375 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 403
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 370 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 406
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 361 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 408
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + D E G G E R+ K+ G L
Sbjct: 409 PEGKWSCPHCEKEGVQWEAKEEDEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 463
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 464 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 492
Score = 39.3 bits (90), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 459 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 495
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 401 NGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 460
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 461 AQGLSPVDQ-----RKCERLL 476
>gi|198433831|ref|XP_002121767.1| PREDICTED: similar to zinc finger, MYND-type containing 8 [Ciona
intestinalis]
Length = 1878
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 541
G +L C+ CPR FH +C + S P+GDW+C C+ + E
Sbjct: 269 GEVLCCELCPRVFHAKCLRMQSEPEGDWFCPECEKITE 306
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 552
+GG L+ CD CP+ FH C SL++ P G+WYC C+++ + +Q++ NA
Sbjct: 705 NGGELICCDKCPKVFHLSCHVPSLTASPSGEWYCTLCRDLNSPE--MQYNVNA 755
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 64/169 (37%), Gaps = 40/169 (23%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 364 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 411
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 587
+G W C +C+ K +Q +A E + + + + C +
Sbjct: 412 EGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRV 458
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 459 CKDGG---------ELLCCDTCISSYHIHCLN----PPLPDIPNGEWLC 494
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G+ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 460 AGEDEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 507
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 508 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 554
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 555 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 601
Query: 646 VLQNLL 651
+Q +L
Sbjct: 602 RVQKIL 607
>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
Length = 2891
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 79/209 (37%), Gaps = 53/209 (25%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 502
C + L + KN + I C CN V PS Q AD
Sbjct: 2674 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2733
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV-SGV 561
+L CD C R +H C L +PQG W+C+ C + + ++ E G + S
Sbjct: 2734 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC--------AVCANCSSREPGGINSDR 2785
Query: 562 DSVEQITKRCIRIVKNLEAELSG-CLLC-------RGCDF-SKSGFGPR-----TILLCD 607
+SV Q + KN +S C+ C R C S+ PR ++ C
Sbjct: 2786 NSVAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEANLVHCS 2845
Query: 608 QCEREFHVGCLKKHKMADLRELPKGKWFC 636
C++ H+GC++ M L + + C
Sbjct: 2846 ACDKYLHLGCVETKGMP----LDRKNYLC 2870
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2705 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 2758
Query: 631 KGKWFC-----CMDCS 641
+G+W C C +CS
Sbjct: 2759 QGRWHCQECAVCANCS 2774
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CPRA+H C L P+G W C +C+ E +
Sbjct: 360 GGEIILCDTCPRAYHLVCLDPELEETPEGKWSCPHCE---------------AEGTQEQD 404
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D + + C + G LLC CD C +H+ CL
Sbjct: 405 DDEHNEFCRLC---------KDGGELLC-----------------CDSCTSAYHIFCLN- 437
Query: 621 HKMADLRELPKGKWFC 636
L E+P G W C
Sbjct: 438 ---PPLSEIPDGDWKC 450
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 417 DGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 343
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
S ++++ L AE G C G +L C C FH C
Sbjct: 344 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 393
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 755 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDL--SKPEVEYDCDNLQHSKKG 812
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 813 KTAQGLSPVDQ--RKCERLL 830
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 348
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
S ++++ L AE G C G +L C C FH C
Sbjct: 349 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 398
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 352 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 399
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 400 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 446
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 645
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 447 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 493
Query: 646 VLQNLL 651
+Q +L
Sbjct: 494 RVQKIL 499
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 347
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
S ++++ L AE G C G +L C C FH C
Sbjct: 348 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 397
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 344
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
S ++++ L AE G C G +L C C FH C
Sbjct: 345 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 394
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 30/60 (50%), Gaps = 7/60 (11%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-----QNMFERKRFLQHDANAVE 554
DGG L+ CDGCPRAFH C S L IP G W C C Q+M R + VE
Sbjct: 435 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQATVQDMRPRAEEPRPQEPPVE 494
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 45/76 (59%), Gaps = 6/76 (7%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
+GG LL CD CP+ FH C +L P G+W+C +C+++ + ++++ N+ ++
Sbjct: 714 NGGELLCCDRCPKVFHLSCHIPALHEPPSGEWFCSFCRDLVSPE--MEYNCNSNDSPVSD 771
Query: 560 GVDSVEQITKRCIRIV 575
G +++ ++C R++
Sbjct: 772 GFPPIDR--RKCERLL 785
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 64/169 (37%), Gaps = 40/169 (23%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 364 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 411
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 587
+G W C +C+ K +Q +A E + + + + C +
Sbjct: 412 EGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRV 458
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 459 CKDGG---------ELLCCDTCISSYHIHCLN----PPLPDIPNGEWLC 494
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 71/193 (36%), Gaps = 44/193 (22%)
Query: 498 EAHADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQNMFERKRFL---QHDA 550
+A DGG L+ CD CP +FH +C LS IP G+W C C+ +++ + +
Sbjct: 62 DACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACRCAMKKENSIGNKRKKK 121
Query: 551 NAVEAGRVS----------------------GVDSVEQIT------KRCIRIVKNLEAEL 582
NA+E ++ G D ++ ++ I +
Sbjct: 122 NALEVLALAASLVNPKEFELPRELQIPITFPGTDKIDPVSFKRGKHHNSNNINGKIHYHE 181
Query: 583 SGC---LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 639
G L R C + ++ CD C FH CL L P G+W C
Sbjct: 182 YGSITPLPARLCFVCRKSCRKAPLIACDYCPLYFHQDCLD----PPLTAFPSGRWMCPNH 237
Query: 640 CSRINSVLQNLLV 652
+ + QNLL
Sbjct: 238 LNHF--IDQNLLT 248
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG L+ CDGCPR+FH C L+ IP G W C C
Sbjct: 37 DGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDAC 73
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 340
>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
Length = 2012
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1736 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1786
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1787 RTVPDGRWSCERCCFCMRC 1805
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 748 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 807
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 808 AQGLSPVDQ-----RKCERLL 823
>gi|242011982|ref|XP_002426722.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
gi|212510893|gb|EEB13984.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
Length = 1196
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 537
DG N+L CD C R FH C LSS+P GDW+C C+
Sbjct: 1084 GDGENMLLCDSCDRGFHLYCLKPKLSSVPLGDWFCSGCR 1122
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C + G +LLCD C+R FH+ CLK L +P G WFC
Sbjct: 1077 CKVCRRGGDGENMLLCDSCDRGFHLYCLK----PKLSSVPLGDWFC 1118
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 333 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 380
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 381 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 427
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 428 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 464
>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
Length = 2011
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1735 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1785
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1786 RTVPDGRWSCERCCFCMRC 1804
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G+LL CDGCP +FH++C ++ +P+G W C C+ +
Sbjct: 1063 GDLLCCDGCPGSFHRQCIGVARLPEGKWLCPECKTV 1098
>gi|118384512|ref|XP_001025404.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89307171|gb|EAS05159.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 1453
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 544
G +L CD CP FH +C L S+P GDW C CQ ++R
Sbjct: 1401 GEVLMCDTCPSVFHLKCIGLKSLPDGDWSCLECQQKLLKQR 1441
>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
Full=Protein enhancer of yellow 3
gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
Length = 2006
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1730 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1780
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1781 RTVPDGRWSCERCCFCMRC 1799
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1418 SEGGSLLCCEACPAAFHRECLNM-EMPQGSWFCNDCK 1453
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1829 GDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 1867
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
Length = 1843
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1567 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1617
Query: 627 RELPKGKWFC-----CMDC 640
R +P G+W C CM C
Sbjct: 1618 RTVPDGRWSCERCCFCMRC 1636
>gi|356540950|ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800973 [Glycine max]
Length = 1735
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
G L+ CDGCP AFH C ++S +P+GDWYC C
Sbjct: 687 GCLICCDGCPAAFHSRCVGIASGHLPEGDWYCPEC 721
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 389 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 425
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 63/166 (37%), Gaps = 37/166 (22%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
EGY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 43 EGYETDHQDYCEVCQQ------------GGEIMLCDTCPRAYHLVCLDPELEEAPEGSWS 90
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVD-SVEQITKRCIRIVKNLEAELSGCLLCR-G 590
C +C+ K + + G+ +G + ++ K+ + C C G
Sbjct: 91 CPHCE-----KEGISMGSQV--EGKATGTKMAPDKSAKQVAAASPEKDEHQEFCTECHDG 143
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
D ++ C+ C +H+ CL + L +P+G W C
Sbjct: 144 GD----------LICCENCPVSYHLDCL----IPPLTNIPEGVWLC 175
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG+L+ C+ CP ++H +C L++IP+G W C C
Sbjct: 142 DGGDLICCENCPVSYHLDCLIPPLTNIPEGVWLCPRC 178
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|223996275|ref|XP_002287811.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976927|gb|EED95254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1562
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
+GG+L+ CD CPR FH C + S+P+G+W+C C
Sbjct: 311 EGGDLICCDNCPRVFHSNCHIPKIYSLPEGEWFCMLC 347
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 800 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 859
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 860 AQGLSPVDQ-----RKCERLL 875
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 812 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 871
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 872 AQGLSPVDQ-----RKCERLL 887
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 450 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 509
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 510 AQGLSPVDQ-----RKCERLL 525
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 899 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 958
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 959 AQGLSPVDQ-----RKCERLL 974
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 316 DGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 352
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 762 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 821
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 822 AQGLSPVDQ-----RKCERLL 837
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 919 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 978
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 979 AQGLSPVDQ-----RKCERLL 994
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL CD CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 687 NGGELLCCDKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHSSEK 744
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + + KR C R++
Sbjct: 745 RKADGLVKLTPVDKRKCERLL 765
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 919 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 978
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 979 AQGLSPVDQ-----RKCERLL 994
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 261 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 297
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 780 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 839
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 840 AQGLSPVDQ-----RKCERLL 855
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1417 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 1452
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1828 GDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1866
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 812 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 871
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 872 AQGLSPVDQ-----RKCERLL 887
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G+ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 241 AGEDEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 288
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 289 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 335
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 336 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 372
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 339 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 375
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 894 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 953
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 954 AQGLSPVDQ-----RKCERLL 969
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 979 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1038
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 1039 AQGLSPVDQ-----RKCERLL 1054
>gi|449456180|ref|XP_004145828.1| PREDICTED: uncharacterized protein LOC101215849 [Cucumis sativus]
gi|449510841|ref|XP_004163779.1| PREDICTED: uncharacterized LOC101215849 [Cucumis sativus]
Length = 1719
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 561
G+LL CDGCP A+H C + IPQG WYC C + +A+ V G+
Sbjct: 438 GSLLCCDGCPSAYHLRCIGMVKVLIPQGPWYCPECS--INKSEPTITKGSALRGAEVFGI 495
Query: 562 DSVEQI 567
D E I
Sbjct: 496 DPYEHI 501
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
GG+LL C+ CP AFH +C L +P+G W+C+ C
Sbjct: 908 GGDLLCCEMCPAAFHPQCLGLEDLPEGTWFCRDC 941
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1429 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 1464
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1840 GDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1878
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 44/79 (55%), Gaps = 6/79 (7%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
+GG LL CD CP+ FH C +L+ P G+W+C +C+++ + +++D N +
Sbjct: 722 NGGELLCCDKCPKVFHLSCHIPALNESPSGEWFCSFCRDLLNPE--MEYDCNRQDRPPSE 779
Query: 560 GVDSVEQITKRCIRIVKNL 578
VE+ ++C R++ L
Sbjct: 780 KFPLVER--RKCERLLLRL 796
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 824 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 883
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 884 AQGLSPVDQ-----RKCERLL 899
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 894 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 953
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 954 AQGLSPVDQ-----RKCERLL 969
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 526
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 356 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 403
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 404 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 458
Query: 587 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
LC CD C +H+ CL L ++P G+W C
Sbjct: 459 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 487
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 454 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|312380117|gb|EFR26201.1| hypothetical protein AND_07843 [Anopheles darlingi]
Length = 2310
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 20/95 (21%)
Query: 573 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 632
R V+ + + S C LC C+ + + ++ CDQC+R +H+ C LR LP G
Sbjct: 2030 RRVQQYKWQCSECKLCMKCNRKPAAIDSK-MVYCDQCDRGYHLAC------KGLRNLPDG 2082
Query: 633 KWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNA 665
+W C C CS+ + + PE H NA
Sbjct: 2083 RWHCSLCTICSQCGA-----------QTPEGHPNA 2106
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 767 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 826
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 827 AQGLSPVDQ-----RKCERLL 842
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|402590896|gb|EJW84826.1| chromodomain-helicase-DNA-binding protein 4 [Wuchereria bancrofti]
Length = 1519
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 54/136 (39%), Gaps = 42/136 (30%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG ++ CD CP+A+H C + P+G W C C E+ +
Sbjct: 179 GGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTC-----------------ESTGAAK 221
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
D E+ +I N+E CR C + G+ +L CD C +H CL
Sbjct: 222 DDEEEK------KITTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 261
Query: 621 HKMADLRELPKGKWFC 636
L E+P+G W C
Sbjct: 262 ---PSLTEIPEGDWSC 274
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 241 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 641
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GG L+ CD CP+AFH EC L +P+G W C+ C+
Sbjct: 1431 GGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCK 1467
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 809 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 868
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 869 AQGLSPVDQ-----RKCERLL 884
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 32/45 (71%), Gaps = 2/45 (4%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 545
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+ +R RF
Sbjct: 549 SEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDCK-AGKRPRF 591
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Query: 504 GNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 536
G+LL CDG C AFH +C LS+ P+G ++C+ C
Sbjct: 389 GDLLACDGHCYGAFHPQCIGLSAAPKGKFFCREC 422
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 841 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 900
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 901 AQGLSPVDQ-----RKCERLL 916
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 308 DGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSC 344
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 527 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 586
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 587 AQGLSPVDQ-----RKCERLL 602
>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
Length = 3312
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 502
C + L + KN + I C CN V PS Q AD
Sbjct: 3095 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 3154
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+L CD C R +H C L +PQG W+C+ C
Sbjct: 3155 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 3188
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 3126 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 3179
Query: 631 KGKWFC-----CMDCS 641
+G+W C C +CS
Sbjct: 3180 QGRWHCQECAVCANCS 3195
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 678 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 735
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 736 KTAQGLSPVDQ--RKCERLL 753
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 46/79 (58%), Gaps = 6/79 (7%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
+GG LL CD CP+ FH C +L+ P G+W+C +C+++ + +++D ++ +A
Sbjct: 718 NGGELLCCDKCPKVFHLACHIPTLNESPSGEWFCSFCRDLVSPE--MEYDCDSKDAPISE 775
Query: 560 GVDSVEQITKRCIRIVKNL 578
V++ ++C R++ L
Sbjct: 776 KFPPVDR--RKCERLLLRL 792
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 527 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 586
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 587 AQGLSPVDQ-----RKCERLL 602
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 912 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 969
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 970 KTAQGLSPVDQ--RKCERLL 987
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 1026 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1085
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 1086 AQGLSPVDQ-----RKCERLL 1101
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 290 DGGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRC 326
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 908 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 965
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 966 KTAQGLSPVDQ--RKCERLL 983
>gi|255079372|ref|XP_002503266.1| JmjN/JmjC protein [Micromonas sp. RCC299]
gi|226518532|gb|ACO64524.1| JmjN/JmjC protein [Micromonas sp. RCC299]
Length = 2663
Score = 48.1 bits (113), Expect = 0.019, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 574 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 633
+ + LE + +GC+ C G +S ++LCD C+R +H+ CL L ELP+G
Sbjct: 246 VAQQLEEQPAGCVNCGGTSHEES------MILCDGCDRGYHMYCLS----PPLDELPQGD 295
Query: 634 WFCCMDC 640
WF C DC
Sbjct: 296 WF-CPDC 301
Score = 43.5 bits (101), Expect = 0.46, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 12/72 (16%)
Query: 505 NLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
+++ CDGC R +H C S L +PQGDW+C C + +A + G SG
Sbjct: 268 SMILCDGCDRGYHMYCLSPPLDELPQGDWFCPDC---------IAAANDAEDIGFNSGKT 318
Query: 563 -SVEQITKRCIR 573
++EQ + C R
Sbjct: 319 FTIEQFKEECAR 330
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 377 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 436
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 437 AQGLSPVDQ-----RKCERLL 452
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 719 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 776
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 777 KTAQGLSPVDQ--RKCERLL 794
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 558 --VSGVDSVEQITKRCIRIV 575
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 910 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 967
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 968 KTAQGLSPVDQ--RKCERLL 985
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 856 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 913
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 914 KTAQGLSPVDQ--RKCERLL 931
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 908 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 965
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 966 KTAQGLSPVDQ--RKCERLL 983
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 283 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 319
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 641
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 47/163 (28%)
Query: 478 KNGLGIICH--CCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYC 533
+ GLG + H C S HA G L+ C CP +H EC LS + Q W+C
Sbjct: 1415 RRGLGALDHNELCQSC-------GHA--GQLILCHDCPIVYHCECLDPPLSKLTQDHWFC 1465
Query: 534 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 593
C V +G DS E++ + E+ +C C
Sbjct: 1466 PLC----------------VMDRTTNGADSEEEMGSN--------DGEIEHEDVCSRCRH 1501
Query: 594 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
++ CD C + FH+ C K LR++PKG W C
Sbjct: 1502 GGE------LICCDTCPKAFHMECCK----PVLRKVPKGHWEC 1534
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GG L+ CD CP+AFH EC L +P+G W C+ C+
Sbjct: 1502 GGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCK 1538
>gi|357616541|gb|EHJ70254.1| putative zinc finger protein [Danaus plexippus]
Length = 1432
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 58/148 (39%), Gaps = 39/148 (26%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC---QNMFERKRFLQHDANAVEAG 556
D N+L CD C + H C L+ +P+GDW+C C + +++R L D +
Sbjct: 1098 DPDNMLLCDSCNKGHHLYCLKPKLTKVPEGDWFCDQCKPTEKTPKKRRKLYTDPD----- 1152
Query: 557 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 616
D+++ ++ C A + C LC G G R C C R FH
Sbjct: 1153 -----DTLDDSSESCS------SAPVELCALC--------GSGGRLAASCRSCGRRFHAE 1193
Query: 617 CLKKHKMADLRELPKGKWFCCMDCSRIN 644
C G+ C DC++ N
Sbjct: 1194 CAPS----------GGRRAVCGDCAKPN 1211
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 583 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
S +L C + P +LLCD C + H+ CLK L ++P+G WFC
Sbjct: 1082 SASVLHASCRLCRRRTDPDNMLLCDSCNKGHHLYCLK----PKLTKVPEGDWFC 1131
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 776 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 833
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 834 KTAQGLSPVDQ--RKCERLL 851
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1772 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1807
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2182 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2220
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +E
Sbjct: 709 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPTHNLEK 766
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 767 KKTEGLVKLTPIDKRKCERLL 787
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 44/114 (38%), Gaps = 3/114 (2%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-QNMFERKRFLQHDANAVEAGRV 558
DGG L+ CDGCPRAFH C S L IP G W C C Q ++ ++ +E
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRIQQNLSQPEESRPLEPSAE 361
Query: 559 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 612
+ ++ C L C F + P T L C C E
Sbjct: 362 TPGPTLSARCGVCGDSTDVLRCAHCAAAFHWRCHFPMAAVRPGTNLRCKSCSAE 415
>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
Length = 1472
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 26/103 (25%)
Query: 460 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS------------------------ 495
D T++ C + L + KN + I C CN V PS
Sbjct: 1246 DDTKLKCKMCLKVLNKHNKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1305
Query: 496 --QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
Q AD +L CD C R +H C L +PQG W+C+ C
Sbjct: 1306 CVQCHDPADEDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 1348
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 572 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 631
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P+
Sbjct: 1287 LDMVPHIQSYAWQCTDCKTCVQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVPQ 1340
Query: 632 GKWFC-----CMDC 640
G+W C C +C
Sbjct: 1341 GRWHCQECAVCANC 1354
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1397 SEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDCR 1432
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +LS P G W C + Q
Sbjct: 1807 GDGGQIVSCKKPGCPKVYHADCLNLSKRPAGRWECPWHQ 1845
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 554
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 971 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1030
Query: 555 AGRVSGVDSVEQITKRCIRIV 575
A +S VD ++C R++
Sbjct: 1031 AQGLSPVDQ-----RKCERLL 1046
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 11/85 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 678 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 735
Query: 556 GRVSGVDSVEQITKRC---IRIVKN 577
+ G+ + I KR +I+KN
Sbjct: 736 KKTEGLVKLTPIDKRVPDYYKIIKN 760
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 128 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 185
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 186 KTAQGLSPVDQ--RKCERLL 203
>gi|219110357|ref|XP_002176930.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411465|gb|EEC51393.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 2413
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G+LL CDGC H +C L+S P+GDW+C+ C
Sbjct: 1904 GDLLCCDGCANVVHGKCIGLTSFPEGDWFCEEC 1936
>gi|407918848|gb|EKG12110.1| Zinc finger PHD-type protein [Macrophomina phaseolina MS6]
Length = 565
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 498 EAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ----NMFERKRFLQHDANAV 553
E D L+ CD C + H CA L +P G+WYC++C + +R+R QH+ +AV
Sbjct: 66 EDFGDEDQLMLCDSCDKLCHVFCAGLDEVPAGEWYCQHCMEDPYTLGQRERERQHNRSAV 125
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 14/64 (21%)
Query: 578 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC- 636
LE + C++C DF ++LCD C++ HV C A L E+P G+W+C
Sbjct: 55 LEPPVDPCMVCE--DFGDED----QLMLCDSCDKLCHVFC------AGLDEVPAGEWYCQ 102
Query: 637 -CMD 639
CM+
Sbjct: 103 HCME 106
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 59/148 (39%), Gaps = 44/148 (29%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 560
GG L+ CD CP +FH +C L +P W C+ C L+ +++ +E G G
Sbjct: 1010 GGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLC--------VLEAESSPLE-GCSDG 1060
Query: 561 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 620
DS + RC + + ++LCD C FH+ C
Sbjct: 1061 TDSHCDVCARCYKHGQ--------------------------LILCDVCPLAFHLRCTD- 1093
Query: 621 HKMADLRELPKGKW---FCCMDCSRINS 645
L ++P GKW C DC ++S
Sbjct: 1094 ---PPLLKVPSGKWTCQICVKDCQPVSS 1118
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 67/161 (41%), Gaps = 30/161 (18%)
Query: 489 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNM 539
+ E + S+ ++H D G L+ C+ CP A+H +CA+ L IP G W C+ C +
Sbjct: 899 DDESNNSEEDSHCDECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVCTDK 958
Query: 540 FERK----RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 595
++K +F + S + + IV + ++ C CR +
Sbjct: 959 SQKKPTGIKFKGKHRKGLLPTSSSPSSLSSDL--ETLGIVADGHSDR--CARCR-----R 1009
Query: 596 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
G ++LCD C FH+ C+ L +P W C
Sbjct: 1010 GG----ELILCDSCPLSFHLDCVD----PPLLGVPPDIWLC 1042
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 28/60 (46%), Gaps = 8/60 (13%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC------QNMFERKRFLQHDANA 552
D +L CDGC R H C + SIP GDWYC C QN R++ D ++
Sbjct: 785 GDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDCRPKIVKQNSRRRRKSTLEDYDS 844
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 16 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 52
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1637 SEGGSLLCCEACPAAFHRECLNI-EMPQGSWFCNDCK 1672
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2048 GDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 2086
>gi|320170020|gb|EFW46919.1| hypothetical protein CAOG_04877 [Capsaspora owczarzaki ATCC 30864]
Length = 1096
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 7/63 (11%)
Query: 476 GYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 535
G + I+C C S S DG N+L CDGC A H+ C + S+P+G W+C
Sbjct: 507 GVEFNHDIVCDVCLSGDS-------EDGNNILFCDGCNLAVHQACYGVESVPEGAWFCYP 559
Query: 536 CQN 538
C +
Sbjct: 560 CAH 562
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 222 DGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 266
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +
Sbjct: 738 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHNSDK 795
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
++ G+ + I KR C R++
Sbjct: 796 RKIEGLSKLTPIDKRKCERLL 816
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 311 DGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCC 347
>gi|432852260|ref|XP_004067159.1| PREDICTED: uncharacterized protein LOC101164387 [Oryzias latipes]
Length = 1310
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQHDANAVEAGRVS 559
GG LL CD CP+ FH C L S P GDW C C++ + + ++ + A A +
Sbjct: 1115 GGELLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDAIQPEVQYNCENERASGANPLH 1174
Query: 560 GVDSVEQITKRCIRIVK 576
G+ + +Q + +I+K
Sbjct: 1175 GLSACDQRARHYYQIIK 1191
>gi|443689527|gb|ELT91900.1| hypothetical protein CAPTEDRAFT_216422 [Capitella teleta]
Length = 1564
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 24/111 (21%)
Query: 556 GRVSGVDSVEQ------------ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 603
GRV V++V+ + + C++ K+ AE + C +CR K G + +
Sbjct: 1140 GRVRWVEAVKSCTTWSRLHLLMSVMESCMKWEKS--AENAKCKICR-----KKGEEEK-V 1191
Query: 604 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 654
LLCD C + FH+ CL+ L E+PKG+WFC R V N+ +E
Sbjct: 1192 LLCDDCNQPFHLYCLR----PALYEVPKGEWFCAACAPRTRRVKTNVNYRE 1238
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 60/139 (43%), Gaps = 32/139 (23%)
Query: 506 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE-AG-----R 557
+L CD C + FH C +L +P+G+W+C C R R ++ + N E AG R
Sbjct: 1191 VLLCDDCNQPFHLYCLRPALYEVPKGEWFCAACA---PRTRRVKTNVNYRELAGEENDKR 1247
Query: 558 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 617
+ +S E+ R + IV E C +C G + ++ C C FH+ C
Sbjct: 1248 IVDSNSEEE---REVDIVHEQE-----CTMCGGDE---------GLVNCSTCVCAFHLEC 1290
Query: 618 LKKHKMADLRELPKGKWFC 636
LR +P+ W C
Sbjct: 1291 HD----PPLRHIPRSIWRC 1305
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 233 DGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 277
>gi|145546835|ref|XP_001459100.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426923|emb|CAK91703.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 509 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 543
CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 880 CDTCPKVFHPKCINLKEVPQGKWNCLNCLKNFERQ 914
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 502 DGGNLLPCDGCPRAFHKE--CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 554
DGG L+ CDGCPRAFH C L IP G W C C + R LQ +A E
Sbjct: 312 DGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCL----QGRALQDTPHAEE 362
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +
Sbjct: 277 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHNSDK 334
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
++ G+ + I KR C R++
Sbjct: 335 RKIEGLSKLTPIDKRKCERLL 355
>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
Length = 2805
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 76/211 (36%), Gaps = 57/211 (27%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 502
C + L + KN + I C CN V PS Q AD
Sbjct: 2588 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2647
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ---NMFERKRFLQHDANAVEAGRVS 559
+L CD C R +H C L +PQG W+C+ C N R+ G S
Sbjct: 2648 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQECAVCVNCGSREP----------GGINS 2697
Query: 560 GVDSVEQITKRCIRIVKNLEAELSG-CLLC-------RGCDF-SKSGFGPR-----TILL 605
+SV Q + KN +S C+ C R C S+ PR ++
Sbjct: 2698 DRNSVAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEVNLVH 2757
Query: 606 CDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
C C++ H+GC++ M L + + C
Sbjct: 2758 CSACDKYLHLGCVETKGMP----LDRKNYLC 2784
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2619 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2672
Query: 631 KGKWFC-----CMDC 640
+G+W C C++C
Sbjct: 2673 QGRWHCQECAVCVNC 2687
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1597 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1632
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2007 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2045
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 10/83 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 15 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 72
Query: 558 --VSGVDSVEQITKRCIRIVKNL 578
G+ V+Q ++C R++ L
Sbjct: 73 KTAQGLSPVDQ--RKCERLLLYL 93
>gi|255084682|ref|XP_002504772.1| SNF2 super family [Micromonas sp. RCC299]
gi|226520041|gb|ACO66030.1| SNF2 super family [Micromonas sp. RCC299]
Length = 1710
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 13/95 (13%)
Query: 456 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRA 515
+G P+G +A G++ L CH C E P + D ++ CDGC
Sbjct: 399 AGGPEGQRATLWAAGEQDL----------CHICG-EADPDFWNIEND--CIVMCDGCDVQ 445
Query: 516 FHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 550
H C LS +P+G+W+C+ C + + R DA
Sbjct: 446 VHLSCYGLSEVPEGEWFCQGCIDGIKVGRDRPDDA 480
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1720 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1755
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2166
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 55/164 (33%), Gaps = 59/164 (35%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 532
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 318 DGYETEHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCFDPELEEPPEGKWS 365
Query: 533 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 592
C +C+ G+ E E CR C
Sbjct: 366 CPHCEG--------------------EGIKEQE---------------EDDHMEFCRVC- 389
Query: 593 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
K G +L CD C +HV CL ++ +P G+W C
Sbjct: 390 --KDG---GELLCCDTCPSAYHVHCLN----PPMKMIPDGEWHC 424
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + + IP G+W+C C
Sbjct: 391 DGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRC 427
>gi|255086445|ref|XP_002509189.1| predicted protein [Micromonas sp. RCC299]
gi|226524467|gb|ACO70447.1| predicted protein [Micromonas sp. RCC299]
Length = 517
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G+ + CDGCPR H +C L +P GDW+C C
Sbjct: 351 GDFVLCDGCPRGGHYDCLGLDGVPAGDWFCAGC 383
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 245 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 290
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 9/80 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 902 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 959
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ VEQ K C R++
Sbjct: 960 KTAQGLFEVEQKIK-CERLL 978
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG LL CD CP A+H C +L +IP GDW C C
Sbjct: 300 DGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 241 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 276
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 12 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 69
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 70 KKTEGLVKLTPIDKRKCERLL 90
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG LL CD CP A+H C +L +IP GDW C C
Sbjct: 442 DGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 383 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 418
>gi|326432230|gb|EGD77800.1| hypothetical protein PTSG_08890 [Salpingoeca sp. ATCC 50818]
Length = 1086
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 47/110 (42%), Gaps = 12/110 (10%)
Query: 439 WNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLE--------GYKNGLGIICHCC-- 488
+++ +D+ KL + L DG E+ Y+ + + E K G CH
Sbjct: 272 YDLDTEDEIWRKLFNERHRLKDGVEISYHHMSRAIFEFERAAFRGMTKAGGKASCHGLEY 331
Query: 489 --NSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
N+ Q +G ++ CDGC H+ C + IP+G+WYC C
Sbjct: 332 DENTVCDVCQLPDSEEGNEMVFCDGCNLCVHQVCYGIKVIPEGNWYCCAC 381
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 443 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 478
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPC--DGCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 855 DGGQLVSCKKSGCPKVYHADCLNLTKRPAGKWECPWHQ 892
>gi|410898760|ref|XP_003962865.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Takifugu rubripes]
Length = 1329
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 537
D N+L CDGC R H C L ++PQGDW+C C+
Sbjct: 1059 DADNMLLCDGCDRGHHTHCLRPRLKAVPQGDWFCPDCR 1096
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 69/164 (42%), Gaps = 38/164 (23%)
Query: 508 PCDGC------PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV--S 559
PC G R+ H +LS I QG ER RFL+ + E GRV +
Sbjct: 963 PCSGSTTPQVISRSVHHLAEALSHIEQG----------IER-RFLKPPLDGSEGGRVCKT 1011
Query: 560 GVDSVEQITKRCIRI------VKNLEAEL--SGCLLCRGCDFSKSGFGPRTILLCDQCER 611
++ + + C + + +LE + S +L C + +LLCD C+R
Sbjct: 1012 VLERWRESLQSCTSLSQVFVHLSSLERSVLWSRSILNARCRICRRKGDADNMLLCDGCDR 1071
Query: 612 EFHVGCLKKHKMADLRELPKGKWFCCMDC------SRINSVLQN 649
H CL+ L+ +P+G WF C DC SR+ S Q+
Sbjct: 1072 GHHTHCLRPR----LKAVPQGDWF-CPDCRPKQRSSRLTSRQQH 1110
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 64/166 (38%), Gaps = 42/166 (25%)
Query: 475 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWY 532
+GY+ C C GG ++ CDGCPRA+H C L P+G W
Sbjct: 337 DGYETDHQDYCEVCKQ------------GGEIILCDGCPRAYHLVCLEPPLDQPPEGSWP 384
Query: 533 CKYCQNMFERKRFLQHDANAVEAG-RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-G 590
C C N ++ R + D + +N++ + C C+ G
Sbjct: 385 CPTCVK------------NGIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDG 432
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
D +L+CD C +H+ CL + ++P+G+W C
Sbjct: 433 GD----------LLICDTCPHSYHLNCLN----PPVEKVPEGEWSC 464
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQHDANAVEAGRV 558
+GG LL C+ CP+ FH C +L + P G+W C +C+++ + + D + E ++
Sbjct: 828 NGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSRPEVEYDCDDPSLAEKRKL 887
Query: 559 SGVDSVEQITKR-CIRIV 575
G S+ I +R C RI+
Sbjct: 888 GGAQSMAPIDQRKCERIL 905
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 71/185 (38%), Gaps = 42/185 (22%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 379 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 426
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 587
+G W C +C+ + + + E G + + + C R+ K+ G LL
Sbjct: 427 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELL 480
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 646
C CD C +H+ CL L ++P G+W C C +
Sbjct: 481 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 519
Query: 647 LQNLL 651
+Q +L
Sbjct: 520 VQKIL 524
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 82 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 139
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 140 KKTEGLVKLTPIDKRKCERLL 160
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 122 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN 158
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 251 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCC 287
>gi|260791426|ref|XP_002590730.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
gi|229275926|gb|EEN46741.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
Length = 1073
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECASL----SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 557
+GG+LL CD CP AFH +C +P+G+W C C + + D A A
Sbjct: 58 EGGDLLCCDRCPAAFHLQCCDPPLCEEDLPEGEWLCHRCMVLEQFPELDDRDETASNASV 117
Query: 558 VSGVDSVEQITKRCIRIVK 576
S S +Q R +IV+
Sbjct: 118 ASSTASYKQRNLRDKKIVR 136
>gi|47220585|emb|CAG05611.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 47.8 bits (112), Expect = 0.029, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 537
+GG LL CD CP+ +H C LS PQGDW C C+
Sbjct: 10 NGGELLCCDRCPKVYHLSCHLPPLSGFPQGDWVCTLCR 47
>gi|346970552|gb|EGY14004.1| origin recognition complex subunit 4 [Verticillium dahliae VdLs.17]
Length = 851
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 509 CDGCPRAFHKECASLSSIPQGDWYCKYC 536
CDGC +A H++C + IP+GDW+CK C
Sbjct: 374 CDGCDKAVHQKCYDVHDIPEGDWFCKEC 401
>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
Length = 2857
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 502
C + L + KN + I C CN V PS Q AD
Sbjct: 2700 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2759
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+L CD C R +H C L +PQG W+C+ C
Sbjct: 2760 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 2793
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 29/61 (47%), Gaps = 11/61 (18%)
Query: 585 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC-----CMD 639
C C+ C +L CD C+R +H+ C+ LR +P+G+W C C++
Sbjct: 2745 CTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVPQGRWHCQECAVCVN 2798
Query: 640 C 640
C
Sbjct: 2799 C 2799
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 59/129 (45%), Gaps = 20/129 (15%)
Query: 498 EAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 555
+A DGG+LL C+ C AFH C +SS+P+GDW+C C+ K +H + +
Sbjct: 93 DACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSCEQNKNPKP--KHSKSILS- 149
Query: 556 GRVSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKSGFGPRTILLC--DQC 609
S DS++ + C + + N + S C +C G D + +L C +C
Sbjct: 150 ---SLFDSLDTLNPSCFTLPEEYLLNNSFKQSFCNVCDGDDSMED------MLHCSHSKC 200
Query: 610 EREFHVGCL 618
H CL
Sbjct: 201 RISVHTYCL 209
>gi|397511403|ref|XP_003826065.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Pan
paniscus]
Length = 1105
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|296200654|ref|XP_002747672.1| PREDICTED: protein kinase C-binding protein 1 isoform 9 [Callithrix
jacchus]
Length = 1107
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|426392025|ref|XP_004062362.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Gorilla
gorilla gorilla]
Length = 1105
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 537
++GG+LL C+ CP AFH+EC ++ +P+G W+C C+
Sbjct: 1533 SEGGSLLCCESCPAAFHRECLNI-DMPEGSWFCNDCK 1568
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 537
DGG L+ C GCP+ +H EC L+ P G W C + Q
Sbjct: 1941 GDGGQLVSCKKPGCPKVYHAECLKLTRRPAGKWECPWHQ 1979
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|119596117|gb|EAW75711.1| protein kinase C binding protein 1, isoform CRA_h [Homo sapiens]
Length = 1105
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|114682470|ref|XP_001163886.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Pan
troglodytes]
Length = 1105
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|297259644|ref|XP_002798155.1| PREDICTED: protein kinase C-binding protein 1-like isoform 7
[Macaca mulatta]
Length = 1105
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|403290903|ref|XP_003936546.1| PREDICTED: protein kinase C-binding protein 1 isoform 9 [Saimiri
boliviensis boliviensis]
Length = 1108
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|193784671|dbj|BAG53824.1| unnamed protein product [Homo sapiens]
Length = 1105
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 554
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
GG+++ CD CP +H +C L SIP G+W+C C
Sbjct: 186 GGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPAC 219
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 501 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 544
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 272 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAELGRLR 317
>gi|307111603|gb|EFN59837.1| hypothetical protein CHLNCDRAFT_133598 [Chlorella variabilis]
Length = 1305
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 502 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 538
D G+LL CDGCP +H C L +P G W+C C +
Sbjct: 1191 DQGDLLCCDGCPSVYHPRCCGLGGVPPGRWFCPVCSD 1227
>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
Length = 1982
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 9/70 (12%)
Query: 567 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 626
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1718 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCL------GL 1768
Query: 627 RELPKGKWFC 636
R +P G+W C
Sbjct: 1769 RTVPDGRWSC 1778
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 799 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDATGHSSEK 856
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 857 KKSDGLGRLMPIDKRKCERLL 877
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 122 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN 158
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 833 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDATGHSSEK 890
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 891 KKSDGLGRLMPIDKRKCERLL 911
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 603 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 661
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 55/139 (39%), Gaps = 23/139 (16%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
D G LL CD C ++H C L ++P+ W CK+C + + A G
Sbjct: 1050 DPGRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCTQ--------CGATTPGLRC 1101
Query: 560 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 619
+ + C A LS C +C D+S+ I+ C QC+R FH C
Sbjct: 1102 EWQNNYTLCAPC--------ASLSTCPICL-VDYSEGTI----IVQCRQCDRWFHASCQS 1148
Query: 620 KHKMADLRELPKGKWFCCM 638
H D+ + + C M
Sbjct: 1149 LHSEEDIEKAADSSFDCTM 1167
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 572 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 631
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1023 IKITKVVLSKGWRCLECTVCEACGQATDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 1078
Query: 632 GKWFC--CMDCSRINSVLQNL 650
W C C+ C++ + L
Sbjct: 1079 DSWKCKWCVSCTQCGATTPGL 1099
>gi|432937609|ref|XP_004082462.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Oryzias latipes]
Length = 1475
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 537
D N+L CDGC R H C L S+P+GDW+C C+
Sbjct: 1148 DADNMLLCDGCDRGHHTHCLRPRLKSVPEGDWFCPDCR 1185
>gi|268557732|ref|XP_002636856.1| C. briggsae CBR-LET-418 protein [Caenorhabditis briggsae]
Length = 1849
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 55/138 (39%), Gaps = 43/138 (31%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 562
GG L+ CD CPRA+H C P+GDW C +C ++H ++
Sbjct: 264 GGELVLCDTCPRAYHTGCMD-EDPPEGDWSCPHC---------IEHGPEVIK-------- 305
Query: 563 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
E+ TK+ C +C+ + +LLCD C FH C+
Sbjct: 306 --EEPTKQNDDF----------CKICKETE---------NLLLCDSCVCAFHAYCID--- 341
Query: 623 MADLRELPKGKWFCCMDC 640
L ++P+ + + C C
Sbjct: 342 -PPLTQVPQEETWACPRC 358
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSC 338
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 418 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSC 454
>gi|302423122|ref|XP_003009391.1| origin recognition complex subunit 4 [Verticillium albo-atrum
VaMs.102]
gi|261352537|gb|EEY14965.1| origin recognition complex subunit 4 [Verticillium albo-atrum
VaMs.102]
Length = 869
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 509 CDGCPRAFHKECASLSSIPQGDWYCKYC 536
CDGC +A H++C + IP+GDW+CK C
Sbjct: 373 CDGCDKAVHQKCYGVHDIPEGDWFCKEC 400
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 451 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 487
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 392 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 427
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 503 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 537
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 129 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN 165
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 802 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 859
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 860 KKTEGLVKLTPIDKRKCERLL 880
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 784 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 841
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 842 KKTEGLVKLTPIDKRKCERLL 862
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 14/55 (25%)
Query: 484 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
CH C DGG LL CD CP ++H C + L IP+GDW C C
Sbjct: 2 FCHVC------------KDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRC 44
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 18/87 (20%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------ERKRFLQHDAN 551
+GG LL CD CP+ FH C +L S P G+W C C+N E R+ H N
Sbjct: 930 NGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCENTRYA-HSYN 988
Query: 552 AVEAGRVSGVDSVEQITKRCIRIVKNL 578
A G+D +Q K+C ++V +L
Sbjct: 989 A-----QYGLDDYDQ--KKCEKLVLSL 1008
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 557
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + ++ +
Sbjct: 715 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--SKPEVEYDCDNLQHSKKG 772
Query: 558 --VSGVDSVEQITKRCIRIV 575
G+ V+Q ++C R++
Sbjct: 773 KTAQGLSPVDQ--RKCERLL 790
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 13 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 49
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 745 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 802
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 803 KKTEGLVKLTPIDKRKCERLL 823
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 440 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 381 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHISEK 857
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 436 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 377 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 412
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 432 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 373 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 408
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 430 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 371 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 406
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHISEK 891
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 745 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 802
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 803 KKTEGLVKLTPIDKRKCERLL 823
>gi|400599137|gb|EJP66841.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
2860]
Length = 1226
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 95/247 (38%), Gaps = 42/247 (17%)
Query: 372 TMTYT---TGIRISSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLI---SKPFE 425
T YT + +R SS R ++TPV+ +S +S T ++ ++ +P E
Sbjct: 633 TNNYTAAESAVRGSSQR----RDTTPVSKTIRSRRSLANPPPTTRTTRSATKRPNDQPDE 688
Query: 426 NASP-PLSF---------------PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYAC 469
SP PLS P R PK R+ +SG P G G
Sbjct: 689 TVSPIPLSLTGDETSTVGGSRAVTPTAQRQAKRPKGLRVKSSPVKKSGGPAG---GLPRL 745
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLS---SI 526
+ ++ G N E A + G++L CDGCPR+FH EC +L+ +
Sbjct: 746 ASESQSSFRAGTPKDPATDNDEF----CSACGNAGDVLCCDGCPRSFHFECVNLAQSEDL 801
Query: 527 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 586
P DWYC C R H + S ++++E+ R + K ++ G
Sbjct: 802 PD-DWYCNECIVRRFPSRVPIH-----KGAFASALNNLEKSIPRAFSLPKRIQNRFEGVK 855
Query: 587 LCRGCDF 593
D+
Sbjct: 856 AGPDGDY 862
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 18/87 (20%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------ERKRFLQHDAN 551
+GG LL CD CP+ FH C +L S P G+W C C+N E R+ H N
Sbjct: 929 NGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCENTRYA-HSYN 987
Query: 552 AVEAGRVSGVDSVEQITKRCIRIVKNL 578
A G+D +Q K+C ++V +L
Sbjct: 988 A-----QYGLDDYDQ--KKCEKLVLSL 1007
>gi|346971621|gb|EGY15073.1| hypothetical protein VDAG_06563 [Verticillium dahliae VdLs.17]
Length = 653
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 540
LL CDGC A+H C L+ +P G WYC C ++F
Sbjct: 134 LLLCDGCDAAYHTHCVGLNHVPAGSWYCLECVDIF 168
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 440 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 381 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 796 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 853
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 854 KKTEGLVKLTPIDKRKCERLL 874
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 436 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 377 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 412
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 446 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 387 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 422
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 37/57 (64%), Gaps = 4/57 (7%)
Query: 501 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 557
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+ +R RF D V+ GR
Sbjct: 214 SEGGSLLCCESCPAAFHQECLNM-EMPQGSWFCNDCK-AGKRPRF--KDILWVKWGR 266
>gi|47195997|emb|CAF91487.1| unnamed protein product [Tetraodon nigroviridis]
Length = 138
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 538
+ G L PC CPRAFH C L + P+G WYC CQ
Sbjct: 97 EDGELQPCRSCPRAFHPSCLHPPLKTPPRGPWYCPKCQK 135
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 713 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 770
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 771 KKTEGLVKLTPIDKRKCERLL 791
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 159 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 216
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 217 KKTEGLVKLTPIDKRKCERLL 237
>gi|312371268|gb|EFR19500.1| hypothetical protein AND_22323 [Anopheles darlingi]
Length = 1628
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 11/61 (18%)
Query: 580 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 639
AE C++CR P LLCD+C R H+ CLK L+E+P G WF CM
Sbjct: 1168 AERISCMICR------RKVDPDLTLLCDECNRACHIYCLK----PKLKEVPAGDWF-CMK 1216
Query: 640 C 640
C
Sbjct: 1217 C 1217
>gi|384487310|gb|EIE79490.1| hypothetical protein RO3G_04195 [Rhizopus delemar RA 99-880]
Length = 339
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 506 LLPCDGCPRAFHKECASLSSIPQGDWYC-KYCQNMFERKRFL 546
L+ CDGCP+AFH+EC L P WYC C + +RKR +
Sbjct: 272 LIFCDGCPKAFHQECKELDKQPDTPWYCTDTCCDNLKRKRVV 313
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 739 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 796
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 797 KKTEGLVKLTPIDKRKCERLL 817
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 295 DGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCC 331
>gi|291230097|ref|XP_002735005.1| PREDICTED: tripartite motif-containing 28 protein-like
[Saccoglossus kowalevskii]
Length = 995
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 559
+GG+LL CD CP+ FH +C SL++ P+ W C CQ++ + + + + + S
Sbjct: 801 NGGDLLCCDTCPKVFHLQCHIPSLTATPKETWICGLCQDLCKEIQGISENDEHGKRKASS 860
Query: 560 GVDSVEQITKRCIRIVKNL 578
G+ Q K C RI+ L
Sbjct: 861 GLSEAHQ--KICERILLEL 877
>gi|71028052|ref|XP_763669.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350623|gb|EAN31386.1| hypothetical protein TP04_0034 [Theileria parva]
Length = 250
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 574 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 633
IV + C +C C+ S+SG RT+L+CD C+R FH+ C + E+P G
Sbjct: 83 IVIRYPWHCNSCKICVKCNDSESGVS-RTLLICDSCDRAFHMECTRNK----YTEVPSGN 137
Query: 634 WFC 636
WFC
Sbjct: 138 WFC 140
Score = 42.7 bits (99), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 55/134 (41%), Gaps = 8/134 (5%)
Query: 506 LLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 563
LL CD C RAFH EC + +P G+W+C CQ + E G
Sbjct: 111 LLICDSCDRAFHMECTRNKYTEVPSGNWFCDECQYCKSCDIKFSEQSIITEGYDNEG--- 167
Query: 564 VEQITKRC-IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 622
++ + C ++ + +A +S C +C + G ++C +C + H C + +
Sbjct: 168 -NKLCQSCMLKRKRGKDAGISYCCVCSK-SINPIGIHQNRRVVCQKCCQNVHPNCSRLEE 225
Query: 623 MADLRELPKGKWFC 636
AD + W C
Sbjct: 226 KADKFREGEESWIC 239
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 69/185 (37%), Gaps = 41/185 (22%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 243 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 290
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 587
+G W C +C+ + + + E G G E R+ K+ G LL
Sbjct: 291 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELL 345
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 646
C CD C +H+ CL L ++P G+W C C +
Sbjct: 346 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 384
Query: 647 LQNLL 651
+Q +L
Sbjct: 385 VQKIL 389
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 717 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 774
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 775 KKTEGLVKLTPIDKRKCERLL 795
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 44/95 (46%), Gaps = 16/95 (16%)
Query: 572 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 631
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 767 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 822
Query: 632 GKWFC-----------CMDCSR-INSVLQNLLVQE 654
G W C C C R +++V QNL +E
Sbjct: 823 GGWKCKCYREEDLILQCRQCDRWMHAVCQNLSTEE 857
>gi|363745215|ref|XP_003643224.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Gallus gallus]
Length = 137
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 555
DGG L+ CDGCPRAFH C L +P G W C C R R A + A
Sbjct: 10 DGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLREADTAAEQLPA 65
>gi|222637620|gb|EEE67752.1| hypothetical protein OsJ_25457 [Oryza sativa Japonica Group]
Length = 1646
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 438 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 472
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 726 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 783
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 784 KKTEGLVKLTPIDKRKCERLL 804
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 196 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 253
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 254 KKTEGLVKLTPIDKRKCERLL 274
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 315 DGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCC 351
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 107 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 69/185 (37%), Gaps = 41/185 (22%)
Query: 470 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 527
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 243 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 290
Query: 528 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 587
+G W C +C+ + + + E G G E R+ K+ G LL
Sbjct: 291 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELL 345
Query: 588 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 646
C CD C +H+ CL L ++P G+W C C +
Sbjct: 346 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 384
Query: 647 LQNLL 651
+Q +L
Sbjct: 385 VQKIL 389
>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
Length = 3066
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 469 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 502
C + L + KN + I C CN V PS Q AD
Sbjct: 2848 CLKVLNKHNKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2907
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
+L CD C R +H C L +PQG W+C+ C
Sbjct: 2908 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 2941
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 11/75 (14%)
Query: 571 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 630
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2879 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2932
Query: 631 KGKWFC-----CMDC 640
+G+W C C +C
Sbjct: 2933 QGRWHCQECAVCANC 2947
>gi|164423528|ref|XP_962530.2| hypothetical protein NCU08317 [Neurospora crassa OR74A]
gi|157070133|gb|EAA33294.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 961
Score = 47.0 bits (110), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G ++ CDGC +A H++C + +P+GDW+C+ C
Sbjct: 406 GNQIVFCDGCDKAIHQKCYGIPRLPKGDWFCREC 439
>gi|297259646|ref|XP_002798156.1| PREDICTED: protein kinase C-binding protein 1-like isoform 8
[Macaca mulatta]
Length = 1054
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|297259642|ref|XP_002798154.1| PREDICTED: protein kinase C-binding protein 1-like isoform 6
[Macaca mulatta]
Length = 1135
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|383417149|gb|AFH31788.1| protein kinase C-binding protein 1 isoform c [Macaca mulatta]
Length = 1135
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|380786007|gb|AFE64879.1| protein kinase C-binding protein 1 isoform c [Macaca mulatta]
Length = 1135
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|50510226|dbj|BAD31424.1| PHD finger transcription factor-like protein [Oryza sativa Japonica
Group]
Length = 1696
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|345790163|ref|XP_003433333.1| PREDICTED: protein kinase C-binding protein 1 [Canis lupus
familiaris]
Length = 1094
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|115473879|ref|NP_001060538.1| Os07g0661500 [Oryza sativa Japonica Group]
gi|113612074|dbj|BAF22452.1| Os07g0661500 [Oryza sativa Japonica Group]
Length = 1752
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|383417151|gb|AFH31789.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
gi|384939240|gb|AFI33225.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
Length = 1160
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|296200646|ref|XP_002747668.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Callithrix
jacchus]
Length = 1188
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|125559491|gb|EAZ05027.1| hypothetical protein OsI_27209 [Oryza sativa Indica Group]
Length = 1696
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 504 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 536
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|444706940|gb|ELW48255.1| Protein kinase C-binding protein 1 [Tupaia chinensis]
Length = 997
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 47 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 82
>gi|86143624|gb|ABC86691.1| RACK7 isoform l [Homo sapiens]
Length = 1134
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|403290905|ref|XP_003936547.1| PREDICTED: protein kinase C-binding protein 1 isoform 10 [Saimiri
boliviensis boliviensis]
Length = 1189
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|452822553|gb|EME29571.1| NuA3 HAT complex component NTO1 [Galdieria sulphuraria]
Length = 1342
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G +++ CDGC A H+ C + SIP+GDW+C C
Sbjct: 939 GNDIILCDGCHVAVHQTCYGVRSIPEGDWFCSSC 972
>gi|426392027|ref|XP_004062363.1| PREDICTED: protein kinase C-binding protein 1 isoform 6 [Gorilla
gorilla gorilla]
Length = 1186
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
Length = 690
Score = 47.0 bits (110), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 556
GG L+ C+ CP AFH EC S IP+G +YCK C E K L D V+ G
Sbjct: 154 GGTLICCESCPAAFHPECISYEGIPEGRFYCKDC---VEGKSLLYGDIVWVKLG 204
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 501 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYC--KYCQNMFERKRFL 546
DGG L+ CD GC + +H +C +L PQG W C +C N +R L
Sbjct: 560 GDGGQLIMCDRSGCLKCYHVDCLNLDKKPQGRWQCPWHFCDNCGKRATVL 609
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 446 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 503 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 536
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 387 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 422
>gi|86143432|gb|ABC86688.1| RACK7 isoform i [Homo sapiens]
Length = 1088
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|73992168|ref|XP_866849.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Canis
lupus familiaris]
Length = 1141
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|34335264|ref|NP_898869.1| protein kinase C-binding protein 1 isoform c [Homo sapiens]
gi|86143420|gb|ABC86682.1| RACK7 isoform c [Homo sapiens]
gi|119596121|gb|EAW75715.1| protein kinase C binding protein 1, isoform CRA_l [Homo sapiens]
Length = 1135
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|410255052|gb|JAA15493.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410341077|gb|JAA39485.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1160
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|410216320|gb|JAA05379.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410306756|gb|JAA31978.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1160
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|384946172|gb|AFI36691.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1163
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|380786053|gb|AFE64902.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
Length = 1160
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|332858654|ref|XP_003317032.1| PREDICTED: protein kinase C-binding protein 1 [Pan troglodytes]
Length = 1186
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|301614223|ref|XP_002936596.1| PREDICTED: hypothetical protein LOC100485119 [Xenopus (Silurana)
tropicalis]
Length = 1043
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 43/88 (48%), Gaps = 19/88 (21%)
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV-- 652
P ILLCD C+ +H CL+ M +P G+WFC C C +++ LQNL V
Sbjct: 834 PELILLCDSCDSGYHTACLRPPLML----IPDGEWFCPPCQHKLLCEKLDEQLQNLDVVL 889
Query: 653 ---QEAEKLPEFHLNAIKKYAGNSLETV 677
+ AE+ E + Y G SLE +
Sbjct: 890 KKKERAERRKERLV-----YVGISLENI 912
>gi|224120882|ref|XP_002318442.1| SET domain protein [Populus trichocarpa]
gi|222859115|gb|EEE96662.1| SET domain protein [Populus trichocarpa]
Length = 319
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 27/197 (13%)
Query: 591 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 650
C+ SG P +LLCD+C++ FH+ CL+ +A +PKG WF C CS+
Sbjct: 7 CEKCGSGESPGELLLCDKCDKGFHLFCLRPILVA----VPKGSWF-CPSCSKQKMPNSFP 61
Query: 651 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 710
LVQ K+ +F I++ + + DI + + S + + R LL
Sbjct: 62 LVQ--TKIIDFF--RIQRSTESIQKLSQDIQKKRKRSSSLVVSKKRRKLLP--------- 108
Query: 711 FDPIVDSISGRDLIPSMVYGRNLRGQEFG-------GMYCAILTVNSSVVSAGILRVFGQ 763
F P D + + S+ G EF GM A +VN + G ++V +
Sbjct: 109 FSPSEDPEKRLEQMRSLATALTASGTEFSNELTYRPGM--APRSVNQPALEKGGMQVLSK 166
Query: 764 EVAELPLVATSKINHGK 780
E AE + +N G+
Sbjct: 167 EDAETLNLCKRMMNRGE 183
>gi|219519075|gb|AAI44290.1| ZMYND8 protein [Homo sapiens]
gi|223460518|gb|AAI36609.1| ZMYND8 protein [Homo sapiens]
Length = 1054
Score = 47.0 bits (110), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|86143622|gb|ABC86690.1| RACK7 isoform k [Homo sapiens]
Length = 1206
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|73992170|ref|XP_866862.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Canis
lupus familiaris]
Length = 1166
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|336470944|gb|EGO59105.1| hypothetical protein NEUTE1DRAFT_60169 [Neurospora tetrasperma FGSC
2508]
gi|350292016|gb|EGZ73211.1| hypothetical protein NEUTE2DRAFT_107503 [Neurospora tetrasperma
FGSC 2509]
Length = 962
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G ++ CDGC +A H++C + +P+GDW+C+ C
Sbjct: 406 GNQIVFCDGCDKAVHQKCYGIPRLPKGDWFCREC 439
>gi|297259640|ref|XP_002798153.1| PREDICTED: protein kinase C-binding protein 1-like isoform 5
[Macaca mulatta]
Length = 1160
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|34335266|ref|NP_036540.3| protein kinase C-binding protein 1 isoform b [Homo sapiens]
gi|86143418|gb|ABC86681.1| RACK7 isoform b [Homo sapiens]
gi|119596112|gb|EAW75706.1| protein kinase C binding protein 1, isoform CRA_d [Homo sapiens]
Length = 1160
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|345790161|ref|XP_003433332.1| PREDICTED: protein kinase C-binding protein 1 [Canis lupus
familiaris]
Length = 1137
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296480966|tpg|DAA23081.1| TPA: zinc finger, MYND-type containing 8-like isoform 3 [Bos
taurus]
Length = 1140
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|119596111|gb|EAW75705.1| protein kinase C binding protein 1, isoform CRA_c [Homo sapiens]
Length = 1200
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|6329749|dbj|BAA86439.1| KIAA1125 protein [Homo sapiens]
Length = 1205
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 117 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 152
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 728 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPIHNSEK 785
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 786 RKTEGLVKLTPIDKRKCERLL 806
>gi|403290889|ref|XP_003936539.1| PREDICTED: protein kinase C-binding protein 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 1163
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|345790157|ref|XP_866912.2| PREDICTED: protein kinase C-binding protein 1 isoform 8 [Canis
lupus familiaris]
Length = 1184
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 829 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 886
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 887 RKTEGLVKLTPIDKRKCERLL 907
>gi|219112141|ref|XP_002177822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410707|gb|EEC50636.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 462
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 501 ADGGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 538
DGG+LL CDGC +H +C SL+ IP+G W C C N
Sbjct: 353 GDGGSLLICDGCEGEYHMDCVQPSLAEIPEGHWECDDCVN 392
>gi|73992186|ref|XP_866949.1| PREDICTED: protein kinase C-binding protein 1 isoform 10 [Canis
lupus familiaris]
Length = 1209
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 794 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 851
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 852 RKTEGLVKLTPIDKRKCERLL 872
>gi|384494855|gb|EIE85346.1| hypothetical protein RO3G_10056 [Rhizopus delemar RA 99-880]
Length = 1060
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 502 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 536
D NLL CDGC R +H C + LSS+P+ DWYC C
Sbjct: 233 DEENLLLCDGCNRGYHLYCLTPPLSSVPKTDWYCLQC 269
>gi|297259632|ref|XP_002798149.1| PREDICTED: protein kinase C-binding protein 1-like isoform 1
[Macaca mulatta]
Length = 1241
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|403290893|ref|XP_003936541.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 1209
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|397511405|ref|XP_003826066.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Pan
paniscus]
Length = 1186
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|348563925|ref|XP_003467757.1| PREDICTED: protein kinase C-binding protein 1-like isoform 2 [Cavia
porcellus]
Length = 1137
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|301783259|ref|XP_002927043.1| PREDICTED: protein kinase C-binding protein 1-like isoform 1
[Ailuropoda melanoleuca]
Length = 1140
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|86143426|gb|ABC86685.1| RACK7 isoform f [Homo sapiens]
Length = 1181
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|338719339|ref|XP_003363990.1| PREDICTED: LOW QUALITY PROTEIN: protein kinase C-binding protein
1-like [Equus caballus]
Length = 1186
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 502 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 555
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 902 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 959
Query: 556 GRVSGVDSVEQITKR-CIRIV 575
+ G+ + I KR C R++
Sbjct: 960 KKTEGLVKLTPIDKRKCERLL 980
>gi|221040998|dbj|BAH12176.1| unnamed protein product [Homo sapiens]
Length = 1241
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|426392023|ref|XP_004062361.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Gorilla
gorilla gorilla]
Length = 1241
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|45946211|gb|AAH30721.2| ZMYND8 protein [Homo sapiens]
Length = 1094
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 24 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 59
>gi|301783261|ref|XP_002927044.1| PREDICTED: protein kinase C-binding protein 1-like isoform 2
[Ailuropoda melanoleuca]
Length = 1165
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|296200638|ref|XP_002747664.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Callithrix
jacchus]
Length = 1243
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|119596116|gb|EAW75710.1| protein kinase C binding protein 1, isoform CRA_g [Homo sapiens]
Length = 1187
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 117 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 152
>gi|115696716|ref|XP_783470.2| PREDICTED: uncharacterized protein LOC578189 isoform 2
[Strongylocentrotus purpuratus]
gi|390342402|ref|XP_003725656.1| PREDICTED: uncharacterized protein LOC578189 isoform 1
[Strongylocentrotus purpuratus]
Length = 1640
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1025 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1057
>gi|44917000|dbj|BAD12142.1| unichrom [Hemicentrotus pulcherrimus]
Length = 1637
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 600 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 636
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1022 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1054
>gi|403290901|ref|XP_003936545.1| PREDICTED: protein kinase C-binding protein 1 isoform 8 [Saimiri
boliviensis boliviensis]
Length = 1244
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|355784415|gb|EHH65266.1| hypothetical protein EGM_02000, partial [Macaca fascicularis]
Length = 1231
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 114 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 149
>gi|296480965|tpg|DAA23080.1| TPA: zinc finger, MYND-type containing 8-like isoform 2 [Bos
taurus]
Length = 1165
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|119596119|gb|EAW75713.1| protein kinase C binding protein 1, isoform CRA_j [Homo sapiens]
Length = 1100
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 30 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 65
>gi|56203004|emb|CAI23168.1| protein kinase C binding protein 1 [Homo sapiens]
Length = 1115
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 108
>gi|86143422|gb|ABC86683.1| RACK7 isoform d [Homo sapiens]
Length = 1163
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|403290891|ref|XP_003936540.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Saimiri
boliviensis boliviensis]
Length = 1237
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|332858652|ref|XP_001164593.2| PREDICTED: protein kinase C-binding protein 1 isoform 22 [Pan
troglodytes]
Length = 1241
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|171690472|ref|XP_001910161.1| hypothetical protein [Podospora anserina S mat+]
gi|170945184|emb|CAP71295.1| unnamed protein product [Podospora anserina S mat+]
Length = 810
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 503 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 536
G +L CD C A H++C ++ IP+GDW+CK C
Sbjct: 288 GNQILFCDSCDMAVHQKCYGVARIPKGDWFCKDC 321
>gi|300794091|ref|NP_001178100.1| protein kinase C-binding protein 1 [Bos taurus]
Length = 1193
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|25453223|sp|Q9ULU4.2|PKCB1_HUMAN RecName: Full=Protein kinase C-binding protein 1; AltName:
Full=Cutaneous T-cell lymphoma-associated antigen
se14-3; Short=CTCL-associated antigen se14-3; AltName:
Full=Rack7; AltName: Full=Zinc finger MYND
domain-containing protein 8
gi|56203005|emb|CAI23169.1| protein kinase C binding protein 1 [Homo sapiens]
gi|119596110|gb|EAW75704.1| protein kinase C binding protein 1, isoform CRA_b [Homo sapiens]
gi|168269692|dbj|BAG09973.1| protein kinase C-binding protein 1 [synthetic construct]
Length = 1186
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|403290895|ref|XP_003936542.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 1109
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|397511401|ref|XP_003826064.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Pan
paniscus]
Length = 1241
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|384946170|gb|AFI36690.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1186
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|384939242|gb|AFI33226.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1188
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|297259636|ref|XP_002798151.1| PREDICTED: protein kinase C-binding protein 1-like isoform 3
[Macaca mulatta]
Length = 1188
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|291409975|ref|XP_002721267.1| PREDICTED: zinc finger, MYND-type containing 8 [Oryctolagus
cuniculus]
Length = 1137
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 94 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 129
>gi|426392017|ref|XP_004062358.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Gorilla
gorilla gorilla]
Length = 1188
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|68137404|gb|AAY85631.1| transcriptional repressor BSR/RACK7/PRKCBP1 isoform o [Homo
sapiens]
Length = 1107
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 10/97 (10%)
Query: 554 EAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQC 609
EA VS + + C+ + + + + G CL C C+ +G +LLCD+C
Sbjct: 454 EASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNGGDEEKLLLCDEC 513
Query: 610 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRIN 644
+ +HV C+K L +P G W C C C R N
Sbjct: 514 DVSYHVYCMK----PPLESVPSGPWRCHWCSRCRRCN 546
>gi|410953588|ref|XP_003983452.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Felis
catus]
Length = 1129
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|395829157|ref|XP_003787727.1| PREDICTED: protein kinase C-binding protein 1 isoform 2 [Otolemur
garnettii]
Length = 1053
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|355563056|gb|EHH19618.1| hypothetical protein EGK_02318, partial [Macaca mulatta]
Length = 1231
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 114 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 149
>gi|348563927|ref|XP_003467758.1| PREDICTED: protein kinase C-binding protein 1-like isoform 3 [Cavia
porcellus]
Length = 1162
Score = 46.6 bits (109), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|34335262|ref|NP_898868.1| protein kinase C-binding protein 1 isoform a [Homo sapiens]
gi|86143160|gb|ABC86680.1| RACK7 isoform a [Homo sapiens]
gi|119596118|gb|EAW75712.1| protein kinase C binding protein 1, isoform CRA_i [Homo sapiens]
Length = 1188
Score = 46.6 bits (109), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 504 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 539
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,129,479,907
Number of Sequences: 23463169
Number of extensions: 547584567
Number of successful extensions: 1368858
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1054
Number of HSP's successfully gapped in prelim test: 3317
Number of HSP's that attempted gapping in prelim test: 1352519
Number of HSP's gapped (non-prelim): 14701
length of query: 873
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 721
effective length of database: 8,792,793,679
effective search space: 6339604242559
effective search space used: 6339604242559
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)