BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003113
(846 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/916 (49%), Positives = 577/916 (62%), Gaps = 132/916 (14%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P ++
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEP-----TTSK 387
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S S + R ++K V+K+ SKS ++ +RSP
Sbjct: 388 SSGSASVSISSRYKRK--------WVTKAKSKSSEYTSI--SRSP--------------R 423
Query: 456 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
S P+ K+ + + RK KS K L+ K ++AS S K++W IT KDQRLHKL
Sbjct: 424 SAPMRIPSKNKSALKMRK---KSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKL 480
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 565
VF+E GLPDGTEV Y+A GQKLL+GYK G GI+C CCN VSPSQFE HA
Sbjct: 481 VFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYA 540
Query: 566 ------------------------------------DGGNLLPCDGCPRAFHKECASLSS 589
DGGNLL CDGCPRAFHKECASLSS
Sbjct: 541 YIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSS 600
Query: 590 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 649
P+GDWYCK+CQNMF+R++F++H+ NAV AGRV GVD +EQITKRCIRIV+N+E +LSGC
Sbjct: 601 TPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGC 660
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 709
+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGCLK HKMA L+ELP+GKWFC + C+RI+S
Sbjct: 661 VLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHS 720
Query: 710 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSGKAATPETRLLLSQAV 768
LQ LL++ EKLP L A+ + G + + D+DV WRL+SGK A+PETRLLLS+A+
Sbjct: 721 ALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPETRLLLSEAI 780
Query: 769 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 828
AIFHD FDPIVD SGRDLIP+MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ+
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQD 840
Query: 829 VAELPLVATSKINHGK 844
+AELPLVATS NHGK
Sbjct: 841 IAELPLVATSNGNHGK 856
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 770 bits (1987), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/925 (49%), Positives = 578/925 (62%), Gaps = 141/925 (15%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P ++
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEP-----TTSK 387
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S S + R ++K V+K+ SKS ++ +RSP
Sbjct: 388 SSGSASVSISSRYKRK--------WVTKAKSKSSEYTSI--SRSP--------------R 423
Query: 456 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
S P+ K+ + + RK KS K L+ K ++AS S K++W IT KDQRLHKL
Sbjct: 424 SAPMRIPSKNKSALKMRK---KSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKL 480
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 565
VF+E GLPDGTEV Y+A GQKLL+GYK G GI+C CCN VSPSQFE HA
Sbjct: 481 VFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYA 540
Query: 566 ------------------------------------DGGNLLPCDGCPRAFHKECASLSS 589
DGGNLL CDGCPRAFHKECASLSS
Sbjct: 541 YIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSS 600
Query: 590 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 649
IP+GDWYCK+CQNMF+R++F++H+ NAV AGRV GVD +EQITKRCIRIV+N+E +LSGC
Sbjct: 601 IPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGC 660
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 709
+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGCLK HKMA L+ELP+GKWFC + C+RI+S
Sbjct: 661 VLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHS 720
Query: 710 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSGKAATPETRLLLSQAV 768
LQ LL++ EKLP L A+ + G + + D+DV WRL+SGK A+PETRLLLS+A+
Sbjct: 721 ALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPETRLLLSEAI 780
Query: 769 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ- 827
AIFHD FDPIVD SGRDLIP+MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQY 840
Query: 828 --------EVAELPLVATSKINHGK 844
++AELPLVATS NHGK
Sbjct: 841 CRAAIGCXDIAELPLVATSNGNHGK 865
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/681 (56%), Positives = 467/681 (68%), Gaps = 81/681 (11%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
++F C RCKG FP CVGK GP LCN C +SK+ T+T G ++SR +
Sbjct: 121 RTFTCKRCKGIFPSICVGKIGP--LCNLCAESKESHPTLTI---------GSSIISRYCQ 169
Query: 395 N--DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGL 452
N + IS+ N + + LL + + L+ S L
Sbjct: 170 NLPSLILISWIINLKTITSGQFLL------------------MLAHCSFRLSFLSPEQVL 211
Query: 453 IANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN-ITPKDQR 511
S+ SQ RK +K ++ LI+KP + AS LS P K ++ I+P+DQR
Sbjct: 212 ALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVASVHLS-PRKRKYKKISPRDQR 270
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEV-SPSQFEAHA----- 565
LH+LVF+E GLPDGTE+ YYA GQKLL GYK G GI+CHCCN EV SPS FEAHA
Sbjct: 271 LHRLVFEEGGLPDGTELAYYARGQKLLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATR 330
Query: 566 -----------------------------------------DGGNLLPCDGCPRAFHKEC 584
DGG+LL CDGCPRAFHK C
Sbjct: 331 KKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCIICADGGDLLLCDGCPRAFHKGC 390
Query: 585 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 644
ASLS++P GDWYC++CQN F+R++F++H+ANA AGRVS +DS+EQITKRC RIVKN+EA
Sbjct: 391 ASLSTVPSGDWYCQHCQNTFQREKFVEHNANAFAAGRVSEIDSIEQITKRCFRIVKNVEA 450
Query: 645 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
EL+GC LCRG DF +SGFGPRTI+LCDQCE+EFHVGCL+ HKMA+L+ELPKG WFCCMDC
Sbjct: 451 ELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELPKGNWFCCMDC 510
Query: 705 SRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLL 763
SRI+S LQ LL++ AEKLP+ LN I KK+ L + IDVRW LLSGK A+PE +LL
Sbjct: 511 SRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISNSIDVRWTLLSGKIASPENKLL 570
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILR 823
LS+A++IF +CFDPIVDS GRDLIP MVYG+N +GQ++GGMYCA+L VNS +VSAGILR
Sbjct: 571 LSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLIVNSCIVSAGILR 630
Query: 824 VFGQEVAELPLVATSKINHGK 844
VFG+EVAELPLVAT +HGK
Sbjct: 631 VFGEEVAELPLVATRNGDHGK 651
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 54/118 (45%), Gaps = 11/118 (9%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQA 252
E + IA+ S + S + + KKIS + + E GL DG + Y G K
Sbjct: 239 EPDLIAKPSKVASVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKL-- 296
Query: 253 SGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF--ENGKSLLEV 308
L G R GILC C CN C V+ PS FE HA R+ Y C NG SL ++
Sbjct: 297 --LGGYKRGFGILCHC--CN-CEVVSPSTFEAHAGWATRK-KPYACIYTSNGVSLHDL 348
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/847 (49%), Positives = 515/847 (60%), Gaps = 199/847 (23%)
Query: 45 KVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILES 104
KV+ VNG+IVY+R KRS S NG E
Sbjct: 21 KVSVVNGYIVYTRAKRSLDSC---------------------------------NGFSEH 47
Query: 105 VVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLKR 164
++N V++ EN EC K +EV K+ R
Sbjct: 48 AELKDNAEVEVKTENG---------------------ECEKLKNESTEVVARTRKR--SR 84
Query: 165 PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
SA++ KVE + +V +E+ ++ AL +P+ +ELKMSKKI +N+K
Sbjct: 85 RSALEAKVECCDQMVV------SETEQVVANGGSGINGALGAPRNKMELKMSKKIVVNRK 138
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
PMTV +LF+TG LDGVSVVYMGGIK +ASGLRG+IRDGGILCSC LCNG RVIPPS+FEI
Sbjct: 139 PMTVKKLFDTGFLDGVSVVYMGGIK-KASGLRGVIRDGGILCSCCLCNGRRVIPPSQFEI 197
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HACKQYRRA+QYIC ENGKSLL++LRACR L L+ T+Q+ + S EE+ F C RCKG
Sbjct: 198 HACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYFTCKRCKG 257
Query: 345 TFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQN 404
FP + V + GP +C SCV+ SR SE S N
Sbjct: 258 CFPSSFVERVGP--ICRSCVE------------------------SRKSEESS------N 285
Query: 405 NKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHK 464
N +V K + RSP RP +++N + + +
Sbjct: 286 N--------------VVGK------------RVRSP--------RPVVLSNPSSTSELSV 311
Query: 465 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 524
SSQ +R RK K+T L+ F + S L DQRLHKLVF+E+GLPD
Sbjct: 312 SSQVKRHRK-----KRTKLV---FISISSVL-------------DQRLHKLVFEENGLPD 350
Query: 525 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------- 565
GTEV YYA GQKLLEG+K G GI+C CCN+E+SPSQFE HA
Sbjct: 351 GTEVAYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVS 410
Query: 566 ---------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 598
DGGNLL CDGCPRAFHKECA+LSSIP+GDWYC+
Sbjct: 411 LHELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSIPRGDWYCQ 470
Query: 599 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 658
+CQNMF+R++F+ H+ANAV AGRV GVD +EQI RCIRIVK++EA+LS C LCRG DFS
Sbjct: 471 FCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDFS 530
Query: 659 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 718
+SGFGPRTI+LCDQCE+E+HVGCL+ HKMA L+ELP+G W CC DC+RI+S L+NLLV+
Sbjct: 531 RSGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVKG 590
Query: 719 AEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 777
AE+LPE L I KK LE + IDVRWRLL+GK A+PETR LL +AV+IFH+CF+P
Sbjct: 591 AERLPESLLGVIKKKQEEKGLEPI--IDVRWRLLNGKIASPETRPLLLEAVSIFHECFNP 648
Query: 778 IVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
IVD+ SGRDLIP+MVYGRN+RGQEFGGMYCA+L VNSSVVSAG+LR+FG +VAELPLVAT
Sbjct: 649 IVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVAT 708
Query: 838 SKINHGK 844
S NHGK
Sbjct: 709 SNGNHGK 715
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/669 (52%), Positives = 430/669 (64%), Gaps = 94/669 (14%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L PMTV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
++F C RCK + + P +S SS+
Sbjct: 121 RTFTCKRCKEQV--------------------------------LALEYFKPASLSTSSQ 148
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+++ RKK RK E D ++K S KS S+ + R +++ P L+
Sbjct: 149 DNT----------PRKKKRKPEEQDSITKPS-KSASVYLSSRKRKYKKIS-----PRLVC 192
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
P+ + K V I K + SP F +DQRLH+
Sbjct: 193 FFYPIDILFGLVMLS-PFPFLWLVKIFVFIRK-YAYLSPFCPFSGYQS-----QDQRLHR 245
Query: 515 LVFDESGLPDGTEVGYYACGQ-------------------------KLLEGYK---NGLG 546
LVF+E GLPDGTE+ YYA GQ KLL GY G
Sbjct: 246 LVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQKLLGGYAYIYTSNG 305
Query: 547 IICHCCNSEVSPSQFEAH----------ADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 596
+ H +S S+ + ADGGNLL CDGCPRAFHK CAS+ ++P GDWY
Sbjct: 306 VSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPRAFHKGCASIPTVPSGDWY 365
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C+YCQN FER++ ++H+ANA AGR SG+DS+EQITKRC RIVKN+EAEL+GC LCRG D
Sbjct: 366 CQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKRCFRIVKNIEAELTGCALCRGYD 425
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 716
F +SGFGPRTI+LCDQCE+EFHVGCL+ HKM +L+ELPKG WFCCMDCSRI+S LQ LL+
Sbjct: 426 FMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKGNWFCCMDCSRIHSTLQKLLI 485
Query: 717 QEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 775
+ AEKLP+ LN I KK+ L ++IDVRW LLSGK A+PE +LLLS+A++IF +CF
Sbjct: 486 RGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSGKIASPENKLLLSRALSIFQECF 545
Query: 776 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 835
DPIVDS GRDLIP MVYG+N +GQ++GGMYCA+LT+NSS+VSAGILRVFG+EVAELPLV
Sbjct: 546 DPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTINSSIVSAGILRVFGEEVAELPLV 605
Query: 836 ATSKINHGK 844
AT HGK
Sbjct: 606 ATRNGEHGK 614
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/680 (53%), Positives = 456/680 (67%), Gaps = 85/680 (12%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
C RCKG FP +C+ + GP +C+SCV+S K + + RIR+ P LVS+SS
Sbjct: 118 TCKRCKGRFPSSCMERVGP--ICSSCVESSKSEESSKNVVSKRIRSPRPVLVSKSSCASE 175
Query: 398 MCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANST 457
M IS + +R RK+ + + + S S SV + K + S + +NS
Sbjct: 176 MSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILPRRKVTPKTKKKSLSVKLKTTSNSN 235
Query: 458 PVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVF 517
+ S Q + + KITKK L+ P SFP T D RLHKLVF
Sbjct: 236 CL-----SPQIKSEWKITKK-----LV---------PYSFP-------TCGDNRLHKLVF 269
Query: 518 DESGLPDGTEVGYYACGQ------KLLEGYKNGLGIICHCCNSEVSPSQFEAHA------ 565
+E+GLPDG+E+ YYA GQ KLLEG+K G GI+C CCN+E+SPSQFE HA
Sbjct: 270 EENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRK 329
Query: 566 ----------------------------------------DGGNLLPCDGCPRAFHKECA 585
DGGNLL CDGCPRAFHKECA
Sbjct: 330 KPYAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGCPRAFHKECA 389
Query: 586 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 645
SLSSIP+GDWYC++CQNMF+R++F+ ++ NA AGRV GVD +EQITKRCIRIVK+++AE
Sbjct: 390 SLSSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAE 449
Query: 646 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 705
LS C LCRG DFSKSGFGPRTI+LCDQCE+E+HVGCL+ HKM L+ELPKG W CC DC+
Sbjct: 450 LSACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCT 509
Query: 706 RINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 764
RI+S L+N+LV+ AE+LP+ L I KK L+ ++DI+VRWRLLSGK A+PETR LL
Sbjct: 510 RIHSTLENVLVRGAERLPKSLLAVIKKKQEEKGLDPINDINVRWRLLSGKKASPETRPLL 569
Query: 765 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
+AV+IFH+CFDPIVD++SGRDLI +MVYG+++RGQEFGGMYCA+L VNSSVVSAG+LR+
Sbjct: 570 LEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLIVNSSVVSAGMLRI 629
Query: 825 FGQEVAELPLVATSKINHGK 844
FG ++AELPLVATS HGK
Sbjct: 630 FGTDIAELPLVATSNSQHGK 649
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/891 (42%), Positives = 525/891 (58%), Gaps = 131/891 (14%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDD-------LLEDDVIDKRINSK----IHEGRI- 88
CKR K T+VNGFIVY+R ++++F+ + LLE+ + + SK + G I
Sbjct: 42 CKRIKTTQVNGFIVYTRTRKTKFTKLHEQGDENAGLLENRMSNHLEESKPTIGVTNGSIG 101
Query: 89 ------NKVVKNVLNENGILESVVEEE----------------NQLVQMTVENVIEETVK 126
N +KN E+ + VEE + LV + ++++ +
Sbjct: 102 ETNVSGNSCIKNTFVESPAGKIAVEERLVTGSLAESPAVETDSSSLVDVVIDDINFVELL 161
Query: 127 GKKAPICKEEPISKVEC-FPRKEGGSEV-----SNGLNKKCLKRPSAMKPKVEPVEVLVT 180
++ P+ E +S+ F K G++V S +++K KR + K + + ++
Sbjct: 162 HEEIPV---EILSEGSLDFEVKRLGTKVRTMGKSYSVSEK--KRHGSFKRTAQIYKSILR 216
Query: 181 QSEGFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKKPMTVTELFETGL 236
+ ++ VE ++E L ++ L K I + ++P TV ELFETG+
Sbjct: 217 MKK---VNNLVPENVEVLSEPDFGREGLDEQSHSVSLA-DKSILIRRRPETVRELFETGI 272
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDG+SVVYMG +K QA GLRGII+DGGILCSCS C+ VI SKFEIHACKQYRRASQY
Sbjct: 273 LDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHVISTSKFEIHACKQYRRASQY 332
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGP 356
ICFENGKSLL+VL R+ PL L+AT+ A+ +EK F C RCKG FP + +G G
Sbjct: 333 ICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKCFTCKRCKGAFPFSSLGHRGF 392
Query: 357 GPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
LC SC + + Q + P + S+ + + CI+ R +
Sbjct: 393 --LCMSCSEVETSQAS-------------PAAMWTSTSSPA-CIASPVKSRLK------- 429
Query: 417 EADLVSKSSSKSVSLRNLLKT---RSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRK 473
+++ S+S+S+ + + S +TR + R L+ + S + SSQ++ + K
Sbjct: 430 ----ITRKPSESMSISPVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSK 485
Query: 474 ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYAC 533
K + + K ++ S +S +S + +T KDQ LHKLVF+ GLP+GTE+GYYA
Sbjct: 486 FKKMLTQYSVTPKAVKSVSLSVSSKKRS-YRLTRKDQGLHKLVFERGGLPEGTELGYYAR 544
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------------------------- 565
GQKLL GYK G GI C+CC SEVSPS FEAHA
Sbjct: 545 GQKLLGGYKMGAGIYCYCCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFS 604
Query: 566 ------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
DGGNLL CD CPRAFH EC SL SIP+G+W+CKYC+N F +
Sbjct: 605 QGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSE 664
Query: 608 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 667
+++ N+ G++ GVD V+Q RCIR+VKN+EAE +GC+LC G DF +SGFGPRTI
Sbjct: 665 IAGEYNVNSSAVGQLEGVDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTI 724
Query: 668 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 727
++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AE L + L
Sbjct: 725 IICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSL 784
Query: 728 NAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 786
I +K + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SGR+
Sbjct: 785 GIIQRKQERTDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGRN 844
Query: 787 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT
Sbjct: 845 LIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVAT 895
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 335/668 (50%), Positives = 430/668 (64%), Gaps = 72/668 (10%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C RCKG FP + +G G LC SC + + Q ++ T R+S +
Sbjct: 373 FTCKRCKGPFPFSSLGHRGF--LCKSCSEVETSQASLAAT--------------RTSTSA 416
Query: 397 SMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANS 456
CI+ R K TRK E+ +S S+ S ++TR + R L+ +
Sbjct: 417 PACITSPVKSR-LKITRKPSESTSISPVFMSSLG-------NSTRKITRKALRQALVGKA 468
Query: 457 TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 516
S + SSQ + + K K + + K ++ S +S +S + + KDQ LHKLV
Sbjct: 469 YLSASTNVSSQKKCRSKFKKMLTQHSVTPKALKSVSLSVSSKKRS-YRLARKDQGLHKLV 527
Query: 517 FDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------- 565
FD GLP+GTE+GYYA GQKLL GYK G GI C+CC EVSPS FEAHA
Sbjct: 528 FDRGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKCEVSPSLFEAHAGWASRRKPYFY 587
Query: 566 -----------------------------------DGGNLLPCDGCPRAFHKECASLSSI 590
DGGNLL CD CPRAFH EC SL SI
Sbjct: 588 IYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSI 647
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G+W+CKYC+N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +GC+
Sbjct: 648 PRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCV 707
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 710
LC G DF +SGFGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS
Sbjct: 708 LCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINST 767
Query: 711 LQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 769
LQ LL+ AEKL + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+A
Sbjct: 768 LQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALA 827
Query: 770 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 829
IFHDCFDPIVD +SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EV
Sbjct: 828 IFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREV 887
Query: 830 AELPLVAT 837
AELPLVAT
Sbjct: 888 AELPLVAT 895
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 556 bits (1433), Expect = e-155, Method: Compositional matrix adjust.
Identities = 268/440 (60%), Positives = 321/440 (72%), Gaps = 62/440 (14%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + N+S NKS IT KDQR
Sbjct: 1903 NNSPATSIHRTSERARLLKPIPVTKSSGSALY------NSSE-----NKSLGKITKKDQR 1951
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------ 565
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHA
Sbjct: 1952 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 2011
Query: 566 ----------------------------------------DGGNLLPCDGCPRAFHKECA 585
DGGNLL CDGCPRAFH+ CA
Sbjct: 2012 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 2071
Query: 586 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 645
SL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 2072 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 2130
Query: 646 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 705
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 2131 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 2190
Query: 706 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 764
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 2191 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 2250
Query: 765 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 2251 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 2310
Query: 825 FGQEVAELPLVATSKINHGK 844
FGQEVAELPLVATS N G+
Sbjct: 2311 FGQEVAELPLVATSVDNQGR 2330
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 141/380 (37%), Positives = 190/380 (50%), Gaps = 63/380 (16%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1589 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1628
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKE--- 148
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+
Sbjct: 1629 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1684
Query: 149 GGS-------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
GS E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1685 DGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1743
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1744 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1800
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1801 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1860
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1861 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1909
Query: 374 TYTTGIRIRASGPKLVSRSS 393
+ T R R P V++SS
Sbjct: 1910 IHRTSERARLLKPIPVTKSS 1929
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 267/440 (60%), Positives = 319/440 (72%), Gaps = 62/440 (14%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + S NKS IT KDQR
Sbjct: 371 NNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSE-----------NKSLGKITKKDQR 419
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------ 565
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHA
Sbjct: 420 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 479
Query: 566 ----------------------------------------DGGNLLPCDGCPRAFHKECA 585
DGGNLL CDGCPRAFH+ CA
Sbjct: 480 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 539
Query: 586 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 645
SL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 540 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 598
Query: 646 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 705
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 599 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 658
Query: 706 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 764
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 659 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 718
Query: 765 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 719 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 778
Query: 825 FGQEVAELPLVATSKINHGK 844
FGQEVAELPLVATS N G+
Sbjct: 779 FGQEVAELPLVATSVDNQGR 798
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 134/368 (36%), Positives = 186/368 (50%), Gaps = 61/368 (16%)
Query: 48 KVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVE 107
+ +G I YSR KR + LE+ D+R + + E ES +
Sbjct: 69 RWDGVIQYSRNKRLK------RLEESKNDER--------------RTIAEEPKDDESTTD 108
Query: 108 EENQLVQMTVEN---VIEETVKGK-KAPICKEEPIS----------------KVECFPRK 147
EE Q T EN V+E+ G PIC+EEP S K++
Sbjct: 109 EE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESNDGSLKLQTAGLI 164
Query: 148 EGGSEVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALT 205
+ E+ + +K KR SA+K K + VE L + F N ++ + +LT
Sbjct: 165 DESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGVDEKTNGAVRSLT 223
Query: 206 SPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGIL 265
SPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G + + L+G I+ GIL
Sbjct: 224 SPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDG--RKKGYRLQGTIKGNGIL 280
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
CSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+ PL L+AT+Q
Sbjct: 281 CSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATIQ 340
Query: 326 SALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASG 385
SA+ S P ++S P K PL NSC+K T + T R R
Sbjct: 341 SAIGSFPVKRS---------LPADEAAKM--DPLGNSCIKRNNSPATSIHRTSERARLLK 389
Query: 386 PKLVSRSS 393
P V++SS
Sbjct: 390 PIPVTKSS 397
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 306/661 (46%), Positives = 393/661 (59%), Gaps = 107/661 (16%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C RCK F CV + R SE+
Sbjct: 373 FTCKRCKEGF----------------CVSHAR----------------------RPSEST 394
Query: 397 SMCISYQNN--KRERKKTRKLLEADLVSKSS-SKSVSLRNLLKTRSPWELTRNSSRPGLI 453
S+ + ++ RK TRK L LV K+ S S ++ + K RS ++
Sbjct: 395 SISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFK----------- 443
Query: 454 ANSTPVTSVHKSSQSQRQRKITKKSKKTV-LISKPFENASPPLSFPNKSRWNITPKDQRL 512
V +H + + KS + L+S P P KDQ L
Sbjct: 444 --KMLVFLLHNDVLMLAEPTLMIKSLILLHLVSLYVLKVDPTFCDPQGFE-----KDQGL 496
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLL-------EGYKNGL------------------GI 547
HKLVFD GLP+GTE+GYYA GQ + E K L G+
Sbjct: 497 HKLVFDRGGLPEGTELGYYARGQTYITVDRNCSEATKWALEYIVTVASASYFYIYTSNGV 556
Query: 548 ICHCCNSEVSPS-QFEAH---------ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 597
H + S ++ A+ ADGGNLL CD CPRAFH EC SL SIP+G+W+C
Sbjct: 557 SLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHC 616
Query: 598 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 657
KYC+N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +G
Sbjct: 617 KYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNG--------- 667
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 717
SGFGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+
Sbjct: 668 --SGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLG 725
Query: 718 EAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 776
AEKL + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFD
Sbjct: 726 GAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFD 785
Query: 777 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 836
PIVD +SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVA
Sbjct: 786 PIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVA 845
Query: 837 T 837
T
Sbjct: 846 T 846
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 256/421 (60%), Positives = 313/421 (74%), Gaps = 48/421 (11%)
Query: 471 QRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY 530
+ I +S K +SK +A +S NK +W IT KDQRLHKLVF++ GLPDGTEV Y
Sbjct: 349 ETNIKSRSSKPATVSKSLNSALEGVSSENKCQWKITTKDQRLHKLVFEDGGLPDGTEVAY 408
Query: 531 YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------------- 565
YA GQKLL GYK G GI+C CCN EVSPS FEAHA
Sbjct: 409 YARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVSLHELAI 468
Query: 566 ---------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 604
DGG+L+ CDGCPRAFHK CASLSSIP+G W+C++CQNMF
Sbjct: 469 SLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQFCQNMF 528
Query: 605 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 664
+R++F++H+ANAV AGR+SGVD +EQIT+RCIRIVKN+EAEL+GC+LCRG DFS+SGFGP
Sbjct: 529 QREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYDFSRSGFGP 588
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 724
RTI+LCDQC +EFHVGCL+ HK+A+L+ELPKGKWFCC DC RI+S L+ LL +EAE +P
Sbjct: 589 RTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLAREAEIIPN 648
Query: 725 FHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSIS 783
L + KK LETV++IDVRW+LL+GK+A+PET+LLLSQA+AIF +CFDPIVD+ +
Sbjct: 649 KLLEVVMKKNEEKGLETVNNIDVRWKLLTGKSASPETKLLLSQALAIFQECFDPIVDT-T 707
Query: 784 GRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
GRDLIP MVYG+N +GQ++GGMYCA+L VNS VVSA I+R+FGQEVAELPLVATS NHG
Sbjct: 708 GRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLVATSNGNHG 767
Query: 844 K 844
K
Sbjct: 768 K 768
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 200/419 (47%), Positives = 253/419 (60%), Gaps = 57/419 (13%)
Query: 1 MKRELDYELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIVYSRVKR 60
MKRE+ G E+ +Q QA D + N CKRFKV VNGF VYSR+++
Sbjct: 1 MKREVGAFDGGIQFESESQ-------QAQD----SNNNNNCKRFKV--VNGFFVYSRLRK 47
Query: 61 SRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQMTVENV 120
++ S+ + H+ + K + I+++V E E V++V
Sbjct: 48 NKPSSRE---------------CHDDKDRKCQQ-------IIQTVSEVETVNKDPQVKDV 85
Query: 121 IEE-TVKGKKAPICKEEPISK-VECFPRKEGGSE-------------VSNGLNKKCLKRP 165
E ++ + PICK E S+ EGG+E SN L + L R
Sbjct: 86 SRELSLCNVQLPICKIESFSEPSRVILANEGGTEDTERKLAHVGTEGKSNKLRQ--LTR- 142
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S KVEPVEV V E +E +S ++VE IAEGSALT PKKNLELKMSKKI+L+ P
Sbjct: 143 SNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKIALDNIP 202
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
MTV ELFETGLL+GV VVYMGG K A LRG I+D GILC CS C GCRVIPPS+FEIH
Sbjct: 203 MTVKELFETGLLEGVPVVYMGGKK--AFCLRGTIKDVGILCYCSFCKGCRVIPPSQFEIH 260
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
A KQYRRA+QYICFENGKSLL+VL ACR+ PL L+AT+QSA+S LP+EK+F C RCKGT
Sbjct: 261 AIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKTFTCKRCKGT 320
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQN 404
+P VGK GPLC+SCV+SK+ G+ T I+ R+S P VS+S + +S +N
Sbjct: 321 YPTILVGKV--GPLCSSCVESKESNGSPACETNIKSRSSKPATVSKSLNSALEGVSSEN 377
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 245/432 (56%), Positives = 310/432 (71%), Gaps = 54/432 (12%)
Query: 460 TSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDE 519
+S+H + R K++ K K + SK LS NKS+W I+ + QRLHKL+F+E
Sbjct: 240 SSIHAVGKRIRSSKLSVKLKTAPITSKC-------LSPQNKSQWRISKRYQRLHKLIFEE 292
Query: 520 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------------- 565
GLP+G EV YYA GQKLLEG K GI+C CCN+E+SPSQFE HA
Sbjct: 293 DGLPNGAEVAYYARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYT 352
Query: 566 --------------------------------DGGNLLPCDGCPRAFHKECASLSSIPQG 593
DGGNLL CDGCPRAFHKECAS+SSIP+G
Sbjct: 353 SNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRG 412
Query: 594 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 653
+WYC+ CQ+ F R+R + H+A+AV AGRV GVD +EQI KRCIRIVK++ AE+ GC+LCR
Sbjct: 413 EWYCQICQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCR 472
Query: 654 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN 713
DFS+SGFGPRTI++CDQCE+E+HVGCL+ HKMA L+ELP+G WFCC DC+RI+S L+N
Sbjct: 473 SSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLEN 532
Query: 714 LLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 772
LL++ AE+LPE L+ I KK G LE +++IDVRW+LL+GK A+PETR LL +AV++FH
Sbjct: 533 LLIRVAERLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGKIASPETRPLLLEAVSMFH 592
Query: 773 DCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 832
+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L VNSSVVSAG++R+FG+++AEL
Sbjct: 593 ECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAEL 652
Query: 833 PLVATSKINHGK 844
PLVAT N GK
Sbjct: 653 PLVATRYKNRGK 664
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/197 (54%), Positives = 140/197 (71%), Gaps = 2/197 (1%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I ++KKP TV ELF+TGLLDGV VVY+G K + LRG I+DGGILCSCSLCNG RV
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
IPPS+FEIHAC Y+RA+QYIC ENGKS+LE++RACR+ PL L+AT+Q+ ++S PEEK
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C C+G FP + V + G LC SCV+S+K + + + G RIR+S + +++
Sbjct: 207 FTCKNCRGCFPSSNVERV--GLLCLSCVESRKSEKSSIHAVGKRIRSSKLSVKLKTAPIT 264
Query: 397 SMCISYQNNKRERKKTR 413
S C+S QN + R R
Sbjct: 265 SKCLSPQNKSQWRISKR 281
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 241/402 (59%), Positives = 282/402 (70%), Gaps = 70/402 (17%)
Query: 490 NASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIIC 549
N SP S S +DQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C
Sbjct: 1968 NNSPATSIHRTSE---RARDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFC 2024
Query: 550 HCCNSEVSPSQFEAHA-------------------------------------------- 565
CC+ EVS SQFEAHA
Sbjct: 2025 WCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSI 2084
Query: 566 --DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
DGGNLL CDGCPRAFH+ CASL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVS
Sbjct: 2085 CGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVS 2144
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 683
GVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGFGPRTI+LCDQ
Sbjct: 2145 GVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQ----------- 2192
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVS 742
ELP GKWFCC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++
Sbjct: 2193 --------ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIA 2244
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 802
D +VRWRLLSGK A+PETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F
Sbjct: 2245 DYNVRWRLLSGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDF 2304
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
G+YCA++TVNS VVSAGILRVFGQEVAELPLVATS N G+
Sbjct: 2305 SGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGR 2346
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 135/369 (36%), Positives = 182/369 (49%), Gaps = 63/369 (17%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1654 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1693
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPIS------------ 139
E ES +EE Q T EN V+E+ G PIC+EEP S
Sbjct: 1694 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1749
Query: 140 ----KVECFPRKEGGSEVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
K++ + E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1750 DGSLKLQTAXLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1808
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1809 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1865
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1866 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1925
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1926 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1974
Query: 374 TYTTGIRIR 382
+ T R R
Sbjct: 1975 IHRTSERAR 1983
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 313/831 (37%), Positives = 447/831 (53%), Gaps = 149/831 (17%)
Query: 98 ENGILESVVEEENQLVQMTV---ENVIEETVKGKKAPICKEEPISKVECFPRKEGG---S 151
E +L+ V+ EE +V T+ E ++ ET+K + E+P+ E + G +
Sbjct: 142 EPKVLDDVINEEEAIVAETLKEQEPIVPETLKEEVVDEMAEQPLCIEESEEKDSNGVALA 201
Query: 152 EVSNGLN-----KKCLKRP--------SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAI 198
V++G KK L+RP SA+K K E E G +S
Sbjct: 202 LVNDGAKGKKSMKKRLERPQSERRFTRSALKVKSEET----NDGEHVGVAGISDGVKRET 257
Query: 199 AEGSAL--TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIK--FQAS- 253
G++L T+P +K S + L K P + +L TG+L+G+ V+YM G+K F
Sbjct: 258 EAGASLVMTTPSS---VKFSNRGKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEK 314
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLEVLR 310
GL+G+I+D G+LC C +C G V+ P+ FE+HA +R +YI +G K+L +V+
Sbjct: 315 GLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMN 374
Query: 311 ACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKK 368
AC PL + +Q L +KS C+ C+G C G + +C+SC+ S
Sbjct: 375 ACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSRL--VCDSCLVSPA 428
Query: 369 PQGTMTYTTGI----RIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKS 424
Q + GI + R+ P ++ +S +N+ S N + K +
Sbjct: 429 -QTAVASNKGISQPVQPRSPEPVVIQKSLDNEVQPNSLHNEVQPNKLDTGM--------- 478
Query: 425 SSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLI 484
+ SL N ++ S NS +P +N H +S+ + Q ++T+K
Sbjct: 479 --QPNSLDNGMEPDS----LNNSMKPKSFSNGMK----HSASRGKSQGRLTRK------- 521
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
D RLHKLVF+ LPDGTE+ YYA GQKLL GYK G
Sbjct: 522 ------------------------DLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKG 557
Query: 545 LGIICHCCNSEVSPSQFEAHA--------------------------------------- 565
GI C CCN +VS SQFEAHA
Sbjct: 558 YGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDN 617
Query: 566 --------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 617
DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++ R QH+ NA+
Sbjct: 618 DDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKDRHGQHEVNAL 677
Query: 618 EA-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 676
A GR++G D +E + KRCIR+V+ LE + GC LC +FSKS FGP+T+++CDQCE+E
Sbjct: 678 AAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSRPNFSKS-FGPQTVIICDQCEKE 736
Query: 677 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK-YAG 735
+HVGCLK H M +L ELP G WFC +CS+I++ L +L+ + + +P+ LN IKK +
Sbjct: 737 YHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDLVASKEKDVPDPLLNLIKKKHEE 796
Query: 736 NSLETVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 793
SL+ + +DV+WR+++ K + + ETR LLS+AVAIFH+ FDPIVDS SGRD IP+M++
Sbjct: 797 KSLDIGAGLDVKWRVINWKLDSDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPAMLF 856
Query: 794 GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G+
Sbjct: 857 GRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQGQ 907
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 493 bits (1270), Expect = e-136, Method: Compositional matrix adjust.
Identities = 284/713 (39%), Positives = 394/713 (55%), Gaps = 128/713 (17%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
E EA AE S L +P + + S+ L K P + +L TG+L+G+ V+YM G K +
Sbjct: 286 ETEASAEASLLMTPPSSAKFSNSR---LKKFPSKLKDLLATGILEGLPVMYMKGAKVLFA 342
Query: 254 G---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLE 307
G L+G+I+D G+LC C +CNG V+ P+ FE+HA +R +YI +G K+L +
Sbjct: 343 GEKGLQGVIQDSGVLCFCKICNGVEVVTPTVFELHAGSANKRPPEYIYIHDGNCGKTLRD 402
Query: 308 VLRACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVK 365
V+ AC PL + +Q L +KS C+ C+G C G + +C+ C+
Sbjct: 403 VMNACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSKL--VCDLCLA 456
Query: 366 SKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSS 425
S PQ M +RK++ + +S
Sbjct: 457 SP-PQTAMA-------------------------------------SRKVISQPVQPRSP 478
Query: 426 SKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLIS 485
V ++L N +P + N P S+ ++ S T +
Sbjct: 479 EPVVIQKSL----------DNEVQPNSLDNEVPPNSL--------DNEVQPNSLDTGVQP 520
Query: 486 KPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
K F N + KS+ +T KD RLHKLVF+ LPDGTE+ YYA GQKLL GYK G
Sbjct: 521 KSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGC 580
Query: 546 GIICHCCNSEVSPSQFEAHA---------------------------------------- 565
GI C CCN +VS SQFEAHA
Sbjct: 581 GIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDND 640
Query: 566 -------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 618
DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++ R QH+ NA+
Sbjct: 641 DLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALA 700
Query: 619 A-GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 677
A GR++G D +E + KRCIR+VK +E + GC LC +FSKS FGPRT+++CDQCE+E+
Sbjct: 701 AAGRIAGPDILELMNKRCIRVVKTVEVDHGGCALCSRPNFSKS-FGPRTVIICDQCEKEY 759
Query: 678 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK-YAGN 736
HVGCLK+H M +L +LP+G WFC +CS I++ L +L+ + + +P+ L+ IKK +
Sbjct: 760 HVGCLKEHNMENLEKLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEK 819
Query: 737 SLETVSDIDVRWRLLSGKAATP-----ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 791
SLE + +DV+WR+++ K + ETR LLS+AVAIFH+ FDPIVDS SGRD IP+M
Sbjct: 820 SLEIGAGLDVKWRVMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTM 879
Query: 792 VYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G+
Sbjct: 880 LFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQ 932
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/397 (58%), Positives = 293/397 (73%), Gaps = 47/397 (11%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
LS NKS+W I+ + QRLHKL+F+E GLP+G EV YYA GQKLLEG K GI+C CCN+
Sbjct: 276 LSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYYARGQKLLEGIKTRCGIVCRCCNT 335
Query: 555 EVSPSQFEAHA----------------------------------------------DGG 568
EVSPSQFE HA DGG
Sbjct: 336 EVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGG 395
Query: 569 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 628
NLL CDGCPRAFHKECAS+SSIP+G+WYC+ CQ+ F R+R + ++A+AV AGRV GVD +
Sbjct: 396 NLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPI 455
Query: 629 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 688
EQI KRCIRIVK++ AE+ GC+LCR DFS+SGFGPRTI++CDQCE+E+HVGCL+ HK A
Sbjct: 456 EQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKA 515
Query: 689 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVR 747
L+ELP+G WFCC DC+ I+S L+NLL++ AE+LPE L+ I KK LE +++IDVR
Sbjct: 516 YLKELPEGDWFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVR 575
Query: 748 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYC 807
W+LL+GK A+PETR LL +AV++FH+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYC
Sbjct: 576 WKLLNGKIASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYC 635
Query: 808 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
A+L VNSSVVSAG++R+FG+++AELPLVAT N GK
Sbjct: 636 ALLIVNSSVVSAGMVRIFGRDIAELPLVATRYKNRGK 672
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/138 (63%), Positives = 108/138 (78%), Gaps = 1/138 (0%)
Query: 208 KKNLELKMSKKI-SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILC 266
K LK +KKI ++KKP+TV ELF+TGLLDGV VVY+G K + LRG I+DGGILC
Sbjct: 85 KTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILC 144
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
SC LCNG RVIPPS+FEIHAC Y+RA+QYIC ENGKSLL+++RACR+ PL L+AT+Q+
Sbjct: 145 SCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQN 204
Query: 327 ALSSLPEEKSFACVRCKG 344
++S PEEK F C C+G
Sbjct: 205 FINSPPEEKYFTCKSCRG 222
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 55/118 (46%), Gaps = 14/118 (11%)
Query: 193 IEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQ 251
++++ + S SP+ + ++SK+ K + E GL +G V Y G K
Sbjct: 264 VKLKTVPITSKCLSPQNKSQWRISKRYQRLHKLI----FEEDGLPNGAEVAYYARGQKL- 318
Query: 252 ASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRR-ASQYICFENGKSLLEV 308
L GI GI+C C CN + PS+FE+HA RR YI NG SL E+
Sbjct: 319 ---LEGIKTRCGIVCRC--CNT--EVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHEL 369
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 293/718 (40%), Positives = 417/718 (58%), Gaps = 77/718 (10%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
+ +QSA+ +++ C+ CKG P + G LC SC+ SKKPQ +
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDSKKPQAIDLLSL 458
Query: 378 GIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSS---SKSVSLRNL 434
+ D + I+ + N ++SKSS +KSVS R
Sbjct: 459 S--------HYYMKEFWADHLIITPKPN--------------VLSKSSDTITKSVSTRGK 496
Query: 435 LK---TRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENA 491
+ TR L + ++ + T V + ++ Q+ + K + + +
Sbjct: 497 IHGRITRKDLRLHKLVFEEDILPDGTEV-----AYYARGQKLLVGYKKGSGIFCSCCNSE 551
Query: 492 SPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL---GII 548
P F + W + + LH ++ +G V + L +G K L +
Sbjct: 552 VSPSQFEAHAGW-ASRRKPYLH--IYTSNG------VSLHELSISLSKGRKFSLTDNDDL 602
Query: 549 CHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
C C ADGG+LL CDGCPR+FH++C L IP G WYCKYCQN+F++++
Sbjct: 603 CSIC------------ADGGDLLCCDGCPRSFHRDCVPLQCIPTGIWYCKYCQNLFQKEK 650
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
F++H+ANAV AGRV+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++
Sbjct: 651 FVEHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVI 710
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 728
LCDQCE+EFHVGCLK++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L
Sbjct: 711 LCDQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILV 770
Query: 729 AI-KKYAGNSLETVSDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRD 786
++ KK +++D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD
Sbjct: 771 SVQKKIEDQGSASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRD 830
Query: 787 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
IPSM+YGRN+RGQEFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+
Sbjct: 831 FIPSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQ 888
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 236/416 (56%), Positives = 292/416 (70%), Gaps = 48/416 (11%)
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQK 536
++ K ++SK + + +S K IT KD RLHKLVF+E LPDGTEV YYA GQK
Sbjct: 473 RTPKPNVLSKSSDTITKSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQK 532
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHA------------------------------- 565
LL GYK G GI C CCNSEVSPSQFEAHA
Sbjct: 533 LLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGR 592
Query: 566 ---------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 610
DGG+LL CDGCPR+FH++C L IP G WYCKYCQN+F++++F+
Sbjct: 593 KFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNLFQKEKFV 652
Query: 611 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 670
+H+ANAV AGRV+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++LC
Sbjct: 653 EHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILC 712
Query: 671 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI 730
DQCE+EFHVGCLK++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L ++
Sbjct: 713 DQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSV 772
Query: 731 -KKYAGNSLETVSDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 788
KK +++D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD I
Sbjct: 773 QKKIEDQGSASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFI 832
Query: 789 PSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
PSM+YGRN+RGQEFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+
Sbjct: 833 PSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQ 888
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 145/289 (50%), Gaps = 29/289 (10%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEPVEVLVTQSEGFGNESMSLI--EVEAIAE--- 200
+V+ L KK ++P SA+K VEP L S+ +M +I + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTS-LEHLSKCNTGVAMQVITNDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ------- 370
+ +QSA+ +++ C+ CKG P + G LC SC+ S+KPQ
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDSRKPQVSSSPSP 458
Query: 371 ---GTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
+ T + R P ++S+SS+ + +S + K + TRK L
Sbjct: 459 SPSPSPTPIVFSKDRTPKPNVLSKSSDTITKSVSTR-GKIHGRITRKDL 506
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 283/757 (37%), Positives = 404/757 (53%), Gaps = 132/757 (17%)
Query: 154 SNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGN-------ESMSLIEVEAIA---EGSA 203
S G++KK + + KP LV Q N E L++V+ A E
Sbjct: 297 SQGVDKKAVN-DTVDKPLRRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDG 355
Query: 204 LTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGII 259
SP K + K L P + ++F+ G+L+G+ V Y+ G K + +G L+G+I
Sbjct: 356 FQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVI 415
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ G+LC CS C G +V+ P+ FE+HA +R +YI E+G +L +V+ AC+ PL
Sbjct: 416 KGSGVLCFCSACIGIQVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLAT 475
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
L+ L+ + + +KS C+ C+G C T +C SC++SK+P+
Sbjct: 476 LEEKLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKSLVVCKSCLESKEPE--------- 523
Query: 380 RIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRS 439
+ N+ + D ++ SS SV +++L+
Sbjct: 524 ----------------------FHNSPSKAN--------DALNGSSRPSVDPKSILR--- 550
Query: 440 PWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPN 499
R+ S P QS R+ + T+KS + ++ + S S +
Sbjct: 551 -----RSKSSP---------------RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKS 590
Query: 500 KSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS 559
S +T KD RLHKLVF++ LPDGTEVGY+ G+K+L GYK G GI C CCN VSPS
Sbjct: 591 NSHGKLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPS 650
Query: 560 QFEAHA----------------------------------------------DGGNLLPC 573
FEAHA DGG L+ C
Sbjct: 651 TFEAHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCC 710
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
D CPR++HK CASL S+P W CKYC NM ER++F+ + NA+ AGRV GVD++ +IT
Sbjct: 711 DTCPRSYHKVCASLPSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITN 770
Query: 634 RCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 692
RCIRIV + EL S C+LCRG F + GF RT+++CDQCE+EFHVGCLK+ +ADL+E
Sbjct: 771 RCIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKE 830
Query: 693 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRW 748
LP+ KWFC + C IN+ L NL+V+ EKL LN ++K + E D D+RW
Sbjct: 831 LPEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRW 890
Query: 749 RLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYC 807
R+LSGK ++ +T++LL++A++I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC
Sbjct: 891 RVLSGKLTSSDDTKILLAKALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYC 950
Query: 808 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+L V+ +VS GI RVFG E+AELPLVATSK G+
Sbjct: 951 TMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQ 987
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 216/381 (56%), Positives = 266/381 (69%), Gaps = 49/381 (12%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD LHKLVF E+GLP+GTEVGYY GQ+LL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 366 VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQLLVGYKRGSGIFCTCCNSEVSPSQFEAH 425
Query: 565 A----------------------------------------------DGGNLLPCDGCPR 578
A DGGNLL CDGCPR
Sbjct: 426 AGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLLCCDGCPR 485
Query: 579 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 638
FHKEC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV+GVD +EQITKRCIRI
Sbjct: 486 VFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRI 545
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
V E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL++H M DL+E+PKGKW
Sbjct: 546 VNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKW 605
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSG-KAA 756
FCC DC RINS LQ L+V E+LP L IK KY N D D++WRL+ G +A+
Sbjct: 606 FCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGRRAS 665
Query: 757 TPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSV 816
+ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G++ R +FGGMYCAILT++S V
Sbjct: 666 SIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHGKSTREWDFGGMYCAILTISSQV 724
Query: 817 VSAGILRVFGQEVAELPLVAT 837
VSA R+FG+EVAELPLVAT
Sbjct: 725 VSAAAFRIFGKEVAELPLVAT 745
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 91/167 (54%), Gaps = 10/167 (5%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKT 354
V+ A ++ L L ++ A+ +KS C+ CKG +G +
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS 341
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/392 (56%), Positives = 267/392 (68%), Gaps = 53/392 (13%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD R HKLVF+E LPDGTEVGYY G+KLL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 501 VTKKDIRTHKLVFEEEVLPDGTEVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAH 560
Query: 565 A----------------------------------------------DGGNLLPCDGCPR 578
A DGG LL CD CPR
Sbjct: 561 AGWASRRKPYLNIYTSNGVSLHELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPR 620
Query: 579 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 638
AFH+EC SL SIP+G WYCKYC N FE+++F++ +ANA+ AGRV+GVD +EQIT+RCIRI
Sbjct: 621 AFHQECLSLPSIPRGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRI 680
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
VK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+HKM DL+ELPKGKW
Sbjct: 681 VKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKW 739
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 758
FCC C RI+S LQ L+++ EKLP+ LN IKK+ ++ E+ DVRWRLLS K +
Sbjct: 740 FCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKHEESASESGCSDDVRWRLLSKKTDSS 799
Query: 759 E-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEFGGMYCAILTV 812
+ T LLS AVAIFH+CFDPI S R D IPSMV G N++GQ+ GGMYCA+L V
Sbjct: 800 DVTEALLSDAVAIFHECFDPITVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLV 859
Query: 813 NSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
N VVS ++R+FGQE+AELP+VATS G+
Sbjct: 860 NHVVVSVAVVRIFGQELAELPIVATSSRWQGQ 891
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 115/233 (49%), Gaps = 15/233 (6%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+E V++ + S+G ++V+ SA + N KM K P
Sbjct: 266 SALKPKIETVDI--SSSDG--------VKVDDRGSSSAAAATTTNTPTKMFSIDGSKKFP 315
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++R+ GILC C C G V+ P+ F
Sbjct: 316 TKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVTPAIF 375
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
+HA +R +YIC ENG +L +V+ AC++ L L ++ + P +KS C C
Sbjct: 376 VLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNFCWNC 435
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEN 395
+G+ IT G LC+ C K Q T + R + P V SS N
Sbjct: 436 RGS--ITGAGSRKSKVLCSQCFGLKDFQAGSAPKTAKKERTAKPHSVPESSCN 486
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 212/399 (53%), Positives = 267/399 (66%), Gaps = 54/399 (13%)
Query: 499 NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSP 558
+KS+ +T KD R+HKLVF+E LPDGTEV YY+ GQKLL GYK G GI C CCN+EVSP
Sbjct: 563 SKSQGRLTVKDLRMHKLVFEEDVLPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSP 622
Query: 559 SQFEAHA----------------------------------------------DGGNLLP 572
SQFEAHA DGG+LL
Sbjct: 623 SQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLC 682
Query: 573 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 632
CD CPRA+HK+C +L IP G WYCK+C N F++++F++H+ANA+ AGRV+GVD ++QIT
Sbjct: 683 CDVCPRAYHKDCLALPEIPTGRWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQIT 742
Query: 633 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 692
+RCIRIVK ++A+ GC+ CRG DF K FGPRT+LLCDQCE+EFHVGCLK H M DL+E
Sbjct: 743 RRCIRIVKTMDADFGGCVFCRGHDFDKI-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKE 801
Query: 693 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV--SDIDVRWRL 750
LPKG WFCC DC RI+S L+ L+++ E+L + LN I K + S+IDVRWRL
Sbjct: 802 LPKGNWFCCSDCCRIHSALEKLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRL 861
Query: 751 LSGKA-ATPETRLLLSQAVAIFHDCFDPIV----DSISGRDLIPSMVYGRNLRGQEFGGM 805
L+ K +T LLS+A+AI H+ F+PI+ S + RDLI SMV+G NL+GQEFGGM
Sbjct: 862 LNDKINPAGDTAALLSEALAILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGM 921
Query: 806 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
YCA+L +N +VVS I+R FG E+AELPLVATS GK
Sbjct: 922 YCAVLMINQAVVSCAIIRFFGLELAELPLVATSSKAQGK 960
Score = 99.0 bits (245), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 104/216 (48%), Gaps = 17/216 (7%)
Query: 166 SAMKPKVE-PVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
S +KPK+E E V S +++ GS + LK+ K + K
Sbjct: 321 SLLKPKMEIGQEYAVKDSSSAADDA-----------GSPSAASNSGTMLKVWKNDTSKKF 369
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVIPPSK 281
P + +L ++G+L+G V YM G K + +G L+G+I ILC C C G V+ PS
Sbjct: 370 PTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSCRGNEVVTPSI 429
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI ENG +L +V+ AC++ L L L + + S C++
Sbjct: 430 FEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCSSLKNSTFCLK 489
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
C+G G++ LC+ C+ K Q ++ TT
Sbjct: 490 CRGKLAEASTGRS--MTLCSQCMVLKDSQASIPATT 523
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 214/400 (53%), Positives = 269/400 (67%), Gaps = 57/400 (14%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S+ +T KD R+HKLVF+E LPDGTEVGYY+ G+KLL GYK G GI C CCN+EVSPSQ
Sbjct: 473 SQGRLTKKDIRMHKLVFEEEVLPDGTEVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSPSQ 532
Query: 561 FEAHA----------------------------------------------DGGNLLPCD 574
FEAHA DGG LL CD
Sbjct: 533 FEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKCRRHSTKENDDLCQICRDGGKLLCCD 592
Query: 575 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 634
CPRAFH+EC SL SIP+G WYCKYC N FE+++F++ +ANA+ AGRV+G D +EQIT+R
Sbjct: 593 VCPRAFHQECLSLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQITRR 652
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR--- 691
CIRIVK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+H+M DL+
Sbjct: 653 CIRIVKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHQMQDLKAIC 711
Query: 692 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 750
ELP GKWFCC C RI+S LQ L+++ EKLP+ LN I KK+ ++ E+ D+RWRL
Sbjct: 712 ELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKKHEESASESGGGDDIRWRL 771
Query: 751 LSGKAATPE-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEFGG 804
LS K + T LLS+AVAIFH+ F PI S R D IPSMV G +++GQ+ GG
Sbjct: 772 LSKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQDLGG 831
Query: 805 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
MYCA+L VN VVSA ++R+FGQE+AELP+VATS + G+
Sbjct: 832 MYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKSQGQ 871
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 120/231 (51%), Gaps = 17/231 (7%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+EP+++ + S+G + V AI T+P K + L K P
Sbjct: 244 SALKPKIEPLDI--SSSDGVKVDDTGSSSVAAIT-----TTPTKMFAID-----GLKKFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++++ GILC C C G V+ P+ F
Sbjct: 292 TKLKDLLDSGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIF 351
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA +R +YI ENG +L +V+ AC++ L +L ++ ++ P +KS C+ C
Sbjct: 352 ELHAGSANKRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSC 411
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
+G+ IT G LC+ C++ K Q + T + R P V SS
Sbjct: 412 RGS--ITGAGTRKSKVLCSQCLELKDSQAILAPETDTKERTPRPSPVPESS 460
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 213/395 (53%), Positives = 264/395 (66%), Gaps = 49/395 (12%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
P+K +T KD R+HKLVF+ LPDGT + YY G+KLLEGYK G I C+CC SEVS
Sbjct: 581 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGKKLLEGYKKGGAIFCYCCQSEVS 640
Query: 558 PSQFEAHA----------------------------------------------DGGNLL 571
PSQFEAHA DGG+LL
Sbjct: 641 PSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGDLL 700
Query: 572 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 631
CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D++EQI
Sbjct: 701 CCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQI 759
Query: 632 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 691
TK IRIV L AE+ C+LCR DFS SGFGP+T+++CDQCE+E+HV CL++H M DL+
Sbjct: 760 TKCSIRIVDTLHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLK 819
Query: 692 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 750
ELPK KWFCC +C+ I+ LQ L+ + LP+ + I +K +LE S DV+WRL
Sbjct: 820 ELPKDKWFCCKECNSIHYALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSINDVKWRL 879
Query: 751 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR-DLIPSMVYGRNLRGQEFGGMYCAI 809
LSGK +T ETR+ LS AV+IFHD FDPI DS + R DLIP+MVYGRN + Q+FGGM CAI
Sbjct: 880 LSGKNSTEETRVWLSGAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLCAI 939
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
L VNS VVSAG++R+FG+EVAELPLVATS GK
Sbjct: 940 LMVNSLVVSAGVIRIFGKEVAELPLVATSLDCQGK 974
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 105/176 (59%), Gaps = 6/176 (3%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLR 256
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y+ K + GL+
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQ 404
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
G+I+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 405 GVIQGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAP 464
Query: 317 LPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 465 ADALEAAIRNATGAGDARKSTFCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 518
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 404 bits (1038), Expect = e-109, Method: Compositional matrix adjust.
Identities = 214/412 (51%), Positives = 263/412 (63%), Gaps = 71/412 (17%)
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ- 535
+S K IS+ E+ S S +KS +T KD LHKLVF E+GLP+GTEVGYY GQ
Sbjct: 370 RSPKPSTISRSAESVSKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQV 429
Query: 536 --KLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------------------------- 565
+LL GYK G GI C CCNSEVSPSQFEAHA
Sbjct: 430 VTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLS 489
Query: 566 ------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
DGGNLL CDGCPR FHKEC SL++IP+G W+CK+C NM +++
Sbjct: 490 RGREISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKE 549
Query: 608 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 667
+F++H+ANAV AGRV+GVD +EQITKRCIRIV E+ GC LCR +FS+SGFGPRT+
Sbjct: 550 KFVEHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTV 609
Query: 668 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 727
+LCDQCE+EFHVGCL++H M DL+E+PKGKWFCC DC RINS LQ L+V E+LP L
Sbjct: 610 MLCDQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVL 669
Query: 728 NAIK-KYAGNSLETVSDIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGR 785
IK KY N D D++WRL+ G +A++ E LLSQA++IFH+ FDPI D+ +GR
Sbjct: 670 TTIKEKYGRNGSACSKDPDIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA-AGR 728
Query: 786 DLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
DL+P MV+G S VVSA R+FG+EVAELPLVAT
Sbjct: 729 DLLPDMVHG-------------------SQVVSAAAFRIFGKEVAELPLVAT 761
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 114/210 (54%), Gaps = 15/210 (7%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
V+ A ++ L L ++ A+ +KS C+ CKG +G + LC SC++ K
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS--KVLCLSCLQLK 352
Query: 368 KPQGTMTYTTG---IRIRASGPKLVSRSSE 394
+ Q + + TG +R+ P +SRS+E
Sbjct: 353 ESQASPSQVTGSSDSHLRSPKPSTISRSAE 382
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 246/667 (36%), Positives = 355/667 (53%), Gaps = 131/667 (19%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVI 277
L P + E+F G+L+G++V Y+ G K + +G L+G+I+ G+LC C C G +V+
Sbjct: 363 LRNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVV 422
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPE-EKS 336
+ +E+HA +R +YI E+G +L +V+ AC+ P L+ L+ + P +KS
Sbjct: 423 STAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVG--PNLKKS 480
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIRIRASGPKLVSRSSE 394
C+ C+G C T +C SC++SK+P+ + + G +S P + +S
Sbjct: 481 SLCLNCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSIL 538
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+ S Q+N++E+ TR S+ PG++
Sbjct: 539 SRSKSSPRQSNRQEQP---------------------------------TRKSTEPGVVP 565
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
+ S S +S Q K+T+ KD RLHK
Sbjct: 566 GTILSESKSSSIKSNSQGKLTR-------------------------------KDVRLHK 594
Query: 515 LVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA--------- 565
LVF++ LPDGTEVGY+ G EVSPS FEAHA
Sbjct: 595 LVFEDDILPDGTEVGYFVAG--------------------EVSPSSFEAHAGCASRRKPF 634
Query: 566 ------DGGNLLPC----------------DGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
+G +L D C CASLSS+P W CKYC NM
Sbjct: 635 QHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGVCASLSSLPSERWSCKYCVNM 694
Query: 604 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGF 662
ER++F+ + NA+ AGRV GVD++ +IT RCIR+V + EL S C+LCRG F + GF
Sbjct: 695 VEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPSVCVLCRGHSFCRLGF 754
Query: 663 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 722
RT+++CDQCE+EFHVGCLK+H +ADL+ELP+ KWFC +DC +IN+ L NL+++ EKL
Sbjct: 755 NSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEKINTTLGNLIIRGEEKL 814
Query: 723 PEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDP 777
LN I+ + E+ D D+RWR+LSGK ++ ET++LL++AV+I H+ FDP
Sbjct: 815 TNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGKLTSSDETKILLAKAVSILHERFDP 874
Query: 778 IVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
I ++ + DLIP+MVYGR +GQ+F GMYC +L V+ +VS GI RVFG E+AELPLVAT
Sbjct: 875 ISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVAT 934
Query: 838 SKINHGK 844
SK G+
Sbjct: 935 SKDCQGQ 941
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 237/615 (38%), Positives = 330/615 (53%), Gaps = 117/615 (19%)
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI E+G +L +V+ AC+ PL L+ L+ + + ++ S
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSS----- 56
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
LC SC GP + +++ +C S
Sbjct: 57 -----------------LCLSC--------------------QGPMIEPCDTKSLVVCKS 79
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+K D ++ SS SV +++L+ R+ S P
Sbjct: 80 CLESKEPEFHNSPSKANDALNGSSRPSVDPKSILR--------RSKSSP----------- 120
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
QS R+ + T+KS + ++ + S S + S +T KD RLHKLVF++
Sbjct: 121 ----RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDI 176
Query: 522 LPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------------- 565
LPDGTEVGY+ G+K+L GYK G GI C CCN VSPS FEAHA
Sbjct: 177 LPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTN 236
Query: 566 ------------------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDW 595
DGG L+ CD CPR++HK CASL S+P W
Sbjct: 237 GVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERW 296
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRG 654
CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG
Sbjct: 297 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRG 356
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 714
F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL
Sbjct: 357 HSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNL 416
Query: 715 LVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVA 769
+V+ EKL LN ++K + E D D+RWR+LSGK ++ +T++LL++A++
Sbjct: 417 IVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALS 476
Query: 770 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 829
I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+
Sbjct: 477 ILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSEL 536
Query: 830 AELPLVATSKINHGK 844
AELPLVATSK G+
Sbjct: 537 AELPLVATSKDCQGQ 551
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/411 (49%), Positives = 258/411 (62%), Gaps = 56/411 (13%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 368 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 420
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------------- 565
++LL G+K G I+C CCNSEVSPSQFEAH+
Sbjct: 421 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 480
Query: 566 -----------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
DGG LL CDGCPR FHKEC SL +IP+G W+CK+C N ++ +
Sbjct: 481 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 540
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 541 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 600
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 728
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 601 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 660
Query: 729 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 786
IKK G S + D++WRLL G +A+ E LLSQA+++FH+ F+PI D+ GRD
Sbjct: 661 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFHEQFNPIADA-EGRD 719
Query: 787 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
L+ MV+ + EFGGMYCAILTV VVSA RV G+EVAELPLVAT
Sbjct: 720 LLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVAT 770
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 65 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 119
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 120 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 173
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 174 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 211
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 330
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 331 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 385
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/408 (46%), Positives = 253/408 (62%), Gaps = 52/408 (12%)
Query: 487 PFENASPPLSFP----NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYK 542
P +A P +F + S +T KD LHKLVF LP+GT+VGYY G++LL+GY
Sbjct: 259 PTSSARVPKNFSPGAKSTSAGRLTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYI 318
Query: 543 NGLGIICHCCNSEVSPSQFEAHA------------------------------------- 565
GI CHCCN+ VSPSQFE HA
Sbjct: 319 KEPGIHCHCCNTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQ 378
Query: 566 ---------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 616
DGG LL CD CPRAFH+EC L+++P+G W C+YC+ +R+ L ++ NA
Sbjct: 379 SDDLCSICSDGGELLLCDTCPRAFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNA 438
Query: 617 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 676
+ AGR+ G+DS+EQI R IRI E GC LC+ DF K F RT+LLCDQC RE
Sbjct: 439 IAAGRIDGIDSMEQIFTRSIRIATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGRE 498
Query: 677 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAG 735
+HVGCLK+H MADL LP+G W+C DC RI+ +++LL AE +P + I KK
Sbjct: 499 YHVGCLKEHSMADLTALPEGAWYCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKRED 558
Query: 736 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 795
L D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ + +GRDLIP+MVYGR
Sbjct: 559 KGLNEDGDLDVRWRVLRDKSSE-DSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGR 617
Query: 796 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
++R Q++ GMYCA+LTV ++VVSAG+ R+ G+E AELPLVATS+ N G
Sbjct: 618 SVRDQDYTGMYCAVLTVGNTVVSAGLFRIMGREAAELPLVATSRDNQG 665
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/396 (49%), Positives = 246/396 (62%), Gaps = 56/396 (14%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQ-------KLLEGYKNGLGIICHCCNSEVS 557
I D LHKL F E+ LP+G+EV YY G+ +LL G+K G I+C CCNSEVS
Sbjct: 383 IMGGDVGLHKLAFGENDLPEGSEVSYYVRGEVGTMRSKRLLSGHKKGCRILCDCCNSEVS 442
Query: 558 PSQFEAHA----------------------------------------------DGGNLL 571
PSQFEAH+ DGG LL
Sbjct: 443 PSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLRGREPSINTNDEICSICLDGGTLL 502
Query: 572 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 631
CDGCPR FHKEC SL +IP+G W+CK+C N ++ +F++ +ANAV AGR+ GVD +EQI
Sbjct: 503 CCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQI 562
Query: 632 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 691
KRCIRIVK+ E GC LCR +FS SGFGP T+++CDQCE+EFHVGCLK H + DL+
Sbjct: 563 RKRCIRIVKSQTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLK 622
Query: 692 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRL 750
+PKGKWFCC DC INS L+ ++V+ E+LP+ L IKK G S + D++WRL
Sbjct: 623 AVPKGKWFCCRDCKDINSSLRKIVVRREEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRL 682
Query: 751 LSGKAATP-ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 809
L G+ A+ E LLSQA+++FH+ F+PI D+ GRDL+ MV+ + EFGGMYCAI
Sbjct: 683 LHGRXASATEAGSLLSQALSLFHEQFNPIADA-EGRDLLLDMVHSNSTGELEFGGMYCAI 741
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKV 845
LTV VVSA RV G+EVAELPLVAT G+V
Sbjct: 742 LTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQV 777
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 88/162 (54%), Gaps = 6/162 (3%)
Query: 220 SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRV 276
+L P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V
Sbjct: 208 TLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNV 267
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
+ P+ FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +
Sbjct: 268 VTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKST 327
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTG 378
F C CKG+ I+ VG + LC SCV K+ + TG
Sbjct: 328 F-CFNCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG 366
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 244/702 (34%), Positives = 352/702 (50%), Gaps = 152/702 (21%)
Query: 184 GFGNESMSLIEVEAIA---EGSALTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDG 239
G E L++V+ A E SP K + K L P + ++F+ G+L+G
Sbjct: 333 GNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEG 392
Query: 240 VSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF 299
+ V Y+ G +V+ P+ FE+HA +R +YI
Sbjct: 393 LIVYYV--------------------------RGAKVVSPAMFELHASSNNKRPPEYILL 426
Query: 300 ENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPL 359
E+G +L +V+ AC+ PL L+ L+ + + +KS C+ C+G C T +
Sbjct: 427 ESGFTLRDVMNACKENPLATLEEKLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKSLVV 483
Query: 360 CNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEAD 419
C SC++SK+P+ + N+ + D
Sbjct: 484 CKSCLESKEPE-------------------------------FHNSPSKAN--------D 504
Query: 420 LVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSK 479
++ SS SV +++L+ R+ S P QS R+ + T+KS
Sbjct: 505 ALNGSSRPSVDPKSILR--------RSKSSP---------------RQSNRREQPTRKST 541
Query: 480 KTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLE 539
+ ++ + S S + S +T KD RLHKLVF++ LPDGTEVGY+ G
Sbjct: 542 EPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILPDGTEVGYFVAG----- 596
Query: 540 GYKNGLGIICHCCNSEVSPSQFEAHA---------------DGGNLLPC----------- 573
EVSPS FEAHA +G +L
Sbjct: 597 ---------------EVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQRFS 641
Query: 574 -----DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 628
D C CASL S+P W CKYC NM ER++F+ + NA+ AGRV GVD++
Sbjct: 642 IHENDDLCSICRDGVCASLPSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAI 701
Query: 629 EQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
+IT RCIRIV + EL S C+LCRG F + GF RT+++CDQCE+EFHVGCLK+ +
Sbjct: 702 AEITNRCIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDI 761
Query: 688 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI--- 744
ADL+ELP+ KWFC + C IN+ L NL+V+ EKL LN ++K + E D
Sbjct: 762 ADLKELPEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTT 821
Query: 745 -DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 802
D+RWR+LSGK ++ +T++LL++A++I H+ FDPI +S + DLIP+MVYGR + Q+F
Sbjct: 822 PDIRWRVLSGKLTSSDDTKILLAKALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDF 881
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
GMYC +L V+ +VS GI RVFG E+AELPLVATSK G+
Sbjct: 882 SGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQ 923
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 352 bits (902), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 179/383 (46%), Positives = 231/383 (60%), Gaps = 68/383 (17%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D+ +HK+VF L +GT+VGYY G+ VSPSQFEAHA
Sbjct: 443 RDKGMHKVVFMSGVLQEGTDVGYYVGGK--------------------VSPSQFEAHAGR 482
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG LL CD CPRAFH
Sbjct: 483 AARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTCPRAFH 542
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 641
+EC SLSS P+G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI
Sbjct: 543 RECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATT 602
Query: 642 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 603 PETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCS 662
Query: 702 MDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPET 760
DC RIN LQ+LL E +P L+ I KK +D+DVRWR+L K++ ++
Sbjct: 663 TDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDKSSD-DS 721
Query: 761 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 820
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYCA+LTVN++VVSAG
Sbjct: 722 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNTVVSAG 781
Query: 821 ILRVFGQEVAELPLVATSKINHG 843
+ R+ G E+AELPLVATS+ + G
Sbjct: 782 LFRIMGNEIAELPLVATSRDSQG 804
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 57/93 (61%), Gaps = 2/93 (2%)
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
+V+ P FE+HA + S YI ENG +L +VLRAC +V L ML++ ++ A+ P++
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
++F C CK +F C GK C+SC++SK
Sbjct: 406 RTFRCKGCKSSFSTLCSGKF--ALFCDSCLESK 436
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 342 bits (878), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 244/437 (55%), Gaps = 72/437 (16%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPNKSRWN-------ITPKDQRLHKL 515
K S Q K+ KS T L+ P N P N +D LH+L
Sbjct: 357 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNSGVLQKRSADGCTKRRDNDLHRL 416
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 565
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 417 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYR 476
Query: 566 ------------------------------------DGGNLLPCDGCPRAFHKECASLSS 589
DGG+L+ C+GCPRAFH C L
Sbjct: 477 HIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQC 536
Query: 590 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 649
+P W C C + + N GR S + V I R R+ K E E+ GC
Sbjct: 537 VPDSGWQCLNC---------IDNAGN----GRESSI--VRPIMIRLTRVDKTPEVEMGGC 581
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 709
++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI +
Sbjct: 582 VVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYA 641
Query: 710 VLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQA 767
LQN + AE +P F I+K+ L T + D++WR+LSGK+ PE LLS+A
Sbjct: 642 ALQNSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRA 701
Query: 768 VAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 827
AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VN VVSAG+LR+FG+
Sbjct: 702 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFGR 760
Query: 828 EVAELPLVATSKINHGK 844
VAELPLVATS+ + GK
Sbjct: 761 NVAELPLVATSRAHQGK 777
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 227 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 281
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 282 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIK 341
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 342 NVAGSSVNEESF 353
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 194/437 (44%), Positives = 244/437 (55%), Gaps = 72/437 (16%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPNKSRWN-------ITPKDQRLHKL 515
K S Q K+ KS T L+ P N P N +D LH+L
Sbjct: 360 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNTGVLQKRSADGCTKRRDNDLHRL 419
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA---------- 565
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 420 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYR 479
Query: 566 ------------------------------------DGGNLLPCDGCPRAFHKECASLSS 589
DGG+L+ C+GCPRAFH C L
Sbjct: 480 HIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQC 539
Query: 590 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 649
+P W C C++ NA GR S + V I R R+ K E E+ GC
Sbjct: 540 VPDSGWQCLNCRD------------NAGN-GRESSI--VRPIMIRLTRVDKTPEFEMGGC 584
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 709
++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI
Sbjct: 585 VVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYV 644
Query: 710 VLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQA 767
LQN + AE +P I+K+ L T + D++WR+LSGK+ PE LLS+A
Sbjct: 645 ALQNSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRA 704
Query: 768 VAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 827
AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSAG+LR+FG+
Sbjct: 705 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGR 763
Query: 828 EVAELPLVATSKINHGK 844
VAELPLVATS+ + GK
Sbjct: 764 NVAELPLVATSRAHQGK 780
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 230 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 284
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 285 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIK 344
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 345 NVAGSSVNEESF 356
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/385 (45%), Positives = 227/385 (58%), Gaps = 67/385 (17%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 458 RDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGM 517
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+L+ C+GCPRAFH
Sbjct: 518 AARRQPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFH 577
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 641
C L S+P+ W+C C++ +R I R R+ K
Sbjct: 578 AACLGLHSVPESGWHCLNCEDNTGDER------------------GARPIMIRLTRVDKE 619
Query: 642 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
E E+ GC++CR DFS F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC
Sbjct: 620 PEYEVGGCVVCRANDFSVDKFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCC 679
Query: 702 MDCSRINSVLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPE 759
DC+RI LQN + A+ +P I+K+ L T D+ D++WR+LSGK+ E
Sbjct: 680 DDCNRIYVALQNSVSAGADTIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAE 739
Query: 760 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 819
LLS+A AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA
Sbjct: 740 HLPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSA 798
Query: 820 GILRVFGQEVAELPLVATSKINHGK 844
G+LR+FG+ +AELPLVATS+ + GK
Sbjct: 799 GLLRIFGRNIAELPLVATSREHQGK 823
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 72/127 (56%), Gaps = 12/127 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILC 266
N+ELKMSKK+ N P V +L TG+LDG +V Y+ G ++ L GII DGG LC
Sbjct: 240 NMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVE-----LDGIIGDGGYLC 294
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR----ACRSVPLPMLKA 322
CS+C+ RV+ +FE HA + R + +I ENGK + ++ A S P ++K
Sbjct: 295 GCSMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKN 354
Query: 323 TLQSALS 329
S+++
Sbjct: 355 VAGSSIN 361
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 183/398 (45%), Positives = 229/398 (57%), Gaps = 67/398 (16%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
L S I +D LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+S
Sbjct: 388 LQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDS 447
Query: 555 EVSPSQFEAHA----------------------------------------------DGG 568
EVSPSQFEAHA DGG
Sbjct: 448 EVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGG 507
Query: 569 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 628
+L+ CDGCPRAFH C L +P+GDW C C F R +
Sbjct: 508 DLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------A 550
Query: 629 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 688
I + R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ +
Sbjct: 551 RPIRIQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLC 610
Query: 689 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDV 746
DL+ELPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D+
Sbjct: 611 DLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DI 669
Query: 747 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMY 806
+W +LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMY
Sbjct: 670 QWCILSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMY 728
Query: 807 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
C +L S+VVSAG++RVFGQEVAELP+VATSK + GK
Sbjct: 729 CVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGK 766
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 340 bits (871), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 181/414 (43%), Positives = 244/414 (58%), Gaps = 63/414 (15%)
Query: 484 ISKPFENASPPLS-----FPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLL 538
+S P E +P S + +D LH+L+F +GLPDG E+ Y+ GQ++L
Sbjct: 364 LSHPVERPNPNFSNAVLQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRIL 423
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHA--------------------------------- 565
G+K G GI+C CN E+SPSQFEAHA
Sbjct: 424 GGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLASGQKL 483
Query: 566 -------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH 612
+GG+L+ CD CPRA+H C L ++P+G W C C++ +
Sbjct: 484 TTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDK------VGS 537
Query: 613 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 672
++ A+ G +S + I R R+VK E E+ GC++CR DFS + F RT+LLCDQ
Sbjct: 538 NSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQ 594
Query: 673 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN-AIK 731
CEREFHVGCL+ + DL+ELPK KWFCC +CS I+ LQN ++ A+ +P+ + I+
Sbjct: 595 CEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIR 654
Query: 732 KYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 790
K+ G L + DVRW++LSGK+ PE LS+A AIF +CFDPIV + SGRDLIP
Sbjct: 655 KHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECFDPIV-AKSGRDLIPV 713
Query: 791 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+EVAELP+VATS+ + GK
Sbjct: 714 MVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREHQGK 767
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E AE S + +ELKMSKK+ N P V +L TG+LDG V Y+ L
Sbjct: 199 EGSAESSRYSLGPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK--L 256
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG +C CS CN ++ +FE HA + R + +I ENG+ + V++ +S
Sbjct: 257 QGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSA 316
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
PL +L + S SF
Sbjct: 317 PLSILDEVIMEVAGSSVNMNSF 338
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 183/398 (45%), Positives = 229/398 (57%), Gaps = 67/398 (16%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
L S I +D LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+S
Sbjct: 443 LQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDS 502
Query: 555 EVSPSQFEAHA----------------------------------------------DGG 568
EVSPSQFEAHA DGG
Sbjct: 503 EVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGG 562
Query: 569 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 628
+L+ CDGCPRAFH C L +P+GDW C C F R +
Sbjct: 563 DLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------A 605
Query: 629 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 688
I + R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ +
Sbjct: 606 RPIRIQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLC 665
Query: 689 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDV 746
DL+ELPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D+
Sbjct: 666 DLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DI 724
Query: 747 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMY 806
+W +LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMY
Sbjct: 725 QWCILSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMY 783
Query: 807 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
C +L S+VVSAG++RVFGQEVAELP+VATSK + GK
Sbjct: 784 CVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGK 821
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 174/387 (44%), Positives = 230/387 (59%), Gaps = 63/387 (16%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY QKLL GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 402 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYKQGSGIVCSCCSREISPSQFEAHAGM 461
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+LL C GCP+AFH
Sbjct: 462 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 521
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 638
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 522 TACLKFQSVPEGTWYCSSCND------------GPISSKKATATDPSGNARPIVIRLSRV 569
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ + DL+E+P+ KW
Sbjct: 570 VKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGLCDLKEIPQEKW 629
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 757
FCC DCSRI++ +QN + + +P L+ I +K + T + V WR+LSGK+
Sbjct: 630 FCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNGDIVEWRILSGKSRY 689
Query: 758 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 817
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 690 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 748
Query: 818 SAGILRVFGQEVAELPLVATSKINHGK 844
SA +LR+FGQ+VAELP+VATS+ G+
Sbjct: 749 SAALLRIFGQQVAELPIVATSREYQGR 775
Score = 76.3 bits (186), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 72/123 (58%), Gaps = 3/123 (2%)
Query: 213 LKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN 272
+KM KKI P V +L ETG+L+G V Y+ + L+GII GG LC C+ C+
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPVRE--LQGIIHSGGYLCGCTTCS 216
Query: 273 GCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALSSL 331
+V+ +FE+HA + R + +I ENG+++ +++ ++ P +L+ +++ A S+L
Sbjct: 217 FSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAGSAL 276
Query: 332 PEE 334
EE
Sbjct: 277 NEE 279
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 338 bits (867), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 176/385 (45%), Positives = 236/385 (61%), Gaps = 58/385 (15%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDG E+ Y+ GQ++L G+K G GI+C CN E+SPSQFEAHA
Sbjct: 332 RDNDLHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGM 391
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
+GG+L+ CD CPRA+H
Sbjct: 392 AARRQPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYH 451
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 641
C L ++P+G W C C++ + ++ A+ G +S + I R R+VK
Sbjct: 452 TGCLHLQNVPEGVWSCPNCRDK------VGSNSKAISGGSLS---FSKPIVFRLTRVVKA 502
Query: 642 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
E E+ GC++CR DFS + F RT+LLCDQCEREFHVGCL+ + DL+ELPK KWFCC
Sbjct: 503 PEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCC 562
Query: 702 MDCSRINSVLQNLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDI-DVRWRLLSGKAATPE 759
+CS I+ LQN ++ A+ +P+ + I+K+ G L + DVRW++LSGK+ PE
Sbjct: 563 DECSNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPE 622
Query: 760 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 819
LS+A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V S VVSA
Sbjct: 623 DLPFLSRATAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSA 681
Query: 820 GILRVFGQEVAELPLVATSKINHGK 844
G+LR+FG+EVAELP+VATS+ + GK
Sbjct: 682 GLLRIFGREVAELPIVATSREHQGK 706
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 338 bits (866), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 174/387 (44%), Positives = 229/387 (59%), Gaps = 63/387 (16%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 372 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 431
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+LL C GCP+AFH
Sbjct: 432 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 491
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 638
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 492 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 539
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 540 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 599
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 757
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 600 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 659
Query: 758 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 817
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 660 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 718
Query: 818 SAGILRVFGQEVAELPLVATSKINHGK 844
SA +LR+FGQEVAELP+VATS+ G+
Sbjct: 719 SAALLRIFGQEVAELPIVATSREYQGR 745
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 160 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 216
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 217 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 276
Query: 330 SLPEE 334
+L EE
Sbjct: 277 ALNEE 281
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 184/410 (44%), Positives = 236/410 (57%), Gaps = 55/410 (13%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
SK F PL+ S +T KD LHKLVF L DGTEV YY GQ+ ++GY
Sbjct: 537 SKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKD 594
Query: 545 LGIICHCCNSEVSPSQFEAHA--------------------------------------- 565
I C+ CN VSPS FEAHA
Sbjct: 595 QRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDMELSERETD 654
Query: 566 -------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 618
GG++ PC CPR+FH C LS +P +WYC C N+ ++++ L + NA
Sbjct: 655 DLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKALAENKNAKA 713
Query: 619 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 678
AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++LCDQCE+E+H
Sbjct: 714 AGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVILCDQCEKEYH 772
Query: 679 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNS 737
VGCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+
Sbjct: 773 VGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKG 832
Query: 738 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR-- 795
L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLIP MV GR
Sbjct: 833 LSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRP 892
Query: 796 --NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
+ GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G
Sbjct: 893 KDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQG 942
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 76/151 (50%), Gaps = 2/151 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYM-GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P + EL TGLL+G+ V Y+ K Q + L+G+I I C C CNG + +
Sbjct: 369 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCS 428
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C
Sbjct: 429 YFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNC 488
Query: 340 VRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
+ C + G LC C++ K+ Q
Sbjct: 489 LNCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 518
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 184/410 (44%), Positives = 236/410 (57%), Gaps = 55/410 (13%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
SK F PL+ S +T KD LHKLVF L DGTEV YY GQ+ ++GY
Sbjct: 546 SKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKD 603
Query: 545 LGIICHCCNSEVSPSQFEAHA--------------------------------------- 565
I C+ CN VSPS FEAHA
Sbjct: 604 QRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDMELSERETD 663
Query: 566 -------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 618
GG++ PC CPR+FH C LS +P +WYC C N+ ++++ L + NA
Sbjct: 664 DLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKALAENKNAKA 722
Query: 619 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 678
AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++LCDQCE+E+H
Sbjct: 723 AGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVILCDQCEKEYH 781
Query: 679 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNS 737
VGCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+
Sbjct: 782 VGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKG 841
Query: 738 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR-- 795
L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLIP MV GR
Sbjct: 842 LSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRP 901
Query: 796 --NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
+ GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G
Sbjct: 902 KDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQG 951
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 2/150 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V Y+ +A L+G+I I C C CNG + +
Sbjct: 380 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKKAV-LKGVITGCNIRCFCLSCNGSKDVCSY 438
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C+
Sbjct: 439 FFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCL 498
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C + G LC C++ K+ Q
Sbjct: 499 NCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 527
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 173/385 (44%), Positives = 232/385 (60%), Gaps = 60/385 (15%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAHA
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 566 ---------------------------------------------DGGNLLPCDGCPRAF 580
+GG+LL C GCP+AF
Sbjct: 449 AGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAF 508
Query: 581 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 640
H C S+P+G WYC C + + A A + +++ I R R+VK
Sbjct: 509 HTACLKFQSMPEGTWYCSSCNDGPTSCKI----ATASDP-------NLKPIVIRLTRVVK 557
Query: 641 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++++ DL+ +P+ KWFC
Sbjct: 558 APESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFC 617
Query: 701 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 759
C DCSRI+ VLQ+ + +P L+ I +KY + + V WR+LSGK+ PE
Sbjct: 618 CSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPE 677
Query: 760 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 819
LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA
Sbjct: 678 HLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSA 736
Query: 820 GILRVFGQEVAELPLVATSKINHGK 844
+LR+FGQ+VAELP+VATS+ G+
Sbjct: 737 ALLRIFGQKVAELPIVATSREYQGR 761
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQAW--KGSF 300
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 335 bits (858), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 174/387 (44%), Positives = 229/387 (59%), Gaps = 63/387 (16%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHA
Sbjct: 77 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 136
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+LL C GCP+AFH
Sbjct: 137 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 196
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 638
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 197 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 244
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 245 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 304
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 757
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 305 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 364
Query: 758 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 817
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 365 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 423
Query: 818 SAGILRVFGQEVAELPLVATSKINHGK 844
SA +LR+FGQEVAELP+VATS+ G+
Sbjct: 424 SAALLRIFGQEVAELPIVATSREYQGR 450
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 171/385 (44%), Positives = 217/385 (56%), Gaps = 67/385 (17%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY GQK+L GYK G GI+C CC E+SPSQFE+HA
Sbjct: 427 RDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGM 486
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+L+ C CPRAFH
Sbjct: 487 SARRQPYRHIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFH 546
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 641
C L P+G W+C C L H N I R R+VK
Sbjct: 547 AACLDLHDTPEGAWHCPNCNK-------LGHGGNFARP-----------IVIRLTRVVKT 588
Query: 642 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
E ++ GC +CR DFS F RT++LCDQCE+EFHVGCL++ + DL+E+PK WFCC
Sbjct: 589 PEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCC 648
Query: 702 MDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPE 759
DC+ I L+N + + +P LN I K L + DV+W++L GK+ E
Sbjct: 649 QDCNNIYVALRNSVSTGVQTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILMGKSRNRE 708
Query: 760 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 819
LLS A AIF +CFDPIV + +GRDLIP MVYGRN+ GQEFGGMYC +LTV VVSA
Sbjct: 709 DLSLLSGAAAIFRECFDPIV-AKTGRDLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSA 767
Query: 820 GILRVFGQEVAELPLVATSKINHGK 844
G+LR+FG+EVAELPLVAT++ + GK
Sbjct: 768 GLLRIFGREVAELPLVATNREHQGK 792
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 202 ERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 258
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS C+ +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 259 GCSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 318
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 319 VAGSSINEEFF 329
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 175/394 (44%), Positives = 232/394 (58%), Gaps = 55/394 (13%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD RLHKLVF+ L DGTEV YY GQ+ ++GY I C+ CN VSPS
Sbjct: 386 SAGKVTNKDNRLHKLVFNV--LLDGTEVAYYVDGQRKVDGYIKDHRIYCNHCNRVVSPSA 443
Query: 561 FEAHA----------------------------------------------DGGNLLPCD 574
FEAHA GG++ PC
Sbjct: 444 FEAHAGEGSRRKPYDNIFTSNGVSLHELAMKISKDMELSERETDDLCRECGQGGDIFPCK 503
Query: 575 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 634
CPR+FH C LS +P +WYC C+N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 504 ICPRSFHPACVGLSKVP-AEWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKR 562
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP
Sbjct: 563 AIRIVP-ISDDLGGCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELP 621
Query: 695 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 753
+ +WFCC CS S L ++ A+ L + L I KK+ L + D++W+LLSG
Sbjct: 622 EEEWFCCSSCSETRSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSG 681
Query: 754 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 809
K AT + +LLS AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+
Sbjct: 682 KRATEDGSILLSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCAL 741
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
LTV S+VVSA +LRV G +VAELPLVATS+ G
Sbjct: 742 LTVGSTVVSAALLRVMGGDVAELPLVATSQDVQG 775
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 78/154 (50%), Gaps = 5/154 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TG+L+G+ V+Y+ +A L+G+I I C C CNG + I
Sbjct: 207 LTKHPGNIRELLNTGMLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGAKAISAY 265
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + C+
Sbjct: 266 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIRSSIDPVVKRSRINCL 325
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
C + + LC C++SK+PQ +T
Sbjct: 326 NCNELV----LPSSHENVLCQVCLESKQPQDPLT 355
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 188/455 (41%), Positives = 242/455 (53%), Gaps = 73/455 (16%)
Query: 439 SPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFP 498
SP L SS+ + P +S S+ +RQ+ + + +P LS P
Sbjct: 351 SPHSLGSYSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRP-----SSLSHP 405
Query: 499 NKSRWNITPK-DQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
K+ T + D LH+L+F +GLPDG E+ YY GQK+L GYK G GI+C CC+ E+S
Sbjct: 406 KKTNEGGTRRRDNDLHRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCDREIS 465
Query: 558 PSQFEAHA----------------------------------------------DGGNLL 571
PSQFEAHA DGG+L+
Sbjct: 466 PSQFEAHAGMAARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLI 525
Query: 572 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 631
C+ CPRAFH C L +P W+C C F + I
Sbjct: 526 FCESCPRAFHLVCLGLKYVPSDVWHCPNCNKFGHGGNFSR------------------SI 567
Query: 632 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 691
R R+VK E E+ GC+ CR DFS F RT++LCDQCEREFHVGCL+ + + DL+
Sbjct: 568 VIRLTRVVKTPEYEVGGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLK 627
Query: 692 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI-DVRWR 749
E+PK WFC DC+RI LQN + + +P LN I K+A L D +WR
Sbjct: 628 EIPKDNWFCSNDCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWR 687
Query: 750 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 809
+L GK+ E LLS A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +
Sbjct: 688 ILMGKSRYQEDLSLLSAAAAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVL 746
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
L V + VVSAG+LR+FG++VAELPLVATS+ + GK
Sbjct: 747 LLVKNVVVSAGLLRIFGRDVAELPLVATSREHQGK 781
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/346 (47%), Positives = 212/346 (61%), Gaps = 55/346 (15%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 430 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 482
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------------- 565
++LL G+K G I+C CCNSEVSPSQFEAH+
Sbjct: 483 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 542
Query: 566 -----------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
DGG LL CDGCPR FHKEC SL +IP+G W+CK+C N ++ +
Sbjct: 543 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 602
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 603 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 662
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 728
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 663 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 722
Query: 729 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFH 772
IKK G S + D++WRLL G +A+ E LLSQA+++FH
Sbjct: 723 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFH 768
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 127 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 181
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 182 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 235
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 236 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 273
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 274 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 333
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 334 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 392
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 393 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 447
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 316 bits (810), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 169/389 (43%), Positives = 229/389 (58%), Gaps = 54/389 (13%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQK ++GY I C+ CN VSPS
Sbjct: 697 STGKVTTKDTGLHKLVF--KVLLDGTEVNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSA 754
Query: 561 FEAHAD----------------------------------------------GGNLLPCD 574
FEAHA GG++ PC
Sbjct: 755 FEAHAGEGSRRKPYDNIFTSNGVSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCR 814
Query: 575 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 634
CPR+FH C L +P +W+C C + ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 815 MCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKR 874
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP
Sbjct: 875 AIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELP 933
Query: 695 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 753
+G+WFCC CS I S L ++ + A L E ++ I KK+ L ++ ++RW+L++G
Sbjct: 934 EGEWFCCDSCSEIRSSLDKMISEGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAG 993
Query: 754 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 809
++AT + LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ++ GMYCA+
Sbjct: 994 RSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAV 1053
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATS 838
LTV S+VVSA +LRV G +VAELPLVATS
Sbjct: 1054 LTVGSTVVSAALLRVMGGDVAELPLVATS 1082
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 103/209 (49%), Gaps = 21/209 (10%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEV-EAIAEGSALTSPK-KNLELKMSKKIS-LN 222
S +KPKVE + SL+ V E + + T P +E+KMSKK++ L+
Sbjct: 472 SLLKPKVE------------APPASSLVVVPEEPVDSTPETPPSVTKMEMKMSKKVAFLS 519
Query: 223 KKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
K P +L TGLL+G+ V+Y+ + L+G+I I C C C+G + I F
Sbjct: 520 KHPGNTRDLLSTGLLEGMPVMYIIP-NSKKPVLKGVIAGCNIRCFCVKCDGSKTITTYFF 578
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA + ++YI NG L +VLRAC S PL L T+QS + + C+ C
Sbjct: 579 ELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCLNC 638
Query: 343 KGTFPITCVGKTGPGPLCNSCV-KSKKPQ 370
G P +T LC+ C +S +PQ
Sbjct: 639 NGELP----SQTEEQFLCHDCCPESNQPQ 663
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 172/389 (44%), Positives = 228/389 (58%), Gaps = 55/389 (14%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQ+ ++GY I C+ C+ VSPS
Sbjct: 723 SSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCSRVVSPSA 780
Query: 561 FEAHAD----------------------------------------------GGNLLPCD 574
FEAHA GG++ PC
Sbjct: 781 FEAHAGEGSRRKPYDNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGLGGDIFPCK 840
Query: 575 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 634
CPR+FH C LS P +W+C C N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 841 MCPRSFHPACVRLSEFPS-EWFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKR 899
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP
Sbjct: 900 AIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELP 958
Query: 695 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 753
G+WFCC CS I S L ++ A+ L L I KK+ L +DID+RW+LL+G
Sbjct: 959 DGEWFCCSSCSEIRSCLDKMISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAG 1018
Query: 754 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 809
++AT + LLLS AV I H FDPI+++ +GRDLIP MV GR + GQ++ GMYCA+
Sbjct: 1019 RSATEDGSLLLSSAVPIIHQSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAV 1078
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVATS 838
+T+ S+VVSA +LR+ G +VAELPLVATS
Sbjct: 1079 ITLGSTVVSAALLRIMGGDVAELPLVATS 1107
Score = 82.8 bits (203), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 77/150 (51%), Gaps = 5/150 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
K P V EL +TGLL+G+ V+Y+ +A ++G+I I C C CNG R +
Sbjct: 544 FTKHPGNVKELLQTGLLEGMPVMYIIPNSKKAV-VKGVITGCNIRCFCIKCNGSRALSTY 602
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA + +++I NG SL +VLRAC L L+ T +S++ + C+
Sbjct: 603 FFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCL 662
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C G P + LC+ C+ SK+PQ
Sbjct: 663 NCGGHLPSSETEHF----LCHCCLDSKQPQ 688
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 168/386 (43%), Positives = 226/386 (58%), Gaps = 56/386 (14%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ Y+ QKLL+GYK G GI+C CC++E+SPSQFEAHA
Sbjct: 376 RDNDLHRLLFLPNGLPDGTELAYFVKSQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGM 435
Query: 566 ---------------------------------------------DGGNLLPCDGCPRAF 580
DGG+LL C GCP+AF
Sbjct: 436 AGRRQPYRHIHISSGLSLHDIAMSLADGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAF 495
Query: 581 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC-IRIV 639
H C S+P+G WYC C + + A A + S V S+ + IR++
Sbjct: 496 HTACLKFQSMPEGTWYCSSCNDGPTSCK----TATATDPNLKSIVGSIAIFSLSAHIRVL 551
Query: 640 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 699
+ A + R DFS F RT++LCDQCE+E+HVGCL+++ + DL+ +P+ KWF
Sbjct: 552 HS--AYCFSPISDRSLDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLCDLKGIPQDKWF 609
Query: 700 CCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATP 758
CC DCSRI++ LQ+ + +P L+ I +KY + + +V WR+LSGK+
Sbjct: 610 CCSDCSRIHTALQSSASCGPQTIPTVLLDTISRKYREKGICIDNGDNVEWRMLSGKSRYA 669
Query: 759 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVS 818
E LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVS
Sbjct: 670 EHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVS 728
Query: 819 AGILRVFGQEVAELPLVATSKINHGK 844
A +LR+FGQ+VAELP+VATS+ G+
Sbjct: 729 AALLRIFGQKVAELPIVATSREYQGR 754
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 70/142 (49%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EGS L +KM KI P V +L TG+LDG V Y+ A L
Sbjct: 147 EHTWEGSPSNVASSTLGVKMLDKIDSTNFPSNVKKLLATGILDGARVKYLS--ISPAREL 204
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C++C+ +V+ +FE HA + + + +I ENG+ + +++ R
Sbjct: 205 QGIIHSGGYLCGCTVCDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVA 264
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
P +L+ ++ S E+ F
Sbjct: 265 PPDVLEEVIRKVAGSALSEEGF 286
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 169/393 (43%), Positives = 222/393 (56%), Gaps = 78/393 (19%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAHA
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 566 ---------------------------------------------DGGNLLPCDGCPRAF 580
+GG+LL C GCP+AF
Sbjct: 449 AGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAF 508
Query: 581 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 640
H C S+P+G WYC C + G S + T +
Sbjct: 509 HTACLKFQSMPEGTWYCSSCND---------------------GPTSCKIATASWLYTYF 547
Query: 641 NLEAEL----SGCLLC----RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 692
NL A + S L R DFS F RT++LCDQCE+E+HVGCL+++++ DL+
Sbjct: 548 NLNANILVLHSAYSLSPISDRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKG 607
Query: 693 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLL 751
+P+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY + + V WR+L
Sbjct: 608 IPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRML 667
Query: 752 SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILT 811
SGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 668 SGKSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLM 726
Query: 812 VNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
VNS VVSA +LR+FGQ+VAELP+VATS+ G+
Sbjct: 727 VNSLVVSAALLRIFGQKVAELPIVATSREYQGR 759
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQA--WKGSF 300
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 297 bits (761), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 160/387 (41%), Positives = 214/387 (55%), Gaps = 83/387 (21%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LH+L+F +GLPDGTE+ YY +++SPSQFEAHA
Sbjct: 388 RDNDLHRLLFMPNGLPDGTELAYYV--------------------KTQISPSQFEAHAGM 427
Query: 566 --------------------------------------------DGGNLLPCDGCPRAFH 581
DGG+LL C GCP+AFH
Sbjct: 428 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 487
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 638
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 488 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 535
Query: 639 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 698
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 536 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 595
Query: 699 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 757
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 596 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 655
Query: 758 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 817
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 656 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 714
Query: 818 SAGILRVFGQEVAELPLVATSKINHGK 844
SA +LR+FGQEVAELP+VATS+ G+
Sbjct: 715 SAALLRIFGQEVAELPIVATSREYQGR 741
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 151 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 207
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 208 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 267
Query: 330 SLPEE 334
+L EE
Sbjct: 268 ALNEE 272
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 174/465 (37%), Positives = 237/465 (50%), Gaps = 117/465 (25%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 301 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 358
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------- 565
HKL F LP+GT+VGYY G++LL+GY GI CHCCN+ VSPSQFEAHA
Sbjct: 359 HKLAFMSGVLPEGTDVGYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRK 418
Query: 566 ---------------------------------------DGGNLLPCDGCPRAFHKECAS 586
DGG LL CD CPRAFH+EC
Sbjct: 419 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 478
Query: 587 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 646
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 479 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 538
Query: 647 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 679
GC LC DFSK F RT+LLCDQ
Sbjct: 539 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 592
Query: 680 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 738
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 593 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 639
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 798
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 640 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 694
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
++VVSAG+ RV G E+AELPLVATS+ + G
Sbjct: 695 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQG 724
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 202
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 203 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 262
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 263 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 316
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 317 RRSKNESPGSKYCNS 331
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 161/363 (44%), Positives = 208/363 (57%), Gaps = 61/363 (16%)
Query: 535 QKLLEGYKNGLGIICHC--CN-SEVSPSQFEAHA-------------------------- 565
+ +L+G G I C C CN S+VSPS FEAHA
Sbjct: 326 KAVLKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMK 385
Query: 566 --------------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
GG++ PC CPR+FH C LS +P +WYC C N+ +
Sbjct: 386 ISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQ 444
Query: 606 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 665
+++ L + NA AGR +GVDS+EQI KR IRIV NL EL DF+ S F R
Sbjct: 445 KEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQK------DFNNSVFDER 498
Query: 666 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
T++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ A L E
Sbjct: 499 TVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAES 558
Query: 726 HLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 784
++ I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SG
Sbjct: 559 DIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSG 618
Query: 785 RDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
RDLIP MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK
Sbjct: 619 RDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKD 678
Query: 841 NHG 843
G
Sbjct: 679 LQG 681
Score = 40.4 bits (93), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 49/119 (41%), Gaps = 22/119 (18%)
Query: 248 IKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF-ENGKSLL 306
+K Q + L+G+I I C C CNG +V PS FE HA + RR F NG SL
Sbjct: 322 LKLQKAVLKGVIAGCNIRCFCLSCNGSKV-SPSAFEAHAGEGTRRKPYDNIFTSNGVSLH 380
Query: 307 EVLRACRSVPLPMLKATLQSALSSLPEE----------KSFACVRCKGTFPITCVGKTG 355
E+ +K + LS + F C C +F CVG +G
Sbjct: 381 EL----------SMKISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSG 429
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 139/263 (52%), Positives = 183/263 (69%), Gaps = 3/263 (1%)
Query: 583 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 642
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 643 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 702
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 64 ETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCST 123
Query: 703 DCSRINSVLQNLLVQEAEKLPEFHLNAIKK--YAGNSLETVSDIDVRWRLLSGKAATPET 760
DC RIN LQ+LL E + L+ IKK + +D+DVRWR+L K ++ ++
Sbjct: 124 DCVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDK-SSDDS 182
Query: 761 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 820
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYC +LTVN+ VVSAG
Sbjct: 183 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSAG 242
Query: 821 ILRVFGQEVAELPLVATSKINHG 843
+ R+ G E+AELPLVATS+ G
Sbjct: 243 LFRIMGSEIAELPLVATSRDRQG 265
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 157/359 (43%), Positives = 198/359 (55%), Gaps = 67/359 (18%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHA----------------------------- 565
QK+L GYK G GI+C CC E+SPSQFE+HA
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 566 -----------------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
DGG+L+ C CPRAFH C L P+G W+C C
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNK------ 130
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
L H N I R R+VK E ++ GC +CR DFS F RT++
Sbjct: 131 -LGHGGNFARP-----------IVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVI 178
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 728
LCDQCE+EFHVGCL++ + DL+E+PK WFCC DC+ I L+N + + +P LN
Sbjct: 179 LCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLN 238
Query: 729 AI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 786
I K L + DV+W++L GK+ E LLS A AIF +CFDPIV + +GRD
Sbjct: 239 IINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDPIV-AKTGRD 297
Query: 787 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKV 845
LIP MVYGRN+ GQEFGGMYC +LTV VVSAG+LR+FG+EVAELPLVAT++ + GKV
Sbjct: 298 LIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGKV 356
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 279 bits (714), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 145/230 (63%), Positives = 172/230 (74%), Gaps = 4/230 (1%)
Query: 150 GSEVSNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKK 209
G SN K+ K S +K K++ VEV V E E++S ++VE IAEGSALT PKK
Sbjct: 10 GEPNSNNRPKRVTK--SKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKK 67
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
NLELKMSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+
Sbjct: 68 NLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCA 127
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CNG RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S
Sbjct: 128 FCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAIS 187
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
LP E++F C RCKG FP CVGK GPLCN C +SK+ T+T + I
Sbjct: 188 GLPVERTFTCKRCKGIFPSICVGKI--GPLCNLCAESKESHPTLTIGSSI 235
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 161/396 (40%), Positives = 218/396 (55%), Gaps = 63/396 (15%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN--GLGIICHCC---------- 552
I +DQ LHKLVF E+ L DG VGY+ +K L+G N GI+C CC
Sbjct: 83 INYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQSGILCDCCKEVILEFFFC 142
Query: 553 -----NSEVSPSQFEAHA---------------DG------------------------- 567
+ +VSPS+FEAHA DG
Sbjct: 143 QIKKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAINHRISISNSDEHCSKCK 202
Query: 568 --GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNLL CDGC RAFH C + S P+ WYC+YC+N ++ + ++H N V ++
Sbjct: 203 QRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIES 262
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
D EQI K C VK+ E E S C LC F+ F P T+++CDQCE+++HVGCLK H
Sbjct: 263 DPSEQIAKICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLKDH 322
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI 744
MA+L+++PK WFC +DC I+ L+N + + L + L+ IK K LET +
Sbjct: 323 NMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGL 382
Query: 745 DVRWRLLSGKAATPE--TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-E 801
D++W++ + + + T LLS V IFH+ FD IV + + DLIP+MV GR ++ +
Sbjct: 383 DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKDKYY 442
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
FGGMYCA+L VN VVSAGI RVFG+EVAEL L+AT
Sbjct: 443 FGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIAT 478
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 269 bits (687), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 133/277 (48%), Positives = 188/277 (67%), Gaps = 6/277 (2%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG++ PC CPR+FH C L +P +W+C C + ++++ L + NA AGR +GVD
Sbjct: 32 GGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVD 91
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
S+EQI KR IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+
Sbjct: 92 SIEQILKRAIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEW 150
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDID 745
DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+ L ++ +
Sbjct: 151 QVDLKELPEGEWFCCDSCSEIRSSLDKMISGGAHPLSESDVDIIRKKHESKGLVMDANTE 210
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQE 801
+RW+L++G++AT + LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ+
Sbjct: 211 IRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQD 270
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
+ GMYCA+LTV S+VVSA +LRV G +VAELPLVATS
Sbjct: 271 YSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVATS 307
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 120/190 (63%), Positives = 154/190 (81%), Gaps = 1/190 (0%)
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 715
DFSKSGF RT+++CDQCERE+HVGCLK+HKMADL+ELP+G+WFC DC RI+S LQ LL
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 716 VQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 774
++ A+ LP ++ I+K ++ D+RW+LLSGK A E+RLLLS+AVAIFH+
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAESRLLLSKAVAIFHES 122
Query: 775 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 834
FDPIVD+ +GRDLIP+MVYGR +R Q++GG+YCA+LTV SSVVSAGILRV G E+AELPL
Sbjct: 123 FDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAELPL 182
Query: 835 VATSKINHGK 844
VATS+ + G+
Sbjct: 183 VATSREHQGQ 192
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 153/388 (39%), Positives = 201/388 (51%), Gaps = 85/388 (21%)
Query: 496 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 555
++ +KS IT KD+ LHKLVF E+ L DG VGY+ G+
Sbjct: 174 TYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGK-------------------- 213
Query: 556 VSPSQFEAHA---------------DG---------------------------GNLLPC 573
VSPS+FEAHA DG GNLL C
Sbjct: 214 VSPSKFEAHAGRASRRKPYSYIRTADGVSLHELANNRRISMSDSDERCSHCEQVGNLLWC 273
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
D C R+FH EC L S P+ YC+YC+N F + + ++H N V GR++ D EQIT+
Sbjct: 274 DRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITE 333
Query: 634 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 693
C K E + C LC DF+ + GPRT+++C QCE+EFHV CLK H MA+L EL
Sbjct: 334 VCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVEL 393
Query: 694 PKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSG 753
PK KWFC +DC I+ LQ L+ + +L +D++WRLL+
Sbjct: 394 PKDKWFCGIDCDDIHMKLQKLMARGEAEL--------------------GLDIKWRLLNT 433
Query: 754 KAATPETRL--LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAIL 810
K P+ + L+S+A AIFH+ F I D + DLI +M+YG + GQ F GMYCA+L
Sbjct: 434 KLNNPKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVL 493
Query: 811 TVNSSVVSAGILRVFGQEVAELPLVATS 838
+ AGI RV GQEVAELPLVAT+
Sbjct: 494 YFKKVIACAGIFRVLGQEVAELPLVATT 521
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 163/465 (35%), Positives = 223/465 (47%), Gaps = 137/465 (29%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 300 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 357
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA------- 565
HKL F LP+GT+VGYY G+ VSPSQFEAHA
Sbjct: 358 HKLAFMSGVLPEGTDVGYYVGGK--------------------VSPSQFEAHAGRAARRK 397
Query: 566 ---------------------------------------DGGNLLPCDGCPRAFHKECAS 586
DGG LL CD CPRAFH+EC
Sbjct: 398 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 457
Query: 587 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 646
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 458 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 517
Query: 647 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 679
GC LC DFSK F RT+LLCDQ
Sbjct: 518 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 571
Query: 680 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 738
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 572 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 618
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 798
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 619 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 673
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
++VVSAG+ RV G E+AELPLVATS+ + G
Sbjct: 674 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQG 703
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 201
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 202 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 261
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 262 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 315
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 316 RRSKNESPGSKYCNS 330
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 246 bits (628), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 169/280 (60%), Gaps = 40/280 (14%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
+DGG LL CD CPRAFH+EC ++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G
Sbjct: 249 SDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDG 308
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+D +EQI R IRI GC LCR DFSK F RT+LLCDQ
Sbjct: 309 IDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLLCDQ------------ 356
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 743
LP+G W+C DC RI+ L++LL + AE + + IK KY +L D
Sbjct: 357 -------ALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKALNKDGD 409
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 803
+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 410 LDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG--------- 459
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 843
++VVSAG+ RV G E+AELPLVATS+ + G
Sbjct: 460 ----------NTVVSAGLFRVMGSEIAELPLVATSRDSQG 489
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 6/143 (4%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
LRG+I+ GILCSCS C G V+ P FE+HA + S YI ENG +L ++LRAC
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 315 VPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
L ML++ +Q+A+ P++++F C CK +F GK LC+SC++SK Q + T
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-T 191
Query: 375 YTTGIRIRASGPKLVSRSSENDS 397
T+ I P +R S+N+S
Sbjct: 192 RTSKI---GRNPTSSARRSKNES 211
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 134/404 (33%), Positives = 197/404 (48%), Gaps = 71/404 (17%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS----QFEA 563
+D LHK +F GL +GTE+GYY Q L+G K G GI C CCN E S +FE
Sbjct: 487 RDAHLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQ 546
Query: 564 HA----------------DG--------------------------------------GN 569
HA DG G
Sbjct: 547 HAGCEARRNPYGNILVLVDGRSLKDVCKDLTHKNKLGEQQNCEPLARDVNCCYECSSSGE 606
Query: 570 LLPCDGCPRAFHKECASLSSI-PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 628
L C+GC A+ C + WYC+ C+N + + + V S+
Sbjct: 607 LKTCNGCEEAWCDNCTKGEEVDSDSKWYCRMCRNDTLK---VAQNGQKVSGKHQEESSSI 663
Query: 629 EQITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKH 685
+I +R R +++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK
Sbjct: 664 TEIDERG-RCIRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDS 722
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVS 742
M +L ELP+G+WFC C I+ +L L+ E L ++ + ++ +E
Sbjct: 723 GMDNLNELPEGEWFCQKGCKVIDEILTQLVAIGPESLSHSIISELPENRQQKSGVIEKAE 782
Query: 743 DID--VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 800
I W++L GK ++P L++AV IF +C DPI D+ +G++LIP MV R +
Sbjct: 783 SISPSFEWQILCGKGSSPANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDY 842
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+F G++C +L +N VVSA +L++FG+E AE+PLVATS + G+
Sbjct: 843 DFEGVFCVVLKLNGKVVSAALLQIFGREFAEVPLVATSLPHQGQ 886
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 69/177 (38%), Gaps = 29/177 (16%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C G ++
Sbjct: 286 LLKAPRNAKELMATRLMEGHHVRCSCRGIQ-----LTGMLKDMGVQCDCRNCRGSVIVSI 340
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRA------CRSVPLPMLKATL--------- 324
S FE H+ S I ENGK+L ++L A C L LK +
Sbjct: 341 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDVQGVEKS 400
Query: 325 --QSALSSLPEEKSF---ACVRCKGTFPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
+ A PEE RC CV P C C K+KKP ++
Sbjct: 401 KSKCAKCGNPEEGDLIYCKGARCSVVAHSGCVEIANPHLGDWFCGKCEKTKKPHASV 457
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 202 bits (513), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/198 (51%), Positives = 134/198 (67%), Gaps = 5/198 (2%)
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 710
LCR DF+ + F RT++LCDQCE+E+HVGCLK +L+ELP+G+WFCC CS S
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 711 LQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 769
L ++ A+ L E L I KK+ L + D++W+LLSGK T + +LLS AV
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRTTEDGSILLSAAVP 131
Query: 770 IFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVF 825
IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA +LRV
Sbjct: 132 IFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVM 191
Query: 826 GQEVAELPLVATSKINHG 843
G +VAELPLVATS+ G
Sbjct: 192 GGDVAELPLVATSQDVQG 209
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 197/402 (49%), Gaps = 79/402 (19%)
Query: 506 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 565
+ +D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E+S S FE HA
Sbjct: 738 SARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEISCSAFEQHA 797
Query: 566 D-------GGNLLPCDGCPRAFHKECASLS-----------------------SIPQG-- 593
G++L DG R+ C L+ S QG
Sbjct: 798 GCESRRNPYGSILLADG--RSLKDMCKELAYQSKLGDRAHQVARTGDVKSSSGSEEQGVL 855
Query: 594 --------DW------------------YCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
W CK CQ E A+ RV +
Sbjct: 856 ASSQRCESTWCINFGTRFSCQEADSGHPLCKICQKNVE-------GAHKTSKKRVDATAN 908
Query: 628 VEQI--TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
+ T R +R+ + ++ SGC +C+ K GF T+L+CDQC RE+HVGCL++
Sbjct: 909 IPATDDTGRNVRLFQAPDSS-SGCAICKKWTLKKCGFD-MTMLVCDQCGREYHVGCLRES 966
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETV- 741
+ D ELP+ +W+C +C I VL L+ E L + +N + +++ +E
Sbjct: 967 GILD--ELPEAEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESRQHQQGIVEMAE 1024
Query: 742 -SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 800
S W++L G P L+QAV IF +C DPI D+ SG+++IP MVY R +
Sbjct: 1025 SSSPVFGWQILHGAGENPVNGRTLAQAVEIFTECSDPIKDAPSGQNMIPIMVYSRRFKDY 1084
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 842
+F G+YC +LT+N VVS +L++FG+EVAE+PL+ATS ++H
Sbjct: 1085 DFDGIYCVVLTLNEKVVSTALLQIFGREVAEVPLIATS-VDH 1125
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRA--- 311
L GI++D G++C+C +C G +V+ S FE H+ S I ENGK+L ++L A
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 312 ---CRSVPLPMLKATLQSALSSLPEEKSFAC----------------VRCKGTFPITCVG 352
C L LK + + +P +K AC +C + CVG
Sbjct: 630 SADCGGDILGALKHAI-GEIQGIP-KKEGACGKCGKREGGDFVSCKEPKCSAVYHAECVG 687
Query: 353 KTGPGPLCNSCVKSKKPQGTMTYT 376
P + C K +K Q M T
Sbjct: 688 LPSPHRVDWFCAKCEKAQVKMPKT 711
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/201 (50%), Positives = 135/201 (67%), Gaps = 5/201 (2%)
Query: 648 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 707
G L R DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP+G+WFCC CS
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 708 NSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 766
S L ++ A+ L E L I KK+ L + D++W+LLSGK AT E +LLS
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKRATEEGSILLSA 174
Query: 767 AVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGIL 822
AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA ++
Sbjct: 175 AVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALM 234
Query: 823 RVFGQEVAELPLVATSKINHG 843
RV G +VAELPLVATS+ G
Sbjct: 235 RVMGGDVAELPLVATSQDVQG 255
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 195 bits (495), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 129/406 (31%), Positives = 193/406 (47%), Gaps = 81/406 (19%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D LHK +F GL DGTE+GYY Q L+G K G E++ +FE HA
Sbjct: 588 RDAHLHKALFLPGGLEDGTELGYYTKSQLKLKGVKRGEAF--KKVVVEINCYKFEQHAGC 645
Query: 566 --------------DG--------------------------------------GNLLPC 573
DG G L C
Sbjct: 646 EARRNPYGNILLVADGRSLKDVCKELAHKNKLGEKEKRVARAGKVNSCYECGTRGELKNC 705
Query: 574 DGCPRAFHKECA-SLSSIPQGDWYCKYCQN------MFERKRFLQHDANAVEAGRVSGVD 626
GC + C L + G WYC+ C+ E+KR +H + G+
Sbjct: 706 HGCVETWCNSCTKGLETDSDGKWYCRMCRQDTLNVAQIEQKRSNKH---------IEGMS 756
Query: 627 SVEQITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLK 683
++ + +R R V++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK
Sbjct: 757 NIAETDER-DRCVRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLK 815
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-----KKYAGNSL 738
+ DL ELP+G+WFC DC I+ +L L+ E L + ++ + ++
Sbjct: 816 DSGVDDLNELPEGEWFCQKDCKVIDEILTQLVANGPELLTDSIISELLESRQQQTGAKDK 875
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 798
S W++L GK+ L++A+ IF +C DPI D+ +G++LIP MV R +
Sbjct: 876 AESSCPSFAWQILCGKSGNTANTQTLAEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSK 935
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+F G++C +L +N VVSA +L++FG E+AE+PLVATS + G+
Sbjct: 936 DHDFEGVFCIVLKLNEKVVSAALLQIFGGEIAEVPLVATSLTHQGQ 981
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 75/178 (42%), Gaps = 31/178 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C ++
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVRCDCRNCKSSVIVSI 441
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 442 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 500
Query: 335 KSFACVR-----------CKGT-----FPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
C + CKG C+G + P C+ C K KKP T+
Sbjct: 501 WKVTCAKCWNSDEGDLIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATV 558
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 183 bits (464), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 98/260 (37%), Positives = 154/260 (59%), Gaps = 10/260 (3%)
Query: 592 QGDWYCKYC-QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
+G WYC+ C Q+ + + Q + + G + ++ EQ RCIR + E+ GC
Sbjct: 983 EGRWYCRMCRQDSLKVAQNGQKGSEKIMEGMSNIAETNEQ--GRCIRHLDGPR-EVGGCA 1039
Query: 651 LCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 709
+C+ + SK+GF TIL+CDQC RE+HV CLK + DL ELP G+WFC DC I+
Sbjct: 1040 ICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDCKVIDE 1099
Query: 710 VLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDIDVR--WRLLSGKAATPETRLLL 764
+L L+ E L ++ + ++ ++ E + + R W++L GK ++P L
Sbjct: 1100 ILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSPADVQTL 1159
Query: 765 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
++A IF DC DPI D +G++LIP MV R + +F G++C +L +N VVSA +L++
Sbjct: 1160 AEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVVSAALLQI 1219
Query: 825 FGQEVAELPLVATSKINHGK 844
FG+E+AE+PL+ATS +HG+
Sbjct: 1220 FGREIAEVPLIATSLSHHGQ 1239
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADG 567
+D LHK +F GL DGTE+GYY Q L+G K G GI C CCN E + + G
Sbjct: 779 RDAHLHKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNEET--RHVQGGSRG 836
Query: 568 GNLLP 572
G + P
Sbjct: 837 GLMFP 841
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 64/131 (48%), Gaps = 12/131 (9%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L + P EL T L++G V GI+ L G+++D G+ C+C C G ++
Sbjct: 563 LVRAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVQCNCRNCKGSVIVSI 617
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 618 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 676
Query: 335 KSFACVRCKGT 345
+ C +C+ +
Sbjct: 677 RKVTCAKCESS 687
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 176 bits (446), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 178/400 (44%), Gaps = 123/400 (30%)
Query: 565 ADGGNLLPCDGCPRAFHK-------------------------EC-ASLSSIPQGDWYCK 598
D G+L C GCP A+H+ +C S S G+++C
Sbjct: 1005 GDSGDLQLCTGCPNAYHQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCP 1064
Query: 599 YCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-ELSGCLLC 652
CQ +R+R + + G + + +++ RC R+++ EA L GC+ C
Sbjct: 1065 DCQEQRFGGTKDRRRSMTKRRSK---GAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCVFC 1121
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVG------------------------------CL 682
+ DF+K+GFGP+T LLCDQCERE+HVG C+
Sbjct: 1122 KSGDFAKTGFGPKTTLLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCV 1181
Query: 683 KKHKMAD------------------------LRELPKGKWFCCMDCSRINSVLQNLLVQE 718
++ D +ELP+G+WFC DC I+S+L L+
Sbjct: 1182 ATVRLRDCDRGCESQGIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNG 1241
Query: 719 AEKLPEFHLNAIKKYAGNSLETVSDI------DVRWRLLSGKAATPETRLLLSQAVAIFH 772
E L + ++ + + LE D W+LL G+ P L++AV IF
Sbjct: 1242 PEPLADSIISKVLRTNQARLERSEDATESSCSGFEWQLLHGRGGDPSNGKALAEAVQIFS 1301
Query: 773 ----------------------------DCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 804
+CFDPI D +SG DLIP MVY R+LR Q+FGG
Sbjct: 1302 VRNLSDPGFPVRTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGG 1361
Query: 805 MYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+YC +L ++ VVS ++RVFG+++AELPL+AT+ + G+
Sbjct: 1362 IYCVVLKYDNRVVSTALIRVFGRQLAELPLLATNPSHQGQ 1401
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 127/291 (43%), Gaps = 43/291 (14%)
Query: 58 VKRSRFSNSDDLLEDDVIDKR----------INSKIHEGRINKVVKNVL-------NENG 100
V SR + + L DD +D R N + E + ++ + V+ NG
Sbjct: 386 VAGSRNDATSNPLSDDGLDGREQDGNGGKGLTNGTVEEDKCTELFEPVVLLPEHSGGTNG 445
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKK 160
+ES V+ E+ E+ + G A SK FP+ G + + ++++
Sbjct: 446 FVESAVDSEDLSSDPAREDSLGNQELGGSAGAAVAS--SKDLEFPQHSGSEKRGDSMDRQ 503
Query: 161 C--LKRPSA----------MKPKVEPVEVLVTQSE---------GFGNESMSLIEVEAIA 199
+ PSA +K V P+++ + +G + S ++++
Sbjct: 504 DSDVATPSAATSAQSRSPGLKVGVVPLKIWLWAVGWLGGWAWLVSYGADGNSRLKLKGQG 563
Query: 200 EGSALTSPKKNLELK-MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGI 258
+ S SP N + K++ L + P + L ++GLLDG V YMG + L GI
Sbjct: 564 DVSMRQSPTLNGARGFVVKEVLLKEAPASAKLLLQSGLLDGHHVRYMG--RGGHIMLTGI 621
Query: 259 IRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVL 309
I++GG+LC CS C G +V+ S FE HA R S +I ENGK L ++L
Sbjct: 622 IQEGGVLCDCSSCKGVQVVNVSAFEKHAGSSARHPSDFIFLENGKCLKDIL 672
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 35/64 (54%), Gaps = 7/64 (10%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D TEVGYY GQK L G K G GI+
Sbjct: 784 ETASPPVLSRESSGAN-------LHKALFLPGGLEDDTEVGYYVKGQKSLAGVKKGAGIL 836
Query: 549 CHCC 552
C CC
Sbjct: 837 CSCC 840
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 173/631 (27%), Positives = 265/631 (41%), Gaps = 92/631 (14%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+R+ R+ + D + + + S S G +ST TS
Sbjct: 281 ------KRRSGRRFRQGDTGTSTPTFS-----------------GSPGKGGFGHSTD-TS 316
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
K ++ + ++ K T +P N S +KS+ + T +D LH L+F E G
Sbjct: 317 EKKGTEETHRLSLSSPVKIT---QRPLRNCS----IDSKSKESKT-RDTTLHPLIFKEDG 368
Query: 522 LPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGG-------NLLPC 573
L D T + Y G+ L +GYK G IIC+CCN E SPS FE HA G N+
Sbjct: 369 LADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNIYTL 428
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
+G + HK L + + N H+ + GR S +
Sbjct: 429 EGL--SLHKLALQLQDHLNPNGF----DNASVSSVSDYHNLTSSGCGREPSTTSGPIVPL 482
Query: 634 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKM 687
+ R ++ E C C + P TI+ C+QCER H+ C KK +
Sbjct: 483 K--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPL 540
Query: 688 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVR 747
L+E + CC +C + + L+ ++K G + ++
Sbjct: 541 EILKEYMCFHFLCCQECQSLRARLE---------------EGLEKCVGITFLRRIRSNIC 585
Query: 748 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 806
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 586 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 643
Query: 807 CAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
CA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 644 CALLTASTHVVSAAILKVRIEQIAELVLIAT 674
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 177/644 (27%), Positives = 270/644 (41%), Gaps = 83/644 (12%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP----GLIA 454
+ + R ++ S S K S K + + N+ P G+ +
Sbjct: 281 KRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDGVKS 340
Query: 455 NS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRWNITPK 508
+S T VT+ H S + + +S P + PL S +KS+ + T +
Sbjct: 341 DSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKSKESKT-R 390
Query: 509 DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADG 567
D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS FE HA
Sbjct: 391 DTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGM 450
Query: 568 G-------NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 620
G N+ +G + HK L + + N H+ + G
Sbjct: 451 GRRRQPYHNIYTLEGL--SLHKLALQLQDHLNPNGF----DNASVSSVSDYHNLTSSGCG 504
Query: 621 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 680
R S + + R ++ E C C + P TI+ C+QCER H+
Sbjct: 505 REPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIK 562
Query: 681 CL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 734
C KK + L+E + CC +C + + L+ ++K
Sbjct: 563 CYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE---------------EGLEKCV 607
Query: 735 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 794
G + ++ WRLLSG A+ + +L + Q + IF D F D S D+I MV G
Sbjct: 608 GITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNG 665
Query: 795 RN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 666 KNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIAT 709
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 168/349 (48%), Gaps = 57/349 (16%)
Query: 513 HKLVF--DE-SGLPDGTEVGYYAC-GQKLLEGY-------KNGLGIICHCCNSEVSPSQF 561
HK +F DE GL DG V Y G++LL+G GI+C CCN +S SQF
Sbjct: 445 HKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCACCNGVISCSQF 504
Query: 562 EAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 621
EAHA G+ RA + + + + C E + + H A+ A
Sbjct: 505 EAHAGRGSR-------RAPYDNIFTAAGVSLRKLAC--LMPASEAESPISHRPAALCAVA 555
Query: 622 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
E +T V A GC+LC+ DF + GFG RT+++CDQCERE+H+GC
Sbjct: 556 DRRALEPELVT------VSGEAALHGGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGC 609
Query: 682 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA-------IKKYA 734
L +H A L ELP+GK + L ++L+ HLN K A
Sbjct: 610 LAEHGRAHLTELPEGK-----------ASLYDILLT-------LHLNGEWHCSPECKGIA 651
Query: 735 GNSLETVSDIDV------RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 788
E VS + V W++L GK T T L A I + FDPI+D ++G DL+
Sbjct: 652 TRMRERVSSVPVPLQGEYSWQVLRGKDGTHATTWALKAAQEILTESFDPILDLVTGADLM 711
Query: 789 PSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+MVY + L ++ GMY A+L V + + RVFG+++AE+PLVAT
Sbjct: 712 MAMVYAQELGDWDYTGMYTAVLRRRGKAVCSAVFRVFGRQLAEVPLVAT 760
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 25/35 (71%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
DGG+LL CDGCP A H CA L +P+GDW+C C
Sbjct: 335 DGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDAC 369
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 73/115 (63%), Positives = 93/115 (80%)
Query: 730 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 789
+KK LE + +IDVRWRLL+G+ A+PET+ LL +AV++FH+CFDPIVD +GRDLIP
Sbjct: 1 MKKQEERCLEPLREIDVRWRLLNGRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIP 60
Query: 790 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+MV GRNLR Q+FGGMYCA+L VNSSV SA +LR+FG ++AELPL+AT N GK
Sbjct: 61 AMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGK 115
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 161/356 (45%), Gaps = 64/356 (17%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAD 566
+D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE HA
Sbjct: 170 RDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAG 229
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH---DANAVEAGRVS 623
G P + Y + + L H ++N VS
Sbjct: 230 MGK------------------RRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSNASVS 271
Query: 624 GVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PRTILL 669
+T + + + L+ + C C + + G P I+
Sbjct: 272 SFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHTELGKIDPNMIVF 331
Query: 670 CDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP 723
C+QCER HV C KK + L++ + CC +C + L+ + EK
Sbjct: 332 CNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGM-----EKCE 386
Query: 724 EF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 782
E L I+ ++ WRLLS A+ + +L LSQ + IF D F ++S
Sbjct: 387 EIAFLGRIRS------------NICWRLLSSADASRDVKLYLSQVIDIFKDAF---LEST 431
Query: 783 SGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
I MVYG+N G+ +F GMYC +LT ++ VVSA IL+V + AEL L+AT
Sbjct: 432 DAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIAT 487
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 126/420 (30%), Positives = 184/420 (43%), Gaps = 68/420 (16%)
Query: 447 SSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRW 503
SS G+ S + K ++ R + T S P +A PL S + SR
Sbjct: 76 SSDKGISGLSAGTSKEKKGTEETRSAQNTGDLGLISSSSSPVTSAQRPLPSSSVGSNSRE 135
Query: 504 NITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFE 562
+ +D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE
Sbjct: 136 S-KKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFE 194
Query: 563 AHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH---DANAVEA 619
HA G P + Y + + L H ++N
Sbjct: 195 KHAGMGK------------------RRQPYQNIYTSQGLTLHDVALQLHHLNLNSNGFSN 236
Query: 620 GRVSGVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRGCDFSKSGFG---PR 665
VS +T + + + L+ + C C + + G P
Sbjct: 237 ASVSSFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYFCGYGHTELGKIDPN 296
Query: 666 TILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 719
I+ C+QCER HV C KK + L++ + CC +C + L+ +
Sbjct: 297 MIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRVRLEGM----- 351
Query: 720 EKLPEF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 778
EK E L I+ ++ WRLLS A+ + +L LSQ + IF D F
Sbjct: 352 EKCEEIAFLGRIRS------------NICWRLLSSADASRDVKLYLSQVIDIFKDAF--- 396
Query: 779 VDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
++S I MVYG+N G+ +F GMYC +LT ++ VVSA IL+V + AEL L+AT
Sbjct: 397 LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVEHAAELVLIAT 456
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 101/139 (72%), Gaps = 4/139 (2%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGP 356
C RCKG FP +C+ + GP
Sbjct: 118 TCKRCKGRFPSSCMERVGP 136
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 85/207 (41%), Positives = 105/207 (50%), Gaps = 72/207 (34%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
P+K +T KD R+HKLVF+ LPDGT + YY G+ VS
Sbjct: 576 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGK--------------------VS 615
Query: 558 PSQFEAHA---------------------------------------------------D 566
PSQFEAHA D
Sbjct: 616 PSQFEAHAGCASRRKPGWYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICAD 675
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+LL CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D
Sbjct: 676 GGDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGID 734
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCR 653
++EQITKR IRIV L AE+ C+LCR
Sbjct: 735 ALEQITKRSIRIVDTLHAEVGVCVLCR 761
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 103/173 (59%), Gaps = 5/173 (2%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGII 259
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y G GL+G+I
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYARG--RPEKGLQGVI 402
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 403 QGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADA 462
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 463 LEAAIRNATGAGDARKSTVCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 513
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 73/198 (36%), Positives = 106/198 (53%), Gaps = 25/198 (12%)
Query: 647 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 706
+ C+LC DF + GF +T+L+CDQCE+E+H+GCL++HKM D++ +P+G+WFC +C R
Sbjct: 1510 AACVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVR 1569
Query: 707 INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 766
I L + +P GN RW++L GK +T LS
Sbjct: 1570 IRDALGEDVAAGEVLMP-----------GNPA-------YRWQILRGKNGRQQTWHALST 1611
Query: 767 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-------TVNSSVVSA 819
+ I + FDPI+D+ SG DL+P+MV +F GMY +L V A
Sbjct: 1612 VLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVLA 1671
Query: 820 GILRVFGQEVAELPLVAT 837
+ RV G +AE+PLVAT
Sbjct: 1672 ALFRVLGSSMAEMPLVAT 1689
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 148/310 (47%), Gaps = 33/310 (10%)
Query: 493 PPLSFPNKS-------RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
P L FP + + +D LH+L+F +GLPDGTE+ YY GQK+L GYK G
Sbjct: 407 PSLYFPGSATKQKKTAESGVRKRDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGN 466
Query: 546 GIICHCCNSEVSPSQFEAHADGG-------NLLPCDGCPRAFHKECASLS-----SIPQG 593
GI+C CC E+SPSQFE+HA ++ +G H SL+ + G
Sbjct: 467 GIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGL--TLHDIAISLANGQNITTGIG 524
Query: 594 DWYCKYC---QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
D C C ++ + + ++ G + R R+ K E + C+
Sbjct: 525 DDMCAECGDGGDLMWHVWIYRILLKVLGIVQIDGGNFARPTVIRLTRVGKIPEYNVGDCV 584
Query: 651 LCR------GCDFSKSGFGP-RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 703
+CR DF+ + ++L ++EFHVGCL++ + DL E+P+ WFCC D
Sbjct: 585 VCRLNLLKFLIDFTLANSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFCCQD 644
Query: 704 CSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAATPETR 761
C+ I L+N + +K+P LN I K L + DV+W++L GK+ E
Sbjct: 645 CNNIYVALRNSVSTGVQKIPASLLNIINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDL 704
Query: 762 LLLSQAVAIF 771
LLS A AIF
Sbjct: 705 SLLSGAAAIF 714
Score = 83.2 bits (204), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 204 ERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 260
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS CN +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 261 GCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 320
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 321 VAGSSINEEFF 331
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 144 bits (362), Expect = 2e-31, Method: Composition-based stats.
Identities = 81/220 (36%), Positives = 121/220 (55%), Gaps = 26/220 (11%)
Query: 626 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
DSV + + + L A+L S C+LC +F + GF +T+L+CDQCE+E+H+GCL+K
Sbjct: 1762 DSVTLTGRDEEHVSEALAADLASSCVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRK 1821
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
HKM D++ +P+G+WFC +C RI +L L +E E +GN
Sbjct: 1822 HKMVDMQAVPEGEWFCSDECVRIRELLTKSL-EEGE----------TTMSGNPA------ 1864
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 804
RW+ + G+ T T L + I + FDPI+D+ SG DL+P MV+ + +F G
Sbjct: 1865 -YRWQFIRGRDGTKATARALKTVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQG 1923
Query: 805 MYCAILTVNSS-------VVSAGILRVFGQEVAELPLVAT 837
MY +L + V AG++RV G +AE+PLVAT
Sbjct: 1924 MYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPLVAT 1963
Score = 40.8 bits (94), Expect = 3.2, Method: Composition-based stats.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 522 LPDGTEVGYYACGQKLLEGY------KNGLGIICHCCNSEVSPSQFEAHA 565
L DG V Y GQ+LL G G GI+C CC+ +S S FE+HA
Sbjct: 1535 LQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHA 1584
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 169/362 (46%), Gaps = 40/362 (11%)
Query: 493 PPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHC 551
P S KS+ + T +D LH L+F E GL D T + Y G+ L +GYK G GIIC+C
Sbjct: 91 PNYSIGTKSKESKT-RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNC 149
Query: 552 CNSEVSPSQFEAHADGG-------NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 604
C+ E +PS FE HA G N+ +G HK L + + + F
Sbjct: 150 CSQEFAPSHFEEHAGMGRRRQPYHNIYTPEG--STLHKLALQLQDHLNSNGFDNASVSSF 207
Query: 605 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 664
L +A GR S + + R ++ E C C + P
Sbjct: 208 SDYPNL---TSASGCGRQPSTTSGPIVPLK--RTLQGRVVETESCYFCGYGHTTIGNIDP 262
Query: 665 RTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 718
I+ C+QCER HV C KK + L+E ++ CC +C +L++ L +
Sbjct: 263 DMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQECQ----LLRDRLEEG 318
Query: 719 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 778
EK E +++ N + WRLLSG A+ + +L + Q + IF D F
Sbjct: 319 LEKCEEIAF--LRRIRSN---------ICWRLLSGMDASRDVKLFMPQVIDIFKDAFVES 367
Query: 779 VDSISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
D S D+ MV +N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 368 TDEHS--DIFSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRMEQIAELVLIAT 425
Query: 838 SK 839
+
Sbjct: 426 RR 427
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/463 (25%), Positives = 187/463 (40%), Gaps = 134/463 (28%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-- 565
+D L K +F GL DGTE+GYY Q L+G K G GI C CCN E+S +FE HA
Sbjct: 714 RDAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYEFEQHAGC 773
Query: 566 --------------DG-------------------------------------GNLLPCD 574
DG G L C
Sbjct: 774 EARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVCCYECSNSGELKRCH 833
Query: 575 GCPRAFHKECA-SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
C A+ +C + + +G WYC+ C+ + + Q+ + + G+ ++ + +
Sbjct: 834 SCEEAWCDKCTKGMETDSEGRWYCRMCRQ--DSLKVAQNGHKGTDK-IIEGMSNIAETDE 890
Query: 634 RCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHV----------- 679
+ R V++LE E+ GC +C+ + SK+GF TIL+CDQ +
Sbjct: 891 KG-RCVRHLEGPREVGGCAICKKWNLSKTGFVDGMTILVCDQVRSLNQMLPGTRITWKVN 949
Query: 680 -----GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 734
L + +ELP+G+WFC DC I+ +L L+ PE N+I
Sbjct: 950 GFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANG----PELLSNSIISEL 1005
Query: 735 GNSLETVSDIDVR---------WRLLSGKAATPETRLLLSQAVAIF-------------- 771
S + S + V+ W++L G+ + L++A IF
Sbjct: 1006 LESRQQQSSVKVKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFTSIDDINLPYLWLV 1065
Query: 772 ---------------------HDC-----FDPIVDSISGRDL----IPSMVYGRNLRGQE 801
C FDP S D+ + +V R + +
Sbjct: 1066 VGNYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNVALVVCSRRAKDHD 1125
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
F G++C +L +N VVSA +L++FG+E+AE+PL+ATS + G+
Sbjct: 1126 FEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQ 1168
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 69/158 (43%), Gaps = 13/158 (8%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL T L++G + + L G+++D G+ C+C C G ++ S
Sbjct: 515 LLKAPRNAKELMATKLMEG----HFVRCSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSIS 570
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EEK 335
FE H+ S I ENGK+L +VL A + +L+A L+ A+ + E+
Sbjct: 571 AFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADCGDNILRA-LKMAIGDIQGVEKS 629
Query: 336 SFACVRCKGTFP---ITCVGKTGPGPLCNSCVKSKKPQ 370
C C G+ I C G + CV S PQ
Sbjct: 630 KVTCAECGGSEEGDLIYCKGARCSVVSHSRCVGSANPQ 667
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 156/608 (25%), Positives = 239/608 (39%), Gaps = 82/608 (13%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPP 279
L+ V L TGLL+G V YM K + + G I G C CS CN ++
Sbjct: 62 LDSDLRDVRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNA 117
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
+FE H + + +I + G SL V+ A + L ML ++ + P +
Sbjct: 118 CEFEEHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN- 176
Query: 340 VRCKGTFP-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
+ K +F + V G C + G M Y+ ++ S VS S
Sbjct: 177 -KWKASFQKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSIS 227
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP- 450
N + S + + R ++ S S K S K + + N+ P
Sbjct: 228 NLNWSASKRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPL 287
Query: 451 ---GLIANS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKS 501
G+ ++S T VT+ H S + + +S P + PL S +KS
Sbjct: 288 SIDGVKSDSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKS 338
Query: 502 RWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
+ + T +D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS
Sbjct: 339 KESKT-RDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSH 397
Query: 561 FEAHADGG-------NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 613
FE HA G N+ +G + HK L + + N H+
Sbjct: 398 FEEHAGMGRRRQPYHNIYTLEGL--SLHKLALQLQDHLNPNGF----DNASVSSVSDYHN 451
Query: 614 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 673
+ GR S + + R ++ E C C + P TI+ C+QC
Sbjct: 452 LTSSGCGREPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQC 509
Query: 674 EREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 727
ER H+ C KK + L+E + CC +C + + L+
Sbjct: 510 ERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE--------------- 554
Query: 728 NAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDL 787
++K G + ++ WRLLSG A+ + +L + Q + IF D F D S D+
Sbjct: 555 EGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DI 612
Query: 788 IPSMVYGR 795
I MV G+
Sbjct: 613 ISDMVNGK 620
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 159/629 (25%), Positives = 255/629 (40%), Gaps = 97/629 (15%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
V L TGLL+G V Y K G I G C CS C ++ +FE H+
Sbjct: 246 VRGLLSTGLLEGFRVTY----KKNEVERIGRINGQGYSCGCSECGYRNIMNACEFEQHSG 301
Query: 288 KQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFP 347
+ + +I ++G SL V++ + L ML + +S P + + K +F
Sbjct: 302 ESSNNQNNHIFLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYE--KWKASFQ 359
Query: 348 IT-CVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNK 406
+ P + C + + T ++ S+ N S +++ + +
Sbjct: 360 LEKDYFDDAPS---DPCSTQSSQESNIALTDSLK----------DSTSNASSILNWSSFR 406
Query: 407 RERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSS 466
R + K R +T +P L+R+ + I++ + TS+ KS
Sbjct: 407 RRSDRQFK-----------------RGGTETSTPI-LSRSPEKE--ISDLSTSTSM-KSE 445
Query: 467 QSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK--DQRLHKLVFDESGLPD 524
++ + + +I P P + + S+ PK D LH ++F E GLPD
Sbjct: 446 ETPSENTAGLLTTDVTVIQDP----PPDHNVDSNSKDLGQPKVRDNTLHPMLFKEGGLPD 501
Query: 525 GTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHADGGNLLPCDGCPRAFH 581
T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H G
Sbjct: 502 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMG------------- 548
Query: 582 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV-SGVDSVEQITKRCI---- 636
+ SI D + + + + L + N E + SG T R I
Sbjct: 549 RRRQPYRSIYTSDGLTLH-ELALKLQDGLSSNVNIDELPTLTSGSGKEYSTTSRPIIVPL 607
Query: 637 -RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC----LKKHK--MAD 689
R ++ + C +CR I+ C+QCER HV C L+K K +
Sbjct: 608 KRTLQERVLTVESCYMCRKPHTVLGVISVDMIVFCNQCERALHVKCYNNGLQKPKAPLKV 667
Query: 690 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWR 749
L E + + CC C + + L +KK + ++ W+
Sbjct: 668 LGEYTQFNFMCCEKCQLLRASLH---------------EGLKKREDIAFLRRIRYNICWQ 712
Query: 750 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCA 808
LL+G + + Q + IF D F + D I +MV ++ G+ +F G+YCA
Sbjct: 713 LLNGTNMRSDVQ---HQVIEIFKDAFAET--APQDIDDIRNMVNSKDTTGEKDFRGIYCA 767
Query: 809 ILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+LT ++ VVSA IL+V +EVAEL L+AT
Sbjct: 768 VLTTSTFVVSAAILKVRTEEVAELVLIAT 796
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 115/277 (41%), Gaps = 60/277 (21%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
DGG+LL CD CP ++H +C L +IP+G+WYC C+
Sbjct: 686 DGGDLLLCDNCPSSYHHDCVGLEAIPEGNWYCPSCR------------------------ 721
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS--KSGFGPRTILLCDQCEREFHVGCLK 683
C +C D+ S F +TI+ CDQCERE+HVGC +
Sbjct: 722 -----------------------CSICNLSDYDPDTSQFTEKTIVYCDQCEREYHVGCTR 758
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
L P+G WFC CS + LQ L+ + E I K+ + D
Sbjct: 759 NSD-NQLICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEGVSWTILKFCSGNGSDHGD 817
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQE 801
D + L AV I H+CF I++ + D+ +V+ R LR
Sbjct: 818 YD--------DEIMADHYGKLCVAVGILHECFVTIIEPRTQSDISEDIVFNRESELRRLN 869
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
F G Y +L +S G R+ GQ+ AELPL+ TS
Sbjct: 870 FRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTS 906
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 121/275 (44%), Gaps = 61/275 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
DGG+L+ CD CP FHK C L IP+G+W+C C + +++ + R+
Sbjct: 630 DGGDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGICGENKFKYNVQEPKDSRL--- 686
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
L CDQCER++H+GCL+
Sbjct: 687 ------------------------------------------LSCDQCERKYHIGCLRNK 704
Query: 686 KMADL-RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ L R+ PK WFC C I LQ LL + P+ + K+ + D+
Sbjct: 705 GVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVGPDNLTWTLWKFMDSD---SCDV 761
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
+ +GK + L AV + H+CF+P ++ +GRD+ +++ R NL F
Sbjct: 762 EAP----TGKHSK------LDLAVEVIHECFEPATETYTGRDIAEDVIFSRECNLNRLNF 811
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N +++ +RVFG +VAE+PLV T
Sbjct: 812 RGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGT 846
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 119 bits (297), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 128/313 (40%), Gaps = 82/313 (26%)
Query: 552 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 611
C+ E S D G LL CD CP AFH C L + P+GDW C C+
Sbjct: 435 CSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------- 484
Query: 612 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTIL 668
C +C G D + GF +TI+
Sbjct: 485 -------------------------------------CGVCGGSDLDDDTAEGFTDKTII 507
Query: 669 LCDQCEREFHVGCLKKHKMAD------LRELPKGK--------WFCCMDCSRINSVLQNL 714
C+QCERE+HVGC+++ + R L + + W C +C + LQ L
Sbjct: 508 YCEQCEREYHVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQAL 567
Query: 715 LV-QEAEKLPEFHLNAI-------KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 766
+ A +P + A ++Y + TV+ I RW+ AA L
Sbjct: 568 VASSRARSIPHYSRGAYHSAPCGRRRY----MSTVTRI-TRWQHEEEDAADHGQ---LCA 619
Query: 767 AVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRV 824
A+ + H+CFD +V+ + DL +V+ + LR F G Y L +++ G LRV
Sbjct: 620 ALDVLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRV 679
Query: 825 FGQEVAELPLVAT 837
FG +VAELPLV T
Sbjct: 680 FGNQVAELPLVGT 692
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 121/283 (42%), Gaps = 62/283 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP +H C L +P+G+W+C C+
Sbjct: 103 GDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR----------------------- 139
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCL 682
C +C G +++ G F T+LLCDQCERE+HV CL
Sbjct: 140 ------------------------CAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCL 175
Query: 683 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 742
MA + P WFC C +I L+ LV + + E + + + L + S
Sbjct: 176 YSRGMAKMTSCPDDSWFCGDHCDKIFEGLRK-LVGISNTIGEGLSWTLLRSGEDDLPSAS 234
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRG 799
++ + E R L+ A+ + +CF P+VD + DL+ ++Y R +
Sbjct: 235 SMN--------REQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 286
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 842
F G Y +L + V+S +R+ G +AE+PL+ T + +H
Sbjct: 287 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGT-RFHH 328
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 121/283 (42%), Gaps = 62/283 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP +H C L +P+G+W+C C+
Sbjct: 128 GDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR----------------------- 164
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG--FGPRTILLCDQCEREFHVGCL 682
C +C G +++ G F T+LLCDQCERE+HV CL
Sbjct: 165 ------------------------CAICGGSEYNADGSSFNEMTVLLCDQCEREYHVSCL 200
Query: 683 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 742
MA + P WFC C +I L+ LV + + E + + + L + +
Sbjct: 201 YSRGMAKMTSCPDDSWFCGDHCDKIFQGLRK-LVGISNNIGEGLSWTLLRSGEDDLPSAN 259
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN---LRG 799
++ + E R L+ A+ + +CF P+VD + DL+ ++Y R +
Sbjct: 260 SMN--------REQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 311
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 842
F G Y +L + V+S +R+ G +AE+PL+ T + +H
Sbjct: 312 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGT-RFHH 353
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 119/274 (43%), Gaps = 61/274 (22%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP +FHK C L IP GDW+C C +R ++ G D
Sbjct: 746 GGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR------------KIDGDD 793
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
V Q+ L C QCE ++HV CL+ +
Sbjct: 794 EVGQL------------------------------------LPCIQCEHKYHVRCLE-NG 816
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDID 745
AD+ G WFC DC +I L LL + + +K +S E S
Sbjct: 817 AADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWTLVKFINPDSCEHDS--- 873
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFG 803
S E+ L+ A+++ H+CF+P+ +S++ RDL+ +++ R L F
Sbjct: 874 ------SKSDLLAESYSKLNLAISVMHECFEPLKESLTNRDLVEDVIFSRWSELNRLNFQ 927
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N ++S +RV+G++VAE+PLV T
Sbjct: 928 GFYTVLLERNEELISVATVRVYGKKVAEIPLVGT 961
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 116/277 (41%), Gaps = 64/277 (23%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP +FHK C L +P GDW+C C
Sbjct: 439 GGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSC-------------------------- 472
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
C +I G L R D S G +L C QCER++HV CL +
Sbjct: 473 --------CCKIC--------GQCLKRDSDLSMEDDG---VLDCTQCERKYHVVCLGNKR 513
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP----EFHLNAIKKYAGNSLETVS 742
L PK WFC C +I L LL +K+P +K N S
Sbjct: 514 EECLEYFPKEHWFCSKRCQQIFLGLHELL---GKKIPVGLHNLTWTLLKSIQFNDQCEAS 570
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQ 800
DI+ A E +L+ A+ + H+ FDP+ + + RDL+ +++ + L
Sbjct: 571 DIE----------ALSENYSMLNIALDMMHEFFDPVEEPHTKRDLLKDVIFSKRSELNRL 620
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L + +S +RV+G++VAE+PLV T
Sbjct: 621 NFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGT 657
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 114 bits (286), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/70 (74%), Positives = 63/70 (90%), Gaps = 1/70 (1%)
Query: 603 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 662
MF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGF
Sbjct: 1 MFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGF 59
Query: 663 GPRTILLCDQ 672
GPR I++CDQ
Sbjct: 60 GPRMIIMCDQ 69
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 127/307 (41%), Gaps = 83/307 (27%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 596
L EG + IC CN GG L+ CD CP A+HK C +L IP GDW+
Sbjct: 779 LFEGENDN---ICSVCNY------------GGELILCDQCPSAYHKNCLNLEGIPDGDWF 823
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C C+ C +C
Sbjct: 824 CPSCR-----------------------------------------------CGICGQNK 836
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 716
++ G L C QCE ++HV CL+ + D R K WFC +C R+ + LQNLL
Sbjct: 837 IEETEDG--HFLTCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLG 893
Query: 717 QEAEKLPEFHLNAIKKYAGNSLETV----SDIDVRWRLLSGKAATPETRLLLSQAVAIFH 772
+ + + KY + V SD+ V E LS A+++ H
Sbjct: 894 KPVLVGADNLTWTLVKYVNSETCGVGGAESDLVV------------ENYSKLSVALSVMH 941
Query: 773 DCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 830
+CF+P+ + S RD++ +++ + L F G Y +L N ++S +R+FG+++A
Sbjct: 942 ECFEPLHNPFSSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIA 1001
Query: 831 ELPLVAT 837
E+PLV T
Sbjct: 1002 EVPLVGT 1008
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 120/273 (43%), Gaps = 57/273 (20%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+L+ CD CP FH+ C L +P+GDW+C C R+ G +
Sbjct: 360 GGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSC------------------CCRICGEN 401
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
++ ++ N + FS C QCE ++HVGCL+K +
Sbjct: 402 RFDEYSEE-----DNFK-------------FS-----------CHQCELQYHVGCLRKQR 432
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 746
L P G FC C +I LL + +P N ++ D+DV
Sbjct: 433 HVKLETYPDGTRFCSTQCEKI---FLGLLKLLGKPIPVGVDNLTWTLLKPTISEWFDMDV 489
Query: 747 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 804
A E L+ A+ + H+CF+PI + +GRDL+ +++ G +L+ F G
Sbjct: 490 -----PDNKALTEVYSKLNIALNVMHECFEPIKEPHTGRDLVEDVIFCRGSDLKRLNFRG 544
Query: 805 MYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
Y +L N ++S +RV G++VAE+PLV T
Sbjct: 545 FYIVLLERNDELISVATIRVHGEKVAEVPLVGT 577
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 113/274 (41%), Gaps = 59/274 (21%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+L+ CD CP +FHK C L GDW+C C
Sbjct: 675 GGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSC-------------------------- 708
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
C C +C F + C QCER++HVGCL+K
Sbjct: 709 --------C-------------CGICGENKFDGGSEQDNVVFSCYQCERQYHVGCLRKWG 747
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDID 745
L P G WFC C +I LQ LL + + +K LE ID
Sbjct: 748 HVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLE----ID 803
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 803
L A E L+ A+ + H+CF+P+ + + RD++ +++ G +L F
Sbjct: 804 -----LPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRLNFQ 858
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N ++S +RV+G++VAE+PL+ T
Sbjct: 859 GFYTVLLERNDELISVATVRVYGEKVAEVPLIGT 892
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 115/299 (38%), Gaps = 74/299 (24%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
DGG+LL CD CP AFH C L + P+GDW+C C+
Sbjct: 512 DGGDLLLCDNCPSAFHHACVGLQATPEGDWFCPSCR------------------------ 547
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS-----GFGPRTILLCDQCEREFHVG 680
C +C G DF + GF +TI+ CDQCERE+HVG
Sbjct: 548 -----------------------CGVCGGSDFDATAAGGGGFTDKTIIYCDQCEREYHVG 584
Query: 681 CLKKHKMADL-----------RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 729
C+++ + E + W C +C + LQ L E+
Sbjct: 585 CVRRRGSEEEEESAAEWCRRPEEQEEWPWLCSPECGEVFRHLQGLAAVARERSIPIPTTV 644
Query: 730 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LSQAVAIFHDCFDPIVD 780
G SL + R + + + L A+ + H+CF +++
Sbjct: 645 PTTVEGVSLSILRRRRRRPISMVATGSGCQEEEEEEDAAEHGQLCSALDVLHECFVTLIE 704
Query: 781 SISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ DL +V+ R LR F G Y L +++ G LRV G EVAELPLV T
Sbjct: 705 PRTQTDLTADIVFNRESELRRLNFRGYYVVGLEKAGELITVGTLRVLGTEVAELPLVGT 763
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 151/346 (43%), Gaps = 50/346 (14%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAH 564
+D LH ++F E GLPD T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H
Sbjct: 584 RDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKH 643
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV-S 623
G + SI D + + + + L + N E + S
Sbjct: 644 VGMG-------------RRRQPYRSIYTSDGLTLH-ELALKLQDGLSSNVNIDELPTLTS 689
Query: 624 GVDSVEQITKRCI-----RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 678
G T R I R ++ + C +CR I+ C+QCER H
Sbjct: 690 GSGKEYSTTSRPIIVPLKRTLQERVLTVESCYMCRKPHTVLGVISVDMIVFCNQCERALH 749
Query: 679 VGC----LKKHK--MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKK 732
V C L+K K + L E + + CC C + + L +KK
Sbjct: 750 VKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLHE---------------GLKK 794
Query: 733 YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 792
+ ++ W+LL+G + + Q + IF D F D I +MV
Sbjct: 795 REDIAFLRRIRYNICWQLLNGTNMRSDVQ---HQVIEIFKDAFAETAPQ--DIDDIRNMV 849
Query: 793 YGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT
Sbjct: 850 NSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIAT 895
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC----RVIPPSKFEIHACKQYR 291
LL GV V Y + + L G + GG C+C GC +V+ +FE HA +
Sbjct: 31 LLQGVPVTYR--FEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGVTSK 88
Query: 292 RASQYICFENGKSLLEVLRACRSVP 316
+ +I NG+SL E+ R VP
Sbjct: 89 NQNGHIFLRNGRSLYELFHKLREVP 113
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 116/275 (42%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 810 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 863
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H GC +
Sbjct: 864 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 885
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 886 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 945
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 946 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 995
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L +++A +R+ G ++AE+P + T
Sbjct: 996 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGT 1030
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 97/172 (56%), Gaps = 3/172 (1%)
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK P + EL TG+L+G V Y+ K + + LRG+I+ GILCSCS C G +V+ P
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIM-RKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYY 315
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA + S YI ENG +L +VLRAC + L ML+ ++ A+ P+E+ F C
Sbjct: 316 FEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKS 375
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
CK +F GK C+SC++SK + ++ + R + S K+ +S
Sbjct: 376 CKSSFSTLRSGKF--ALFCDSCLESKGAKNNISSSKVGRSQTSSAKVYKSAS 425
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 57/81 (70%), Gaps = 2/81 (2%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
+K +++ASP + S +T KD+ +HK+VF LP+GT+VGYY G++LL+GY
Sbjct: 418 AKVYKSASP--GAKSSSVGRLTRKDKGMHKVVFMSGILPEGTDVGYYVGGKRLLDGYIKE 475
Query: 545 LGIICHCCNSEVSPSQFEAHA 565
LGI CHCC++ VSPSQFE HA
Sbjct: 476 LGIYCHCCSTVVSPSQFEGHA 496
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 115/275 (41%), Gaps = 57/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L +P DW C C + L HDA
Sbjct: 1036 GDGGNLICCDGCPSTFHMSCLELEELPSDDWRCTNCSCKLCHE-HLNHDA---------- 1084
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D+ E P + C QCE+++H C +
Sbjct: 1085 PDNAE--------------------------------IDP--LHSCSQCEKKYHPSCSPE 1110
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ G FC C + LQNLL E + PE+ I+ ++ ETV D+
Sbjct: 1111 TEKLSSVSSQAGNHFCQQSCRLLFEELQNLLAVEKDLGPEYACRIIQCIHEDAPETVLDL 1170
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1171 DGRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFLRLDF 1220
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y IL +VSA +R+ G ++AE+P + T
Sbjct: 1221 RGFYIFILERGDEIVSAASVRIHGTKLAEMPFIGT 1255
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 116/275 (42%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 418 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 471
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H GC +
Sbjct: 472 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 493
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 494 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 553
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 554 DER----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 603
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L +++A +R+ G ++AE+P + T
Sbjct: 604 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGT 638
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 116/275 (42%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 965 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 1018
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H GC +
Sbjct: 1019 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 1040
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 1041 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 1100
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1101 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 1150
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L +++A +R+ G ++AE+P + T
Sbjct: 1151 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGT 1185
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 116/275 (42%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 946 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 999
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H GC +
Sbjct: 1000 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 1021
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 1022 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 1081
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1082 DERV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 1131
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L +++A +R+ G ++AE+P + T
Sbjct: 1132 RGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGT 1166
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 117/278 (42%), Gaps = 56/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P DW C C F ++ Q +A ++
Sbjct: 281 GDGGNLICCDGCPSTFHMSCLELEALPSDDWRCAKCSCKFCQEHSRQ------DAQDIAE 334
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H GC +
Sbjct: 335 VDS--------------------------------------SLCTCSQCEEKYHPGCSPE 356
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + L+NLL + + PEF I++ N ETV +
Sbjct: 357 TTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVKKDLEPEFSCRIIQRIHENVPETVVAL 416
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 417 DER----------VECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFVRMDF 466
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G Y +L +++A +R+ G ++AE+P + T +
Sbjct: 467 HGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIGTRNM 504
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 125/462 (27%), Positives = 191/462 (41%), Gaps = 90/462 (19%)
Query: 421 VSKSSSKSVSLRNLLKTRSP-WELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSK 479
+ +S+S +S N R P W+ R + +T ++ +S + T S
Sbjct: 394 MKESTSNGISNLNWSAFRRPRWQYKRGGT-------ATSTQTLSRSPEKGISGLSTGTSM 446
Query: 480 KTVLISKPFENASPPL--------------SFPNKSRWNITPK--DQRLHKLVFDESGLP 523
K P EN + PL S KS+ + T K D LH+LVF E G+P
Sbjct: 447 KINTEETPSENTAGPLHSEVTIVQEPPRGHSVGPKSKESRTSKVRDNSLHQLVFKEGGVP 506
Query: 524 DGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHADGGNLLPCDGCPRAF 580
+ T + Y G+ L +GYK G I+C CC+ EV +PS FE HA G
Sbjct: 507 ELTILTYKLKHGEVLKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGK----------- 555
Query: 581 HKECASLSSIPQGDWYCKYCQNMFERKRFLQH--DANAVEAGRVSGVDSVEQITKRCI-- 636
P + Y + E LQ ++N + G D ++
Sbjct: 556 -------RRQPYRNIYTPEGLTLHELALKLQGGLNSNGNSSANFPGGDEPPNLSSGSSRE 608
Query: 637 -------------RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-- 681
R ++ + + C LC + I+ C+QCER HV C
Sbjct: 609 SSTTYRPSIVPLKRTLQQIADKTESCRLCGDACTTIGTISEDMIVFCNQCERPCHVKCYN 668
Query: 682 --LKKHK--MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNS 737
L+K K + L E + +FCC C + + L +L + EK+ + + Y
Sbjct: 669 NGLQKQKGPLNVLAEYMQFHFFCCQKCQLLRASLHEVL-NKREKIRQ-----KRSY---- 718
Query: 738 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNL 797
V W++L+G + + Q + IF F S G +I MV +++
Sbjct: 719 --------VFWQILNGMNPGINVQKYIHQVIEIFKVAFPKTAASDFG--VIQDMVNAKDV 768
Query: 798 RGQ-EFGGMYCAILTVNSS-VVSAGILRVFGQEVAELPLVAT 837
G+ +F GMYCA+LT +S VVSA +L+V +EVAEL +VAT
Sbjct: 769 GGEKDFRGMYCAVLTTSSKLVVSAAVLKVRTEEVAELVIVAT 810
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 113/274 (41%), Gaps = 62/274 (22%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+L+ CD CP +FHK C L ++P+GDW+C C
Sbjct: 930 GGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSC-------------------------- 963
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
C C +C F + C QCER+ CL+K
Sbjct: 964 --------C-------------CGICGENKFDGGSEQDNVVFSCYQCERQC---CLRKWG 999
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPEFHLNAIKKYAGNSLETVSDID 745
L P G WFC C +I LQ LL + + +K LE ID
Sbjct: 1000 HVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDNLTWTLLKPIRSKGLE----ID 1055
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 803
L A E L+ A+ + H+CF+P+ + + RD++ +++ G +L F
Sbjct: 1056 -----LPDIEALTEVYSKLNIALGVMHECFEPVKEPHTRRDVVEDVIFCRGSDLNRLNFQ 1110
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N ++S +RV+G++VAE+PL+ T
Sbjct: 1111 GFYTVLLERNDELISVATVRVYGEKVAEVPLIGT 1144
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 108/209 (51%), Gaps = 26/209 (12%)
Query: 637 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-------LKKHKMAD 689
R ++ + C CR D P TI C+QCER HV C +KK +
Sbjct: 63 RTLQERVVQTESCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 690 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWR 749
L+E ++ CC +C + + L+ V++ E++ L I+ ++ WR
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG--VEKGEEIA--FLRQIRS------------NICWR 166
Query: 750 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCA 808
LLS A+ + +L +SQA+ IF D F D+ S D+ MVYG+N G+ +F GMYC
Sbjct: 167 LLSKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGMYCV 224
Query: 809 ILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+LT ++ VVSA IL+V ++ AEL L+AT
Sbjct: 225 VLTASTHVVSAAILKVRVEQFAELVLIAT 253
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 119/296 (40%), Gaps = 102/296 (34%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D EVGYY GQK L G K G GI+
Sbjct: 840 EAASPPVPSRESSGAN-------LHKALFLPGGLEDDIEVGYYVKGQKFLAGLKRGAGIL 892
Query: 549 CHCCNS------------------------EVSPSQFEAHADGG-------NLLPCDGCP 577
C CC +S S FE HA G ++ DG
Sbjct: 893 CSCCQQVARNGMSSILMDSVVCGLTYEGERMISCSLFEQHAGWGSRRNPYTSIYLADG-- 950
Query: 578 RAFHKECASL---------SSIP--------------QGDW-YCKYCQNMFERKRFLQHD 613
R+ H SL + P +GD C C N + + + D
Sbjct: 951 RSLHDAAQSLVVEQTVKQEGNTPAKIEHLDQCVECGDRGDLQLCTRCPNAYHQDCLGKVD 1010
Query: 614 A---------------------------NAVEAGRVSGVDSVEQITKRCIRIVKNLEA-E 645
+ N G + S +++T RC R+++ EA
Sbjct: 1011 SYSSGEFFCPDCQEQRYGGTKDRRRSMVNRRSKGAAKTLLSKDRVTGRCTRLLQVPEAVV 1070
Query: 646 LSGCLLCRGCDFSKSGFGPRTILLCDQ----------CEREFHVGCLKKHKMADLR 691
L GC+ C+ DF+K+GFGP+T LLCDQ CERE+HVGCLKKH + DL+
Sbjct: 1071 LGGCVFCKSGDFAKTGFGPKTTLLCDQVSVGDMKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 690 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----- 744
+ELP+G+WFC DC I+S+L L+ E L + ++ + K LE D
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 745 -DVRWRLLSGKAATPETRLLLSQAVAIF 771
W+LL G+ P L++ V IF
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKALAEVVQIF 1289
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 56/146 (38%), Gaps = 53/146 (36%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
M K++ L + P + L ++GLLDG V Y+G + L GII++GG+LC CS C G
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLG--RGGHIMLTGIIQEGGVLCDCSSCKGV 634
Query: 275 ---------------------------------------------------RVIPPSKFE 283
+V+ S FE
Sbjct: 635 QVTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFE 694
Query: 284 IHACKQYRRASQYICFENGKSLLEVL 309
HA R S +I ENGK L ++L
Sbjct: 695 KHAGSSARHPSDFIFLENGKCLKDIL 720
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 135/357 (37%), Gaps = 114/357 (31%)
Query: 546 GIICHCCNSEVSPSQFEAHAD--------------------------------------- 566
GI+C+CC + S FEAHA
Sbjct: 418 GIVCNCCRVNFTVSGFEAHAGCTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEANS 477
Query: 567 -------------GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY-CQNMFERKRFLQH 612
GG+L+ CD CP AFH C L +P GDW+C C + R + Q
Sbjct: 478 EANNDNVCSICGFGGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTCCCKICYRPKCKQE 537
Query: 613 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 672
A+ E L+C Q
Sbjct: 538 CADGNE----------------------------------------------NNFLVCVQ 551
Query: 673 CEREFHVGCLKKHKMADLR---ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 729
CE++FH GC+K + + K WFC + C + L+ LL + + +
Sbjct: 552 CEQKFHFGCVKTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINWTL 611
Query: 730 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 789
+K VS D S + + + +L + A+ + ++ F+P +D++SGR+LI
Sbjct: 612 LK--------NVSSDDDGGDFTSNEFSQEKHKL--NAALGVLYEGFNPTIDALSGRELIK 661
Query: 790 SMVYGRNLRGQE--FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+V+ R+ + F G Y IL V+S +R+FGQ+VAE+ VAT + + G+
Sbjct: 662 DLVFSRDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGR 718
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 109/278 (39%), Gaps = 60/278 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H+ C S IP G+WYC C
Sbjct: 990 GDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCL----------------------- 1026
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C SK L C QCER++HV C+
Sbjct: 1027 ------------------------CDICGEVIDSKELVTSLPALDCSQCERQYHVKCVSA 1062
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
E G WFC C I ++ + + ++ + T +I
Sbjct: 1063 K--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI 1120
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-RNLRGQEFG 803
A E + L A +I +CF PI+D +G D+IPS++Y R+ +
Sbjct: 1121 ----------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYK 1170
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 841
G Y +L + S+VS +R+ G +AE+PLVATSK N
Sbjct: 1171 GFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKEN 1208
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 113/275 (41%), Gaps = 57/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L +P DW C C +F Q +N +A ++
Sbjct: 982 GDGGNLICCDGCPSTFHMSCLELEELPSDDWRCANCCC-----KFCQEHSND-DAPDIAE 1035
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS LC C QCE +H C +
Sbjct: 1036 VDS-----------------------LC----------------TCSQCEENYHPVCSPE 1056
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ G FC C + LQNLL + + PEF IK + ET +
Sbjct: 1057 TENPSSVPSQAGDLFCQQSCRLLFEELQNLLAVKKDLEPEFACRIIKCIHEDVPETALAL 1116
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PI+D +G +LI ++VY G N +F
Sbjct: 1117 DERV----------ECNSKIAVALSLMDECFLPIIDQRTGINLIRNVVYNCGSNFLRLDF 1166
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y IL +VSA +R+ G + AE+P + T
Sbjct: 1167 RGFYIFILERGDEIVSAASVRIHGTKCAEMPFIGT 1201
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 127/365 (34%), Gaps = 117/365 (32%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPC------DGCP---------- 577
G +L EG GI C CCN + FE HA LP DG P
Sbjct: 728 GPRLKEGAVTKDGIKCRCCNELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVELMG 787
Query: 578 -----------------------------------------RAFHKECASLSSIPQGDWY 596
+FH C L S P+G WY
Sbjct: 788 ENKPKESLHVRLKTNYSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWY 847
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C C+ C +C D
Sbjct: 848 CPSCR-----------------------------------------------CSICDSSD 860
Query: 657 F--SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 714
+ + F +TI+ CDQCERE+HVGC+ ++K L P+G WFC CS I LQ L
Sbjct: 861 YDPDTNKFTEKTIMYCDQCEREYHVGCM-RNKGDQLTCCPEGCWFCSRGCSEIFQHLQGL 919
Query: 715 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 774
+ + E I ++ + D E L A+ + H+C
Sbjct: 920 IGKSIPTPVEGLSCTILRFDRENASQHGDF--------YNEIIAEQYGKLCIALDVLHEC 971
Query: 775 FDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 832
F I++ + RDL +V+ R LR F G Y IL + ++S G RV G++ AEL
Sbjct: 972 FVTIIEPSTRRDLSEDIVFNRESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAEL 1031
Query: 833 PLVAT 837
PL+ T
Sbjct: 1032 PLIGT 1036
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 112/275 (40%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L +P W C C F +H + E +
Sbjct: 1009 GDGGNLICCDGCPSTFHMSCLGLEELPSDYWCCANCSCKF----CHEHSNDGAED--TAD 1062
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE ++H C +
Sbjct: 1063 VDS--------------------------------------SLHTCSQCEEQYHEACSPE 1084
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ G FC C + LQNLL + + PE+ +++ + E V +
Sbjct: 1085 NDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLPL 1144
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PIVD +G +LI ++VY G N +F
Sbjct: 1145 DTRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYSCGSNFARLDF 1194
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y IL +++A +R+ G ++AE+P + T
Sbjct: 1195 RGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGT 1229
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 115/275 (41%), Gaps = 60/275 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H C IP G WYC C+
Sbjct: 1008 GDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR----------------------- 1044
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C SK P C QCER++H+ C+
Sbjct: 1045 ------------------------CDVCGEVVSSKEPRTPLHAFECSQCERQYHIKCISG 1080
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + P G WFC C +I + L++ + +P+ HL+ ++ L D
Sbjct: 1081 KVLCNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPD-HLD--DGFSCTILHNNGDQ 1131
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEF 802
VR + + A E + L A++I +CF PI D +G D++P ++Y N ++
Sbjct: 1132 KVR--MAADIALLAECNMKLIIALSILEECFLPIFDPRTGMDIMPLILYNWRSNFVHLDY 1189
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L + S++S +R+ G VAE+PL+AT
Sbjct: 1190 KGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIAT 1224
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 116/285 (40%), Gaps = 82/285 (28%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP +FH+ C L +P+GDW+C C + L AN V+
Sbjct: 908 GGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICGQNKLSEHANIVD-------- 959
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
GP L C QCE ++HV CL+ K
Sbjct: 960 ------------------------------------GP--FLTCYQCECKYHVQCLRGTK 981
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 746
K WFC C +I LQ LL G S+ D ++
Sbjct: 982 --KFGSCSKPHWFCNKHCKQIYWGLQKLL-------------------GKSIPVGGD-NL 1019
Query: 747 RWRLLSGKAATP------------ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 794
W LL ++ E + L+ A+ + H+CF+P+ + + RD++ +++
Sbjct: 1020 TWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHECFEPVREQHTRRDIVEDVIFS 1079
Query: 795 R--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
R L+ F G Y +L N +++ +RV+G++VAE+PLV T
Sbjct: 1080 RRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGT 1124
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 115/278 (41%), Gaps = 60/278 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H+ C S IP G+WYC C G V
Sbjct: 16 GDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCLCDI--------------CGEV-- 59
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+DS E +T +L A L C QCER++HV C+
Sbjct: 60 IDSKELVT--------SLPA-----------------------LDCSQCERQYHVKCVSA 88
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
E G WFC C I ++ + + ++ + T +I
Sbjct: 89 K--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDKKVRTAEEI 146
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-RNLRGQEFG 803
A E + L A +I +CF PI+D +G D+IPS++Y R+ +
Sbjct: 147 ----------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWRSDLHFNYK 196
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 841
G Y +L + S+VS +R+ G +AE+PLVATSK N
Sbjct: 197 GFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKEN 234
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 111/291 (38%), Gaps = 90/291 (30%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFLQHDANAVEAGR 621
GG L+ CDGCP AFH C L +P GDW+ C C F + NA E
Sbjct: 712 GGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT----STNAKEEKF 767
Query: 622 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
+S C QCE ++H C
Sbjct: 768 IS----------------------------------------------CKQCELKYHPSC 781
Query: 682 LKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 740
L+ D L ++ KWFC DC I +L +L+ + E
Sbjct: 782 LRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE-------------------- 821
Query: 741 VSDIDVRWRLL------------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 788
VS + WRL+ S A E +LS A+ + H+ F+P+ GRDL
Sbjct: 822 VSVEKLTWRLVQSLEPNMYGDDASKIEAAAENHCILSVALDVMHELFEPVKRPHGGRDLA 881
Query: 789 PSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+++ R + F G Y +L N+ +VS +R+ G++VAE+P + T
Sbjct: 882 EDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 932
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 115/275 (41%), Gaps = 61/275 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 1038 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 1073
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1074 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 1110
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 1111 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 1158
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 1159 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1218
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L + ++VS +RV G VAE+PL+AT
Sbjct: 1219 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIAT 1253
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 115/275 (41%), Gaps = 61/275 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 877 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 912
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 913 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 949
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 950 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 997
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 998 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1057
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L + ++VS +RV G VAE+PL+AT
Sbjct: 1058 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIAT 1092
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 118/279 (42%), Gaps = 57/279 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + P GDW+C YC F G SG
Sbjct: 256 GDGGDLICCDGCPSTFHQSCLDIQKFPSGDWHCIYCSCKF--------------CGMFSG 301
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK- 683
+ +Q+ NL+ S +L C CE ++H C +
Sbjct: 302 --NTDQMN-------YNLDVNDSA------------------LLTCQLCEEKYHHMCTQG 334
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
+ + D P FC C + LQ LL + E F +++ E D
Sbjct: 335 EDSILDDSSSPS---FCGKTCRELFEQLQMLLGVKHELEDGFSWTLVQR-----TEVGFD 386
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
I L+G E L+ A++I +CF PIVD SG +LI +++Y G N
Sbjct: 387 IS-----LNGIPQKVECNSKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLN 441
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G + AIL ++SA +R+ G ++AE+P + T I
Sbjct: 442 YSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHI 480
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 115/275 (41%), Gaps = 61/275 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 974 GDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC------------------------ 1009
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ C +VK+ EA S F L C QCE ++H+ CLK+
Sbjct: 1010 ------TCRICGDLVKDREA--------------SSSF---LALKCSQCEHKYHMPCLKE 1046
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
++E+ FC +C I S LQ LL F + + L + D
Sbjct: 1047 KC---VKEVGGDARFCGENCQEIYSGLQGLL--------GFVNHIADGFTWTLLRCIHD- 1094
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
D + A E L+ A+ I +CF +VD +G D+IP ++Y R + F
Sbjct: 1095 DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDPRTGIDMIPHVLYNRGSDFARLNF 1154
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L + ++VS +RV G VAE+PL+AT
Sbjct: 1155 NGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIAT 1189
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 108/279 (38%), Gaps = 61/279 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H+ C IP G+WYC C
Sbjct: 979 GDGGELICCDNCPASYHQACLPCQDIPDGNWYCSSCL----------------------- 1015
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C SK L C QCER++HV C+
Sbjct: 1016 ------------------------CNICGEVITSKELRTSLPALECSQCERQYHVKCVSA 1051
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
E G WFC C +I + ++ + + ++ + T +I
Sbjct: 1052 K--VSCNEDGPGTWFCGRKCQQIYMIFRSRVGVPDHVDNDLSCTILRNNGDKKVRTAGEI 1109
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG- 803
A E + L A++I +CF PI+D +G D+IPS++Y F
Sbjct: 1110 ----------ALMAECNMKLMIALSIMEECFLPILDPRTGIDIIPSILYNWRSDFIHFNH 1159
Query: 804 -GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 841
G Y +L + S+VS +R+ G VAE+PLVATS N
Sbjct: 1160 KGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVATSTEN 1198
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 114/284 (40%), Gaps = 71/284 (25%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 632 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------------------------ 667
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 680
C+ CD + + G ++L C CER +H
Sbjct: 668 --------------------------TCKFCDAAVASGGKDGNSISLLSCGMCERRYHQL 701
Query: 681 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 738
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 702 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 753
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 796
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 754 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 808
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA LR G ++AE+P + T I
Sbjct: 809 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHI 852
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 114/284 (40%), Gaps = 71/284 (25%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 632 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNC------------------------ 667
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 680
C+ CD + + G ++L C CER +H
Sbjct: 668 --------------------------TCKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 701
Query: 681 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 738
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 702 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 753
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 796
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 754 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 808
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA LR G ++AE+P + T I
Sbjct: 809 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHI 852
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 140/357 (39%), Gaps = 79/357 (22%)
Query: 530 YYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 586
YY G L EG + GI C+CC S + F+AH G N+ C F S
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGNNI--CRPAENLFLGNGKS 106
Query: 587 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI-------TKRCIRIV 639
L S CQ RK+ + D A R +G S + ++ C V
Sbjct: 107 LVS----------CQVELMRKKIMMFDQGP--AVRAAGTGSRSKFRSLAPLGSENCNDYV 154
Query: 640 KNLEAELSGCLLCRGCD-----------------------------------FSKSG--F 662
++ G L+C CD F K F
Sbjct: 155 CSI-CHYGGDLIC--CDRCPSSFHAACLNIESVPEGDWFCPCCCCGICGDSQFDKMAEQF 211
Query: 663 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 722
++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 212 ADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILVG 267
Query: 723 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 782
IK + E D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 268 QNLTCTLIKPMQYQA-EDREDYDL--------AAMAENYSKLSVALEVMHECFDPVKDPK 318
Query: 783 SGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 319 TKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGT 375
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 115/278 (41%), Gaps = 59/278 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP +H+ C + +P GDW+C C F DA G+
Sbjct: 615 GDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPNCTCKF-------CDAAVASGGK--- 664
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
G L ++L C CER +H CL
Sbjct: 665 ----------------------DGNFL--------------SLLSCSMCERRYHQLCLSD 688
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
++ FC C + LQ L + E + + I + ++T SDI
Sbjct: 689 EAQK-VQSFGSASSFCGPKCLELFEKLQKYLGVKNEIEGGYSWSLIHR-----VDTDSDI 742
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
+ + LS + ++L + +AI +CF PIVD SG +LI +++Y G N +
Sbjct: 743 NSQ---LSAQRIENNSKLAV--GLAIMDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINY 797
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G Y AIL ++SA LR G ++AE+P + T I
Sbjct: 798 TGFYTAILERGDEIISAASLRFHGTQLAEMPFIGTRHI 835
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 142/355 (40%), Gaps = 75/355 (21%)
Query: 530 YYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 586
YY G L EG + GI C+CC S S F+AH G N+ C F S
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGNNI--CRPAENLFLGNGKS 106
Query: 587 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI-----TKRCIRIVKN 641
L S CQ RK+ ++ + V +G S ++ ++ C V +
Sbjct: 107 LVS----------CQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENCNDYVCS 156
Query: 642 LEAELSGCLLCRGCD-----------------------------------FSKSG--FGP 664
+ G L+C CD F K F
Sbjct: 157 I-CHYGGDLIC--CDRCPSSFHATCLNIERVPEGDWFCPCCCCGICGDSQFDKMAEQFAD 213
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 724
++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 214 DSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILVGHN 269
Query: 725 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 784
IK + + V D D+ AA E LS A+ + H+CFDP+ D +
Sbjct: 270 LTCTLIKPMQYQAEDRV-DYDL--------AAMAENYSKLSVALEVMHECFDPVKDPKTK 320
Query: 785 RDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 321 RDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGT 375
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 113/275 (41%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L +P W C C F +H ++ E
Sbjct: 1008 GDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCCANCSCKF----CHEHSSDGAE------ 1057
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D+ + D+S + C QCE ++H C +
Sbjct: 1058 -DTAD-------------------------VDYS--------LHTCSQCEEQYHEACSPE 1083
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
G FC C + LQNLL + + PE+ +++ + E V +
Sbjct: 1084 TDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLAL 1143
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D R E ++ A+++ +CF PI+D +G +LI ++VY G N +F
Sbjct: 1144 DKRV----------ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDF 1193
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y IL +++A +R+ G ++AE+P + T
Sbjct: 1194 RGFYIFILERGDEIIAAASVRIHGTKLAEMPFIGT 1228
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 118/275 (42%), Gaps = 54/275 (19%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CD CP FH+ C + P G W+C YC + +V G
Sbjct: 674 GDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------------------SCKVCG 715
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
Q+T + + EA + +LC+ CD CE ++H C++
Sbjct: 716 -----QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQM 752
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + + FC C ++ LQ LL + F I++ SD+
Sbjct: 753 NNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDV 801
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D L + A + L+ A+ + +CF P++D SG +LI +++Y G N F
Sbjct: 802 DSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNF 861
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y AIL + V+ A LR+ G E+AE+P + T
Sbjct: 862 SGFYTAILEKDDEVICAASLRIHGNELAEMPFIGT 896
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 118/275 (42%), Gaps = 54/275 (19%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CD CP FH+ C + P G W+C YC + +V G
Sbjct: 647 GDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC------------------SCKVCG 688
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
Q+T + + EA + +LC+ CD CE ++H C++
Sbjct: 689 -----QVTIGLHPMDDHHEA--AADVLCK----------------CDLCEEKYHPICVQM 725
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + + FC C ++ LQ LL + F I++ SD+
Sbjct: 726 NNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQDMKEGFSWTLIRR---------SDV 774
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D L + A + L+ A+ + +CF P++D SG +LI +++Y G N F
Sbjct: 775 DSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVIDHRSGINLIHNILYNCGSNFTRLNF 834
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y AIL + V+ A LR+ G E+AE+P + T
Sbjct: 835 SGFYTAILEKDDEVICAASLRIHGNELAEMPFIGT 869
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 102/208 (49%), Gaps = 24/208 (11%)
Query: 637 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADL 690
R ++ E C C + P TI+ C+QCER H+ C KK + L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 691 RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRL 750
+E + CC +C + + L+ ++K G + ++ WRL
Sbjct: 130 KEYMCFHFLCCQECQSLRARLEE---------------GLEKCVGITFLRRIRSNICWRL 174
Query: 751 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAI 809
LSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMYCA+
Sbjct: 175 LSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCAL 232
Query: 810 LTVNSSVVSAGILRVFGQEVAELPLVAT 837
LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 233 LTASTHVVSAAILKVRIEQIAELVLIAT 260
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 114/277 (41%), Gaps = 60/277 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + P GDW+C YC F G VSG
Sbjct: 572 GDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIYCCCKF--------------CGSVSG 617
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
S Q IV L L C CE ++H C++
Sbjct: 618 --SSNQRDDNDELIVSKL-------------------------LTCQLCEEKYHRSCIEA 650
Query: 685 H--KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 742
+ D R++ +FC C ++ L+ LL + E + I++
Sbjct: 651 NDANTDDSRDV----FFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRRS--------- 697
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQ 800
DV + K E L+ AV+I +CF P +D SG +LI S++Y R N
Sbjct: 698 --DVGFDASQIKPQMVECNSKLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGSNFNRL 755
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ G AIL ++SA +R+ G ++AE+P + T
Sbjct: 756 NYSGFVTAILERGDEIISAASIRIRGNQLAEMPFIGT 792
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 114/284 (40%), Gaps = 71/284 (25%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 114 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNCT----------------------- 150
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 680
C+ CD + + G ++L C CER +H
Sbjct: 151 ---------------------------CKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 183
Query: 681 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 738
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 184 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 235
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 796
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 236 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 290
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA LR G ++AE+P + T I
Sbjct: 291 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHI 334
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 114/284 (40%), Gaps = 71/284 (25%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP +H+ C + +P GDW+C C
Sbjct: 94 GDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPNCT----------------------- 130
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVG 680
C+ CD + + G ++L C CER +H
Sbjct: 131 ---------------------------CKFCDAAVASGGKDGNFISLLSCGMCERRYHQL 163
Query: 681 CL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSL 738
CL + HK ++ FC C + LQ L + E + + I + +
Sbjct: 164 CLNDEAHK---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----V 215
Query: 739 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 796
+T SD + + A E L+ +AI +CF PIVD SG DLI +++Y G N
Sbjct: 216 DTDSDTNSQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSN 270
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA LR G ++AE+P + T I
Sbjct: 271 FNRINYTGFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHI 314
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 114/275 (41%), Gaps = 56/275 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FHK C + P GDW+C YC F R+ G
Sbjct: 924 GDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYCCCKF---------------CRLVG 968
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
S + + F +L C CE +FH+ C++
Sbjct: 969 GSSNQSVVN--------------------------DEFTMPALLTCHLCEEKFHISCVEA 1002
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + K FC C ++ L+ LL + E F + I++ SD+
Sbjct: 1003 N--GGKTDDSKDALFCGNKCQELSERLEMLLGVKHEIEDGFSWSFIRR---------SDV 1051
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
L + + ++L + A++I ++CF P +D SG +L+ S++Y G N + ++
Sbjct: 1052 GCDLSLTNPQLVECNSKLAV--ALSIMNECFMPYIDHRSGTNLLRSILYNCGSNFKRLDY 1109
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G IL ++ +RV G +AE+P + T
Sbjct: 1110 SGFITVILERGDEIICVASIRVHGNRLAEMPYIGT 1144
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 111/277 (40%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H++C IP G WYC
Sbjct: 1051 GDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY-------------------------- 1084
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
RC+ C +C K L C QCER++H C+
Sbjct: 1085 ---------RCL------------CDICGEVINLKELRSSLPALECAQCERQYHAKCIYG 1123
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + P WFC C +I L++ + + F ++ + T +DI
Sbjct: 1124 KLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTVLRNNGDQRVSTAADI 1182
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 800
A E + L A++I +CF PI+D+ +G D+IP ++Y N R
Sbjct: 1183 ----------AILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILY--NWRSDFVHL 1230
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
++ G Y +L + ++S +R+ G VAE+PL+AT
Sbjct: 1231 DYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIAT 1267
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 99.4 bits (246), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 111/277 (40%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP ++H++C IP G WYC
Sbjct: 1000 GDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY-------------------------- 1033
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
RC+ C +C K L C QCER++H C+
Sbjct: 1034 ---------RCL------------CDICGEVINLKELRSSLPALECAQCERQYHAKCIYG 1072
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + P WFC C +I L++ + + F ++ + T +DI
Sbjct: 1073 KLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSCTVLRNNGDQRVSTAADI 1131
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 800
A E + L A++I +CF PI+D+ +G D+IP ++Y N R
Sbjct: 1132 ----------AILAECNMKLVIALSIMEECFLPIIDARTGIDIIPPILY--NWRSDFVHL 1179
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
++ G Y +L + ++S +R+ G VAE+PL+AT
Sbjct: 1180 DYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIAT 1216
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 108/273 (39%), Gaps = 82/273 (30%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP +FHK C + +P GDW+C C ++ G +
Sbjct: 708 GGELILCDHCPSSFHKRCLGMKDVPDGDWFCPSC------------------CCKICGQN 749
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
+++ TK I V N C QCE ++H+ CL
Sbjct: 750 KLKKDTKDFIDGVLN----------------------------CTQCEHQYHIMCLSNSW 781
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 746
++ PK FC C + + + KL F + +ET S + +
Sbjct: 782 TDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDAFDDETL-------VETYSKLKI 827
Query: 747 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGG 804
A+ + H+CF+PI + +GRDL+ +++ G L F G
Sbjct: 828 --------------------ALDVVHECFEPIEEPRTGRDLMKDVIFSNGSELNRLNFQG 867
Query: 805 MYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
Y +L N +VS +R+ G +VAE+PLV T
Sbjct: 868 FYTILLEKNDELVSVATVRIHGDKVAEIPLVGT 900
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 121/277 (43%), Gaps = 68/277 (24%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 364 GGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC-------------------------- 397
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
K C ++ + + ++S F S + C QCE+ H+GC+K +
Sbjct: 398 ----CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIGCVKSIQ 439
Query: 687 MADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + + + + WFC C I+ LQNLL ++ +P G++ E ++
Sbjct: 440 VLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDARENLT-- 484
Query: 745 DVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE- 801
W L+ + R L++A+ + H F P+ D I+ DLI + + +
Sbjct: 485 ---WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKRSESKRL 541
Query: 802 -FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y AIL ++VV+ +RV+G EVAE+PLVAT
Sbjct: 542 NFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVAT 578
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 121/277 (43%), Gaps = 68/277 (24%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 341 GGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC-------------------------- 374
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
K C ++ + + ++S F S + C QCE+ H+GC+K +
Sbjct: 375 ----CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQNVHIGCVKSIQ 416
Query: 687 MADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + + + + WFC C I+ LQNLL ++ +P G++ E ++
Sbjct: 417 VLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------VGDARENLT-- 461
Query: 745 DVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE- 801
W L+ + R L++A+ + H F P+ D I+ DLI + + +
Sbjct: 462 ---WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLSKRSESKRL 518
Query: 802 -FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y AIL ++VV+ +RV+G EVAE+PLVAT
Sbjct: 519 NFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVAT 555
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 112/288 (38%), Gaps = 82/288 (28%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
D G LL CD CP FH C L S PQGDW+C C
Sbjct: 419 DCGELLMCDRCPSMFHHACVGLESTPQGDWFCPACT------------------------ 454
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDF-------SKSGFGP-RTILLCDQCEREF 677
C +C D + GF R ++ C+QC RE+
Sbjct: 455 -----------------------CAICGSSDLDDPPATTTTQGFSSDRMVISCEQCRREY 491
Query: 678 HVGCLKK------HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 731
HVGC+++ + AD +G W C CS+I L+ L V +A
Sbjct: 492 HVGCMRERDNGLWYPEAD----GEGPWLCSEACSKIYLRLEELAVVQAP--------CRS 539
Query: 732 KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 791
+G SL V R + + E L A+ + +CF +++ + DL +
Sbjct: 540 VASGLSL-------VVLRRGAARDGEEEEHAKLCMALDVLRECFVTLIEPRTQTDLTADI 592
Query: 792 VYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 593 VFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 640
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 117/279 (41%), Gaps = 59/279 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + P GDW+C C + G
Sbjct: 711 GDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNCTCKY------------------CG 752
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-LK 683
V S++ +C+G + S S I C CE++FH C L+
Sbjct: 753 VASID---------------------ICQGDNTSVS-----EISTCILCEKKFHESCNLE 786
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
L FC C + LQ L + E F + I++ T D
Sbjct: 787 MDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------TSED 837
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
DV R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N
Sbjct: 838 SDVSVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLN 894
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA +R G ++AE+P + T I
Sbjct: 895 YSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHI 933
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/286 (25%), Positives = 114/286 (39%), Gaps = 86/286 (30%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH +C L ++P+GDW+C C
Sbjct: 135 GDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC------------------------ 170
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT---ILLCDQCEREFHVGC 681
C C +S + P IL CDQCERE+H C
Sbjct: 171 --------------------------CCASC--GRSLYDPTIQTEILYCDQCEREYHSNC 202
Query: 682 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKY----AG 735
+ M + FC C +I L+ L+ V + + + + L + Y
Sbjct: 203 VPGSAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSEEN 259
Query: 736 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 795
+ LE+V+D++ R L+ A+ + +CF P++D S D++ ++Y R
Sbjct: 260 SKLESVADLNTR----------------LALALTVIQECFRPMIDPRSNIDMVSHILYNR 303
Query: 796 NLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
RG+ +F G Y +L ++S +RV G AE+P + T
Sbjct: 304 --RGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGT 347
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 115/282 (40%), Gaps = 74/282 (26%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH C S+ +P+G+WYC C
Sbjct: 952 GDGGELICCDNCPSTFHHSCLSIQELPEGNWYCLNC------------------------ 987
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
T R + N E E+S L C QCE+++H CLK+
Sbjct: 988 -------TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQ 1024
Query: 685 HKMADLRELPKGK----WFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLE 739
R++ G WFC C +I + LQ+ L ++A G S
Sbjct: 1025 ------RDIDSGVESHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWT 1066
Query: 740 TVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--R 795
+ I ++LS A E L A+ I +CF +VD +G D+IP +VY
Sbjct: 1067 LLRCIHYDQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKS 1126
Query: 796 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ +F G Y IL + ++ +RV G E+AE+PL+AT
Sbjct: 1127 SFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIAT 1168
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 112/276 (40%), Gaps = 57/276 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P W C C F H+ ++ +A +
Sbjct: 1013 GDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNCSCKF------CHEHSSDDAEDTAD 1066
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
VDS ++ C QCE + C
Sbjct: 1067 VDS--------------------------------------SLHTCSQCEEQCTEACSPD 1088
Query: 685 -HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
+A G FC C + LQNLL + + PE+ +++ E V
Sbjct: 1089 IDSIATNLSSQTGNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLA 1148
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
+D R E ++ A+++ +CF PIVD +G +LI ++VY G N +
Sbjct: 1149 LDKRV----------ECNSKIAVALSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLD 1198
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y IL +++A +R+ G ++AE+P + T
Sbjct: 1199 FRGFYIIILERGDEIIAAASVRIHGTKLAEMPFIGT 1234
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 117/279 (41%), Gaps = 59/279 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + P GDW+C C + G
Sbjct: 593 GDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPNC------------------TCKYCG 634
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-LK 683
V S++ +C+G + S S I C CE++FH C L+
Sbjct: 635 VASID---------------------ICQGDNTSVS-----EISTCILCEKKFHESCNLE 668
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
L FC C + LQ L + E F + I++ T D
Sbjct: 669 MDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVKHELDAGFSWSLIRR-------TSED 719
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
DV R LS + E+ L+ A+ + +CF PIVD SG +LI +++Y G N
Sbjct: 720 SDVSVRGLSQRI---ESNSKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLN 776
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA +R G ++AE+P + T I
Sbjct: 777 YSGFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHI 815
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 113/278 (40%), Gaps = 66/278 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH C S+ +P+G+WYC C
Sbjct: 952 GDGGELICCDNCPSTFHHSCLSIQELPEGNWYCLNC------------------------ 987
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
T R + N E E+S L C QCE+++H CLK+
Sbjct: 988 -------TCRICGDLVNFE-EISS---------------SSDALKCFQCEQKYHGQCLKQ 1024
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVSD 743
+ E WFC C +I + LQ+ L ++A G S +
Sbjct: 1025 RDINSGVE--SHIWFCSGSCQKIYAALQS------------QLGLTNQFANGFSWTLLRC 1070
Query: 744 IDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRG 799
I ++LS A E L A+ I +CF +VD +G D+IP +VY +
Sbjct: 1071 IHYDQKILSTARLAMMAECNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPR 1130
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F G Y IL + ++ +RV G E+AE+PL+AT
Sbjct: 1131 LDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLIAT 1168
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/54 (81%), Positives = 49/54 (90%)
Query: 791 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
MVYGRN+RGQEFGGMYCA+L VNSSVVSA +LR+FG +VAELPLVATS NHGK
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGK 54
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 109/281 (38%), Gaps = 62/281 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF---ERKRFLQHDANAVEAGR 621
DGG+L+ CDGCP FH+ C + +P GDW+C C F + F+Q D V
Sbjct: 768 GDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCPNCTCKFCGIASEDFVQEDGTNVSE-- 825
Query: 622 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
+L C C +++H C
Sbjct: 826 ---------------------------------------------LLTCSLCAKKYHKSC 840
Query: 682 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 741
L+ + FC C + LQ L + E F + + +
Sbjct: 841 LQDVDAPCIDFNNSTPCFCGKTCRELFEQLQKYLGIKHELESGFSWSLVHRM-------- 892
Query: 742 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 799
DID+ L G E L+ A+++ +CF PIVD SG ++I +++Y G N
Sbjct: 893 -DIDLDMSL-QGLPQRVECNSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNR 950
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G Y AIL ++SA +R G ++AE+P + T +
Sbjct: 951 LNYSGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHV 991
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 96.3 bits (238), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 103/278 (37%), Gaps = 66/278 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
+GG L+ CD CP FH C S IP GDWYC C
Sbjct: 805 GEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----------------------- 841
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C K L C QCE ++H CL+
Sbjct: 842 ------------------------CRICGNLVIDKDTLDAHDSLQCSQCEHKYHEKCLED 877
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLETV 741
+ L WFC C + S LQ+ L+ Q A+ + L I + + +
Sbjct: 878 RDKQEGAILDT--WFCGQSCQEVYSGLQSQVGLVNQVADGISWTLLRCI--HDDQKVHSA 933
Query: 742 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 799
+W L T L+ A+ I +CF + D +G LIP ++Y G
Sbjct: 934 -----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHLIPQVLYNWGSEFAR 983
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L + ++S +RV G VAE+PL+AT
Sbjct: 984 LNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIAT 1021
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 95.9 bits (237), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 103/279 (36%), Gaps = 92/279 (32%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFLQHDANAVEAGR 621
GG L+ CDGCP AFH C L +P GDW+ C C F + NA E
Sbjct: 703 GGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT----STNAKEEKF 758
Query: 622 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
+S C QCE ++H C
Sbjct: 759 IS----------------------------------------------CKQCELKYHPSC 772
Query: 682 LKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 740
L+ D L ++ KWFC DC S+ N+ +A K+
Sbjct: 773 LRYDGACDSLDKILGEKWFCSKDCE--ESLEPNMYGDDASKI------------------ 812
Query: 741 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLR 798
A E +LS A+ + H+ F+P+ GRDL +++ R +
Sbjct: 813 --------------EAAAENHCILSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFK 858
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L N+ +VS +R+ G++VAE+P + T
Sbjct: 859 RLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 897
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 111/278 (39%), Gaps = 61/278 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + P GDW+C C F
Sbjct: 692 GDGGDLVCCDGCPSTFHQRCLDIRMFPLGDWHCPNCTCKF-------------------- 731
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +++++ + G T C CE+++H C+ K
Sbjct: 732 ----------CKAVIEDVTQTV----------------GANT---CKMCEKKYHKSCMPK 762
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ FC C ++ ++ + + E F + + + NS D+
Sbjct: 763 ANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHRECTNS-----DL 817
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
LSG E L+ A+ + +CF PI+D SG +++ +++Y G N F
Sbjct: 818 S-----LSGHPHIVENNSKLALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNRLNF 872
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
GG Y A+L +V++ +R G +AE+P + T +
Sbjct: 873 GGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHV 910
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 3/136 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
S + + KK +LK KK+ N P V L TG+LDGV V Y I + LRG+I+
Sbjct: 259 SGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKY---IAWSREELRGVIKG 315
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LC C CN +VI +FE HA + + + +I FENGK++ +++ RS+P ML
Sbjct: 316 SGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLF 375
Query: 322 ATLQSALSSLPEEKSF 337
+Q+ S +KSF
Sbjct: 376 EVIQTITGSPINQKSF 391
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 103/279 (36%), Gaps = 68/279 (24%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
+GG L+ CD CP FH C S IP GDWYC C
Sbjct: 805 GEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT----------------------- 841
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C K L C QCE ++H CL+
Sbjct: 842 ------------------------CRICGNLVIDKDTSDAHDSLQCSQCEHKYHEKCLED 877
Query: 685 HKMADLRELP-KGKWFCCMDCSRINSVLQN---LLVQEAEKLPEFHLNAIKKYAGNSLET 740
D +E+ WFC C + S LQ L+ Q A+ + L I + + +
Sbjct: 878 R---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISWTLLRCI--HDDQKVHS 932
Query: 741 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 798
+W L T L+ A+ I +CF + D +G +IP ++Y G
Sbjct: 933 A-----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGIHMIPQVLYNWGSEFA 982
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L ++S +RV G VAE+PL+AT
Sbjct: 983 RLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIAT 1021
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 110/278 (39%), Gaps = 56/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + +P GDW+C C F G
Sbjct: 94 GDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPNCSCKF------------------CG 135
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
V S + +F + +L C C +++H C+++
Sbjct: 136 VASDK--------------------------NFQRDDTTVSKLLTCSLCVKKYHKSCMQE 169
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ FC C + LQ L + E F + I + T +D
Sbjct: 170 INTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKHELEAGFSWSLIHR-------TDADS 222
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
D L G E L+ ++++ +CF PIVD SG +LI +++Y G N F
Sbjct: 223 DTS---LQGLPQRVECNSKLAVSLSVMDECFLPIVDRRSGINLIQNVLYNCGSNFNRLNF 279
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
GG Y IL ++SA +R G +AE+P + T +
Sbjct: 280 GGFYALILERGDEIISAASIRFHGTRLAEMPFIGTRHM 317
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 3/147 (2%)
Query: 191 SLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+LI I + T KK E K+SKK+ N P V L TG+LDGV V Y I +
Sbjct: 429 ALISTAQITASGSETVSKKKEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKY---IAW 485
Query: 251 QASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR 310
LRGII+ G LC C CN +VI +FE HA + + + +I FENGK++ +++
Sbjct: 486 SREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQ 545
Query: 311 ACRSVPLPMLKATLQSALSSLPEEKSF 337
+S P L +Q+ S +KSF
Sbjct: 546 ELKSTPQNSLFDVIQTITGSPINQKSF 572
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 109/279 (39%), Gaps = 57/279 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CD CP FH+ C + +P G W C YC F G G
Sbjct: 94 GDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYCSCKF--------------CGMAGG 139
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL-CDQCEREFHVGCLK 683
C ++ R LL C CE ++H C+
Sbjct: 140 ----------------------------DACQMDENDAAARPALLTCCLCEEKYHHSCIP 171
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
+ + FC C ++ LQ LL + E F ++++ + SD
Sbjct: 172 AEDT--INDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAWTVVRRF-----DVGSD 224
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
I LSG E ++ A+ I +CF P+ D SG +LI ++VY G N
Sbjct: 225 I-----TLSGMHRKVECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLN 279
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+ G AIL V+SA +R+ G ++AE+P + T +
Sbjct: 280 YCGFLTAILERGDEVISAASIRIHGNQLAEMPFIGTRHM 318
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y I LRGII+ G LC
Sbjct: 306 KNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---ISLSRKELRGIIKGSGYLCG 362
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +ICFENGK++ ++++ R+ P ML +Q+
Sbjct: 363 CQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTV 422
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 423 FGAPINQKSF 432
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
+ + S KN ELK SKKI N P V L TGLLDGV+V Y+ + + LRG I+
Sbjct: 401 TTIDSASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKT--LRGTIKG 458
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LCSC +C G +V+ +FE HA + + + +I FENGK++ V++ ++ P ML
Sbjct: 459 TGYLCSCKVC-GNKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLF 517
Query: 322 ATLQSALSSLPEEKSFACVRC 342
+++ S +K+F +
Sbjct: 518 NAIETVTGSAINQKNFLSWKA 538
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 92.4 bits (228), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/54 (79%), Positives = 48/54 (88%)
Query: 791 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGK
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGK 54
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 106/288 (36%), Gaps = 83/288 (28%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 965 GDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT----------------------- 1001
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C G K I C QC +H C+++
Sbjct: 1002 ------------------------CQVCGGPFSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1037
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
K+ L + WFC C I ++ + G + + D
Sbjct: 1038 EKLP-LEDQISQTWFCGKYCKEI-------------------FIGLRSHVGT--DNILDS 1075
Query: 745 DVRWRLL----SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 792
D+ W +L G+ A E + L+ A+ + +CF +VD +G D+IP ++
Sbjct: 1076 DLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVL 1135
Query: 793 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
Y G N ++ G Y IL ++ +RV G + AELP +ATS
Sbjct: 1136 YNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 1183
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 59/87 (67%), Gaps = 2/87 (2%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG++ PC CPR+FH C LS +P +WYC C+N+ ++++ L + NA AGR +GVD
Sbjct: 24 GGDIFPCKICPRSFHPACVGLSKVPS-EWYCDNCRNLVQKEKALAENKNAKAAGRQAGVD 82
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCR 653
S+EQI KR IRIV + +L GC LC+
Sbjct: 83 SIEQIMKRAIRIVP-ISDDLGGCALCK 108
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 110/278 (39%), Gaps = 57/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C ++ +P GDW+C C F G G
Sbjct: 506 GDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADG 551
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
++ + T +EL C LC E+++H C++
Sbjct: 552 SNAEDDTTV----------SELVTCSLC---------------------EKKYHTSCIQG 580
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + LQ + + E F + I + + SD
Sbjct: 581 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDT 635
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
VR G E+ L+ A+ + +CF IVD S +LI +++Y R N +
Sbjct: 636 SVR-----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNY 690
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G Y AIL ++ A +R+ G ++AE+P + T I
Sbjct: 691 SGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHI 728
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 112/278 (40%), Gaps = 57/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + +P G+W C C F G SG
Sbjct: 717 GDGGDLICCDGCPSTFHQSCLDIQMLPPGEWRCMNCTCKF--------------CGIASG 762
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ + S C+L +C+ CE+++H C K+
Sbjct: 763 TSEKD---------------DASVCVL----------------HICNLCEKKYHDSCTKE 791
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC +C ++ L+ L + E F + I + T D
Sbjct: 792 MDTLPNNINSSSLSFCGKECKELSEHLKKYLGTKHELESGFSWSLIHR-------TDDDS 844
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
+ R +S + E L+ + + +CF P++D SG +LI +++Y G N +
Sbjct: 845 EAACRGISQRV---ECNSKLAITLTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLSY 901
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G Y AIL +++A +R G ++AE+P + T I
Sbjct: 902 SGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHI 939
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 110/278 (39%), Gaps = 57/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C ++ +P GDW+C C F G G
Sbjct: 674 GDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADG 719
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
++ + T +EL C LC E+++H C++
Sbjct: 720 SNAEDDTTV----------SELVTCSLC---------------------EKKYHTSCIQG 748
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
FC C + LQ + + E F + I + + SD
Sbjct: 749 VDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHR-----TDPGSDT 803
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEF 802
VR G E+ L+ A+ + +CF IVD S +LI +++Y R N +
Sbjct: 804 SVR-----GFPQRVESNSKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNY 858
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G Y AIL ++ A +R+ G ++AE+P + T I
Sbjct: 859 SGFYTAILERGDEIICAASIRIHGTQLAEMPFIGTRHI 896
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +KK + N P V L TG+LDGV V Y + LRG
Sbjct: 302 VAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKY---VSVSREELRG 358
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 359 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 418
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 419 SLLFDTIQTVFGAPINQKAF 438
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 105/277 (37%), Gaps = 61/277 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H C S +P+G WYC C
Sbjct: 989 GDGGELLCCDNCPSTYHPACLSAKELPEGSWYCHNCT----------------------- 1025
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C G K I C QC +H C+++
Sbjct: 1026 ------------------------CQICGGPVSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1061
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 743
K+ L + WFC C I L++ + E E + ++ G L +V
Sbjct: 1062 EKLP-LEDQISQTWFCGKYCKEIFIGLRSHVGTENILDSELSWSILRCNNDGQKLHSVQK 1120
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
I A E + L+ A+ + +CF +VD +G D+IP ++Y G N +
Sbjct: 1121 I----------ACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVD 1170
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
+ G Y IL ++ +RV G + AELP +ATS
Sbjct: 1171 YQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 1207
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 100/277 (36%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH+ C S +P+G WYC C
Sbjct: 873 GDGGELICCDNCPSTFHQACLSTEELPEGSWYCPNCT----------------------- 909
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C K C QCE ++H C K
Sbjct: 910 ------------------------CWICGELVNDKEDINSSNAFKCSQCEHKYHDSCWKN 945
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + WFC C + LQ+ + +N I +L
Sbjct: 946 KTIG--KGGASDTWFCGGSCQAVYFGLQSRVGI---------INHIADGVCWTLLKCIHE 994
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ---- 800
D + A E L+ A+ I +CF +VD +G D+IP ++Y N R +
Sbjct: 995 DQKVHSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHVLY--NWRSEFARL 1052
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L + ++S +R+ G VAE+PL+AT
Sbjct: 1053 NFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLIAT 1089
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 86/172 (50%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYGQETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVTY---VSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 111/277 (40%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 623
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 683
LS C LC E ++H C+
Sbjct: 713 ----------------------LSSCRLC---------------------EEKYHQACIN 729
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVS 742
+ FC C + LQ L + LPE F + ++++ S V+
Sbjct: 730 QDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVA 784
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 785 DCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRL 836
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F A+L +++ +R+ G ++AE+P + T
Sbjct: 837 DFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGT 873
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +K + N P V L TG+LDGV V Y I LRG
Sbjct: 341 VAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKY---ISVSREELRG 397
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 458 SLLFDTIQTVFGAPIHQKAF 477
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 111/277 (40%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 623
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 504 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 559
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 683
LS C LC E ++H C+
Sbjct: 560 ----------------------LSSCRLC---------------------EEKYHQACIN 576
Query: 684 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVS 742
+ FC C + LQ L + LPE F + ++++ S V+
Sbjct: 577 QDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EVA 631
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 632 DCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHRL 683
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F A+L +++ +R+ G ++AE+P + T
Sbjct: 684 DFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGT 720
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 107/274 (39%), Gaps = 56/274 (20%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CDGCP AFH C L +P GDW+C+ C + FL+ + + +
Sbjct: 712 GGKLILCDGCPSAFHANCLGLEEVPDGDWFCESCCCGACGQFFLKATSKYAKEEKF---- 767
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
+ C+ C+ ++H CL+
Sbjct: 768 -----------------------ISCKQCEL------------------KYHPSCLRYDG 786
Query: 687 MAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 745
D L KWFC DC I L L+ K E + + S E D
Sbjct: 787 AGDSLDTFLGEKWFCSKDCEEIFVNLCELI----GKPREVGVEKLTWRLVQSFEPNMYGD 842
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFG 803
+++ A E LS A+ + H+ F+P+ GRDL +++ R + F
Sbjct: 843 DAYKI----EAVAENHCKLSVALDVMHELFEPVKRPHGGRDLAEDVIFSRWSKFKRLNFS 898
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N +V+ +R+ G++VAE+P + T
Sbjct: 899 GFYTVLLERNEELVTVATVRILGKKVAEMPFIGT 932
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y + LRGII+ G LC
Sbjct: 243 KNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---VSLSREELRGIIKGSGYLCG 299
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P ML +Q+
Sbjct: 300 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTV 359
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 360 FGAPINQKSF 369
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENG+++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 89.7 bits (221), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 38/194 (19%)
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNL 714
D + GF IL CDQC R+FHV C + + L R+ WFC C + S LQ+L
Sbjct: 40 DVQQDGF----ILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHL 95
Query: 715 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LS 765
L G S+ +D ++ W LL K P+ L L
Sbjct: 96 L-------------------GKSVPVGTD-NLTWTLL--KRVEPDCFDLEVLSANNSKLK 133
Query: 766 QAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 823
A+ + H+CF+P D+ +G+DL+ +++ G NL F G Y +L N+ + + +R
Sbjct: 134 LALEVMHECFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVR 193
Query: 824 VFGQEVAELPLVAT 837
VFG +VAE+P VAT
Sbjct: 194 VFGDKVAEVPFVAT 207
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 110/278 (39%), Gaps = 55/278 (19%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CD CP FH+ C + P G W C YC F ++G
Sbjct: 94 GDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYCSCKF---------------CGMAG 138
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D+ + D + + P +L C CE ++H C+
Sbjct: 139 GDTCQM-------------------------DENDTAAQP-ALLACCLCEEKYHHSCILA 172
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + FC C + LQ LL + E F ++++ + SDI
Sbjct: 173 ENTVN--DGYSSVSFCGKKCQELYDKLQALLGVKHEMEEGFAWTLVRRF-----DVGSDI 225
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
LSG E ++ A+ I +CF P+ D SG +LI ++VY G N +
Sbjct: 226 S-----LSGMHRKVECNSKVAVALHIMDECFLPMPDHRSGVNLIRNIVYNFGSNFNRLNY 280
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G AIL ++SA +R+ G +AE+P + T +
Sbjct: 281 SGFLTAILERGDEIISAASIRIHGNHLAEMPFIGTRHM 318
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K E KMSKK + N P V L TG+LDGV V Y + L GII+ G LC
Sbjct: 265 KNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGYLCG 321
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +Q+
Sbjct: 322 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 381
Query: 328 LSSLPEEKSF 337
S +KSF
Sbjct: 382 TGSPINQKSF 391
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 72/133 (54%), Gaps = 3/133 (2%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
++ K E KMSKK + N P V L TG+LDGV V Y + L GII+ G
Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGY 388
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +
Sbjct: 389 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAI 448
Query: 325 QSALSSLPEEKSF 337
Q+ S +KSF
Sbjct: 449 QTVTGSPINQKSF 461
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 89.4 bits (220), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 82/159 (51%), Gaps = 3/159 (1%)
Query: 179 VTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLD 238
V Q E +++ + +A+ + + E+K +K + N P V L TG+LD
Sbjct: 251 VQQKEFDASDAHATASNTRVAKSKTESVSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLD 310
Query: 239 GVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYIC 298
GV V Y I LRG+I+ G LCSC CN +V+ +FE HA + + + +I
Sbjct: 311 GVPVKY---IALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIY 367
Query: 299 FENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
FENGK++ ++++ RS P ML +Q+ + +KSF
Sbjct: 368 FENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSF 406
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 109/280 (38%), Gaps = 67/280 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CDGCP FH+ C + +P G+W+C C F
Sbjct: 732 GDGGDLICCDGCPSTFHQSCLDIQMLPLGEWHCPNCTCKF-------------------- 771
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C + G + K + +C+ CE+++H C K
Sbjct: 772 ------------------------CGIASG-NSEKDDASVYVLQICNLCEKKYHDSCTK- 805
Query: 685 HKMADLRELPKG-----KWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLE 739
++ LP FC +C ++ L+ L + E F + I + +S
Sbjct: 806 ----EMDNLPNNINTSSLSFCGKECKELSEHLKKYLGTKHELEAGFSWSLIHRIDEDSEA 861
Query: 740 TVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNL 797
I R E L+ A+ + +CF P++D SG +LI +++Y G N
Sbjct: 862 ACRGISQR----------VECNSKLAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNF 911
Query: 798 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ G Y A L ++++ +R G ++AE+P + T
Sbjct: 912 SRLNYSGFYTATLERGDEIIASASIRFHGTQIAEMPFIGT 951
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 115/278 (41%), Gaps = 56/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 623
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCL 682
LS C LC C S P T+ C + G +
Sbjct: 713 ----------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTV 744
Query: 683 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 741
+ D FC C + LQ L + LPE F + ++++ S V
Sbjct: 745 PGERSTDS--------FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EV 793
Query: 742 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 799
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 794 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHR 845
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F A+L +++ +R+ G ++AE+P + T
Sbjct: 846 LDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGT 883
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 115/278 (41%), Gaps = 56/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 623
DGG+L+ CDGCP FH+ C + P G WYC C F E+ +H+ + + +
Sbjct: 657 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEKDEAAKHETSTLPS---- 712
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQCEREFHVGCL 682
LS C LC C S P T+ C + G +
Sbjct: 713 ----------------------LSSCRLCEEKC----SKHYPHTLADHQACINQ--DGTV 744
Query: 683 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 741
+ D FC C + LQ L + LPE F + ++++ S V
Sbjct: 745 PGERSTDS--------FCGKYCQELFEELQ-LFIGVKHPLPEGFSWSFLRRFELPS--EV 793
Query: 742 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 799
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY G N
Sbjct: 794 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFGSNFHR 845
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F A+L +++ +R+ G ++AE+P + T
Sbjct: 846 LDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGT 883
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 428 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 485
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 546 GSTINQKNF 554
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 112/282 (39%), Gaps = 75/282 (26%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
ADGGNL+ CD CP +H C + PQG+W C C
Sbjct: 270 ADGGNLICCDKCPSTYHISCLQMEDEPQGEWRCPAC------------------------ 305
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C C F S + C QC++++H C ++
Sbjct: 306 -----------------------ACKFCHTHAFDIS------VFTCSQCDKKYHWECFRE 336
Query: 685 HK--MADLR-ELPKGKW-FCCMDCSRINSVLQNLL---VQEAEKLPEFHLNAIKKYAGNS 737
++ + DL + P FC CS+I L+ L+ + E L L + AG
Sbjct: 337 NEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDEGLTWTLLRRMDPEAGVY 396
Query: 738 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 795
LE D R L ++ AVA+ +CF+P++D + +++ S++Y G
Sbjct: 397 LEESYD-----RTLCNSK--------IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGA 443
Query: 796 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
N F G Y AIL +S +R+ G ++AE+P +AT
Sbjct: 444 NFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIAT 485
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 113/279 (40%), Gaps = 71/279 (25%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
ADGG+L+ C+ C H +C L IPQGDW C YC
Sbjct: 217 ADGGDLICCEKCWSTSHLKCMGLERIPQGDWICPYC------------------------ 252
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ K C + K+L+ C QC++++H CL
Sbjct: 253 ------VCKHCNKNDKDLQT-------------------------CVQCDKKYHCQCLVS 281
Query: 685 HKMADLRELPKGKWFCC-MDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSD 743
+K DL G+ C C + LQ+L+ + E F +++ ++L+ D
Sbjct: 282 NKELDLN--ASGETLACDSHCGEVYEKLQSLVGVKHELEGGFCWTLLQRMEPDNLD-FKD 338
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQE 801
+ + E ++ A + +CF I+D + +++ S+ Y R NL
Sbjct: 339 LHL----------ITECNSKIALAWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRIN 388
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
F G Y AIL N ++SA +RV G ++AE+P + T +
Sbjct: 389 FRGFYTAILEKNDDIISAATIRVHGTDLAEMPFIGTRHL 427
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 74/140 (52%), Gaps = 7/140 (5%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM-------GGIKFQASGLRG 257
++ K E KMSKK + N P V L TG+LDGV V Y+ G I L G
Sbjct: 443 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHG 502
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 503 IIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 562
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L +Q+ S +KSF
Sbjct: 563 SLLFBAIQTVTGSPINQKSF 582
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 114/278 (41%), Gaps = 56/278 (20%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF-ERKRFLQHDANAVEAGRVS 623
DGG+L+ CDGCP FH+ C + P G WYC C F E+ HD +A+ +
Sbjct: 754 GDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEKVEAAIHDTSALHS---- 809
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLC-RGCDFSKSGFGPRTILLCDQCEREFHVGCL 682
LS C LC C S P T+ C + G +
Sbjct: 810 ----------------------LSSCRLCEEKC----SNHYPHTLADHQACINQ--DGTV 841
Query: 683 KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETV 741
+ D FC C + LQ LL+ LPE F + ++++ S V
Sbjct: 842 PGERSTDS--------FCGKYCQELFEELQ-LLIGVKHPLPEGFSWSFLRRFELPS--EV 890
Query: 742 SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRG 799
+D D+ ++ ++ A ++ +CF P+VD SG +L+ ++VY N
Sbjct: 891 ADCDISEKIAYNAK--------MAVAFSVMDECFSPLVDHRSGVNLLQNIVYNFWSNFHR 942
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F A+L +++ +R+ G ++AE+P + T
Sbjct: 943 LNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGT 980
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 102/277 (36%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG L+ CD CP FH+ C +P+G WYC C
Sbjct: 226 GDGGELICCDNCPSTFHQACLCTEDLPEGSWYCPNCT----------------------- 262
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C K C QCE ++H C +
Sbjct: 263 ------------------------CWICGDLVNDKEASSSVGAYKCLQCEHKYHGACQQG 298
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+ + L WFC C + S L + + N I G + I
Sbjct: 299 KQTHE--GLVSDAWFCSGSCQEVYSGLHSRVGIN---------NPIAD--GFCWTLLRCI 345
Query: 745 DVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
++LS + A E L+ A+ I +CF +VD +G D+IP +Y G +
Sbjct: 346 HEDQKVLSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGIDMIPHALYNWGSDFARL 405
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y +L + +VSA +RV G VAE+PL+AT
Sbjct: 406 NFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIAT 442
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 417 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 474
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 475 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 534
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 535 QEMLFDAIQNVTGSPINQKNF 555
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 396 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 453
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 454 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 513
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 514 QEMLFDAIQNVTGSPINQKNF 534
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 72/134 (53%), Gaps = 2/134 (1%)
Query: 204 LTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGG 263
+ S KN ELK SKK+ N P V L TGLLDGV V Y+ + + L GII+ G
Sbjct: 432 IDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT--LEGIIKGTG 489
Query: 264 ILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKAT 323
LC C C + + +FE HA + + + +I FENGK++ V++ ++ P +L
Sbjct: 490 YLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNA 549
Query: 324 LQSALSSLPEEKSF 337
+Q+ S +K+F
Sbjct: 550 IQTVTGSHINQKNF 563
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V+Y+ +A L+G+I I C C CNG + +
Sbjct: 403 LTKHPGNIRELLNTGLLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGSKAVSAY 461
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + S C+
Sbjct: 462 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCL 521
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYT 376
C P+ ++ LC C++SK+PQ +T +
Sbjct: 522 NCNE--PVLPSSQSE-NVLCQVCLESKQPQDPLTAS 554
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 68/137 (49%), Gaps = 30/137 (21%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNSEVS----PSQF--EAHA---------DGGNLLPCDGC 576
+ G++ ++GY I C+ CN VS S F AH G N PC G
Sbjct: 583 SAGKRKVDGYIKDQRIYCNHCNRVVSLFSHLSYFFRLAHQHLKLMRVRDQGAN--PCVG- 639
Query: 577 PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCI 636
L +P +WYC C N+ ++++ L + NA AGR +GVDS+EQI KR I
Sbjct: 640 ----------LRKVP-SEWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAI 688
Query: 637 RIVKNLEAELSGCLLCR 653
RIV + +L GC LC+
Sbjct: 689 RIVP-ISDDLGGCALCK 704
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 106/290 (36%), Gaps = 89/290 (30%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CD C FH +C + +P GDWYC+ C
Sbjct: 741 GDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSC------------------------ 775
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCEREFHVGCL 682
LCR C F K P +L C QC R++H C
Sbjct: 776 --------------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCS 809
Query: 683 KKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 740
+P FC C +I L LL IK +
Sbjct: 810 SGTGTDFDCTIPGTSIDCFCSPGCRKIYKRLNKLL-------------GIKNHM------ 850
Query: 741 VSDIDVRWRLL----SGKAATPETRLLLSQ-------AVAIFHDCFDPIVDSISGRDLIP 789
+ W L+ + +A P+ + ++Q A + +CF P +D SG ++I
Sbjct: 851 --EAGFSWSLVHCFPNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHIDERSGINMIH 908
Query: 790 SMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
++ Y G + +F G Y IL V+SA +R+ G ++AE+P + T
Sbjct: 909 NVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGT 958
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 429 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 486
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ ML +Q+
Sbjct: 487 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVT 546
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 547 GSTINQKNF 555
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 199 AEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGLLDGVSVVYMGGIKFQAS 253
A S +T K ++ LK + + K+ P V L TG+LDGV V Y+ + +
Sbjct: 296 ASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE-- 353
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
LRGII+ G LC C CN +++ +FE HA + + + +I FENGK++ ++++ R
Sbjct: 354 -LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELR 412
Query: 314 SVPLPMLKATLQSALSSLPEEKSF 337
S P +L T+Q+ + +KSF
Sbjct: 413 STPESLLFDTIQTIFGAPINQKSF 436
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 199 AEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGLLDGVSVVYMGGIKFQAS 253
A S +T K ++ LK + + K+ P V L TG+LDGV V Y+ + +
Sbjct: 302 ASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE-- 359
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
LRGII+ G LC C CN +++ +FE HA + + + +I FENGK++ ++++ R
Sbjct: 360 -LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELR 418
Query: 314 SVPLPMLKATLQSALSSLPEEKSF 337
S P +L T+Q+ + +KSF
Sbjct: 419 STPESLLFDTIQTIFGAPINQKSF 442
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK S N P V L TG+ DG+ V Y + L+G+I+ G LCSC
Sbjct: 386 KNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYC--TWSREKNLQGVIKGTGYLCSC 443
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
+C G + + +FE HA + + + +I FENGKS+ V++ ++ P ML +Q+
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 329 SSLPEEKSF 337
+ +++F
Sbjct: 504 GATINQRNF 512
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/279 (24%), Positives = 104/279 (37%), Gaps = 67/279 (24%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+L+ CD C FH +C + +P GDWYC+ C
Sbjct: 761 GDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSC------------------------ 795
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPRTILLCDQCEREFHVGCL 682
LCR C F K P +L C QC R++H C
Sbjct: 796 --------------------------LCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCS 829
Query: 683 KKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLET 740
+P FC C +I L LL + F + + +A +
Sbjct: 830 SGTGTDSGCTMPGTSIDCFCSPGCRKIYKRLNKLLGIKNHMEAGFSWSLVHCFANDQAMP 889
Query: 741 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 798
+ + K A ++ L A + +CF P +D SG ++I ++ Y G +
Sbjct: 890 NKNKE--------KLAQCNSKTAL--AFTVLDECFQPHIDDRSGINMIHNVAYNCGSDFS 939
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F G Y IL V++A +R+ G ++AE+P + T
Sbjct: 940 RLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIGT 978
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 104/281 (37%), Gaps = 67/281 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S+ +P WYC C
Sbjct: 662 GDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHNCI----------------------- 698
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK---SGFGPRTILLCDQCEREFHVGC 681
C +C GC ++ S F I+ C QC H C
Sbjct: 699 ------------------------CRIC-GCPVTEKEISSFS--AIIKCLQCGAAHHDTC 731
Query: 682 LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLET 740
++ A E+ +WFC C I L + E+ ++ G + +
Sbjct: 732 VEMGATA-FEEMDSDEWFCGTHCKEIYLGLHGCVGVESSLGDGLSWTILRCNSGGQKMHS 790
Query: 741 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 798
V I A E L+ A+ + +CF +VD+ +G ++IP ++Y G
Sbjct: 791 VQKI----------AHAIECNSKLAVALTLMEECFAQMVDTRTGINMIPHVLYNQGSKYA 840
Query: 799 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 839
+ G Y IL ++ A +RV G + AELP +AT +
Sbjct: 841 RLNYQGFYTVILEKGEEILCAASIRVHGMKAAELPFIATCR 881
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN + + SKK++ N P V L TG+LDGV V Y+ + L+G+I+ G LC C
Sbjct: 339 KNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSR--EKNLKGLIKGAGYLCGC 396
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 397 QECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTVT 456
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 457 GSPINQKNF 465
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 82.8 bits (203), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 110/277 (39%), Gaps = 83/277 (29%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+LL CDGCP AFH C LSS+P+ D + F + S V+
Sbjct: 263 GGDLLLCDGCPSAFHHTCLGLSSLPEEDLW------------FCPCCCCDICGSMESPVN 310
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
S +L C C+ R FH+ CLK+
Sbjct: 311 S-----------------KLMACEQCQ---------------------RRFHLKCLKEEP 332
Query: 687 -MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 745
+ R WFC C+R++S L+NL+ G + ++ D
Sbjct: 333 GIVSCR-----GWFCSSQCNRVSSALENLI-------------------GCKIAVGNNGD 368
Query: 746 VRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE 801
+ W L+ G+ E L AV I H F+P D SGRDL+ +++ ++ G
Sbjct: 369 LVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTKDVFSGRDLVEELIFRKDRTG-- 426
Query: 802 FG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G G Y ++ ++ +RV ++V E+PLVAT
Sbjct: 427 VGRGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVAT 462
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 82.8 bits (203), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 103/277 (37%), Gaps = 61/277 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S +P+G WY C N R N + VS
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GNPLSEKEVST 63
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ IL C QC +H C+ +
Sbjct: 64 FSA---------------------------------------ILKCLQCGDSYHDTCIDQ 84
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 743
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKK 143
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
I A E L+ A+ I +CF +VD +G D+IP ++Y G N +
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVLYNKGSNFARLD 193
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
+ G Y IL ++ +RV G + AELP +ATS
Sbjct: 194 YQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 230
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 85/176 (48%), Gaps = 31/176 (17%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 726
++ C+QC+R FH+ CLK+ D + WFC C+R+ S L+NLL
Sbjct: 315 LMACEQCQRRFHLTCLKE----DSCIVSSRGWFCSSQCNRVFSALENLL----------- 359
Query: 727 LNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSI 782
G+ + +D D+ W L+ G+ E L AV I H F+P D
Sbjct: 360 --------GSKIAVGNDGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTNDVF 411
Query: 783 SGRDLIPSMVYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
SGRDL+ ++Y ++ G G G Y ++ + ++ +RV ++V E+PLVAT
Sbjct: 412 SGRDLVEELIYRKDRTG--VGRGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVAT 464
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 23/31 (74%), Gaps = 1/31 (3%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGD-WY 596
GG+LL CDGCP AFH C LSS+P+ D W+
Sbjct: 265 GGDLLLCDGCPSAFHHACLGLSSLPEEDLWF 295
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/277 (24%), Positives = 104/277 (37%), Gaps = 64/277 (23%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+LL CD C FH C + +P GDW+C+
Sbjct: 856 GDGGDLLCCDRCTSTFHVACLGIE-MPSGDWFCR-------------------------- 888
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
CI C C + S P +L C QC R++H C +
Sbjct: 889 ---------NCI------------CKFCGSAEERTSS--PAELLSCLQCSRKYHQVCAQG 925
Query: 685 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 742
+ + P FC C++I L+ LL + + F + ++ +A
Sbjct: 926 IEREFVSTTPSASIDCFCSPGCTKIYKRLKRLLGLKNDLEAGFSWSLVRCFA-------- 977
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
D KA + A ++ +CF P +D SG ++I ++VY G +
Sbjct: 978 --DTEATSTKKKAQLVHCNSKTALAFSVLDECFLPRIDERSGINIIHNVVYNCGSDFSRL 1035
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
F G Y IL V+SA +R+ G + AE+P + T
Sbjct: 1036 NFSGFYTFILERGDEVISAATVRIHGTDFAEMPFIGT 1072
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 1/134 (0%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
K+ + K +KK S N P V L TG+ DGV+V Y + + L+GII+ G LC C
Sbjct: 367 KSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR-EVRNLKGIIKGTGYLCGC 425
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN RV+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+
Sbjct: 426 GNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVT 485
Query: 329 SSLPEEKSFACVRC 342
S K+F +
Sbjct: 486 GSDINHKNFNTWKA 499
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ++K ++K S N P V L TG+LDGV V Y+ + + LRGII+ LC
Sbjct: 293 KNKQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE---LRGIIKGTTYLCG 349
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN + + +FE HA + + + +I FENGK++ ++++ RS P L T+Q+
Sbjct: 350 CQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTI 409
Query: 328 LSSLPEEKSF 337
+ +K+F
Sbjct: 410 FGAPINQKAF 419
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 110/276 (39%), Gaps = 76/276 (27%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
DGG LL CD CP FH C ++ +PQG W C YC
Sbjct: 461 DGGELLCCDSCPSTFHPACLAMK-VPQGWWACHYC------------------------- 494
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
RC+ + N + LS C C ++H C ++
Sbjct: 495 --------RCVLCMANDDQGLS---------------------TCQHCSLKYHEVC-RRP 524
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVSD 743
+++ R + +C C ++++ L +++ E + L I+K E VS
Sbjct: 525 SLSNGRGIGA---YCSETCKKVSARLSDMVGVTNHTEDGFSWALLKIQKD-----EAVSS 576
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQE 801
D AA E + L+ A+ + ++CF+P D + D++ VY G +
Sbjct: 577 QDT--------AAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAVYSLGSEFKRVS 628
Query: 802 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ G Y +L + ++A +LR+ G +VAE+P AT
Sbjct: 629 YEGFYTMVLDKDGETIAAALLRIHGTKVAEMPFAAT 664
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/275 (22%), Positives = 106/275 (38%), Gaps = 73/275 (26%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
ADGG LL CD CP FH C ++ +P+G W C YC
Sbjct: 519 ADGGELLCCDSCPSTFHPACLAMK-VPEGLWACHYC------------------------ 553
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
RC+ + N + LS C C ++H C +
Sbjct: 554 ---------RCVLCMANDDQGLS---------------------RCQHCTLKYHEIC--R 581
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
+++ R +C C ++++ L +++ F +K + + +
Sbjct: 582 PSLSNGR---GNGAYCSETCKKVSAQLSDMIGITNHTEDGFSWALLKIQKDEPVSSQNSP 638
Query: 745 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEF 802
DV E + L+ A+ + ++CF+P+ D + D++ VY G + +
Sbjct: 639 DVL-----------ECNVKLAVALGVLNECFNPVKDRRTKIDMLHQAVYSLGSEFKRVSY 687
Query: 803 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G Y +L N ++SA +LR+ G +VAE+P T
Sbjct: 688 EGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGT 722
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 60/91 (65%), Gaps = 3/91 (3%)
Query: 748 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 806
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 40 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 97
Query: 807 CAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
CA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 98 CALLTASTHVVSAAILKVRIEQIAELVLIAT 128
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 101/275 (36%), Gaps = 61/275 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S +P+G WY C N R N + VS
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GNPLSEKEVST 63
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ IL C QC +H C+ +
Sbjct: 64 FSA---------------------------------------ILKCLQCGDSYHDTCIDQ 84
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 743
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGQKLHSSKK 143
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 803
I A E L+ A+ I +CF +VD +G D+IP ++ N ++
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDYQ 191
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
G Y IL ++ +RV G + AELP +ATS
Sbjct: 192 GFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 226
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 76.6 bits (187), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 111/288 (38%), Gaps = 80/288 (27%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG+LL CD CP FH C + +P GDW+C+ C F
Sbjct: 316 GDGGDLLCCDNCPSTFHLACLGIK-MPSGDWHCRSCICRF-------------------- 354
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
S ++IT AEL CL QC R++H C
Sbjct: 355 CGSTQEITTS--------SAELLSCL---------------------QCSRKYHQVCAPG 385
Query: 685 HKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS 742
++ FC C +I L+ LL + NAI+ AG S V
Sbjct: 386 TMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK---------NAIE--AGFSWSLV- 433
Query: 743 DIDVRWRLLSGK-AATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSMVY- 793
R K AA P+ + L + A ++ +CF P +D SG ++I +++Y
Sbjct: 434 ------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYN 487
Query: 794 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
G + F Y IL V+SA +R+ G ++AE+P + T I
Sbjct: 488 CGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGI 535
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 76.3 bits (186), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 107/281 (38%), Gaps = 87/281 (30%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
+GG L+ C+ CP FH EC SL +P+ W+
Sbjct: 277 EGGELVCCETCPLTFHMECVSLLEVPKDAWF----------------------------- 307
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
C R CL C + P C+QCER FH GC
Sbjct: 308 ---------CFR-----------CLCCHCGE-------PLRTQPCEQCERCFHPGCCDDA 340
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 745
+A G +F C +S N+ + AE + ++ + +
Sbjct: 341 ILA-------GDFFFC------SSGCWNIFQRLAEMVA-------------TVNPLGRSE 374
Query: 746 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ--EFG 803
+ W LL + LL++A+ + FDP++D + D + +MV+ R+ +F
Sbjct: 375 LSWSLLRRGRCDDK---LLAEALQVISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRLDFS 431
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
G Y A+L + VV +LR+ G +AE+P +AT G+
Sbjct: 432 GFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQ 472
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 99/275 (36%), Gaps = 61/275 (22%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S +P+G W YC H+ G
Sbjct: 12 GDGGELLCCDNCPSTYHQTCLSDQELPEGSW---YC-----------HNCTCRSCGNPLS 57
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
V + IL C QC +H C+ +
Sbjct: 58 EKEVSTFS---------------------------------AILKCLQCGDSYHDTCIDQ 84
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSD 743
+M + WFC C I L N + E E + +K G L +
Sbjct: 85 -EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTDGRKLHSSKK 143
Query: 744 IDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFG 803
I A E L+ A+ I +CF +VD +G D+IP ++ N ++
Sbjct: 144 I----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SNFARLDYQ 191
Query: 804 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
G Y IL ++ +RV G + AELP +ATS
Sbjct: 192 GFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 226
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 96/281 (34%), Gaps = 84/281 (29%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG+L+ CD CP FH C L +P +W+C C
Sbjct: 152 GGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC-------------------------- 185
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
C LC D S S C QC R +HV CL K
Sbjct: 186 ---------------------CCELCGKGDSSTSTNA------CLQCARAYHVHCLTKDG 218
Query: 687 MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDV 746
+ P + FC C + + L LL G S T D +
Sbjct: 219 CLLPTDYP-SENFCSKSCYELCAQLHQLL-------------------GISNPTSVD-GL 257
Query: 747 RWRLLSGKAAT--------PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRN 796
W L T + Q + + H+CF + + + +D++ ++Y G
Sbjct: 258 TWTLTRSSKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDMVTDLIYNSGSK 317
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ F G Y +L +VS LR+ G + AE+PLVAT
Sbjct: 318 FKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVAT 358
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + Q + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQRN-LKGMIKGTGYLCGCGNC 444
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 445 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 504
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 505 INHKNFNTWKA 515
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 97/226 (42%), Gaps = 58/226 (25%)
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI---------- 707
S GFG + LCD+C FH+GCL L +P G+WFC C +I
Sbjct: 130 SICGFGG-DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQEC 182
Query: 708 -----NSVLQNLLVQEAEKLPEFHLNAIKKYA---------------GNSLETVSDI--- 744
N++L + VQ +K +H +K GN + +
Sbjct: 183 KDHKDNNIL--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGK 237
Query: 745 ------DVRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 793
++ W L+ ++ + L+ A+ + ++ F+P D++SGR+LI +V+
Sbjct: 238 PIKVADNLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVF 297
Query: 794 GRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
R F G Y IL V+S +R++GQ+VAE+ VAT
Sbjct: 298 SRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVAT 343
Score = 46.2 bits (108), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG+L CD CP AFH C L+ +P G+W+C C
Sbjct: 135 GGDLALCDRCPSAFHLGCLGLNRVPIGEWFCPTC 168
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 97/226 (42%), Gaps = 58/226 (25%)
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI---------- 707
S GFG + LCD+C FH+GCL L +P G+WFC C +I
Sbjct: 130 SICGFGG-DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCCCKICYRPKCKQEC 182
Query: 708 -----NSVLQNLLVQEAEKLPEFHLNAIKKYA---------------GNSLETVSDI--- 744
N++L + VQ +K +H +K GN + +
Sbjct: 183 KDHKDNNIL--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVCGNMFLCLKKLLGK 237
Query: 745 ------DVRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 793
++ W L+ ++ + L+ A+ + ++ F+P D++SGR+LI +V+
Sbjct: 238 PIKVADNLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTFDALSGRELIKDVVF 297
Query: 794 GRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
R F G Y IL V+S +R++GQ+VAE+ VAT
Sbjct: 298 SRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVAT 343
Score = 46.2 bits (108), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG+L CD CP AFH C L+ +P G+W+C C
Sbjct: 135 GGDLALCDRCPSAFHLGCLGLNRVPIGEWFCPTC 168
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 74.3 bits (181), Expect = 3e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 46/71 (64%)
Query: 583 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 642
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 643 EAELSGCLLCR 653
E GC LC+
Sbjct: 64 ETGFGGCALCK 74
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/273 (22%), Positives = 95/273 (34%), Gaps = 83/273 (30%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 965 GDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT----------------------- 1001
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
C +C G K I C QC +H C+++
Sbjct: 1002 ------------------------CQVCGGPFSEKEVSTFSAIFKCFQCGDAYHDTCIEQ 1037
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 744
K+ L + WFC C I ++ + G + + D
Sbjct: 1038 EKLP-LEDQISQTWFCGKYCKEI-------------------FIGLRSHVGT--DNILDS 1075
Query: 745 DVRWRLL----SGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 792
D+ W +L G+ A E + L+ A+ + +CF +VD +G D+IP ++
Sbjct: 1076 DLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALTLLEECFIRMVDPRTGVDMIPHVL 1135
Query: 793 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 823
Y G N ++ G Y IL ++ +R
Sbjct: 1136 YNKGSNFARVDYQGFYTVILEKGDEILCVASIR 1168
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 66/131 (50%), Gaps = 2/131 (1%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSR--ERNLKGMIKGTGYLCGCGNC 443
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 444 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 503
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 504 INHKNFNTWKA 514
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 140/371 (37%), Gaps = 110/371 (29%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS 586
+V Y + +LEG+ GI C CC+ ++ S+FE HA G+ LP
Sbjct: 621 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHA--GSKLP-------------- 664
Query: 587 LSSIPQGDWYCKYCQNMFERKRFLQHDA-NAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 645
P + Y + ++ + Q DA N E G SV+ + +
Sbjct: 665 ---QPYQNIYLESGVSLLQ----CQIDAWNRQEHAEKIGFHSVD---------IDGNDPN 708
Query: 646 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----- 700
C +C G G ++ CD C FH CL D++ LP G+W C
Sbjct: 709 DDTCGIC--------GDGG-DLICCDGCPSTFHQSCL------DIQMLPPGEWHCPNCTC 753
Query: 701 ----------------------CMDCSR----------------INSVLQNLLVQEAEKL 722
C+ C + INS + +E ++L
Sbjct: 754 KFCGIASETSDKDDASVNVLRTCILCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKEL 813
Query: 723 PEFHLNAIKKYAGNSLETVSDIDVRWRLLS-----------GKAATPETRLLLSQAVAIF 771
E+ +KKY G E + W L+ G E L+ A+ +
Sbjct: 814 SEY----LKKYLGTKHEL--EAGFSWCLIHRSDEDSEAACRGLTQRVECNSKLAIALTVM 867
Query: 772 HDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 829
+CF P++D SG +LI +++Y G N + G Y AIL +++A +R G ++
Sbjct: 868 DECFLPVIDRRSGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKI 927
Query: 830 AELPLVATSKI 840
AE+P + T I
Sbjct: 928 AEMPFIGTRHI 938
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/339 (22%), Positives = 118/339 (34%), Gaps = 112/339 (33%)
Query: 546 GIICHCCNSEVSPSQFEAHA--------------DGGNLLPC------------------ 573
GI C CCN + ++FEAHA G +LL C
Sbjct: 800 GIQCDCCNKTFTSAEFEAHAGGKSCQPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFHF 859
Query: 574 ---DG----------------------CPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
DG CP FH+ C + P G W+C YC F
Sbjct: 860 IDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKF---- 915
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
G V G C R + ++ ++
Sbjct: 916 ----------CGMVGG--------NTCQR-----DGNMAAV--------------SHALV 938
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 728
C CE ++H C ++ + + P FC +C + LQ L + E F
Sbjct: 939 TCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQELYERLQMLFGVKQELEAGFSWT 996
Query: 729 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 788
++++ + SDI V SG + + ++ A+ I +CF P+VD SG +LI
Sbjct: 997 FVRRF-----DVSSDISV-----SGMSWKVDCNSKVAVALQIMDECFVPMVDHKSGVNLI 1046
Query: 789 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVF 825
++VY G N + G + A+L +++A +R F
Sbjct: 1047 RNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF 1085
>gi|154309635|ref|XP_001554151.1| hypothetical protein BC1G_07288 [Botryotinia fuckeliana B05.10]
Length = 765
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 88/207 (42%), Gaps = 59/207 (28%)
Query: 546 GIICHCCN--SEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +C C V+P++ +L CDGCP H++C S+ IP+GDW+CK CQ
Sbjct: 291 GPVCEICTKPDSVAPNK---------ILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKA 341
Query: 604 FERKRFLQHDANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 662
+ N GV DS ++I+ C +C+G D K
Sbjct: 342 RVAAEAARAAEN-------DGVTDSDDEIS----------------CAVCQGLDSEK--- 375
Query: 663 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 722
P I+LC+ C+ H C ++ + P+G+W C C ++ V +LL +E +
Sbjct: 376 -PNEIILCENCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLG 425
Query: 723 P-----------EFHLNAIKKYAGNSL 738
P E HL +++ N L
Sbjct: 426 PISNEVPSIEGFETHLKTMQRVLLNRL 452
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 448 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 505
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F +
Sbjct: 506 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQTWKA 563
>gi|347838360|emb|CCD52932.1| hypothetical protein [Botryotinia fuckeliana]
Length = 886
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 88/207 (42%), Gaps = 59/207 (28%)
Query: 546 GIICHCCN--SEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +C C V+P++ +L CDGCP H++C S+ IP+GDW+CK CQ
Sbjct: 291 GPVCEICTKPDSVAPNK---------ILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKA 341
Query: 604 FERKRFLQHDANAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 662
+ N GV DS ++I+ C +C+G D K
Sbjct: 342 RVAAEAARAAEN-------DGVTDSDDEIS----------------CAVCQGLDSEK--- 375
Query: 663 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL 722
P I+LC+ C+ H C ++ + P+G+W C C ++ V +LL +E +
Sbjct: 376 -PNEIILCENCDYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLG 425
Query: 723 P-----------EFHLNAIKKYAGNSL 738
P E HL +++ N L
Sbjct: 426 PISNEVPSIEGFETHLKTMQRVLLNRL 452
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 69/141 (48%), Gaps = 13/141 (9%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 473 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 530
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F K
Sbjct: 531 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQT--WKA 588
Query: 345 TFPITCV---------GKTGP 356
++ V G TGP
Sbjct: 589 SYQAATVELQRIYGKDGATGP 609
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 128/341 (37%), Gaps = 94/341 (27%)
Query: 546 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
GI+C+CC +S S F+AHA GC +SL Q CQ
Sbjct: 967 GILCNCCTKTLSISDFKAHA---------GC----RLRLSSLGLFLQSGKSYTLCQ---- 1009
Query: 606 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 665
VEA S E +++R + +EA C C G G
Sbjct: 1010 -----------VEAW------SAELMSRRSDAYGRKVEAVDENDDTCGFC-----GDGGE 1047
Query: 666 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQ 712
+L CD C +H CL +ELP+G W+C C C S + +L+
Sbjct: 1048 -LLCCDNCPSTYHEACLSS------QELPEGSWYCHNCTCRSCGNPVNEKEVSSFSDILK 1100
Query: 713 NLLVQEAEK--------LPEFHLNAIKKYAGN-------------SLETVSDIDVRW--- 748
L +A LP + + G +E V + D+ W
Sbjct: 1101 CLQCGDAYHNTCIDRVMLPSDGKRSDTWFCGRYCKEIFMGLHSQVGVENVINNDLSWTIL 1160
Query: 749 -------RLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNL 797
RL S + E L+ A+ + +CF +VD +G D+IP ++Y G N
Sbjct: 1161 RCNSDGQRLHSAQKIGLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIPHVLYNKGSNF 1220
Query: 798 RGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
++ G Y IL ++ +R+ G + AELP +ATS
Sbjct: 1221 ARLDYKGFYTVILEKGDEILCVASIRLHGTKAAELPFIATS 1261
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 69.3 bits (168), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 40/60 (66%)
Query: 506 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 565
+ +D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E++ S FE HA
Sbjct: 507 SARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHA 566
Score = 65.9 bits (159), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 12/157 (7%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL TGLL+G V + L GI +D G++C+C +C G +V+ S
Sbjct: 308 LTKPPRNAKELMATGLLEGHYVH----CSCRGEQLTGIFQDMGVVCNCRICKGTQVVSIS 363
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLPE-EKS 336
FE H+ S I ENGK+L ++L A + +L+A LQ A+ + K
Sbjct: 364 AFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRA-LQHAIGEIQGISKE 422
Query: 337 FACVRC---KGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
CV+C +G I+C G CV K P
Sbjct: 423 MTCVKCGKHEGGEFISCKGAKCSAAYHAECVGVKSPH 459
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 4/77 (5%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDG--CPRAFHKECASLSSI 590
CG +L ++ +G I + E++ + H +GG + C G C A+H EC + S
Sbjct: 401 CGDNILRALQHAIGEI-QGISKEMTCVKCGKH-EGGEFISCKGAKCSAAYHAECVGVKSP 458
Query: 591 PQGDWYCKYCQNMFERK 607
DW+C C+ RK
Sbjct: 459 HLEDWFCAKCEKTQARK 475
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 72/147 (48%), Gaps = 12/147 (8%)
Query: 696 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKA 755
G FC C + LQNLL + + PE+ +++ + E V +D R
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLALDKRV------- 76
Query: 756 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVN 813
E ++ A+++ +CF PI+D +G +LI ++VY G N +F G Y IL
Sbjct: 77 ---ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERG 133
Query: 814 SSVVSAGILRVFGQEVAELPLVATSKI 840
+++A +R+ G ++AE+P + T +
Sbjct: 134 DEIIAAASVRIHGTKLAEMPFIGTRNM 160
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/402 (24%), Positives = 153/402 (38%), Gaps = 102/402 (25%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQF 561
K + + D + DG V Y G +K++ G G+ C CC+ V F
Sbjct: 367 KKHTILTWLIDGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVHCSCCDGVVPLPVF 426
Query: 562 EAHA---------------------DGGNLLPCDGCP------RAFHKECASLSSIPQGD 594
EAHA G +LL C R FH + +++ Q +
Sbjct: 427 EAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFHAQAKVRAALEQEE 486
Query: 595 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLC 652
C ++R L A ++ G V VE+I + +K E + S C +C
Sbjct: 487 DKCSQA-----KRRLL---AKHLKKGVV-----VERIMSPRMEKIKAGEKDSSDDACGVC 533
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRIN 708
+ G +L CD C FH CL ++P+G W C C+ C N
Sbjct: 534 -----ADGG----ELLCCDSCTSTFHPECLAI-------KVPEGSWSCHYCRCVLCMS-N 576
Query: 709 SVLQNLLV--QEAEKLPE----FHLNA--IKKYAGNS-------LETVSDI------DVR 747
LQ L Q A K E N I Y G + L V+ +
Sbjct: 577 DDLQGLSTCQQCARKYHESCRPLPGNGCDIGTYCGETCKKLFSQLAQVTGVTNPTGDGFW 636
Query: 748 WRLLSGKAATP----------ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 795
W LL + P E + L+ A+ +F++CF+P+ D + D++ VY G
Sbjct: 637 WALLRIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGS 696
Query: 796 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ + G Y +L + +VSA +LR+ G +VAE+P T
Sbjct: 697 QFKRLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGT 738
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGI 821
L A+++ H+CF+P+ +S+S RDL+ +++ R L F G Y +L N ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 822 LRVFGQEVAELPLVAT 837
+RV+G++VAE+PLV T
Sbjct: 1039 VRVYGKKVAEIPLVGT 1054
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 55/141 (39%), Gaps = 49/141 (34%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CP +FHK C L IP GDW+C C +R + D D
Sbjct: 650 GGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQRKIDRD------------D 697
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
VEQ +L C QCE ++HV CL ++
Sbjct: 698 EVEQ------------------------------------LLPCIQCEHKYHVRCL-ENG 720
Query: 687 MADLRELPKGKWFCCMDCSRI 707
AD+ G WFC DC ++
Sbjct: 721 AADISTRYLGNWFCGKDCEKL 741
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 85/189 (44%), Gaps = 30/189 (15%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM------------DCSRINSVLQNL 714
+LLCD+C FH C+ L+ P+G W C + D + + +
Sbjct: 576 LLLCDKCPSAFHHACVG------LQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 715 LVQEAEKLPE----FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 770
+ EA +P L+ +++ + TV+ I RW+ AA L A+ +
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRR--RYMSTVTRI-TRWQHEEEDAADHGQ---LCAALDV 683
Query: 771 FHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 828
H+CFD +V+ + DL +V+ + L F G Y L +++ G LRVFG +
Sbjct: 684 LHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVFGNQ 743
Query: 829 VAELPLVAT 837
VAELPLV T
Sbjct: 744 VAELPLVGT 752
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 27/50 (54%)
Query: 552 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
C+ E S DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 558 CSEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR 607
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 87/378 (23%), Positives = 153/378 (40%), Gaps = 92/378 (24%)
Query: 512 LHKLVFDESGLPDGTEVGYY-ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNL 570
+ + D L +G V Y ++ G + GI+C CCN S + F+ HA
Sbjct: 3 IFSWLIDGEILSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGD--- 59
Query: 571 LPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE-RKRFLQHDANAVEAGRVSGVDSVE 629
H+ A+L ++ G +++ E +K+ L+ A +G +V+
Sbjct: 60 --------EVHRT-AALLTLEDG-------RSVLECQKQALKKIEQAKCDEPANGQLTVD 103
Query: 630 QITKRCIR------IVKNLEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
+ + + +V ++E + + C +C G G + ++ CD C FH+ C
Sbjct: 104 ETALKAMELKESELVVDDVEMDENDDTCAVC--------GDGGQ-LVCCDHCPSTFHLKC 154
Query: 682 LKKHKMADLRELPKGKWFC----CMDCSR--INSVLQN-LLVQEAEKLP----------- 723
L+ L +P+G WFC C C R + +Q +L + +P
Sbjct: 155 LR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHSNCVPGCAMKYESSDN 208
Query: 724 EF-------HLNAIKKYAGNSLETVSDIDVRWRLLSGK-------------AATPETRLL 763
+F ++K G + V D+ W LL + A TRL
Sbjct: 209 QFCSRKCFKIFRGLRKLVGR-VNKVDDM-YSWTLLRSEHYDQSAENSKLESVADLNTRLA 266
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE----FGGMYCAILTVNSSVVSA 819
L A+ + +CF P++D S D++ ++Y R RG++ F G Y +L ++S
Sbjct: 267 L--ALTVIQECFRPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYTVVLEKEQELISV 322
Query: 820 GILRVFGQEVAELPLVAT 837
+RV G AE+P + T
Sbjct: 323 ASMRVHGSHAAEIPFIGT 340
Score = 39.7 bits (91), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1189
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 125/346 (36%), Gaps = 109/346 (31%)
Query: 546 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC--QNM 603
G++C CCN VS S+F+ HA PC F +S W +Y +N
Sbjct: 657 GVVCTCCNKTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEYKARRNG 713
Query: 604 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFG 663
+ ++ D N G V G G L+C
Sbjct: 714 WRLEKASDDDPNDDSCG-VCGD---------------------GGELIC----------- 740
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEA 719
CD C FH CL ++ LP+G W+C C CS L+ A
Sbjct: 741 ------CDNCPSTFHQACLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNA 781
Query: 720 EKLPEFHLNA-IKKYAGNSLETVS--------------------------------DID- 745
E+ +F + KY G L+ +S + D
Sbjct: 782 ERSQDFKCSQCAHKYHGTCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADG 841
Query: 746 VRWRLL----------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 793
+ W +L S + A E L+ A++I + F +VD +G D+IP ++Y
Sbjct: 842 LSWSILKCFQEDGMVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLY 901
Query: 794 --GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G +F G Y ++ + ++S +RV G +AE+PLVAT
Sbjct: 902 NWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVAT 947
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 124/338 (36%), Gaps = 93/338 (27%)
Query: 546 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
G++C CCN VS S+F+ HA PC F +S W +Y
Sbjct: 645 GVVCTCCNRTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEY------ 695
Query: 606 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 665
R +G S E + + C +C G G
Sbjct: 696 -------------KARRNGWRSEE---------ASDDDPNDDSCGVC--------GDGGE 725
Query: 666 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC--------------------MDCS 705
++ CD C FH CL ++ LP+G W+C CS
Sbjct: 726 -LICCDNCPSTFHQACLS------MQVLPEGSWYCSSCSCQICSELVSDNGERSQDFKCS 778
Query: 706 RINSVLQNLLVQEAEK----LPEFHL---NAIKKYAGNSLETVSDID-----VRWRLL-- 751
+ + +Q K PE + N K Y G S V I+ + W +L
Sbjct: 779 QCAHKYHGICLQGISKRRKLFPETYFCGKNCEKVYTGLS-SRVGVINPNADGLSWSILKC 837
Query: 752 ---SGK-------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRG 799
GK A E L+ A++I + F +VD +G D+IP ++Y G N
Sbjct: 838 FQEDGKVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSNFAR 897
Query: 800 QEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+F G Y +L + ++S +RV G VAE+PLVAT
Sbjct: 898 LDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVAT 935
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 73/182 (40%), Gaps = 33/182 (18%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 371 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAPEGKWS 418
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGC 655
C +C+ K +Q +A + I++ +R+ E E + CR C
Sbjct: 419 CPHCE-----KEGIQWEAKDEDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVC 473
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C I +Q
Sbjct: 474 ---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQK 522
Query: 714 LL 715
+L
Sbjct: 523 IL 524
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 66.6 bits (161), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 125/346 (36%), Gaps = 109/346 (31%)
Query: 546 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC--QNM 603
G++C CCN VS S+F+ HA PC F +S W +Y +N
Sbjct: 613 GVVCTCCNKTVSLSEFKNHAGFNQNCPC---LNLFMGSGKPFASCQLEAWSAEYKARRNG 669
Query: 604 FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFG 663
+ ++ D N G V G G L+C
Sbjct: 670 WRLEKASDDDPNDDSCG-VCGD---------------------GGELIC----------- 696
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEA 719
CD C FH CL ++ LP+G W+C C CS L+ A
Sbjct: 697 ------CDNCPSTFHQACLS------MQVLPEGSWYCSSCTCWICSE-------LVSDNA 737
Query: 720 EKLPEFHLNA-IKKYAGNSLETVS--------------------------------DID- 745
E+ +F + KY G L+ +S + D
Sbjct: 738 ERSQDFKCSQCAHKYHGTCLQGISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADG 797
Query: 746 VRWRLL----------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 793
+ W +L S + A E L+ A++I + F +VD +G D+IP ++Y
Sbjct: 798 LSWSILKCFQEDGMVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLY 857
Query: 794 --GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
G +F G Y ++ + ++S +RV G +AE+PLVAT
Sbjct: 858 NWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVAT 903
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 56/102 (54%), Gaps = 2/102 (1%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
+LDGV V Y+ + LRGII+ G LC C CN +VI +FE HA + + +
Sbjct: 1 MLDGVPVKYIAWSR--EKELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNN 58
Query: 296 YICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
+I FENGK++ +++ +S P L +Q+ S +KSF
Sbjct: 59 HIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSF 100
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 66.2 bits (160), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 85/203 (41%), Gaps = 42/203 (20%)
Query: 549 CHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ--NMF 604
C C + +P Q +L CDGC R +H C LS IPQGDW+C C +
Sbjct: 978 CRVCRKKSNPEQ---------MLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQLS 1028
Query: 605 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 664
RKR VE D+ E++ + + + C +C P
Sbjct: 1029 PRKR----TKAPVEVSSEEEDDN-EKVDEDGDEDEEEEDLNQEVCNICE---------SP 1074
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKL 722
++LCD C + FH+ C+ DL+ LP+G W C C+ + N L K+
Sbjct: 1075 GELILCDFCPKSFHLDCI------DLKRLPRGTWKCPPCVLGKKKNKRGSPPLT----KV 1124
Query: 723 PEFHLNAIKKYAGNSLETVSDID 745
N I+KY L TV+D+D
Sbjct: 1125 KVRSRNNIRKY---DLATVTDVD 1144
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 130/342 (38%), Gaps = 100/342 (29%)
Query: 546 GIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
GI+C+CC S S F+ H GG LP +SL Q CQ
Sbjct: 956 GILCNCCTKTFSISDFKVH--GGCSLPK-----------SSLGLFLQSGKSYTLCQ---- 998
Query: 606 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 665
VEA S E ++++C + +EA C C G G
Sbjct: 999 -----------VEAW------SAEFLSRKCDASGRKVEAMDENDDTCGFC-----GDGGE 1036
Query: 666 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDC---------SRINSVLQ 712
+L CD C +H CL +ELP+G W+C C C S +++L+
Sbjct: 1037 -LLCCDNCPSTYHQTCLSD------QELPEGSWYCHNCTCRSCGNPLSEKEVSTFSAILK 1089
Query: 713 NLLVQEA--------EKLP----------------EFHLNAIKKYAGNSLETVSDIDVRW 748
L ++ E LP E + + + G +E D ++ W
Sbjct: 1090 CLQCGDSYHDTCIDQEMLPCGDKQSNIWFCGRYCKEIFI-GLHNHVG--IENFLDNELSW 1146
Query: 749 RLL----------SGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN 796
+L S K A E L+ A+ I +CF +VD +G D+IP ++ N
Sbjct: 1147 SILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL--SN 1204
Query: 797 LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 838
++ G Y IL ++ +RV G + AELP +ATS
Sbjct: 1205 FARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATS 1246
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 61/132 (46%), Gaps = 34/132 (25%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRVSGVDSV 628
+L CDGCP A H++C S+ IP GDW+CK CQ N + ++ +A+ DS
Sbjct: 305 ILFCDGCPLAVHQKCYSVPKIPDGDWFCKKCQRNRVAAEAARANENDALN-------DSD 357
Query: 629 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 688
++I C +CRG D K P I+LC+ C+ H C
Sbjct: 358 DEIK----------------CAVCRGLDSKK----PNEIILCENCDYAVHQTC------G 391
Query: 689 DLRELPKGKWFC 700
D+ + P+ +W C
Sbjct: 392 DIPKKPREEWLC 403
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 27/35 (77%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
DGG+LL CD CPR+FH +C L SIP+ DWYCK C
Sbjct: 129 DGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 70/182 (38%), Gaps = 42/182 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 354 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWS 401
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A + +I V + E E CR C
Sbjct: 402 CPHCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC- 445
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 446 --KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKI 495
Query: 715 LV 716
L
Sbjct: 496 LT 497
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 87/202 (43%), Gaps = 31/202 (15%)
Query: 667 ILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL-- 722
++ CD C FH CL + H M D L + + C + I ++ V + K+
Sbjct: 650 LVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGIAPIVHANSVGQLLKMLL 708
Query: 723 -PEFHLNA----------IKKYAGNSLETVSDIDVRWRLLSGKAATPETRL--------- 762
P + A +KKY G E + W L+ + A + L
Sbjct: 709 RPRMQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRECADSDLFLGEHPHIVEN 766
Query: 763 --LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 818
L+ A+ + +CF PIVD SG +++ +++Y G N FGG Y A+L VV+
Sbjct: 767 NSKLALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVA 826
Query: 819 AGILRVFGQEVAELPLVATSKI 840
+ +R G +AE+P + T +
Sbjct: 827 SASIRFHGNHLAEMPFIGTRHV 848
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDW 595
DGG+L+ CDGCP FH+ C + DW
Sbjct: 645 GDGGDLVCCDGCPSTFHQRCLDIRGHLMPDW 675
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 406
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A R G D E ++E + CR C
Sbjct: 407 CPHCE-----KEGIQWEA------REDGSDGEEDNGD-----AGDMEEDDHHMEFCRVC- 449
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 450 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCM-CPPMKGKVQKI 499
Query: 715 LV 716
L
Sbjct: 500 LT 501
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/306 (25%), Positives = 110/306 (35%), Gaps = 73/306 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 596
LLEG+ + GI C CC+ + S+FE HA + C+ + SL W
Sbjct: 208 LLEGWISRDGIRCGCCSEIFTISKFEIHA---GMKLCEPSQNIILETGISLLQCQLDSWN 264
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
Q ER F D G D T C +C
Sbjct: 265 K---QEESERSGFHLVDV---------GADDPNDDT----------------CGIC---- 292
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 716
G G ++ CD C FH CL L+ L K + LV
Sbjct: 293 ----GDG-GDLICCDGCPSTFHQSCLDIQLFEQLQMLLGVK-------HELEDGFSWTLV 340
Query: 717 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 776
Q E + LN I + E L+ A++I +CF
Sbjct: 341 QRTEVGFDISLNGIPQKV------------------------ECNSKLAVALSIMDECFL 376
Query: 777 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 834
PIVD SG +LI +++Y G N + G + AIL ++SA +R+ G ++AE+P
Sbjct: 377 PIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPF 436
Query: 835 VATSKI 840
+ T I
Sbjct: 437 IGTRHI 442
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 62.8 bits (151), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 79/177 (44%), Gaps = 31/177 (17%)
Query: 670 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 729
C+QCER FH GC +A G +F C +S NL + AE +
Sbjct: 164 CEQCERCFHPGCCDDAILA-------GDFFFC------SSGCWNLFQRLAEMVA------ 204
Query: 730 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 789
++ + ++ W LL + LL++A+ + FDP++D + D +
Sbjct: 205 -------TVNPLGRSELSWSLLRRGRCDDK---LLAEALQLISSRFDPVLDCWTQLDYLD 254
Query: 790 SMVYGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
+MV+ R+ +F G Y A+L + VV +LR+ +AE+P +AT G+
Sbjct: 255 AMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQ 311
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+GG L+ C+ CP FH EC SL +P+ W+C C
Sbjct: 116 EGGELVCCETCPLTFHMECVSLLEVPKDAWFCFRC 150
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 74/187 (39%), Gaps = 33/187 (17%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G + +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 379 GDEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 426
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 650
+G W C +C+ K +Q +A E I++ + + E E +
Sbjct: 427 EGKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPTGAEEEDDDHME 481
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRIN 708
CR C K G +L CD C +H+ CL L E+P G+W C C C I
Sbjct: 482 FCRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIK 530
Query: 709 SVLQNLL 715
+Q +L
Sbjct: 531 GRVQKIL 537
>gi|357510881|ref|XP_003625729.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
gi|355500744|gb|AES81947.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
Length = 290
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 184 GFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKK-PMTVTELFETGLLD 238
G G E+++ ++ E A S AL + +ELK SKKI+++KK P T+ ELF TGLLD
Sbjct: 203 GSGEETVTKLDQEGAAVESEIDGALAVRRNKMELKTSKKIAVDKKRPTTMKELFRTGLLD 262
Query: 239 GVSVVYMGGIKFQAS 253
GVSVVY+ GIK + S
Sbjct: 263 GVSVVYVSGIKKEES 277
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 59/135 (43%), Gaps = 31/135 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
DGG+L+ CD CP++FH+ C +L+ IP GDW C C L D ++ + +
Sbjct: 465 DGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTG-----EGLPEDGDSSNSAQEEEE 519
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
E + ++ K RG D ++LCD C FH+ CL
Sbjct: 520 GEEETEHDQFCKVCK------------RGGD----------VILCDFCSCVFHLRCLN-- 555
Query: 686 KMADLRELPKGKWFC 700
L E+P+G W C
Sbjct: 556 --PPLGEVPEGDWKC 568
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 90/402 (22%), Positives = 146/402 (36%), Gaps = 103/402 (25%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG-------QKLLEGYKNGLGIICHCCNSEVSPSQ 560
K + + D L DG V YY G +K++ G G+ C+CC++ V
Sbjct: 277 KKHTILTWLIDGGFLSDGETV-YYVPGDSGGAGKEKIVSGAVTRAGVHCNCCDAVVPLPV 335
Query: 561 FEAHA---------------------DGGNLLPC------DGCPRAFHKECASLSSIPQG 593
FE HA G +LL + R FH + +++ Q
Sbjct: 336 FEVHAGRVPGTGQQQQQVAWEKLLLVSGDSLLQSMQEAWQNEKVRTFHAQAKVRAALEQ- 394
Query: 594 DWYCKYCQNMFERKRFL-QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCL 650
+ +N ++R L +H V VE+I + +K E + S C
Sbjct: 395 ----EEEKNSQAKRRLLAKHQKKGV---------VVERIMSPRMEKIKAGEKDSSDDACG 441
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC---------- 700
+C D + +L CD C FH CL E+P G W C
Sbjct: 442 VC--ADGGE-------LLCCDFCTSTFHPECLAI-------EVPDGSWSCHYCRCTLCMS 485
Query: 701 --------CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-----VR 747
C +C+ L+ + + KK + E + ++
Sbjct: 486 NDDQDLSTCQECACKYHESCRPLLGNGRDIGAYCGEICKKLSAKLSEVIGVMNSTEDGFS 545
Query: 748 WRLL----------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 795
W LL G A E + L+ A+ + + CF+P+ D + D++ VY G
Sbjct: 546 WSLLRIHEDEPASSQGMPAVLERNVKLAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLGS 605
Query: 796 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ + G Y IL + +VS +LR+ G++VAE+P T
Sbjct: 606 QFKRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGT 647
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 70/182 (38%), Gaps = 42/182 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 353 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWS 400
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A + +I V + E E CR C
Sbjct: 401 CPHCE-----KEGVQWEAK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC- 444
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 445 --KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKI 494
Query: 715 LV 716
L
Sbjct: 495 LT 496
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/175 (25%), Positives = 75/175 (42%), Gaps = 37/175 (21%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 726
++ CD C +H CL +LR SR+ + + F
Sbjct: 1056 LICCDNCPASYHQDCLPCQIYMNLR-------------SRVGIPIHTI--------DGFS 1094
Query: 727 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 786
++ + T +DI A E + L A++I +CF PI+D+ +G D
Sbjct: 1095 CTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGID 1144
Query: 787 LIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT
Sbjct: 1145 IIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIAT 1197
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 73/186 (39%), Gaps = 31/186 (16%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G + +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 262 GDEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 309
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 650
+G W C +C+ K +Q +A E I++ + + E E +
Sbjct: 310 EGKWSCPHCE-----KEGIQWEAKDEEFEDFEEDSEDRVISEVSLGVPMGAEEEDDDHME 364
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
CR C K G +L CD C +H+ CL L E+P G+W C C I
Sbjct: 365 FCRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKG 414
Query: 710 VLQNLL 715
+Q +L
Sbjct: 415 RVQRIL 420
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/44 (68%), Positives = 32/44 (72%)
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 844
EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQ 60
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 64/151 (42%), Gaps = 50/151 (33%)
Query: 553 NSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRF 609
NS++S QF D + LL CDGC + +H C + +IP+GDWYC C N +R
Sbjct: 1785 NSKLSNCQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHECMNKATGER- 1843
Query: 610 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 669
C++C G S SG ++L
Sbjct: 1844 --------------------------------------NCIVC-GKKSSTSG---TRLIL 1861
Query: 670 CDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C+ C R +H C+ H + + ++P+GKW+C
Sbjct: 1862 CELCPRAYHTDCI--HPI--MHKVPRGKWYC 1888
Score = 46.2 bits (108), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 10/86 (11%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQF------EAHADGGNLLPCDGC 576
DG + GY+ C + +E G CH C ++ + + ++ G L+ C+ C
Sbjct: 1807 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKATGERNCIVCGKKSSTSGTRLILCELC 1865
Query: 577 PRAFHKECAS--LSSIPQGDWYCKYC 600
PRA+H +C + +P+G WYC C
Sbjct: 1866 PRAYHTDCIHPIMHKVPRGKWYCSKC 1891
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 6/51 (11%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 703
C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1791 CQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1837
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 78/184 (42%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDSDMEKAPEGKW 383
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 428
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 429 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 477
Query: 713 NLLV 716
+L+
Sbjct: 478 KILI 481
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 406
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 407 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 451
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 452 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 501
Query: 714 LLV 716
+L+
Sbjct: 502 ILI 504
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 496
Query: 713 NLLV 716
+L+
Sbjct: 497 KILI 500
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 297 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 344
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 345 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 389
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 390 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 438
Query: 713 NLLV 716
+L+
Sbjct: 439 KILI 442
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 344 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 391
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 392 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 436
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 437 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 486
Query: 714 LLV 716
+L+
Sbjct: 487 ILI 489
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 496
Query: 713 NLLV 716
+L+
Sbjct: 497 KILI 500
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 447
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 448 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 496
Query: 713 NLLV 716
+L+
Sbjct: 497 KILI 500
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 359 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 406
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 407 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 451
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 452 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 501
Query: 714 LLV 716
+L+
Sbjct: 502 ILI 504
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 77/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 503
Query: 713 NLLV 716
+L+
Sbjct: 504 KILI 507
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 360 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 407
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 408 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 452
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 453 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 502
Query: 714 LLV 716
+L+
Sbjct: 503 ILI 505
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 76/183 (41%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 352 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 399
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 400 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 444
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQN 713
C K G +L CD C +H+ CL L E+P G+W C C + +Q
Sbjct: 445 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQK 494
Query: 714 LLV 716
+L+
Sbjct: 495 ILI 497
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 42/92 (45%), Gaps = 19/92 (20%)
Query: 520 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRA 579
SG DG E G A +G+ + C+ C DGG LL CD CPRA
Sbjct: 137 SGEEDGNESGETAD-----DGWADHNRWYCNICK------------DGGQLLCCDRCPRA 179
Query: 580 FHKECASLS--SIPQGDWYCKYCQNMFERKRF 609
FH C +S IP +WYCK C +R+R
Sbjct: 180 FHMSCLGMSVDMIPDSEWYCKMCTECLDRRRL 211
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG LL CDGCPRAFH C L+ IP +W+C C
Sbjct: 1248 GGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC 1281
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 65/164 (39%), Gaps = 36/164 (21%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN----MFERKRFLQHDANAVEAGR 621
+GG LL CDGCP FH C L IP+G +C C +F ++ A + R
Sbjct: 1122 EGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVKPVFPVNGAKKNGKAASKRPR 1181
Query: 622 VSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKS----------------- 660
S + + ++ ++ K + E+E SG + S
Sbjct: 1182 RSNSPTSRRRPRKQAKLGKAKSDDSESEDSGAESDSAVSTTSSARPASRPKAPEDQWDVD 1241
Query: 661 ----GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
G G +L CD C R FHV C+ L E+P +WFC
Sbjct: 1242 CSVCGLGGE-LLCCDGCPRAFHVNCIG------LAEIPDTEWFC 1278
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 560 QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 604
+ ++H G+ GC R FH +CA L ++P DWYCK C+
Sbjct: 1295 RLDSHVICGSEDGTKGCDRVFHLKCAKLDAVPADDWYCKKCRTKL 1339
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 76/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 363 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 410
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 411 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 455
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 504
Query: 713 NLLV 716
+L+
Sbjct: 505 KILI 508
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 76/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 356 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 403
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 404 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 448
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 497
Query: 713 NLLV 716
+L+
Sbjct: 498 KILI 501
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 76/184 (41%), Gaps = 43/184 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 363 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 410
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E +V + E E + CR
Sbjct: 411 SCPHCE-----KEGIQWEAKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRV 455
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQ 712
C K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQ 504
Query: 713 NLLV 716
+L+
Sbjct: 505 KILI 508
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 346 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 393
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 394 CPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC---- 436
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 437 --KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 486
Query: 715 LV 716
L
Sbjct: 487 LT 488
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
L GII GG LC CS CN +V+ +FE HA + R + +I ENGK + +++ ++
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 315 VPLPMLKATLQSALSSLPEEKSF 337
PL M+ ++ S E+ F
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFF 89
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 72/182 (39%), Gaps = 43/182 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 346 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 393
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ M + + DA+ E +G ++ E + + C +C
Sbjct: 394 CPHCEKMGIQWE-AREDASEGEEDNEAGGEAEED------------DHHMEFCRVC---- 436
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 437 --KDGG---ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 486
Query: 715 LV 716
L
Sbjct: 487 LT 488
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 72/185 (38%), Gaps = 49/185 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 347 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 394
Query: 597 CKYCQNM---FERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 653
C +C+ M +E + DA+ E +G ++ E + + C +C+
Sbjct: 395 CPHCEKMGIQWEAR----EDASEGEEDNEAGGEAEED------------DHHMEFCRVCK 438
Query: 654 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVL 711
+L CD C +H+ CL L E+P G+W C C C + +
Sbjct: 439 DGG---------ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKV 484
Query: 712 QNLLV 716
Q +L
Sbjct: 485 QKILT 489
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 59.7 bits (143), Expect = 7e-06, Method: Composition-based stats.
Identities = 30/97 (30%), Positives = 37/97 (38%), Gaps = 46/97 (47%)
Query: 547 IICHCCNSEVSPSQFEAHA----------------------------------------- 565
I+C CC E+SPSQFE+HA
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 566 -----DGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 597
DGG+L+ C CPRAFH C L P+G W+C
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 40/166 (24%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 40/166 (24%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G + +E++ +LE E + CR
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRV 454
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 455 C---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 55/133 (41%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP+GDWYC C N +R
Sbjct: 1949 LLLCDGCDKGYHTYCFKPKMENIPEGDWYCHECMNKATGER------------------- 1989
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S SG ++LC+ C R +H C+ H +
Sbjct: 1990 --------------------NCIVC-GKKSSTSG---TRLILCELCPRAYHTDCI--HPI 2023
Query: 688 ADLRELPKGKWFC 700
+ ++P+GKW+C
Sbjct: 2024 --MHKVPRGKWYC 2034
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 10/86 (11%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQF------EAHADGGNLLPCDGC 576
DG + GY+ C + +E G CH C ++ + + ++ G L+ C+ C
Sbjct: 1953 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKATGERNCIVCGKKSSTSGTRLILCELC 2011
Query: 577 PRAFHKECAS--LSSIPQGDWYCKYC 600
PRA+H +C + +P+G WYC C
Sbjct: 2012 PRAYHTDCIHPIMHKVPRGKWYCSKC 2037
Score = 43.1 bits (100), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 703
++ C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1932 IMKANCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1983
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 70/155 (45%), Gaps = 31/155 (20%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A + G
Sbjct: 366 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS---EG 417
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 683
+ +E++ +LE E + CR C K G +L CD C +H+ CL
Sbjct: 418 EEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCLN 464
Query: 684 KHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 716
L E+P G+W C C C + +Q +L+
Sbjct: 465 ----PPLPEIPNGEWLCPRCT-CPALKGKVQKILI 494
>gi|321470558|gb|EFX81534.1| hypothetical protein DAPPUDRAFT_347174 [Daphnia pulex]
Length = 1890
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 53/136 (38%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDW+C C+N +R
Sbjct: 1623 LLLCDGCDKGYHTYCFRPPMDNIPDGDWFCYECRNKATGQR------------------- 1663
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI-LLCDQCEREFHVGCLKKHK 686
C++C G +TI +LCDQC + +H+ CL+
Sbjct: 1664 --------------------NCIVC-------GKPGNKTISVLCDQCPKAYHIECLQ--- 1693
Query: 687 MADLRELPKGKWFCCM 702
L ++P+GKW C +
Sbjct: 1694 -PPLAKVPRGKWLCVL 1708
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%), Gaps = 2/31 (6%)
Query: 573 CDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
CD CP+A+H EC L+ +P+G W C C
Sbjct: 1680 CDQCPKAYHIECLQPPLAKVPRGKWLCVLCH 1710
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 71/187 (37%), Gaps = 33/187 (17%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSS 589
A G + +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 373 ALGDEDGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELEK 420
Query: 590 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 649
P+G W C +C+ K +Q +A + + + + + + + C
Sbjct: 421 APEGKWSCPHCE-----KEGIQWEAKEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFC 475
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRIN 708
+C+ +L CD C +H+ CL L E+P G+W C C I
Sbjct: 476 RVCKDGG---------ELLCCDSCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIK 522
Query: 709 SVLQNLL 715
+Q +L
Sbjct: 523 GRVQKIL 529
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 68/166 (40%), Gaps = 40/166 (24%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 357 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 404
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRG 654
C +C+ K +Q +A + G +++E V + E E + CR
Sbjct: 405 SCPHCE-----KEGIQWEAKEDNS---EGEETMED-------AVGDAEEEDDHHMEFCRV 449
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 450 C---KDG---GELLCCDACPSSYHIHCLN----PPLPEIPNGEWLC 485
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 452 DGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGGDA------EEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 366 MDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 413
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 414 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 459
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 460 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 508
Query: 714 LLV 716
+L+
Sbjct: 509 ILI 511
>gi|195436452|ref|XP_002066182.1| GK22224 [Drosophila willistoni]
gi|194162267|gb|EDW77168.1| GK22224 [Drosophila willistoni]
Length = 3148
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2669 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2709
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2710 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2741
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2742 PPLLKVPRGKWYC 2754
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2723 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2757
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2651 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2698
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2616 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2656
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2657 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2688
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2689 PPLLKVPRGKWYC 2701
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2670 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2704
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2602 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2645
>gi|194883931|ref|XP_001976050.1| GG22641 [Drosophila erecta]
gi|190659237|gb|EDV56450.1| GG22641 [Drosophila erecta]
Length = 3148
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2662 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2702
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2703 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2734
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2735 PPLLKVPRGKWYC 2747
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2716 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2750
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2648 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2691
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2580 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2620
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2621 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2652
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2653 PPLLKVPRGKWYC 2665
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2634 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2668
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2566 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2609
>gi|198457110|ref|XP_001360553.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
gi|198135863|gb|EAL25128.2| GA10623 [Drosophila pseudoobscura pseudoobscura]
Length = 3214
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2732 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2772
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2773 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2804
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2805 PPLLKVPRGKWYC 2817
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2786 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2820
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2714 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2761
>gi|12642598|gb|AAK00302.1|AF314193_1 Toutatis [Drosophila melanogaster]
Length = 3109
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2602 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2642
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2643 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2674
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2675 PPLLKVPRGKWYC 2687
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2656 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2690
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2588 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2631
>gi|195582482|ref|XP_002081057.1| GD25895 [Drosophila simulans]
gi|194193066|gb|EDX06642.1| GD25895 [Drosophila simulans]
Length = 2944
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2466 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2506
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2507 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2538
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2539 PPLLKVPRGKWYC 2551
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2520 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2554
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2448 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2495
>gi|161076540|ref|NP_001097270.1| toutatis, isoform E [Drosophila melanogaster]
gi|157400285|gb|ABV53763.1| toutatis, isoform E [Drosophila melanogaster]
Length = 3131
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2653 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2693
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2694 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2725
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2726 PPLLKVPRGKWYC 2738
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2707 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2741
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2639 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2682
>gi|195485690|ref|XP_002091194.1| GE13512 [Drosophila yakuba]
gi|194177295|gb|EDW90906.1| GE13512 [Drosophila yakuba]
Length = 3129
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2648 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2688
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2689 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2720
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2721 PPLLKVPRGKWYC 2733
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2702 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2736
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2634 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2677
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRF 609
DGG LL CD CPRAFH C +S IP +WYCK C +R+R
Sbjct: 166 DGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMCSECLDRRRL 211
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 20/34 (58%), Positives = 23/34 (67%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG LL CDGCPRAFH C L IP+ +W+C C
Sbjct: 1201 GGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC 1234
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 560 QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+ ++H G+ GC R FH +CA L ++P DWYCK C+
Sbjct: 1248 RLDSHVICGSEDGTKGCDRVFHLKCAKLDAVPADDWYCKKCR 1289
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
++GG L+ CDGCP FH C L +P+G +C C
Sbjct: 1066 SEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 MDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|161076538|ref|NP_523701.3| toutatis, isoform A [Drosophila melanogaster]
gi|157400284|gb|AAF58638.3| toutatis, isoform A [Drosophila melanogaster]
Length = 2999
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2521 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2561
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2562 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2593
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2594 PPLLKVPRGKWYC 2606
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2575 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2609
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2503 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2550
>gi|195027235|ref|XP_001986489.1| GH20497 [Drosophila grimshawi]
gi|193902489|gb|EDW01356.1| GH20497 [Drosophila grimshawi]
Length = 3415
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2889 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2929
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2930 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2961
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2962 PPLLKVPRGKWYC 2974
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2943 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2977
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2875 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2918
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 67/170 (39%), Gaps = 30/170 (17%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ +GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 368 GEEEGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELDKAP 415
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL- 650
+G W C +C+ K +Q +A + I++ + + E +
Sbjct: 416 EGKWSCPHCE-----KEGIQWEAKDEDFEDFEEDSEDRVISEVSSGVPAGGDDEDDDHME 470
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CR C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 471 FCRVC---KDG---GELLCCDTCTSSYHIHCLN----PPLPEIPNGEWLC 510
Score = 39.7 bits (91), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 477 DGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 513
>gi|194752946|ref|XP_001958780.1| GF12391 [Drosophila ananassae]
gi|190620078|gb|EDV35602.1| GF12391 [Drosophila ananassae]
Length = 3047
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2554 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2594
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2595 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2626
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2627 PPLLKVPRGKWYC 2639
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2608 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2642
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2536 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2583
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 36/64 (56%), Gaps = 3/64 (4%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRV 622
DGG L+ CDGCPRAFH C L+SIP+G W C+ CQ N + + + A E
Sbjct: 300 DGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLCQSNRLKDRTYTHVQPPATETSSG 359
Query: 623 SGVD 626
S VD
Sbjct: 360 SAVD 363
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 505
Query: 715 LV 716
L+
Sbjct: 506 LI 507
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 88/397 (22%), Positives = 149/397 (37%), Gaps = 96/397 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEA 563
K + + D L D +V Y +K++ G GI C CCN+ V + FE
Sbjct: 370 KKHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFET 429
Query: 564 HA-------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL----QH 612
HA LL G P LS Q W + M R++ + Q
Sbjct: 430 HARCERPGQPWEKLLLMSGKP---------LSKCMQEAWAQERVTAMRAREKAMASLEQE 480
Query: 613 DANAVEAGR----------VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 662
+ +A R + GV V + R ++ KN + C C + G
Sbjct: 481 KEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG- 536
Query: 663 GPRTILLCDQCEREFHVGCL------KKHKMADLRELP----KGKW-------------- 698
+L CD C FH CL K + D ++L + W
Sbjct: 537 ---QLLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQ 593
Query: 699 --------------FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVS 742
+C C +++S L +++ + E + L I+K + L T
Sbjct: 594 YCRPLQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSE 650
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
D+ V +L E+ + L+ A+ + ++CF+P+ D + D++ VY G +
Sbjct: 651 DMPV---IL-------ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRV 700
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ G Y +L + ++S +LR G+++AE+P T
Sbjct: 701 NYEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGT 737
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 88/397 (22%), Positives = 149/397 (37%), Gaps = 96/397 (24%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEA 563
K + + D L D +V Y +K++ G GI C CCN+ V + FE
Sbjct: 372 KKHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFET 431
Query: 564 HA-------DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL----QH 612
HA LL G P LS Q W + M R++ + Q
Sbjct: 432 HARCERPGQPWEKLLLMSGKP---------LSKCMQEAWAQERVTAMRAREKAMASLEQE 482
Query: 613 DANAVEAGR----------VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 662
+ +A R + GV V + R ++ KN + C C + G
Sbjct: 483 KEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADGG- 538
Query: 663 GPRTILLCDQCEREFHVGCL------KKHKMADLRELP----KGKW-------------- 698
+L CD C FH CL K + D ++L + W
Sbjct: 539 ---QLLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQ 595
Query: 699 --------------FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLETVS 742
+C C +++S L +++ + E + L I+K + L T
Sbjct: 596 YCRPLQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVTSE 652
Query: 743 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 800
D+ V +L E+ + L+ A+ + ++CF+P+ D + D++ VY G +
Sbjct: 653 DMPV---IL-------ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRV 702
Query: 801 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ G Y +L + ++S +LR G+++AE+P T
Sbjct: 703 NYEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGT 739
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 383
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 429
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 430 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 479
Query: 715 LV 716
L+
Sbjct: 480 LI 481
>gi|195150317|ref|XP_002016101.1| GL10676 [Drosophila persimilis]
gi|194109948|gb|EDW31991.1| GL10676 [Drosophila persimilis]
Length = 3244
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2885 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2925
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2926 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2957
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2958 PPLLKVPRGKWYC 2970
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2939 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2973
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2871 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFKPK----MDNIPDGDWYC 2914
>gi|195382825|ref|XP_002050129.1| GJ21968 [Drosophila virilis]
gi|194144926|gb|EDW61322.1| GJ21968 [Drosophila virilis]
Length = 3086
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2597 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2637
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2638 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2669
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2670 PPLLKVPRGKWYC 2682
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2651 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2685
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2580 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2626
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 336 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 383
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 384 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 429
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 430 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 478
Query: 714 LLV 716
+L+
Sbjct: 479 ILI 481
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 356 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 403
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 404 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 449
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 450 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 498
Query: 714 LLV 716
+L+
Sbjct: 499 ILI 501
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 355 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 402
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 403 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 448
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 449 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 497
Query: 714 LLV 716
+L+
Sbjct: 498 ILI 500
>gi|195123885|ref|XP_002006432.1| GI21040 [Drosophila mojavensis]
gi|193911500|gb|EDW10367.1| GI21040 [Drosophila mojavensis]
Length = 2976
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2632 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 2672
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 2673 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 2704
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 2705 PPLLKVPRGKWYC 2717
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2686 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2720
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 4/52 (7%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2614 SIMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2661
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 343 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 390
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 391 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 436
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 437 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 486
Query: 715 LV 716
L+
Sbjct: 487 LI 488
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|195333469|ref|XP_002033414.1| GM20421 [Drosophila sechellia]
gi|194125384|gb|EDW47427.1| GM20421 [Drosophila sechellia]
Length = 2123
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 1645 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 1685
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 1686 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 1717
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 1718 PPLLKVPRGKWYC 1730
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 20/91 (21%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVS-----------PSQFEAHADGGNLL 571
DG + GY+ C + ++ +G C N + PS G ++
Sbjct: 1649 DGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNERKCIVCGGHRPSPV------GKMI 1702
Query: 572 PCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
CD CPRA+H +C L +P+G WYC C
Sbjct: 1703 YCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 1733
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 4/51 (7%)
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 1628 IMKANCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 1674
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 383 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 430
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 431 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 476
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 477 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 526
Query: 715 LV 716
L+
Sbjct: 527 LI 528
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/351 (21%), Positives = 123/351 (35%), Gaps = 95/351 (27%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 597
LEG GI C CC+ ++ ++FE HA P + A+LS W
Sbjct: 655 LEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENI--FLEDGGATLSQCLVDAWKK 712
Query: 598 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 657
Q+ E+K F ++ + + C +C
Sbjct: 713 ---QSQSEKKGFY--------------------------KVDPGDDPDDDTCGIC----- 738
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 717
G G +L CD C FH+ CL ++P G W C R Q +
Sbjct: 739 ---GDGG-DLLCCDNCPSTFHLACLG-------IKMPSGDWHCSSCICRFCGSTQEITTS 787
Query: 718 EAEKL--------------PEFHLNAIKKYAGNS---------------------LETVS 742
AE L P +++K + +S ++
Sbjct: 788 SAELLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVKNAI 847
Query: 743 DIDVRWRLL----SGKAATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSM 791
+ W L+ AA P+ + L + A ++ +CF P +D SG ++I ++
Sbjct: 848 EAGFSWSLVRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNV 907
Query: 792 VY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+Y G + F Y IL V+SA +R+ G ++AE+P + T I
Sbjct: 908 IYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGI 958
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/182 (25%), Positives = 73/182 (40%), Gaps = 39/182 (21%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 413 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 460
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 461 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 506
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C + +Q +
Sbjct: 507 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKI 556
Query: 715 LV 716
L+
Sbjct: 557 LI 558
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 115/496 (23%), Positives = 181/496 (36%), Gaps = 130/496 (26%)
Query: 410 KKTRKLLEADL------VSKSSSKSVSLRNLLKTRSPW--ELTRNSSRPGLIANSTPVT- 460
+KT+ +E D+ S S K+ RN L ++ + +R+SS+ + +P+
Sbjct: 437 RKTKSKIEKDMKRELHSASDSDGKATFARNFLAIKNEVGNDDSRDSSQGTTSKSESPLHH 496
Query: 461 SVHKSSQSQRQR----KITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 516
KS+ S R K +K + T+L+ + N P ++ L
Sbjct: 497 QTEKSTGSSSHRVDGGKSSKHGRSTLLVRRSVRG-------DNSESDGFVPSSEKRTVLA 549
Query: 517 F-DESGLPDGTEVGYYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLP 572
+ +SG +E Y ++ +LEG+ GI C CC+ ++ S+FE HA P
Sbjct: 550 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAGSKLRQP 609
Query: 573 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 632
QN+F N+ AG + G SV+ I
Sbjct: 610 F---------------------------QNIF---------LNSGGAGNI-GFCSVDVIA 632
Query: 633 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL--KKHKMADL 690
+ C +C G G ++ CD C FH CL + H M D
Sbjct: 633 D---------DPNDDACGIC--------GDGG-DLVCCDGCPSTFHQRCLDIRGHLMPDW 674
Query: 691 RELPKGKWFCCMDCSRI------NSVLQ------NLLVQEAEKLPEFHLN-AIKKYAGNS 737
L + + C + I NSV Q L VQ K +L+ +KKY G
Sbjct: 675 IFL-RFNYRCFLLVIGIAPIVHANSVRQLLKMLLRLWVQIPAKCVRKNLSEGVKKYVGVK 733
Query: 738 LETVSDIDVRWRL-----------LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG-- 784
E + W L LSG E L+ A+ + +CF PI+D SG
Sbjct: 734 HEL--EAGFSWSLVHRECTNSDLSLSGHPHIVENNSKLALALTVMDECFLPIIDRRSGHC 791
Query: 785 -----RDLIPSMVYG---------------RNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
R+ + +G N FGG Y A+L +V++ +R
Sbjct: 792 KKFCLRNFTTVIFFGISLCWFVCLYIAFRRSNFNRLNFGGFYTALLERGDEIVASASIRF 851
Query: 825 FGQEVAELPLVATSKI 840
G +AE+P + T +
Sbjct: 852 HGNRLAEMPFIGTRHV 867
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/351 (21%), Positives = 123/351 (35%), Gaps = 95/351 (27%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 597
LEG GI C CC+ ++ ++FE HA P + A+LS W
Sbjct: 654 LEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENI--FLEDGGATLSQCLVDAWKK 711
Query: 598 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 657
Q+ E+K F ++ + + C +C
Sbjct: 712 ---QSQSEKKGFY--------------------------KVDPGDDPDDDTCGIC----- 737
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 717
G G +L CD C FH+ CL ++P G W C R Q +
Sbjct: 738 ---GDGG-DLLCCDNCPSTFHLACLG-------IKMPSGDWHCSSCICRFCGSTQEITTS 786
Query: 718 EAEKL--------------PEFHLNAIKKYAGNS---------------------LETVS 742
AE L P +++K + +S ++
Sbjct: 787 SAELLSCLQCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVKNAI 846
Query: 743 DIDVRWRLL----SGKAATPETRLLL-------SQAVAIFHDCFDPIVDSISGRDLIPSM 791
+ W L+ AA P+ + L + A ++ +CF P +D SG ++I ++
Sbjct: 847 EAGFSWSLVRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNV 906
Query: 792 VY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 840
+Y G + F Y IL V+SA +R+ G ++AE+P + T I
Sbjct: 907 IYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGI 957
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 504
Query: 714 LLV 716
+L+
Sbjct: 505 ILI 507
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 64/164 (39%), Gaps = 40/164 (24%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 420
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A R D+ E + +E + CR C
Sbjct: 421 CPHCE-----KEGIQWEA------REDVSDAEEDNGE-----TGEMEEDDHHMEFCRVC- 463
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 464 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWIC 498
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 501
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 66/164 (40%), Gaps = 36/164 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 297 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 344
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A +S + ++ +R + + + + C +C+
Sbjct: 345 CPHCE-----KEGIQWEAR----DDLSDGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG 395
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+L CD C +H+ CL L E+P G+W C
Sbjct: 396 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 426
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 393 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 430
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 64/154 (41%), Gaps = 30/154 (19%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A
Sbjct: 370 GGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-----KEGVQWEAK--------- 415
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ ++ V + E E CR C K G +L CD C +H+ CL
Sbjct: 416 -EDNSELDDDLDDAVGDPEEEDHHMEFCRVC---KDG---GELLCCDVCPSSYHIHCLN- 467
Query: 685 HKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 716
L E+P G+W C C C + +Q +L
Sbjct: 468 ---PPLPEIPNGEWLCPRCT-CPPLKGKIQKILT 497
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 59/143 (41%), Gaps = 24/143 (16%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
D LL CD C + FH C +LS +P+GDW C C+ R+ D + G S
Sbjct: 1100 DEDKLLLCDECNQPFHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDYAELNGG--S 1157
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 683
D ++ + E+E +C CD + ++ C +C +H C
Sbjct: 1158 DSDEYDE--------TDSDESEAEHDEMCCMCD------DDQELVYCSRCPAAYHRECHD 1203
Query: 684 KHKMADLRELPKGKWFC--CMDC 704
LR P+GKW C C +C
Sbjct: 1204 ----PPLRNFPRGKWVCSACTNC 1222
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 10/57 (17%)
Query: 644 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
AE + C +CR K G + +LLCD+C + FH+ CL+ L +PKG W C
Sbjct: 1087 AENAKCKICR-----KKGDEDK-LLLCDECNQPFHLYCLR----PALSYVPKGDWMC 1133
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 205 LLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------------------- 245
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G S G ++ CD C R +H C +
Sbjct: 246 --------------------KCIVCGGHRPSPVG----KMIYCDLCPRAYHADCY----I 277
Query: 688 ADLRELPKGKWFC 700
L ++P+GKW+C
Sbjct: 278 PPLLKVPRGKWYC 290
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 259 GKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 293
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKP----KMDNIPDGDWYC 234
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 281 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 328
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 329 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 374
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 375 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 423
Query: 714 LLV 716
+L+
Sbjct: 424 ILI 426
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 10/135 (7%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
G NLL D +K A+ + +G C YCQ+ ER RF QH+ + + G
Sbjct: 164 GDNLLSLDH--NDIYKTGATYDN-DKGILLCDYCQSTVERNRFGQHEELLI--CKDCGNK 218
Query: 627 SVEQITKRCIRIVKNLEAELSG-CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
+ +V+ + ++ S C+ C+ C + P T+L CD C++ +H+ C +
Sbjct: 219 AHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDKGYHMNCHE-- 276
Query: 686 KMADLRELPKGKWFC 700
L ++P GKW C
Sbjct: 277 --PKLTQMPSGKWAC 289
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 67/155 (43%), Gaps = 27/155 (17%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 622
+ GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A +
Sbjct: 288 SQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS--- 339
Query: 623 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 682
G + +E++ E + CR C K G +L CD C +H+ CL
Sbjct: 340 EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCL 387
Query: 683 KKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 716
L E+P G+W C C + +Q +L+
Sbjct: 388 N----PPLPEIPNGEWLCPRCTCPSLKGKVQKILI 418
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 66/153 (43%), Gaps = 27/153 (17%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A + G
Sbjct: 233 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS---EG 284
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ +E++ E + CR C K G +L CD C +H+ CL
Sbjct: 285 EEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHCLN- 331
Query: 685 HKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 716
L E+P G+W C C + +Q +L+
Sbjct: 332 ---PPLPEIPNGEWLCPRCTCPALKGKVQKILI 361
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 66/165 (40%), Gaps = 38/165 (23%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 362 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 409
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 410 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 455
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 456 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 457 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/177 (25%), Positives = 72/177 (40%), Gaps = 28/177 (15%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKL 722
+L+CD+C FH C+ L P+G WFC C C + L
Sbjct: 423 LLMCDRCPSMFHHACVG------LESTPQGDWFCPACTCAICG-------------SSDL 463
Query: 723 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 782
+ + + +S R G+ E L A+ + +CF +++
Sbjct: 464 DDPPATTTTQGFSSDRMVISCEQCRRESRDGEE---EEHAKLCMALDVLRECFVTLIEPR 520
Query: 783 SGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
+ DL +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 521 TQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 577
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 21/35 (60%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
D G LL CD CP FH C L S PQGDW+C C
Sbjct: 419 DCGELLMCDRCPSMFHHACVGLESTPQGDWFCPAC 453
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 28/35 (80%)
Query: 690 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 724
L+ELPKG WFC MDC+RINS LQ LL+ AEKL +
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSD 75
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 21/30 (70%), Positives = 27/30 (90%)
Query: 808 AILTVNSSVVSAGILRVFGQEVAELPLVAT 837
I+ N++VV+AG+LRVFG+EVAELPLVAT
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVAT 108
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 69/152 (45%), Gaps = 38/152 (25%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE------------RKRFLQHD 613
+GG ++ CDGC R FH C ++ +P+G YCK+C R++ L+
Sbjct: 1085 EGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCSEGDTKGAEEKSVGGDGRRQSLRLS 1144
Query: 614 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG-----CLLCRGCDFSKSGFGPRTIL 668
A+ GR V+ ++I R + LE+ G C +C+ +G +L
Sbjct: 1145 AD----GRHDDVEENDEI--RMKSSNRELESGAVGPWDVECFICK-------LYGE--LL 1189
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CD C + FH+ C+ ++ P+ +WFC
Sbjct: 1190 GCDGCPKAFHLACI------GIKSWPQEEWFC 1215
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 54/133 (40%), Gaps = 49/133 (36%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
G LL CDGCP+AFH C + S PQ +W+C C ++ V G +
Sbjct: 1186 GELLGCDGCPKAFHLACIGIKSWPQEEWFCDECD---------------MQTCGVCGRNK 1230
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
+ +L+ ++C D SK C++ FH+ C+K
Sbjct: 1231 I----------------KLNSHVICGSEDGSKG------------CDKVFHLKCVK---- 1258
Query: 688 ADLRELPKGKWFC 700
L ++P+ WFC
Sbjct: 1259 --LEKVPESDWFC 1269
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 26/56 (46%), Gaps = 21/56 (37%)
Query: 566 DGGNLLPCDGCPRAFH-----------------KECASL----SSIPQGDWYCKYC 600
DGG LL CD CPRAFH ASL IP+ +WYCK+C
Sbjct: 141 DGGELLCCDRCPRAFHLKWYVGCFPSAVVAHQASRYASLGLQKEEIPESEWYCKFC 196
>gi|334187637|ref|NP_568273.2| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
gi|332004422|gb|AED91805.1| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
Length = 1602
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 553 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 603
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 596 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 653
Query: 604 FERK 607
F+R+
Sbjct: 654 FDRR 657
>gi|14586370|emb|CAC42901.1| putative protein [Arabidopsis thaliana]
Length = 1595
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 553 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 603
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 596 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 653
Query: 604 FERK 607
F+R+
Sbjct: 654 FDRR 657
>gi|307169034|gb|EFN61879.1| Bromodomain adjacent to zinc finger domain protein 2B [Camponotus
floridanus]
Length = 2352
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 59/153 (38%), Gaps = 53/153 (34%)
Query: 552 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2062 TTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2121
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
CL+C ++G + ++
Sbjct: 2122 ---------------------------------------NCLVC----GKRAG---KNLV 2135
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2136 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2164
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2085 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRAGKNLVLCELCPRA 2143
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 2144 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2174
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 630 QITKRCIRI-VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 688
++ RC+ + N +L C F SG +LLCD C+R +H C +
Sbjct: 2043 KLRNRCVSLKATNQYKQLLTTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----P 2098
Query: 689 DLRELPKGKWFC--CMD 703
+ +P G W+C CM+
Sbjct: 2099 KMENIPDGDWYCHECMN 2115
>gi|297807283|ref|XP_002871525.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
gi|297317362|gb|EFH47784.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
Length = 1581
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 40/64 (62%), Gaps = 11/64 (17%)
Query: 553 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNM 603
N+EV + F+ ++D G+LL CDGCP A+H +C L+S +P+GDWYC C
Sbjct: 593 NNEVIDTSFDRNSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--A 650
Query: 604 FERK 607
F+R+
Sbjct: 651 FDRR 654
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 74/183 (40%), Gaps = 41/183 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDW 595
++GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 107 VDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKW 154
Query: 596 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGC 655
C +C+ K +Q +A + G + +E++ E + CR C
Sbjct: 155 SCPHCE-----KEGIQWEAKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC 200
Query: 656 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQN 713
K G +L CD C +H+ CL L E+P G+W C C C + +Q
Sbjct: 201 ---KDG---GELLCCDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQK 249
Query: 714 LLV 716
+L+
Sbjct: 250 ILI 252
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 72/182 (39%), Gaps = 44/182 (24%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 405 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 452
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E G D+ + V +E + CR C
Sbjct: 453 CPHCE-----KEGIQWEAR--EEGSEGEDDNGD---------VGEMEDD-HHMEFCRVC- 494
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 495 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSLKGKVQRI 544
Query: 715 LV 716
L
Sbjct: 545 LT 546
>gi|332022570|gb|EGI62872.1| Bromodomain adjacent to zinc finger domain protein 2B [Acromyrmex
echinatior]
Length = 2202
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 52/134 (38%), Gaps = 52/134 (38%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1931 LLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------------------- 1971
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
CL+C ++G + ++LC+ C R +H C H
Sbjct: 1972 --------------------NCLVC----GKRAG---KNLVLCELCPRAYHTDC---HNP 2001
Query: 688 ADLRELPKGKWFCC 701
+ ++P+GKW+C
Sbjct: 2002 V-MPKMPRGKWYCS 2014
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 1935 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRAGKNLVLCELCPRA 1993
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 1994 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2024
Score = 43.9 bits (102), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 642 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 699
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1904 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 1959
Query: 700 C--CMD 703
C CM+
Sbjct: 1960 CHECMN 1965
>gi|340709835|ref|XP_003393506.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Bombus terrestris]
Length = 2263
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 53/153 (34%)
Query: 552 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1973 TTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2032
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
CL+C G K+ ++
Sbjct: 2033 ---------------------------------------NCLVC-GKRVGKN------LV 2046
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2047 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2075
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 1996 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2054
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 2055 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2085
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 654 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 703
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 1979 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2026
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 45/76 (59%), Gaps = 2/76 (2%)
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGI 821
++ AVA+ +CF+P++D + +++ S++Y G N F G Y AIL +S
Sbjct: 22 IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVAS 81
Query: 822 LRVFGQEVAELPLVAT 837
+R+ G ++AE+P +AT
Sbjct: 82 MRIHGNKLAEMPFIAT 97
>gi|328792710|ref|XP_623473.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis mellifera]
Length = 2293
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 53/153 (34%)
Query: 552 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2003 TTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2062
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
CL+C G K+ ++
Sbjct: 2063 ---------------------------------------NCLVC-GKRVGKN------LV 2076
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2077 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2105
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2026 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2084
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 2085 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2115
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 6/51 (11%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 703
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2010 CQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2056
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 759 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG--QEFGGMYCAILTVNSSV 816
E R+ LS A+ + H C++P+ DS +G D++P ++ G L G ++ GM+ A+L +
Sbjct: 420 ELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGPAA 479
Query: 817 VSAGILRVFGQEVAELPLVAT 837
V+ + R FG ++AE+P++A
Sbjct: 480 VAVAVFRSFG-DLAEVPVLAV 499
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 12/54 (22%)
Query: 548 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+CH C GG+L+ C+ CP FH C L++ P+GD++C C+
Sbjct: 121 LCHICGL------------GGDLMCCETCPGVFHAACLGLAAPPEGDYHCPLCR 162
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 51/134 (38%), Gaps = 52/134 (38%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2001 LLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------------------- 2041
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
CL+C G K+ ++LC+ C R +H C H
Sbjct: 2042 --------------------NCLVC-GKRVGKN------LVLCELCPRAYHTDC---HNP 2071
Query: 688 ADLRELPKGKWFCC 701
+ ++P+GKW+C
Sbjct: 2072 V-MPKMPRGKWYCS 2084
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2005 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2063
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 2064 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2094
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 642 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 699
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1974 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 2029
Query: 700 C--CMD 703
C CM+
Sbjct: 2030 CHECMN 2035
>gi|350407087|ref|XP_003487980.1| PREDICTED: hypothetical protein LOC100749908 [Bombus impatiens]
Length = 2303
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 58/153 (37%), Gaps = 53/153 (34%)
Query: 552 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
S+ S QF D + LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2013 TTSQASNCQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER 2072
Query: 609 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
CL+C G K+ ++
Sbjct: 2073 ---------------------------------------NCLVC-GKRVGKN------LV 2086
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 701
LC+ C R +H C H + ++P+GKW+C
Sbjct: 2087 LCELCPRAYHTDC---HNPV-MPKMPRGKWYCS 2115
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQ---FEAHADGGNLLPCDGCPRA 579
DG + GY+ C + +E +G CH C ++ + + G NL+ C+ CPRA
Sbjct: 2036 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKATGERNCLVCGKRVGKNLVLCELCPRA 2094
Query: 580 FHKECAS--LSSIPQGDWYCKYCQNMFERKR 608
+H +C + + +P+G WYC C + +KR
Sbjct: 2095 YHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2125
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 654 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 703
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2019 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2066
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 72/182 (39%), Gaps = 44/182 (24%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 249 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGTWS 296
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E G D+ + V +E + CR C
Sbjct: 297 CPHCE-----KEGIQWEAR--EEGSEGDEDNGD---------VGEMEDD-HHMEFCRVC- 338
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
K G +L CD C +H+ CL L E+P G+W C C C + +Q +
Sbjct: 339 --KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKI 388
Query: 715 LV 716
L
Sbjct: 389 LT 390
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 66/164 (40%), Gaps = 36/164 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 378 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 425
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A +S + ++ +R + + + + C +C+
Sbjct: 426 CPHCE-----KEGIQWEAK----DELSEGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG 476
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+L CD C +H+ CL L E+P G+W C
Sbjct: 477 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 507
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 474 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 511
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 320 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 367
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 368 CPHCE-----KEGIQWEAKDDEEEEEEAPGEEEDDHMEFCRVCKD-----GGELLC---- 413
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 414 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 455
Query: 715 L 715
L
Sbjct: 456 L 456
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 344 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 391
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 392 CPHCE-----KEGIQWEAKDDEEEEDEVAGEEEDDHMEFCRVCKD-----GGELLC---- 437
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 438 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 479
Query: 715 L 715
L
Sbjct: 480 L 480
>gi|224090647|ref|XP_002309044.1| predicted protein [Populus trichocarpa]
gi|222855020|gb|EEE92567.1| predicted protein [Populus trichocarpa]
Length = 467
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
KK ++K +KK+ N P V L TG+LDGV V Y+ Q LRG+I+ G LC
Sbjct: 378 KKKDDIKTAKKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWS--QEKELRGVIKGSGYLCG 435
Query: 268 CSLCNGCRVI 277
C CN +VI
Sbjct: 436 CQTCNFSKVI 445
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 57/136 (41%), Gaps = 26/136 (19%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C + P+G W C +C+ + + ++ E G
Sbjct: 268 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVG 327
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D E+ ++E CR C K G +L CD C +H+ CL
Sbjct: 328 GDPEEEDD-------HHME-------FCRVC---KDG---GELLCCDPCPSSYHIHCLN- 366
Query: 685 HKMADLRELPKGKWFC 700
L E+P G+W C
Sbjct: 367 ---PPLPEIPNGEWLC 379
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG LL CD CP ++H C L IP G+W C +C
Sbjct: 346 DGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQ 611
DGG L+ CDGCPRAFH C LSSIP G W C++C+ +K Q
Sbjct: 268 DGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCRGHRVKKEKAQ 315
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 65/164 (39%), Gaps = 36/164 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 379 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWS 426
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A +S + + +R + + + + C +C+
Sbjct: 427 CPHCE-----KEGIQWEAR----DDLSEAEGEDDDDRRDEGMEEEDDHHIEFCRVCKDGG 477
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+L CD C +H+ CL L E+P G+W C
Sbjct: 478 ---------ELLCCDTCPSSYHIHCLN----PPLPEIPNGEWIC 508
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 475 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 512
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 67/181 (37%), Gaps = 48/181 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 322 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELEKAPEGKWS 369
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E E R+ K+ G LLC
Sbjct: 370 CPHCE-----KEGIQWEAKGEEEEEEEAAGEEEDDHMEFCRVCKD-----GGELLC---- 415
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 416 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 457
Query: 715 L 715
L
Sbjct: 458 L 458
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 68/181 (37%), Gaps = 48/181 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 497 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 544
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A E V E R+ K+ G LLC
Sbjct: 545 CPHCE-----KEGIQWEAKDEEEDEEEPVGEEEDDHMEFCRVCKD-----GGELLC---- 590
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNL 714
CD C +H+ CL L E+P G+W C CM C + +Q +
Sbjct: 591 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKI 632
Query: 715 L 715
L
Sbjct: 633 L 633
>gi|380485054|emb|CCF39608.1| origin recognition complex subunit 4 [Colletotrichum higginsianum]
Length = 863
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 8/73 (10%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
G +L CDGC +A+H++C + +P+GDWYC C Q + A AG +
Sbjct: 359 GNRILFCDGCDKAYHQKCYKVPKVPRGDWYCNEC--------VQQKQSRAAAAGEAVKIP 410
Query: 627 SVEQITKRCIRIV 639
+VEQ R++
Sbjct: 411 NVEQHLNSLKRVL 423
Score = 43.1 bits (100), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 18/88 (20%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 706
CL+C D SK G IL CD C++ +H C K K +P+G W+C C+ +
Sbjct: 348 CLVCSKPD-SKPG---NRILFCDGCDKAYHQKCYKVPK------VPRGDWYCNECVQQKQ 397
Query: 707 INSVLQNLLVQEAEKLP--EFHLNAIKK 732
+ EA K+P E HLN++K+
Sbjct: 398 SRAAAAG----EAVKIPNVEQHLNSLKR 421
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 767 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 824
AV I H+CF I++ + D+ +V+ R LR F G Y +L +VS G R+
Sbjct: 11 AVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGTFRI 70
Query: 825 FGQEVAELPLVATSKI 840
GQ+ AELPL+ T +
Sbjct: 71 CGQKFAELPLIGTRSL 86
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 66/170 (38%), Gaps = 42/170 (24%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ--------------NMFERKRF 609
DGG+LL CD C ++FH C + L IP+GDWYC C+ + K +
Sbjct: 74 DGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSCKYKKSKTNVTKSPSTTIINNKTY 133
Query: 610 LQHDANAVE---------------AGRVSGVDSVEQITKRCIRI----VKNLEAELSGCL 650
+ + E + S VD++ + ++ KN+ S L
Sbjct: 134 FKESEQSPEEMSPPYLPISSSPIGSTMSSLVDNLSSVNPSTFQLPQEYTKNVNRNSSKKL 193
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C C+ S S G IL C++C +H C+ + + W C
Sbjct: 194 NCLVCEES-SNSG--DILQCNKCNAAYHSTCVDSSSLGNKT----SAWLC 236
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 44/126 (34%), Gaps = 50/126 (39%)
Query: 552 CNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 611
C+ E S DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 459 CSEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR---------- 508
Query: 612 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTIL 668
C +C G D + GF +TI+
Sbjct: 509 -------------------------------------CGVCGGSDLDDDTAEGFTDKTII 531
Query: 669 LCDQCE 674
C+QCE
Sbjct: 532 YCEQCE 537
>gi|345489409|ref|XP_003426132.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Nasonia vitripennis]
Length = 1407
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 60/154 (38%), Gaps = 32/154 (20%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-----------------QNMFER 606
D N+L CDGC + H C L+S+P GDW+C C ++ E
Sbjct: 1075 DAENMLLCDGCNKGHHLYCLKPKLTSVPAGDWFCHLCKPRETKAKEKAKKRRKFEDEIEE 1134
Query: 607 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 666
L + A RV V+S E ++ E E LC C+
Sbjct: 1135 DTTLTKETRHNRAKRV--VESDEDAEADSDE-NQDEEMEHETTQLCSVCE------SDGK 1185
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ CD C + FH CL+ L P+G+W C
Sbjct: 1186 LIECDMCSKFFHTDCLE----PPLARAPRGRWSC 1215
>gi|345489407|ref|XP_001604290.2| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Nasonia vitripennis]
Length = 1443
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 60/154 (38%), Gaps = 32/154 (20%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-----------------QNMFER 606
D N+L CDGC + H C L+S+P GDW+C C ++ E
Sbjct: 1075 DAENMLLCDGCNKGHHLYCLKPKLTSVPAGDWFCHLCKPRETKAKEKAKKRRKFEDEIEE 1134
Query: 607 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 666
L + A RV V+S E ++ E E LC C+
Sbjct: 1135 DTTLTKETRHNRAKRV--VESDEDAEADSDE-NQDEEMEHETTQLCSVCE------SDGK 1185
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ CD C + FH CL+ L P+G+W C
Sbjct: 1186 LIECDMCSKFFHTDCLE----PPLARAPRGRWSC 1215
>gi|255544948|ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis]
gi|223547443|gb|EEF48938.1| hypothetical protein RCOM_1578820 [Ricinus communis]
Length = 1915
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 4/61 (6%)
Query: 568 GNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNL+ CDGCP A+H +C ++ S+P+GDW+C C +R + N++ + GV
Sbjct: 741 GNLICCDGCPAAYHSKCVGVANDSLPEGDWFCPEC--AIDRHKPWMKTRNSLRGAELLGV 798
Query: 626 D 626
D
Sbjct: 799 D 799
>gi|383863769|ref|XP_003707352.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Megachile rotundata]
Length = 1448
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 35/161 (21%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 622
D +L CDGC + H C LS++P+GDWYCK C+ + K L+ E
Sbjct: 1082 GDAEKMLLCDGCNKGHHLYCLKPKLSTVPEGDWYCKVCKPPIKSKEKLKQRKKFEEELEE 1141
Query: 623 SGVDSVEQITKRCIRIVKNLEA-------------------ELSGCLLCRGCDFSKSGFG 663
+ + E R RI+++ +++ C CR SG
Sbjct: 1142 EVILTKETRHNRAKRILESEGEEERDDDELEEDSDMDISSQQVNVCTACR------SG-- 1193
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
++ CD C +H+ C++ + P+GKW C DC
Sbjct: 1194 -GKLISCDACSSYYHIECIE----PPIARAPRGKW-SCSDC 1228
>gi|348690302|gb|EGZ30116.1| hypothetical protein PHYSODRAFT_474458 [Phytophthora sojae]
Length = 239
Score = 53.5 bits (127), Expect = 5e-04, Method: Composition-based stats.
Identities = 21/38 (55%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 567 GGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 602
GG LL CDGC RA+H C SL +P+GDW+C YC++
Sbjct: 200 GGKLLCCDGCERAYHLNCVRPSLLDVPEGDWFCPYCRD 237
Score = 39.7 bits (91), Expect = 7.6, Method: Composition-based stats.
Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 705
+L CD CER +H+ C++ L ++P+G WFC C D S
Sbjct: 203 LLCCDGCERAYHLNCVR----PSLLDVPEGDWFCPYCRDAS 239
>gi|350410934|ref|XP_003489181.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Bombus impatiens]
Length = 1416
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 611
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 612 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSQQINVCCICKSG---GKLI 1195
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 45.8 bits (107), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG L+ CD C +H EC L+ P+G W C C++ +RK +++D++ E
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKDRKTNIRYDSSTSE------ 1244
Query: 625 VDSVEQITKRCIRIVKNLEAE-----LSGCL 650
D+ + T+R + +E E + GC+
Sbjct: 1245 -DTEPRQTRRAAKRAAEIEQEEDKGTIKGCM 1274
>gi|380025897|ref|XP_003696700.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Apis florea]
Length = 1447
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 64/157 (40%), Gaps = 29/157 (18%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-------QNMFERKRF------- 609
DG +L CDGC + H C LS +P GDWYCK C + + +RK+F
Sbjct: 1075 DGDKMLLCDGCNKGHHLYCLQPKLSCVPDGDWYCKVCKPSTKPKEKIXKRKKFEDELEED 1134
Query: 610 --LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 667
L + A RV + + + + +C C KSG +
Sbjct: 1135 VILTKETRHNRAKRVLESEEEDNSEDEELEEDSDDNISNQQINVCSAC---KSG---GKL 1188
Query: 668 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
+ CD C +H+ C++ + P+G+W C DC
Sbjct: 1189 ISCDICPNFYHIECIE----PPITRAPRGRWICS-DC 1220
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 612
GG L+ CD CP +H EC ++ P+G W C C++ +RK +++
Sbjct: 1185 GGKLISCDICPNFYHIECIEPPITRAPRGRWICSDCKDRRDRKMNIKY 1232
>gi|341902027|gb|EGT57962.1| hypothetical protein CAEBREN_23443 [Caenorhabditis brenneri]
Length = 642
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 630 QITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 689
++ R +R+VK+ E C+ CR C +I+ CD C+R FH C A
Sbjct: 556 EMPDRMVRVVKSYEW---NCIECRTCSICHKKDNEDSIVSCDWCDRAFHYLC------AG 606
Query: 690 LRELPKGKWFCCMDCS 705
LR +P+G W C + CS
Sbjct: 607 LRAMPRGMWMCQVYCS 622
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 13/54 (24%)
Query: 548 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK-YC 600
ICH ++E S ++ CD C RAFH CA L ++P+G W C+ YC
Sbjct: 580 ICHKKDNEDS------------IVSCDWCDRAFHYLCAGLRAMPRGMWMCQVYC 621
>gi|350410937|ref|XP_003489182.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Bombus impatiens]
Length = 1454
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 611
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 612 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSQQINVCCICKSG---GKLI 1195
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 40.0 bits (92), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 612
GG L+ CD C +H EC L+ P+G W C C++ +RK +++
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKDRKTNIRY 1238
>gi|340714616|ref|XP_003395822.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 1 [Bombus terrestris]
Length = 1416
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 611
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 612 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSRQINVCCICKSG---GKLI 1195
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG L+ CD C +H EC L+ P+G W C C++ ERK +++D++ E
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKERKTNIRYDSSTSE------ 1244
Query: 625 VDSVEQITKRCIRIVKNLEAE-----LSGCL 650
D+ + T+R + +E E + GC+
Sbjct: 1245 -DTEPRQTRRAAKRAAEIEQEEDKGTIKGCM 1274
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 53/136 (38%), Gaps = 37/136 (27%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C + P G W C +C+N L +D +AV + +
Sbjct: 348 GGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCEND------LVNDNDAVTSKEAAP 401
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ + C LCR +L CD C +H CL
Sbjct: 402 AKA----------------GNMEFCRLCRDGG---------ELLCCDSCPSSYHRYCL-- 434
Query: 685 HKMADLRELPKGKWFC 700
+ L +P+G W C
Sbjct: 435 --IPPLTTIPEGDWHC 448
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG LL CD CP ++H+ C L++IP+GDW+C C
Sbjct: 415 DGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 68/180 (37%), Gaps = 42/180 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
EGY+ C C GG ++ CD CPRA+H C L PQG W
Sbjct: 394 EGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLEPELERAPQGKWS 441
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ V+ + + KR R ++ E CR C
Sbjct: 442 CPHCEK------------EGVQWEAKELEEEEMEEPKRERREEEDDHME-----FCRVC- 483
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L ++P G+W C C ++ +Q +L
Sbjct: 484 --KDG---GELLCCDACVSSYHIHCLN----PPLPDIPHGEWLCPRCTCPQLKGKVQKIL 534
>gi|340714618|ref|XP_003395823.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
isoform 2 [Bombus terrestris]
Length = 1454
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 62/156 (39%), Gaps = 27/156 (17%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ------------NMFERKRFLQ 611
DG +L CDGC + H C L+S+P GDWYCK C+ FE + L+
Sbjct: 1081 DGDKMLLCDGCNKGHHLYCLQPKLNSVPDGDWYCKVCKPPTKPKEKIKKRKKFEDE--LE 1138
Query: 612 HD---ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 668
D R V E+ ++ + G C KSG ++
Sbjct: 1139 EDVILTKETRHNRAKRVLESEEEGDSVDEELEEDSDDDMGSRQINVCCICKSG---GKLI 1195
Query: 669 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
CD C +HV C++ L P+G+W C DC
Sbjct: 1196 SCDTCSNFYHVECIE----PPLTRAPRGRWVCS-DC 1226
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 612
GG L+ CD C +H EC L+ P+G W C C++ ERK +++
Sbjct: 1191 GGKLISCDTCSNFYHVECIEPPLTRAPRGRWVCSDCKDRKERKTNIRY 1238
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 58/164 (35%), Gaps = 48/164 (29%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 408 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCFDPELEEAPEGRWS 455
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + +T++ R + + C +C+
Sbjct: 456 CPHCEG--------------------EGI-TAATVTEKAGRNAADDDEHSEFCRICKDGG 494
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+L CD C +H CL L E+P G W C
Sbjct: 495 ---------ELLCCDSCTSAYHTFCLN----PPLSEIPDGDWKC 525
Score = 44.7 bits (104), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 492 DGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 53.1 bits (126), Expect = 6e-04, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGI 821
L A + H+CF +V+ + DL +V+ R LR F G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 822 LRVFGQEVAELPLVAT 837
LRV+G++VAELPLV T
Sbjct: 69 LRVYGKKVAELPLVGT 84
>gi|157125686|ref|XP_001660731.1| hypothetical protein AaeL_AAEL002037 [Aedes aegypti]
gi|108882584|gb|EAT46809.1| AAEL002037-PA [Aedes aegypti]
Length = 2884
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2347 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2387
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G G ++ C+ C R +H C +
Sbjct: 2388 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2419
Query: 688 ADLRELPKGKWFC 700
L + P+GKW+C
Sbjct: 2420 PPLLKYPRGKWYC 2432
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 8/88 (9%)
Query: 617 VEAGRVSGVDSVEQ--ITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGFGPRTILLCDQ 672
+ G VS D+VE+ T + + LE+ ++ C F +SG +LLCD
Sbjct: 2293 IPKGLVSWRDAVERSVTTAQLSMALYVLESCVAWDKSIMKANCQFCQSGEQEDKLLLCDG 2352
Query: 673 CEREFHVGCLKKHKMADLRELPKGKWFC 700
C+R +H C K + ++P G W+C
Sbjct: 2353 CDRGYHTYCFKPR----MDKIPDGDWYC 2376
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ C+ CPRA+H++C L P+G WYC+ C
Sbjct: 2401 GKMVYCELCPRAYHQDCYIPPLLKYPRGKWYCQNC 2435
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 431 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILGT 487
Query: 626 D 626
D
Sbjct: 488 D 488
>gi|427795843|gb|JAA63373.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1933
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 51/139 (36%), Gaps = 55/139 (39%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + IP GDWYC C N + ++
Sbjct: 1676 LLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKV------------------ 1717
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C+LC K G ++ CD C + FH CL
Sbjct: 1718 ---------------------CILC-----GKKG----KLVRCDACPKVFHHTCLD---- 1743
Query: 688 ADLRELPKGKWFCCMDCSR 706
L + PKGKW CC C++
Sbjct: 1744 PPLSKPPKGKW-CCSGCAK 1761
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1664 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPK----MDKIPDGDWYC 1705
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHK 582
DG + GY+ C + ++ +G C N G L+ CD CP+ FH
Sbjct: 1680 DGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKVCILCGKKGKLVRCDACPKVFHH 1739
Query: 583 EC--ASLSSIPQGDWYCKYC 600
C LS P+G W C C
Sbjct: 1740 TCLDPPLSKPPKGKWCCSGC 1759
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYC 600
GNL+ CDGCP AFH C ++ S+P+GDWYC C
Sbjct: 806 GNLICCDGCPAAFHSRCVGIASDSLPEGDWYCPEC 840
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 431 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILGT 487
Query: 626 D 626
D
Sbjct: 488 D 488
>gi|221484244|gb|EEE22540.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
gi|221505773|gb|EEE31418.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 527
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE----AGRVSGV 625
+L CDGC A H+ C + ++P+ DWYC+YC+ + K AN + A + SG
Sbjct: 193 MLLCDGCDVAVHQTCYYVKTVPKADWYCQYCEEKNQAK------ANVAKLQRLAAKASGK 246
Query: 626 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREF----HVG 680
+ +Q+ V L E+ S C+L + C FG +C +F HV
Sbjct: 247 ATDKQVETTLKTEVDRLTGEMESVCVLPKRCPLCPRSFGAHV-----RCGEDFRMWVHVN 301
Query: 681 C 681
C
Sbjct: 302 C 302
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 50/134 (37%), Gaps = 54/134 (40%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + SIP GDWYC C+N ++
Sbjct: 1696 LLLCDGCDRGYHMYCFKPKMESIPDGDWYCHECKNKSNGEK------------------- 1736
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP-RTILLCDQCEREFHVGCLKKHK 686
C++C G P + ++C+ C R +H+ CL
Sbjct: 1737 --------------------NCIVC--------GKRPIKNYVICEHCPRIYHIECLN--- 1765
Query: 687 MADLRELPKGKWFC 700
L ++P+ KW C
Sbjct: 1766 -PPLSKVPRAKWNC 1778
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F +SG +LLCD C+R +H+ C K + +P G W+C
Sbjct: 1679 IMKANCQFCQSGDNEDKLLLCDGCDRGYHMYCFK----PKMESIPDGDWYC 1725
>gi|298707514|emb|CBJ30116.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1227
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 30/45 (66%), Gaps = 3/45 (6%)
Query: 564 HADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
HA GG+L+ CDGC +H EC LS +P+GDW+C C + RK+
Sbjct: 838 HA-GGDLICCDGCEAVYHPECVGLSVVPEGDWFCPAC--VIRRKK 879
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 541 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 595
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 596 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 638
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 231 LFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
L E+G+L+G++V YM G + G+++ G ILC+C C + S FE H
Sbjct: 514 LLESGVLEGLTVRYMPRPGEVLGS-----GVVKSGVILCNCRHCKSHQGFNASSFEKHVG 568
Query: 288 KQYRRASQYICFENGKSLLEVLR-ACRSVPLPMLKATLQSALS 329
R S +I +NG+ L EVL R P + L+ A+S
Sbjct: 569 STARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVS 611
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1716 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1751
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2127 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2165
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 65/178 (36%), Gaps = 53/178 (29%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC---QNMFERK--------RFLQ 611
+GG L+ CD CP +FH EC L +P GDW+C+ C FE + R LQ
Sbjct: 34 GEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQDTPFEPQSMEMTIMGRLLQ 93
Query: 612 H-----------------------------DANAVEAGRVSGVDSVEQITKRCIRIVKNL 642
H DA+A G + DS + + R +R K
Sbjct: 94 HLEGRNTVAFSLPLGVIKAVEPEDYADGQDDADA--PGESNLFDSDQSDSDRGLRQGKRK 151
Query: 643 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
S C +C S+ C +C +H+ CL +A P KW C
Sbjct: 152 RRHDSYCSVCSLPSPSRDDLA-----QCTRCPHSYHLWCLDPPLLAK----PTVKWLC 200
>gi|427795587|gb|JAA63245.1| Putative bromodomain adjacent to zinc finger domain protein 2b,
partial [Rhipicephalus pulchellus]
Length = 1435
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 51/139 (36%), Gaps = 55/139 (39%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + +H C + IP GDWYC C N + ++
Sbjct: 1178 LLLCDGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKV------------------ 1219
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C+LC K G ++ CD C + FH CL
Sbjct: 1220 ---------------------CILC-----GKKG----KLVRCDACPKVFHHTCLD---- 1245
Query: 688 ADLRELPKGKWFCCMDCSR 706
L + PKGKW CC C++
Sbjct: 1246 PPLSKPPKGKW-CCSGCAK 1263
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C F SG + +LLCD C++ +H C K + ++P G W+C
Sbjct: 1166 CQFCHSGDNEQMLLLCDGCDKGYHTYCFKPK----MDKIPDGDWYC 1207
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHK 582
DG + GY+ C + ++ +G C N G L+ CD CP+ FH
Sbjct: 1182 DGCDKGYHTYCFKPKMDKIPDGDWYCYECLNKTQDEKVCILCGKKGKLVRCDACPKVFHH 1241
Query: 583 EC--ASLSSIPQGDWYCKYC 600
C LS P+G W C C
Sbjct: 1242 TCLDPPLSKPPKGKWCCSGC 1261
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1527 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1562
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1938 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1976
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
GG +L CD CPR FH C L IP+G W C C + F R+
Sbjct: 830 GGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQ 870
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 13/90 (14%)
Query: 615 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 674
N +++ ++ +E+I + ++ V+ + E C+ C G G + +LLCD C
Sbjct: 787 NTIQSQQLQPSLIIEEIYRDQVKKVQIQDGENIWEEQCKVC-----GQGGK-VLLCDTCP 840
Query: 675 REFHVGCLKKHKMADLRELPKGKWFCCMDC 704
R FH CLK L+E+PKGKW CM C
Sbjct: 841 RVFHPRCLK------LKEIPKGKW-SCMIC 863
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|312376807|gb|EFR23792.1| hypothetical protein AND_12238 [Anopheles darlingi]
Length = 3049
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + IP GDWYC C N +R
Sbjct: 2439 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECNNKATGER------------------- 2479
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G G ++ C+ C R +H C +
Sbjct: 2480 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2511
Query: 688 ADLRELPKGKWFC 700
+ + P+GKW+C
Sbjct: 2512 PPMLKYPRGKWYC 2524
Score = 44.3 bits (103), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2425 QNCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2468
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ C+ CPRA+H++C + P+G WYC+ C
Sbjct: 2493 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCQNC 2527
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1612 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1647
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2023 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1611 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1646
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2022 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1611 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1646
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2022 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1718 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1753
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2129 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2167
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1609 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1644
Score = 41.6 bits (96), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2020 GDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2058
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1430 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1465
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1841 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1879
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1713 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1748
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2124 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 41/144 (28%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYC--------KYCQNMFERKRFLQHDANAVEAGR 621
L+ C CPR+FH C + P+ DW C KY Q + + K+ ++ + A EA
Sbjct: 1004 LISCGSCPRSFHLICIQMKRAPRRDWRCLACTAGVKKYKQELKDLKKIIE-EKEAFEAKD 1062
Query: 622 VSGVD-SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 680
+ D S+ Q C++ G LL RG + C C R++H+
Sbjct: 1063 SNEEDFSINQ----CLKC---------GELLSRGH------------IECIGCGRKYHLA 1097
Query: 681 CLKKHKMADLRELPKGKWFCCMDC 704
C ADL + PKG W+C C
Sbjct: 1098 C------ADLTKRPKGDWYCKKRC 1115
Score = 43.9 bits (102), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/173 (19%), Positives = 64/173 (36%), Gaps = 40/173 (23%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASL 587
+Y+ + +E K C C + +P + ++ C+ C FH C +L
Sbjct: 897 FYSSLESSIEWQKGLTNARCKVCRGKATPDR---------MIRCETCDLVFHLPCIKPAL 947
Query: 588 SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS 647
IP+G+W+CK C ++V ++ ++ + E S
Sbjct: 948 REIPRGEWFCKACT-----------------------PETVPDSPRKKPKVTSAEDEEES 984
Query: 648 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ DF + ++ C C R FH+ C++ ++ P+ W C
Sbjct: 985 TGEVPESNDFCEVCLNDEQLISCGSCPRSFHLICIQ------MKRAPRRDWRC 1031
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1715 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1750
Score = 39.3 bits (90), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2126 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1714 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1749
Score = 39.3 bits (90), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2125 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1448 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1483
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1859 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1897
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1345 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1380
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1756 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1794
>gi|170053718|ref|XP_001862805.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167874114|gb|EDS37497.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 3017
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 47/134 (35%), Gaps = 49/134 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2483 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2523
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G G ++ C+ C R +H C +
Sbjct: 2524 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2555
Query: 688 ADLRELPKGKWFCC 701
+ + P+GKW+C
Sbjct: 2556 PPMLKYPRGKWYCT 2569
Score = 43.5 bits (101), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2469 QNCQFCQSGEQEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2512
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ C+ CPRA+H++C + P+G WYC C
Sbjct: 2537 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCTNC 2571
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1358 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1393
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 46/164 (28%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 268 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 315
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + + G + + + C R+ K+ G LLC
Sbjct: 316 CPHCE-----KEGIQWEPKEEDEEEEEGGEEEDDHMEFC-RVCKD-----GGELLC---- 360
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CD C +H+ CL L E+P G+W C
Sbjct: 361 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 387
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 354 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1444 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1479
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1855 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1893
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1446 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1481
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1857 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1409 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1444
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1820 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1858
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1718 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1753
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2129 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2167
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1358 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1393
Score = 39.3 bits (90), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1758 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1796
>gi|237838373|ref|XP_002368484.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211966148|gb|EEB01344.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 527
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE----AGRVSGV 625
+L CDGC A H+ C + ++P+ DWYC+YC+ + K AN + A + SG
Sbjct: 193 MLLCDGCDVAVHQTCYYVKTVPKADWYCQYCEEKNKAK------ANVAKLQRLAAKASGK 246
Query: 626 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREF----HVG 680
+ +Q+ V L E+ S C+L + C FG +C +F HV
Sbjct: 247 ATDKQVETTLKTEVDRLTGEMESVCVLPKRCPLCPRSFGAHV-----RCGEDFRMWVHVN 301
Query: 681 C 681
C
Sbjct: 302 C 302
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1446 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1481
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1857 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1406 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1441
Score = 39.7 bits (91), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1817 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1855
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1449 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1484
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1860 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1898
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1445 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1480
Score = 39.3 bits (90), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1856 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1445 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1480
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1856 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1706 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1741
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2117 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2155
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1717 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1752
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|255577029|ref|XP_002529399.1| hypothetical protein RCOM_0623590 [Ricinus communis]
gi|223531147|gb|EEF32995.1| hypothetical protein RCOM_0623590 [Ricinus communis]
Length = 275
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 10/103 (9%)
Query: 745 DVRWRLLSGK--------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR- 795
++ W LL A+ E LS A+ + H+CF P+ + + D + +++ +
Sbjct: 29 NLTWTLLKSNHSSDHKPDASDIENYSKLSIALHVMHECFQPVEEPRTKGDFLKDVIFRKR 88
Query: 796 -NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
L F G Y +L + ++ +RV+G++VAE+PLV T
Sbjct: 89 SELNRLNFRGFYTVLLQKDDEFITVATVRVYGEKVAEIPLVGT 131
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1404 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1439
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1815 GDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1853
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1408 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1443
Score = 39.3 bits (90), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1819 GDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1857
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 1163 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 1198
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 7/57 (12%)
Query: 547 IICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
++ HC + E + + DGG L+ CDGCPRAFH C + L SIP G W C+ C+
Sbjct: 151 MVIHCNDDECAVCK-----DGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERCR 202
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 60/164 (36%), Gaps = 45/164 (27%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 327 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 374
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + E G + E R+ K+ G LLC
Sbjct: 375 CPHCE-----KEGIQWEPKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 420
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CD C +H+ CL L E+P G+W C
Sbjct: 421 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 447
Score = 43.1 bits (100), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 414 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|397596945|gb|EJK56894.1| hypothetical protein THAOC_23124 [Thalassiosira oceanica]
Length = 1752
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 24/138 (17%)
Query: 569 NLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVEAGRVSGV 625
+L+ CDGCP+ +H C + +P G+W C +C+ +RK+ Q ++ G
Sbjct: 541 DLVCCDGCPKVYHSNCHKPKIRELPDGEWLCMHCKPKGADRKKKYQ----GFRLAKIPG- 595
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP---RTILLCDQCEREFHVGCL 682
++V+ + V+ E E C++C G + + GP + C C+ +H C+
Sbjct: 596 ETVDSPARHVKCTVRWPEME---CIICEGTEVT----GPLKDNDWVTCATCDDAYHTRCV 648
Query: 683 KKHKMADLRELPKGKWFC 700
L P GKW C
Sbjct: 649 ------GLETRPGGKWRC 660
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 12/76 (15%)
Query: 553 NSEVSPSQ--FEAHAD--------GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
N E +PS+ F+ D GG+LL CD C +A+H +C L+ IP+G+W C+ C
Sbjct: 676 NKENAPSKPLFKGEHDDTCYMCYQGGDLLCCDYCSKAYHMKCHLPPLTEIPEGNWKCQEC 735
Query: 601 QNMFERKRFLQHDANA 616
+ ++ F D A
Sbjct: 736 AAVEMKRMFKCGDCTA 751
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 60/164 (36%), Gaps = 45/164 (27%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 312 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 359
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + E G + E R+ K+ G LLC
Sbjct: 360 CPHCE-----KEGIQWEPKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 405
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CD C +H+ CL L E+P G+W C
Sbjct: 406 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLC 432
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 399 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 62/170 (36%), Gaps = 39/170 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 594
L+ GY C C GG ++ CD CPRAFH C L P+G
Sbjct: 356 LMSGYDTDHQDYCEVCQQ------------GGEIMLCDTCPRAFHLVCLDPELEEAPEGS 403
Query: 595 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK----NLEAELSGCL 650
W C +C+ V A R + + +++ I K N E +
Sbjct: 404 WSCPHCEK-----------EGVVAASRSTTPATGGDMSQNPQNIRKSAQPNEEEKDEHQE 452
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C C K G ++ C +C +H CL L E+P+G W C
Sbjct: 453 FCNEC---KDG---GDLICCAKCPVSYHPECL----YPPLSEIPEGPWLC 492
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG+L+ C CP ++H EC LS IP+G W C C
Sbjct: 459 DGGDLICCAKCPVSYHPECLYPPLSEIPEGPWLCPRC 495
>gi|158297171|ref|XP_317442.4| AGAP008017-PA [Anopheles gambiae str. PEST]
gi|157015066|gb|EAA12387.4| AGAP008017-PA [Anopheles gambiae str. PEST]
Length = 2930
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 47/133 (35%), Gaps = 49/133 (36%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC R +H C + IP GDWYC C+N R
Sbjct: 2399 LLLCDGCDRGYHTYCFKPRMDKIPDGDWYCFECKNKATGDR------------------- 2439
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
C++C G G ++ C+ C R +H C +
Sbjct: 2440 --------------------KCIVCGGLRPPPLG----KMVYCELCPRAYHQDCY----I 2471
Query: 688 ADLRELPKGKWFC 700
+ + P+GKW+C
Sbjct: 2472 PPMLKYPRGKWYC 2484
Score = 43.9 bits (102), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 650 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ C F +SG +LLCD C+R +H C K + ++P G W+C
Sbjct: 2382 IMKANCQFCQSGESEDKLLLCDGCDRGYHTYCFKPR----MDKIPDGDWYC 2428
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
G ++ C+ CPRA+H++C + P+G WYC+ C
Sbjct: 2453 GKMVYCELCPRAYHQDCYIPPMLKYPRGKWYCQNC 2487
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 59/173 (34%), Gaps = 44/173 (25%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASL 587
++ G +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 53 FFMMGVDDGDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEL 100
Query: 588 SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS 647
P+G W C +C+ G+ + E E
Sbjct: 101 EKAPEGKWSCPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDD 140
Query: 648 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CR C K G +L CD C +H+ CL L E+P G+W C
Sbjct: 141 HMEFCRVC---KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 183
Score = 43.9 bits (102), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 150 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 186
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 61/164 (37%), Gaps = 46/164 (28%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 348 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEMEKAPEGKWS 395
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + + + + + C R+ K+ G LLC
Sbjct: 396 CPHCE-----KEGIQWEPKDDDEEDEDLCEEADDHMEFC-RVCKD-----GGELLC---- 440
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CD C +H+ CL L E+P G+W C
Sbjct: 441 -------------CDTCPSSYHIHCLN----PPLPEIPNGEWLC 467
Score = 43.1 bits (100), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 434 DGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 470
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 874 GGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQ 914
>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 588
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 538 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 573
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 677 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 712
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 69/186 (37%), Gaps = 42/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ +GY+ C C GG ++ CD CPRA+H C L
Sbjct: 425 AGEEEADGYETDHQDYCEVCQQ------------GGEIILCDSCPRAYHLVCLDPELDKA 472
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + + + + C R+ K+ G L
Sbjct: 473 PEGKWSCPHCEKEGVQWEPKEEEDEYEGEMDDAEKEEEDDHMEYC-RVCKD-----GGEL 526
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L E+P G+W C C +
Sbjct: 527 LC-----------------CDACISSYHIHCLN----PPLPEIPNGEWLCPRCTCPMLKG 565
Query: 710 VLQNLL 715
+Q +L
Sbjct: 566 RVQKIL 571
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 869 GGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQ 909
>gi|320167629|gb|EFW44528.1| hypothetical protein CAOG_02553 [Capsaspora owczarzaki ATCC 30864]
Length = 1716
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASL--SSIPQGDWYCKYC 600
G LL CDGCPR +H C L +S+PQGDW+C C
Sbjct: 437 GELLCCDGCPRVYHATCLKLDTASLPQGDWFCPTC 471
Score = 43.5 bits (101), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 602
GN+L CD CPR++H +C +S P+GDW C C++
Sbjct: 1445 GNVLCCDYCPRSYHLKCLKPPMSKPPRGDWKCPICKS 1481
>gi|307109592|gb|EFN57830.1| hypothetical protein CHLNCDRAFT_143249 [Chlorella variabilis]
Length = 295
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 26/35 (74%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+GG L+ CDGC A+H++CA L ++P+ DW+C C
Sbjct: 200 EGGELVCCDGCTAAYHEQCAGLEAVPETDWFCPMC 234
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 63/180 (35%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 727 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 774
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + + E E CR C
Sbjct: 775 CPHCEK--------------------EGIQWEPKDDEDEEEEGGCEEEEDDHMEFCRVC- 813
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 814 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 864
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 49/136 (36%), Gaps = 46/136 (33%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C L P+G W C YCQ E +
Sbjct: 384 GGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQ---------------AEGNQEQE 428
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D Q RI K+ G LLC CD C +H CL
Sbjct: 429 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHRFCLN- 462
Query: 685 HKMADLRELPKGKWFC 700
L E+P G+W C
Sbjct: 463 ---PPLEEVPDGEWKC 475
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H+ C + L +P G+W C C
Sbjct: 442 DGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRC 478
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 25/36 (69%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+GGNL+ CD CPR H C LS IP+GD+YC C+
Sbjct: 1767 EGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDCE 1802
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 489 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 524
>gi|358386993|gb|EHK24588.1| hypothetical protein TRIVIDRAFT_114174, partial [Trichoderma virens
Gv29-8]
Length = 633
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
LL CDGC A+H C L IP GDWYC C ++F+
Sbjct: 141 LLLCDGCDAAYHTHCIGLDYIPDGDWYCMECAHLFQ 176
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 474 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 521
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 522 CPHCEK--------------------EGIQWEPKEDDEEEEEGGCEEEEDDHMEFCRVC- 560
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 561 --KDG---GELLCCDACPSSYHLHCLN----PPLAEIPNGEWLCPRCTCPPLKGKVQRIL 611
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 66/180 (36%), Gaps = 46/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 339 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPEMEKAPEGKWS 386
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + + G + E R+ K+ G LLC
Sbjct: 387 CPHCE-----KEGIQWEPKDDDEEDEEGGEEEEDDHMEFCRVCKD-----GGELLC---- 432
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 433 -------------CDTCPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 475
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 347 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 394
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 395 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 433
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 434 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 468
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 435 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 471
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 87 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 122
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|194762684|ref|XP_001963464.1| GF20276 [Drosophila ananassae]
gi|190629123|gb|EDV44540.1| GF20276 [Drosophila ananassae]
Length = 2062
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ +R + V+N + SGC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1767 MPQRMVGRVRNYNWQCSGCKCCIKC---RSNQRPGKMLFCEQCDRGYHIYCLG------L 1817
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1818 RTVPDGRWSCERCCVCMRC 1836
>gi|307203232|gb|EFN82387.1| Bromodomain adjacent to zinc finger domain protein 1A [Harpegnathos
saltator]
Length = 1466
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 70/164 (42%), Gaps = 33/164 (20%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRF------ 609
D N+L CD C + H C L+++P+GDW+C C + +RKRF
Sbjct: 1091 DAENMLLCDECNKGHHLYCLKPKLNAVPEGDWFCTTCRPPVIKPKEKTQKRKRFEDEMED 1150
Query: 610 ---LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLCRGCDFSKSGFGP 664
L + A RV E I+ + + + + +++ +C C KSG
Sbjct: 1151 EAILTKETRHNRAKRVVTYSDDEAISDQEDDVDEESDQDINVRSENICASC---KSG--- 1204
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 706
++ CD C +H+ C++ L P+G+W C C D R
Sbjct: 1205 GKLITCDTCPDRYHLECVE----PPLSRAPRGRWSCTKCKDKRR 1244
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-QNMFERKRFLQHD 613
DGG L+ CDGCPRAFH C L +P G W C C +N+ E + L+ D
Sbjct: 254 GDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSCVENVTEPGQLLEAD 305
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 40/67 (59%), Gaps = 2/67 (2%)
Query: 773 DCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 830
+CF PI+D +G +LI ++VY G N +F G Y IL +++A +R+ G ++A
Sbjct: 3 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 62
Query: 831 ELPLVAT 837
E+P + T
Sbjct: 63 EMPFIGT 69
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 334 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 381
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 382 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 420
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 421 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 422 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 322 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 369
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 370 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 408
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 409 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 459
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL CD CP AFH+EC ++ IP+G+WYC C+
Sbjct: 292 SEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDCK 327
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 189 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 236
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 237 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 290
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 291 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 319
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 286 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 321 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 368
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 369 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 407
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 408 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 442
Score = 43.5 bits (101), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 409 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 445
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 45 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 383 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 430
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 431 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 484
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 485 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 513
Score = 39.3 bits (90), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 480 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 516
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 70/186 (37%), Gaps = 42/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E + + + C R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEGEEEGEKEEEDDHMEYC-RVCKD-----GGEL 467
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 468 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 506
Query: 710 VLQNLL 715
+Q +L
Sbjct: 507 RVQKIL 512
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 610
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 394 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|429862948|gb|ELA37533.1| origin recognition complex subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 830
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 8/73 (10%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
G +L CDGC R +H++C ++ +P+GDWYC C Q ++ + AG V+ +
Sbjct: 348 GNKILFCDGCDRCYHQKCHNVPKVPKGDWYCDDC--------VQQKESRVLAAGEVAKIP 399
Query: 627 SVEQITKRCIRIV 639
+ Q R++
Sbjct: 400 NFAQHLNSMKRVL 412
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 24/91 (26%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC---- 704
C++C D SK+G IL CD C+R +H C ++ ++PKG W+C DC
Sbjct: 337 CVICSKPD-SKAG---NKILFCDGCDRCYHQKC------HNVPKVPKGDWYCD-DCVQQK 385
Query: 705 -SRINSVLQNLLVQEAEKLPEF--HLNAIKK 732
SR+ L E K+P F HLN++K+
Sbjct: 386 ESRV------LAAGEVAKIPNFAQHLNSMKR 410
>gi|302774224|ref|XP_002970529.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
gi|300162045|gb|EFJ28659.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
Length = 1340
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 482 GNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 482 GNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 610
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 394 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 386 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 424
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 425 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 426 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 420
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 421 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 459
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 460 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 43.9 bits (102), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 461 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 51.2 bits (121), Expect = 0.002, Method: Composition-based stats.
Identities = 27/53 (50%), Positives = 39/53 (73%), Gaps = 1/53 (1%)
Query: 786 DLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 837
D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT
Sbjct: 104 DDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIAT 156
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 310 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 357
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 358 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 396
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 397 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 431
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 398 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 434
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 386 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 424
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 425 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 426 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 320 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 367
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 368 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 406
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 407 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 441
Score = 43.9 bits (102), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 408 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 444
>gi|47214709|emb|CAG01062.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1036
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 71/164 (43%), Gaps = 29/164 (17%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQG 593
GQ L+ + + C C + + AD +L CD C A H+EC + IP+G
Sbjct: 205 GQSDLQAMVDEDAVCCICMDGD--------GADSNVILFCDSCNIAVHQECYGVPYIPEG 256
Query: 594 DWYCKYCQN---MFERKRFLQ--------HDANAVEAGRVSGVDSVEQITKRCIRIVKNL 642
W C++C + +++R L+ H A A+ V D+V +R +
Sbjct: 257 QWLCRHCLQVRLLPQQRRSLKKTDDGRWGHVACALWVPEVGFSDTVFIEPIDGVRNIPPA 316
Query: 643 EAELSGCLLCRGCDFSKSGFGPRTILLCDQ--CEREFHVGCLKK 684
+L+ C LCR + G G + CD+ C FHV C +K
Sbjct: 317 RWKLT-CYLCR-----EKGAG--ACIQCDKVNCYTAFHVSCAQK 352
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 42 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 62/171 (36%), Gaps = 42/171 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 189 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 236
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C E V E+ R K E +
Sbjct: 237 PEGKWSCPHC-----------------EKEGVQWEAKEEEEDYEEERGGKERRREEDDHM 279
Query: 651 -LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CR C K G +L CD C +H+ CL L ++P G+W C
Sbjct: 280 EYCRVC---KDG---GELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 320
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 287 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 424 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 471
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 472 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 510
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 511 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 561
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG LL CDGCPRAFH C LS IP G W C C
Sbjct: 441 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 477
>gi|291234752|ref|XP_002737311.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 281
Score = 51.2 bits (121), Expect = 0.002, Method: Composition-based stats.
Identities = 35/133 (26%), Positives = 51/133 (38%), Gaps = 54/133 (40%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS 627
LL CDGC + FH C ++SIP+GDWYC C
Sbjct: 33 LLLCDGCDKGFHTYCFKPKMNSIPEGDWYCYEC--------------------------- 65
Query: 628 VEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 687
+ + T I C+LCR ++ C+ C R +H C+
Sbjct: 66 IYKATGEYI------------CVLCR---------HKGRLVKCENCPRAYHPDCID---- 100
Query: 688 ADLRELPKGKWFC 700
L ++P+G+WFC
Sbjct: 101 PPLLKMPRGRWFC 113
Score = 44.3 bits (103), Expect = 0.33, Method: Composition-based stats.
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 568 GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
G L+ C+ CPRA+H +C L +P+G W+C+ CQ
Sbjct: 82 GRLVKCENCPRAYHPDCIDPPLLKMPRGRWFCQACQ 117
Score = 39.7 bits (91), Expect = 7.5, Method: Composition-based stats.
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C F G +LLCD C++ FH C K + +P+G W+C
Sbjct: 21 CQFCLKGDNEELLLLCDGCDKGFHTYCFK----PKMNSIPEGDWYC 62
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 554 SEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
+ V+ + A DGG L+ CDGCP+AFH C L+SIP G W C +C
Sbjct: 248 AHVNDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWC 296
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 51.2 bits (121), Expect = 0.002, Method: Composition-based stats.
Identities = 26/58 (44%), Positives = 31/58 (53%), Gaps = 5/58 (8%)
Query: 545 LGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
+G+ C C +E + DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 89 MGVSCLCQKNE---DECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 63/180 (35%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 363 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 410
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + + E E CR C
Sbjct: 411 CPHCEK--------------------EGIQWEPKDDEDDEEEGGCEEEEDDHMEFCRVC- 449
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 450 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 500
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 26/38 (68%), Gaps = 2/38 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 42 SDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 373 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 420
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 421 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 459
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 460 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 461 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
GG ++ CD CP+ FH +C L IP+G W C C + FER+
Sbjct: 853 GGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNFERQ 893
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 439 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 486
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 487 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 525
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 526 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 576
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 35/53 (66%), Gaps = 4/53 (7%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 616
+GG LL CD CP+ FH C +L++ P G+WYC +C+++ + +Q++ NA
Sbjct: 705 NGGELLCCDKCPKVFHLSCHVPTLTASPSGEWYCTFCRDLNSPE--MQYNVNA 755
>gi|310792256|gb|EFQ27783.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 312
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G ++ CDGC +A+H++C + +P+GDWYC C
Sbjct: 197 GNQIMFCDGCDKAYHQKCYKVPKVPRGDWYCNEC 230
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 18/88 (20%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 706
CL+C D SK+G I+ CD C++ +H C K + ++P+G W+C C+D +
Sbjct: 186 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYK------VPKVPRGDWYCNECLDQKQ 235
Query: 707 INSVLQNLLVQEAEKLPEF--HLNAIKK 732
+ + EA K+P F HL+ +K+
Sbjct: 236 SRAAAAD----EAVKIPNFQQHLSKLKR 259
>gi|156033159|ref|XP_001585416.1| hypothetical protein SS1G_13655 [Sclerotinia sclerotiorum 1980]
gi|154699058|gb|EDN98796.1| hypothetical protein SS1G_13655 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 670
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 613
LL CDGC +H C LSSIP G WYC C E F QHD
Sbjct: 179 LLLCDGCDAPYHTHCIGLSSIPTGHWYCMEC---VESGAFTQHD 219
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 62/171 (36%), Gaps = 42/171 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 443 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 490
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C E V E+ R K E +
Sbjct: 491 PEGKWSCPHC-----------------EKEGVQWEAKEEEEDYEEERGGKERRREEDDHM 533
Query: 651 -LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
CR C K G +L CD C +H+ CL L ++P G+W C
Sbjct: 534 EYCRVC---KDG---GELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 574
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 541 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 586 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 633
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 634 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 672
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 673 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 723
>gi|301093217|ref|XP_002997457.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110713|gb|EEY68765.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 248
Score = 51.2 bits (121), Expect = 0.003, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 567 GGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 602
GG LL CDGC RA+H C +L +P+GDW+C +C++
Sbjct: 196 GGKLLCCDGCERAYHLNCVRPALLDVPEGDWFCSHCRD 233
Score = 39.3 bits (90), Expect = 9.4, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 6/43 (13%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRI 707
+L CD CER +H+ C++ L ++P+G WFC C D S +
Sbjct: 199 LLCCDGCERAYHLNCVR----PALLDVPEGDWFCSHCRDASPV 237
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C L P+G W C +C+N G
Sbjct: 385 GGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------------EGPAEQ 428
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D Q RI K+ G LLC CD C +H CL
Sbjct: 429 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHTHCLN- 462
Query: 685 HKMADLRELPKGKWFC 700
L E+P G W C
Sbjct: 463 ---PPLVEIPDGDWKC 475
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------DGGNLLPCDGCPRAFHK 582
+ C LE G HC N + + H DGG LL CD CP A+H
Sbjct: 399 HLVCLDPELEDTPEGKWSCPHCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHT 458
Query: 583 ECAS--LSSIPQGDWYCKYC 600
C + L IP GDW C C
Sbjct: 459 HCLNPPLVEIPDGDWKCPRC 478
>gi|440799762|gb|ELR20806.1| PHD-finger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 482
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 54/154 (35%), Gaps = 65/154 (42%)
Query: 549 CHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFER 606
C C P G +L CDGC R FH C + L S+P G+WYCK C
Sbjct: 241 CQICRRSTQP---------GCMLLCDGCDRGFHTFCLNPRLKSVPSGEWYCKSC------ 285
Query: 607 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 666
L S C +C G G R
Sbjct: 286 -----------------------------------LANSKSACEVCEG--------GGR- 301
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+L C+ C R +H+ CL L+++PK KW C
Sbjct: 302 LLCCEVCPRVYHLKCLD----PPLKQVPKEKWTC 331
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 51/138 (36%), Gaps = 46/138 (33%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG L+ CD CPRA+H C A++ P+GDW C +C ++H V+
Sbjct: 264 GGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHGPEIVKEEPAKQ 314
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D C +C+ + +LLCD C FH C+
Sbjct: 315 NDDF--------------------CKICKETE---------NLLLCDNCTCSFHAYCMD- 344
Query: 685 HKMADLRELP--KGKWFC 700
L ELP W C
Sbjct: 345 ---PPLLELPPQDESWAC 359
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 65/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 419 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 466
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G + + + C R+ K+ G L
Sbjct: 467 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 520
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 521 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 549
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 516 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 552
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C L P+G W C +C+N G
Sbjct: 382 GGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------------EGPAEQ 425
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D Q RI K+ G LLC CD C +H CL
Sbjct: 426 DDDEHQ---EFCRICKD-----GGELLC-----------------CDSCPSAYHTHCLN- 459
Query: 685 HKMADLRELPKGKWFC 700
L E+P G W C
Sbjct: 460 ---PPLVEIPDGDWKC 472
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA-------DGGNLLPCDGCPRAFHK 582
+ C LE G HC N + + H DGG LL CD CP A+H
Sbjct: 396 HLVCLDPELEDTPEGKWSCPHCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHT 455
Query: 583 ECAS--LSSIPQGDWYCKYC 600
C + L IP GDW C C
Sbjct: 456 HCLNPPLVEIPDGDWKCPRC 475
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 51/138 (36%), Gaps = 46/138 (33%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG L+ CD CPRA+H C A++ P+GDW C +C ++H V+
Sbjct: 264 GGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHGPEIVKEEPAKQ 314
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D C +C+ + +LLCD C FH C+
Sbjct: 315 NDDF--------------------CKICKETE---------NLLLCDNCTCSFHAYCMD- 344
Query: 685 HKMADLRELP--KGKWFC 700
L ELP W C
Sbjct: 345 ---PPLLELPPQDESWAC 359
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 58/143 (40%), Gaps = 44/143 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG LL CD CPRA+H C +S+ P+GDW C +C +E G
Sbjct: 260 GGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHC----------------IEHG---- 299
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+I K + V + C +C+ + +LLCD C FH C+
Sbjct: 300 ----PEIVKEEPQKVND-----DFCKICKETE---------NLLLCDTCVCAFHAYCMD- 340
Query: 685 HKMADLRELPKGKWFCCMDCSRI 707
L ++P+ + + C C +
Sbjct: 341 ---PPLTQVPQEETWNCPRCELV 360
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 424 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 471
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 472 CPHCEK--------------------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC- 510
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 511 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 43.5 bits (101), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 512 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|328788377|ref|XP_395223.4| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
[Apis mellifera]
Length = 1449
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 60/157 (38%), Gaps = 29/157 (18%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC----------------QNMFERK 607
DG +L CDGC + H C L+ +P GDWYCK C ++ E
Sbjct: 1077 DGDKMLLCDGCNKGHHLYCLQPKLNCVPDGDWYCKVCKPSTKPKEKIKKRKKFEDELEED 1136
Query: 608 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 667
L + A RV + + + + +C C KSG +
Sbjct: 1137 VILTKETRHNRAKRVLESEEEDNSEDEELEEDSDDNISNQQINVCSAC---KSG---GKL 1190
Query: 668 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 704
+ CD C +H+ C++ + P+G+W C DC
Sbjct: 1191 ISCDMCPNFYHIECIE----PPITRAPRGRWICS-DC 1222
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQH 612
GG L+ CD CP +H EC ++ P+G W C C++ +RK +++
Sbjct: 1187 GGKLISCDMCPNFYHIECIEPPITRAPRGRWICSDCKDRRDRKMNIKY 1234
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 807 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 854
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 855 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 893
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 894 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 944
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 41/136 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CP+A+H C + P+G W C C EA +
Sbjct: 265 GGEIILCDTCPKAYHMVCLDPDMEEAPEGHWSCPSC-----------------EAAGIPQ 307
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D E+ ++ N+E C +C+ + +L CD C +H C+
Sbjct: 308 KDEEEE-----KKVATNMEY----CRVCKDVGW---------LLCCDTCPSSYHAYCMN- 348
Query: 685 HKMADLRELPKGKWFC 700
L E+P+G+W C
Sbjct: 349 ---PPLTEVPEGEWSC 361
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
D G LL CD CP ++H C + L+ +P+G+W C C
Sbjct: 328 DVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPRC 364
>gi|359479239|ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260139 [Vitis vinifera]
Length = 1976
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNL+ CDGCP A+H C ++S +P GDWYC C ++ + ++ + GV
Sbjct: 603 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC--AIDKDKPWMKQRKSLRGAELLGV 660
Query: 626 D 626
D
Sbjct: 661 D 661
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 60/164 (36%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 312 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 359
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ E + D E+ + ++ E CR C
Sbjct: 360 CPHCEK---------------EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC- 398
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 399 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 433
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 400 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 436
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
GG ++ CD CP+ FH +C L +P+G W C C + FER+
Sbjct: 853 GGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNFERQ 893
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 406 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 453
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 454 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 492
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 493 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 527
Score = 43.5 bits (101), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 494 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 530
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNLL CDGCP AFH +C + +P+GDWYC C
Sbjct: 436 GNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC 470
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 35/164 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 368 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMERAPEGTWS 415
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A R + E+ E + CR C
Sbjct: 416 CPHCE-----KEGIQWEA------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC- 463
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L ++P G+W C
Sbjct: 464 --KDG---GELLCCDTCPSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 43.1 bits (100), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 10/83 (12%)
Query: 764 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR---------GQEFGGMYCAILTVNS 814
+ QA+ +F F P++ +GRDL+ + G G F G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMD-NGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRG 279
Query: 815 SVVSAGILRVFGQEVAELPLVAT 837
+VV+A LRVFG+ AELP VAT
Sbjct: 280 AVVTAATLRVFGRRFAELPFVAT 302
>gi|296083821|emb|CBI24209.3| unnamed protein product [Vitis vinifera]
Length = 1805
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNL+ CDGCP A+H C ++S +P GDWYC C
Sbjct: 589 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC 623
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/180 (24%), Positives = 62/180 (34%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 329 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 376
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 377 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 415
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 416 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 466
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 44 DGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 66/180 (36%), Gaps = 45/180 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 360 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 407
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ E + D E+ + ++ E CR C
Sbjct: 408 CPHCEK---------------EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC- 446
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 715
K G +L CD C +H+ CL L E+P G+W C C + +Q +L
Sbjct: 447 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 497
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 603
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 929 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 968
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 610
DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 312 DGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 358
>gi|449547717|gb|EMD38685.1| hypothetical protein CERSUDRAFT_113863 [Ceriporiopsis subvermispora
B]
Length = 906
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 73/196 (37%), Gaps = 65/196 (33%)
Query: 568 GNLLPCDGCPRAFHKEC----ASLSSIPQGD--WYCKYCQN-------MFERKRF----L 610
G+L+ CDGCPRAFH C + S +P+GD WYC C N + + +F L
Sbjct: 225 GSLVYCDGCPRAFHLWCLDPPMAASDLPEGDERWYCPACTNQQKPPPKISAKLKFIAPLL 284
Query: 611 QHDANAVEA------------------GRVSGVDSVEQITKRCIRIVK-------NLEAE 645
+H A + A R + VD+ E R R+ + L+
Sbjct: 285 EHLATIIPAEYSLPNEIKTHFKDVATGPRGAYVDTSEIKAPRLNRLGQVEDRDPYRLKDR 344
Query: 646 LSGCLLCRGC---------------------DFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+LC C D + PR I+ CD C +H+ CL
Sbjct: 345 NGDPVLCFQCGTSALPPAVAATSPAAKRTKRDHNTFHDNPRAIITCDYCHLHWHLDCLDP 404
Query: 685 HKMADLRELPKGKWFC 700
+A + K KW C
Sbjct: 405 -PLACMPPWSK-KWMC 418
>gi|356544359|ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max]
Length = 1702
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
G+L+ CDGCP AFH C ++S +P+GDWYC C
Sbjct: 685 GSLICCDGCPAAFHSRCVGIASDHLPEGDWYCPEC 719
>gi|342877621|gb|EGU79070.1| hypothetical protein FOXB_10409 [Fusarium oxysporum Fo5176]
Length = 673
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
LL CD C A+H C L +IP GDWYC C ++F+
Sbjct: 167 LLLCDSCDAAYHTHCIGLDAIPDGDWYCMECSHLFQ 202
Score = 43.9 bits (102), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 62/155 (40%), Gaps = 28/155 (18%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD--SVEQI 631
DGC H C + S Q C C+N F R N V+ +S D +Q+
Sbjct: 77 DGCNHIIHDAC--IRSWAQKTNTCPICRNPFHSVRVY----NGVDGTAISKYDVQDKKQV 130
Query: 632 TKRCIR-------IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
+ +R + E + + C +C + +LLCD C+ +H C+
Sbjct: 131 AEFDVRQWLGENPEDEEEEEQGNPCPICNSSERED------VLLLCDSCDAAYHTHCIG- 183
Query: 685 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 719
L +P G W+ CM+CS + +++ E+
Sbjct: 184 -----LDAIPDGDWY-CMECSHLFQLVEEPRTTES 212
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 44 DGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|357616639|gb|EHJ70297.1| hypothetical protein KGM_09919 [Danaus plexippus]
Length = 1569
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 52/140 (37%), Gaps = 49/140 (35%)
Query: 569 NLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
LL CDGC + +H C + IP GDWYC C N G
Sbjct: 1267 QLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWEC-------------VNKARGG------ 1307
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
S E++ C++C G ++ L C C R +H+ C
Sbjct: 1308 SRERV-----------------CIVCGGAARGRA-------LPCALCVRAYHLDC----H 1339
Query: 687 MADLRELPKGKWFCCMDCSR 706
L ++P+GKW+C SR
Sbjct: 1340 YPPLTKMPRGKWYCSQCASR 1359
Score = 43.5 bits (101), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 8/91 (8%)
Query: 518 DESGLPDGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFE-----AHADGGNLL 571
D+ L DG + GY+ C + +E +G C N S+ A G L
Sbjct: 1266 DQLLLCDGCDKGYHTYCFKPRMEKIPDGDWYCWECVNKARGGSRERVCIVCGGAARGRAL 1325
Query: 572 PCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
PC C RA+H +C L+ +P+G WYC C
Sbjct: 1326 PCALCVRAYHLDCHYPPLTKMPRGKWYCSQC 1356
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 654 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 706
C F SG +LLCD C++ +H C K + ++P G W+C C++ +R
Sbjct: 1255 NCQFCLSGDNEDQLLLCDGCDKGYHTYCFKPR----MEKIPDGDWYCWECVNKAR 1305
>gi|147864569|emb|CAN78969.1| hypothetical protein VITISV_022739 [Vitis vinifera]
Length = 1318
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNL+ CDGCP A+H C ++S +P GDWYC C
Sbjct: 587 GNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC 621
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 603
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 942 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 981
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 2/40 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 603
+GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 824 NGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 863
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 364 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 411
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 412 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 466
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 467 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPMLKG 505
Query: 710 VLQNLL 715
+Q +L
Sbjct: 506 RVQKIL 511
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 602
GG+L+ C+ CP AFH +C IP G WYC+ C N
Sbjct: 1095 GGSLMCCESCPAAFHPDCIGYDEIPDGSWYCRDCTN 1130
>gi|408391355|gb|EKJ70734.1| hypothetical protein FPSE_09104 [Fusarium pseudograminearum CS3096]
Length = 710
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS-V 628
LL CD C A+H C L IP GDWYC C ++F + + A EAG S S V
Sbjct: 177 LLLCDSCDAAYHTHCIGLEVIPDGDWYCMECAHLF----HMVDEPEATEAGESSPRPSYV 232
Query: 629 EQITKRCIR 637
+ R +R
Sbjct: 233 RRPNPRNVR 241
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 61/164 (37%), Gaps = 42/164 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 332 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 379
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + + G + E+ + CR C
Sbjct: 380 CPHCE-----KEGIQWEPKEDDEDEEDGAEEEEEEEDDHME-------------FCRVC- 420
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 421 --KDG---GELLCCDTCPSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 422 DGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|224059262|ref|XP_002299795.1| predicted protein [Populus trichocarpa]
gi|222847053|gb|EEE84600.1| predicted protein [Populus trichocarpa]
Length = 89
Score = 50.4 bits (119), Expect = 0.004, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 26/35 (74%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNL+ CDGCP A+H +C +++ +P+GDWYC C
Sbjct: 16 GNLICCDGCPAAYHAKCVGVANNYLPEGDWYCPEC 50
>gi|310792252|gb|EFQ27779.1| PHD-finger domain-containing protein [Glomerella graminicola
M1.001]
Length = 897
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G ++ CDGC +A+H++C + +P+GDWYC C
Sbjct: 395 GNQIMFCDGCDKAYHQKCYKVPKVPRGDWYCNEC 428
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 18/88 (20%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSR 706
CL+C D SK+G I+ CD C++ +H C K K +P+G W+C C+D +
Sbjct: 384 CLICSKPD-SKAG---NQIMFCDGCDKAYHQKCYKVPK------VPRGDWYCNECLDQKQ 433
Query: 707 INSVLQNLLVQEAEKLPEF--HLNAIKK 732
+ + EA K+P F HL+ +K+
Sbjct: 434 SRAAAAD----EAVKIPNFQQHLSKLKR 457
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 46 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 93
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 94 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 132
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 133 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 167
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 134 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 170
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 384 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 431
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 432 PEGKWSCPHCEKEGVQWEAKEEEEDYEEDGEEEGEKEEEDDHMEYCRVCKD-----GGEL 486
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 487 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 525
Query: 710 VLQNLL 715
+Q +L
Sbjct: 526 RVQKIL 531
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
GNLL CDGCP AFH +C + +P+G+WYC C + +R ++ A V G+
Sbjct: 432 GNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPEC--LMQRNNGSRNMAKLGRGAEVLGI 489
Query: 626 D 626
D
Sbjct: 490 D 490
>gi|397610251|gb|EJK60736.1| hypothetical protein THAOC_18861, partial [Thalassiosira oceanica]
Length = 578
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 6/40 (15%)
Query: 567 GGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 600
GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 448 GGDLIVCDGGDNEGGCGRAFHLECINLRTLPKGDWICKDC 487
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 896 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 953
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q ++C R++
Sbjct: 954 KTVQGLSPVDQ--RKCERLL 971
>gi|313247391|emb|CBY15642.1| unnamed protein product [Oikopleura dioica]
Length = 1498
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
GG LL C+ CPR +H +C + + IP GDW+C YC
Sbjct: 157 GGELLACESCPRVYHPKCLNPPQTEIPDGDWFCPYC 192
>gi|17562600|ref|NP_504523.1| Protein LET-418 [Caenorhabditis elegans]
gi|403399446|sp|G5EBZ4.1|LE418_CAEEL RecName: Full=Protein let-418; AltName: Full=Lethal protein 418
gi|11095333|gb|AAG29838.1| LET-418 [Caenorhabditis elegans]
gi|351020697|emb|CCD62685.1| Protein LET-418 [Caenorhabditis elegans]
Length = 1829
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 53/142 (37%), Gaps = 44/142 (30%)
Query: 568 GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
G LL CD CPRA+H C ++ P+GDW C +C ++H V+
Sbjct: 266 GELLLCDTCPRAYHTVCIDENMEEPPEGDWSCAHC---------IEHGPEVVKEEPAKQN 316
Query: 626 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
D C +C+ + +LLCD C FH C+
Sbjct: 317 DEF--------------------CKICKETE---------NLLLCDSCVCSFHAYCID-- 345
Query: 686 KMADLRELPKGKWFCCMDCSRI 707
L E+PK + + C C +
Sbjct: 346 --PPLTEVPKEETWSCPRCETV 365
>gi|400602572|gb|EJP70174.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
2860]
Length = 633
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 22/36 (61%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
LL CD C A+H C L IP GDWYC C ++FE
Sbjct: 140 LLLCDSCDAAYHTHCIGLDHIPDGDWYCIECAHLFE 175
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 35/152 (23%), Positives = 57/152 (37%), Gaps = 19/152 (12%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
DGC H C + S Q C C+ F R N V+ VS D ++
Sbjct: 52 DGCEHIIHDAC--IRSWAQKTNTCPICRTPFHSVRVY----NGVDGTAVSTYDVQDKKQV 105
Query: 634 RCIRIVKNL-EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 692
+ + L + C S +LLCD C+ +H C+ L
Sbjct: 106 AEFDVQQWLGDNAEEEEEATNPCPVCNSAEREDILLLCDSCDAAYHTHCIG------LDH 159
Query: 693 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 724
+P G W+ C++C+ + L ++++ PE
Sbjct: 160 IPDGDWY-CIECAHLFE-----LTEDSQNAPE 185
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 896 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 953
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q ++C R++
Sbjct: 954 KTVQGLSPVDQ--RKCERLL 971
>gi|302915931|ref|XP_003051776.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
gi|256732715|gb|EEU46063.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
Length = 1194
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 87/240 (36%), Gaps = 77/240 (32%)
Query: 491 ASPPLSFPNKSRWNITPKDQRLHKLVFDESGLP----DGTEVGYYACGQKLLEGYKNGLG 546
A+P L P K R + K + K +G+P DGT A Q + + ++
Sbjct: 703 ATPSLRPPKKQRTGLRVKSSPVKKRGGTAAGVPRAMGDGTATSAAAKDQ-ISDNDED--- 758
Query: 547 IICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGD----WYCKYCQN 602
C C + G+++ CDGCPR+FH EC + +P D WYC C
Sbjct: 759 --CSACGA------------AGDVVCCDGCPRSFHFECVGM--VPSEDLPDEWYCNEC-- 800
Query: 603 MFER--KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC----------- 649
+F+R R H A ++++E+ R + K L+ G
Sbjct: 801 LFKRYPSRVPVHKGVFGPA-----LNNLEKSIPRAFSLPKKLQTRFEGVKAGPDGEYEEV 855
Query: 650 ---------------------------LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 682
+LC GC K+ R I+ C C R +H+ CL
Sbjct: 856 TTAKTTKRKNGYEEVPDFFRQRDDGQPVLCHGC--QKAATDVRAIIPCSVCPRYWHIDCL 913
>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
Length = 209
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 565 ADGGNLLPCDG-CPRAFHKECASLSSIPQGD-WYCKYCQNMFERKR 608
DGG+LL CDG C R +H C +L+S+P+G+ W C YC E+ R
Sbjct: 62 GDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLCPYCARQKEKAR 107
>gi|322707184|gb|EFY98763.1| PHD and RING finger domain protein, putative [Metarhizium
anisopliae ARSEF 23]
Length = 651
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 23/36 (63%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
LL CD C A+H C L IP+GDWYC C ++F+
Sbjct: 152 LLLCDSCDAAYHTHCIGLDHIPEGDWYCMECAHLFQ 187
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 62/148 (41%), Gaps = 27/148 (18%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE--QI 631
DGC H C + S Q C C+ F R N V+ +S D ++ Q+
Sbjct: 63 DGCEHIIHDAC--IRSWAQKTNTCPICRTPFHCVRVY----NGVDGTAISTYDVIDKKQV 116
Query: 632 TKRCIR------IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 685
+ ++ IV E E + C +C + +LLCD C+ +H C+
Sbjct: 117 AEFDVQAWLGENIVDQEEEECNPCPICNSAERED------ILLLCDSCDAAYHTHCIG-- 168
Query: 686 KMADLRELPKGKWFCCMDCSRINSVLQN 713
L +P+G W+ CM+C+ + + Q+
Sbjct: 169 ----LDHIPEGDWY-CMECAHLFQLTQD 191
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 48/80 (60%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 649 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 706
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q ++C R++
Sbjct: 707 KTVQGLSPVDQ--RKCERLL 724
>gi|346327633|gb|EGX97229.1| PHD and RING finger domain protein, putative [Cordyceps militaris
CM01]
Length = 754
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 619
LL CD C A+H C L IP GDWYC C + FE Q+ + V++
Sbjct: 256 LLLCDSCDAAYHTHCLGLDHIPDGDWYCMECAHAFELTEESQNGSQPVDS 305
Score = 39.3 bits (90), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 54/132 (40%), Gaps = 20/132 (15%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 633
DGC H C + S Q C C+ F R + + + +V+ D V+Q
Sbjct: 176 DGCEHIIHDTC--IRSWAQKTNTCPICRTPFHSVRVY-NGLDVKDKKQVAEFD-VQQW-- 229
Query: 634 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 693
+ N E E C C+ ++ +LLCD C+ +H CL L +
Sbjct: 230 ----LGDNAEEEDEVSNPCPVCNSAERE---DILLLCDSCDAAYHTHCLG------LDHI 276
Query: 694 PKGKWFCCMDCS 705
P G W+ CM+C+
Sbjct: 277 PDGDWY-CMECA 287
>gi|397574031|gb|EJK48991.1| hypothetical protein THAOC_32170 [Thalassiosira oceanica]
Length = 884
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%), Gaps = 6/40 (15%)
Query: 567 GGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 600
GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 754 GGDLIVCDGGDNEGGCGRAFHLECINLRTLPEGDWICKDC 793
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 60/154 (38%), Gaps = 42/154 (27%)
Query: 566 DGGNLLPCDGCPRAFHKECASLS---SIPQGDWYCKYC---------------------- 600
+ G++L CDGCPR+FH EC +L+ +P DWYC C
Sbjct: 904 NAGDVLCCDGCPRSFHFECVNLTQSEDLPD-DWYCSECIMRRFPSRVPIHKGAFAPALNA 962
Query: 601 -----QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-------ELSG 648
F + +Q+ V+AG G D E + K R E + +
Sbjct: 963 LEKSIPRAFSLPKHIQNRFEGVKAG--PGGDYEEIVGKTVKRRTGFDETPDLFKQRDENQ 1020
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 682
+LC C KS R IL C C +H+ CL
Sbjct: 1021 PVLCHAC--QKSSNDTRAILPCSLCSYYWHLDCL 1052
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 325 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 372
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 373 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 426
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 427 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 455
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 422 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 458
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + D + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|46124757|ref|XP_386932.1| hypothetical protein FG06756.1 [Gibberella zeae PH-1]
Length = 717
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDS-V 628
LL CD C A+H C L +IP GDWYC C ++F + + EAG S S V
Sbjct: 176 LLLCDSCDAAYHTHCIGLEAIPDGDWYCMECAHLF----HMVDEPETTEAGESSPRPSYV 231
Query: 629 EQITKRCIR 637
+ R +R
Sbjct: 232 RRPNPRNVR 240
Score = 40.0 bits (92), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 55/148 (37%), Gaps = 15/148 (10%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD--SVEQI 631
DGC H C + S + C C+ F R N V+ +S D +Q+
Sbjct: 87 DGCDHIIHDAC--IRSWAKKTNTCPICRCPFHSVRVY----NGVDGTAISKYDVQDKKQV 140
Query: 632 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 691
+ +R E C S +LLCD C+ +H C+ L
Sbjct: 141 AEFDVRQWLGENPEEDEEEQGNPCPICNSSEREDVLLLCDSCDAAYHTHCIG------LE 194
Query: 692 ELPKGKWFCCMDCSRINSVLQNLLVQEA 719
+P G W+ CM+C+ + ++ EA
Sbjct: 195 AIPDGDWY-CMECAHLFHMVDEPETTEA 221
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 323 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 370
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 371 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 424
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 425 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 453
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 420 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 62/164 (37%), Gaps = 35/164 (21%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C + P+G W
Sbjct: 368 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHMVCLDPDMERAPEGTWS 415
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q +A R + E+ E + CR C
Sbjct: 416 CPHCE-----KEGIQWEA------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC- 463
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L ++P G+W C
Sbjct: 464 --KDG---GELLCCDTCPSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 42.7 bits (99), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 465 DGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 322 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 369
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 370 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 423
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 424 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 452
Score = 39.7 bits (91), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 419 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 455
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
G LL CDGCPR +H +C + L+ +P+GDW+C C
Sbjct: 728 GELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPAC 762
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 372 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 419
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 420 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 466
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 467 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 513
Query: 710 VLQNLL 715
+Q +L
Sbjct: 514 RVQKIL 519
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 381 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 428
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 429 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 483
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 484 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 522
Query: 710 VLQNLL 715
+Q +L
Sbjct: 523 RVQKIL 528
>gi|170592228|ref|XP_001900871.1| CHD4 protein [Brugia malayi]
gi|158591738|gb|EDP30342.1| CHD4 protein, putative [Brugia malayi]
Length = 1846
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 265 GGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCES--------------------TG 304
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
++ K +I N+E CR C + G+ +L CD C +H CL
Sbjct: 305 ATKDDEEEK---KITTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 347
Query: 685 HKMADLRELPKGKWFC 700
L E+P+G W C
Sbjct: 348 ---PSLTEIPEGDWSC 360
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 327 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 363
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 72/186 (38%), Gaps = 42/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 433 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 480
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 481 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 534
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 535 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 573
Query: 710 VLQNLL 715
+Q +L
Sbjct: 574 RVQKIL 579
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + D + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 333 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 380
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 381 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 435
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 436 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 474
Query: 710 VLQNLL 715
+Q +L
Sbjct: 475 RVQKIL 480
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 336 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 383
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 384 CPHCEK--------------------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC- 422
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 423 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 424 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG+LL CD CP +H+ C + L SIP+GDW C C
Sbjct: 16 DGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 41/170 (24%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 323 TGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 370
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G + + + C R+ K+ G L
Sbjct: 371 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGEL 424
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 425 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 453
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 420 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 362 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 409
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 410 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 456
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 457 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 503
Query: 710 VLQNLL 715
+Q +L
Sbjct: 504 RVQKIL 509
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 198 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 245
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 246 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 300
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 301 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 329
Score = 39.7 bits (91), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 296 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 364 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 411
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 412 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 458
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 459 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 495
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 462 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 425 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 472
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 473 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 527
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 528 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 566
Query: 710 VLQNLL 715
+Q +L
Sbjct: 567 RVQKIL 572
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 377 AGEEEIDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 424
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + + E G G E R+ K+ G L
Sbjct: 425 PEGKWSCPHCEKEGVQWEAKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 479
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 480 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 518
Query: 710 VLQNLL 715
+Q +L
Sbjct: 519 RVQKIL 524
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 424 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 471
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 472 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 518
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 519 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 565
Query: 710 VLQNLL 715
+Q +L
Sbjct: 566 RVQKIL 571
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 56/164 (34%), Gaps = 44/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 316 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 363
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ + E E CR C
Sbjct: 364 CPHCEK--------------------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC- 402
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 403 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 437
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 404 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 440
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 411 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 458
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 459 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 505
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 506 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 552
Query: 710 VLQNLL 715
+Q +L
Sbjct: 553 RVQKIL 558
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 6/63 (9%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
DGGNL+ CDGCP FH C L ++P W C C F H+ ++ +A +
Sbjct: 1013 GDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCSNCSCKF------CHEHSSDDAEDTAD 1066
Query: 625 VDS 627
VDS
Sbjct: 1067 VDS 1069
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 56/116 (48%), Gaps = 12/116 (10%)
Query: 711 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 770
LQNLL + + PE+ +++ E V +D R E ++ A+++
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDKR----------VECNSKIAVALSL 1291
Query: 771 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRV 824
+CF PIVD +G +LI ++VY G N +F G Y IL +++A +R+
Sbjct: 1292 MDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRL 1347
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 341 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 388
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 389 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 443
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 444 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 472
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 439 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 475
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 272 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 319
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 320 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 374
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 375 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 403
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 370 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 406
>gi|195479715|ref|XP_002100999.1| GE17369 [Drosophila yakuba]
gi|194188523|gb|EDX02107.1| GE17369 [Drosophila yakuba]
Length = 2002
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1731 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1781
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1782 RTVPDGRWSCERCCVCMRC 1800
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|405972707|gb|EKC37461.1| Zinc finger protein ubi-d4 [Crassostrea gigas]
Length = 591
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 59/150 (39%), Gaps = 27/150 (18%)
Query: 570 LLPCDGCPRAFHKECAS--LSSIPQ--------------GDWYCKYCQNMFERKRFLQHD 613
LL CD C R +H C + LS P+ + YC +C E +
Sbjct: 433 LLFCDDCDRGYHMYCLNPPLSEPPEEKSGRSGMDKRDISANNYCDFCLGDSEENKKSNQP 492
Query: 614 ANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 670
V + GR SG + Q T I VK + C+ C+ C + +L C
Sbjct: 493 EELVSCSDCGR-SGHPTCLQFTANMIISVKKYPWQ---CIECKSCGLCGTSDNDDQLLFC 548
Query: 671 DQCEREFHVGCLKKHKMADLRELPKGKWFC 700
D C+R +H+ CL L E P+G W C
Sbjct: 549 DDCDRGYHMYCLN----PPLSEPPEGNWSC 574
>gi|358336343|dbj|GAA54879.1| tyrosine-protein kinase BAZ1B, partial [Clonorchis sinensis]
Length = 1921
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
+D NLL CDGC RAFH C L +P GDWYC C+
Sbjct: 1433 SDDDNLLLCDGCNRAFHLYCLRPPLRRVPAGDWYCPSCR 1471
Score = 45.1 bits (105), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 38/74 (51%), Gaps = 14/74 (18%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
CIR K++E + C +CR KS +LLCD C R FH+ CL+ LR +P
Sbjct: 1414 CIRWEKSVED--ARCRICR----HKSDDD--NLLLCDGCNRAFHLYCLR----PPLRRVP 1461
Query: 695 KGKWFC--CMDCSR 706
G W+C C SR
Sbjct: 1462 AGDWYCPSCRPASR 1475
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 61/164 (37%), Gaps = 43/164 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 338 DGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELEKAPEGKWS 385
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ K +Q + + E+ ++E CR C
Sbjct: 386 CPHCE-----KEGIQWEPKDDDD-------EEEEGGCEEEEEDDHME-------FCRVC- 425
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 426 --KDG---GELLCCDACPSSYHLHCLN----PPLPEIPNGEWLC 460
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 427 DGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 463
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 414 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 460
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 461 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/37 (62%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 338
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 352 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 399
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 400 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 454
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 455 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 493
Query: 710 VLQNLL 715
+Q +L
Sbjct: 494 RVQKIL 499
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 360 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 407
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 408 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 454
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
+C+ +L CD C +H+ CL L ++P G+W C C +
Sbjct: 455 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 501
Query: 710 VLQNLL 715
+Q +L
Sbjct: 502 RVQKIL 507
>gi|393911013|gb|EJD76123.1| LET-418 protein [Loa loa]
Length = 1755
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 179 GGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES--------------------TG 218
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
++ K ++ N+E CR C + G+ +L CD C +H CL
Sbjct: 219 APKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 261
Query: 685 HKMADLRELPKGKWFC 700
L E+P+G W C
Sbjct: 262 ---PSLTEIPEGDWSC 274
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 241 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 464 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 361 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 408
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + D E G G E R+ K+ G L
Sbjct: 409 PEGKWSCPHCEKEGVQWEAKEEDEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 463
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 464 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 492
Score = 39.3 bits (90), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 459 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 495
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 517 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 564
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 565 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 619
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 620 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 648
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 615 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 651
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 48/136 (35%), Gaps = 47/136 (34%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CPRA+H C L P+G W C +C+ E +
Sbjct: 360 GGEIILCDTCPRAYHLVCLDPELEETPEGKWSCPHCE---------------AEGTQEQD 404
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D + + C + G LLC CD C +H+ CL
Sbjct: 405 DDEHNEFCRLC---------KDGGELLC-----------------CDSCTSAYHIFCLN- 437
Query: 685 HKMADLRELPKGKWFC 700
L E+P G W C
Sbjct: 438 ---PPLSEIPDGDWKC 450
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 417 DGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 68/186 (36%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G+ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 460 AGEDEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 507
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 508 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 562
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 563 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 601
Query: 710 VLQNLL 715
+Q +L
Sbjct: 602 RVQKIL 607
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 18/73 (24%)
Query: 546 GIICHCCNSEVSPSQFEAH-----------------ADGGNLLPCDGCPRAFHKECASLS 588
G + NS + P+ F A ++GG+LL C+ CP AFH+EC ++
Sbjct: 1109 GTVILASNSMICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1167
Query: 589 SIPQGDWYCKYCQ 601
+P+G WYC C+
Sbjct: 1168 DMPEGSWYCNDCK 1180
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C SL+ P G W C + Q
Sbjct: 1555 GDGGQLVSCKRPGCPKVYHADCLSLTRRPAGKWECPWHQ 1593
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 602
GG+L+ C+ CP A+H +C S+P G+W+C+ C N
Sbjct: 1004 GGDLICCESCPAAYHAKCLGFDSVPDGNWFCRDCVN 1039
Score = 39.3 bits (90), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 599
A+GG L CD CP+A+H +C L+ P G W C +
Sbjct: 1420 AEGGELTMCDVKTCPKAYHLDCLGLTKQPYGKWQCPW 1456
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 69/186 (37%), Gaps = 41/186 (22%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 366 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 413
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 414 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 468
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINS 709
LC CD C +H+ CL L ++P G+W C C +
Sbjct: 469 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKG 507
Query: 710 VLQNLL 715
+Q +L
Sbjct: 508 RVQKIL 513
>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
Length = 3661
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 27/159 (16%)
Query: 566 DGGNLLPCDG-CPRAFHKECASLSSIPQGD-WYCKYCQNMFER-----KRFLQHDANAVE 618
DGG + CDG C R+FH C + P+ D W C C N ++ K+ + D++
Sbjct: 2745 DGGVTIMCDGPCQRSFHPACLGMDDNPEEDPWMCNRCMNKVQKCLECGKKGSEMDSHN-R 2803
Query: 619 AGRVSGVDSVEQIT-------KRCIRIVKNLEAELS--GCL-----LCRGCDFSKSGFGP 664
A ++ G S Q++ K C+ + S G C C + + GP
Sbjct: 2804 AVKIPGGVSRCQLSSCGRYYHKECLDKITPNRTSYSKEGNFKCPQHFCIDCGKTSTNLGP 2863
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 703
RT++ C +C + CLK R + KGKW C D
Sbjct: 2864 RTLVKCLRCAKARCPDCLKT-----ARYVKKGKWMVCSD 2897
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 64/132 (48%), Gaps = 17/132 (12%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 726
+L+CD C+ E+H+ CL+ L +PKG+W C + C+ + Q L + E
Sbjct: 551 MLVCDTCDAEYHLKCLR------LSSVPKGQWLCPI-CTVMLRKGQTLFSHQT----EVE 599
Query: 727 LNAIKKYAGNSLETVSDID--VRWRLLSGKAATPETRLLLSQAVAI--FHDCFD--PIVD 780
+ + ++E V ++ ++W LS + T ETR L+ AI FH D P+
Sbjct: 600 KAKLSQMPQPTVEVVDELKYLIKWSGLSYQFCTWETREELNNDGAIDRFHKLNDHPPLSP 659
Query: 781 SISGRDLIPSMV 792
+S +L+ ++
Sbjct: 660 PMSEEELMRTLA 671
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE 629
+L CD C +H +C LSS+P+G W C C M + + L VE ++S
Sbjct: 551 MLVCDTCDAEYHLKCLRLSSVPKGQWLCPICTVMLRKGQTLFSHQTEVEKAKLS------ 604
Query: 630 QITKRCIRIVKNL 642
Q+ + + +V L
Sbjct: 605 QMPQPTVEVVDEL 617
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 64/169 (37%), Gaps = 40/169 (23%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 364 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 411
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 651
+G W C +C+ K +Q +A E + + + + C +
Sbjct: 412 EGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRV 458
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 459 CKDGG---------ELLCCDTCISSYHIHCLN----PPLPDIPNGEWLC 494
>gi|332018342|gb|EGI58947.1| Bromodomain adjacent to zinc finger domain protein 1A [Acromyrmex
echinatior]
Length = 1453
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 70/169 (41%), Gaps = 40/169 (23%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRF------ 609
D N+L CDGC R H C L+++P GDW+C C + +RKRF
Sbjct: 1082 DAENMLLCDGCNRGHHLYCLKPKLTAVPAGDWFCTACRPPEIKPKEKTQKRKRFEDEIEE 1141
Query: 610 --------LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 661
Q+ A + + +S ++ + I LE LC C K+G
Sbjct: 1142 ETMLTKETRQNRAKRIHSDDEDDQESDDEDDESEEDINIRLEN------LCASC---KNG 1192
Query: 662 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 710
++ CD C FH+ C++ L P+G+W C + + N++
Sbjct: 1193 ---GKLIACDTCPNRFHLECVE----PPLSRAPRGRWSCTICKKKKNAI 1234
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 259 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 316
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 317 KTAQGLSPVDQ--RKCERLL 334
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 308 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSC 344
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 69/185 (37%), Gaps = 41/185 (22%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 364 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 411
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 651
+G W C +C+ + + + E G G E R+ K+ G LL
Sbjct: 412 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELL 466
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 710
C CD C +H+ CL L ++P G+W C C +
Sbjct: 467 C-----------------CDTCISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 505
Query: 711 LQNLL 715
+Q +L
Sbjct: 506 VQKIL 510
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 755 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDL--SKPEVEYDCDNLQHSKKG 812
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 813 KTAQGLSPVDQ--RKCERLL 830
>gi|312077956|ref|XP_003141528.1| CHromoDomain protein family member [Loa loa]
Length = 1696
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 42/136 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CP+A+H C + P+G W C C++ +G
Sbjct: 120 GGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES--------------------TG 159
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
++ K ++ N+E CR C + G+ +L CD C +H CL
Sbjct: 160 APKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 202
Query: 685 HKMADLRELPKGKWFC 700
L E+P+G W C
Sbjct: 203 ---PSLTEIPEGDWSC 215
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 182 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 218
>gi|336270508|ref|XP_003350013.1| hypothetical protein SMAC_00903 [Sordaria macrospora k-hell]
gi|380095404|emb|CCC06877.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 990
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G ++ CDGC +A H++C + +P+GDWYCK C
Sbjct: 444 GNQIVFCDGCDKAVHQKCYGIPRLPRGDWYCKEC 477
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 341 DGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 377
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 49.3 bits (116), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 338
>gi|401402451|ref|XP_003881253.1| hypothetical protein NCLIV_042870 [Neospora caninum Liverpool]
gi|325115665|emb|CBZ51220.1| hypothetical protein NCLIV_042870 [Neospora caninum Liverpool]
Length = 476
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query: 552 CNSEVSPSQFEAHADGGN-LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 610
C S+ Q + +G + +L CDGC A H+ C + ++P+ DWYC+YC+
Sbjct: 133 CQSDDVSKQASSEQEGHDEILLCDGCDVAVHQTCYYVETVPKADWYCQYCE--------- 183
Query: 611 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL-- 668
D N +A V + ++ K + K +EA L D S P +L
Sbjct: 184 --DRNQAQA----NVTKLRRLAKSSGKTDKQVEATFRTEL-----DRMASAKEPFCVLPK 232
Query: 669 LCDQCEREF--HVGCLKKHKM 687
C C R F HV C + +M
Sbjct: 233 RCPLCPRSFGAHVRCGEDFRM 253
>gi|328785896|ref|XP_003250672.1| PREDICTED: hypothetical protein LOC725681 [Apis mellifera]
Length = 2891
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 79/209 (37%), Gaps = 53/209 (25%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 566
C + L + KN + I C CN V PS Q AD
Sbjct: 2674 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2733
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV-SGV 625
+L CD C R +H C L +PQG W+C+ C + + ++ E G + S
Sbjct: 2734 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC--------AVCANCSSREPGGINSDR 2785
Query: 626 DSVEQITKRCIRIVKNLEAELSG-CLLC-------RGCDF-SKSGFGPR-----TILLCD 671
+SV Q + KN +S C+ C R C S+ PR ++ C
Sbjct: 2786 NSVAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEANLVHCS 2845
Query: 672 QCEREFHVGCLKKHKMADLRELPKGKWFC 700
C++ H+GC++ M L + + C
Sbjct: 2846 ACDKYLHLGCVETKGMP----LDRKNYLC 2870
Score = 42.7 bits (99), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2705 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 2758
Query: 695 KGKWFC-----CMDCS 705
+G+W C C +CS
Sbjct: 2759 QGRWHCQECAVCANCS 2774
>gi|429328961|gb|AFZ80720.1| hypothetical protein BEWA_001270 [Babesia equi]
Length = 238
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 44/93 (47%), Gaps = 12/93 (12%)
Query: 649 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRIN 708
C +C GCD S T+L+CD C+R FH+ C K+ E+PKG WF C DCS
Sbjct: 75 CKICAGCD-DNSSKRDHTMLICDACDRSFHMECTKEK----YSEVPKGAWF-CDDCSICQ 128
Query: 709 SVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV 741
L +E+ + L GN L TV
Sbjct: 129 ICDIKLTERESNNPTNYSLE------GNKLCTV 155
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 401 NGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 460
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 461 AQGLSPVDQ-----RKCERLL 476
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 347
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
S ++++ L AE G C G +L C C FH C
Sbjct: 348 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 397
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|198433831|ref|XP_002121767.1| PREDICTED: similar to zinc finger, MYND-type containing 8 [Ciona
intestinalis]
Length = 1878
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 605
G +L C+ CPR FH +C + S P+GDW+C C+ + E
Sbjct: 269 GEVLCCELCPRVFHAKCLRMQSEPEGDWFCPECEKITE 306
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 64/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 333 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 380
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 381 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 427
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 428 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 464
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 431 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 348
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
S ++++ L AE G C G +L C C FH C
Sbjct: 349 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 398
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 59/148 (39%), Gaps = 44/148 (29%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG L+ CD CP +FH +C L +P W C+ C L+ +++ +E G G
Sbjct: 1010 GGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLC--------VLEAESSPLE-GCSDG 1060
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
DS + RC + + ++LCD C FH+ C
Sbjct: 1061 TDSHCDVCARCYKHGQ--------------------------LILCDVCPLAFHLRCTD- 1093
Query: 685 HKMADLRELPKGKW---FCCMDCSRINS 709
L ++P GKW C DC ++S
Sbjct: 1094 ---PPLLKVPSGKWTCQICVKDCQPVSS 1118
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 67/161 (41%), Gaps = 30/161 (18%)
Query: 553 NSEVSPSQFEAHADG-------GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNM 603
+ E + S+ ++H D G L+ C+ CP A+H +CA+ L IP G W C+ C +
Sbjct: 899 DDESNNSEEDSHCDECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVCTDK 958
Query: 604 FERK----RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 659
++K +F + S + + IV + ++ C CR +
Sbjct: 959 SQKKPTGIKFKGKHRKGLLPTSSSPSSLSSDL--ETLGIVADGHSDR--CARCR-----R 1009
Query: 660 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
G ++LCD C FH+ C+ L +P W C
Sbjct: 1010 GG----ELILCDSCPLSFHLDCVD----PPLLGVPPDIWLC 1042
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 28/60 (46%), Gaps = 8/60 (13%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC------QNMFERKRFLQHDANA 616
D +L CDGC R H C + SIP GDWYC C QN R++ D ++
Sbjct: 785 GDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDCRPKIVKQNSRRRRKSTLEDYDS 844
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|307172331|gb|EFN63819.1| Bromodomain adjacent to zinc finger domain protein 1A [Camponotus
floridanus]
Length = 1460
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 37/158 (23%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC--------QNMFERKRFLQHDAN 615
D N+L CDGC R H C L+++P GDW+C C + +RKRF +
Sbjct: 1093 DAENMLLCDGCNRGHHLYCLKPKLNAVPAGDWFCTACRPPEIKLKEKAQKRKRFEDEIED 1152
Query: 616 AVEAGRVSGVDSVEQITKRCIRIVK-------------NLEAELSGCLLCRGCDFSKSGF 662
V + + + ++I + + N+ E + C LC KSG
Sbjct: 1153 EVILTKETRHNRAKRIPQSDDENDQEDDEDDEDSEEDINMRLE-NLCALC------KSG- 1204
Query: 663 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ CD C +H+ C++ L P+G+W C
Sbjct: 1205 --GKVISCDTCPNYYHLECVE----PPLSRAPRGRWSC 1236
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
GG ++ CD CP +H EC LS P+G W C C+
Sbjct: 1204 GGKVISCDTCPNYYHLECVEPPLSRAPRGRWSCSKCK 1240
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 616
+GG L+ CD CP+ FH C SL++ P G+WYC C+++ + +Q++ NA
Sbjct: 705 NGGELICCDKCPKVFHLSCHVPSLTASPSGEWYCTLCRDLNSPE--MQYNVNA 755
>gi|307110583|gb|EFN58819.1| hypothetical protein CHLNCDRAFT_19495 [Chlorella variabilis]
Length = 176
Score = 49.3 bits (116), Expect = 0.010, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+GG LL CDGC A+H C +L + P GDW+C C
Sbjct: 20 EGGELLCCDGCTAAYHFSCVNLDAAPPGDWFCPLC 54
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 344
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
S ++++ L AE G C G +L C C FH C
Sbjct: 345 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 394
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 45/76 (59%), Gaps = 6/76 (7%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
+GG LL CD CP+ FH C +L P G+W+C +C+++ + ++++ N+ ++
Sbjct: 714 NGGELLCCDRCPKVFHLSCHIPALHEPPSGEWFCSFCRDLVSPE--MEYNCNSNDSPVSD 771
Query: 624 GVDSVEQITKRCIRIV 639
G +++ ++C R++
Sbjct: 772 GFPPIDR--RKCERLL 785
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G++ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 356 AGEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 403
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ + + + E G G E R+ K+ G L
Sbjct: 404 PEGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGEL 458
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
LC CD C +H+ CL L ++P G+W C
Sbjct: 459 LC-----------------CDACISSYHIHCLN----PPLPDIPNGEWLC 487
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 454 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
DGG L+ CDGCPRAFH C S L IP G W C C LQ GRV
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--------GRVQ 343
Query: 624 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
S ++++ L AE G C G +L C C FH C
Sbjct: 344 QNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCAAAFHWRC 393
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|356540950|ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800973 [Glycine max]
Length = 1735
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
G L+ CDGCP AFH C ++S +P+GDWYC C
Sbjct: 687 GCLICCDGCPAAFHSRCVGIASGHLPEGDWYCPEC 721
>gi|242011982|ref|XP_002426722.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
gi|212510893|gb|EEB13984.1| bromodomain adjacent to zinc finger protein domain 1, baz1, putative
[Pediculus humanus corporis]
Length = 1196
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
DG N+L CD C R FH C LSS+P GDW+C C+
Sbjct: 1084 GDGENMLLCDSCDRGFHLYCLKPKLSSVPLGDWFCSGCR 1122
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C + G +LLCD C+R FH+ CLK L +P G WFC
Sbjct: 1077 CKVCRRGGDGENMLLCDSCDRGFHLYCLK----PKLSSVPLGDWFC 1118
>gi|118384512|ref|XP_001025404.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89307171|gb|EAS05159.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 1453
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 608
G +L CD CP FH +C L S+P GDW C CQ ++R
Sbjct: 1401 GEVLMCDTCPSVFHLKCIGLKSLPDGDWSCLECQQKLLKQR 1441
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 63/166 (37%), Gaps = 37/166 (22%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
EGY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 43 EGYETDHQDYCEVCQQ------------GGEIMLCDTCPRAYHLVCLDPELEEAPEGSWS 90
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVD-SVEQITKRCIRIVKNLEAELSGCLLCR-G 654
C +C+ K + + G+ +G + ++ K+ + C C G
Sbjct: 91 CPHCE-----KEGISMGSQV--EGKATGTKMAPDKSAKQVAAASPEKDEHQEFCTECHDG 143
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
D ++ C+ C +H+ CL + L +P+G W C
Sbjct: 144 GD----------LICCENCPVSYHLDCL----IPPLTNIPEGVWLC 175
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/37 (43%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG+L+ C+ CP ++H +C L++IP+G W C C
Sbjct: 142 DGGDLICCENCPVSYHLDCLIPPLTNIPEGVWLCPRC 178
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 748 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 807
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 808 AQGLSPVDQ-----RKCERLL 823
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 63/170 (37%), Gaps = 40/170 (23%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSI 590
G+ ++GY+ C C GG ++ CD CPRA+H C L
Sbjct: 241 AGEDEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRA 288
Query: 591 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 650
P+G W C +C+ K +Q +A E + + + + C
Sbjct: 289 PEGKWSCPHCE-----KEGVQWEAKEEE--------EEYEEEGEEEGEKEEEDDHMEYCR 335
Query: 651 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+C+ +L CD C +H+ CL L ++P G+W C
Sbjct: 336 VCKDGG---------ELLCCDACISSYHIHCLN----PPLPDIPNGEWLC 372
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD C ++H C + L IP G+W C C
Sbjct: 339 DGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 375
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 340
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG L+ CDGCPR+FH C L+ IP G W C C
Sbjct: 37 DGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDAC 73
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 26/36 (72%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G+LL CDGCP +FH++C ++ +P+G W C C+ +
Sbjct: 1063 GDLLCCDGCPGSFHRQCIGVARLPEGKWLCPECKTV 1098
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 73/194 (37%), Gaps = 46/194 (23%)
Query: 562 EAHADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQNMFERKRFL---QHDA 614
+A DGG L+ CD CP +FH +C LS IP G+W C C+ +++ + +
Sbjct: 62 DACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACRCAMKKENSIGNKRKKK 121
Query: 615 NAVEAGRVS----------------------GVDSVEQIT------KRCIRIVKNLEAEL 646
NA+E ++ G D ++ ++ I +
Sbjct: 122 NALEVLALAASLVNPKEFELPRELQIPITFPGTDKIDPVSFKRGKHHNSNNINGKIHYHE 181
Query: 647 SGC---LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 703
G L R C + ++ CD C FH CL L P G+W C
Sbjct: 182 YGSITPLPARLCFVCRKSCRKAPLIACDYCPLYFHQDCLD----PPLTAFPSGRWMC--- 234
Query: 704 CSRINSVL-QNLLV 716
+ +N + QNLL
Sbjct: 235 PNHLNHFIDQNLLT 248
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|223996275|ref|XP_002287811.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976927|gb|EED95254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1562
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
+GG+L+ CD CPR FH C + S+P+G+W+C C
Sbjct: 311 EGGDLICCDNCPRVFHSNCHIPKIYSLPEGEWFCMLC 347
>gi|449456180|ref|XP_004145828.1| PREDICTED: uncharacterized protein LOC101215849 [Cucumis sativus]
gi|449510841|ref|XP_004163779.1| PREDICTED: uncharacterized LOC101215849 [Cucumis sativus]
Length = 1719
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 625
G+LL CDGCP A+H C + IPQG WYC C + +A+ V G+
Sbjct: 438 GSLLCCDGCPSAYHLRCIGMVKVLIPQGPWYCPECS--INKSEPTITKGSALRGAEVFGI 495
Query: 626 DSVEQI 631
D E I
Sbjct: 496 DPYEHI 501
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG+LL C+ CP AFH +C L +P+G W+C+ C
Sbjct: 908 GGDLLCCEMCPAAFHPQCLGLEDLPEGTWFCRDC 941
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 899 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 958
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 959 AQGLSPVDQ-----RKCERLL 974
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 27/48 (56%), Gaps = 7/48 (14%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-----QNMFER 606
DGG L+ CDGCPRAFH C S L IP G W C C Q+M R
Sbjct: 435 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQATVQDMRPR 482
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 800 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 859
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 860 AQGLSPVDQ-----RKCERLL 875
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 812 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 871
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 872 AQGLSPVDQ-----RKCERLL 887
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1418 SEGGSLLCCEACPAAFHRECLNM-EMPQGSWFCNDCK 1453
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1829 GDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 1867
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL CD CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 687 NGGELLCCDKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHSSEK 744
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + + KR C R++
Sbjct: 745 RKADGLVKLTPVDKRKCERLL 765
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 261 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 297
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 762 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 819
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 820 KTAQGLSPVDQ--RKCERLL 837
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 979 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1038
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 1039 AQGLSPVDQ-----RKCERLL 1054
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 389 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 425
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 894 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 953
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 954 AQGLSPVDQ-----RKCERLL 969
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 780 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 839
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 840 AQGLSPVDQ-----RKCERLL 855
>gi|442616991|ref|NP_001259720.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
gi|440216957|gb|AGB95560.1| enhancer of yellow 3, isoform D [Drosophila melanogaster]
Length = 2011
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1735 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1785
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1786 RTVPDGRWSCERCCFCMRC 1804
>gi|442616989|ref|NP_001259719.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
gi|440216956|gb|AGB95559.1| enhancer of yellow 3, isoform C [Drosophila melanogaster]
Length = 2012
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1736 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1786
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1787 RTVPDGRWSCERCCFCMRC 1805
>gi|45550083|ref|NP_608334.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|442616987|ref|NP_001259718.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|442616993|ref|NP_001259721.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
gi|62901062|sp|Q9VWF2.3|SAYP_DROME RecName: Full=Supporter of activation of yellow protein; AltName:
Full=Protein enhancer of yellow 3
gi|45447061|gb|AAF48990.3| enhancer of yellow 3, isoform A [Drosophila melanogaster]
gi|257153436|gb|ACV44475.1| LD27440p [Drosophila melanogaster]
gi|440216955|gb|AGB95558.1| enhancer of yellow 3, isoform B [Drosophila melanogaster]
gi|440216958|gb|AGB95561.1| enhancer of yellow 3, isoform E [Drosophila melanogaster]
Length = 2006
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1730 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1780
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1781 RTVPDGRWSCERCCFCMRC 1799
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 450 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 509
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 510 AQGLSPVDQ-----RKCERLL 525
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 678 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 735
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 736 KTAQGLSPVDQ--RKCERLL 753
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 824 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 883
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 884 AQGLSPVDQ-----RKCERLL 899
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 767 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 826
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 827 AQGLSPVDQ-----RKCERLL 842
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 316 DGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 352
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 812 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 871
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 872 AQGLSPVDQ-----RKCERLL 887
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1417 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 1452
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1828 GDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1866
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 919 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 978
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 979 AQGLSPVDQ-----RKCERLL 994
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 894 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 953
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 954 AQGLSPVDQ-----RKCERLL 969
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 919 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 978
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 979 AQGLSPVDQ-----RKCERLL 994
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 776 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 833
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 834 KTAQGLSPVDQ--RKCERLL 851
>gi|402590896|gb|EJW84826.1| chromodomain-helicase-DNA-binding protein 4 [Wuchereria bancrofti]
Length = 1519
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 54/136 (39%), Gaps = 42/136 (30%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
GG ++ CD CP+A+H C + P+G W C C E+ +
Sbjct: 179 GGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTC-----------------ESTGAAK 221
Query: 625 VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 684
D E+ +I N+E CR C + G+ +L CD C +H CL
Sbjct: 222 DDEEEK------KITTNME-------YCRTC--KEGGW----LLCCDTCPSSYHAYCLN- 261
Query: 685 HKMADLRELPKGKWFC 700
L E+P+G W C
Sbjct: 262 ---PSLTEIPEGDWSC 274
Score = 45.1 bits (105), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
+GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 241 EGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 719 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 776
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 777 KTAQGLSPVDQ--RKCERLL 794
>gi|21391996|gb|AAM48352.1| LD10526p [Drosophila melanogaster]
Length = 1843
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1567 MPPRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCLG------L 1617
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C CM C
Sbjct: 1618 RTVPDGRWSCERCCFCMRC 1636
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 46/79 (58%), Gaps = 6/79 (7%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
+GG LL CD CP+ FH C +L+ P G+W+C +C+++ + +++D ++ +A
Sbjct: 718 NGGELLCCDKCPKVFHLACHIPTLNESPSGEWFCSFCRDLVSPE--MEYDCDSKDAPISE 775
Query: 624 GVDSVEQITKRCIRIVKNL 642
V++ ++C R++ L
Sbjct: 776 KFPPVDR--RKCERLLLRL 792
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 912 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 969
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 970 KTAQGLSPVDQ--RKCERLL 987
>gi|320170020|gb|EFW46919.1| hypothetical protein CAOG_04877 [Capsaspora owczarzaki ATCC 30864]
Length = 1096
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 7/63 (11%)
Query: 540 GYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 599
G + I+C C S S DG N+L CDGC A H+ C + S+P+G W+C
Sbjct: 507 GVEFNHDIVCDVCLSGDS-------EDGNNILFCDGCNLAVHQACYGVESVPEGAWFCYP 559
Query: 600 CQN 602
C +
Sbjct: 560 CAH 562
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 809 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 868
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 869 AQGLSPVDQ-----RKCERLL 884
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1429 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 1464
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1840 GDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1878
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 908 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 965
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 966 KTAQGLSPVDQ--RKCERLL 983
>gi|380029720|ref|XP_003698514.1| PREDICTED: uncharacterized protein LOC100870597 [Apis florea]
Length = 3312
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 566
C + L + KN + I C CN V PS Q AD
Sbjct: 3095 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 3154
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+L CD C R +H C L +PQG W+C+ C
Sbjct: 3155 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 3188
Score = 42.7 bits (99), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 11/76 (14%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 3126 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVP 3179
Query: 695 KGKWFC-----CMDCS 705
+G+W C C +CS
Sbjct: 3180 QGRWHCQECAVCANCS 3195
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 841 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 900
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 901 AQGLSPVDQ-----RKCERLL 916
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 308 DGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSC 344
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 895 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 954
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 955 AQGLSPVDQ-----RKCERLL 970
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 910 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 967
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 968 KTAQGLSPVDQ--RKCERLL 985
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 856 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 913
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 914 KTAQGLSPVDQ--RKCERLL 931
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 290 DGGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRC 326
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 908 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 965
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 966 KTAQGLSPVDQ--RKCERLL 983
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + + +
Sbjct: 858 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDCDNSQHSKKG 915
Query: 622 --VSGVDSVEQITKRCIRIV 639
V G+ V+Q+ +C R++
Sbjct: 916 KTVQGLSPVDQM--KCERLL 933
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 44/79 (55%), Gaps = 6/79 (7%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
+GG LL CD CP+ FH C +L+ P G+W+C +C+++ + +++D N +
Sbjct: 722 NGGELLCCDKCPKVFHLSCHIPALNESPSGEWFCSFCRDLLNPE--MEYDCNRQDRPPSE 779
Query: 624 GVDSVEQITKRCIRIVKNL 642
VE+ ++C R++ L
Sbjct: 780 KFPLVER--RKCERLLLRL 796
>gi|219110357|ref|XP_002176930.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411465|gb|EEC51393.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 2413
Score = 48.5 bits (114), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G+LL CDGC H +C L+S P+GDW+C+ C
Sbjct: 1904 GDLLCCDGCANVVHGKCIGLTSFPEGDWFCEEC 1936
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 11/85 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 678 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 735
Query: 620 GRVSGVDSVEQITKRC---IRIVKN 641
+ G+ + I KR +I+KN
Sbjct: 736 KKTEGLVKLTPIDKRVPDYYKIIKN 760
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 32/45 (71%), Gaps = 2/45 (4%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 609
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+ +R RF
Sbjct: 549 SEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDCK-AGKRPRF 591
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Query: 568 GNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 600
G+LL CDG C AFH +C LS+ P+G ++C+ C
Sbjct: 389 GDLLACDGHCYGAFHPQCIGLSAAPKGKFFCREC 422
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 527 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 586
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 587 AQGLSPVDQ-----RKCERLL 602
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 527 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 586
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 587 AQGLSPVDQ-----RKCERLL 602
>gi|312380117|gb|EFR26201.1| hypothetical protein AND_07843 [Anopheles darlingi]
Length = 2310
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 20/95 (21%)
Query: 637 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG 696
R V+ + + S C LC C+ + + ++ CDQC+R +H+ C LR LP G
Sbjct: 2030 RRVQQYKWQCSECKLCMKCNRKPAAIDSK-MVYCDQCDRGYHLAC------KGLRNLPDG 2082
Query: 697 KWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNA 729
+W C C CS+ + + PE H NA
Sbjct: 2083 RWHCSLCTICSQCGA-----------QTPEGHPNA 2106
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 377 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 436
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 437 AQGLSPVDQ-----RKCERLL 452
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 971 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1030
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 1031 AQGLSPVDQ-----RKCERLL 1046
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 71/185 (38%), Gaps = 42/185 (22%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 379 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 426
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 651
+G W C +C+ + + + E G + + + C R+ K+ G LL
Sbjct: 427 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELL 480
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 710
C CD C +H+ CL L ++P G+W C C +
Sbjct: 481 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 519
Query: 711 LQNLL 715
+Q +L
Sbjct: 520 VQKIL 524
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +E
Sbjct: 709 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPTHNLEK 766
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 767 KKTEGLVKLTPIDKRKCERLL 787
>gi|350400841|ref|XP_003485981.1| PREDICTED: hypothetical protein LOC100740971 [Bombus impatiens]
Length = 2805
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 76/211 (36%), Gaps = 57/211 (27%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 566
C + L + KN + I C CN V PS Q AD
Sbjct: 2588 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2647
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ---NMFERKRFLQHDANAVEAGRVS 623
+L CD C R +H C L +PQG W+C+ C N R+ G S
Sbjct: 2648 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQECAVCVNCGSREP----------GGINS 2697
Query: 624 GVDSVEQITKRCIRIVKNLEAELSG-CLLC-------RGCDF-SKSGFGPR-----TILL 669
+SV Q + KN +S C+ C R C S+ PR ++
Sbjct: 2698 DRNSVAQWQHEYKKGDKNTRVYVSTLCVPCSKLWRKGRYCPHCSRCHTAPRLDLEVNLVH 2757
Query: 670 CDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
C C++ H+GC++ M L + + C
Sbjct: 2758 CSACDKYLHLGCVETKGMP----LDRKNYLC 2784
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2619 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2672
Query: 695 KGKWFC-----CMDC 704
+G+W C C++C
Sbjct: 2673 QGRWHCQECAVCVNC 2687
>gi|407918848|gb|EKG12110.1| Zinc finger PHD-type protein [Macrophomina phaseolina MS6]
Length = 565
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 562 EAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ----NMFERKRFLQHDANAV 617
E D L+ CD C + H CA L +P G+WYC++C + +R+R QH+ +AV
Sbjct: 66 EDFGDEDQLMLCDSCDKLCHVFCAGLDEVPAGEWYCQHCMEDPYTLGQRERERQHNRSAV 125
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 14/64 (21%)
Query: 642 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC- 700
LE + C++C DF ++LCD C++ HV C A L E+P G+W+C
Sbjct: 55 LEPPVDPCMVCE--DFGDED----QLMLCDSCDKLCHVFC------AGLDEVPAGEWYCQ 102
Query: 701 -CMD 703
CM+
Sbjct: 103 HCME 106
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 283 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 319
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVE 618
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH
Sbjct: 1026 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKT 1085
Query: 619 AGRVSGVDSVEQITKRCIRIV 639
A +S VD ++C R++
Sbjct: 1086 AQGLSPVDQ-----RKCERLL 1101
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 705
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 45.8 bits (107), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
GG L+ CD CP+AFH EC L +P+G W C+ C+
Sbjct: 1431 GGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCK 1467
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +
Sbjct: 738 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHNSDK 795
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
++ G+ + I KR C R++
Sbjct: 796 RKIEGLSKLTPIDKRKCERLL 816
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 128 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKG 185
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 186 KTAQGLSPVDQ--RKCERLL 203
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1772 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1807
Score = 40.8 bits (94), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2182 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2220
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 9/80 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 902 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 959
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ VEQ K C R++
Sbjct: 960 KTAQGLFEVEQKIK-CERLL 978
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1397 SEGGSLLCCESCPAAFHRECLNI-EMPQGSWFCNDCR 1432
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +LS P G W C + Q
Sbjct: 1807 GDGGQIVSCKKPGCPKVYHADCLNLSKRPAGRWECPWHQ 1845
>gi|255084682|ref|XP_002504772.1| SNF2 super family [Micromonas sp. RCC299]
gi|226520041|gb|ACO66030.1| SNF2 super family [Micromonas sp. RCC299]
Length = 1710
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 13/96 (13%)
Query: 520 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRA 579
+G P+G +A G++ L CH C E P + D ++ CDGC
Sbjct: 399 AGGPEGQRATLWAAGEQDL----------CHICG-EADPDFWNIEND--CIVMCDGCDVQ 445
Query: 580 FHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 615
H C LS +P+G+W+C+ C + + R DA
Sbjct: 446 VHLSCYGLSEVPEGEWFCQGCIDGIKVGRDRPDDAG 481
>gi|332019339|gb|EGI59845.1| PHD finger protein 10 [Acromyrmex echinatior]
Length = 1472
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 26/103 (25%)
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS------------------------ 559
D T++ C + L + KN + I C CN V PS
Sbjct: 1246 DDTKLKCKMCLKVLNKHNKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKT 1305
Query: 560 --QFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
Q AD +L CD C R +H C L +PQG W+C+ C
Sbjct: 1306 CVQCHDPADEDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 1348
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 636 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 695
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P+
Sbjct: 1287 LDMVPHIQSYAWQCTDCKTCVQCHDPADEDKMLFCDMCDRGYHIYCV------GLRRVPQ 1340
Query: 696 GKWFC-----CMDC 704
G+W C C +C
Sbjct: 1341 GRWHCQECAVCANC 1354
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 44/114 (38%), Gaps = 3/114 (2%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-QNMFERKRFLQHDANAVEAGRV 622
DGG L+ CDGCPRAFH C S L IP G W C C Q ++ ++ +E
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRIQQNLSQPEESRPLEPSAE 361
Query: 623 SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 676
+ ++ C L C F + P T L C C E
Sbjct: 362 TPGPTLSARCGVCGDSTDVLRCAHCAAAFHWRCHFPMAAVRPGTNLRCKSCSAE 415
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 311 DGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCC 347
>gi|432852260|ref|XP_004067159.1| PREDICTED: uncharacterized protein LOC101164387 [Oryzias latipes]
Length = 1310
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQHDANAVEAGRVS 623
GG LL CD CP+ FH C L S P GDW C C++ + + ++ + A A +
Sbjct: 1115 GGELLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDAIQPEVQYNCENERASGANPLH 1174
Query: 624 GVDSVEQITKRCIRIVK 640
G+ + +Q + +I+K
Sbjct: 1175 GLSACDQRARHYYQIIK 1191
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 221 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 266
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1637 SEGGSLLCCEACPAAFHRECLNI-EMPQGSWFCNDCK 1672
Score = 39.7 bits (91), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2048 GDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 2086
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 705
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 47/163 (28%)
Query: 542 KNGLGIICH--CCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYC 597
+ GLG + H C S HA G L+ C CP +H EC LS + Q W+C
Sbjct: 1415 RRGLGALDHNELCQSC-------GHA--GQLILCHDCPIVYHCECLDPPLSKLTQDHWFC 1465
Query: 598 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 657
C V +G DS E++ + E+ +C C
Sbjct: 1466 PLC----------------VMDRTTNGADSEEEMGSN--------DGEIEHEDVCSRCRH 1501
Query: 658 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
++ CD C + FH+ C K LR++PKG W C
Sbjct: 1502 GGE------LICCDTCPKAFHMECCK----PVLRKVPKGHWEC 1534
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 2/37 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 601
GG L+ CD CP+AFH EC L +P+G W C+ C+
Sbjct: 1502 GGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCK 1538
>gi|255086445|ref|XP_002509189.1| predicted protein [Micromonas sp. RCC299]
gi|226524467|gb|ACO70447.1| predicted protein [Micromonas sp. RCC299]
Length = 517
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G+ + CDGCPR H +C L +P GDW+C C
Sbjct: 351 GDFVLCDGCPRGGHYDCLGLDGVPAGDWFCAGC 383
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 232 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 277
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 48.1 bits (113), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A +
Sbjct: 277 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDARSHNSDK 334
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
++ G+ + I KR C R++
Sbjct: 335 RKIEGLSKLTPIDKRKCERLL 355
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 16 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 52
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 64/166 (38%), Gaps = 42/166 (25%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWY 596
+GY+ C C GG ++ CDGCPRA+H C L P+G W
Sbjct: 337 DGYETDHQDYCEVCKQ------------GGEIILCDGCPRAYHLVCLEPPLDQPPEGSWP 384
Query: 597 CKYCQNMFERKRFLQHDANAVEAG-RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-G 654
C C N ++ R + D + +N++ + C C+ G
Sbjct: 385 CPTCVK------------NGIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDG 432
Query: 655 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
D +L+CD C +H+ CL + ++P+G+W C
Sbjct: 433 GD----------LLICDTCPHSYHLNCLN----PPVEKVPEGEWSC 464
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 55/164 (33%), Gaps = 59/164 (35%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 596
+GY+ C C GG ++ CD CPRA+H C L P+G W
Sbjct: 318 DGYETEHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCFDPELEEPPEGKWS 365
Query: 597 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 656
C +C+ G+ E E CR C
Sbjct: 366 CPHCEG--------------------EGIKEQE---------------EDDHMEFCRVC- 389
Query: 657 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
K G +L CD C +HV CL ++ +P G+W C
Sbjct: 390 --KDG---GELLCCDTCPSAYHVHCLN----PPMKMIPDGEWHC 424
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + + IP G+W+C C
Sbjct: 391 DGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRC 427
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 245 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 290
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GG+++ CD CP +H +C L SIP G+W+C C
Sbjct: 186 GGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPAC 219
>gi|403290903|ref|XP_003936546.1| PREDICTED: protein kinase C-binding protein 1 isoform 9 [Saimiri
boliviensis boliviensis]
Length = 1108
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 566 DGGNLLPCDGCPRAFHKE--CASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 618
DGG L+ CDGCPRAFH C L IP G W C C + R LQ +A E
Sbjct: 312 DGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCL----QGRALQDTPHAEE 362
>gi|326432230|gb|EGD77800.1| hypothetical protein PTSG_08890 [Salpingoeca sp. ATCC 50818]
Length = 1086
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 47/110 (42%), Gaps = 12/110 (10%)
Query: 503 WNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLE--------GYKNGLGIICHCC-- 552
+++ +D+ KL + L DG E+ Y+ + + E K G CH
Sbjct: 272 YDLDTEDEIWRKLFNERHRLKDGVEISYHHMSRAIFEFERAAFRGMTKAGGKASCHGLEY 331
Query: 553 --NSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
N+ Q +G ++ CDGC H+ C + IP+G+WYC C
Sbjct: 332 DENTVCDVCQLPDSEEGNEMVFCDGCNLCVHQVCYGIKVIPEGNWYCCAC 381
>gi|296200654|ref|XP_002747672.1| PREDICTED: protein kinase C-binding protein 1 isoform 9 [Callithrix
jacchus]
Length = 1107
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1720 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1755
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2128 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2166
>gi|297259644|ref|XP_002798155.1| PREDICTED: protein kinase C-binding protein 1-like isoform 7
[Macaca mulatta]
Length = 1105
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 10/83 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 15 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKG 72
Query: 622 --VSGVDSVEQITKRCIRIVKNL 642
G+ V+Q ++C R++ L
Sbjct: 73 KTAQGLSPVDQ--RKCERLLLYL 93
>gi|397511403|ref|XP_003826065.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Pan
paniscus]
Length = 1105
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|193784671|dbj|BAG53824.1| unnamed protein product [Homo sapiens]
Length = 1105
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|114682470|ref|XP_001163886.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Pan
troglodytes]
Length = 1105
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
+GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 1597 TEGGSLLCCESCPAAFHRECLNI-EMPKGSWYCNDCK 1632
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2007 GDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2045
>gi|357616541|gb|EHJ70254.1| putative zinc finger protein [Danaus plexippus]
Length = 1432
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 58/148 (39%), Gaps = 39/148 (26%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC---QNMFERKRFLQHDANAVEAG 620
D N+L CD C + H C L+ +P+GDW+C C + +++R L D +
Sbjct: 1098 DPDNMLLCDSCNKGHHLYCLKPKLTKVPEGDWFCDQCKPTEKTPKKRRKLYTDPD----- 1152
Query: 621 RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 680
D+++ ++ C A + C LC G G R C C R FH
Sbjct: 1153 -----DTLDDSSESCS------SAPVELCALC--------GSGGRLAASCRSCGRRFHAE 1193
Query: 681 CLKKHKMADLRELPKGKWFCCMDCSRIN 708
C G+ C DC++ N
Sbjct: 1194 CAPS----------GGRRAVCGDCAKPN 1211
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 647 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
S +L C + P +LLCD C + H+ CLK L ++P+G WFC
Sbjct: 1082 SASVLHASCRLCRRRTDPDNMLLCDSCNKGHHLYCLK----PKLTKVPEGDWFC 1131
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 58/158 (36%), Gaps = 32/158 (20%)
Query: 547 IICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF 604
+C C P G LL CD C ++H C L ++P+ W CK+C +
Sbjct: 1040 TVCEACGQATDP---------GRLLLCDDCDISYHTYCLDPPLQNVPKDSWKCKWCVSCT 1090
Query: 605 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 664
+ A G + + C A LS C +C D+S+
Sbjct: 1091 Q--------CGATTPGLRCEWQNNYTLCAPC--------ASLSTCPICL-VDYSEGTI-- 1131
Query: 665 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 702
I+ C QC+R FH C H D+ + + C M
Sbjct: 1132 --IVQCRQCDRWFHASCQSLHSEEDIEKAADSSFDCTM 1167
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 636 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 695
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 1023 IKITKVVLSKGWRCLECTVCEACGQATDPGRLLLCDDCDISYHTYCLD----PPLQNVPK 1078
Query: 696 GKWFC--CMDCSRINSVLQNL 714
W C C+ C++ + L
Sbjct: 1079 DSWKCKWCVSCTQCGATTPGL 1099
>gi|426392025|ref|XP_004062362.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Gorilla
gorilla gorilla]
Length = 1105
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|268557732|ref|XP_002636856.1| C. briggsae CBR-LET-418 protein [Caenorhabditis briggsae]
Length = 1849
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 55/138 (39%), Gaps = 43/138 (31%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
GG L+ CD CPRA+H C P+GDW C +C ++H ++
Sbjct: 264 GGELVLCDTCPRAYHTGCMD-EDPPEGDWSCPHC---------IEHGPEVIK-------- 305
Query: 627 SVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK 686
E+ TK+ C +C+ + +LLCD C FH C+
Sbjct: 306 --EEPTKQNDDF----------CKICKETE---------NLLLCDSCVCAFHAYCID--- 341
Query: 687 MADLRELPKGKWFCCMDC 704
L ++P+ + + C C
Sbjct: 342 -PPLTQVPQEETWACPRC 358
>gi|119596117|gb|EAW75711.1| protein kinase C binding protein 1, isoform CRA_h [Homo sapiens]
Length = 1105
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKRFLQHDANAVE 618
G +L C+ CPR +H +C L+S P+GDW+C C+ + F++ L+ + E
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQKPVPLEQHPDYAE 124
>gi|340719315|ref|XP_003398100.1| PREDICTED: hypothetical protein LOC100644567 [Bombus terrestris]
Length = 2857
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 566
C + L + KN + I C CN V PS Q AD
Sbjct: 2700 CLKTLNKHSKNEVLIQCGTCNGHVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2759
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+L CD C R +H C L +PQG W+C+ C
Sbjct: 2760 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 2793
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 11/75 (14%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2731 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2784
Query: 695 KGKWFC-----CMDC 704
+G+W C C++C
Sbjct: 2785 QGRWHCQECAVCVNC 2799
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG LL CD CP A+H C +L +IP GDW C C
Sbjct: 300 DGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 241 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 276
>gi|307111603|gb|EFN59837.1| hypothetical protein CHLNCDRAFT_133598 [Chlorella variabilis]
Length = 1305
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 22/37 (59%)
Query: 566 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 602
D G+LL CDGCP +H C L +P G W+C C +
Sbjct: 1191 DQGDLLCCDGCPSVYHPRCCGLGGVPPGRWFCPVCSD 1227
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 82 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 139
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 140 KKTEGLVKLTPIDKRKCERLL 160
>gi|145546835|ref|XP_001459100.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426923|emb|CAK91703.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 573 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 607
CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 880 CDTCPKVFHPKCINLKEVPQGKWNCLNCLKNFERQ 914
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 58/129 (44%), Gaps = 20/129 (15%)
Query: 562 EAHADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 619
+A DGG+LL C+ C AFH C +SS+P+GDW+C C+ K +H + +
Sbjct: 93 DACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSCEQNKNPKP--KHSKSILS- 149
Query: 620 GRVSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ--C 673
S DS++ + C + + N + S C +C G D + +L C C
Sbjct: 150 ---SLFDSLDTLNPSCFTLPEEYLLNNSFKQSFCNVCDGDDSMED------MLHCSHSKC 200
Query: 674 EREFHVGCL 682
H CL
Sbjct: 201 RISVHTYCL 209
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG LL CD CP A+H C +L +IP GDW C C
Sbjct: 442 DGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 383 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 418
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQHDANAVEAGRV 622
+GG LL C+ CP+ FH C +L + P G+W C +C+++ + + D + E ++
Sbjct: 828 NGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSRPEVEYDCDDPSLAEKRKL 887
Query: 623 SGVDSVEQITKR-CIRIV 639
G S+ I +R C RI+
Sbjct: 888 GGAQSMAPIDQRKCERIL 905
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 251 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCC 287
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 47.8 bits (112), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 306 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +P+G WYC C+
Sbjct: 443 SEGGSLLCCESCPAAFHRECLNI-EMPEGSWYCNDCK 478
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPC--DGCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 854 GDGGQLVSCKKSGCPKVYHADCLNLTKRPAGKWECPWHQ 892
>gi|443689527|gb|ELT91900.1| hypothetical protein CAPTEDRAFT_216422 [Capitella teleta]
Length = 1564
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 54/111 (48%), Gaps = 24/111 (21%)
Query: 620 GRVSGVDSVEQ------------ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 667
GRV V++V+ + + C++ K+ AE + C +CR K G + +
Sbjct: 1140 GRVRWVEAVKSCTTWSRLHLLMSVMESCMKWEKS--AENAKCKICR-----KKGEEEK-V 1191
Query: 668 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 718
LLCD C + FH+ CL+ L E+PKG+WFC R V N+ +E
Sbjct: 1192 LLCDDCNQPFHLYCLR----PALYEVPKGEWFCAACAPRTRRVKTNVNYRE 1238
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 60/139 (43%), Gaps = 32/139 (23%)
Query: 570 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE-AG-----R 621
+L CD C + FH C +L +P+G+W+C C R R ++ + N E AG R
Sbjct: 1191 VLLCDDCNQPFHLYCLRPALYEVPKGEWFCAACA---PRTRRVKTNVNYRELAGEENDKR 1247
Query: 622 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 681
+ +S E+ R + IV E C +C G + ++ C C FH+ C
Sbjct: 1248 IVDSNSEEE---REVDIVHEQE-----CTMCGGDE---------GLVNCSTCVCAFHLEC 1290
Query: 682 LKKHKMADLRELPKGKWFC 700
LR +P+ W C
Sbjct: 1291 HD----PPLRHIPRSIWRC 1305
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 305 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 40.4 bits (93), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|346970552|gb|EGY14004.1| origin recognition complex subunit 4 [Verticillium dahliae VdLs.17]
Length = 851
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 573 CDGCPRAFHKECASLSSIPQGDWYCKYC 600
CDGC +A H++C + IP+GDW+CK C
Sbjct: 374 CDGCDKAVHQKCYDVHDIPEGDWFCKEC 401
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 28/37 (75%), Gaps = 1/37 (2%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 601
++GG+LL C+ CP AFH+EC ++ +P+G W+C C+
Sbjct: 1533 SEGGSLLCCESCPAAFHRECLNI-DMPEGSWFCNDCK 1568
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 601
DGG L+ C GCP+ +H EC L+ P G W C + Q
Sbjct: 1941 GDGGQLVSCKKPGCPKVYHAECLKLTRRPAGKWECPWHQ 1979
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 47.8 bits (112), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 12 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 69
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 70 KKTEGLVKLTPIDKRKCERLL 90
>gi|255079372|ref|XP_002503266.1| JmjN/JmjC protein [Micromonas sp. RCC299]
gi|226518532|gb|ACO64524.1| JmjN/JmjC protein [Micromonas sp. RCC299]
Length = 2663
Score = 47.8 bits (112), Expect = 0.024, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 638 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 697
+ + LE + +GC+ C G +S ++LCD C+R +H+ CL L ELP+G
Sbjct: 246 VAQQLEEQPAGCVNCGGTSHEES------MILCDGCDRGYHMYCLS----PPLDELPQGD 295
Query: 698 WFCCMDC 704
WF C DC
Sbjct: 296 WF-CPDC 301
Score = 43.1 bits (100), Expect = 0.56, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 12/72 (16%)
Query: 569 NLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 626
+++ CDGC R +H C S L +PQGDW+C C + +A + G SG
Sbjct: 268 SMILCDGCDRGYHMYCLSPPLDELPQGDWFCPDC---------IAAANDAEDIGFNSGKT 318
Query: 627 -SVEQITKRCIR 637
++EQ + C R
Sbjct: 319 FTIEQFKEECAR 330
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 608
DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 272 GDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAELGRLR 317
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 301 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 725
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|346971621|gb|EGY15073.1| hypothetical protein VDAG_06563 [Verticillium dahliae VdLs.17]
Length = 653
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 604
LL CDGC A+H C L+ +P G WYC C ++F
Sbjct: 134 LLLCDGCDAAYHTHCVGLNHVPAGSWYCLECVDIF 168
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 122 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTC 157
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 47.8 bits (112), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 799 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDATGHSSEK 856
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 857 KKSDGLGRLMPIDKRKCERLL 877
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 302 DGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSC 338
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 802 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 859
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 860 KKTEGLVKLTPIDKRKCERLL 880
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHISEK 857
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|260791426|ref|XP_002590730.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
gi|229275926|gb|EEN46741.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
Length = 1073
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECASL----SSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 621
+GG+LL CD CP AFH +C +P+G+W C C + + D A A
Sbjct: 58 EGGDLLCCDRCPAAFHLQCCDPPLCEEDLPEGEWLCHRCMVLEQFPELDDRDETASNASV 117
Query: 622 VSGVDSVEQITKRCIRIVK 640
S S +Q R +IV+
Sbjct: 118 ASSTASYKQRNLRDKKIVR 136
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 47.8 bits (112), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 18/87 (20%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------ERKRFLQHDAN 615
+GG LL CD CP+ FH C +L S P G+W C C+N E R+ H N
Sbjct: 930 NGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCENTRYA-HSYN 988
Query: 616 AVEAGRVSGVDSVEQITKRCIRIVKNL 642
A G+D +Q K+C ++V +L
Sbjct: 989 A-----QYGLDDYDQ--KKCEKLVLSL 1008
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHISEK 891
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 745 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 802
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 803 KKTEGLVKLTPIDKRKCERLL 823
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 833 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDATGHSSEK 890
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 891 KKSDGLGRLMPIDKRKCERLL 911
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|410898760|ref|XP_003962865.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Takifugu rubripes]
Length = 1329
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
D N+L CDGC R H C L ++PQGDW+C C+
Sbjct: 1058 GDADNMLLCDGCDRGHHTHCLRPRLKAVPQGDWFCPDCR 1096
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 69/164 (42%), Gaps = 38/164 (23%)
Query: 572 PCDGCP------RAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV--S 623
PC G R+ H +LS I QG ER RFL+ + E GRV +
Sbjct: 963 PCSGSTTPQVISRSVHHLAEALSHIEQG----------IER-RFLKPPLDGSEGGRVCKT 1011
Query: 624 GVDSVEQITKRCIRI------VKNLEAEL--SGCLLCRGCDFSKSGFGPRTILLCDQCER 675
++ + + C + + +LE + S +L C + +LLCD C+R
Sbjct: 1012 VLERWRESLQSCTSLSQVFVHLSSLERSVLWSRSILNARCRICRRKGDADNMLLCDGCDR 1071
Query: 676 EFHVGCLKKHKMADLRELPKGKWFCCMDC------SRINSVLQN 713
H CL+ L+ +P+G WF C DC SR+ S Q+
Sbjct: 1072 GHHTHCLRPR----LKAVPQGDWF-CPDCRPKQRSSRLTSRQQH 1110
>gi|302423122|ref|XP_003009391.1| origin recognition complex subunit 4 [Verticillium albo-atrum
VaMs.102]
gi|261352537|gb|EEY14965.1| origin recognition complex subunit 4 [Verticillium albo-atrum
VaMs.102]
Length = 869
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 573 CDGCPRAFHKECASLSSIPQGDWYCKYC 600
CDGC +A H++C + IP+GDW+CK C
Sbjct: 373 CDGCDKAVHQKCYGVHDIPEGDWFCKEC 400
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 18/87 (20%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------ERKRFLQHDAN 615
+GG LL CD CP+ FH C +L S P G+W C C+N E R+ H N
Sbjct: 929 NGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCENTRYA-HSYN 987
Query: 616 AVEAGRVSGVDSVEQITKRCIRIVKNL 642
A G+D +Q K+C ++V +L
Sbjct: 988 A-----QYGLDDYDQ--KKCEKLVLSL 1007
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 800 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 857
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 858 KKTEGLVKLTPIDKRKCERLL 878
>gi|194893051|ref|XP_001977800.1| GG18040 [Drosophila erecta]
gi|190649449|gb|EDV46727.1| GG18040 [Drosophila erecta]
Length = 1982
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 14/79 (17%)
Query: 631 ITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL 690
+ +R + V+N + +GC C C +S P +L C+QC+R +H+ CL L
Sbjct: 1718 MPQRMVGRVRNYNWQCAGCKCCIKC---RSSQRPGKMLYCEQCDRGYHIYCL------GL 1768
Query: 691 RELPKGKWFC-----CMDC 704
R +P G+W C C+ C
Sbjct: 1769 RTVPDGRWSCERCCVCIRC 1787
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 47/80 (58%), Gaps = 10/80 (12%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR-- 621
+GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D + ++ +
Sbjct: 715 NGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--SKPEVEYDCDNLQHSKKG 772
Query: 622 --VSGVDSVEQITKRCIRIV 639
G+ V+Q ++C R++
Sbjct: 773 KTAQGLSPVDQ--RKCERLL 790
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 739 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 796
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 797 KKTEGLVKLTPIDKRKCERLL 817
>gi|47220585|emb|CAG05611.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 47.8 bits (112), Expect = 0.028, Method: Composition-based stats.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
+GG LL CD CP+ +H C LS PQGDW C C+
Sbjct: 10 NGGELLCCDRCPKVYHLSCHLPPLSGFPQGDWVCTLCR 47
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 418 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSC 454
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 834 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 891
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 892 KKTEGLVKLTPIDKRKCERLL 912
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 784 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 841
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 842 KKTEGLVKLTPIDKRKCERLL 862
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 745 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 802
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 803 KKTEGLVKLTPIDKRKCERLL 823
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 196 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 253
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 254 KKTEGLVKLTPIDKRKCERLL 274
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 713 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 770
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 771 KKTEGLVKLTPIDKRKCERLL 791
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 717 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 774
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 775 KKTEGLVKLTPIDKRKCERLL 795
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 159 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 216
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 217 KKTEGLVKLTPIDKRKCERLL 237
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 726 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 783
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 784 KKTEGLVKLTPIDKRKCERLL 804
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANA----VEA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 796 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 853
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 854 KKTEGLVKLTPIDKRKCERLL 874
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 430 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 371 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 406
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 122 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTC 157
>gi|291230097|ref|XP_002735005.1| PREDICTED: tripartite motif-containing 28 protein-like
[Saccoglossus kowalevskii]
Length = 995
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 623
+GG+LL CD CP+ FH +C SL++ P+ W C CQ++ + + + + + S
Sbjct: 801 NGGDLLCCDTCPKVFHLQCHIPSLTATPKETWICGLCQDLCKEIQGISENDEHGKRKASS 860
Query: 624 GVDSVEQITKRCIRIVKNL 642
G+ Q K C RI+ L
Sbjct: 861 GLSEAHQ--KICERILLEL 877
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 69/185 (37%), Gaps = 41/185 (22%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 243 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 290
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 651
+G W C +C+ + + + E G G E R+ K+ G LL
Sbjct: 291 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELL 345
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 710
C CD C +H+ CL L ++P G+W C C +
Sbjct: 346 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 384
Query: 711 LQNLL 715
+Q +L
Sbjct: 385 VQKIL 389
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 436 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 377 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 412
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 436 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 377 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 412
>gi|432937609|ref|XP_004082462.1| PREDICTED: LOW QUALITY PROTEIN: bromodomain adjacent to zinc finger
domain protein 1A-like [Oryzias latipes]
Length = 1475
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 601
D N+L CDGC R H C L S+P+GDW+C C+
Sbjct: 1147 GDADNMLLCDGCDRGHHTHCLRPRLKSVPEGDWFCPDCR 1185
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 21/37 (56%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 295 DGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCC 331
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 451 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 487
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 392 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 427
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 440 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 381 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
Length = 690
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAG 620
GG L+ C+ CP AFH EC S IP+G +YCK C E K L D V+ G
Sbjct: 154 GGTLICCESCPAAFHPECISYEGIPEGRFYCKDC---VEGKSLLYGDIVWVKLG 204
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 565 ADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYC--KYCQNMFERKRFL 610
DGG L+ CD GC + +H +C +L PQG W C +C N +R L
Sbjct: 560 GDGGQLIMCDRSGCLKCYHVDCLNLDKKPQGRWQCPWHFCDNCGKRATVL 609
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 446 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 387 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 422
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 69/185 (37%), Gaps = 41/185 (22%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKEC--ASLSSIP 591
G++ ++GY+ C C GG ++ CD CPRA+H C L P
Sbjct: 243 GEEEVDGYETDHQDYCEVCQQ------------GGEIILCDTCPRAYHLVCLDPELDRAP 290
Query: 592 QGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLL 651
+G W C +C+ + + + E G G E R+ K+ G LL
Sbjct: 291 EGKWSCPHCEKEGVQWEAKEEEEEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELL 345
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSV 710
C CD C +H+ CL L ++P G+W C C +
Sbjct: 346 C-----------------CDACISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGR 384
Query: 711 LQNLL 715
+Q +L
Sbjct: 385 VQKIL 389
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 47.4 bits (111), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 14/55 (25%)
Query: 548 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
CH C DGG LL CD CP ++H C + L IP+GDW C C
Sbjct: 2 FCHVC------------KDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRC 44
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
GGNLL CD CPR +H +C + L IP G W+C C
Sbjct: 129 GGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTC 164
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 440 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 381 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 416
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 432 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 373 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 408
>gi|145259007|ref|XP_001402242.1| PHD finger domain protein [Aspergillus niger CBS 513.88]
gi|134074859|emb|CAK38971.1| unnamed protein product [Aspergillus niger]
gi|350631896|gb|EHA20265.1| hypothetical protein ASPNIDRAFT_51426 [Aspergillus niger ATCC 1015]
Length = 848
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 62/165 (37%), Gaps = 48/165 (29%)
Query: 548 ICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ-- 601
CH CN G LL CDGCP +FH C + ++ P+GDW+C C
Sbjct: 464 FCHNCNG------------SGQLLCCDGCPNSFHFSCLNPPLDPANPPEGDWFCPKCSLS 511
Query: 602 ----------NMFERKRFL-----QHDANAVEAGRVSGVDSVEQITKRCIR--------- 637
+ +K FL + V G D V + K R
Sbjct: 512 KPMTSLLGALDNAPQKDFLLPLGVRDYFTGVATGDEDKYDEVVPLPKLYPRRGNRHGRYD 571
Query: 638 ---IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 679
+++ +A+ +LC C + +G P I+ CD C FH+
Sbjct: 572 DPYLLRTQDAK-GNVILCAACGRTSNGRQP--IIQCDYCPCAFHM 613
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 445 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 41.2 bits (95), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 386 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 421
>gi|222637620|gb|EEE67752.1| hypothetical protein OsJ_25457 [Oryza sativa Japonica Group]
Length = 1646
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 438 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 472
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 44/95 (46%), Gaps = 16/95 (16%)
Query: 636 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 695
I+I K + ++ CL C C+ P +LLCD C+ +H CL L+ +PK
Sbjct: 767 IKITKVVLSKGWRCLECTVCEACGKATDPGRLLLCDDCDISYHTYCLD----PPLQTVPK 822
Query: 696 GKWFC-----------CMDCSR-INSVLQNLLVQE 718
G W C C C R +++V QNL +E
Sbjct: 823 GGWKCKCYREEDLILQCRQCDRWMHAVCQNLSTEE 857
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 37/58 (63%), Gaps = 4/58 (6%)
Query: 565 ADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRV 622
++GG+LL C+ CP AFH+EC ++ +PQG W+C C+ +R RF D V+ GR
Sbjct: 214 SEGGSLLCCESCPAAFHQECLNM-EMPQGSWFCNDCK-AGKRPRF--KDILWVKWGRF 267
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 315 DGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCC 351
Score = 39.3 bits (90), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 6/48 (12%)
Query: 667 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 714
++ CD C R FH+ CL LRE+P G W C C V QNL
Sbjct: 319 LICCDGCPRAFHLACLSP----PLREIPSGLWRC--SCCLQGRVQQNL 360
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 13 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 49
>gi|301614223|ref|XP_002936596.1| PREDICTED: hypothetical protein LOC100485119 [Xenopus (Silurana)
tropicalis]
Length = 1043
Score = 47.4 bits (111), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 43/88 (48%), Gaps = 19/88 (21%)
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSRINSVLQNLLV-- 716
P ILLCD C+ +H CL+ M +P G+WFC C C +++ LQNL V
Sbjct: 834 PELILLCDSCDSGYHTACLRPPLML----IPDGEWFCPPCQHKLLCEKLDEQLQNLDVVL 889
Query: 717 ---QEAEKLPEFHLNAIKKYAGNSLETV 741
+ AE+ E + Y G SLE +
Sbjct: 890 KKKERAERRKERLV-----YVGISLENI 912
>gi|50510226|dbj|BAD31424.1| PHD finger transcription factor-like protein [Oryza sativa Japonica
Group]
Length = 1696
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|115473879|ref|NP_001060538.1| Os07g0661500 [Oryza sativa Japonica Group]
gi|113612074|dbj|BAF22452.1| Os07g0661500 [Oryza sativa Japonica Group]
Length = 1752
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|125559491|gb|EAZ05027.1| hypothetical protein OsI_27209 [Oryza sativa Indica Group]
Length = 1696
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 568 GNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 600
GNLL CDGCP AFH +C + +P+G+W+C C
Sbjct: 453 GNLLCCDGCPAAFHSKCVGVVEDLLPEGNWFCPEC 487
>gi|312371268|gb|EFR19500.1| hypothetical protein AND_22323 [Anopheles darlingi]
Length = 1628
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 31/61 (50%), Gaps = 11/61 (18%)
Query: 644 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 703
AE C++CR P LLCD+C R H+ CLK L+E+P G WF CM
Sbjct: 1168 AERISCMICR------RKVDPDLTLLCDECNRACHIYCLKPK----LKEVPAGDWF-CMK 1216
Query: 704 C 704
C
Sbjct: 1217 C 1217
>gi|383417149|gb|AFH31788.1| protein kinase C-binding protein 1 isoform c [Macaca mulatta]
Length = 1135
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296200646|ref|XP_002747668.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Callithrix
jacchus]
Length = 1188
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|86143432|gb|ABC86688.1| RACK7 isoform i [Homo sapiens]
Length = 1088
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|326912468|ref|XP_003202572.1| PREDICTED: PHD finger protein 21B-like, partial [Meleagris
gallopavo]
Length = 445
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 37/73 (50%), Gaps = 9/73 (12%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 624
G NL PC CPRA+H C L + P+G W C CQ ++ L+ D N G ++
Sbjct: 273 GVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQ-----QKVLKKDDNVPWTGTLAI 327
Query: 625 VDSVEQITKRCIR 637
V S +T + ++
Sbjct: 328 VHSY--VTHKTVK 338
>gi|307177781|gb|EFN66778.1| Supporter of activation of yellow protein [Camponotus floridanus]
Length = 3066
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 36/94 (38%), Gaps = 26/94 (27%)
Query: 533 CGQKLLEGYKNGLGIICHCCNSEVSPS--------------------------QFEAHAD 566
C + L + KN + I C CN V PS Q AD
Sbjct: 2848 CLKVLNKHNKNEILIQCGTCNGNVHPSCIDLTLDMVPHIQSYAWQCTDCKTCAQCHDPAD 2907
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
+L CD C R +H C L +PQG W+C+ C
Sbjct: 2908 EDKMLFCDMCDRGYHIYCVGLRRVPQGRWHCQEC 2941
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 11/75 (14%)
Query: 635 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 694
+ +V ++++ C C+ C +L CD C+R +H+ C+ LR +P
Sbjct: 2879 TLDMVPHIQSYAWQCTDCKTCAQCHDPADEDKMLFCDMCDRGYHIYCVG------LRRVP 2932
Query: 695 KGKWFC-----CMDC 704
+G+W C C +C
Sbjct: 2933 QGRWHCQECAVCANC 2947
>gi|164423528|ref|XP_962530.2| hypothetical protein NCU08317 [Neurospora crassa OR74A]
gi|157070133|gb|EAA33294.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 961
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G ++ CDGC +A H++C + +P+GDW+C+ C
Sbjct: 406 GNQIVFCDGCDKAIHQKCYGIPRLPKGDWFCREC 439
>gi|426392027|ref|XP_004062363.1| PREDICTED: protein kinase C-binding protein 1 isoform 6 [Gorilla
gorilla gorilla]
Length = 1186
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|380786007|gb|AFE64879.1| protein kinase C-binding protein 1 isoform c [Macaca mulatta]
Length = 1135
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|297259646|ref|XP_002798156.1| PREDICTED: protein kinase C-binding protein 1-like isoform 8
[Macaca mulatta]
Length = 1054
Score = 47.4 bits (111), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 794 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 851
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 852 RKTEGLVKLTPIDKRKCERLL 872
>gi|384487310|gb|EIE79490.1| hypothetical protein RO3G_04195 [Rhizopus delemar RA 99-880]
Length = 339
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYC-KYCQNMFERKRFL 610
L+ CDGCP+AFH+EC L P WYC C + +RKR +
Sbjct: 272 LIFCDGCPKAFHQECKELDKQPDTPWYCTDTCCDNLKRKRVV 313
>gi|383417151|gb|AFH31789.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
gi|384939240|gb|AFI33225.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
Length = 1160
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|345790163|ref|XP_003433333.1| PREDICTED: protein kinase C-binding protein 1 [Canis lupus
familiaris]
Length = 1094
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|297259642|ref|XP_002798154.1| PREDICTED: protein kinase C-binding protein 1-like isoform 6
[Macaca mulatta]
Length = 1135
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|73992168|ref|XP_866849.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Canis
lupus familiaris]
Length = 1141
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|403290905|ref|XP_003936547.1| PREDICTED: protein kinase C-binding protein 1 isoform 10 [Saimiri
boliviensis boliviensis]
Length = 1189
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|297259640|ref|XP_002798153.1| PREDICTED: protein kinase C-binding protein 1-like isoform 5
[Macaca mulatta]
Length = 1160
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 829 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 886
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 887 RKTEGLVKLTPIDKRKCERLL 907
>gi|86143624|gb|ABC86691.1| RACK7 isoform l [Homo sapiens]
Length = 1134
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|410255052|gb|JAA15493.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410341077|gb|JAA39485.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1160
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|363745215|ref|XP_003643224.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Gallus gallus]
Length = 137
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 619
DGG L+ CDGCPRAFH C L +P G W C C R R A + A
Sbjct: 10 DGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLREADTAAEQLPA 65
>gi|444706940|gb|ELW48255.1| Protein kinase C-binding protein 1 [Tupaia chinensis]
Length = 997
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 47 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 82
>gi|34335264|ref|NP_898869.1| protein kinase C-binding protein 1 isoform c [Homo sapiens]
gi|86143420|gb|ABC86682.1| RACK7 isoform c [Homo sapiens]
gi|119596121|gb|EAW75715.1| protein kinase C binding protein 1, isoform CRA_l [Homo sapiens]
Length = 1135
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|410216320|gb|JAA05379.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410306756|gb|JAA31978.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1160
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|403290889|ref|XP_003936539.1| PREDICTED: protein kinase C-binding protein 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 1163
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|380786053|gb|AFE64902.1| protein kinase C-binding protein 1 isoform b [Macaca mulatta]
Length = 1160
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|345790161|ref|XP_003433332.1| PREDICTED: protein kinase C-binding protein 1 [Canis lupus
familiaris]
Length = 1137
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|332858654|ref|XP_003317032.1| PREDICTED: protein kinase C-binding protein 1 [Pan troglodytes]
Length = 1186
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|168014304|ref|XP_001759692.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689231|gb|EDQ75604.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 776
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 12/74 (16%)
Query: 545 LGIICHCCNSEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLS-SIPQGDWYCKYCQNM 603
L ++C C + D G LL CDGC A H C L S+P+GDW+C C
Sbjct: 234 LDVVCSVC---------QEAGDEGVLLLCDGCDSAAHTYCVGLGLSVPRGDWFCNACS-- 282
Query: 604 FERKRFLQHDANAV 617
E + F D + V
Sbjct: 283 IEHRGFSTDDDDEV 296
>gi|6329749|dbj|BAA86439.1| KIAA1125 protein [Homo sapiens]
Length = 1205
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 117 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 152
>gi|452822553|gb|EME29571.1| NuA3 HAT complex component NTO1 [Galdieria sulphuraria]
Length = 1342
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 23/34 (67%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G +++ CDGC A H+ C + SIP+GDW+C C
Sbjct: 939 GNDIILCDGCHVAVHQTCYGVRSIPEGDWFCSSC 972
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 902 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 959
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 960 KKTEGLVKLTPIDKRKCERLL 980
>gi|219519075|gb|AAI44290.1| ZMYND8 protein [Homo sapiens]
gi|223460518|gb|AAI36609.1| ZMYND8 protein [Homo sapiens]
Length = 1054
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|119596111|gb|EAW75705.1| protein kinase C binding protein 1, isoform CRA_c [Homo sapiens]
Length = 1200
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|86143426|gb|ABC86685.1| RACK7 isoform f [Homo sapiens]
Length = 1181
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|34335266|ref|NP_036540.3| protein kinase C-binding protein 1 isoform b [Homo sapiens]
gi|86143418|gb|ABC86681.1| RACK7 isoform b [Homo sapiens]
gi|119596112|gb|EAW75706.1| protein kinase C binding protein 1, isoform CRA_d [Homo sapiens]
Length = 1160
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|44917000|dbj|BAD12142.1| unichrom [Hemicentrotus pulcherrimus]
Length = 1637
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1022 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1054
>gi|397511405|ref|XP_003826066.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Pan
paniscus]
Length = 1186
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|338719339|ref|XP_003363990.1| PREDICTED: LOW QUALITY PROTEIN: protein kinase C-binding protein
1-like [Equus caballus]
Length = 1186
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|86143622|gb|ABC86690.1| RACK7 isoform k [Homo sapiens]
Length = 1206
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|403290893|ref|XP_003936541.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 1209
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|321460287|gb|EFX71331.1| hypothetical protein DAPPUDRAFT_308933 [Daphnia pulex]
Length = 501
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 45/108 (41%), Gaps = 11/108 (10%)
Query: 596 YCKYCQNMFERKRFLQHDANAV---EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLC 652
YC +C + H V + GR SG S Q T I VK + C+ C
Sbjct: 392 YCDFCLGDATENKKSGHPEELVSCADCGR-SGHPSCLQFTANMIISVKQYRWQ---CIEC 447
Query: 653 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
+ C + +L CD C+R +H+ CLK L E P+G W C
Sbjct: 448 KCCSLCGNSDNDEQLLFCDDCDRGYHMYCLK----PPLSEPPEGSWSC 491
>gi|73992170|ref|XP_866862.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Canis
lupus familiaris]
Length = 1166
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|73992186|ref|XP_866949.1| PREDICTED: protein kinase C-binding protein 1 isoform 10 [Canis
lupus familiaris]
Length = 1209
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 1005 NGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDCDAPSHNSEK 1062
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 1063 KKTEGLVKLTPIDKRKCERLL 1083
>gi|345790157|ref|XP_866912.2| PREDICTED: protein kinase C-binding protein 1 isoform 8 [Canis
lupus familiaris]
Length = 1184
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296480966|tpg|DAA23081.1| TPA: zinc finger, MYND-type containing 8-like isoform 3 [Bos
taurus]
Length = 1140
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|384946172|gb|AFI36691.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1163
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|297259636|ref|XP_002798151.1| PREDICTED: protein kinase C-binding protein 1-like isoform 3
[Macaca mulatta]
Length = 1188
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 107 DGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|395829157|ref|XP_003787727.1| PREDICTED: protein kinase C-binding protein 1 isoform 2 [Otolemur
garnettii]
Length = 1053
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296200638|ref|XP_002747664.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Callithrix
jacchus]
Length = 1243
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|219112141|ref|XP_002177822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410707|gb|EEC50636.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 462
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 565 ADGGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 602
DGG+LL CDGC +H +C SL+ IP+G W C C N
Sbjct: 353 GDGGSLLICDGCEGEYHMDCVQPSLAEIPEGHWECDDCVN 392
>gi|119596116|gb|EAW75710.1| protein kinase C binding protein 1, isoform CRA_g [Homo sapiens]
Length = 1187
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 117 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 152
>gi|56203004|emb|CAI23168.1| protein kinase C binding protein 1 [Homo sapiens]
Length = 1115
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 73 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 108
>gi|300794091|ref|NP_001178100.1| protein kinase C-binding protein 1 [Bos taurus]
Length = 1193
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|45946211|gb|AAH30721.2| ZMYND8 protein [Homo sapiens]
Length = 1094
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 24 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 59
>gi|301783261|ref|XP_002927044.1| PREDICTED: protein kinase C-binding protein 1-like isoform 2
[Ailuropoda melanoleuca]
Length = 1165
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|119596119|gb|EAW75713.1| protein kinase C binding protein 1, isoform CRA_j [Homo sapiens]
Length = 1100
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 30 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 65
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV----EA 619
+GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D +A E
Sbjct: 728 NGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDCDAPIHNSEK 785
Query: 620 GRVSGVDSVEQITKR-CIRIV 639
+ G+ + I KR C R++
Sbjct: 786 RKTEGLVKLTPIDKRKCERLL 806
>gi|34335262|ref|NP_898868.1| protein kinase C-binding protein 1 isoform a [Homo sapiens]
gi|86143160|gb|ABC86680.1| RACK7 isoform a [Homo sapiens]
gi|119596118|gb|EAW75712.1| protein kinase C binding protein 1, isoform CRA_i [Homo sapiens]
Length = 1188
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|47195997|emb|CAF91487.1| unnamed protein product [Tetraodon nigroviridis]
Length = 138
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 566 DGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 602
+ G L PC CPRAFH C L + P+G WYC CQ
Sbjct: 97 EDGELQPCRSCPRAFHPSCLHPPLKTPPRGPWYCPKCQK 135
>gi|410255050|gb|JAA15492.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410341075|gb|JAA39484.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1188
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|397511401|ref|XP_003826064.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Pan
paniscus]
Length = 1241
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|348563925|ref|XP_003467757.1| PREDICTED: protein kinase C-binding protein 1-like isoform 2 [Cavia
porcellus]
Length = 1137
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|301783259|ref|XP_002927043.1| PREDICTED: protein kinase C-binding protein 1-like isoform 1
[Ailuropoda melanoleuca]
Length = 1140
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|296480965|tpg|DAA23080.1| TPA: zinc finger, MYND-type containing 8-like isoform 2 [Bos
taurus]
Length = 1165
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|115696716|ref|XP_783470.2| PREDICTED: uncharacterized protein LOC578189 isoform 2
[Strongylocentrotus purpuratus]
gi|390342402|ref|XP_003725656.1| PREDICTED: uncharacterized protein LOC578189 isoform 1
[Strongylocentrotus purpuratus]
Length = 1640
Score = 47.0 bits (110), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%), Gaps = 4/37 (10%)
Query: 664 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 700
PR ILLCD+C+ FH CL+ MA +P G WFC
Sbjct: 1025 PRWILLCDKCDSGFHTACLRPPLMA----IPDGNWFC 1057
>gi|426392017|ref|XP_004062358.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Gorilla
gorilla gorilla]
Length = 1188
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|25453223|sp|Q9ULU4.2|PKCB1_HUMAN RecName: Full=Protein kinase C-binding protein 1; AltName:
Full=Cutaneous T-cell lymphoma-associated antigen
se14-3; Short=CTCL-associated antigen se14-3; AltName:
Full=Rack7; AltName: Full=Zinc finger MYND
domain-containing protein 8
gi|56203005|emb|CAI23169.1| protein kinase C binding protein 1 [Homo sapiens]
gi|119596110|gb|EAW75704.1| protein kinase C binding protein 1, isoform CRA_b [Homo sapiens]
gi|168269692|dbj|BAG09973.1| protein kinase C-binding protein 1 [synthetic construct]
Length = 1186
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 98 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 133
>gi|384939242|gb|AFI33226.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1188
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|348563927|ref|XP_003467758.1| PREDICTED: protein kinase C-binding protein 1-like isoform 3 [Cavia
porcellus]
Length = 1162
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|297259632|ref|XP_002798149.1| PREDICTED: protein kinase C-binding protein 1-like isoform 1
[Macaca mulatta]
Length = 1241
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|221040998|dbj|BAH12176.1| unnamed protein product [Homo sapiens]
Length = 1241
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|86143620|gb|ABC86689.1| RACK7 isoform j [Homo sapiens]
Length = 1136
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|410216318|gb|JAA05378.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410306752|gb|JAA31976.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
gi|410306754|gb|JAA31977.1| zinc finger, MYND-type containing 8 [Pan troglodytes]
Length = 1188
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|403290895|ref|XP_003936542.1| PREDICTED: protein kinase C-binding protein 1 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 1109
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|114682428|ref|XP_001165143.1| PREDICTED: protein kinase C-binding protein 1 isoform 36 [Pan
troglodytes]
Length = 1188
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|86143422|gb|ABC86683.1| RACK7 isoform d [Homo sapiens]
Length = 1163
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|410953586|ref|XP_003983451.1| PREDICTED: protein kinase C-binding protein 1 isoform 2 [Felis
catus]
Length = 1154
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|384946170|gb|AFI36690.1| protein kinase C-binding protein 1 isoform a [Macaca mulatta]
Length = 1186
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|355563056|gb|EHH19618.1| hypothetical protein EGK_02318, partial [Macaca mulatta]
Length = 1231
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 114 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 149
>gi|340521998|gb|EGR52231.1| predicted protein [Trichoderma reesei QM6a]
Length = 641
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%)
Query: 570 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
LL CDGC A+H C L IP+GDW+C C ++
Sbjct: 138 LLLCDGCDAAYHTHCIGLDYIPEGDWFCMECAHL 171
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 20/139 (14%)
Query: 574 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG--VDSVEQI 631
DGC H C + S Q C C+N F R N V+ VS V+ +Q+
Sbjct: 50 DGCDHIIHDAC--IRSWAQKTNTCPICRNPFHTVRVY----NGVDGIAVSTYEVEDKKQV 103
Query: 632 TKRCIR--IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMAD 689
+ ++ + +N E E C C+ ++ +LLCD C+ +H C+
Sbjct: 104 AEFDVQQWLGENPEEEEEESNPCPICNSAERE---DILLLCDGCDAAYHTHCIG------ 154
Query: 690 LRELPKGKWFCCMDCSRIN 708
L +P+G WF CM+C+ +N
Sbjct: 155 LDYIPEGDWF-CMECAHLN 172
>gi|332858652|ref|XP_001164593.2| PREDICTED: protein kinase C-binding protein 1 isoform 22 [Pan
troglodytes]
Length = 1241
Score = 47.0 bits (110), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 566 DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 600
DGG LL CD CP A+H C + L +IP GDW C C
Sbjct: 446 DGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 567 GGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 600
GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 387 GGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHC 422
>gi|410953590|ref|XP_003983453.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Felis
catus]
Length = 1173
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 93 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 128
>gi|403290887|ref|XP_003936538.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 1191
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|336470944|gb|EGO59105.1| hypothetical protein NEUTE1DRAFT_60169 [Neurospora tetrasperma FGSC
2508]
gi|350292016|gb|EGZ73211.1| hypothetical protein NEUTE2DRAFT_107503 [Neurospora tetrasperma
FGSC 2509]
Length = 962
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 567 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
G ++ CDGC +A H++C + +P+GDW+C+ C
Sbjct: 406 GNQIVFCDGCDKAVHQKCYGIPRLPKGDWFCREC 439
>gi|322700900|gb|EFY92652.1| origin recognition complex subunit Orc4, putative [Metarhizium
acridum CQMa 102]
Length = 754
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 11/57 (19%)
Query: 546 GIICHCCN--SEVSPSQFEAHADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 600
GI+C C V+P++ ++ CD C A H++C ++ IP+GDW CK C
Sbjct: 237 GIVCEICYKPDSVAPNE---------IILCDNCDFAVHQQCYDIAEIPEGDWLCKSC 284
>gi|291409975|ref|XP_002721267.1| PREDICTED: zinc finger, MYND-type containing 8 [Oryctolagus
cuniculus]
Length = 1137
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 94 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 129
>gi|426392023|ref|XP_004062361.1| PREDICTED: protein kinase C-binding protein 1 isoform 4 [Gorilla
gorilla gorilla]
Length = 1241
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|86143424|gb|ABC86684.1| RACK7 isoform e [Homo sapiens]
Length = 1234
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|403290901|ref|XP_003936545.1| PREDICTED: protein kinase C-binding protein 1 isoform 8 [Saimiri
boliviensis boliviensis]
Length = 1244
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 125 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 160
>gi|355784415|gb|EHH65266.1| hypothetical protein EGM_02000, partial [Macaca fascicularis]
Length = 1231
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 114 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 149
>gi|68137402|gb|AAY85630.1| transcriptional repressor BSR/RACK7/PRKCBP1 isoform m [Homo
sapiens]
Length = 1234
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|348532584|ref|XP_003453786.1| PREDICTED: remodeling and spacing factor 1-like [Oreochromis
niloticus]
Length = 1596
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 33/68 (48%), Gaps = 12/68 (17%)
Query: 652 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD---CSR 706
C+ C S P ILLCD C+ +H CL+ M +P G+WFC C C +
Sbjct: 1055 CKHCGLSNH---PELILLCDSCDSGYHTACLRPPLMI----IPDGEWFCPPCQHKQLCDK 1107
Query: 707 INSVLQNL 714
+ LQNL
Sbjct: 1108 LEEQLQNL 1115
>gi|68137404|gb|AAY85631.1| transcriptional repressor BSR/RACK7/PRKCBP1 isoform o [Homo
sapiens]
Length = 1107
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 10/97 (10%)
Query: 618 EAGRVSGVDSVEQITKRCIRIVKNLEAELSG----CLLCRGCDFSKSGFGPRTILLCDQC 673
EA VS + + C+ + + + + G CL C C+ +G +LLCD+C
Sbjct: 454 EASMVSCANCSQTYHTYCVTLHDKMNSAILGRGWRCLDCTICEGCGNGGDEEKLLLCDEC 513
Query: 674 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRIN 708
+ +HV C+K L +P G W C C C R N
Sbjct: 514 DVSYHVYCMK----PPLESVPSGPWRCHWCSRCRRCN 546
>gi|410953584|ref|XP_003983450.1| PREDICTED: protein kinase C-binding protein 1 isoform 1 [Felis
catus]
Length = 1198
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 568 GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 603
G +L C+ CPR +H +C L+S P+GDW+C C+ +
Sbjct: 118 GQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECEKI 153
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,582,569,049
Number of Sequences: 23463169
Number of extensions: 520632765
Number of successful extensions: 1384619
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1027
Number of HSP's successfully gapped in prelim test: 3347
Number of HSP's that attempted gapping in prelim test: 1368112
Number of HSP's gapped (non-prelim): 14979
length of query: 846
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 695
effective length of database: 8,816,256,848
effective search space: 6127298509360
effective search space used: 6127298509360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)