BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002195
(954 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 929 bits (2401), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 522/992 (52%), Positives = 658/992 (66%), Gaps = 115/992 (11%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P ++
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEP-----TTSK 387
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S S + R ++K V+K+ SKS ++ +RSP
Sbjct: 388 SSGSASVSISSRYKRK--------WVTKAKSKSSEYTSI--SRSP--------------R 423
Query: 456 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
S P+ K+ + + RK KS K L+ K ++AS S K++W IT KDQRLHKL
Sbjct: 424 SAPMRIPSKNKSALKMRK---KSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKL 480
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------- 566
VF+E GLPDGTEV Y+A GQKLL+GYK G GI+C CCN VSPSQFE HAG
Sbjct: 481 VFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYA 540
Query: 567 --------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSS 606
R+Y KDNDDLC IC DGGNLL CDGCPRAFHKECASLSS
Sbjct: 541 YIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSS 600
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
P+GDWYCK+CQNMF+R++F++H+ NAV AGRV GVD +EQITKRCIRIV+N+E +LSGC
Sbjct: 601 TPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGC 660
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGCLK HKMA L+ELP+GKWFC + C+RI+S
Sbjct: 661 VLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHS 720
Query: 727 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSGKAATPETRLLLSQAV 785
LQ LL++ EKLP L A+ + G + + D+DV WRL+SGK A+PETRLLLS+A+
Sbjct: 721 ALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPETRLLLSEAI 780
Query: 786 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
AIFHD FDPIVD SGRDLIP+MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ+
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQD 840
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
+AELPLVATS NHGKGYFQ LF+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I
Sbjct: 841 IAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIK 900
Query: 906 PELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 901 PDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 932
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101203549 [Cucumis sativus]
Length = 946
Score = 921 bits (2381), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/1001 (52%), Positives = 659/1001 (65%), Gaps = 124/1001 (12%)
Query: 1 MKRELDY------ELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIV 54
MKREL + +L G+LD T ++ L +A S + + CKRFK + VNG IV
Sbjct: 1 MKRELAFALEVQSQLEGTLDHTRSEILAEAR-PGSSYLDETARSGGCKRFKGSVVNGLIV 59
Query: 55 YSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQ 114
Y+RV++S+ + LL D+ K+ +S +GR VL ES EE Q+
Sbjct: 60 YTRVRKSQINVYSGLL-DNGNRKKCDST--DGR------EVLGSFAPEESCRTEEVQI-- 108
Query: 115 MTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLK----------- 163
K + +CK+E VE KE G+E S+ + K +K
Sbjct: 109 ------------QKTSSVCKKESDEVVENSGNKEEGAEGSSLVIAKDIKVEGNLPGWEIK 156
Query: 164 --RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISL 221
S++ PKVEP+++ E +S + E ++L++PK LELKMSKKI+L
Sbjct: 157 RFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGGETSETVNSLSTPKNKLELKMSKKIAL 216
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK+PMTV ELFETGLL+GV V+YMG K GLRG I+D GILC+CS CNGCRVIPPS+
Sbjct: 217 NKRPMTVRELFETGLLEGVPVIYMGVKKADDFGLRGTIKDSGILCTCSSCNGCRVIPPSQ 276
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FEIHAC QY+RA+QYIC ENGKSLL++L+AC+ L+AT+QS +SS PEEK F C
Sbjct: 277 FEIHACNQYKRAAQYICLENGKSLLDLLKACKG-SRQTLEATVQSLISSSPEEKHFTCRD 335
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTY----TTGI--RIRASGPKLVSRSSEN 395
CKG FP + VG+ GP LC SC +SK+ + +T T+GI R+R + P ++
Sbjct: 336 CKGCFP-SSVGQVGP--LCPSCEESKRSKWMLTLPAPPTSGIGKRLRLAEP-----TTSK 387
Query: 396 DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIAN 455
S S + R ++K V+K+ SKS ++ +RSP
Sbjct: 388 SSGSASVSISSRYKRK--------WVTKAKSKSSEYTSI--SRSP--------------R 423
Query: 456 STPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
S P+ K+ + + RK KS K L+ K ++AS S K++W IT KDQRLHKL
Sbjct: 424 SAPMRIPSKNKSALKMRK---KSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKL 480
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------- 566
VF+E GLPDGTEV Y+A GQKLL+GYK G GI+C CCN VSPSQFE HAG
Sbjct: 481 VFEEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYA 540
Query: 567 --------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSS 606
R+Y KDNDDLC IC DGGNLL CDGCPRAFHKECASLSS
Sbjct: 541 YIYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSS 600
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
IP+GDWYCK+CQNMF+R++F++H+ NAV AGRV GVD +EQITKRCIRIV+N+E +LSGC
Sbjct: 601 IPRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGC 660
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
+LCRG DFSKSGFGPRTI+LCDQCE+EFHVGCLK HKMA L+ELP+GKWFC + C+RI+S
Sbjct: 661 VLCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHS 720
Query: 727 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRLLSGKAATPETRLLLSQAV 785
LQ LL++ EKLP L A+ + G + + D+DV WRL+SGK A+PETRLLLS+A+
Sbjct: 721 ALQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGKIASPETRLLLSEAI 780
Query: 786 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ- 844
AIFHD FDPIVD SGRDLIP+MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQY 840
Query: 845 --------EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 896
++AELPLVATS NHGKGYFQ LF+CIE+LL+FL+VK +VLPAAEEAESIWT
Sbjct: 841 CRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWT 900
Query: 897 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
+KFGF++I P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 901 EKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 941
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 890 bits (2300), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/764 (59%), Positives = 551/764 (72%), Gaps = 64/764 (8%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCAFCNGR 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
++F C RCKG FP CVGK GP LCN C +SK+ T+T G ++SR +
Sbjct: 121 RTFTCKRCKGIFPSICVGKIGP--LCNLCAESKESHPTLTI---------GSSIISRYCQ 169
Query: 395 N--DSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGL 452
N + IS+ N + + LL + + L+ S L
Sbjct: 170 NLPSLILISWIINLKTITSGQFLL------------------MLAHCSFRLSFLSPEQVL 211
Query: 453 IANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWN-ITPKDQR 511
S+ SQ RK +K ++ LI+KP + AS LS P K ++ I+P+DQR
Sbjct: 212 ALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVASVHLS-PRKRKYKKISPRDQR 270
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEV-SPSQFEAHAG---- 566
LH+LVF+E GLPDGTE+ YYA GQKLL GYK G GI+CHCCN EV SPS FEAHAG
Sbjct: 271 LHRLVFEEGGLPDGTELAYYARGQKLLGGYKRGFGILCHCCNCEVVSPSTFEAHAGWATR 330
Query: 567 -------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC 601
R+Y +DNDDLC ICADGG+LL CDGCPRAFHK C
Sbjct: 331 KKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCIICADGGDLLLCDGCPRAFHKGC 390
Query: 602 ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA 661
ASLS++P GDWYC++CQN F+R++F++H+ANA AGRVS +DS+EQITKRC RIVKN+EA
Sbjct: 391 ASLSTVPSGDWYCQHCQNTFQREKFVEHNANAFAAGRVSEIDSIEQITKRCFRIVKNVEA 450
Query: 662 ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 721
EL+GC LCRG DF +SGFGPRTI+LCDQCE+EFHVGCL+ HKMA+L+ELPKG WFCCMDC
Sbjct: 451 ELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMANLKELPKGNWFCCMDC 510
Query: 722 SRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLL 780
SRI+S LQ LL++ AEKLP+ LN I KK+ L + IDVRW LLSGK A+PE +LL
Sbjct: 511 SRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISNSIDVRWTLLSGKIASPENKLL 570
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILR 840
LS+A++IF +CFDPIVDS GRDLIP MVYG+N +GQ++GGMYCA+L VNS +VSAGILR
Sbjct: 571 LSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLIVNSCIVSAGILR 630
Query: 841 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 900
VFG+EVAELPLVAT +HGKGYFQLLF+CIEKLL+FL V+++VLPAAEEAESIW +KFG
Sbjct: 631 VFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWIEKFG 690
Query: 901 FKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 944
F+KI PE LS YRK C Q+V F+GTSMLQK VP C+I + S +S
Sbjct: 691 FQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAVPTCKIVNQSIES 734
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 54/118 (45%), Gaps = 11/118 (9%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMG-GIKFQA 252
E + IA+ S + S + + KKIS + + E GL DG + Y G K
Sbjct: 239 EPDLIAKPSKVASVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKL-- 296
Query: 253 SGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF--ENGKSLLEV 308
L G R GILC C CN C V+ PS FE HA R+ Y C NG SL ++
Sbjct: 297 --LGGYKRGFGILCHC--CN-CEVVSPSTFEAHAGWATRK-KPYACIYTSNGVSLHDL 348
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 861 bits (2224), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/923 (52%), Positives = 595/923 (64%), Gaps = 182/923 (19%)
Query: 45 KVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILES 104
KV+ VNG+IVY+R KRS S NG E
Sbjct: 21 KVSVVNGYIVYTRAKRSLDSC---------------------------------NGFSEH 47
Query: 105 VVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKKCLKR 164
++N V++ EN EC K +EV K+ R
Sbjct: 48 AELKDNAEVEVKTENG---------------------ECEKLKNESTEVVARTRKR--SR 84
Query: 165 PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK 224
SA++ KVE + +V +E+ ++ AL +P+ +ELKMSKKI +N+K
Sbjct: 85 RSALEAKVECCDQMVV------SETEQVVANGGSGINGALGAPRNKMELKMSKKIVVNRK 138
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
PMTV +LF+TG LDGVSVVYMGGIK +ASGLRG+IRDGGILCSC LCNG RVIPPS+FEI
Sbjct: 139 PMTVKKLFDTGFLDGVSVVYMGGIK-KASGLRGVIRDGGILCSCCLCNGRRVIPPSQFEI 197
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HACKQYRRA+QYIC ENGKSLL++LRACR L L+ T+Q+ + S EE+ F C RCKG
Sbjct: 198 HACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYFTCKRCKG 257
Query: 345 TFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQN 404
FP + V + GP +C SCV+ SR SE S N
Sbjct: 258 CFPSSFVERVGP--ICRSCVE------------------------SRKSEESS------N 285
Query: 405 NKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHK 464
N +V K + RSP RP +++N + + +
Sbjct: 286 N--------------VVGK------------RVRSP--------RPVVLSNPSSTSELSV 311
Query: 465 SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPD 524
SSQ +R RK K+T L+ F + S L DQRLHKLVF+E+GLPD
Sbjct: 312 SSQVKRHRK-----KRTKLV---FISISSVL-------------DQRLHKLVFEENGLPD 350
Query: 525 GTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------ 566
GTEV YYA GQKLLEG+K G GI+C CCN+E+SPSQFE HAG
Sbjct: 351 GTEVAYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVS 410
Query: 567 -----------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
R+Y KDNDDLC +C DGGNLL CDGCPRAFHKECA+LSSIP+GDWYC+
Sbjct: 411 LHELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSIPRGDWYCQ 470
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
+CQNMF+R++F+ H+ANAV AGRV GVD +EQI RCIRIVK++EA+LS C LCRG DFS
Sbjct: 471 FCQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDFS 530
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
+SGFGPRTI+LCDQCE+E+HVGCL+ HKMA L+ELP+G W CC DC+RI+S L+NLLV+
Sbjct: 531 RSGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVKG 590
Query: 736 AEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
AE+LPE L I KK LE + IDVRWRLL+GK A+PETR LL +AV+IFH+CF+P
Sbjct: 591 AERLPESLLGVIKKKQEEKGLEPI--IDVRWRLLNGKIASPETRPLLLEAVSIFHECFNP 648
Query: 795 IVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
IVD+ SGRDLIP+MVYGRN+RGQEFGGMYCA+L VNSSVVSAG+LR+FG +VAELPLVAT
Sbjct: 649 IVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVAT 708
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
S NHGKGYFQ LF+CIE+LL+FL VK++VLPAAEEAESIWTDKFGF K++P+ L+ YRK
Sbjct: 709 SNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSKMNPDELTNYRK 768
Query: 915 RCSQLVTFKGTSMLQKRVPACRI 937
C Q+V+FKGT+ML K VP+CR+
Sbjct: 769 NCHQMVSFKGTNMLHKMVPSCRV 791
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/750 (55%), Positives = 505/750 (67%), Gaps = 77/750 (10%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
MSKKI+L PMTV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+ CNG
Sbjct: 1 MSKKIALENVPMTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDAGILCSCAFCNGH 60
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S LP E
Sbjct: 61 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 120
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSE 394
++F C RCK + + P +S SS+
Sbjct: 121 RTFTCKRCKEQV--------------------------------LALEYFKPASLSTSSQ 148
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+++ RKK RK E D ++K S KS S+ + R +++ P L+
Sbjct: 149 DNT----------PRKKKRKPEEQDSITKPS-KSASVYLSSRKRKYKKIS-----PRLVC 192
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
P+ + K V I K + SP F +DQRLH+
Sbjct: 193 FFYPIDILFGLVMLS-PFPFLWLVKIFVFIRK-YAYLSPFCPFSGYQS-----QDQRLHR 245
Query: 515 LVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ-------------- 560
LVF+E GLPDGTE+ YYA GQ + Y + N S SQ
Sbjct: 246 LVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQKLLGGYAYIYTSNG 305
Query: 561 -------FEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
R+Y +DNDDLC ICADGGNLL CDGCPRAFHK CAS+ ++P GDWY
Sbjct: 306 VSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPRAFHKGCASIPTVPSGDWY 365
Query: 614 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 673
C+YCQN FER++ ++H+ANA AGR SG+DS+EQITKRC RIVKN+EAEL+GC LCRG D
Sbjct: 366 CQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKRCFRIVKNIEAELTGCALCRGYD 425
Query: 674 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
F +SGFGPRTI+LCDQCE+EFHVGCL+ HKM +L+ELPKG WFCCMDCSRI+S LQ LL+
Sbjct: 426 FMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKGNWFCCMDCSRIHSTLQKLLI 485
Query: 734 QEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 792
+ AEKLP+ LN I KK+ L ++IDVRW LLSGK A+PE +LLLS+A++IF +CF
Sbjct: 486 RGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSGKIASPENKLLLSRALSIFQECF 545
Query: 793 DPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
DPIVDS GRDLIP MVYG+N +GQ++GGMYCA+LT+NSS+VSAGILRVFG+EVAELPLV
Sbjct: 546 DPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTINSSIVSAGILRVFGEEVAELPLV 605
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
AT HGKGYFQLLF+CIEKLL+FL V+++VLPAAEEAESIWT+KFGF+KI PE L+ Y
Sbjct: 606 ATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWTEKFGFQKIKPEQLNKY 665
Query: 913 RKRCSQLVTFKGTSMLQKRVPACRIGSSST 942
RK C Q+V F+GTSMLQK VP CRI + T
Sbjct: 666 RKSCCQMVRFEGTSMLQKAVPTCRIVNQRT 695
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 788 bits (2035), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/756 (55%), Positives = 534/756 (70%), Gaps = 68/756 (8%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
C RCKG FP +C+ + GP+C+SCV+S K + + RIR+ P LVS+SS
Sbjct: 118 TCKRCKGRFPSSCMERV--GPICSSCVESSKSEESSKNVVSKRIRSPRPVLVSKSSCASE 175
Query: 398 MCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANST 457
M IS + +R RK+ + + + S S SV + K + S + +NS
Sbjct: 176 MSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILPRRKVTPKTKKKSLSVKLKTTSNSN 235
Query: 458 PVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVF 517
+ S Q + + KITKK L+ P SFP T D RLHKLVF
Sbjct: 236 CL-----SPQIKSEWKITKK-----LV---------PYSFP-------TCGDNRLHKLVF 269
Query: 518 DESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG----- 566
+E+GLPDG+E+ YYA G QKLLEG+K G GI+C CCN+E+SPSQFE HAG
Sbjct: 270 EENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRK 329
Query: 567 ------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECA 602
R+Y DNDDLC +C DGGNLL CDGCPRAFHKECA
Sbjct: 330 KPYAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGCPRAFHKECA 389
Query: 603 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 662
SLSSIP+GDWYC++CQNMF+R++F+ ++ NA AGRV GVD +EQITKRCIRIVK+++AE
Sbjct: 390 SLSSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAE 449
Query: 663 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 722
LS C LCRG DFSKSGFGPRTI+LCDQCE+E+HVGCL+ HKM L+ELPKG W CC DC+
Sbjct: 450 LSACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCT 509
Query: 723 RINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 781
RI+S L+N+LV+ AE+LP+ L I KK L+ ++DI+VRWRLLSGK A+PETR LL
Sbjct: 510 RIHSTLENVLVRGAERLPKSLLAVIKKKQEEKGLDPINDINVRWRLLSGKKASPETRPLL 569
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 841
+AV+IFH+CFDPIVD++SGRDLI +MVYG+++RGQEFGGMYCA+L VNSSVVSAG+LR+
Sbjct: 570 LEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLIVNSSVVSAGMLRI 629
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
FG ++AELPLVATS HGKGYFQ LF+CIE+LL+F++VK++VLPAAEEA+SIWTDKFGF
Sbjct: 630 FGTDIAELPLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAAEEAQSIWTDKFGF 689
Query: 902 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
KI P+ L+ YR+ C+Q VTF+GT+ML K VP CR+
Sbjct: 690 SKIKPDELANYRRNCNQFVTFQGTNMLHKMVPPCRV 725
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/974 (45%), Positives = 601/974 (61%), Gaps = 114/974 (11%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDD-------LLEDDVIDKRINSK----IHEGRI- 88
CKR K T+VNGFIVY+R ++++F+ + LLE+ + + SK + G I
Sbjct: 42 CKRIKTTQVNGFIVYTRTRKTKFTKLHEQGDENAGLLENRMSNHLEESKPTIGVTNGSIG 101
Query: 89 ------NKVVKNVLNENGILESVVEEE----------------NQLVQMTVENVIEETVK 126
N +KN E+ + VEE + LV + ++++ +
Sbjct: 102 ETNVSGNSCIKNTFVESPAGKIAVEERLVTGSLAESPAVETDSSSLVDVVIDDINFVELL 161
Query: 127 GKKAPICKEEPISKVEC-FPRKEGGSEV-----SNGLNKKCLKRPSAMKPKVEPVEVLVT 180
++ P+ E +S+ F K G++V S +++K KR + K + + ++
Sbjct: 162 HEEIPV---EILSEGSLDFEVKRLGTKVRTMGKSYSVSEK--KRHGSFKRTAQIYKSILR 216
Query: 181 QSEGFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKKPMTVTELFETGL 236
+ ++ VE ++E L ++ L K I + ++P TV ELFETG+
Sbjct: 217 MKK---VNNLVPENVEVLSEPDFGREGLDEQSHSVSLA-DKSILIRRRPETVRELFETGI 272
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDG+SVVYMG +K QA GLRGII+DGGILCSCS C+ VI SKFEIHACKQYRRASQY
Sbjct: 273 LDGLSVVYMGTVKSQAFGLRGIIKDGGILCSCSSCDWAHVISTSKFEIHACKQYRRASQY 332
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGP 356
ICFENGKSLL+VL R+ PL L+AT+ A+ +EK F C RCKG FP + +G G
Sbjct: 333 ICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKCFTCKRCKGAFPFSSLGHRGF 392
Query: 357 GPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
LC SC + + Q + P + S+ + + CI+ R +
Sbjct: 393 --LCMSCSEVETSQAS-------------PAAMWTSTSSPA-CIASPVKSRLK------- 429
Query: 417 EADLVSKSSSKSVSLRNLLKT---RSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRK 473
+++ S+S+S+ + + S +TR + R L+ + S + SSQ++ + K
Sbjct: 430 ----ITRKPSESMSISPVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSK 485
Query: 474 ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYAC 533
K + + K ++ S +S +S + +T KDQ LHKLVF+ GLP+GTE+GYYA
Sbjct: 486 FKKMLTQYSVTPKAVKSVSLSVSSKKRS-YRLTRKDQGLHKLVFERGGLPEGTELGYYAR 544
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------------------------- 566
GQKLL GYK G GI C+CC SEVSPS FEAHAG
Sbjct: 545 GQKLLGGYKMGAGIYCYCCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFS 604
Query: 567 --RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
R+Y DN+DLC ICADGGNLL CD CPRAFH EC SL SIP+G+W+CKYC+N F +
Sbjct: 605 QGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSE 664
Query: 625 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 684
+++ N+ G++ GVD V+Q RCIR+VKN+EAE +GC+LC G DF +SGFGPRTI
Sbjct: 665 IAGEYNVNSSAVGQLEGVDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTI 724
Query: 685 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 744
++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+ AE L + L
Sbjct: 725 IICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSL 784
Query: 745 NAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
I +K + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFDPIVD +SGR+
Sbjct: 785 GIIQRKQERTDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFDPIVDPLSGRN 844
Query: 804 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVAT + KGY
Sbjct: 845 LIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGY 904
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
FQLLF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y K C Q+V FK
Sbjct: 905 FQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFK 964
Query: 924 GTSMLQKRVPACRI 937
G SMLQK V A +I
Sbjct: 965 GASMLQKPVHAHQI 978
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 768 bits (1983), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/751 (52%), Positives = 506/751 (67%), Gaps = 55/751 (7%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C RCKG FP + +G G LC SC + + Q ++ T R+S +
Sbjct: 373 FTCKRCKGPFPFSSLGHRGF--LCKSCSEVETSQASLAAT--------------RTSTSA 416
Query: 397 SMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANS 456
CI+ R K TRK E+ +S S+ S ++TR + R L+ +
Sbjct: 417 PACITSPVKSR-LKITRKPSESTSISPVFMSSLG-------NSTRKITRKALRQALVGKA 468
Query: 457 TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 516
S + SSQ + + K K + + K ++ S +S +S + + KDQ LHKLV
Sbjct: 469 YLSASTNVSSQKKCRSKFKKMLTQHSVTPKALKSVSLSVSSKKRS-YRLARKDQGLHKLV 527
Query: 517 FDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------- 566
FD GLP+GTE+GYYA GQKLL GYK G GI C+CC EVSPS FEAHAG
Sbjct: 528 FDRGGLPEGTELGYYARGQKLLGGYKMGAGIYCYCCKCEVSPSLFEAHAGWASRRKPYFY 587
Query: 567 -------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSI 607
R+Y DN+DLC ICADGGNLL CD CPRAFH EC SL SI
Sbjct: 588 IYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSI 647
Query: 608 PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 667
P+G+W+CKYC+N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +GC+
Sbjct: 648 PRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCV 707
Query: 668 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 727
LC G DF +SGFGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS
Sbjct: 708 LCSGSDFCRSGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINST 767
Query: 728 LQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 786
LQ LL+ AEKL + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+A
Sbjct: 768 LQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALA 827
Query: 787 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 846
IFHDCFDPIVD +SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EV
Sbjct: 828 IFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREV 887
Query: 847 AELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP 906
AELPLVAT + KGYFQLLF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ P
Sbjct: 888 AELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAP 947
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
E LS Y K C Q+V FKG SMLQK V + +I
Sbjct: 948 EQLSKYIKICYQMVRFKGASMLQKPVDSHQI 978
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/515 (64%), Positives = 396/515 (76%), Gaps = 45/515 (8%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + N+S NKS IT KDQR
Sbjct: 1903 NNSPATSIHRTSERARLLKPIPVTKSSGSALY------NSSE-----NKSLGKITKKDQR 1951
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG----- 566
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHAG
Sbjct: 1952 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 2011
Query: 567 ------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECA 602
R+Y +DNDDLC+IC DGGNLL CDGCPRAFH+ CA
Sbjct: 2012 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 2071
Query: 603 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 662
SL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 2072 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 2130
Query: 663 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 722
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 2131 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 2190
Query: 723 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 781
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 2191 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 2250
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 841
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 2251 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 2310
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
FGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPAAEEAE IWT KFGF
Sbjct: 2311 FGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGF 2370
Query: 902 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 936
KKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2371 KKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2405
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 141/380 (37%), Positives = 190/380 (50%), Gaps = 63/380 (16%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1589 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1628
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPISKVECFPRKE--- 148
E ES +EE Q T EN V+E+ G PIC+EEP S+ + K+
Sbjct: 1629 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1684
Query: 149 GGS-------------EVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
GS E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1685 DGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1743
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1744 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1800
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1801 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1860
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1861 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1909
Query: 374 TYTTGIRIRASGPKLVSRSS 393
+ T R R P V++SS
Sbjct: 1910 IHRTSERARLLKPIPVTKSS 1929
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 691 bits (1782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/503 (65%), Positives = 398/503 (79%), Gaps = 31/503 (6%)
Query: 471 QRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY 530
+ I +S K +SK +A +S NK +W IT KDQRLHKLVF++ GLPDGTEV Y
Sbjct: 349 ETNIKSRSSKPATVSKSLNSALEGVSSENKCQWKITTKDQRLHKLVFEDGGLPDGTEVAY 408
Query: 531 YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------ 566
YA GQKLL GYK G GI+C CCN EVSPS FEAHAG
Sbjct: 409 YARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVSLHELAI 468
Query: 567 -----RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 621
R+Y +DNDDLC +CADGG+L+ CDGCPRAFHK CASLSSIP+G W+C++CQNMF
Sbjct: 469 SLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQFCQNMF 528
Query: 622 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 681
+R++F++H+ANAV AGR+SGVD +EQIT+RCIRIVKN+EAEL+GC+LCRG DFS+SGFGP
Sbjct: 529 QREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYDFSRSGFGP 588
Query: 682 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 741
RTI+LCDQC +EFHVGCL+ HK+A+L+ELPKGKWFCC DC RI+S L+ LL +EAE +P
Sbjct: 589 RTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLAREAEIIPN 648
Query: 742 FHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSIS 800
L + KK LETV++IDVRW+LL+GK+A+PET+LLLSQA+AIF +CFDPIVD+ +
Sbjct: 649 KLLEVVMKKNEEKGLETVNNIDVRWKLLTGKSASPETKLLLSQALAIFQECFDPIVDT-T 707
Query: 801 GRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 860
GRDLIP MVYG+N +GQ++GGMYCA+L VNS VVSA I+R+FGQEVAELPLVATS NHG
Sbjct: 708 GRDLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLVATSNGNHG 767
Query: 861 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 920
KGYFQLLF+ IEKLL++L+V SIVLPAAEEAESIWTDKFGF+KI P+ LS YRK C Q++
Sbjct: 768 KGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSKYRKSCCQIL 827
Query: 921 TFKGTSMLQKRVPACRIGSSSTD 943
TFKGTSMLQK VP CRI + +T+
Sbjct: 828 TFKGTSMLQKAVPPCRIVNQNTE 850
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 200/419 (47%), Positives = 253/419 (60%), Gaps = 57/419 (13%)
Query: 1 MKRELDYELAGSLDETSTQSLPQAGIQASDCVKAACENVRCKRFKVTKVNGFIVYSRVKR 60
MKRE+ G E+ +Q QA D + N CKRFKV VNGF VYSR+++
Sbjct: 1 MKREVGAFDGGIQFESESQ-------QAQD----SNNNNNCKRFKV--VNGFFVYSRLRK 47
Query: 61 SRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLVQMTVENV 120
++ S+ + H+ + K + I+++V E E V++V
Sbjct: 48 NKPSSRE---------------CHDDKDRKCQQ-------IIQTVSEVETVNKDPQVKDV 85
Query: 121 IEE-TVKGKKAPICKEEPISK-VECFPRKEGGSE-------------VSNGLNKKCLKRP 165
E ++ + PICK E S+ EGG+E SN L + L R
Sbjct: 86 SRELSLCNVQLPICKIESFSEPSRVILANEGGTEDTERKLAHVGTEGKSNKLRQ--LTR- 142
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S KVEPVEV V E +E +S ++VE IAEGSALT PKKNLELKMSKKI+L+ P
Sbjct: 143 SNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKIALDNIP 202
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
MTV ELFETGLL+GV VVYMGG K A LRG I+D GILC CS C GCRVIPPS+FEIH
Sbjct: 203 MTVKELFETGLLEGVPVVYMGGKK--AFCLRGTIKDVGILCYCSFCKGCRVIPPSQFEIH 260
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGT 345
A KQYRRA+QYICFENGKSLL+VL ACR+ PL L+AT+QSA+S LP+EK+F C RCKGT
Sbjct: 261 AIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKTFTCKRCKGT 320
Query: 346 FPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQN 404
+P VGK GPLC+SCV+SK+ G+ T I+ R+S P VS+S + +S +N
Sbjct: 321 YPTILVGKV--GPLCSSCVESKESNGSPACETNIKSRSSKPATVSKSLNSALEGVSSEN 377
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/515 (63%), Positives = 394/515 (76%), Gaps = 45/515 (8%)
Query: 455 NSTPVTSVHKSSQSQRQRK---ITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N++P TS+H++S+ R K +TK S + S NKS IT KDQR
Sbjct: 371 NNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSE-----------NKSLGKITKKDQR 419
Query: 512 LHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG----- 566
LH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C CC+ EVS SQFEAHAG
Sbjct: 420 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 479
Query: 567 ------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECA 602
R+Y +DNDDLC+IC DGGNLL CDGCPRAFH+ CA
Sbjct: 480 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 539
Query: 603 SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 662
SL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE
Sbjct: 540 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 598
Query: 663 LSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 722
+S C+LCRG DFSKSGFGPRTI+LCDQCE+EFH+GCL+ HKM DL+ELP GKWFCC++C
Sbjct: 599 VSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLECI 658
Query: 723 RINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 781
RI+S LQ L V+ EKLP+ LN IK K+ LE+++D +VRWRLLSGK A+PETR+LL
Sbjct: 659 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKLASPETRVLL 718
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 841
S+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F G+YCA++TVNS VVSAGILRV
Sbjct: 719 SEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGILRV 778
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
FGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL V+S VLPAAEEAE IWT KFGF
Sbjct: 779 FGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKFGF 838
Query: 902 KKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 936
KKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 839 KKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 873
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 134/369 (36%), Positives = 186/369 (50%), Gaps = 61/369 (16%)
Query: 47 TKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVV 106
+ +G I YSR KR + LE+ D+R + + E ES
Sbjct: 68 NRWDGVIQYSRNKRLK------RLEESKNDER--------------RTIAEEPKDDESTT 107
Query: 107 EEENQLVQMTVEN---VIEETVKGK-KAPICKEEPIS----------------KVECFPR 146
+EE Q T EN V+E+ G PIC+EEP S K++
Sbjct: 108 DEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESNDGSLKLQTAGL 163
Query: 147 KEGGSEVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSAL 204
+ E+ + +K KR SA+K K + VE L + F N ++ + +L
Sbjct: 164 IDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGVDEKTNGAVRSL 222
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
TSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G + + L+G I+ GI
Sbjct: 223 TSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDG--RKKGYRLQGTIKGNGI 279
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+ PL L+AT+
Sbjct: 280 LCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLEATI 339
Query: 325 QSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRAS 384
QSA+ S P ++S P K PL NSC+K T + T R R
Sbjct: 340 QSAIGSFPVKRS---------LPADEAAKM--DPLGNSCIKRNNSPATSIHRTSERARLL 388
Query: 385 GPKLVSRSS 393
P V++SS
Sbjct: 389 KPIPVTKSS 397
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/744 (49%), Positives = 466/744 (62%), Gaps = 90/744 (12%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I + +P TV +LFETGLLDG+SVVYMG +K QA LRGIIRDGGILCSCS C+ V
Sbjct: 253 KSILIRSRPETVRDLFETGLLDGLSVVYMGTVKSQAFPLRGIIRDGGILCSCSSCDWANV 312
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
I SKFEIHACKQYRRASQYICFENGKSLL+VL R+ PL L+AT+ A+ +EK
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C RCK F CV + R SE+
Sbjct: 373 FTCKRCKEGF----------------CVSHAR----------------------RPSEST 394
Query: 397 SMCISYQNN--KRERKKTRKLLEADLVSKSS-SKSVSLRNLLKTRSPWELTRNSSRPGLI 453
S+ + ++ RK TRK L LV K+ S S ++ + K RS ++
Sbjct: 395 SISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFK----------- 443
Query: 454 ANSTPVTSVHKSSQSQRQRKITKKSKKTV-LISKPFENASPPLSFPNKSRWNITPKDQRL 512
V +H + + KS + L+S P P KDQ L
Sbjct: 444 --KMLVFLLHNDVLMLAEPTLMIKSLILLHLVSLYVLKVDPTFCDPQGFE-----KDQGL 496
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLL-------EGYKNGLGIICHCCNSE---------V 556
HKLVFD GLP+GTE+GYYA GQ + E K L I ++ V
Sbjct: 497 HKLVFDRGGLPEGTELGYYARGQTYITVDRNCSEATKWALEYIVTVASASYFYIYTSNGV 556
Query: 557 SPSQFEA--HAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
S ++ GR+Y DN+DLC ICADGGNLL CD CPRAFH EC SL SIP+G+W+C
Sbjct: 557 SLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHC 616
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
KYC+N F + +++ N+ G++ GVD V+Q+ RCIR+VKN+EAE +G
Sbjct: 617 KYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNG--------- 667
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
SGFGPRTI++CDQCE+E+H+GCL + DL+ELPKG WFC MDC+RINS LQ LL+
Sbjct: 668 --SGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLG 725
Query: 735 EAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
AEKL + L I+ K N + ++SD+D+RWRL+SGK +PE+R+LLSQA+AIFHDCFD
Sbjct: 726 GAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKVTSPESRMLLSQALAIFHDCFD 785
Query: 794 PIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
PIVD +SG +LIP MVYG+ ++GQ++GG+ CA+LTVN++VVSAG+LRVFG+EVAELPLVA
Sbjct: 786 PIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVA 845
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + KGYFQLLF+CIEKLLS L V+SIV+PAAEEAE +W +KFGF+K+ PE LS Y
Sbjct: 846 TRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYI 905
Query: 914 KRCSQLVTFKGTSMLQKRVPACRI 937
K C Q+V FKG SMLQK V + +I
Sbjct: 906 KICYQMVRFKGASMLQKPVDSHQI 929
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 3/92 (3%)
Query: 41 CKRFKVTKVNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENG 100
CKR K T+VNGFIVY+R ++++F+ L E + + +++ + E + V + +
Sbjct: 38 CKRIKTTQVNGFIVYTRTRKTKFTK---LHEQEDENAGLSNHLEESKPTSGVTSGFGGDM 94
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPI 132
S V E N V+N + E+ GK I
Sbjct: 95 CRSSSVGETNVSGSSCVKNTLVESSSGKVVVI 126
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 303/477 (63%), Positives = 357/477 (74%), Gaps = 53/477 (11%)
Query: 490 NASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIIC 549
N SP S S +DQRLH+LVF+E GLPDGTEV YYA G+KLL+GYK G GI C
Sbjct: 1968 NNSPATSIHRTSE---RARDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFC 2024
Query: 550 HCCNSEVSPSQFEAHAG-----------------------------RQYPGKDNDDLCTI 580
CC+ EVS SQFEAHAG R+Y +DNDDLC+I
Sbjct: 2025 WCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSI 2084
Query: 581 CADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVS 640
C DGGNLL CDGCPRAFH+ CASL SIPQ DWYC+YCQNMF+R++F++H+ANAV AGRVS
Sbjct: 2085 CGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVS 2144
Query: 641 GVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLK 700
GVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGFGPRTI+LCDQ
Sbjct: 2145 GVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQ----------- 2192
Query: 701 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVS 759
ELP GKWFCC++C RI+S LQ L V+ EKLP+ LN IK K+ LE+++
Sbjct: 2193 --------ELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIA 2244
Query: 760 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 819
D +VRWRLLSGK A+PETR+LLS+AVAIFHD FDPI+DS++GRDLIP+MVYGRN+RGQ+F
Sbjct: 2245 DYNVRWRLLSGKLASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDF 2304
Query: 820 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 879
G+YCA++TVNS VVSAGILRVFGQEVAELPLVATS N G+GYFQ+LF+CIEKLL+FL
Sbjct: 2305 SGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLN 2364
Query: 880 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 936
V+S VLPAAEEAE IWT KFGFKKI P+ LS YRK Q+++F+GT ML+K VP R
Sbjct: 2365 VRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWR 2421
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 135/369 (36%), Positives = 182/369 (49%), Gaps = 63/369 (17%)
Query: 38 NVRCKRFKVTK--VNGFIVYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNV 95
N R + TK +G I YSR KR + LE+ D+R + +
Sbjct: 1654 NDSSDRIRETKNRWDGVIQYSRNKRLK------RLEESKNDER--------------RTI 1693
Query: 96 LNENGILESVVEEENQLVQMTVEN---VIEETVKGK-KAPICKEEPIS------------ 139
E ES +EE Q T EN V+E+ G PIC+EEP S
Sbjct: 1694 AEEPKDDESTTDEE----QKTDENDPVVVEKPTGGYLVGPICEEEPKSQSQKASIKDESN 1749
Query: 140 ----KVECFPRKEGGSEVSNGLNKKCLKR--PSAMKPKVEPVEVLVTQSEGFGNESMSLI 193
K++ + E+ + +K KR SA+K K + VE L + F N +
Sbjct: 1750 DGSLKLQTAXLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVESLESDY-NFCNSVAIGV 1808
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
+ + +LTSPKK L LKMSKKI+LNK P+T+ +L ETG+L+G V Y G K
Sbjct: 1809 DEKTNGAVRSLTSPKK-LGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTYDGRKK--GY 1865
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
L+G I+ GILCSCSLC G RV+ PS+FE+HACK YR A++YI +NGK+L +VL C+
Sbjct: 1866 RLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCK 1925
Query: 314 SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTM 373
PL L+AT+QSA+ S P K + P K PL NSC+K T
Sbjct: 1926 DAPLETLEATIQSAIGSFP---------VKRSLPADEAAKM--DPLGNSCIKRNNSPATS 1974
Query: 374 TYTTGIRIR 382
+ T R R
Sbjct: 1975 IHRTSERAR 1983
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 303/508 (59%), Positives = 382/508 (75%), Gaps = 37/508 (7%)
Query: 460 TSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDE 519
+S+H + R K++ K K + SK LS NKS+W I+ + QRLHKL+F+E
Sbjct: 240 SSIHAVGKRIRSSKLSVKLKTAPITSKC-------LSPQNKSQWRISKRYQRLHKLIFEE 292
Query: 520 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYP------ 570
GLP+G EV YYA GQKLLEG K GI+C CCN+E+SPSQFE HAG R+ P
Sbjct: 293 DGLPNGAEVAYYARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYT 352
Query: 571 --------------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQG 610
K ND +C +C DGGNLL CDGCPRAFHKECAS+SSIP+G
Sbjct: 353 SNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRG 412
Query: 611 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 670
+WYC+ CQ+ F R+R + H+A+AV AGRV GVD +EQI KRCIRIVK++ AE+ GC+LCR
Sbjct: 413 EWYCQICQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCR 472
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN 730
DFS+SGFGPRTI++CDQCE+E+HVGCL+ HKMA L+ELP+G WFCC DC+RI+S L+N
Sbjct: 473 SSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLEN 532
Query: 731 LLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 789
LL++ AE+LPE L+ I KK G LE +++IDVRW+LL+GK A+PETR LL +AV++FH
Sbjct: 533 LLIRVAERLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGKIASPETRPLLLEAVSMFH 592
Query: 790 DCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 849
+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYCA+L VNSSVVSAG++R+FG+++AEL
Sbjct: 593 ECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAEL 652
Query: 850 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 909
PLVAT N GKGYFQ LFACIE+LL+FL VK++VLPAAEEA SIWT+KFGF K+ P L
Sbjct: 653 PLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKPNQL 712
Query: 910 SIYRKRCSQLVTFKGTSMLQKRVPACRI 937
+ YR C Q++ FKGT+ML K VP CR+
Sbjct: 713 TNYRMNCHQIMAFKGTNMLHKTVPQCRV 740
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/197 (54%), Positives = 140/197 (71%), Gaps = 2/197 (1%)
Query: 217 KKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRV 276
K I ++KKP TV ELF+TGLLDGV VVY+G K + LRG I+DGGILCSCSLCNG RV
Sbjct: 87 KIIVVHKKPATVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
IPPS+FEIHAC Y+RA+QYIC ENGKS+LE++RACR+ PL L+AT+Q+ ++S PEEK
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEND 396
F C C+G FP + V + G LC SCV+S+K + + + G RIR+S + +++
Sbjct: 207 FTCKNCRGCFPSSNVERV--GLLCLSCVESRKSEKSSIHAVGKRIRSSKLSVKLKTAPIT 264
Query: 397 SMCISYQNNKRERKKTR 413
S C+S QN + R R
Sbjct: 265 SKCLSPQNKSQWRISKR 281
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 293/476 (61%), Positives = 368/476 (77%), Gaps = 30/476 (6%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
LS NKS+W I+ + QRLHKL+F+E GLP+G EV YYA GQKLLEG K GI+C CCN+
Sbjct: 276 LSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYYARGQKLLEGIKTRCGIVCRCCNT 335
Query: 555 EVSPSQFEAHAG---RQYP--------------------------GKDNDDLCTICADGG 585
EVSPSQFE HAG R+ P K ND +C +C DGG
Sbjct: 336 EVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGG 395
Query: 586 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 645
NLL CDGCPRAFHKECAS+SSIP+G+WYC+ CQ+ F R+R + ++A+AV AGRV GVD +
Sbjct: 396 NLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPI 455
Query: 646 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 705
EQI KRCIRIVK++ AE+ GC+LCR DFS+SGFGPRTI++CDQCE+E+HVGCL+ HK A
Sbjct: 456 EQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKA 515
Query: 706 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVR 764
L+ELP+G WFCC DC+ I+S L+NLL++ AE+LPE L+ I KK LE +++IDVR
Sbjct: 516 YLKELPEGDWFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVR 575
Query: 765 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYC 824
W+LL+GK A+PETR LL +AV++FH+CFDPIVD +GRDLIP+MVYGRNL+ Q+FGGMYC
Sbjct: 576 WKLLNGKIASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGMYC 635
Query: 825 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 884
A+L VNSSVVSAG++R+FG+++AELPLVAT N GKGYFQ LFACIE+LL+FL VK++V
Sbjct: 636 ALLIVNSSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKNLV 695
Query: 885 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 940
LPAAEEAESIWT+KFGF K+ + L+ YR C Q++ FKGT+ML K VP CR+ +S
Sbjct: 696 LPAAEEAESIWTEKFGFSKMKLDQLTNYRMNCHQIMAFKGTNMLHKTVPRCRVTNS 751
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/138 (63%), Positives = 108/138 (78%), Gaps = 1/138 (0%)
Query: 208 KKNLELKMSKKI-SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILC 266
K LK +KKI ++KKP+TV ELF+TGLLDGV VVY+G K + LRG I+DGGILC
Sbjct: 85 KTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYVGCKKDSTTELRGEIKDGGILC 144
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
SC LCNG RVIPPS+FEIHAC Y+RA+QYIC ENGKSLL+++RACR+ PL L+AT+Q+
Sbjct: 145 SCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDLMRACRAAPLHTLEATIQN 204
Query: 327 ALSSLPEEKSFACVRCKG 344
++S PEEK F C C+G
Sbjct: 205 FINSPPEEKYFTCKSCRG 222
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 293/498 (58%), Positives = 369/498 (74%), Gaps = 32/498 (6%)
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQK 536
++ K ++SK + + +S K IT KD RLHKLVF+E LPDGTEV YYA GQK
Sbjct: 473 RTPKPNVLSKSSDTITKSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQK 532
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG-----------------------------R 567
LL GYK G GI C CCNSEVSPSQFEAHAG R
Sbjct: 533 LLVGYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGR 592
Query: 568 QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 627
++ DNDDLC+ICADGG+LL CDGCPR+FH++C L IP G WYCKYCQN+F++++F+
Sbjct: 593 KFSLTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNLFQKEKFV 652
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
+H+ANAV AGRV+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++LC
Sbjct: 653 EHNANAVAAGRVAGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILC 712
Query: 688 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI 747
DQCE+EFHVGCLK++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L ++
Sbjct: 713 DQCEKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSV 772
Query: 748 -KKYAGNSLETVSDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
KK +++D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD I
Sbjct: 773 QKKIEDQGSASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFI 832
Query: 806 PSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQ 865
PSM+YGRN+RGQEFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+GYFQ
Sbjct: 833 PSMLYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQ 892
Query: 866 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGT 925
L+ACIE+ L FL VK++VLPAA+EAES+W +KFGF K+ PE + + KR Q++ F+GT
Sbjct: 893 SLYACIERFLGFLNVKNLVLPAADEAESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGT 951
Query: 926 SMLQKRVPACRIGSSSTD 943
SMLQK VP R+ +S+ +
Sbjct: 952 SMLQKEVPKYRVINSAAN 969
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 145/289 (50%), Gaps = 29/289 (10%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEPVEVLVTQSEGFGNESMSLI--EVEAIAE--- 200
+V+ L KK ++P SA+K VEP L S+ +M +I + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTS-LEHLSKCNTGVAMQVITNDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ------- 370
+ +QSA+ +++ C+ CKG P + G LC SC+ S+KPQ
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDSRKPQVSSSPSP 458
Query: 371 ---GTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
+ T + R P ++S+SS+ + +S + K + TRK L
Sbjct: 459 SPSPSPTPIVFSKDRTPKPNVLSKSSDTITKSVSTR-GKIHGRITRKDL 506
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 293/495 (59%), Positives = 367/495 (74%), Gaps = 32/495 (6%)
Query: 480 KTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLE 539
K ++SK + + +S K IT KD RLHKLVF+E LPDGTEV YYA GQKLL
Sbjct: 476 KPNVLSKSSDTITKSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQKLLV 535
Query: 540 GYKNGLGIICHCCNSEVSPSQFEAHAG-----------------------------RQYP 570
GYK G GI C CCNSEVSPSQFEAHAG R++
Sbjct: 536 GYKKGSGIFCSCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFS 595
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
DNDDLC+ICADGG+LL CDGCPR+FH++C L IP G WYCKYCQN+F++++F++H+
Sbjct: 596 LTDNDDLCSICADGGDLLCCDGCPRSFHRDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHN 655
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
ANAV AGRV+GVD +EQIT RCIRIVK +E E+ GC LCR DFSKSGFGPRT++LCDQC
Sbjct: 656 ANAVAAGRVAGVDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQC 715
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KK 749
E+EFHVGCLK++ M DL+ELP+GKWFCC +C+RI+S L+ L+V EKLPE L ++ KK
Sbjct: 716 EKEFHVGCLKENNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKK 775
Query: 750 YAGNSLETVSDIDVRWRLLSGKA-ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 808
+++D+++RWR+L+ K ++ ETR LLS+AV+IFHDCFDPIVDS SGRD IPSM
Sbjct: 776 IEDQGSASINDVEIRWRVLNWKMLSSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSM 835
Query: 809 VYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 868
+YGRN+RGQEFGG+YCA+LTVN SVVS GI R+FG EVAELPLVAT G+GYFQ L+
Sbjct: 836 LYGRNIRGQEFGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLY 895
Query: 869 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 928
ACIE+ L FL VK++VLPAA+EAES+W +KFGF K+ PE + + KR Q++ F+GTSML
Sbjct: 896 ACIERFLGFLNVKNLVLPAADEAESLWINKFGFSKLPPEEVMEF-KRHYQMMIFQGTSML 954
Query: 929 QKRVPACRIGSSSTD 943
QK VP R+ +S+ +
Sbjct: 955 QKEVPKYRVINSAAN 969
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 124/234 (52%), Gaps = 18/234 (7%)
Query: 152 EVSNGLNKKCLKRP------SAMKPKVEP--VEVLVTQSEGFGNESMSLIEVEAIAE--- 200
+V+ L KK ++P SA+K VEP +E L + G + ++ + E E
Sbjct: 223 DVNGQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVIT-NDTETKPEDIP 281
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRG 257
G T P K + K+ KK+S K P + +L +TG+L+G+ V Y+ G K +A +GL G
Sbjct: 282 GPLATPPVKIGKTKL-KKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSKIKALGETGLGG 340
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I GI+C C+ C G V+ P+ FE+HA +R +YI E G +L +++ AC++
Sbjct: 341 VISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLRDIMNACQNFSF 400
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQG 371
+ +QSA+ +++ C+ CKG P + G LC SC+ SKKPQ
Sbjct: 401 DQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDSKKPQA 452
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 336/785 (42%), Positives = 462/785 (58%), Gaps = 112/785 (14%)
Query: 194 EVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS 253
E EA AE S L +P + + S+ L K P + +L TG+L+G+ V+YM G K +
Sbjct: 286 ETEASAEASLLMTPPSSAKFSNSR---LKKFPSKLKDLLATGILEGLPVMYMKGAKVLFA 342
Query: 254 G---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLE 307
G L+G+I+D G+LC C +CNG V+ P+ FE+HA +R +YI +G K+L +
Sbjct: 343 GEKGLQGVIQDSGVLCFCKICNGVEVVTPTVFELHAGSANKRPPEYIYIHDGNCGKTLRD 402
Query: 308 VLRACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVK 365
V+ AC PL + +Q L +KS C+ C+G C G + +C+ C+
Sbjct: 403 VMNACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSKL--VCDLCLA 456
Query: 366 SKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSS 425
S PQ M +RK++ + +S
Sbjct: 457 SP-PQTAMA-------------------------------------SRKVISQPVQPRSP 478
Query: 426 SKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLIS 485
V ++L N +P + N P S+ ++ S T +
Sbjct: 479 EPVVIQKSL----------DNEVQPNSLDNEVPPNSL--------DNEVQPNSLDTGVQP 520
Query: 486 KPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
K F N + KS+ +T KD RLHKLVF+ LPDGTE+ YYA GQKLL GYK G
Sbjct: 521 KSFSNGMKHSASRGKSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGC 580
Query: 546 GIICHCCNSEVSPSQFEAHAG------------------------------RQYPGKDND 575
GI C CCN +VS SQFEAHAG R++ DND
Sbjct: 581 GIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDND 640
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV- 634
DLC IC DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++ R QH+ NA+
Sbjct: 641 DLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALA 700
Query: 635 EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREF 694
AGR++G D +E + KRCIR+VK +E + GC LC +FSKS FGPRT+++CDQCE+E+
Sbjct: 701 AAGRIAGPDILELMNKRCIRVVKTVEVDHGGCALCSRPNFSKS-FGPRTVIICDQCEKEY 759
Query: 695 HVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGN 753
HVGCLK+H M +L +LP+G WFC +CS I++ L +L+ + + +P+ L+ I KK+
Sbjct: 760 HVGCLKEHNMENLEKLPEGNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEK 819
Query: 754 SLETVSDIDVRWRLLSGKAATP-----ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 808
SLE + +DV+WR+++ K + ETR LLS+AVAIFH+ FDPIVDS SGRD IP+M
Sbjct: 820 SLEIGAGLDVKWRVMNWKLDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTM 879
Query: 809 VYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLF 868
++GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G+GYFQ LF
Sbjct: 880 LFGRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLF 939
Query: 869 ACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSML 928
+CIE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y K+ +++ F+GTS+L
Sbjct: 940 SCIETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKY-KKFYRMMIFQGTSVL 998
Query: 929 QKRVP 933
QK VP
Sbjct: 999 QKPVP 1003
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 363/902 (40%), Positives = 513/902 (56%), Gaps = 133/902 (14%)
Query: 98 ENGILESVVEEENQLVQMTV---ENVIEETVKGKKAPICKEEPISKVECFPRKEGG---S 151
E +L+ V+ EE +V T+ E ++ ET+K + E+P+ E + G +
Sbjct: 142 EPKVLDDVINEEEAIVAETLKEQEPIVPETLKEEVVDEMAEQPLCIEESEEKDSNGVALA 201
Query: 152 EVSNGLN-----KKCLKRP--------SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAI 198
V++G KK L+RP SA+K K E E G +S
Sbjct: 202 LVNDGAKGKKSMKKRLERPQSERRFTRSALKVKSEET----NDGEHVGVAGISDGVKRET 257
Query: 199 AEGSAL--TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF---QAS 253
G++L T+P +K S + L K P + +L TG+L+G+ V+YM G+K
Sbjct: 258 EAGASLVMTTPSS---VKFSNRGKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEK 314
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENG---KSLLEVLR 310
GL+G+I+D G+LC C +C G V+ P+ FE+HA +R +YI +G K+L +V+
Sbjct: 315 GLQGVIQDSGVLCFCKICKGVEVVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMN 374
Query: 311 ACR--SVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKK 368
AC PL + +Q L +KS C+ C+G C G + +C+SC+ S
Sbjct: 375 ACCCCDFPLESMDEAVQKLLGDFTMKKSSICLNCRGA----CKGVSRL--VCDSCLVSPA 428
Query: 369 PQGTMTYTTGI----RIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKS 424
Q + GI + R+ P ++ +S +N+ S N + K +
Sbjct: 429 -QTAVASNKGISQPVQPRSPEPVVIQKSLDNEVQPNSLHNEVQPNKLDTGM--------- 478
Query: 425 SSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLI 484
+ SL N ++ S NS +P +N H +S+ + Q ++T+K
Sbjct: 479 --QPNSLDNGMEPDS----LNNSMKPKSFSNGMK----HSASRGKSQGRLTRK------- 521
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
D RLHKLVF+ LPDGTE+ YYA GQKLL GYK G
Sbjct: 522 ------------------------DLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKG 557
Query: 545 LGIICHCCNSEVSPSQFEAHAG------------------------------RQYPGKDN 574
GI C CCN +VS SQFEAHAG R++ DN
Sbjct: 558 YGIFCTCCNEQVSASQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDN 617
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAV 634
DDLC IC DGG+LL CDGCPRAFH +C L IP G WYCKYCQN+F++ R QH+ NA+
Sbjct: 618 DDLCIICEDGGDLLCCDGCPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKDRHGQHEVNAL 677
Query: 635 -EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
AGR++G D +E + KRCIR+V+ LE + GC LC +FSKS FGP+T+++CDQCE+E
Sbjct: 678 AAAGRIAGPDILELMNKRCIRVVRTLEVDHGGCALCSRPNFSKS-FGPQTVIICDQCEKE 736
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAG 752
+HVGCLK H M +L ELP G WFC +CS+I++ L +L+ + + +P+ LN I KK+
Sbjct: 737 YHVGCLKDHNMENLEELPVGNWFCSGNCSQIHTALMDLVASKEKDVPDPLLNLIKKKHEE 796
Query: 753 NSLETVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 810
SL+ + +DV+WR+++ K + + ETR LLS+AVAIFH+ FDPIVDS SGRD IP+M++
Sbjct: 797 KSLDIGAGLDVKWRVINWKLDSDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPAMLF 856
Query: 811 GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFAC 870
GRN+RGQ+F G+YCA+LTVN +VSAG+ RVFG E+AELPLVAT+ + G+GYFQ LF+C
Sbjct: 857 GRNIRGQDFSGIYCAVLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQGQGYFQCLFSC 916
Query: 871 IEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
IE LL L VK++VLPAA+EAESIWT KFGF K+ + ++ Y K+ +++ F+GTS+LQK
Sbjct: 917 IETLLGSLNVKNLVLPAADEAESIWTGKFGFTKLPQDEINKY-KKFYRMMIFQGTSVLQK 975
Query: 931 RV 932
V
Sbjct: 976 PV 977
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 331/793 (41%), Positives = 461/793 (58%), Gaps = 108/793 (13%)
Query: 184 GFGNESMSLIEVEAIA---EGSALTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDG 239
G E L++V+ A E SP K + K L P + ++F+ G+L+G
Sbjct: 333 GNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEG 392
Query: 240 VSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
+ V Y+ G K + +G L+G+I+ G+LC CS C G +V+ P+ FE+HA +R +Y
Sbjct: 393 LIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIGIQVVSPAMFELHASSNNKRPPEY 452
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGP 356
I E+G +L +V+ AC+ PL L+ L+ + + +KS C+ C+G C T
Sbjct: 453 ILLESGFTLRDVMNACKENPLATLEEKLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKS 509
Query: 357 GPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNKRERKKTRKLL 416
+C SC++SK+P+ + N+ +
Sbjct: 510 LVVCKSCLESKEPE-------------------------------FHNSPSKAN------ 532
Query: 417 EADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITK 476
D ++ SS SV +++L+ R+ S P QS R+ + T+
Sbjct: 533 --DALNGSSRPSVDPKSILR--------RSKSSP---------------RQSNRREQPTR 567
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQK 536
KS + ++ + S S + S +T KD RLHKLVF++ LPDGTEVGY+ G+K
Sbjct: 568 KSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEK 627
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG-----------------------------R 567
+L GYK G GI C CCN VSPS FEAHAG +
Sbjct: 628 MLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQ 687
Query: 568 QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 627
++ +NDDLC+IC DGG L+ CD CPR++HK CASL S+P W CKYC NM ER++F+
Sbjct: 688 RFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSCKYCVNMVEREKFV 747
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILL 686
+ NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG F + GF RT+++
Sbjct: 748 DSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHSFCRLGFNARTVII 807
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL+V+ EKL LN
Sbjct: 808 CDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNF 867
Query: 747 IKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVAIFHDCFDPIVDSISG 801
++K + E D D+RWR+LSGK ++ +T++LL++A++I H+ FDPI +S +
Sbjct: 868 LRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALSILHERFDPISESGTK 927
Query: 802 RDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+AELPLVATSK G+
Sbjct: 928 GDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQ 987
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+ E + YRK S ++
Sbjct: 988 GYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI- 1046
Query: 922 FKGTSMLQKRVPA 934
F GTSML+K VPA
Sbjct: 1047 FHGTSMLRKSVPA 1059
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 272/460 (59%), Positives = 335/460 (72%), Gaps = 33/460 (7%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD LHKLVF E+GLP+GTEVGYY GQ+LL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 366 VTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQLLVGYKRGSGIFCTCCNSEVSPSQFEAH 425
Query: 565 AG-----------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPR 595
AG R+ DNDDLC+IC DGGNLL CDGCPR
Sbjct: 426 AGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLLCCDGCPR 485
Query: 596 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 655
FHKEC SL++IP+G W+CK+C NM ++++F++H+ANAV AGRV+GVD +EQITKRCIRI
Sbjct: 486 VFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRCIRI 545
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
V E+ GC LCR +FS+SGFGPRT++LCDQCE+EFHVGCL++H M DL+E+PKGKW
Sbjct: 546 VNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPKGKW 605
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSG-KAA 773
FCC DC RINS LQ L+V E+LP L IK KY N D D++WRL+ G +A+
Sbjct: 606 FCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGRRAS 665
Query: 774 TPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSV 833
+ E LLSQA++IFH+ FDPI D+ +GRDL+P MV+G++ R +FGGMYCAILT++S V
Sbjct: 666 SIEAGSLLSQALSIFHEQFDPIADA-AGRDLLPDMVHGKSTREWDFGGMYCAILTISSQV 724
Query: 834 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 893
VSA R+FG+EVAELPLVAT G+GYFQ LF+C+E LL L V+S+VLPAAE AES
Sbjct: 725 VSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAEGAES 784
Query: 894 IWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
IWT+KFGF K+ E + +R R Q+VTF+GT MLQK VP
Sbjct: 785 IWTNKFGFNKVTQEQRNNFR-RDYQMVTFQGTLMLQKLVP 823
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 91/167 (54%), Gaps = 10/167 (5%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKT 354
V+ A ++ L L ++ A+ +KS C+ CKG +G +
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS 341
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 277/468 (59%), Positives = 335/468 (71%), Gaps = 37/468 (7%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+T KD R HKLVF+E LPDGTEVGYY G+KLL GYK G GI C CCNSEVSPSQFEAH
Sbjct: 501 VTKKDIRTHKLVFEEEVLPDGTEVGYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAH 560
Query: 565 AG-----------------------------RQYPGKDNDDLCTICADGGNLLPCDGCPR 595
AG R++ K+NDDLC IC DGG LL CD CPR
Sbjct: 561 AGWASRRKPYLNIYTSNGVSLHELAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPR 620
Query: 596 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 655
AFH+EC SL SIP+G WYCKYC N FE+++F++ +ANA+ AGRV+GVD +EQIT+RCIRI
Sbjct: 621 AFHQECLSLPSIPRGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRI 680
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+HKM DL+ELPKGKW
Sbjct: 681 VKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKW 739
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 775
FCC C RI+S LQ L+++ EKLP+ LN IKK+ ++ E+ DVRWRLLS K +
Sbjct: 740 FCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKHEESASESGCSDDVRWRLLSKKTDSS 799
Query: 776 E-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEFGGMYCAILTV 829
+ T LLS AVAIFH+CFDPI S R D IPSMV G N++GQ+ GGMYCA+L V
Sbjct: 800 DVTEALLSDAVAIFHECFDPITVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLV 859
Query: 830 NSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAE 889
N VVS ++R+FGQE+AELP+VATS G+GYFQ LF CIEKLL FL VK++VLPAAE
Sbjct: 860 NHVVVSVAVVRIFGQELAELPIVATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAE 919
Query: 890 EAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
E SIW +KFGF I + L YR+R Q++ F+G MLQK VP CRI
Sbjct: 920 EVGSIWKNKFGFGAITQDELMEYRRR-YQIMVFQGALMLQKPVPKCRI 966
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 115/233 (49%), Gaps = 15/233 (6%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+E V++ + S+G ++V+ SA + N KM K P
Sbjct: 266 SALKPKIETVDI--SSSDG--------VKVDDRGSSSAAAATTTNTPTKMFSIDGSKKFP 315
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++R+ GILC C C G V+ P+ F
Sbjct: 316 TKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVTPAIF 375
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
+HA +R +YIC ENG +L +V+ AC++ L L ++ + P +KS C C
Sbjct: 376 VLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNFCWNC 435
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSEN 395
+G+ IT G LC+ C K Q T + R + P V SS N
Sbjct: 436 RGS--ITGAGSRKSKVLCSQCFGLKDFQAGSAPKTAKKERTAKPHSVPESSCN 486
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 268/475 (56%), Positives = 335/475 (70%), Gaps = 38/475 (8%)
Query: 499 NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSP 558
+KS+ +T KD R+HKLVF+E LPDGTEV YY+ GQKLL GYK G GI C CCN+EVSP
Sbjct: 563 SKSQGRLTVKDLRMHKLVFEEDVLPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSP 622
Query: 559 SQFEAHAG-----------------------------RQYPGKDNDDLCTICADGGNLLP 589
SQFEAHAG R++ NDDLC IC DGG+LL
Sbjct: 623 SQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLC 682
Query: 590 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 649
CD CPRA+HK+C +L IP G WYCK+C N F++++F++H+ANA+ AGRV+GVD ++QIT
Sbjct: 683 CDVCPRAYHKDCLALPEIPTGRWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQIT 742
Query: 650 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 709
+RCIRIVK ++A+ GC+ CRG DF K FGPRT+LLCDQCE+EFHVGCLK H M DL+E
Sbjct: 743 RRCIRIVKTMDADFGGCVFCRGHDFDKI-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKE 801
Query: 710 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETV--SDIDVRWRL 767
LPKG WFCC DC RI+S L+ L+++ E+L + LN I K + S+IDVRWRL
Sbjct: 802 LPKGNWFCCSDCCRIHSALEKLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRL 861
Query: 768 LSGKA-ATPETRLLLSQAVAIFHDCFDPIV----DSISGRDLIPSMVYGRNLRGQEFGGM 822
L+ K +T LLS+A+AI H+ F+PI+ S + RDLI SMV+G NL+GQEFGGM
Sbjct: 862 LNDKINPAGDTAALLSEALAILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGM 921
Query: 823 YCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKS 882
YCA+L +N +VVS I+R FG E+AELPLVATS GKGYFQ LF CIEKLL FL +K+
Sbjct: 922 YCAVLMINQAVVSCAIIRFFGLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKN 981
Query: 883 IVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
+VLPAAEEAESIW +KFGF+K+ E +RK Q++ F+GTSML K VP RI
Sbjct: 982 LVLPAAEEAESIWINKFGFRKLTHEEFLKFRKD-YQMMVFQGTSMLHKPVPKIRI 1035
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 91/180 (50%), Gaps = 5/180 (2%)
Query: 201 GSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRG 257
GS + LK+ K + K P + +L ++G+L+G V YM G K + +G L+G
Sbjct: 346 GSPSAASNSGTMLKVWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQG 405
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
+I ILC C C G V+ PS FE+HA +R +YI ENG +L +V+ AC++ L
Sbjct: 406 VISGSAILCFCRSCRGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASL 465
Query: 318 PMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTT 377
L L + + S C++C+G G++ LC+ C+ K Q ++ TT
Sbjct: 466 ETLDEALWLSTGCSSLKNSTFCLKCRGKLAEASTGRS--MTLCSQCMVLKDSQASIPATT 523
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 550 bits (1417), Expect = e-153, Method: Compositional matrix adjust.
Identities = 273/480 (56%), Positives = 340/480 (70%), Gaps = 43/480 (8%)
Query: 499 NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSP 558
+ S+ +T KD R+HKLVF+E LPDGTEVGYY+ G+KLL GYK G GI C CCN+EVSP
Sbjct: 471 SNSQGRLTKKDIRMHKLVFEEEVLPDGTEVGYYSQGKKLLVGYKKGFGIFCSCCNTEVSP 530
Query: 559 SQFEAHAG-----------------------------RQYPGKDNDDLCTICADGGNLLP 589
SQFEAHAG R++ K+NDDLC IC DGG LL
Sbjct: 531 SQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKCRRHSTKENDDLCQICRDGGKLLC 590
Query: 590 CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQIT 649
CD CPRAFH+EC SL SIP+G WYCKYC N FE+++F++ +ANA+ AGRV+G D +EQIT
Sbjct: 591 CDVCPRAFHQECLSLPSIPKGKWYCKYCLNTFEKEKFVERNANAIAAGRVAGTDPIEQIT 650
Query: 650 KRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR- 708
+RCIRIVK EAE+ GC+ CRG DF ++ FGPRT+++CDQCE+EFHVGCLK+H+M DL+
Sbjct: 651 RRCIRIVKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQCEKEFHVGCLKEHQMQDLKA 709
Query: 709 --ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRW 765
ELP GKWFCC C RI+S LQ L+++ EKLP+ LN I KK+ ++ E+ D+RW
Sbjct: 710 ICELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLNFIKKKHEESASESGGGDDIRW 769
Query: 766 RLLSGKAATPE-TRLLLSQAVAIFHDCFDPIVDSISGR-----DLIPSMVYGRNLRGQEF 819
RLLS K + T LLS+AVAIFH+ F PI S R D IPSMV G +++GQ+
Sbjct: 770 RLLSKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKRKRDDHDFIPSMVKGGDMKGQDL 829
Query: 820 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 879
GGMYCA+L VN VVSA ++R+FGQE+AELP+VATS + G+GYFQ LF CIEKLL FL
Sbjct: 830 GGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKSQGQGYFQTLFTCIEKLLGFLN 889
Query: 880 VKSIVLPAAEEAESIWTDKFGFKKI--DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
VK++VLPAAEE ESIWT+KFGF I D L YRK Q++ F+G+ MLQK VP CR+
Sbjct: 890 VKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKS-YQIMEFQGSLMLQKPVPKCRV 948
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 120/231 (51%), Gaps = 17/231 (7%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
SA+KPK+EP+++ + S+G + V AI T+P K + L K P
Sbjct: 244 SALKPKIEPLDI--SSSDGVKVDDTGSSSVAAIT-----TTPTKMFAID-----GLKKFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
+ +L ++G+L+G V Y+ G K + GL G++++ GILC C C G V+ P+ F
Sbjct: 292 TKLKDLLDSGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIF 351
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA +R +YI ENG +L +V+ AC++ L +L ++ ++ P +KS C+ C
Sbjct: 352 ELHAGSANKRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSC 411
Query: 343 KGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
+G+ IT G LC+ C++ K Q + T + R P V SS
Sbjct: 412 RGS--ITGAGTRKSKVLCSQCLELKDSQAILAPETDTKERTPRPSPVPESS 460
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 268/466 (57%), Positives = 329/466 (70%), Gaps = 33/466 (7%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
P+K +T KD R+HKLVF+ LPDGT + YY G+KLLEGYK G I C+CC SEVS
Sbjct: 581 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGKKLLEGYKKGGAIFCYCCQSEVS 640
Query: 558 PSQFEAHAG-----------------------------RQYPGKDNDDLCTICADGGNLL 588
PSQFEAHAG R+ +NDDLC+ICADGG+LL
Sbjct: 641 PSQFEAHAGCASRRKPYSHIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGDLL 700
Query: 589 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 648
CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D++EQI
Sbjct: 701 CCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQI 759
Query: 649 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 708
TK IRIV L AE+ C+LCR DFS SGFGP+T+++CDQCE+E+HV CL++H M DL+
Sbjct: 760 TKCSIRIVDTLHAEVGVCVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLK 819
Query: 709 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRL 767
ELPK KWFCC +C+ I+ LQ L+ + LP+ + I +K +LE S DV+WRL
Sbjct: 820 ELPKDKWFCCKECNSIHYALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSINDVKWRL 879
Query: 768 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR-DLIPSMVYGRNLRGQEFGGMYCAI 826
LSGK +T ETR+ LS AV+IFHD FDPI DS + R DLIP+MVYGRN + Q+FGGM CAI
Sbjct: 880 LSGKNSTEETRVWLSGAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLCAI 939
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
L VNS VVSAG++R+FG+EVAELPLVATS GKGYFQ LF IE LL L VK +VLP
Sbjct: 940 LMVNSLVVSAGVIRIFGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLVLP 999
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 932
AAEEAESIWT KFGF+ I PE L Y+ QL+ F+GT+MLQK+V
Sbjct: 1000 AAEEAESIWTKKFGFQHITPEELKHYKDN-YQLMIFQGTAMLQKQV 1044
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 105/176 (59%), Gaps = 6/176 (3%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLR 256
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y+ K + GL+
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQ 404
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
G+I+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 405 GVIQGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAP 464
Query: 317 LPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 465 ADALEAAIRNATGAGDARKSTFCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 518
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 294/688 (42%), Positives = 400/688 (58%), Gaps = 101/688 (14%)
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA +R +YI E+G +L +V+ AC+ PL L+ L+ + + ++ S
Sbjct: 2 FELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPILKKSS----- 56
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
LC SC GP + +++ +C S
Sbjct: 57 -----------------LCLSC--------------------QGPMIEPCDTKSLVVCKS 79
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+K D ++ SS SV +++L+ R+ S P
Sbjct: 80 CLESKEPEFHNSPSKANDALNGSSRPSVDPKSILR--------RSKSSP----------- 120
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
QS R+ + T+KS + ++ + S S + S +T KD RLHKLVF++
Sbjct: 121 ----RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDI 176
Query: 522 LPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------------- 566
LPDGTEVGY+ G+K+L GYK G GI C CCN VSPS FEAHAG
Sbjct: 177 LPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTN 236
Query: 567 --------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
+++ +NDDLC+IC DGG L+ CD CPR++HK CASL S+P W
Sbjct: 237 GVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERW 296
Query: 613 YCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-SGCLLCRG 671
CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RCIRIV + EL S C+LCRG
Sbjct: 297 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRG 356
Query: 672 CDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 731
F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+ KWFC + C IN+ L NL
Sbjct: 357 HSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNL 416
Query: 732 LVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRLLLSQAVA 786
+V+ EKL LN ++K + E D D+RWR+LSGK ++ +T++LL++A++
Sbjct: 417 IVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGKLTSSDDTKILLAKALS 476
Query: 787 IFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 846
I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L V+ +VS GI RVFG E+
Sbjct: 477 ILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSEL 536
Query: 847 AELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP 906
AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKFGF K+
Sbjct: 537 AELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTD 596
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
E + YRK S ++ F GTSML+K VPA
Sbjct: 597 EEVKEYRKDYSVMI-FHGTSMLRKSVPA 623
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 514 bits (1323), Expect = e-142, Method: Compositional matrix adjust.
Identities = 304/767 (39%), Positives = 425/767 (55%), Gaps = 145/767 (18%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASG---LRGIIRDGGILCSCSLCNGCRVI 277
L P + E+F G+L+G++V Y+ G K + +G L+G+I+ G+LC C C G +V+
Sbjct: 363 LRNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVV 422
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPE-EKS 336
+ +E+HA +R +YI E+G +L +V+ AC+ P L+ L+ + P +KS
Sbjct: 423 STAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVG--PNLKKS 480
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ--GTMTYTTGIRIRASGPKLVSRSSE 394
C+ C+G C T +C SC++SK+P+ + + G +S P + +S
Sbjct: 481 SLCLNCQGPMIEPC--DTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSIL 538
Query: 395 NDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIA 454
+ S Q+N++E+ TR S+ PG++
Sbjct: 539 SRSKSSPRQSNRQEQP---------------------------------TRKSTEPGVVP 565
Query: 455 NSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHK 514
+ S S +S Q K+T+ KD RLHK
Sbjct: 566 GTILSESKSSSIKSNSQGKLTR-------------------------------KDVRLHK 594
Query: 515 LVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG-------- 566
LVF++ LPDGTEVGY+ G EVSPS FEAHAG
Sbjct: 595 LVFEDDILPDGTEVGYFVAG--------------------EVSPSSFEAHAGCASRRKPF 634
Query: 567 ---------------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
+++ +NDDLC+IC DG CASLS
Sbjct: 635 QHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG---------------VCASLS 679
Query: 606 SIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL-S 664
S+P W CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RCIR+V + EL S
Sbjct: 680 SLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPS 739
Query: 665 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
C+LCRG F + GF RT+++CDQCE+EFHVGCLK+H +ADL+ELP+ KWFC +DC +I
Sbjct: 740 VCVLCRGHSFCRLGFNSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEKI 799
Query: 725 NSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLLSGK-AATPETRL 779
N+ L NL+++ EKL LN I+ + E+ D D+RWR+LSGK ++ ET++
Sbjct: 800 NTTLGNLIIRGEEKLTNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGKLTSSDETKI 859
Query: 780 LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGIL 839
LL++AV+I H+ FDPI ++ + DLIP+MVYGR +GQ+F GMYC +L V+ +VS GI
Sbjct: 860 LLAKAVSILHERFDPISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGIF 919
Query: 840 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 899
RVFG E+AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLPAA+EA+SIWTDKF
Sbjct: 920 RVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKF 979
Query: 900 GFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTE 946
GF K+ E + YRK S ++ F GTSML+K VPA S + S E
Sbjct: 980 GFTKMTDEEVKEYRKDYSVMI-FHGTSMLRKSVPAPSAPSKTEGSKE 1025
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 256/493 (51%), Positives = 332/493 (67%), Gaps = 40/493 (8%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 368 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 420
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYP--------------------- 570
++LL G+K G I+C CCNSEVSPSQFEAH+G R+ P
Sbjct: 421 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 480
Query: 571 GKD-----NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
G++ ND++C+IC DGG LL CDGCPR FHKEC SL +IP+G W+CK+C N ++ +
Sbjct: 481 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 540
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 541 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 600
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 601 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 660
Query: 746 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
IKK G S + D++WRLL G +A+ E LLSQA+++FH+ F+PI D+ GRD
Sbjct: 661 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFHEQFNPIADA-EGRD 719
Query: 804 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
L+ MV+ + EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+GY
Sbjct: 720 LLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGY 779
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
FQ L+ CIE+LL FL+V S+VLPAAE AES+W +KF F K++ E L+ + R Q++TF+
Sbjct: 780 FQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQ 838
Query: 924 GTSMLQKRVPACR 936
GTSMLQK VP R
Sbjct: 839 GTSMLQKPVPEYR 851
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 65 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 119
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 120 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 173
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 174 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 211
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 212 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 271
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 272 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 330
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 331 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 385
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 261/493 (52%), Positives = 324/493 (65%), Gaps = 65/493 (13%)
Query: 477 KSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ- 535
+S K IS+ E+ S S +KS +T KD LHKLVF E+GLP+GTEVGYY GQ
Sbjct: 370 RSPKPSTISRSAESVSKCSSSGSKSYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQV 429
Query: 536 --KLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------------------------- 566
+LL GYK G GI C CCNSEVSPSQFEAHAG
Sbjct: 430 VTQLLVGYKRGSGIXCTCCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLS 489
Query: 567 --RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
R+ DNDDLC+IC DGGNLL CDGCPR FHKEC SL++IP+G W+CK+C NM +++
Sbjct: 490 RGREISVSDNDDLCSICLDGGNLLCCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKE 549
Query: 625 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTI 684
+F++H+ANAV AGRV+GVD +EQITKRCIRIV E+ GC LCR +FS+SGFGPRT+
Sbjct: 550 KFVEHNANAVAAGRVAGVDPIEQITKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTV 609
Query: 685 LLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHL 744
+LCDQCE+EFHVGCL++H M DL+E+PKGKWFCC DC RINS LQ L+V E+LP L
Sbjct: 610 MLCDQCEKEFHVGCLREHDMDDLKEVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVL 669
Query: 745 NAIK-KYAGNSLETVSDIDVRWRLLSG-KAATPETRLLLSQAVAIFHDCFDPIVDSISGR 802
IK KY N D D++WRL+ G +A++ E LLSQA++IFH+ FDPI D+ +GR
Sbjct: 670 TTIKEKYGRNGSACSKDPDIKWRLICGRRASSIEAGSLLSQALSIFHEQFDPIADA-AGR 728
Query: 803 DLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKG 862
DL+P MV+G S VVSA R+FG+EVAELPLVAT G+G
Sbjct: 729 DLLPDMVHG-------------------SQVVSAAAFRIFGKEVAELPLVATRSDCQGQG 769
Query: 863 YFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE--LLSIYRKRCSQLV 920
YFQ LF+C+E LL L V+S+VLPAAE AESIWT+KFGF K+ E ++ ++ Q
Sbjct: 770 YFQTLFSCLEGLLGVLEVRSLVLPAAEGAESIWTNKFGFNKVTQEQYIMDLFGIAAEQ-- 827
Query: 921 TFKGTSMLQKRVP 933
QKR+P
Sbjct: 828 -------FQKRLP 833
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 114/210 (54%), Gaps = 15/210 (7%)
Query: 193 IEVEAIAEGSALTSPKKNLEL--KMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+E+ A+ G T K++ L ++ ++ P + EL +TG+L+ + V Y+ G +
Sbjct: 180 MEISAVNNGEENTGTKRSSGLVPRVPRRF-----PAKLKELLDTGILEDLPVQYIRGSRT 234
Query: 251 QASG---LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLE 307
+ SG LRG+I+ GILCSC+ C G +V+ P+ FE+HA +R +YI ENG SL
Sbjct: 235 RGSGESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRG 294
Query: 308 VLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
V+ A ++ L L ++ A+ +KS C+ CKG +G + LC SC++ K
Sbjct: 295 VMNAWKNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNS--KVLCLSCLQLK 352
Query: 368 KPQGTMTYTTG---IRIRASGPKLVSRSSE 394
+ Q + + TG +R+ P +SRS+E
Sbjct: 353 ESQASPSQVTGSSDSHLRSPKPSTISRSAE 382
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 237/481 (49%), Positives = 317/481 (65%), Gaps = 36/481 (7%)
Query: 487 PFENASPPLSFP----NKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYK 542
P +A P +F + S +T KD LHKLVF LP+GT+VGYY G++LL+GY
Sbjct: 259 PTSSARVPKNFSPGAKSTSAGRLTRKDHGLHKLVFLSGILPEGTDVGYYVGGKRLLDGYI 318
Query: 543 NGLGIICHCCNSEVSPSQFEAHAGR-----------------------------QYPGKD 573
GI CHCCN+ VSPSQFE HAGR + +
Sbjct: 319 KEPGIHCHCCNTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHELSVSLSRGRKTSDRQ 378
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
+DDLC+IC+DGG LL CD CPRAFH+EC L+++P+G W C+YC+ +R+ L ++ NA
Sbjct: 379 SDDLCSICSDGGELLLCDTCPRAFHRECVDLTAVPKGTWCCRYCETRQQRESSLAYNHNA 438
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
+ AGR+ G+DS+EQI R IRI E GC LC+ DF K F RT+LLCDQC RE
Sbjct: 439 IAAGRIDGIDSMEQIFTRSIRIATTPETGFGGCALCKLHDFGKKKFSARTVLLCDQCGRE 498
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+HVGCLK+H MADL LP+G W+C DC RI+ +++LL AE +P + IKK +
Sbjct: 499 YHVGCLKEHSMADLTALPEGAWYCSSDCVRISETMKDLLSGGAEPVPAMDADLIKKKRED 558
Query: 754 -SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 812
L D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ + +GRDLIP+MVYGR
Sbjct: 559 KGLNEDGDLDVRWRVLRDKSSE-DSKLVLSKAVAIFHESFDPIIQTTTGRDLIPAMVYGR 617
Query: 813 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 872
++R Q++ GMYCA+LTV ++VVSAG+ R+ G+E AELPLVATS+ N G GYFQ LF CIE
Sbjct: 618 SVRDQDYTGMYCAVLTVGNTVVSAGLFRIMGREAAELPLVATSRDNQGFGYFQALFGCIE 677
Query: 873 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 932
+LL+ L+VK VLPAA+EA SIWT +FGF KI + L + + ++ F+GTS L K +
Sbjct: 678 RLLASLKVKYFVLPAADEAVSIWTQRFGFSKISRDEL-LEHLKGARTTVFQGTSTLHKLI 736
Query: 933 P 933
P
Sbjct: 737 P 737
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 308/827 (37%), Positives = 431/827 (52%), Gaps = 174/827 (21%)
Query: 154 SNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGN-------ESMSLIEVEAIA---EGSA 203
S G++KK + + KP LV Q N E L++V+ A E
Sbjct: 297 SQGVDKKAVN-DTVDKPLRRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDG 355
Query: 204 LTSPKKNLELKMSK-KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDG 262
SP K + K L P + ++F+ G+L+G+ V Y+
Sbjct: 356 FQSPSVTTPNKRGRPKKFLRNFPAKLKDIFDCGILEGLIVYYV----------------- 398
Query: 263 GILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKA 322
G +V+ P+ FE+HA +R +YI E+G +L +V+ AC+ PL L+
Sbjct: 399 ---------RGAKVVSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEE 449
Query: 323 TLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIR 382
L+ + + +KS C+ C+G C T +C SC++SK+P+
Sbjct: 450 KLRVVVGPIL-KKSSLCLSCQGPMIEPC--DTKSLVVCKSCLESKEPE------------ 494
Query: 383 ASGPKLVSRSSENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWE 442
+ N+ + D ++ SS SV +++L+
Sbjct: 495 -------------------FHNSPSKAN--------DALNGSSRPSVDPKSILR------ 521
Query: 443 LTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSR 502
R+ S P QS R+ + T+KS + ++ + S S + S
Sbjct: 522 --RSKSSP---------------RQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSH 564
Query: 503 WNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFE 562
+T KD RLHKLVF++ LPDGTEVGY+ G EVSPS FE
Sbjct: 565 GKLTRKDLRLHKLVFEDDILPDGTEVGYFVAG--------------------EVSPSTFE 604
Query: 563 AHAG-----------------------------RQYPGKDNDDLCTICADGGNLLPCDGC 593
AHAG +++ +NDDLC+IC DG
Sbjct: 605 AHAGCASRRKPFQHIYTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDG--------- 655
Query: 594 PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCI 653
CASL S+P W CKYC NM ER++F+ + NA+ AGRV GVD++ +IT RCI
Sbjct: 656 ------VCASLPSLPSERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCI 709
Query: 654 RIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPK 712
RIV + EL S C+LCRG F + GF RT+++CDQCE+EFHVGCLK+ +ADL+ELP+
Sbjct: 710 RIVSSFVTELPSVCVLCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPE 769
Query: 713 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----DVRWRLL 768
KWFC + C IN+ L NL+V+ EKL LN ++K + E D D+RWR+L
Sbjct: 770 EKWFCSLGCEEINTTLGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVL 829
Query: 769 SGK-AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL 827
SGK ++ +T++LL++A++I H+ FDPI +S + DLIP+MVYGR + Q+F GMYC +L
Sbjct: 830 SGKLTSSDDTKILLAKALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTML 889
Query: 828 TVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 887
V+ +VS GI RVFG E+AELPLVATSK G+GYFQ LFACIE+LL FL VK IVLPA
Sbjct: 890 AVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPA 949
Query: 888 AEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
A+EA+SIWTDKFGF K+ E + YRK S ++ F GTSML+K VPA
Sbjct: 950 ADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMI-FHGTSMLRKSVPA 995
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 232/468 (49%), Positives = 303/468 (64%), Gaps = 47/468 (10%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDGTE+ YY QKLL GYK G GI+C CC+ E+SPSQFEAHAG
Sbjct: 402 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYKQGSGIVCSCCSREISPSQFEAHAGM 461
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C+IC DGG+LL C GCP+AFH
Sbjct: 462 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 521
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 655
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 522 TACLKFQSVPEGTWYCSSCND------------GPISSKKATATDPSGNARPIVIRLSRV 569
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ + DL+E+P+ KW
Sbjct: 570 VKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGLCDLKEIPQEKW 629
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 774
FCC DCSRI++ +QN + + +P L+ I +K + T + V WR+LSGK+
Sbjct: 630 FCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKGIFTDNGDIVEWRILSGKSRY 689
Query: 775 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 834
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 690 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 748
Query: 835 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 894
SA +LR+FGQ+VAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 749 SAALLRIFGQQVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 808
Query: 895 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 942
WT KFGF K+ + L Y+K QL FKGTSML+K+VP S ST
Sbjct: 809 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKTTSLSEST 855
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 72/123 (58%), Gaps = 3/123 (2%)
Query: 213 LKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN 272
+KM KKI P V +L ETG+L+G V Y+ + L+GII GG LC C+ C+
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYISTPPVRE--LQGIIHSGGYLCGCTTCS 216
Query: 273 GCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALSSL 331
+V+ +FE+HA + R + +I ENG+++ +++ ++ P +L+ +++ A S+L
Sbjct: 217 FSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAGSAL 276
Query: 332 PEE 334
EE
Sbjct: 277 NEE 279
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 231/468 (49%), Positives = 302/468 (64%), Gaps = 47/468 (10%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHAG
Sbjct: 372 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 431
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C+IC DGG+LL C GCP+AFH
Sbjct: 432 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 491
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 655
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 492 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 539
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 540 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 599
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 774
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 600 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 659
Query: 775 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 834
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 660 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 718
Query: 835 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 894
SA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 719 SAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 778
Query: 895 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 942
WT KFGF K+ + L Y+K QL FKGTSML+K+VP G S +
Sbjct: 779 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSES 825
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 160 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 216
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 217 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 276
Query: 330 SLPEE 334
+L EE
Sbjct: 277 ALNEE 281
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 454 bits (1168), Expect = e-124, Method: Compositional matrix adjust.
Identities = 229/455 (50%), Positives = 294/455 (64%), Gaps = 52/455 (11%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR 567
+D+ +HK+VF L +GT+VGYY G+ VSPSQFEAHAGR
Sbjct: 443 RDKGMHKVVFMSGVLQEGTDVGYYVGGK--------------------VSPSQFEAHAGR 482
Query: 568 -----------------------------QYPGKDNDDLCTICADGGNLLPCDGCPRAFH 598
+ + +DDLC+IC+DGG LL CD CPRAFH
Sbjct: 483 AARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTCPRAFH 542
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 658
+EC SLSS P+G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI
Sbjct: 543 RECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATT 602
Query: 659 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 718
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 603 PETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCS 662
Query: 719 MDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPET 777
DC RIN LQ+LL E +P L+ I KK +D+DVRWR+L K++ ++
Sbjct: 663 TDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDKSSD-DS 721
Query: 778 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 837
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYCA+LTVN++VVSAG
Sbjct: 722 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVNNTVVSAG 781
Query: 838 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 897
+ R+ G E+AELPLVATS+ + G GYFQ LF+CIE+LL+ L VK VLPAAEEAESIWT+
Sbjct: 782 LFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEEAESIWTE 841
Query: 898 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 932
+FGF KI + L Y K + F+GTS L K V
Sbjct: 842 RFGFTKISQDELREYLK-GGRTTVFQGTSNLHKLV 875
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 57/93 (61%), Gaps = 2/93 (2%)
Query: 275 RVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEE 334
+V+ P FE+HA + S YI ENG +L +VLRAC +V L ML++ ++ A+ P++
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 335 KSFACVRCKGTFPITCVGKTGPGPLCNSCVKSK 367
++F C CK +F C GK C+SC++SK
Sbjct: 406 RTFRCKGCKSSFSTLCSGKF--ALFCDSCLESK 436
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 227/458 (49%), Positives = 303/458 (66%), Gaps = 44/458 (9%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH--- 564
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAH
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 565 AGRQYPGK---------------------------DNDDLCTICADGGNLLPCDGCPRAF 597
AGR+ P + D+DD+C+IC +GG+LL C GCP+AF
Sbjct: 449 AGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAF 508
Query: 598 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 657
H C S+P+G WYC C + + A A + +++ I R R+VK
Sbjct: 509 HTACLKFQSMPEGTWYCSSCNDGPTSCKI----ATASDP-------NLKPIVIRLTRVVK 557
Query: 658 NLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
E+E+ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++++ DL+ +P+ KWFC
Sbjct: 558 APESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFC 617
Query: 718 CMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPE 776
C DCSRI+ VLQ+ + +P L+ I +KY + + V WR+LSGK+ PE
Sbjct: 618 CSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPE 677
Query: 777 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 836
LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA
Sbjct: 678 HLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSA 736
Query: 837 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 896
+LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++LPAAEEAESIWT
Sbjct: 737 ALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWT 796
Query: 897 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 797 NKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 833
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQAW--KGSF 300
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 451 bits (1160), Expect = e-123, Method: Compositional matrix adjust.
Identities = 235/470 (50%), Positives = 299/470 (63%), Gaps = 51/470 (10%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
L S I +D LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+S
Sbjct: 388 LQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDS 447
Query: 555 EVSPSQFEAHAG---RQYPGK--------------------------DNDDLCTICADGG 585
EVSPSQFEAHAG R+ P + D+DD+CT+C DGG
Sbjct: 448 EVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGG 507
Query: 586 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 645
+L+ CDGCPRAFH C L +P+GDW C C F R +
Sbjct: 508 DLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------A 550
Query: 646 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 705
I + R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ +
Sbjct: 551 RPIRIQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLC 610
Query: 706 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDV 763
DL+ELPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D+
Sbjct: 611 DLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DI 669
Query: 764 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMY 823
+W +LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMY
Sbjct: 670 QWCILSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMY 728
Query: 824 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 883
C +L S+VVSAG++RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++
Sbjct: 729 CVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTL 788
Query: 884 VLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
VLPAAEEAE+IWT+K GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 789 VLPAAEEAEAIWTNKLGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 837
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 69/121 (57%), Gaps = 2/121 (1%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 184 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 241
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 242 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 301
Query: 330 S 330
S
Sbjct: 302 S 302
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 451 bits (1159), Expect = e-123, Method: Compositional matrix adjust.
Identities = 231/470 (49%), Positives = 303/470 (64%), Gaps = 47/470 (10%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC+ E+SPSQFEAHAG
Sbjct: 77 RDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGM 136
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C+IC DGG+LL C GCP+AFH
Sbjct: 137 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 196
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 655
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 197 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 244
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 245 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 304
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 774
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 305 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 364
Query: 775 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 834
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 365 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 423
Query: 835 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 894
SA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 424 SAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 483
Query: 895 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 944
WT KFGF K+ + L Y+K QL FKGTSML+K+VP G S + +
Sbjct: 484 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSESTT 532
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 451 bits (1159), Expect = e-123, Method: Compositional matrix adjust.
Identities = 235/470 (50%), Positives = 299/470 (63%), Gaps = 51/470 (10%)
Query: 495 LSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNS 554
L S I +D LH+L+F +GLPDG E+ YY GQ++L GYK G GI+C C+S
Sbjct: 443 LQHKKSSEGAIKKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDS 502
Query: 555 EVSPSQFEAHAG---RQYPGK--------------------------DNDDLCTICADGG 585
EVSPSQFEAHAG R+ P + D+DD+CT+C DGG
Sbjct: 503 EVSPSQFEAHAGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGG 562
Query: 586 NLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 645
+L+ CDGCPRAFH C L +P+GDW C C F R +
Sbjct: 563 DLILCDGCPRAFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----------------A 605
Query: 646 EQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 705
I + R VK E+E+ GC++CR DFS S F RT++LCDQCE+EFHVGCL+ +
Sbjct: 606 RPIRIQLTRAVKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLC 665
Query: 706 DLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDV 763
DL+ELPK KWFCC DCSR++ LQNL + E +P + I K ++ +D D+
Sbjct: 666 DLKELPKDKWFCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAAD-DI 724
Query: 764 RWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMY 823
+W +LSGK+ E LLS+ AIF +CFDPIV S SGRDLIP MVYGRN+ GQEFGGMY
Sbjct: 725 QWCILSGKSCYKEHLPLLSRTTAIFRECFDPIVAS-SGRDLIPVMVYGRNISGQEFGGMY 783
Query: 824 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 883
C +L S+VVSAG++RVFGQEVAELP+VATSK + GKG+F+ LF+CIE+LLS L VK++
Sbjct: 784 CVVLLAKSTVVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTL 843
Query: 884 VLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
VLPAAEEAE+IWT+K GF+K+ E + Y + QL FKGTSML+K VP
Sbjct: 844 VLPAAEEAEAIWTNKLGFQKMSEERMLKYTREL-QLTIFKGTSMLEKEVP 892
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 69/121 (57%), Gaps = 2/121 (1%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ P V +L TG+LDG V Y+ + + L+G+IR+ G LC CS
Sbjct: 239 NMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE--LQGVIRESGYLCGCS 296
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN +V+ +FE HA + R + +I ENGK + +++ ++ PL L +++
Sbjct: 297 ACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIAG 356
Query: 330 S 330
S
Sbjct: 357 S 357
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 232/485 (47%), Positives = 310/485 (63%), Gaps = 47/485 (9%)
Query: 484 ISKPFENASPPLS-----FPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLL 538
+S P E +P S + +D LH+L+F +GLPDG E+ Y+ GQ++L
Sbjct: 364 LSHPVERPNPNFSNAVLQHKKTAEKGTKRRDNDLHRLLFMPNGLPDGAELAYFVKGQRIL 423
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYPGK----------------------- 572
G+K G GI+C CN E+SPSQFEAHAG R+ P +
Sbjct: 424 GGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNGLTLHDIAISLASGQKL 483
Query: 573 ---DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH 629
D+DD+C C +GG+L+ CD CPRA+H C L ++P+G W C C++ +
Sbjct: 484 TTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWSCPNCRDK------VGS 537
Query: 630 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 689
++ A+ G +S + I R R+VK E E+ GC++CR DFS + F RT+LLCDQ
Sbjct: 538 NSKAISGGSLS---FSKPIVFRLTRVVKAPEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQ 594
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN-AIK 748
CEREFHVGCL+ + DL+ELPK KWFCC +CS I+ LQN ++ A+ +P+ + I+
Sbjct: 595 CEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVLNGAQIIPDSLSDLIIR 654
Query: 749 KYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPS 807
K+ G L + DVRW++LSGK+ PE LS+A AIF +CFDPIV + SGRDLIP
Sbjct: 655 KHVGKGLLVDEALNDVRWQILSGKSRFPEDLPFLSRATAIFRECFDPIV-AKSGRDLIPV 713
Query: 808 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
MVYGRN+ GQEFGGMYC +L V S VVSAG+LR+FG+EVAELP+VATS+ + GKGYFQ+L
Sbjct: 714 MVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAELPIVATSREHQGKGYFQVL 773
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 927
F+CIE+LLS L V+++VLPAAE+AESIWT K GF+K+ E L Y + QL F GTSM
Sbjct: 774 FSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQLIKYMREV-QLTIFNGTSM 832
Query: 928 LQKRV 932
L+K V
Sbjct: 833 LEKVV 837
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E AE S + +ELKMSKK+ N P V +L TG+LDG V Y+ L
Sbjct: 199 EGSAESSRYSLGPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK--L 256
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG +C CS CN ++ +FE HA + R + +I ENG+ + V++ +S
Sbjct: 257 QGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSA 316
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
PL +L + S SF
Sbjct: 317 PLSILDEVIMEVAGSSVNMNSF 338
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 235/470 (50%), Positives = 302/470 (64%), Gaps = 58/470 (12%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQ-------KLLEGYKNGLGIICHCCNSEVS 557
I D LHKL F E+ LP+G+EV YY G+ +LL G+K G I+C CCNSEVS
Sbjct: 383 IMGGDVGLHKLAFGENDLPEGSEVSYYVRGEVGTMRSKRLLSGHKKGCRILCDCCNSEVS 442
Query: 558 PSQFEAHAG---RQYP---------------------GKD-----NDDLCTICADGGNLL 588
PSQFEAH+G R+ P G++ ND++C+IC DGG LL
Sbjct: 443 PSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLRGREPSINTNDEICSICLDGGTLL 502
Query: 589 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 648
CDGCPR FHKEC SL +IP+G W+CK+C N ++ +F++ +ANAV AGR+ GVD +EQI
Sbjct: 503 CCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGKFVERNANAVAAGRMGGVDPIEQI 562
Query: 649 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 708
KRCIRIVK+ E GC LCR +FS SGFGP T+++CDQCE+EFHVGCLK H + DL+
Sbjct: 563 RKRCIRIVKSQTDEAGGCALCRRHEFSTSGFGPHTVMICDQCEKEFHVGCLKAHNIDDLK 622
Query: 709 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVS-DIDVRWRL 767
+PKGKWFCC DC INS L+ ++V+ E+LP+ L IKK G S + D++WRL
Sbjct: 623 AVPKGKWFCCRDCKDINSSLRKIVVRREEELPDDVLRIIKKRYGRKGSVCSGNPDIKWRL 682
Query: 768 LSGKAATP-ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 826
L G+ A+ E LLSQA+++FH+ F+PI D+ GRDL+ MV+ + EFGGMYCAI
Sbjct: 683 LHGRXASATEAGSLLSQALSLFHEQFNPIADA-EGRDLLLDMVHSNSTGELEFGGMYCAI 741
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
LTV VVSA RV G+EVAELPLVAT G+ V S+VLP
Sbjct: 742 LTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQ------------------VNSLVLP 783
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 936
AAE AES+W +KF F K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 784 AAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 832
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 88/162 (54%), Gaps = 6/162 (3%)
Query: 220 SLNKKPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRV 276
+L P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V
Sbjct: 208 TLKGFPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNV 267
Query: 277 IPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKS 336
+ P+ FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +
Sbjct: 268 VTPNVFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKST 327
Query: 337 FACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTG 378
F C CKG+ I+ VG + LC SCV K+ + TG
Sbjct: 328 F-CFNCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTG 366
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/508 (47%), Positives = 309/508 (60%), Gaps = 56/508 (11%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPNKSRWN-------ITPKDQRLHKL 515
K S Q K+ KS T L+ P N P N +D LH+L
Sbjct: 357 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNSGVLQKRSADGCTKRRDNDLHRL 416
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYPGK 572
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHAG R+ P +
Sbjct: 417 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYR 476
Query: 573 --------------------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSS 606
D+DD+C +C DGG+L+ C+GCPRAFH C L
Sbjct: 477 HIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQC 536
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
+P W C C + + N GR S + V I R R+ K E E+ GC
Sbjct: 537 VPDSGWQCLNC---------IDNAGN----GRESSI--VRPIMIRLTRVDKTPEVEMGGC 581
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI +
Sbjct: 582 VVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYA 641
Query: 727 VLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQA 784
LQN + AE +P F I+K+ L T + D++WR+LSGK+ PE LLS+A
Sbjct: 642 ALQNSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRA 701
Query: 785 VAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VN VVSAG+LR+FG+
Sbjct: 702 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFGR 760
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
VAELPLVATS+ + GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT K GF+K+
Sbjct: 761 NVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKM 820
Query: 905 DPELLSIYRKRCSQLVTFKGTSMLQKRV 932
+ LS + + QL F TSML+K V
Sbjct: 821 SEDQLSKHLREV-QLTLFNKTSMLEKTV 847
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 227 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 281
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 282 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIK 341
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 342 NVAGSSVNEESF 353
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 227/456 (49%), Positives = 302/456 (66%), Gaps = 42/456 (9%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDG E+ Y+ GQ++L G+K G GI+C CN E+SPSQFEAHAG
Sbjct: 332 RDNDLHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGM 391
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C C +GG+L+ CD CPRA+H
Sbjct: 392 AARRQPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYH 451
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 658
C L ++P+G W C C++ + ++ A+ G +S + I R R+VK
Sbjct: 452 TGCLHLQNVPEGVWSCPNCRDK------VGSNSKAISGGSLS---FSKPIVFRLTRVVKA 502
Query: 659 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 718
E E+ GC++CR DFS + F RT+LLCDQCEREFHVGCL+ + DL+ELPK KWFCC
Sbjct: 503 PEYEIGGCVVCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCC 562
Query: 719 MDCSRINSVLQNLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDI-DVRWRLLSGKAATPE 776
+CS I+ LQN ++ A+ +P+ + I+K+ G L + DVRW++LSGK+ PE
Sbjct: 563 DECSNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALNDVRWQILSGKSRFPE 622
Query: 777 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 836
LS+A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L V S VVSA
Sbjct: 623 DLPFLSRATAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSA 681
Query: 837 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 896
G+LR+FG+EVAELP+VATS+ + GKGYFQ+LF+CIE+LLS L V+++VLPAAE+AESIWT
Sbjct: 682 GLLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWT 741
Query: 897 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 932
K GF+K+ E L Y + QL F GTSML+K V
Sbjct: 742 KKLGFRKMSEEQLIKYMREV-QLTIFNGTSMLEKVV 776
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 242/508 (47%), Positives = 309/508 (60%), Gaps = 56/508 (11%)
Query: 464 KSSQSQRQRKI-TKKSKKTVLISKPFENASPPLSFPNKSRWN-------ITPKDQRLHKL 515
K S Q K+ KS T L+ P N P N +D LH+L
Sbjct: 360 KESLLQSNGKVQAHKSYSTKLVGMPHTNIRPSSYTSNTGVLQKRSADGCTKRRDNDLHRL 419
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYPGK 572
+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHAG R+ P +
Sbjct: 420 LFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYR 479
Query: 573 --------------------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSS 606
D+DD+C +C DGG+L+ C+GCPRAFH C L
Sbjct: 480 HIYTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQC 539
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
+P W C C++ NA GR S + V I R R+ K E E+ GC
Sbjct: 540 VPDSGWQCLNCRD------------NAGN-GRESSI--VRPIMIRLTRVDKTPEFEMGGC 584
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
++CR DFS + F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC DC+RI
Sbjct: 585 VVCREHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYV 644
Query: 727 VLQNLLVQEAEKLP-EFHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPETRLLLSQA 784
LQN + AE +P I+K+ L T + D++WR+LSGK+ PE LLS+A
Sbjct: 645 ALQNSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMNDIQWRILSGKSRYPEHLPLLSRA 704
Query: 785 VAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSAG+LR+FG+
Sbjct: 705 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGR 763
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
VAELPLVATS+ + GKGYFQ+LF+CIE+LLS L V+ +VLPAA +AESIWT K GF+K+
Sbjct: 764 NVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKM 823
Query: 905 DPELLSIYRKRCSQLVTFKGTSMLQKRV 932
+ LS + + QL F TSML+K V
Sbjct: 824 SEDQLSKHLREV-QLTLFNKTSMLEKTV 850
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 76/132 (57%), Gaps = 8/132 (6%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGIL 265
+N+ELKMSKK+ N P V +L TG+LDG V Y+ G ++ Q GII GG L
Sbjct: 230 RNMELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYNPGKVELQ-----GIIDGGGYL 284
Query: 266 CSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQ 325
C CS+CN RV+ +FE HA + R + +I ENG+ + +++ ++ PL +L ++
Sbjct: 285 CGCSMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIK 344
Query: 326 SALSSLPEEKSF 337
+ S E+SF
Sbjct: 345 NVAGSSVNEESF 356
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 224/458 (48%), Positives = 291/458 (63%), Gaps = 51/458 (11%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDG E+ YY GQKLL GYK G GI+C CC+ E+SPSQFEAHAG
Sbjct: 458 RDNDLHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGM 517
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C +C DGG+L+ C+GCPRAFH
Sbjct: 518 AARRQPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFH 577
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN 658
C L S+P+ W+C C++ +R I R R+ K
Sbjct: 578 AACLGLHSVPESGWHCLNCEDNTGDER------------------GARPIMIRLTRVDKE 619
Query: 659 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC 718
E E+ GC++CR DFS F RT+++CDQCE+E+HVGCL+ + +L ELPK KWFCC
Sbjct: 620 PEYEVGGCVVCRANDFSVDKFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCC 679
Query: 719 MDCSRINSVLQNLLVQEAEKLPE-FHLNAIKKYAGNSLETVSDI-DVRWRLLSGKAATPE 776
DC+RI LQN + A+ +P I+K+ L T D+ D++WR+LSGK+ E
Sbjct: 680 DDCNRIYVALQNSVSAGADTIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAE 739
Query: 777 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSA 836
LLS+A AIF +CFDPIV +ISGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVSA
Sbjct: 740 HLPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSA 798
Query: 837 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 896
G+LR+FG+ +AELPLVATS+ + GKGYFQ LF+CIE+LLS L V+ +VLPAA +AESIWT
Sbjct: 799 GLLRIFGRNIAELPLVATSREHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWT 858
Query: 897 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
K GF K+ + L+ + K QL F TS+L+K V A
Sbjct: 859 KKLGFHKMSEDQLTKHLKEV-QLTLFNKTSVLEKMVQA 895
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 72/127 (56%), Gaps = 12/127 (9%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM---GGIKFQASGLRGIIRDGGILC 266
N+ELKMSKK+ N P V +L TG+LDG +V Y+ G ++ L GII DGG LC
Sbjct: 240 NMELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYNPGKVE-----LDGIIGDGGYLC 294
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR----ACRSVPLPMLKA 322
CS+C+ RV+ +FE HA + R + +I ENGK + ++ A S P ++K
Sbjct: 295 GCSMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKN 354
Query: 323 TLQSALS 329
S+++
Sbjct: 355 VAGSSIN 361
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 241/527 (45%), Positives = 307/527 (58%), Gaps = 57/527 (10%)
Query: 439 SPWELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFP 498
SP L SS+ + P +S S+ +RQ+ + + +P LS P
Sbjct: 351 SPHSLGSYSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRP-----SSLSHP 405
Query: 499 NKSRWNITPK-DQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
K+ T + D LH+L+F +GLPDG E+ YY GQK+L GYK G GI+C CC+ E+S
Sbjct: 406 KKTNEGGTRRRDNDLHRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCDREIS 465
Query: 558 PSQFEAHAG---RQYPGKD--------------------------NDDLCTICADGGNLL 588
PSQFEAHAG R+ P + +DD+C C DGG+L+
Sbjct: 466 PSQFEAHAGMAARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLI 525
Query: 589 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 648
C+ CPRAFH C L +P W+C C F + I
Sbjct: 526 FCESCPRAFHLVCLGLKYVPSDVWHCPNCNKFGHGGNFSR------------------SI 567
Query: 649 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 708
R R+VK E E+ GC+ CR DFS F RT++LCDQCEREFHVGCL+ + + DL+
Sbjct: 568 VIRLTRVVKTPEYEVGGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLK 627
Query: 709 ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI-DVRWR 766
E+PK WFC DC+RI LQN + + +P LN I K+A L D +WR
Sbjct: 628 EIPKDNWFCSNDCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDFQWR 687
Query: 767 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAI 826
+L GK+ E LLS A AIF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +
Sbjct: 688 ILMGKSRYQEDLSLLSAAAAIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCVL 746
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
L V + VVSAG+LR+FG++VAELPLVATS+ + GKGYFQ LF+CIE+LL L V +VLP
Sbjct: 747 LLVKNVVVSAGLLRIFGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLP 806
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
AAEEAESIWT +FGF+K+ E LS Y + QL FKGTSML+K VP
Sbjct: 807 AAEEAESIWTRRFGFRKMTEEQLSQYTREL-QLTIFKGTSMLEKEVP 852
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 70/128 (54%), Gaps = 4/128 (3%)
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
N+ELKMSKK+ N P V +L TG+LDG V Y+ + L GII GG LC C
Sbjct: 201 NMELKMSKKVLPNTFPSNVKKLLSTGILDGARVKYISPQR----ELYGIIDGGGYLCGCP 256
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CN RV+ +FE+HA + R + +I ENGK + +++ ++ PL + ++ A
Sbjct: 257 SCNFSRVLTAYEFELHAGAKTRHPNNHIYLENGKPICSIIQELKAAPLGAVDEVIKDAAG 316
Query: 330 SLPEEKSF 337
S E+ F
Sbjct: 317 SSINEEFF 324
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 229/484 (47%), Positives = 295/484 (60%), Gaps = 39/484 (8%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
SK F PL+ S +T KD LHKLVF L DGTEV YY GQ+ ++GY
Sbjct: 537 SKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKD 594
Query: 545 LGIICHCCNSEVSPSQFEAHAG---RQYP--------------------------GKDND 575
I C+ CN VSPS FEAHAG R+ P ++ D
Sbjct: 595 QRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKDMELSERETD 654
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 635
DLC C GG++ PC CPR+FH C LS +P +WYC C N+ ++++ L + NA
Sbjct: 655 DLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKALAENKNAKA 713
Query: 636 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 695
AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++LCDQCE+E+H
Sbjct: 714 AGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVILCDQCEKEYH 772
Query: 696 VGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNS 754
VGCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I KK+
Sbjct: 773 VGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDIIRKKHEMKG 832
Query: 755 LETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR-- 812
L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLIP MV GR
Sbjct: 833 LSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRP 892
Query: 813 --NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFAC 870
+ GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G GYFQ LF+C
Sbjct: 893 KDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGLGYFQALFSC 952
Query: 871 IEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
IE++L L++K +LPAA+EAE IW +KFGF KI E Y + L F GTS L K
Sbjct: 953 IERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLN-GAHLTIFHGTSNLYK 1011
Query: 931 RVPA 934
+P+
Sbjct: 1012 AIPS 1015
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 76/151 (50%), Gaps = 2/151 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYM-GGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P + EL TGLL+G+ V Y+ K Q + L+G+I I C C CNG + +
Sbjct: 369 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAVLKGVITGCNIRCFCLSCNGSKDVCS 428
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFAC 339
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C
Sbjct: 429 YFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNC 488
Query: 340 VRCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
+ C + G LC C++ K+ Q
Sbjct: 489 LNCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 518
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 230/493 (46%), Positives = 298/493 (60%), Gaps = 39/493 (7%)
Query: 476 KKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ 535
K + + SK F PL+ S +T KD LHKLVF L DGTEV YY GQ
Sbjct: 537 KSNSSLIPSSKDFLLKKTPLNTKGGSAGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQ 594
Query: 536 KLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYP---------------------- 570
+ ++GY I C+ CN VSPS FEAHAG R+ P
Sbjct: 595 RKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISKD 654
Query: 571 ----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 626
++ DDLC C GG++ PC CPR+FH C LS +P +WYC C N+ ++++
Sbjct: 655 MELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQKEKA 713
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 686
L + NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+ S F RT++L
Sbjct: 714 LAENKNAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCKQKDFNNSVFDERTVIL 772
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
CDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++
Sbjct: 773 CDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDIDI 832
Query: 747 I-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SGRDLI
Sbjct: 833 IRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSGRDLI 892
Query: 806 PSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
P MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK G
Sbjct: 893 PEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDLQGL 952
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
GYFQ LF+CIE++L L++K +LPAA+EAE IW +KFGF KI E Y + L
Sbjct: 953 GYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQSDAYLN-GAHLTI 1011
Query: 922 FKGTSMLQKRVPA 934
F GTS L K +P+
Sbjct: 1012 FHGTSNLYKAIPS 1024
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 2/150 (1%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V Y+ +A L+G+I I C C CNG + +
Sbjct: 380 LTKHPSNIRELLNTGLLEGMPVRYIIPSSKKAV-LKGVITGCNIRCFCLSCNGSKDVCSY 438
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + +I NG SL +VLRAC S PL L+ T++S++ + + C+
Sbjct: 439 FFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIRSSIDPIAKRSYVNCL 498
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C + G LC C++ K+ Q
Sbjct: 499 NCNEHLSSSQTEIFG-SFLCQRCLEPKQHQ 527
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 222/459 (48%), Positives = 280/459 (61%), Gaps = 51/459 (11%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH 564
+ +D LH+L+F +GLPDGTE+ YY GQK+L GYK G GI+C CC E+SPSQFE+H
Sbjct: 424 VRKRDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESH 483
Query: 565 AG---RQYPGKD--------------------------NDDLCTICADGGNLLPCDGCPR 595
AG R+ P + DD+C C DGG+L+ C CPR
Sbjct: 484 AGMSARRQPYRHIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPR 543
Query: 596 AFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRI 655
AFH C L P+G W+C C L H N I R R+
Sbjct: 544 AFHAACLDLHDTPEGAWHCPNCNK-------LGHGGNFARP-----------IVIRLTRV 585
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK E ++ GC +CR DFS F RT++LCDQCE+EFHVGCL++ + DL+E+PK W
Sbjct: 586 VKTPEYDVGGCAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNW 645
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI--KKYAGNSLETVSDIDVRWRLLSGKAA 773
FCC DC+ I L+N + + +P LN I K L + DV+W++L GK+
Sbjct: 646 FCCQDCNNIYVALRNSVSTGVQTIPVSLLNTINRKHVEKGLLVDEAAYDVQWQILMGKSR 705
Query: 774 TPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSV 833
E LLS A AIF +CFDPIV + +GRDLIP MVYGRN+ GQEFGGMYC +LTV V
Sbjct: 706 NREDLSLLSGAAAIFRECFDPIV-AKTGRDLIPVMVYGRNISGQEFGGMYCVLLTVRHVV 764
Query: 834 VSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAES 893
VSAG+LR+FG+EVAELPLVAT++ + GKGYFQ LF+CIE+LL L V+ +VLPAAEEAES
Sbjct: 765 VSAGLLRIFGREVAELPLVATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAES 824
Query: 894 IWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRV 932
IWT +FGF+K+ L Y R QL FKGTSML+K V
Sbjct: 825 IWTRRFGFRKMSEGQLLKY-TREFQLTIFKGTSMLEKEV 862
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 202 ERYVELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 258
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS C+ +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 259 GCSSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 318
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 319 VAGSSINEEFF 329
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 427 bits (1098), Expect = e-116, Method: Compositional matrix adjust.
Identities = 223/466 (47%), Positives = 293/466 (62%), Gaps = 62/466 (13%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDGTE+ YY QKLL+GYK G GI+C CC++++SPSQFEAHAG
Sbjct: 389 RDNDLHRLLFLPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGM 448
Query: 567 --RQYPGK---------------------------DNDDLCTICADGGNLLPCDGCPRAF 597
R+ P + D+DD+C+IC +GG+LL C GCP+AF
Sbjct: 449 AGRRQPYRRIHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAF 508
Query: 598 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 657
H C S+P+G WYC C + G S + T +
Sbjct: 509 HTACLKFQSMPEGTWYCSSCND---------------------GPTSCKIATASWLYTYF 547
Query: 658 NLEAEL----SGCLLC----RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRE 709
NL A + S L R DFS F RT++LCDQCE+E+HVGCL+++++ DL+
Sbjct: 548 NLNANILVLHSAYSLSPISDRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKG 607
Query: 710 LPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLL 768
+P+ KWFCC DCSRI+ VLQ+ + +P L+ I +KY + + V WR+L
Sbjct: 608 IPQDKWFCCSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRML 667
Query: 769 SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILT 828
SGK+ PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L
Sbjct: 668 SGKSRYPEHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLM 726
Query: 829 VNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAA 888
VNS VVSA +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++LPAA
Sbjct: 727 VNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAA 786
Query: 889 EEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
EEAESIWT+KFGF K+ L Y++ QL FKGTSML+K+VP+
Sbjct: 787 EEAESIWTNKFGFTKMTEHRLQRYQREV-QLTIFKGTSMLEKKVPS 831
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 4/151 (2%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EG L +KM KKI V +L TG+LDG V Y+ A L
Sbjct: 154 EHTWEGYPSNVASSTLGVKMLKKIDSTNFLSNVKKLLGTGILDGARVKYLS--TSAAREL 211
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C+ C+ +V+ +FE HA + + + +I ENG+ + V++ R
Sbjct: 212 QGIIHSGGYLCGCTACDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIA 271
Query: 316 PLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
P +L+ ++ S E+ F KG+F
Sbjct: 272 PPDVLEEVIRKVAGSALSEEGFQA--WKGSF 300
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/458 (48%), Positives = 295/458 (64%), Gaps = 40/458 (8%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAH--- 564
+D LH+L+F +GLPDGTE+ Y+ QKLL+GYK G GI+C CC++E+SPSQFEAH
Sbjct: 376 RDNDLHRLLFLPNGLPDGTELAYFVKSQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGM 435
Query: 565 AGRQYPGK---------------------------DNDDLCTICADGGNLLPCDGCPRAF 597
AGR+ P + D+DD+C+IC DGG+LL C GCP+AF
Sbjct: 436 AGRRQPYRHIHISSGLSLHDIAMSLADGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAF 495
Query: 598 HKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC-IRIV 656
H C S+P+G WYC C + + A A + S V S+ + IR++
Sbjct: 496 HTACLKFQSMPEGTWYCSSCNDGPTSCK----TATATDPNLKSIVGSIAIFSLSAHIRVL 551
Query: 657 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 716
+ A + R DFS F RT++LCDQCE+E+HVGCL+++ + DL+ +P+ KWF
Sbjct: 552 HS--AYCFSPISDRSLDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLCDLKGIPQDKWF 609
Query: 717 CCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATP 775
CC DCSRI++ LQ+ + +P L+ I +KY + + +V WR+LSGK+
Sbjct: 610 CCSDCSRIHTALQSSASCGPQTIPTVLLDTISRKYREKGICIDNGDNVEWRMLSGKSRYA 669
Query: 776 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVS 835
E LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VVS
Sbjct: 670 EHLPLLSRAATIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVS 728
Query: 836 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIW 895
A +LR+FGQ+VAELP+VATS+ G+GYFQ LFAC+E LLS L V++++LPAAEEAESIW
Sbjct: 729 AALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIW 788
Query: 896 TDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
T KFGF K+ L Y++ QL FKGTSML+K+VP
Sbjct: 789 TKKFGFTKMTEHQLQKYQREV-QLTIFKGTSMLEKKVP 825
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 70/142 (49%), Gaps = 2/142 (1%)
Query: 196 EAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGL 255
E EGS L +KM KI P V +L TG+LDG V Y+ A L
Sbjct: 147 EHTWEGSPSNVASSTLGVKMLDKIDSTNFPSNVKKLLATGILDGARVKYLS--ISPAREL 204
Query: 256 RGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSV 315
+GII GG LC C++C+ +V+ +FE HA + + + +I ENG+ + +++ R
Sbjct: 205 QGIIHSGGYLCGCTVCDFSKVLGAYEFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVA 264
Query: 316 PLPMLKATLQSALSSLPEEKSF 337
P +L+ ++ S E+ F
Sbjct: 265 PPDVLEEVIRKVAGSALSEEGF 286
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 222/467 (47%), Positives = 291/467 (62%), Gaps = 39/467 (8%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD RLHKLVF+ L DGTEV YY GQ+ ++GY I C+ CN VSPS
Sbjct: 386 SAGKVTNKDNRLHKLVFNV--LLDGTEVAYYVDGQRKVDGYIKDHRIYCNHCNRVVSPSA 443
Query: 561 FEAHAG---RQYP--------------------------GKDNDDLCTICADGGNLLPCD 591
FEAHAG R+ P ++ DDLC C GG++ PC
Sbjct: 444 FEAHAGEGSRRKPYDNIFTSNGVSLHELAMKISKDMELSERETDDLCRECGQGGDIFPCK 503
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 651
CPR+FH C LS +P +WYC C+N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 504 ICPRSFHPACVGLSKVP-AEWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKR 562
Query: 652 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 711
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP
Sbjct: 563 AIRIVP-ISDDLGGCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELP 621
Query: 712 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 770
+ +WFCC CS S L ++ A+ L + L I KK+ L + D++W+LLSG
Sbjct: 622 EEEWFCCSSCSETRSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSG 681
Query: 771 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 826
K AT + +LLS AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+
Sbjct: 682 KRATEDGSILLSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCAL 741
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
LTV S+VVSA +LRV G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLP
Sbjct: 742 LTVGSTVVSAALLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLP 801
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
AA EAE IW +KFGF +I PE L Y + L F GTS + K VP
Sbjct: 802 AAHEAEGIWMNKFGFSRISPEELEAYLNG-AHLTIFHGTSYMYKAVP 847
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 78/154 (50%), Gaps = 5/154 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TG+L+G+ V+Y+ +A L+G+I I C C CNG + I
Sbjct: 207 LTKHPGNIRELLNTGMLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGAKAISAY 265
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + C+
Sbjct: 266 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIRSSIDPVVKRSRINCL 325
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
C + + LC C++SK+PQ +T
Sbjct: 326 NCNELV----LPSSHENVLCQVCLESKQPQDPLT 355
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 217/468 (46%), Positives = 287/468 (61%), Gaps = 67/468 (14%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LH+L+F +GLPDGTE+ YY +++SPSQFEAHAG
Sbjct: 388 RDNDLHRLLFMPNGLPDGTELAYYV--------------------KTQISPSQFEAHAGM 427
Query: 567 --RQYPGK--------------------------DNDDLCTICADGGNLLPCDGCPRAFH 598
R+ P + D+DD+C+IC DGG+LL C GCP+AFH
Sbjct: 428 AARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFH 487
Query: 599 KECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD---SVEQITKRCIRI 655
C S+P+G WYC C + + + + + D + I R R+
Sbjct: 488 TACLKFQSMPEGTWYCSSCND------------GPISSKKATTTDPSGNARPIVIRLSRV 535
Query: 656 VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW 715
VK E+++ GC+ CR DFS F RT++LCDQCE+E+HVGCL+++ DL+E+P+ KW
Sbjct: 536 VKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKW 595
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAAT 774
FCC +CSRI++ +QN + + LP L+ I +K + T V WR+LSGK+
Sbjct: 596 FCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRY 655
Query: 775 PETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVV 834
PE LLS+A IF +CFDPIV + SGRDLIP MVYGRN+ GQEFGGMYC +L VNS VV
Sbjct: 656 PEHLPLLSRAAVIFRECFDPIV-AKSGRDLIPVMVYGRNISGQEFGGMYCLVLIVNSLVV 714
Query: 835 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 894
SA +LR+FGQEVAELP+VATS+ G+GYFQ L+AC+E LLS L V+++VLPAAEEAESI
Sbjct: 715 SAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESI 774
Query: 895 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSST 942
WT KFGF K+ + L Y+K QL FKGTSML+K+VP G S +
Sbjct: 775 WTKKFGFTKMSDQQLQEYQKEV-QLTIFKGTSMLEKKVPKATTGLSES 821
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 6/125 (4%)
Query: 213 LKMSKK--ISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+KM KK +SL+ P V +L ETG+L+G V Y+ + L GII GG LC C+
Sbjct: 151 VKMPKKKIVSLSY-PSNVKKLLETGILEGARVKYISTPPVRQ--LLGIIHSGGYLCGCTT 207
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS-ALS 329
CN +V+ +FE HA + R + +I EN +++ +++ ++ P +L+ +++ A S
Sbjct: 208 CNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGS 267
Query: 330 SLPEE 334
+L EE
Sbjct: 268 ALNEE 272
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 219/472 (46%), Positives = 291/472 (61%), Gaps = 39/472 (8%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQ+ ++GY I C+ C+ VSPS
Sbjct: 723 SSGKVTTKDTGLHKLVF--KVLLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCSRVVSPSA 780
Query: 561 FEAHAG---RQYP--------------------------GKDNDDLCTICADGGNLLPCD 591
FEAHAG R+ P ++ DDLC C GG++ PC
Sbjct: 781 FEAHAGEGSRRKPYDNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGLGGDIFPCK 840
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 651
CPR+FH C LS P +W+C C N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 841 MCPRSFHPACVRLSEFPS-EWFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKR 899
Query: 652 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 711
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP
Sbjct: 900 AIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELP 958
Query: 712 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 770
G+WFCC CS I S L ++ A+ L L I KK+ L +DID+RW+LL+G
Sbjct: 959 DGEWFCCSSCSEIRSCLDKMISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAG 1018
Query: 771 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 826
++AT + LLLS AV I H FDPI+++ +GRDLIP MV GR + GQ++ GMYCA+
Sbjct: 1019 RSATEDGSLLLSSAVPIIHQSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAV 1078
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
+T+ S+VVSA +LR+ G +VAELPLVATS G GYFQ+LF+C+E++L L++K +LP
Sbjct: 1079 ITLGSTVVSAALLRIMGGDVAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLP 1138
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIG 938
AA+EAE+IW KFGF +I E L Y + L F GTS L K VP+ G
Sbjct: 1139 AAQEAEAIWMKKFGFSRIPQEQLEAYLNG-AHLTVFHGTSNLYKAVPSPSPG 1189
Score = 82.8 bits (203), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 77/150 (51%), Gaps = 5/150 (3%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
K P V EL +TGLL+G+ V+Y+ +A ++G+I I C C CNG R +
Sbjct: 544 FTKHPGNVKELLQTGLLEGMPVMYIIPNSKKAV-VKGVITGCNIRCFCIKCNGSRALSTY 602
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA + +++I NG SL +VLRAC L L+ T +S++ + C+
Sbjct: 603 FFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPMVIRSRPNCL 662
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
C G P + LC+ C+ SK+PQ
Sbjct: 663 NCGGHLPSSETEHF----LCHCCLDSKQPQ 688
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 407 bits (1047), Expect = e-110, Method: Compositional matrix adjust.
Identities = 214/467 (45%), Positives = 285/467 (61%), Gaps = 38/467 (8%)
Query: 501 SRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQ 560
S +T KD LHKLVF L DGTEV YY GQK ++GY I C+ CN VSPS
Sbjct: 697 STGKVTTKDTGLHKLVF--KVLLDGTEVNYYVDGQKKIDGYIKDQRIYCNHCNKVVSPSA 754
Query: 561 FEAHAGR-----------------------------QYPGKDNDDLCTICADGGNLLPCD 591
FEAHAG Q ++ DDLC C GG++ PC
Sbjct: 755 FEAHAGEGSRRKPYDNIFTSNGVSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCR 814
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 651
CPR+FH C L +P +W+C C + ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 815 MCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKR 874
Query: 652 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 711
IRIV + +L GC LC+ DF+ + F RT++LCDQCE+E+HVGCL+ DL+ELP
Sbjct: 875 AIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELP 933
Query: 712 KGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSG 770
+G+WFCC CS I S L ++ + A L E ++ I KK+ L ++ ++RW+L++G
Sbjct: 934 EGEWFCCDSCSEIRSSLDKMISEGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAG 993
Query: 771 KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR----NLRGQEFGGMYCAI 826
++AT + LLS AV + H FDPI+++ +GRDLIP MV+GR + GQ++ GMYCA+
Sbjct: 994 RSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAV 1053
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
LTV S+VVSA +LRV G +VAELPLVATS G GYFQ+LF+CIE+LL L+VK +LP
Sbjct: 1054 LTVGSTVVSAALLRVMGGDVAELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLP 1113
Query: 887 AAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
AA EAE+IW KFGF KI + + Y L F GT L K +P
Sbjct: 1114 AAHEAEAIWMKKFGFSKIPQDQMEAYLNG-GHLTVFHGTLNLYKAIP 1159
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 103/209 (49%), Gaps = 21/209 (10%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEV-EAIAEGSALTSPK-KNLELKMSKKIS-LN 222
S +KPKVE + SL+ V E + + T P +E+KMSKK++ L+
Sbjct: 472 SLLKPKVE------------APPASSLVVVPEEPVDSTPETPPSVTKMEMKMSKKVAFLS 519
Query: 223 KKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKF 282
K P +L TGLL+G+ V+Y+ + L+G+I I C C C+G + I F
Sbjct: 520 KHPGNTRDLLSTGLLEGMPVMYIIP-NSKKPVLKGVIAGCNIRCFCVKCDGSKTITTYFF 578
Query: 283 EIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
E+HA + ++YI NG L +VLRAC S PL L T+QS + + C+ C
Sbjct: 579 ELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCLNC 638
Query: 343 KGTFPITCVGKTGPGPLCNS-CVKSKKPQ 370
G P +T LC+ C +S +PQ
Sbjct: 639 NGELP----SQTEEQFLCHDCCPESNQPQ 663
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 224/538 (41%), Positives = 299/538 (55%), Gaps = 101/538 (18%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 301 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 358
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR----- 567
HKL F LP+GT+VGYY G++LL+GY GI CHCCN+ VSPSQFEAHAGR
Sbjct: 359 HKLAFMSGVLPEGTDVGYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARRK 418
Query: 568 ------------------------QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
+ +DDLC+IC+DGG LL CD CPRAFH+EC
Sbjct: 419 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 478
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 663
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 479 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 538
Query: 664 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 696
GC LC DFSK F RT+LLCDQ
Sbjct: 539 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 592
Query: 697 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 755
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 593 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 639
Query: 756 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 815
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 640 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 694
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL
Sbjct: 695 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLL 739
Query: 876 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 740 ASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLK-GGRTTVFQGTSTLHKLVP 796
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 144 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 202
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 203 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 262
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 263 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 316
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 317 RRSKNESPGSKYCNS 331
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 197/412 (47%), Positives = 254/412 (61%), Gaps = 44/412 (10%)
Query: 535 QKLLEGYKNGLGIICHC--CN-SEVSPSQFEAHAGR------------------------ 567
+ +L+G G I C C CN S+VSPS FEAHAG
Sbjct: 326 KAVLKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMK 385
Query: 568 -----QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
Q ++ DDLC C GG++ PC CPR+FH C LS +P +WYC C N+ +
Sbjct: 386 ISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVP-SEWYCDNCSNLVQ 444
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
+++ L + NA AGR +GVDS+EQI KR IRIV NL EL DF+ S F R
Sbjct: 445 KEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELG------QKDFNNSVFDER 498
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
T++LCDQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ A L E
Sbjct: 499 TVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAES 558
Query: 743 HLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 801
++ I KK+ L ++ D+RWRLL+G+ A+ + LLLS AV I H FDPI++ SG
Sbjct: 559 DIDIIRKKHEMKGLSMDTNTDLRWRLLAGRKASEDGDLLLSAAVPIIHQSFDPIIEVQSG 618
Query: 802 RDLIPSMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 857
RDLIP MV GR + GQ++ GMYCA+LT+ +SVVSA +LRV G EVAELPLVATSK
Sbjct: 619 RDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKD 678
Query: 858 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 909
G GYFQ LF+CIE++L L++K +LPAA+EAE IW +KFGF KI E L
Sbjct: 679 LQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQL 730
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 49/119 (41%), Gaps = 22/119 (18%)
Query: 248 IKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICF-ENGKSLL 306
+K Q + L+G+I I C C CNG +V PS FE HA + RR F NG SL
Sbjct: 322 LKLQKAVLKGVIAGCNIRCFCLSCNGSKV-SPSAFEAHAGEGTRRKPYDNIFTSNGVSLH 380
Query: 307 EVLRACRSVPLPMLKATLQSALSSLPEE----------KSFACVRCKGTFPITCVGKTG 355
E+ +K + LS + F C C +F CVG +G
Sbjct: 381 EL----------SMKISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSG 429
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 369 bits (946), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 182/336 (54%), Positives = 233/336 (69%), Gaps = 4/336 (1%)
Query: 600 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 660 EAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM 719
E GC LC+ DFSK F RT+LLCDQC RE+HVGCLK+H MADL LP+G W+C
Sbjct: 64 ETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCST 123
Query: 720 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKY--AGNSLETVSDIDVRWRLLSGKAATPET 777
DC RIN LQ+LL E + L+ IKK + +D+DVRWR+L K+ + ++
Sbjct: 124 DCVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDKS-SDDS 182
Query: 778 RLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAG 837
+L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYGR+ R Q++ GMYC +LTVN+ VVSAG
Sbjct: 183 KLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSAG 242
Query: 838 ILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD 897
+ R+ G E+AELPLVATS+ G GYFQ LF+CIE+LLS L VK VLPAAEEAESIWT+
Sbjct: 243 LFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWTE 302
Query: 898 KFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
+FGF KI + L Y K + F+GTS L K V
Sbjct: 303 RFGFAKISQDELREYLKG-GRTTVFQGTSNLHKLVA 337
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 360 bits (923), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 213/538 (39%), Positives = 285/538 (52%), Gaps = 121/538 (22%)
Query: 455 NSTPVTSVHK--SSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRL 512
NST + + + +S ++R + + SK ++ +NA P + + R IT KD+ L
Sbjct: 300 NSTRTSKIGRNPTSSARRSKNESPGSKYCNSSARGSKNAFPGVKTTSTGR--ITRKDKGL 357
Query: 513 HKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR----- 567
HKL F LP+GT+VGYY G+ VSPSQFEAHAGR
Sbjct: 358 HKLAFMSGVLPEGTDVGYYVGGK--------------------VSPSQFEAHAGRAARRK 397
Query: 568 ------------------------QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
+ +DDLC+IC+DGG LL CD CPRAFH+EC
Sbjct: 398 PYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVG 457
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 663
++IP+G W C+YC+N +R+ L ++ NA+ AGR+ G+D +EQI R IRI
Sbjct: 458 FTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGF 517
Query: 664 SGCLLCRGC---------------------------DFSKSGFGPRTILLCDQCEREFHV 696
GC LC DFSK F RT+LLCDQ
Sbjct: 518 GGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCDQA------ 571
Query: 697 GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSL 755
LP+G W+C DC RI+ L++LL + AE + + IK KY +L
Sbjct: 572 -------------LPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKAL 618
Query: 756 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 815
D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLIP+MVYG
Sbjct: 619 NKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG---- 673
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
++VVSAG+ RV G E+AELPLVATS+ + G GYFQ LF CIE+LL
Sbjct: 674 ---------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIERLL 718
Query: 876 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GTS L K VP
Sbjct: 719 ASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLK-GGRTTVFQGTSTLHKLVP 775
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 211 LELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSL 270
+ELKMSKKIS + P + +L TGLL+G V Y+ K + + LRG+I+ GILCSCS
Sbjct: 143 MELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMR-KGKRAVLRGVIKRVGILCSCSS 201
Query: 271 CNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSS 330
C G V+ P FE+HA + S YI ENG +L ++LRAC L ML++ +Q+A+
Sbjct: 202 CKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAIQNAIGP 261
Query: 331 LPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVS 390
P++++F C CK +F GK LC+SC++SK Q + T T+ I P +
Sbjct: 262 APKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-TRTSKI---GRNPTSSA 315
Query: 391 RSSENDSMCISYQNN 405
R S+N+S Y N+
Sbjct: 316 RRSKNESPGSKYCNS 330
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 177/371 (47%), Positives = 243/371 (65%), Gaps = 7/371 (1%)
Query: 568 QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL 627
Q ++ DDLC C GG++ PC CPR+FH C L +P +W+C C + ++++ L
Sbjct: 16 QLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILVQKEKAL 75
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
+ NA AGR +GVDS+EQI KR IRIV + +L GC LC+ DF+ + F RT++LC
Sbjct: 76 AANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKDFNNAVFDERTVILC 134
Query: 688 DQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI 747
DQCE+E+HVGCL+ DL+ELP+G+WFCC CS I S L ++ A L E ++ I
Sbjct: 135 DQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISGGAHPLSESDVDII 194
Query: 748 -KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 806
KK+ L ++ ++RW+L++G++AT + LLS AV + H FDPI+++ +GRDLIP
Sbjct: 195 RKKHESKGLVMDANTEIRWQLVAGRSATEDGNSLLSSAVPVIHQSFDPIIEAHTGRDLIP 254
Query: 807 SMVYGR----NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKG 862
MV+GR + GQ++ GMYCA+LTV S+VVSA +LRV G +VAELPLVATS G G
Sbjct: 255 EMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVATSMDLQGLG 314
Query: 863 YFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTF 922
YFQ+LF+CIE+LL L+VK +LPAA EAE+IW KFGF KI + + Y L F
Sbjct: 315 YFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYLNG-GHLTVF 373
Query: 923 KGTSMLQKRVP 933
GT L K +P
Sbjct: 374 HGTLNLYKAIP 384
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 198/486 (40%), Positives = 277/486 (56%), Gaps = 49/486 (10%)
Query: 505 ITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKN--GLGIICHCC---------- 552
I +DQ LHKLVF E+ L DG VGY+ +K L+G N GI+C CC
Sbjct: 83 INYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQLQGEINIKQSGILCDCCKEVILEFFFC 142
Query: 553 -----NSEVSPSQFEAHAG---RQYPG----------------------KDNDDLCTICA 582
+ +VSPS+FEAHAG R+ P ++D+ C+ C
Sbjct: 143 QIKKLDEQVSPSKFEAHAGWASRRKPYFHIRTTDGVSLHQLAINHRISISNSDEHCSKCK 202
Query: 583 DGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGV 642
GNLL CDGC RAFH C + S P+ WYC+YC+N ++ + ++H N V ++
Sbjct: 203 QRGNLLCCDGCQRAFHLGCIPVESPPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIES 262
Query: 643 DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKH 702
D EQI K C VK+ E E S C LC F+ F P T+++CDQCE+++HVGCLK H
Sbjct: 263 DPSEQIAKICTLSVKHKEVEHSSCALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLKDH 322
Query: 703 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYAGNSLETVSDI 761
MA+L+++PK WFC +DC I+ L+N + + L + L+ IK K LET +
Sbjct: 323 NMANLKKVPKHYWFCGVDCYDIHMKLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGL 382
Query: 762 DVRWRLLSGKAATPE--TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-E 818
D++W++ + + + T LLS V IFH+ FD IV + + DLIP+MV GR ++ +
Sbjct: 383 DIKWKVFNRQLIVSKIITSSLLSDVVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKDKYY 442
Query: 819 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 878
FGGMYCA+L VN VVSAGI RVFG+EVAEL L+AT +G+F+ L +CIE +L L
Sbjct: 443 FGGMYCAVLIVNQVVVSAGIFRVFGKEVAELSLIATKAEYQKQGFFKCLLSCIENVLKEL 502
Query: 879 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGT---SMLQKRVPAC 935
+V+ +VLPAA EAES+W DKFGF + + L Y +R KG + L ++ C
Sbjct: 503 KVERLVLPAAHEAESMWIDKFGFTEPNQGLGRRYYRRSWSFHLNKGVEQNTHLTGKLEFC 562
Query: 936 RIGSSS 941
+ +SS
Sbjct: 563 QKCASS 568
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 343 bits (879), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 172/346 (49%), Positives = 227/346 (65%), Gaps = 38/346 (10%)
Query: 475 TKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACG 534
+ +S KT +SK + S K+ +T KD LHKL F E+ LP+G+EV YY G
Sbjct: 430 SDRSTKTTTVSKCSSSGS-------KNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRG 482
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYP--------------------- 570
++LL G+K G I+C CCNSEVSPSQFEAH+G R+ P
Sbjct: 483 ERLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 542
Query: 571 GKD-----NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
G++ ND++C+IC DGG LL CDGCPR FHKEC SL +IP+G W+CK+C N ++ +
Sbjct: 543 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 602
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
F++ +ANAV AGR+ GVD +EQI KRCIRIVKN E GC LCR +FS SGFGP T++
Sbjct: 603 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 662
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
+CDQCE+EFHVGCLK H + DL+ +PKGKWFCC DC INS L+ ++V++ E+LP+ L
Sbjct: 663 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 722
Query: 746 AIKKYAGNSLETVS-DIDVRWRLLSG-KAATPETRLLLSQAVAIFH 789
IKK G S + D++WRLL G +A+ E LLSQA+++FH
Sbjct: 723 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAGSLLSQALSLFH 768
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 164/357 (45%), Gaps = 49/357 (13%)
Query: 54 VYSRVKRSRFSNSDDLLEDDVIDKRINSKIHEGRINKVVKNVLNENGILESVVEEENQLV 113
+ R K S + + +E+ + R+ S + NKVV++ E G + + +
Sbjct: 127 IKKRQKSSSLDSQKNNVEERFPEDRVRSNDGKSMDNKVVRSGQGEQG-----NDSTDNPM 181
Query: 114 QMTVENVIEETVKGKKAPICKEE----PIS------KVECFPRKEGGSEVSNGLNKKCLK 163
Q++ +N E++ G P +EE P S K P +G + + N++ +
Sbjct: 182 QISRDN---ESMSG---PAEEEELDYLPTSTLREGVKTSRTPSVDGLKKAPSSQNQRRVS 235
Query: 164 RPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNK 223
R + +KPK +++ V + E + GS+ P +L
Sbjct: 236 RVT-LKPKANAMKISVVNNG----------EKNVVKMGSSALVPS-----------TLKG 273
Query: 224 KPMTVTELFETGLLDGVSVVYMGGIKFQA---SGLRGIIRDGGILCSCSLCNGCRVIPPS 280
P + EL +TG+L+ + V Y+ G++ + SGL G+I+ GILC C C G V+ P+
Sbjct: 274 FPTKLKELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPN 333
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE+HA +R +YI ENG +L V+ AC L L ++ A+ S ++ +F C
Sbjct: 334 VFELHAGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSSIKKSTF-CF 392
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDS 397
CKG+ I+ VG + LC SCV K+ + TG R++ VS+ S + S
Sbjct: 393 NCKGS--ISEVGTSDSLVLCESCVGLKESHASPAQPTGTSDRSTKTTTVSKCSSSGS 447
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 174/368 (47%), Positives = 229/368 (62%), Gaps = 41/368 (11%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 626
R + +DDLC+IC+DGG LL CD CPRAFH+EC ++IP+G W C+YC+N +R+
Sbjct: 234 RNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESS 293
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 686
L ++ NA+ AGR+ G+D +EQI R IRI GC LCR DFSK F RT+LL
Sbjct: 294 LAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLL 353
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
CDQ LP+G W+C DC RI+ L++LL + AE + +
Sbjct: 354 CDQ-------------------ALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEI 394
Query: 747 IK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
IK KY +L D+DVRWR+L K++ +++L+LS+AVAIFH+ FDPI+ +GRDLI
Sbjct: 395 IKRKYEQKALNKDGDLDVRWRVLKDKSSA-DSKLVLSKAVAIFHESFDPIIQIATGRDLI 453
Query: 806 PSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQ 865
P+MVYG ++VVSAG+ RV G E+AELPLVATS+ + G GYFQ
Sbjct: 454 PAMVYG-------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQ 494
Query: 866 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGT 925
LF CIE+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y K + F+GT
Sbjct: 495 ALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTTVFQGT 553
Query: 926 SMLQKRVP 933
S L K VP
Sbjct: 554 STLHKLVP 561
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 84/143 (58%), Gaps = 6/143 (4%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
LRG+I+ GILCSCS C G V+ P FE+HA + S YI ENG +L ++LRAC
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSD 134
Query: 315 VPLPMLKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMT 374
L ML++ +Q+A+ P++++F C CK +F GK LC+SC++SK Q + T
Sbjct: 135 ATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKF--ALLCDSCLESKGSQNS-T 191
Query: 375 YTTGIRIRASGPKLVSRSSENDS 397
T+ I P +R S+N+S
Sbjct: 192 RTSKI---GRNPTSSARRSKNES 211
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 340 bits (871), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 164/271 (60%), Positives = 205/271 (75%), Gaps = 2/271 (0%)
Query: 673 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL 732
DFSKSGF RT+++CDQCERE+HVGCLK+HKMADL+ELP+G+WFC DC RI+S LQ LL
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 733 VQEAEKLPEFHLNAIKKYAG-NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 791
++ A+ LP ++ I+K ++ D+RW+LLSGK A E+RLLLS+AVAIFH+
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAESRLLLSKAVAIFHES 122
Query: 792 FDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
FDPIVD+ +GRDLIP+MVYGR +R Q++GG+YCA+LTV SSVVSAGILRV G E+AELPL
Sbjct: 123 FDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAELPL 182
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 911
VATS+ + G+GYFQ LF+CIE+LL L+VK VLPAA+EAESIWT KFGF KI + L
Sbjct: 183 VATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDELHK 242
Query: 912 YRKRCSQLVTFKGTSMLQKRVPACRIGSSST 942
Y V F+GTS L K V + S T
Sbjct: 243 YLNGARTTV-FQGTSTLHKPVTVPHVSSRET 272
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 187/442 (42%), Positives = 245/442 (55%), Gaps = 74/442 (16%)
Query: 496 SFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSE 555
++ +KS IT KD+ LHKLVF E+ L DG VGY+ G+
Sbjct: 174 TYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGK-------------------- 213
Query: 556 VSPSQFEAHAGRQYPGK-------------------------DNDDLCTICADGGNLLPC 590
VSPS+FEAHAGR K D+D+ C+ C GNLL C
Sbjct: 214 VSPSKFEAHAGRASRRKPYSYIRTADGVSLHELANNRRISMSDSDERCSHCEQVGNLLWC 273
Query: 591 DGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 650
D C R+FH EC L S P+ YC+YC+N F + + ++H N V GR++ D EQIT+
Sbjct: 274 DRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKENDVATGRIAEGDPSEQITE 333
Query: 651 RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLREL 710
C K E + C LC DF+ + GPRT+++C QCE+EFHV CLK H MA+L EL
Sbjct: 334 VCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKEFHVECLKDHNMANLVEL 393
Query: 711 PKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSG 770
PK KWFC +DC I+ LQ L+ + +L +D++WRLL+
Sbjct: 394 PKDKWFCGIDCDDIHMKLQKLMARGEAEL--------------------GLDIKWRLLNT 433
Query: 771 KAATPETRL--LLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAIL 827
K P+ + L+S+A AIFH+ F I D + DLI +M+YG + GQ F GMYCA+L
Sbjct: 434 KLNNPKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLYGMEIEGQYSFEGMYCAVL 493
Query: 828 TVNSSVVSAGILRVFGQEVAELPLVATSK------INHGKGYFQLLFACIEKLLSFLRVK 881
+ AGI RV GQEVAELPLVAT+ I GYF+ LF+CIE +L L+VK
Sbjct: 494 YFKKVIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGYFRSLFSCIENMLRHLKVK 553
Query: 882 SIVLPAAEEAESIWTDKFGFKK 903
++VLPAA EAES+W DKFGF K
Sbjct: 554 TLVLPAAHEAESMWIDKFGFTK 575
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 306 bits (784), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 161/358 (44%), Positives = 206/358 (57%), Gaps = 50/358 (13%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYPGKD------------------ 573
QK+L GYK G GI+C CC E+SPSQFE+HAG R+ P +
Sbjct: 17 QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLAN 76
Query: 574 --------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
DD+C C DGG+L+ C CPRAFH C L P+G W+C C +
Sbjct: 77 GQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNKLGHGGN 136
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
F I R R+VK E ++ GC +CR DFS F RT++
Sbjct: 137 F------------------ARPIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTFDDRTVI 178
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
LCDQCE+EFHVGCL++ + DL+E+PK WFCC DC+ I L+N + + +P LN
Sbjct: 179 LCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTIPASLLN 238
Query: 746 AI--KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
I K L + DV+W++L GK+ E LLS A AIF +CFDPIV + +GRD
Sbjct: 239 IINRKHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIFRECFDPIV-AKTGRD 297
Query: 804 LIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
LIP MVYGRN+ GQEFGGMYC +LTV VVSAG+LR+FG+EVAELPLVAT++ + GK
Sbjct: 298 LIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVATNREHQGK 355
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 145/230 (63%), Positives = 172/230 (74%), Gaps = 4/230 (1%)
Query: 150 GSEVSNGLNKKCLKRPSAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKK 209
G SN K+ K S +K K++ VEV V E E++S ++VE IAEGSALT PKK
Sbjct: 10 GEPNSNNRPKRVTK--SKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKK 67
Query: 210 NLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCS 269
NLELKMSKKI+L+ P+TV ELFETGLL+GV VVYMGG KFQA GLRG I+D GILCSC+
Sbjct: 68 NLELKMSKKIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQAFGLRGTIKDVGILCSCA 127
Query: 270 LCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALS 329
CNG RVIPPS+FEIHA KQYRRA+QYICFENGKSLL+VL ACR+ PL L+ T+QSA+S
Sbjct: 128 FCNGRRVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAIS 187
Query: 330 SLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGI 379
LP E++F C RCKG FP CVGK GPLCN C +SK+ T+T + I
Sbjct: 188 GLPVERTFTCKRCKGIFPSICVGKI--GPLCNLCAESKESHPTLTIGSSI 235
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 277 bits (709), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/271 (52%), Positives = 180/271 (66%), Gaps = 6/271 (2%)
Query: 668 LCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSV 727
LCR DF+ + F RT++LCDQCE+E+HVGCLK +L+ELP+G+WFCC CS S
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 728 LQNLLVQEAEKLPEFHLNAIKK-YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 786
L ++ A+ L E L IKK + L + D++W+LLSGK T + +LLS AV
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRTTEDGSILLSAAVP 131
Query: 787 IFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGILRVF 842
IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA +LRV
Sbjct: 132 IFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLRVM 191
Query: 843 GQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFK 902
G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KFGF
Sbjct: 192 GGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFGFS 251
Query: 903 KIDPELLSIYRKRCSQLVTFKGTSMLQKRVP 933
+ PE L Y + L F GTS L K VP
Sbjct: 252 RTTPEELEAYLNG-AHLTIFHGTSYLYKAVP 281
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 272 bits (695), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 142/277 (51%), Positives = 183/277 (66%), Gaps = 12/277 (4%)
Query: 665 GCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
G L R DF+ + F RT++LCDQCE+E+HVGCL+ +L+ELP+G+WFCC CS
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 725 NSVLQNLLVQEAEKLPEFHLNAI-KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 783
S L ++ A+ L E L I KK+ L + D++W+LLSGK AT E +LLS
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKRATEEGSILLSA 174
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN----LRGQEFGGMYCAILTVNSSVVSAGIL 839
AV IFH FDPI ++++GRDLIP MV GR + GQ++ GMYCA+LTV S+VVSA ++
Sbjct: 175 AVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALM 234
Query: 840 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 899
RV G +VAELPLVATS+ G GYFQ LF+CIE++L L++K VLPAA EAE IW KF
Sbjct: 235 RVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMKKF 294
Query: 900 GFKKIDPELLSIYRKRC---SQLVTFKGTSMLQKRVP 933
GF +I PE L + C + L F GTS L K VP
Sbjct: 295 GFSRIPPEEL----EACLNGAHLTIFHGTSYLYKAVP 327
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 268 bits (686), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 158/447 (35%), Positives = 236/447 (52%), Gaps = 54/447 (12%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPS----QFEA 563
+D LHK +F GL +GTE+GYY Q L+G K G GI C CCN E S +FE
Sbjct: 487 RDAHLHKALFLPGGLENGTELGYYTKSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQ 546
Query: 564 HAG---RQYP---------GKDNDDLCTI-------------------------CADGGN 586
HAG R+ P G+ D+C C+ G
Sbjct: 547 HAGCEARRNPYGNILVLVDGRSLKDVCKDLTHKNKLGEQQNCEPLARDVNCCYECSSSGE 606
Query: 587 LLPCDGCPRAFHKECASLSSI-PQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSV 645
L C+GC A+ C + WYC+ C+N + + + V S+
Sbjct: 607 LKTCNGCEEAWCDNCTKGEEVDSDSKWYCRMCRNDTLK---VAQNGQKVSGKHQEESSSI 663
Query: 646 EQITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKH 702
+I +R R +++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK
Sbjct: 664 TEIDERG-RCIRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDS 722
Query: 703 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVS 759
M +L ELP+G+WFC C I+ +L L+ E L ++ + ++ +E
Sbjct: 723 GMDNLNELPEGEWFCQKGCKVIDEILTQLVAIGPESLSHSIISELPENRQQKSGVIEKAE 782
Query: 760 DID--VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ 817
I W++L GK ++P L++AV IF +C DPI D+ +G++LIP MV R +
Sbjct: 783 SISPSFEWQILCGKGSSPANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDY 842
Query: 818 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 877
+F G++C +L +N VVSA +L++FG+E AE+PLVATS + G+G+ + L IE+LL
Sbjct: 843 DFEGVFCVVLKLNGKVVSAALLQIFGREFAEVPLVATSLPHQGQGFCKALMTTIERLLGV 902
Query: 878 LRVKSIVLPAAEEAESIWTDKFGFKKI 904
L V+ +VLP A++ ES+W +KFGF ++
Sbjct: 903 LSVERLVLPTAKDTESLWVNKFGFSRV 929
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 69/177 (38%), Gaps = 29/177 (16%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C G ++
Sbjct: 286 LLKAPRNAKELMATRLMEGHHVRCSCRGIQ-----LTGMLKDMGVQCDCRNCRGSVIVSI 340
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRA------CRSVPLPMLKATL--------- 324
S FE H+ S I ENGK+L ++L A C L LK +
Sbjct: 341 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRALKMAIGDVQGVEKS 400
Query: 325 --QSALSSLPEEKSF---ACVRCKGTFPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
+ A PEE RC CV P C C K+KKP ++
Sbjct: 401 KSKCAKCGNPEEGDLIYCKGARCSVVAHSGCVEIANPHLGDWFCGKCEKTKKPHASV 457
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 268 bits (685), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 123/196 (62%), Positives = 157/196 (80%)
Query: 747 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 806
+KK LE + +IDVRWRLL+G+ A+PET+ LL +AV++FH+CFDPIVD +GRDLIP
Sbjct: 1 MKKQEERCLEPLREIDVRWRLLNGRVASPETKPLLLEAVSMFHECFDPIVDPATGRDLIP 60
Query: 807 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 866
+MV GRNLR Q+FGGMYCA+L VNSSV SA +LR+FG ++AELPL+AT N GKGYFQ
Sbjct: 61 AMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQT 120
Query: 867 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTS 926
LF+CIE+LLSFL VK++VLPAAEEAESIW KFGF K++P+ L+ YRK Q++ FKGT
Sbjct: 121 LFSCIERLLSFLSVKNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGTV 180
Query: 927 MLQKRVPACRIGSSST 942
ML K VP CR+ ++ +
Sbjct: 181 MLHKTVPQCRVTNTQS 196
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 168/475 (35%), Positives = 246/475 (51%), Gaps = 58/475 (12%)
Query: 506 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 565
+ +D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E+S S FE HA
Sbjct: 738 SARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEISCSAFEQHA 797
Query: 566 G---RQYP--------GKDNDDLCT-------------ICADGGNLLPCDGCPR----AF 597
G R+ P G+ D+C A G++ G A
Sbjct: 798 GCESRRNPYGSILLADGRSLKDMCKELAYQSKLGDRAHQVARTGDVKSSSGSEEQGVLAS 857
Query: 598 HKECASLSSIPQGDWY-----------CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE 646
+ C S I G + CK CQ E A+ RV ++
Sbjct: 858 SQRCESTWCINFGTRFSCQEADSGHPLCKICQKNVE-------GAHKTSKKRVDATANIP 910
Query: 647 QI--TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 704
T R +R+ + ++ SGC +C+ K GF T+L+CDQC RE+HVGCL++ +
Sbjct: 911 ATDDTGRNVRLFQAPDSS-SGCAICKKWTLKKCGFD-MTMLVCDQCGREYHVGCLRESGI 968
Query: 705 ADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETV--S 759
D ELP+ +W+C +C I VL L+ E L + +N + +++ +E S
Sbjct: 969 LD--ELPEAEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESRQHQQGIVEMAESS 1026
Query: 760 DIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEF 819
W++L G P L+QAV IF +C DPI D+ SG+++IP MVY R + +F
Sbjct: 1027 SPVFGWQILHGAGENPVNGRTLAQAVEIFTECSDPIKDAPSGQNMIPIMVYSRRFKDYDF 1086
Query: 820 GGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLR 879
G+YC +LT+N VVS +L++FG+EVAE+PL+ATS + +G+ + L IE+LL L
Sbjct: 1087 DGIYCVVLTLNEKVVSTALLQIFGREVAEVPLIATSVDHQDQGFCKALMTTIERLLGVLN 1146
Query: 880 VKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
V+ +VLPA++ AE +W ++FGF +++ L R LV F GT+ML K + A
Sbjct: 1147 VERLVLPASKNAEFVWVNRFGFSRMEDAQLKHIRSMMGLLV-FTGTTMLVKHIDA 1200
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 24/144 (16%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRA--- 311
L GI++D G++C+C +C G +V+ S FE H+ S I ENGK+L ++L A
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 312 ---CRSVPLPMLKATLQSALSSLPEEKSFAC----------------VRCKGTFPITCVG 352
C L LK + + +P +K AC +C + CVG
Sbjct: 630 SADCGGDILGALKHAI-GEIQGIP-KKEGACGKCGKREGGDFVSCKEPKCSAVYHAECVG 687
Query: 353 KTGPGPLCNSCVKSKKPQGTMTYT 376
P + C K +K Q M T
Sbjct: 688 LPSPHRVDWFCAKCEKAQVKMPKT 711
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 251 bits (642), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 156/458 (34%), Positives = 233/458 (50%), Gaps = 64/458 (13%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D LHK +F GL DGTE+GYY Q L+G K G E++ +FE HAG
Sbjct: 588 RDAHLHKALFLPGGLEDGTELGYYTKSQLKLKGVKRGEAF--KKVVVEINCYKFEQHAGC 645
Query: 567 --RQYP---------GKDNDDLCTI-------------------------CADGGNLLPC 590
R+ P G+ D+C C G L C
Sbjct: 646 EARRNPYGNILLVADGRSLKDVCKELAHKNKLGEKEKRVARAGKVNSCYECGTRGELKNC 705
Query: 591 DGCPRAFHKECA-SLSSIPQGDWYCKYCQN------MFERKRFLQHDANAVEAGRVSGVD 643
GC + C L + G WYC+ C+ E+KR +H + G+
Sbjct: 706 HGCVETWCNSCTKGLETDSDGKWYCRMCRQDTLNVAQIEQKRSNKH---------IEGMS 756
Query: 644 SVEQITKRCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLK 700
++ + +R R V++LE E+ GC +C+ + SK+GF TIL+CDQC RE+HV CLK
Sbjct: 757 NIAETDER-DRCVRHLEGHREVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLK 815
Query: 701 KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-----KKYAGNSL 755
+ DL ELP+G+WFC DC I+ +L L+ E L + ++ + ++
Sbjct: 816 DSGVDDLNELPEGEWFCQKDCKVIDEILTQLVANGPELLTDSIISELLESRQQQTGAKDK 875
Query: 756 ETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR 815
S W++L GK+ L++A+ IF +C DPI D+ +G++LIP MV R +
Sbjct: 876 AESSCPSFAWQILCGKSGNTANTQTLAEAINIFTECSDPIRDAKTGKNLIPLMVQSRRSK 935
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
+F G++C +L +N VVSA +L++FG E+AE+PLVATS + G+G+ + L IE+LL
Sbjct: 936 DHDFEGVFCIVLKLNEKVVSAALLQIFGGEIAEVPLVATSLTHQGQGFCKALMTTIERLL 995
Query: 876 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
L V+ +VLP A+ ESIW +KFGF ++ + S R
Sbjct: 996 GVLSVERLVLPTAKNTESIWINKFGFSRVPDDEGSFLR 1033
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 75/178 (42%), Gaps = 31/178 (17%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L K P EL T L++G V GI+ L G+++D G+ C C C ++
Sbjct: 387 LLKAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVRCDCRNCKSSVIVSI 441
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 442 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 500
Query: 335 KSFACVR-----------CKGT-----FPITCVGKTGP---GPLCNSCVKSKKPQGTM 373
C + CKG C+G + P C+ C K KKP T+
Sbjct: 501 WKVTCAKCWNSDEGDLIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATV 558
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 240 bits (612), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 151/482 (31%), Positives = 227/482 (47%), Gaps = 124/482 (25%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHK-------------------------EC-ASLSSIPQ 609
D C C D G+L C GCP A+H+ +C S S
Sbjct: 999 DQCVECGDSGDLQLCTGCPNAYHQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSF 1058
Query: 610 GDWYCKYCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEA-EL 663
G+++C CQ +R+R + + G + + +++ RC R+++ EA L
Sbjct: 1059 GEFFCPDCQEQRFGGTKDRRRSMTKRRSK---GAAKTLLTKDRMIGRCSRLLQVPEAIVL 1115
Query: 664 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG-------------------------- 697
GC+ C+ DF+K+GFGP+T LLCDQCERE+HVG
Sbjct: 1116 GGCVFCKSGDFAKTGFGPKTTLLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGE 1175
Query: 698 ----CLKKHKMAD------------------------LRELPKGKWFCCMDCSRINSVLQ 729
C+ ++ D +ELP+G+WFC DC I+S+L
Sbjct: 1176 VVFRCVATVRLRDCDRGCESQGIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILS 1235
Query: 730 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI------DVRWRLLSGKAATPETRLLLSQ 783
L+ E L + ++ + + LE D W+LL G+ P L++
Sbjct: 1236 LLVSNGPEPLADSIISKVLRTNQARLERSEDATESSCSGFEWQLLHGRGGDPSNGKALAE 1295
Query: 784 AVAIFH----------------------------DCFDPIVDSISGRDLIPSMVYGRNLR 815
AV IF +CFDPI D +SG DLIP MVY R+LR
Sbjct: 1296 AVQIFSVRNLSDPGFPVRTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLR 1355
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
Q+FGG+YC +L ++ VVS ++RVFG+++AELPL+AT+ + G+G+ + L IE+LL
Sbjct: 1356 DQDFGGIYCVVLKYDNRVVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLL 1415
Query: 876 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPAC 935
LRV+ + LPAAE AE IW +KFGF+++ + + + +V F G+ ML+K +P
Sbjct: 1416 GVLRVERLALPAAEGAEGIWLNKFGFRRMAEGQVKQFHSDLNMMV-FTGSFMLEKEIPPL 1474
Query: 936 RI 937
I
Sbjct: 1475 EI 1476
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 124/291 (42%), Gaps = 43/291 (14%)
Query: 58 VKRSRFSNSDDLLEDDVIDKR----------INSKIHEGRINKVVKNVL-------NENG 100
V SR + + L DD +D R N + E + ++ + V+ NG
Sbjct: 386 VAGSRNDATSNPLSDDGLDGREQDGNGGKGLTNGTVEEDKCTELFEPVVLLPEHSGGTNG 445
Query: 101 ILESVVEEENQLVQMTVENVIEETVKGKKAPICKEEPISKVECFPRKEGGSEVSNGLNKK 160
+ES V+ E+ E+ + G A SK FP+ G + + ++++
Sbjct: 446 FVESAVDSEDLSSDPAREDSLGNQELGGSAGAAVAS--SKDLEFPQHSGSEKRGDSMDRQ 503
Query: 161 ------------CLKRPSAMKPKVEPVEVLVTQSE---------GFGNESMSLIEVEAIA 199
R +K V P+++ + +G + S ++++
Sbjct: 504 DSDVATPSAATSAQSRSPGLKVGVVPLKIWLWAVGWLGGWAWLVSYGADGNSRLKLKGQG 563
Query: 200 EGSALTSPKKNLELK-MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGI 258
+ S SP N + K++ L + P + L ++GLLDG V YMG + L GI
Sbjct: 564 DVSMRQSPTLNGARGFVVKEVLLKEAPASAKLLLQSGLLDGHHVRYMG--RGGHIMLTGI 621
Query: 259 IRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVL 309
I++GG+LC CS C G +V+ S FE HA R S +I ENGK L ++L
Sbjct: 622 IQEGGVLCDCSSCKGVQVVNVSAFEKHAGSSARHPSDFIFLENGKCLKDIL 672
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 35/64 (54%), Gaps = 7/64 (10%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D TEVGYY GQK L G K G GI+
Sbjct: 784 ETASPPVLSRESSGAN-------LHKALFLPGGLEDDTEVGYYVKGQKSLAGVKKGAGIL 836
Query: 549 CHCC 552
C CC
Sbjct: 837 CSCC 840
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 233 bits (593), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 198/333 (59%), Gaps = 15/333 (4%)
Query: 609 QGDWYCKYC-QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL 667
+G WYC+ C Q+ + + Q + + G + ++ EQ RCIR + E+ GC
Sbjct: 983 EGRWYCRMCRQDSLKVAQNGQKGSEKIMEGMSNIAETNEQ--GRCIRHLDGPR-EVGGCA 1039
Query: 668 LCRGCDFSKSGF-GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
+C+ + SK+GF TIL+CDQC RE+HV CLK + DL ELP G+WFC DC I+
Sbjct: 1040 ICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDCKVIDE 1099
Query: 727 VLQNLLVQEAEKLPEFHLNAI---KKYAGNSLETVSDIDVR--WRLLSGKAATPETRLLL 781
+L L+ E L ++ + ++ ++ E + + R W++L GK ++P L
Sbjct: 1100 ILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSPADVQTL 1159
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRV 841
++A IF DC DPI D +G++LIP MV R + +F G++C +L +N VVSA +L++
Sbjct: 1160 AEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVVSAALLQI 1219
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
FG+E+AE+PL+ATS +HG+ + + L IE+LL L V+ +VLP A+ ES+W +KFGF
Sbjct: 1220 FGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESVWINKFGF 1279
Query: 902 KKIDPELLSIYRKRCS--QLVTFKGTSMLQKRV 932
++ + L + C+ +L TF GTSM+ K +
Sbjct: 1280 SRVQEDQL---KSICTTIRLTTFTGTSMVVKAI 1309
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 37/67 (55%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR 567
+D LHK +F GL DGTE+GYY Q L+G K G GI C CCN E Q + G
Sbjct: 779 RDAHLHKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNEETRHVQGGSRGGL 838
Query: 568 QYPGKDN 574
+P + +
Sbjct: 839 MFPSQQD 845
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 64/131 (48%), Gaps = 12/131 (9%)
Query: 221 LNKKPMTVTELFETGLLDGVSV-VYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPP 279
L + P EL T L++G V GI+ L G+++D G+ C+C C G ++
Sbjct: 563 LVRAPRNAKELMATRLMEGHFVRCSCRGIQ-----LTGMLKDMGVQCNCRNCKGSVIVSI 617
Query: 280 SKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EE 334
S FE H+ S I ENGK+L ++L A + +L+A L+ A+ + E+
Sbjct: 618 SAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQEAADCGDNILRA-LKMAIGDIQGVEK 676
Query: 335 KSFACVRCKGT 345
+ C +C+ +
Sbjct: 677 RKVTCAKCESS 687
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 99/133 (74%), Positives = 116/133 (87%)
Query: 808 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
MVYGRN+RGQEFGGMYCA+L VNSSVVSA +LR+FG +VAELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 927
F+CIE+LL+F+ VKS+VLPAAEEAESIWTDKFGF ++ P+ LS YRK +Q+VTFKGT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 928 LQKRVPACRIGSS 940
L K VP CRI S+
Sbjct: 121 LHKLVPPCRIISN 133
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 98/130 (75%), Positives = 116/130 (89%)
Query: 808 MVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
MVYGR++ GQEFGGMYCAIL VNS VVSA +LRVFGQ++AELPLVATS NHGKGYFQ L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 927
F+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I P+ LS YR+ C Q+VTFKGTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 928 LQKRVPACRI 937
LQK VP+CR+
Sbjct: 121 LQKTVPSCRV 130
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 159/555 (28%), Positives = 249/555 (44%), Gaps = 125/555 (22%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
+D L K +F GL DGTE+GYY Q L+G K G GI C CCN E+S +FE HAG
Sbjct: 714 RDAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCNKEISCYEFEQHAGC 773
Query: 567 --RQYP---------GKDNDDL------------------------CTICADGGNLLPCD 591
R+ P G+ D+ C C++ G L C
Sbjct: 774 EARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVCCYECSNSGELKRCH 833
Query: 592 GCPRAFHKECA-SLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITK 650
C A+ +C + + +G WYC+ C+ + + Q+ + + G+ ++ + +
Sbjct: 834 SCEEAWCDKCTKGMETDSEGRWYCRMCRQ--DSLKVAQNGHKGTDK-IIEGMSNIAETDE 890
Query: 651 RCIRIVKNLEA--ELSGCLLCRGCDFSKSGF-GPRTILLCDQCEREFHV----------- 696
+ R V++LE E+ GC +C+ + SK+GF TIL+CDQ +
Sbjct: 891 KG-RCVRHLEGPREVGGCAICKKWNLSKTGFVDGMTILVCDQVRSLNQMLPGTRITWKVN 949
Query: 697 -----GCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 751
L + +ELP+G+WFC DC I+ +L L+ PE N+I
Sbjct: 950 GFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANG----PELLSNSIISEL 1005
Query: 752 GNSLETVSDIDVR---------WRLLSGKAATPETRLLLSQAVAIF-------------- 788
S + S + V+ W++L G+ + L++A IF
Sbjct: 1006 LESRQQQSSVKVKLESSNPRFGWQILCGEGGSSANVQTLAEAANIFTSIDDINLPYLWLV 1065
Query: 789 ---------------------HDC-----FDPIVDSISGRDL----IPSMVYGRNLRGQE 818
C FDP S D+ + +V R + +
Sbjct: 1066 VGNYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNVALVVCSRRAKDHD 1125
Query: 819 FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFL 878
F G++C +L +N VVSA +L++FG+E+AE+PL+ATS + G+G+ + L IE+LL L
Sbjct: 1126 FEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCKALMTTIERLLGVL 1185
Query: 879 RVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS--QLVTFKGTSMLQKRVPAC- 935
V+ +VLP A+ ESIW +KFGF ++ + L ++ C+ +L+TF GT ML K +
Sbjct: 1186 SVERLVLPTAKNTESIWINKFGFSRVPEDQL---KRICTTIRLMTFTGTRMLGKAITPMT 1242
Query: 936 --RIGSSSTDSTECV 948
RI S D C+
Sbjct: 1243 LNRIQRQSRDGCVCI 1257
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 69/158 (43%), Gaps = 13/158 (8%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL T L++G + + L G+++D G+ C+C C G ++ S
Sbjct: 515 LLKAPRNAKELMATKLMEG----HFVRCSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSIS 570
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLP--EEK 335
FE H+ S I ENGK+L +VL A + +L+A L+ A+ + E+
Sbjct: 571 AFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQEAADCGDNILRA-LKMAIGDIQGVEKS 629
Query: 336 SFACVRCKGTFP---ITCVGKTGPGPLCNSCVKSKKPQ 370
C C G+ I C G + CV S PQ
Sbjct: 630 KVTCAECGGSEEGDLIYCKGARCSVVSHSRCVGSANPQ 667
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 149/444 (33%), Positives = 206/444 (46%), Gaps = 79/444 (17%)
Query: 513 HKLVF--DE-SGLPDGTEVGYYAC-GQKLLEGY-------KNGLGIICHCCNSEVSPSQF 561
HK +F DE GL DG V Y G++LL+G GI+C CCN +S SQF
Sbjct: 445 HKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCACCNGVISCSQF 504
Query: 562 EAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 621
EAHAGR D++ T L C AS + P C
Sbjct: 505 EAHAGRGSRRAPYDNIFTAAGVSLRKLAC--------LMPASEAESPISHRPAALC--AV 554
Query: 622 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 681
+R L+ + + V A GC+LC+ DF + GFG
Sbjct: 555 ADRRALEPE----------------------LVTVSGEAALHGGCVLCKVPDFLRGGFGE 592
Query: 682 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 741
RT+++CDQCERE+H+GCL +H A L ELP+GK + L ++L+
Sbjct: 593 RTMIICDQCEREYHIGCLAEHGRAHLTELPEGK-----------ASLYDILLT------- 634
Query: 742 FHLNA-------IKKYAGNSLETVSDIDV------RWRLLSGKAATPETRLLLSQAVAIF 788
HLN K A E VS + V W++L GK T T L A I
Sbjct: 635 LHLNGEWHCSPECKGIATRMRERVSSVPVPLQGEYSWQVLRGKDGTHATTWALKAAQEIL 694
Query: 789 HDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
+ FDPI+D ++G DL+ +MVY + L ++ GMY A+L V + + RVFG+++AE
Sbjct: 695 TESFDPILDLVTGADLMMAMVYAQELGDWDYTGMYTAVLRRRGKAVCSAVFRVFGRQLAE 754
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
+PLVAT +G+ ++L A E L V+S+ LPAA+ W FGF I PE
Sbjct: 755 VPLVATRLGARRQGHARVLMAAFEDYFRSLGVQSLCLPAAQSTVETWIHGFGFAAITPEE 814
Query: 909 LSIYRKRCSQL--VTFKGTSMLQK 930
+ CS+L + F GT +LQK
Sbjct: 815 QAAT---CSELRVLIFPGTELLQK 835
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 55/119 (46%), Gaps = 35/119 (29%)
Query: 534 GQKLLEGYKNGLG-IICHC--CNSEVSP----SQFEAHAG--RQYPGKD----------- 573
G LL G G I C C C ++ +P S+FE HAG + PG+
Sbjct: 251 GDLLLNGSITEEGQIACPCKQCRAKKTPGVSCSEFEEHAGSRERRPGESIYLTNLSISLK 310
Query: 574 ------NDD---------LCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
ND+ C +C DGG+LL CDGCP A H CA L +P+GDW+C C
Sbjct: 311 EFCALVNDEGRSADRHGSACGLCMDGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDAC 369
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 192 bits (489), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 202/723 (27%), Positives = 303/723 (41%), Gaps = 104/723 (14%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTS 461
+R+ R+ + D + + + S S G +ST TS
Sbjct: 281 ------KRRSGRRFRQGDTGTSTPTFS-----------------GSPGKGGFGHSTD-TS 316
Query: 462 VHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESG 521
K ++ + ++ K T +P N S +KS+ + T +D LH L+F E G
Sbjct: 317 EKKGTEETHRLSLSSPVKIT---QRPLRNCS----IDSKSKESKT-RDTTLHPLIFKEDG 368
Query: 522 LPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---RQYPGKDNDDL 577
L D T + Y G+ L +GYK G IIC+CCN E SPS FE HAG R+ P + L
Sbjct: 369 LADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNIYTL 428
Query: 578 CTICADGGNLLPCDGC-PRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 636
+ L D P F + AS+SS+ H+ +
Sbjct: 429 EGLSLHKLALQLQDHLNPNGF--DNASVSSVSD------------------YHNLTSSGC 468
Query: 637 GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHV 696
GR S + + R ++ E C C + P TI+ C+QCER H+
Sbjct: 469 GREPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHI 526
Query: 697 GCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKY 750
C KK + L+E + CC +C + + L+ ++K
Sbjct: 527 KCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLEE---------------GLEKC 571
Query: 751 AGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY 810
G + ++ WRLLSG A+ + +L + Q + IF D F D S D+I MV
Sbjct: 572 VGITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVN 629
Query: 811 GRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 869
G+N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL
Sbjct: 630 GKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLK 689
Query: 870 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQ 929
IE L V + P E IW++K GF + E + LV FK ++Q
Sbjct: 690 SIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTILSAEEKESMLES-HPLVMFKNLVLVQ 748
Query: 930 KRV 932
K +
Sbjct: 749 KSL 751
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 209/740 (28%), Positives = 311/740 (42%), Gaps = 103/740 (13%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 115 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 170
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 171 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 228
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 229 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 280
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP----GLIA 454
+ + R ++ S S K S K + + N+ P G+ +
Sbjct: 281 KRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDGVKS 340
Query: 455 NS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRWNITPK 508
+S T VT+ H S + + +S P + PL S +KS+ + T +
Sbjct: 341 DSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKSKESKT-R 390
Query: 509 DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS FE HAG
Sbjct: 391 DTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGM 450
Query: 567 --RQYPGKDNDDLCTICADGGNLLPCDGC-PRAFHKECASLSSIPQGDWYCKYCQNMFER 623
R+ P + L + L D P F + AS+SS+ D+
Sbjct: 451 GRRRQPYHNIYTLEGLSLHKLALQLQDHLNPNGF--DNASVSSV--SDY----------- 495
Query: 624 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 683
H+ + GR S + + R ++ E C C + P T
Sbjct: 496 -----HNLTSSGCGREPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDT 548
Query: 684 ILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
I+ C+QCER H+ C KK + L+E + CC +C + + L+
Sbjct: 549 IIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLEE------- 601
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
++K G + ++ WRLLSG A+ + +L + Q + IF D F D
Sbjct: 602 --------GLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTD 653
Query: 798 SISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
S D+I MV G+N + ++F GMYCA+LT ++ VVSA IL+V +++AEL L+AT
Sbjct: 654 EHS--DIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAILKVRIEQIAELVLIATRS 711
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK-- 914
KGYF LL IE L V + P E IW++K GF +LS K
Sbjct: 712 ECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFT-----ILSAEEKES 766
Query: 915 --RCSQLVTFKGTSMLQKRV 932
LV FK ++QK +
Sbjct: 767 MLESHPLVMFKNLVLVQKSL 786
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 183 bits (464), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 140/450 (31%), Positives = 198/450 (44%), Gaps = 84/450 (18%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG 566
+D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE HAG
Sbjct: 170 RDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFEKHAG 229
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRF 626
GK I G L ++ +
Sbjct: 230 M---GKRRQPYQNIYTSQGLTL-----------------------------HDVALQLHH 257
Query: 627 LQHDANAVEAGRVSGVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRGCDFS 675
L ++N VS +T + + + L+ + C C +
Sbjct: 258 LNLNSNGFSNASVSSFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYFCGYG 317
Query: 676 KSGFG---PRTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINS 726
+ G P I+ C+QCER HV C KK + L++ + CC +C +
Sbjct: 318 HTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQSLRV 377
Query: 727 VLQNLLVQEAEKLPEF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAV 785
L+ + EK E L I+ ++ WRLLS A+ + +L LSQ +
Sbjct: 378 RLEGM-----EKCEEIAFLGRIRS------------NICWRLLSSADASRDVKLYLSQVI 420
Query: 786 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQ 844
IF D F ++S I MVYG+N G+ +F GMYC +LT ++ VVSA IL+V +
Sbjct: 421 DIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILKVRVE 477
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
AEL L+AT KGYF+LL IE L V ++ P E IW+DK GF
Sbjct: 478 HAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLGFT-- 535
Query: 905 DPELLSIYRKR----CSQLVTFKGTSMLQK 930
+LS K+ LV FK ++QK
Sbjct: 536 ---ILSADEKKSMLESHPLVMFKNLVLVQK 562
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 182 bits (463), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 155/514 (30%), Positives = 221/514 (42%), Gaps = 88/514 (17%)
Query: 447 SSRPGLIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRW 503
SS G+ S + K ++ R + T S P +A PL S + SR
Sbjct: 76 SSDKGISGLSAGTSKEKKGTEETRSAQNTGDLGLISSSSSPVTSAQRPLPSSSVGSNSRE 135
Query: 504 NITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFE 562
+ +D LH L+F E+GLPD T + Y G+ LL+GYK G GI+C+CCN EVSPS+FE
Sbjct: 136 S-KKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGAGIVCNCCNQEVSPSEFE 194
Query: 563 AHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
HAG GK I G L ++
Sbjct: 195 KHAGM---GKRRQPYQNIYTSQGLTL-----------------------------HDVAL 222
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKR-----------CIRIVKNLEAELSGCLLCRG 671
+ L ++N VS +T + + + L+ + C
Sbjct: 223 QLHHLNLNSNGFSNASVSSFSDYPNLTSSGCGKEPSVSGPIVPLKRTLQERVVQTESCYF 282
Query: 672 CDFSKSGFG---PRTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCS 722
C + + G P I+ C+QCER HV C KK + L++ + CC +C
Sbjct: 283 CGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEILKDYLCFHFLCCQECQ 342
Query: 723 RINSVLQNLLVQEAEKLPEF-HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 781
+ L+ + EK E L I+ ++ WRLLS A+ + +L L
Sbjct: 343 SLRVRLEGM-----EKCEEIAFLGRIRS------------NICWRLLSSADASRDVKLYL 385
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILR 840
SQ + IF D F ++S I MVYG+N G+ +F GMYC +LT ++ VVSA IL+
Sbjct: 386 SQVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMYCVVLTASTHVVSAAILK 442
Query: 841 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 900
V + AEL L+AT KGYF+LL IE L V ++ P E IW+DK G
Sbjct: 443 VRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLLMAPVDPEMAQIWSDKLG 502
Query: 901 FKKIDPELLSIYRKR----CSQLVTFKGTSMLQK 930
F +LS K+ LV FK ++QK
Sbjct: 503 FT-----ILSADEKKSMLESHPLVMFKNLVLVQK 531
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 124/444 (27%), Positives = 187/444 (42%), Gaps = 110/444 (24%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------- 566
Q LLEG GI+C CC S S FEAH+G
Sbjct: 17 QVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRACASIFNNKGESLLDLQVQAWEL 76
Query: 567 -----------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+ P +NDD C +C DGG L+ CD CP +H C L +P+G+W+C
Sbjct: 77 LDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCP 136
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C+ C +C G +++
Sbjct: 137 SCR-----------------------------------------------CAICGGSEYN 149
Query: 676 KSG--FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
G F T+LLCDQCERE+HV CL MA + P WFC C +I L+ L V
Sbjct: 150 ADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFEGLRKL-V 208
Query: 734 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
+ + E + + + L + S ++ + E R L+ A+ + +CF
Sbjct: 209 GISNTIGEGLSWTLLRSGEDDLPSASSMN--------REQMAEHRSKLAVALGVMQECFL 260
Query: 794 PIVDSISGRDLIPSMVYGRN---LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 850
P+VD + DL+ ++Y R + F G Y +L + V+S +R+ G +AE+P
Sbjct: 261 PMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMP 320
Query: 851 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP---- 906
L+ T + +G + L IE LL L ++S VLPA E W + FGF+++ P
Sbjct: 321 LIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRL 380
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQK 930
EL+ + +V+F G ++LQK
Sbjct: 381 ELVKL------SVVSFPGVTLLQK 398
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 179 bits (455), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 123/444 (27%), Positives = 187/444 (42%), Gaps = 110/444 (24%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------- 566
Q LLEG GI+C CC S S FEAH+G
Sbjct: 42 QVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSGTSQRRACASIFNNKGESLLDLQVQAWEL 101
Query: 567 -----------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+ P +NDD C +C DGG L+ CD CP +H C L +P+G+W+C
Sbjct: 102 LDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCP 161
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C+ C +C G +++
Sbjct: 162 SCR-----------------------------------------------CAICGGSEYN 174
Query: 676 KSG--FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
G F T+LLCDQCERE+HV CL MA + P WFC C +I L+ L V
Sbjct: 175 ADGSSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFQGLRKL-V 233
Query: 734 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
+ + E + + + L + + ++ + E R L+ A+ + +CF
Sbjct: 234 GISNNIGEGLSWTLLRSGEDDLPSANSMN--------REQMAEHRSKLAVALGVMQECFL 285
Query: 794 PIVDSISGRDLIPSMVYGRN---LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 850
P+VD + DL+ ++Y R + F G Y +L + V+S +R+ G +AE+P
Sbjct: 286 PMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMP 345
Query: 851 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP---- 906
L+ T + +G + L IE LL L ++S VLPA E W + FGF+++ P
Sbjct: 346 LIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRL 405
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQK 930
EL+ + +V+F G ++LQK
Sbjct: 406 ELVKL------SVVSFPGVTLLQK 423
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 97/207 (46%), Positives = 119/207 (57%), Gaps = 55/207 (26%)
Query: 498 PNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVS 557
P+K +T KD R+HKLVF+ LPDGT + YY G+ VS
Sbjct: 576 PSKVHGRLTRKDLRMHKLVFEGDVLPDGTALAYYVRGK--------------------VS 615
Query: 558 PSQFEAHAG---RQYPG-------------------------KDND------DLCTICAD 583
PSQFEAHAG R+ PG KD DLC+ICAD
Sbjct: 616 PSQFEAHAGCASRRKPGWYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICAD 675
Query: 584 GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVD 643
GG+LL CD CPRAFH EC SL +IP+G WYCKYC+NMF +++F ANA+ AGRV+G+D
Sbjct: 676 GGDLLCCDNCPRAFHTECVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGID 734
Query: 644 SVEQITKRCIRIVKNLEAELSGCLLCR 670
++EQITKR IRIV L AE+ C+LCR
Sbjct: 735 ALEQITKRSIRIVDTLHAEVGVCVLCR 761
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/173 (41%), Positives = 103/173 (59%), Gaps = 5/173 (2%)
Query: 200 EGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGII 259
E SA+ + K LE+KMSKK++L K P + L TGLL+G+ V Y G GL+G+I
Sbjct: 346 EASAIGTTSK-LEMKMSKKVALVKIPTKLKGLLATGLLEGLPVRYARG--RPEKGLQGVI 402
Query: 260 RDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPM 319
+ GILC C C G +V+ P++FE+HA +R +YI +NGK+L +VL AC+ P
Sbjct: 403 QGSGILCFCQNCGGTKVVTPNQFEMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADA 462
Query: 320 LKATLQSALSSLPEEKSFACVRCKGTFPITCVGKTGPGPLCNSCVKSKKPQGT 372
L+A +++A + KS C+ CK + P G+ P C+SC+ SKK Q T
Sbjct: 463 LEAAIRNATGAGDARKSTVCLNCKASLPEASFGR--PRLQCDSCMTSKKSQTT 513
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 181/441 (41%), Gaps = 102/441 (23%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG--------------------------- 566
G +L EG GI C CCN + FE HAG
Sbjct: 728 GPRLKEGAVTKDGIKCRCCNELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVELMG 787
Query: 567 -------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
Y ++D +C+IC +GG +L CD CP +FH C L S P+G WY
Sbjct: 788 ENKPKESLHVRLKTNYSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWY 847
Query: 614 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 673
C C+ C +C D
Sbjct: 848 CPSCR-----------------------------------------------CSICDSSD 860
Query: 674 FS--KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNL 731
+ + F +TI+ CDQCERE+HVGC++ +K L P+G WFC CS I LQ L
Sbjct: 861 YDPDTNKFTEKTIMYCDQCEREYHVGCMR-NKGDQLTCCPEGCWFCSRGCSEIFQHLQGL 919
Query: 732 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDC 791
+ + E I ++ + D E L A+ + H+C
Sbjct: 920 IGKSIPTPVEGLSCTILRFDRENASQHGDF--------YNEIIAEQYGKLCIALDVLHEC 971
Query: 792 FDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAEL 849
F I++ + RDL +V+ R LR F G Y IL + ++S G RV G++ AEL
Sbjct: 972 FVTIIEPSTRRDLSEDIVFNRESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAEL 1031
Query: 850 PLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELL 909
PL+ T +G +LL +EKLLS L V+ +VLPA + WT FGF+ +
Sbjct: 1032 PLIGTRVQYRRQGMCRLLMNELEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAM--SFS 1089
Query: 910 SIYRKRCSQLVTFKGTSMLQK 930
+ S +++F+GT++ QK
Sbjct: 1090 DRFELAESSILSFQGTTICQK 1110
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 95/280 (33%), Positives = 145/280 (51%), Gaps = 30/280 (10%)
Query: 664 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 723
+ C+LC DF + GF +T+L+CDQCE+E+H+GCL++HKM D++ +P+G+WFC +C R
Sbjct: 1510 AACVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVR 1569
Query: 724 INSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 783
I L + +P GN RW++L GK +T LS
Sbjct: 1570 IRDALGEDVAAGEVLMP-----------GNPA-------YRWQILRGKNGRQQTWHALST 1611
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAIL-------TVNSSVVSA 836
+ I + FDPI+D+ SG DL+P+MV +F GMY +L V A
Sbjct: 1612 VLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVLA 1671
Query: 837 GILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT 896
+ RV G +AE+PLVAT +G+ + L + L L V++IVLPA +A+ W
Sbjct: 1672 ALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLGLGVRAIVLPATADAQPAWR 1731
Query: 897 DKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---RVP 933
+ F+ +D + R ++V F T++L + RVP
Sbjct: 1732 -QLQFQDLDEPSTRVARSE-HRMVIFPHTTVLARPLIRVP 1769
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 129/437 (29%), Positives = 183/437 (41%), Gaps = 100/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ---------------------------- 568
LLEG GI C CC+ V+ +F AHAG Q
Sbjct: 950 LLEGLITRDGINCSCCSKVVTVLEFVAHAGGQLSKPYRNILVDGLDNDLLHCLIIAWDKQ 1009
Query: 569 --------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+P G D NDD C IC DGGNL+ CDGCP FH C L +P DW C
Sbjct: 1010 SDSERQAFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWRCT 1069
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C + L HDA D+ E
Sbjct: 1070 NCSCKLCHE-HLNHDA----------PDNAE----------------------------- 1089
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
P + C QCE+++H C + + G FC C + LQNLL E
Sbjct: 1090 ---IDP--LHSCSQCEKKYHPSCSPETEKLSSVSSQAGNHFCQQSCRLLFEELQNLLAVE 1144
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PE+ I+ ++ ETV D+D R E ++ A+++ +CF PI
Sbjct: 1145 KDLGPEYACRIIQCIHEDAPETVLDLDGRV----------ECNSKIAVALSLMDECFLPI 1194
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y IL +VSA +R+ G ++AE+P +
Sbjct: 1195 VDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKLAEMPFIG 1254
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L+V+ +++PA E WT KFGF + E+
Sbjct: 1255 TRHMYRRQGMCRRLLDGIEMILSSLKVEKLIIPAINELVDTWTSKFGFSPL--EVSDKQE 1312
Query: 914 KRCSQLVTFKGTSMLQK 930
+ ++ F GT +LQK
Sbjct: 1313 VKSINMLVFPGTGLLQK 1329
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 182/437 (41%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
LLEG GI C CC+ + +F AHAG
Sbjct: 724 LLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQ 783
Query: 567 ----RQ--YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
RQ +P NDD C IC DGGNL+ CDGCP FH C L ++P DW C
Sbjct: 784 SDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCA 843
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F ++ Q +A ++ VDS
Sbjct: 844 KCSCKFCQEHSRQ------DAQDIAEVDS------------------------------- 866
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H GC + FC C + L+NLL +
Sbjct: 867 -------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 919
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PEF I++ N ETV +D R E ++ A+++ +CF PI
Sbjct: 920 KDLEPEFSCRIIQRIHENVPETVVALDERV----------ECNSKIAVALSLMDECFLPI 969
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y +L +++A +R+ G ++AE+P +
Sbjct: 970 VDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIG 1029
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L V+ +++PA E WT KFGF +D +
Sbjct: 1030 TRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQE 1087
Query: 914 KRCSQLVTFKGTSMLQK 930
+ + ++ F GT +LQK
Sbjct: 1088 VKSTSMLVFPGTGLLQK 1104
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 182/437 (41%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
LLEG GI C CC+ + +F AHAG
Sbjct: 879 LLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQ 938
Query: 567 ----RQ--YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
RQ +P NDD C IC DGGNL+ CDGCP FH C L ++P DW C
Sbjct: 939 SDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCA 998
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F ++ Q +A ++ VDS
Sbjct: 999 KCSCKFCQEHSRQ------DAQDIAEVDS------------------------------- 1021
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H GC + FC C + L+NLL +
Sbjct: 1022 -------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 1074
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PEF I++ N ETV +D R E ++ A+++ +CF PI
Sbjct: 1075 KDLEPEFSCRIIQRIHENVPETVVALDERV----------ECNSKIAVALSLMDECFLPI 1124
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y +L +++A +R+ G ++AE+P +
Sbjct: 1125 VDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIG 1184
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L V+ +++PA E WT KFGF +D +
Sbjct: 1185 TRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQE 1242
Query: 914 KRCSQLVTFKGTSMLQK 930
+ + ++ F GT +LQK
Sbjct: 1243 VKSTSMLVFPGTGLLQK 1259
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 140/448 (31%), Positives = 203/448 (45%), Gaps = 44/448 (9%)
Query: 493 PPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHC 551
P S KS+ + T +D LH L+F E GL D T + Y G+ L +GYK G GIIC+C
Sbjct: 91 PNYSIGTKSKESKT-RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNC 149
Query: 552 CNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD 611
C+ E +PS FE HAG G+ I G+ L HK L +
Sbjct: 150 CSQEFAPSHFEEHAGM---GRRRQPYHNIYTPEGSTL---------HKLALQLQDHLNSN 197
Query: 612 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 671
+ + F L +A GR S + + R ++ E C C
Sbjct: 198 GFDNASVSSFSDYPNL---TSASGCGRQPSTTSGPIVPLK--RTLQGRVVETESCYFCGY 252
Query: 672 CDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRIN 725
+ P I+ C+QCER HV C KK + L+E ++ CC +C
Sbjct: 253 GHTTIGNIDPDMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEYVCFRFLCCQECQ--- 309
Query: 726 SVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAV 785
+L++ L + EK E +++ N + WRLLSG A+ + +L + Q +
Sbjct: 310 -LLRDRLEEGLEKCEEIAF--LRRIRSN---------ICWRLLSGMDASRDVKLFMPQVI 357
Query: 786 AIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
IF D F D S D+ MV +N + ++F GMYCA+LT ++ VVSA IL+V +
Sbjct: 358 DIFKDAFVESTDEHS--DIFSDMVNCKNGDQEKDFRGMYCALLTASTHVVSAAILKVRME 415
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
++AEL L+AT + KGYF LL IE L V + P E IW++K GF +
Sbjct: 416 QIAELVLIATRRECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEKLGFTIL 475
Query: 905 DPELLSIYRKRCSQLVTFKGTSMLQKRV 932
E + LV FK ++QK +
Sbjct: 476 SAEEKESVLES-HPLVMFKNLVLVQKSL 502
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 182/437 (41%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
LLEG GI C CC+ + +F AHAG
Sbjct: 860 LLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQ 919
Query: 567 ----RQ--YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
RQ +P NDD C IC DGGNL+ CDGCP FH C L ++P DW C
Sbjct: 920 SDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCA 979
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F ++ Q +A ++ VDS
Sbjct: 980 KCSCKFCQEHSRQ------DAQDIAEVDS------------------------------- 1002
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H GC + FC C + L+NLL +
Sbjct: 1003 -------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 1055
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PEF I++ N ETV +D R E ++ A+++ +CF PI
Sbjct: 1056 KDLEPEFSCRIIQRIHENVPETVVALDERV----------ECNSKIAVALSLMDECFLPI 1105
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y +L +++A +R+ G ++AE+P +
Sbjct: 1106 VDQRTGINLIRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIG 1165
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L V+ +++PA E WT KFGF +D +
Sbjct: 1166 TRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQE 1223
Query: 914 KRCSQLVTFKGTSMLQK 930
+ + ++ F GT +LQK
Sbjct: 1224 VKSTSMLVFPGTGLLQK 1240
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 182/437 (41%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
LLEG GI C CC+ + +F AHAG
Sbjct: 332 LLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQ 391
Query: 567 ----RQ--YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
RQ +P NDD C IC DGGNL+ CDGCP FH C L ++P DW C
Sbjct: 392 SDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCA 451
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F ++ Q +A ++ VDS
Sbjct: 452 KCSCKFCQEHSRQ------DAQDIAEVDS------------------------------- 474
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H GC + FC C + L+NLL +
Sbjct: 475 -------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 527
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PEF I++ N ETV +D R E ++ A+++ +CF PI
Sbjct: 528 KDLEPEFSCRIIQRIHENVPETVVALDER----------VECNSKIAVALSLMDECFLPI 577
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y +L +++A +R+ G ++AE+P +
Sbjct: 578 VDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIG 637
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L V+ +++PA E WT KFGF +D +
Sbjct: 638 TRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQE 695
Query: 914 KRCSQLVTFKGTSMLQK 930
+ + ++ F GT +LQK
Sbjct: 696 VKSTSMLVFPGTGLLQK 712
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 127/444 (28%), Positives = 183/444 (41%), Gaps = 106/444 (23%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP----------------------- 570
G + EG GI C CCN + FE HAG P
Sbjct: 595 GPGIKEGSIRRDGIKCMCCNEIFTVENFEVHAGSSTPLPSAHMFLKDGMSLSQCLVEFMG 654
Query: 571 ---------------GK----DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD 611
GK ++D +C++C DGG+LL CD CP ++H +C L +IP+G+
Sbjct: 655 GNKPRDPHPLHARLKGKNSDLESDSICSVCHDGGDLLLCDNCPSSYHHDCVGLEAIPEGN 714
Query: 612 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 671
WYC C+ C +C
Sbjct: 715 WYCPSCR-----------------------------------------------CSICNL 727
Query: 672 CDFS--KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQ 729
D+ S F +TI+ CDQCERE+HVGC + L P+G WFC CS + LQ
Sbjct: 728 SDYDPDTSQFTEKTIVYCDQCEREYHVGCTRNSD-NQLICRPEGCWFCSRGCSNVFQHLQ 786
Query: 730 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 789
L+ + E I K+ + D D + L AV I H
Sbjct: 787 ELIGKSVPTPIEGVSWTILKFCSGNGSDHGDYD--------DEIMADHYGKLCVAVGILH 838
Query: 790 DCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 847
+CF I++ + D+ +V+ R LR F G Y +L +S G R+ GQ+ A
Sbjct: 839 ECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFA 898
Query: 848 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DP 906
ELPL+ TS +G +LL +EKLL L V+ ++LPA E WT FGF + +
Sbjct: 899 ELPLIGTSSPYRRQGMCRLLINELEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNS 958
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQK 930
+ L + + +++F+GT+M QK
Sbjct: 959 DRLEL---AGNSILSFQGTTMCQK 979
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 129/442 (29%), Positives = 184/442 (41%), Gaps = 100/442 (22%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ----------------------- 568
+C + LLEG GI C CC+ + + +F AHAG Q
Sbjct: 891 SCSKVLLEGLVTRDGIQCRCCSKDFALLEFVAHAGGQVSKPYRNVLVDGLDKDLLHCLIS 950
Query: 569 -------------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQG 610
+P G D NDD C IC DGGNL+ CDGCP FH C L +P
Sbjct: 951 AWDKQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSD 1010
Query: 611 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 670
DW C C +F Q +N +A ++ VDS LC
Sbjct: 1011 DWRCANCCC-----KFCQEHSND-DAPDIAEVDS-----------------------LC- 1040
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN 730
C QCE +H C + + G FC C + LQN
Sbjct: 1041 ---------------TCSQCEENYHPVCSPETENPSSVPSQAGDLFCQQSCRLLFEELQN 1085
Query: 731 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 790
LL + + PEF IK + ET +D R E ++ A+++ +
Sbjct: 1086 LLAVKKDLEPEFACRIIKCIHEDVPETALALDERV----------ECNSKIAVALSLMDE 1135
Query: 791 CFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
CF PI+D +G +LI ++VY G N +F G Y IL +VSA +R+ G + AE
Sbjct: 1136 CFLPIIDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKCAE 1195
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
+P + T + +G + L IE +LS L+V+ +++PA E WT KFGF + E+
Sbjct: 1196 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLKVQKLIIPAISELVDTWTSKFGFSPL--EV 1253
Query: 909 LSIYRKRCSQLVTFKGTSMLQK 930
+ ++ F GT +LQK
Sbjct: 1254 SEKQEVKSISMLVFPGTGLLQK 1275
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 182/437 (41%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
LLEG GI C CC+ + +F AHAG
Sbjct: 195 LLEGVTTRDGINCRCCSKVFTVLEFVAHAGGPVSKPYRNVLVDGLDTDLLHCLINAWDKQ 254
Query: 567 ----RQ--YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
RQ +P NDD C IC DGGNL+ CDGCP FH C L ++P DW C
Sbjct: 255 SDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWRCA 314
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F ++ Q +A ++ VDS
Sbjct: 315 KCSCKFCQEHSRQ------DAQDIAEVDS------------------------------- 337
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H GC + FC C + L+NLL +
Sbjct: 338 -------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLAVK 390
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PEF I++ N ETV +D R E ++ A+++ +CF PI
Sbjct: 391 KDLEPEFSCRIIQRIHENVPETVVALDER----------VECNSKIAVALSLMDECFLPI 440
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y +L +++A +R+ G ++AE+P +
Sbjct: 441 VDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAEMPFIG 500
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T + +G + L IE +LS L V+ +++PA E WT KFGF +D +
Sbjct: 501 TRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLD--VSEKQE 558
Query: 914 KRCSQLVTFKGTSMLQK 930
+ + ++ F GT +LQK
Sbjct: 559 VKSTSMLVFPGTGLLQK 575
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 117/453 (25%), Positives = 188/453 (41%), Gaps = 104/453 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGR--------------------QYPGKD--- 573
+LEG+ GI C CC+ ++ S+FE HAG Q D
Sbjct: 607 MLEGWITRDGIHCGCCSKILAVSKFEIHAGSKLRQPFQNIFLNSGVSLLQCQIDAWDKQK 666
Query: 574 -----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
NDD C IC DGG+L+ CDGCP FH+ C + P GDW+C
Sbjct: 667 GAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRMFPLGDWHCPN 726
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F C +++++ +
Sbjct: 727 CTCKF------------------------------CKAVIEDVTQTV------------- 743
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
G T C CE+++H C+ K + FC C ++ ++ + +
Sbjct: 744 ---GANT---CKMCEKKYHKSCMPKANVTPADTTEPITSFCGKKCKALSEGVKKYVGVKH 797
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
E F + + + NS D+ LSG E L+ A+ + +CF PI+
Sbjct: 798 ELEAGFSWSLVHRECTNS-----DLS-----LSGHPHIVENNSKLALALTVMDECFLPII 847
Query: 797 DSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D SG +++ +++Y G N FGG Y A+L +V++ +R G +AE+P + T
Sbjct: 848 DRRSGVNIVQNVLYNCGSNFNRLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGT 907
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
+ +G + LF+ +E L L+VK +++PA + +W KFGF++++ L
Sbjct: 908 RHVYRHQGMCRRLFSVVESALQHLKVKLLIIPATADFSHVWISKFGFRQVEDSLKK--EM 965
Query: 915 RCSQLVTFKGTSMLQKRVPACRIGSSSTDSTEC 947
R L+TF G +LQK + A R S+ D T+C
Sbjct: 966 RSMNLLTFPGIDVLQKELLAPRHTESAVD-TDC 997
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/437 (29%), Positives = 184/437 (42%), Gaps = 99/437 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY--PGKD--------------------- 573
LLEG+ + GI C CC+ + S+FE HAG + P ++
Sbjct: 171 LLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLCEPSQNIILETGISLLQCQLDSWNKQE 230
Query: 574 -----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
NDD C IC DGG+L+ CDGCP FH+ C + P GDW+C Y
Sbjct: 231 ESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQKFPSGDWHCIY 290
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F G SG + +Q+ NL+ S
Sbjct: 291 CSCKF--------------CGMFSG--NTDQMN-------YNLDVNDSA----------- 316
Query: 677 SGFGPRTILLCDQCEREFHVGCLK-KHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
+L C CE ++H C + + + D P FC C + LQ LL +
Sbjct: 317 -------LLTCQLCEEKYHHMCTQGEDSILDDSSSPS---FCGKTCRELFEQLQMLLGVK 366
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
E F +++ E DI L+G E L+ A++I +CF PI
Sbjct: 367 HELEDGFSWTLVQR-----TEVGFDIS-----LNGIPQKVECNSKLAVALSIMDECFLPI 416
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD SG +LI +++Y G N + G + AIL ++SA +R+ G ++AE+P +
Sbjct: 417 VDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIG 476
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T I +G + L IE L L V+ +V+PA E WT FGFK + E+ S
Sbjct: 477 TRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPL--EVSSRKE 534
Query: 914 KRCSQLVTFKGTSMLQK 930
R ++ F GT MLQK
Sbjct: 535 MRNMNMLVFHGTDMLQK 551
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 176/391 (45%), Gaps = 85/391 (21%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
++ D +C++C D G LL CD CP AFH C L + P+GDW C C+
Sbjct: 438 EEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR------------- 484
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTILLCD 688
C +C G D + GF +TI+ C+
Sbjct: 485 ----------------------------------CGVCGGSDLDDDTAEGFTDKTIIYCE 510
Query: 689 QCEREFHVGCLKKHKMAD------LRELPKGK--------WFCCMDCSRINSVLQNLLV- 733
QCERE+HVGC+++ + R L + + W C +C + LQ L+
Sbjct: 511 QCEREYHVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPWLCSPECGEVFQHLQALVAS 570
Query: 734 QEAEKLPEFHLNAI-------KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 786
A +P + A ++Y + TV+ I RW+ AA L A+
Sbjct: 571 SRARSIPHYSRGAYHSAPCGRRRY----MSTVTRI-TRWQHEEEDAADHGQ---LCAALD 622
Query: 787 IFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
+ H+CFD +V+ + DL +V+ + LR F G Y L +++ G LRVFG
Sbjct: 623 VLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGELINVGTLRVFGN 682
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
+VAELPLV T + +G +LL +EK+L + V+ +VLPA E +WT GF +
Sbjct: 683 QVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELMPMWTASLGFHAM 742
Query: 905 D-PELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
+++ + + +++FKGT+M QK + A
Sbjct: 743 TRSDVMEMAVEHA--ILSFKGTTMCQKTLLA 771
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 162 bits (411), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 122/435 (28%), Positives = 182/435 (41%), Gaps = 96/435 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------- 566
LEG GI C CC+ ++ S+FE HAG
Sbjct: 563 LEGRLTRDGIHCSCCDEVITISKFEMHAGSRVGQPLENIYVHTGSSLLQCLLESWNKQNE 622
Query: 567 ---RQYPGKD------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+ Y D NDD C IC DGG+L+ CD CP FH+ C + P G W+C YC
Sbjct: 623 PQCKGYNFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC 682
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
+ +V G Q+T + + EA + +LC+
Sbjct: 683 ------------------SCKVCG-----QVTIGLHPMDDHHEA--AADVLCK------- 710
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
CD CE ++H C++ + + + FC C ++ LQ LL +
Sbjct: 711 ---------CDLCEEKYHPICVQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQD 759
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
F I++ SD+D L + A + L+ A+ + +CF P++D
Sbjct: 760 MKEGFSWTLIRR---------SDVDSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVID 810
Query: 798 SISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 855
SG +LI +++Y G N F G Y AIL + V+ A LR+ G E+AE+P + T
Sbjct: 811 HRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTR 870
Query: 856 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR 915
+ +G + + IE +LS L V+ +V+PA E W FGFK +D + R R
Sbjct: 871 YMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDE--TTKQRMR 928
Query: 916 CSQLVTFKGTSMLQK 930
L+ F G MLQK
Sbjct: 929 KMSLLVFPGVEMLQK 943
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 122/435 (28%), Positives = 182/435 (41%), Gaps = 96/435 (22%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------- 566
LEG GI C CC+ ++ S+FE HAG
Sbjct: 590 LEGRLTRDGIHCSCCDEVITISKFEMHAGSRVGQPLENIYVHTGSSLLQCLLESWNKQNE 649
Query: 567 ---RQYPGKD------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+ Y D NDD C IC DGG+L+ CD CP FH+ C + P G W+C YC
Sbjct: 650 PQCKGYNFVDVDVEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC 709
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
+ +V G Q+T + + EA + +LC+
Sbjct: 710 ------------------SCKVCG-----QVTIGLHPMDDHHEA--AADVLCK------- 737
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
CD CE ++H C++ + + + FC C ++ LQ LL +
Sbjct: 738 ---------CDLCEEKYHPICVQMNNASG--DDVNNPLFCGKKCQMLHERLQRLLGVRQD 786
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
F I++ SD+D L + A + L+ A+ + +CF P++D
Sbjct: 787 MKEGFSWTLIRR---------SDVDSDVSLCNEVAQKIKCNSELAVALFVMDECFLPVID 837
Query: 798 SISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 855
SG +LI +++Y G N F G Y AIL + V+ A LR+ G E+AE+P + T
Sbjct: 838 HRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFIGTR 897
Query: 856 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR 915
+ +G + + IE +LS L V+ +V+PA E W FGFK +D + R R
Sbjct: 898 YMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDE--TTKQRMR 955
Query: 916 CSQLVTFKGTSMLQK 930
L+ F G MLQK
Sbjct: 956 KMSLLVFPGVEMLQK 970
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 181/438 (41%), Gaps = 101/438 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ---------------------------- 568
LLEG+ GI C CCN S +F HAG +
Sbjct: 922 LLEGFATRDGINCSCCNEVYSVLEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNMQ 981
Query: 569 --------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+P G D NDD C IC DGGNL+ CDGCP FH C L +P W C
Sbjct: 982 SDVERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWCCA 1041
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F +H ++ E D+ + D+S
Sbjct: 1042 NCSCKF----CHEHSSDGAE-------DTAD-------------------------VDYS 1065
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
+ C QCE ++H C + G FC C + LQNLL +
Sbjct: 1066 --------LHTCSQCEEQYHEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVK 1117
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PE+ +++ + E V +D R E ++ A+++ +CF PI
Sbjct: 1118 KDLEPEYSCRVVQRIHEDVPEEVLALDKRV----------ECNSRIAVALSLMDECFLPI 1167
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
+D +G +LI ++VY G N +F G Y IL +++A +R+ G ++AE+P +
Sbjct: 1168 IDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIG 1227
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIY 912
T + +G + L IE +LS L V+ +++PA E WT KFGF + D E +
Sbjct: 1228 TRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEKQEV- 1286
Query: 913 RKRCSQLVTFKGTSMLQK 930
K S LV F GT +LQK
Sbjct: 1287 -KSISMLV-FPGTGLLQK 1302
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 181/438 (41%), Gaps = 101/438 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ---------------------------- 568
LLEG GI C CC+ +S +F AHAG +
Sbjct: 923 LLEGIVTRDGIDCSCCSKVLSVLEFVAHAGSEVNTPYRNILVDGQDIDLLHCLINAWNMQ 982
Query: 569 --------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+P G D NDD C IC DGGNL+ CDGCP FH C L +P W C
Sbjct: 983 SDAEKQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWCCA 1042
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F +H + E + VDS
Sbjct: 1043 NCSCKF----CHEHSNDGAED--TADVDS------------------------------- 1065
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
++ C QCE ++H C ++ G FC C + LQNLL +
Sbjct: 1066 -------SLHTCSQCEEQYHEACSPENDSITNLSSQTGNLFCQQSCRLLFEELQNLLAVK 1118
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
+ PE+ +++ + E V +D R E ++ A+++ +CF PI
Sbjct: 1119 KDLEPEYSCRVVQRIHEDVPEEVLPLDTRV----------ECNSKIAVALSLMDECFLPI 1168
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD +G +LI ++VY G N +F G Y IL +++A +R+ G ++AE+P +
Sbjct: 1169 VDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEMPFIG 1228
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIY 912
T + +G + L IE +LS L V+ +++PA E WT +FGF + D E +
Sbjct: 1229 TRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSEKEEV- 1287
Query: 913 RKRCSQLVTFKGTSMLQK 930
K S LV F GT +LQK
Sbjct: 1288 -KSISMLV-FPGTGLLQK 1303
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 161 bits (408), Expect = 1e-36, Method: Composition-based stats.
Identities = 101/303 (33%), Positives = 159/303 (52%), Gaps = 31/303 (10%)
Query: 643 DSVEQITKRCIRIVKNLEAEL-SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKK 701
DSV + + + L A+L S C+LC +F + GF +T+L+CDQCE+E+H+GCL+K
Sbjct: 1762 DSVTLTGRDEEHVSEALAADLASSCVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRK 1821
Query: 702 HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI 761
HKM D++ +P+G+WFC +C RI +L L +E E +GN
Sbjct: 1822 HKMVDMQAVPEGEWFCSDECVRIRELLTKSL-EEGE----------TTMSGNPA------ 1864
Query: 762 DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGG 821
RW+ + G+ T T L + I + FDPI+D+ SG DL+P MV+ + +F G
Sbjct: 1865 -YRWQFIRGRDGTKATARALKTVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQG 1923
Query: 822 MYCAILTVNSS-------VVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 874
MY +L + V AG++RV G +AE+PLVAT +G+ + L +
Sbjct: 1924 MYSILLRYRGADKEARGRPVLAGLVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHR 1983
Query: 875 LSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK---R 931
L L V+++VLPA +A W + F +D + + R +++ F TS++ + R
Sbjct: 1984 LIALGVRAMVLPATADALPAWR-QLAFMDLDEGSVRVARGE-HRMIIFPHTSVVVRQLIR 2041
Query: 932 VPA 934
VP
Sbjct: 2042 VPG 2044
Score = 44.3 bits (103), Expect = 0.32, Method: Composition-based stats.
Identities = 23/54 (42%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 522 LPDGTEVGYYACGQKLLEGY------KNGLGIICHCCNSEVSPSQFEAHAGRQY 569
L DG V Y GQ+LL G G GI+C CC+ +S S FE+HAG ++
Sbjct: 1535 LQDGERVHYTIQGQRLLSGTVVIVQRTAGSGILCDCCSKVISASAFESHAGHKH 1588
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 182/467 (38%), Gaps = 124/467 (26%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHA-------------------GRQYP--------- 570
+G+ G GI C CCN ++ ++F AHA GR
Sbjct: 421 DGFITGEGIRCTCCNKTLTVAEFAAHATARRGSDRREAWARVFLKDGRSLSQCLVELMRR 480
Query: 571 -------------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD 611
+ D +C+IC DGG+LL CD CP AFH C L + P+GD
Sbjct: 481 DVAVVAARNGDVRVKEKCSDPEGDSVCSICNDGGDLLLCDNCPSAFHHACVGLQATPEGD 540
Query: 612 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRG 671
W+C C+ C +C G
Sbjct: 541 WFCPSCR-----------------------------------------------CGVCGG 553
Query: 672 CDFSKS-----GFGPRTILLCDQCEREFHVGCLKKHKMADL-----------RELPKGKW 715
DF + GF +TI+ CDQCERE+HVGC+++ + E + W
Sbjct: 554 SDFDATAAGGGGFTDKTIIYCDQCEREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWPW 613
Query: 716 FCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATP 775
C +C + LQ L E+ G SL + R + +
Sbjct: 614 LCSPECGEVFRHLQGLAAVARERSIPIPTTVPTTVEGVSLSILRRRRRRPISMVATGSGC 673
Query: 776 ETRLL---------LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYC 824
+ L A+ + H+CF +++ + DL +V+ R LR F G Y
Sbjct: 674 QEEEEEEDAAEHGQLCSALDVLHECFVTLIEPRTQTDLTADIVFNRESELRRLNFRGYYV 733
Query: 825 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 884
L +++ G LRV G EVAELPLV T + +G LL +EK+L + V+ +V
Sbjct: 734 VGLEKAGELITVGTLRVLGTEVAELPLVGTRFAHRRQGMCHLLVTELEKVLRQVGVRRLV 793
Query: 885 LPAAEEAESIWTDKFGFKKID-PELLSIYRKRCSQLVTFKGTSMLQK 930
LPA E +WT GF + +++ I + +++F+GT+M K
Sbjct: 794 LPAVPELLPMWTASLGFHPMTRSDVMEIAAEH--AILSFQGTTMCHK 838
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 175/403 (43%), Gaps = 97/403 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--RQYPG-------------------------------- 571
GI+C+CC + S FEAHAG R P
Sbjct: 418 GIVCNCCRVNFTVSGFEAHAGCTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEANS 477
Query: 572 -KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY-CQNMFERKRFLQH 629
+ND++C+IC GG+L+ CD CP AFH C L +P GDW+C C + R + Q
Sbjct: 478 EANNDNVCSICGFGGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTCCCKICYRPKCKQE 537
Query: 630 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 689
A+ E L+C Q
Sbjct: 538 CADGNE----------------------------------------------NNFLVCVQ 551
Query: 690 CEREFHVGCLKKHKMADLR---ELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
CE++FH GC+K + + K WFC + C + L+ LL + + +
Sbjct: 552 CEQKFHFGCVKTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINWTL 611
Query: 747 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 806
+K VS D S + + + +L + A+ + ++ F+P +D++SGR+LI
Sbjct: 612 LK--------NVSSDDDGGDFTSNEFSQEKHKL--NAALGVLYEGFNPTIDALSGRELIK 661
Query: 807 SMVYGRNLRGQE--FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 864
+V+ R+ + F G Y IL V+S +R+FGQ+VAE+ VAT + + G+G
Sbjct: 662 DLVFSRDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMC 721
Query: 865 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE 907
+LL +E+ L+ L V +VL ++E+A + WT FGF ++ E
Sbjct: 722 RLLMDELEEQLTRLGVGRLVLHSSEDAINTWTKSFGFARMTSE 764
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 123/454 (27%), Positives = 177/454 (38%), Gaps = 104/454 (22%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKD-------------------------------- 573
GI CC SP FEAHAG + D
Sbjct: 279 GIKNSCCQEVFSPRGFEAHAGSSFHQSDANIFLEDEGSLLEGQRQMVHRITGKSFTKESS 338
Query: 574 -----------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
NDD+C++C GG+L+ CD CP FH+ C L +P+GDW+C C
Sbjct: 339 HGKKSNGDQCNNDDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSC----- 393
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
C RI C F +
Sbjct: 394 -----------------------------CCRI-------------CGENRFDEYSEEDN 411
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
C QCE ++HVGCL+K + L P G FC C +I LL + +P
Sbjct: 412 FKFSCHQCELQYHVGCLRKQRHVKLETYPDGTRFCSTQCEKI---FLGLLKLLGKPIPVG 468
Query: 743 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 802
N ++ D+DV A E L+ A+ + H+CF+PI + +GR
Sbjct: 469 VDNLTWTLLKPTISEWFDMDV-----PDNKALTEVYSKLNIALNVMHECFEPIKEPHTGR 523
Query: 803 DLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 860
DL+ +++ G +L+ F G Y +L N ++S +RV G++VAE+PLV T
Sbjct: 524 DLVEDVIFCRGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEKVAEVPLVGTRSQYRR 583
Query: 861 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQL 919
G ++L IEK L L V+ + LPAA W FGF K+ D E L+
Sbjct: 584 LGMCRILINEIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMTDSERLTFLD---YTF 640
Query: 920 VTFKGTSMLQKRVPACRIGSSSTDSTECVSGVEV 953
+ F+ T M QK + SS + C + + +
Sbjct: 641 LDFQDTVMCQKLLMKIPSTKSSQSTVNCAASLWI 674
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 117/446 (26%), Positives = 182/446 (40%), Gaps = 99/446 (22%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP---------------- 570
+V Y + +LEG+ GI C CC+ ++ S+FE HAG + P
Sbjct: 622 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQNIYLESGVSLLQ 681
Query: 571 ------------------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSS 606
G NDD C IC DGG+L+ CDGCP FH+ C +
Sbjct: 682 CQIDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQM 741
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
+P G+W C C F G SG + + S C
Sbjct: 742 LPPGEWRCMNCTCKF--------------CGIASGTSEKD---------------DASVC 772
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINS 726
+L +C+ CE+++H C K+ FC +C ++
Sbjct: 773 VL----------------HICNLCEKKYHDSCTKEMDTLPNNINSSSLSFCGKECKELSE 816
Query: 727 VLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVA 786
L+ L + E F + I + T D + R +S + E L+ +
Sbjct: 817 HLKKYLGTKHELESGFSWSLIHR-------TDDDSEAACRGISQRV---ECNSKLAITLT 866
Query: 787 IFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
+ +CF P++D SG +LI +++Y G N + G Y AIL +++A +R G
Sbjct: 867 VMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGT 926
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
++AE+P + T I +G + LF+ IE L L+V+ +V+PA E + WT FGF +
Sbjct: 927 QIAEMPFIGTRHIYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAEVTNTWTTVFGFTHL 986
Query: 905 DPELLSIYRKRCSQLVTFKGTSMLQK 930
D L + ++ F G MLQK
Sbjct: 987 DKSLRQ--EMKSLNMMVFPGIDMLQK 1010
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 159 bits (402), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 126/437 (28%), Positives = 181/437 (41%), Gaps = 101/437 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGR----------------------------- 567
+LEG+ GI C CC+ ++ S+FE HAG
Sbjct: 626 MLEGWITRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIFLESGLSLLQCQRDAWNRQE 685
Query: 568 ----------QYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+ G D NDD C IC DGG+L+ CDGCP FH+ C + P GDW+C
Sbjct: 686 ESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPN 745
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C + GV S++ +C+G + S
Sbjct: 746 CTCKY------------------CGVASID---------------------ICQGDNTSV 766
Query: 677 SGFGPRTILLCDQCEREFHVGC-LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
S I C CE++FH C L+ L FC C + LQ L +
Sbjct: 767 S-----EISTCILCEKKFHESCNLEMDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVK 819
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
E F + I++ T D DV R LS + E+ L+ A+ + +CF PI
Sbjct: 820 HELDAGFSWSLIRR-------TSEDSDVSVRGLSQRI---ESNSKLAVALTVMDECFLPI 869
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD SG +LI +++Y G N + G Y AIL ++SA +R G ++AE+P +
Sbjct: 870 VDRRSGINLIHNVLYNCGSNFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIG 929
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T I +G + LF IE L +V+ +++PA E W FGF ++P L
Sbjct: 930 TRHIYRRQGMCRRLFCAIESALRVFKVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQ--E 987
Query: 914 KRCSQLVTFKGTSMLQK 930
R ++ F GT MLQK
Sbjct: 988 MRLMNMLVFPGTDMLQK 1004
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 126/437 (28%), Positives = 181/437 (41%), Gaps = 101/437 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGR----------------------------- 567
+LEG+ GI C CC+ ++ S+FE HAG
Sbjct: 508 MLEGWITRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIFLESGLSLLQCQRDAWNRQE 567
Query: 568 ----------QYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+ G D NDD C IC DGG+L+ CDGCP FH+ C + P GDW+C
Sbjct: 568 ESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDILIPPPGDWHCPN 627
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C + GV S++ +C+G + S
Sbjct: 628 C------------------TCKYCGVASID---------------------ICQGDNTSV 648
Query: 677 SGFGPRTILLCDQCEREFHVGC-LKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
S I C CE++FH C L+ L FC C + LQ L +
Sbjct: 649 S-----EISTCILCEKKFHESCNLEMDTPVHSSGLVTS--FCGKSCRELFESLQKNLGVK 701
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
E F + I++ T D DV R LS + E+ L+ A+ + +CF PI
Sbjct: 702 HELDAGFSWSLIRR-------TSEDSDVSVRGLSQRI---ESNSKLAVALTVMDECFLPI 751
Query: 796 VDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVA 853
VD SG +LI +++Y G N + G Y AIL ++SA +R G ++AE+P +
Sbjct: 752 VDRRSGINLIHNVLYNCGSNFYRLNYSGFYTAILERGDEIISAATIRFHGTKLAEMPFIG 811
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYR 913
T I +G + LF IE L +V+ +++PA E W FGF ++P L
Sbjct: 812 TRHIYRRQGMCRRLFCAIESALRVFKVEKLIIPAIAELMHTWNVIFGFSPLEPSLKQ--E 869
Query: 914 KRCSQLVTFKGTSMLQK 930
R ++ F GT MLQK
Sbjct: 870 MRLMNMLVFPGTDMLQK 886
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/446 (27%), Positives = 177/446 (39%), Gaps = 109/446 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQY--------------------------PGK------- 572
GI C CC S S+FEAHAG Y GK
Sbjct: 594 GIKCSCCQEVFSLSRFEAHAGSSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKESF 653
Query: 573 ----------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
+ND +C++C GG+L+ CD CP +FHK C L GDW+C C
Sbjct: 654 SRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSC----- 708
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
C C +C F
Sbjct: 709 -----------------------------C-------------CGICGENKFDGGSEQDN 726
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPE 741
+ C QCER++HVGCL+K L P G WFC C +I LQ LL + +
Sbjct: 727 VVFSCYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDN 786
Query: 742 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 801
+K LE ID L A E L+ A+ + H+CF+P+ + +
Sbjct: 787 LTWTLLKPIRSKGLE----ID-----LPDIEALTEVYSKLNIALGVMHECFEPVKEPHTR 837
Query: 802 RDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
RD++ +++ G +L F G Y +L N ++S +RV+G++VAE+PL+ T
Sbjct: 838 RDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQYR 897
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQ 918
G +L +EK L L V+ +VLPA + WT FGF K+ D E L
Sbjct: 898 RLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRFLD---YS 954
Query: 919 LVTFKGTSMLQK---RVPACRIGSSS 941
+ F+ T M QK ++P + S+
Sbjct: 955 FLDFQDTVMCQKLLMKIPLAKSNQST 980
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/438 (27%), Positives = 179/438 (40%), Gaps = 98/438 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------RQY 569
+LEG+ GI C CC+ ++ S+FE HAG RQ
Sbjct: 9 MLEGWVTRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIYLESGASLLDCQIEAWNRQE 68
Query: 570 P------------GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
P G D NDD C +C DGG+L+ CDGCP FH+ C + +P GDW+C
Sbjct: 69 PVKRLGFQAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPN 128
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F GV S + +F +
Sbjct: 129 CSCKF------------------CGVASDK--------------------------NFQR 144
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
+L C C +++H C+++ + FC C + LQ L +
Sbjct: 145 DDTTVSKLLTCSLCVKKYHKSCMQEINTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKH 204
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
E F + I + T +D D L G E L+ ++++ +CF PIV
Sbjct: 205 ELEAGFSWSLIHR-------TDADSDTS---LQGLPQRVECNSKLAVSLSVMDECFLPIV 254
Query: 797 DSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D SG +LI +++Y G N FGG Y IL ++SA +R G +AE+P + T
Sbjct: 255 DRRSGINLIQNVLYNCGSNFNRLNFGGFYALILERGDEIISAASIRFHGTRLAEMPFIGT 314
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
+ +G + LF IE L L+V+ +++PA E WT+ FGF +D L +
Sbjct: 315 RHMYRRQGMCRRLFYAIESTLCSLKVEKLIIPAISELMHTWTEVFGFTTLDESLKQELKS 374
Query: 915 RCSQLVTFKGTSMLQKRV 932
++ F G MLQK++
Sbjct: 375 M--NMLVFPGIDMLQKQL 390
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/442 (27%), Positives = 181/442 (40%), Gaps = 108/442 (24%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ---------------------------- 568
LLEG+ GI C CC+ +S +F HAG +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNMQ 986
Query: 569 --------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+P G D NDD C IC DGGNL+ CDGCP FH C L ++P W C
Sbjct: 987 SDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCS 1046
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F H+ ++ +A + VDS
Sbjct: 1047 NCSCKF------CHEHSSDDAEDTADVDS------------------------------- 1069
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKK-HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
++ C QCE + C +A G FC C + LQNLL
Sbjct: 1070 -------SLHTCSQCEEQCTEACSPDIDSIATNLSSQTGNLFCQQSCRLLFEELQNLLAV 1122
Query: 735 EAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
+ + PE+ +++ E V +D R E ++ A+++ +CF P
Sbjct: 1123 KKDLEPEYSCRVVQRIHEEVPEEVLALDKRV----------ECNSKIAVALSLMDECFLP 1172
Query: 795 IVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
IVD +G +LI ++VY G N +F G Y IL +++A +R+ G ++AE+P +
Sbjct: 1173 IVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRIHGTKLAEMPFI 1232
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----EL 908
T + +G + L IE +LS L ++ +++PA E WT KFGF +D E+
Sbjct: 1233 GTRNMYRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEV 1292
Query: 909 LSIYRKRCSQLVTFKGTSMLQK 930
S+ ++ F GT +LQK
Sbjct: 1293 KSV------SMLVFPGTGLLQK 1308
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 188/731 (25%), Positives = 295/731 (40%), Gaps = 129/731 (17%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHAC 287
V L TGLL+G V Y K G I G C CS C ++ +FE H+
Sbjct: 246 VRGLLSTGLLEGFRVTY----KKNEVERIGRINGQGYSCGCSECGYRNIMNACEFEQHSG 301
Query: 288 KQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTFP 347
+ + +I ++G SL V++ + L ML + +S P + + K +F
Sbjct: 302 ESSNNQNNHIFLDSGISLYMVIQGLKYTKLDMLGDVIGKVISLPPNMIQYE--KWKASFQ 359
Query: 348 IT-CVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCISYQNNK 406
+ P + C + + T ++ S+ N S +++ + +
Sbjct: 360 LEKDYFDDAPS---DPCSTQSSQESNIALTDSLK----------DSTSNASSILNWSSFR 406
Query: 407 RERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVHKSS 466
R + K R +T +P L+R+ + I++ + TS+ KS
Sbjct: 407 RRSDRQFK-----------------RGGTETSTPI-LSRSPEKE--ISDLSTSTSM-KSE 445
Query: 467 QSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPK--DQRLHKLVFDESGLPD 524
++ + + +I P P + + S+ PK D LH ++F E GLPD
Sbjct: 446 ETPSENTAGLLTTDVTVIQDP----PPDHNVDSNSKDLGQPKVRDNTLHPMLFKEGGLPD 501
Query: 525 GTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHAG---RQYPGKDNDDLC 578
T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H G R+ P +
Sbjct: 502 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQPYR-----S 556
Query: 579 TICADGGNLLPC-----DGCPRAFH-KECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+DG L DG + E +L+S ++ + KR LQ
Sbjct: 557 IYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKEYSTTSRPIIVPLKRTLQERVL 616
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
VE+ C +CR I+ C+QCER
Sbjct: 617 TVES-----------------------------CYMCRKPHTVLGVISVDMIVFCNQCER 647
Query: 693 EFHVGC----LKKHK--MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
HV C L+K K + L E + + CC C + + L
Sbjct: 648 ALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLH---------------EG 692
Query: 747 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 806
+KK + ++ W+LL+G + + Q + IF D F + D I
Sbjct: 693 LKKREDIAFLRRIRYNICWQLLNGTNMRSDVQ---HQVIEIFKDAFAET--APQDIDDIR 747
Query: 807 SMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQ 865
+MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT KGYF
Sbjct: 748 NMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKKGYFS 807
Query: 866 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CSQLVT 921
LL + IE L V+ + P E IW++K G+ +LS +K LV
Sbjct: 808 LLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAHPLVM 862
Query: 922 FKGTSMLQKRV 932
F S++QK +
Sbjct: 863 FANLSLVQKSL 873
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 153/341 (44%), Gaps = 78/341 (22%)
Query: 493 PPLSFPNKS-------RWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGL 545
P L FP + + +D LH+L+F +GLPDGTE+ YY GQK+L GYK G
Sbjct: 407 PSLYFPGSATKQKKTAESGVRKRDNDLHRLLFMPNGLPDGTELAYYVKGQKILGGYKQGN 466
Query: 546 GIICHCCNSEVSPSQFEAHAG---RQYPGKD--------------------------NDD 576
GI+C CC E+SPSQFE+HAG R+ P + DD
Sbjct: 467 GIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDIAISLANGQNITTGIGDD 526
Query: 577 LCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEA 636
+C C DGG+L+ W+ + + + +Q D
Sbjct: 527 MCAECGDGGDLM-----------------------WHVWIYRILLKVLGIVQID------ 557
Query: 637 GRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR------GCDFSKSGFGP-RTILLCDQ 689
G + R R+ K E + C++CR DF+ + ++L
Sbjct: 558 ----GGNFARPTVIRLTRVGKIPEYNVGDCVVCRLNLLKFLIDFTLANSRKCLNVMLSFS 613
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAI-- 747
++EFHVGCL++ + DL E+P+ WFCC DC+ I L+N + +K+P LN I
Sbjct: 614 AKKEFHVGCLRESGLCDLEEIPEDNWFCCQDCNNIYVALRNSVSTGVQKIPASLLNIINR 673
Query: 748 KKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIF 788
K L + DV+W++L GK+ E LLS A AIF
Sbjct: 674 KHVEKGLLVDEAAYDVQWQILMGKSRNREDLSLLSGAAAIF 714
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 70/131 (53%), Gaps = 4/131 (3%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQAS-GLRGIIRDGGILC 266
++ +EL MSKK+ N P V +L TG+LD V Y I F + L GII GG LC
Sbjct: 204 ERYMELNMSKKVVPNNYPTNVKKLLATGILDRARVKY---ICFSSERELDGIIDGGGYLC 260
Query: 267 SCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQS 326
CS CN +V+ +FE HA + R + +I ENGK + +++ ++ PL M+ ++
Sbjct: 261 GCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKD 320
Query: 327 ALSSLPEEKSF 337
S E+ F
Sbjct: 321 VAGSSINEEFF 331
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/70 (52%), Positives = 47/70 (67%), Gaps = 4/70 (5%)
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
F LF+CIE+LL L V+ +VLPA AE+IWT +FGF+K+ L Y + QL FK
Sbjct: 736 FNPLFSCIERLLCSLNVEQLVLPA---AETIWTRRFGFRKMSEGQLLKYTREF-QLTIFK 791
Query: 924 GTSMLQKRVP 933
GTSML+K VP
Sbjct: 792 GTSMLEKEVP 801
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/441 (26%), Positives = 176/441 (39%), Gaps = 104/441 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------RQ- 568
+LEG+ GI C CC+ ++ S+FE HAG RQ
Sbjct: 683 MLEGWVTRDGIHCGCCSKILTVSKFEIHAGSKLRQPFQNIYLDSGVSLLECQIDAWNRQE 742
Query: 569 -----------YPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G D NDD C IC DGG+L+ CDGCP FH+ C + +P GDW+C
Sbjct: 743 SIERIGFHSVNTDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIMMLPPGDWHCPN 802
Query: 617 CQNMF---ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 673
C F + F+Q D V
Sbjct: 803 CTCKFCGIASEDFVQEDGTNVSE------------------------------------- 825
Query: 674 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
+L C C +++H CL+ + FC C + LQ L
Sbjct: 826 ----------LLTCSLCAKKYHKSCLQDVDAPCIDFNNSTPCFCGKTCRELFEQLQKYLG 875
Query: 734 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
+ E F + + + DID+ L G E L+ A+++ +CF
Sbjct: 876 IKHELESGFSWSLVHRM---------DIDLDMSL-QGLPQRVECNSKLAVALSVMDECFL 925
Query: 794 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
PIVD SG ++I +++Y G N + G Y AIL ++SA +R G ++AE+P
Sbjct: 926 PIVDRRSGINIIQNVLYNCGSNFNRLNYSGFYAAILERGDEIISAASIRFHGTQLAEMPF 985
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 911
+ T + +G + LF+ IE L L+V+ +++PA E WT FGF + L
Sbjct: 986 IGTRHVYRRQGMCRRLFSAIESALCSLKVQKLIIPAISELTHTWTGVFGFTTLSDSLKQE 1045
Query: 912 YRKRCSQLVTFKGTSMLQKRV 932
+ ++ F G MLQK++
Sbjct: 1046 LKSM--NMLVFPGIDMLQKQL 1064
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 125/439 (28%), Positives = 176/439 (40%), Gaps = 99/439 (22%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------R 567
+ +L+G GI C CC + S FEAHAG R
Sbjct: 7 RTVLKGKITTDGIQCDCCGETFAISDFEAHAGSKSCQPLKNIFLENGPSLLHCQLESWHR 66
Query: 568 Q------------YPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
Q G+D NDD C IC DGGNL+ CD CP FH+ C + +P G W C
Sbjct: 67 QDESDRKGFHFVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNC 126
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
YC F G G C
Sbjct: 127 TYCSCKF--------------CGMAGG----------------------------DACQM 144
Query: 675 SKSGFGPRTILL-CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
++ R LL C CE ++H C+ + + FC C ++ LQ LL
Sbjct: 145 DENDAAARPALLTCCLCEEKYHHSCIPAEDT--INDYHSSLSFCGKKCQELHDKLQALLG 202
Query: 734 QEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
+ E F ++++ + SDI LSG E ++ A+ I +CF
Sbjct: 203 VKHEMEEGFAWTVVRRF-----DVGSDI-----TLSGMHRKVECNSKVAVALHIMDECFL 252
Query: 794 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
P+ D SG +LI ++VY G N + G AIL V+SA +R+ G ++AE+P
Sbjct: 253 PMPDHRSGVNLIRNIVYNFGSNFNRLNYCGFLTAILERGDEVISAASIRIHGNQLAEMPF 312
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 911
+ T + +G + L IE L L V+ +V+PA E WT FGFK++ E LS
Sbjct: 313 IGTRHMYRRQGMCRRLLGAIETALCSLNVEKLVIPAISELRETWTSVFGFKQL--EGLSK 370
Query: 912 YRKRCSQLVTFKGTSMLQK 930
+ R ++V F G MLQK
Sbjct: 371 QKMRYMKMVAFPGVDMLQK 389
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/432 (26%), Positives = 179/432 (41%), Gaps = 108/432 (25%)
Query: 546 GIICHCCNSEVSPSQFEAHAGR-------------------------------------Q 568
GI C+CC + + FE+HAG +
Sbjct: 550 GIECNCCQKIFTLTGFESHAGSTNHRPAANIILEDGRSLLDCQRKKKPRIKMQRVTREAK 609
Query: 569 YPGKDN------DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
+ G+ N D +C++C DGG+L+ CD CP FHK C L IP+G+W+C C
Sbjct: 610 WKGRQNQHQGETDYICSVCHDGGDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGIC 669
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
+ +++ + R
Sbjct: 670 GENKFKYNVQEPKDSR-------------------------------------------- 685
Query: 683 TILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 741
+L CDQCER++H+GCL+ + L R+ PK WFC C I LQ LL + P+
Sbjct: 686 -LLSCDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVGPD 744
Query: 742 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 801
+ K+ + D++ +GK + L AV + H+CF+P ++ +G
Sbjct: 745 NLTWTLWKFMDSD---SCDVEAP----TGKHSK------LDLAVEVIHECFEPATETYTG 791
Query: 802 RDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
RD+ +++ R NL F G Y +L N +++ +RVFG +VAE+PLV T +
Sbjct: 792 RDIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGTRFLFR 851
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQ 918
G ++L +EK L L V+ ++LPA W + FGF K+ D E +
Sbjct: 852 RLGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYLD---HT 908
Query: 919 LVTFKGTSMLQK 930
+ F GT QK
Sbjct: 909 FLDFPGTIKCQK 920
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 121/454 (26%), Positives = 184/454 (40%), Gaps = 102/454 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP-------------------------- 570
+L+G G GI C CC+ + S FE HAG +
Sbjct: 487 VLDGEITGNGIHCGCCDKIFTISDFELHAGSKLADPLKNIYVGEGTSLLQCLLDSWNKQD 546
Query: 571 -------------GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G+D NDD C +C DGG+L+ CDGCP FH+ C + P GDW+C Y
Sbjct: 547 ESERKGFHFVDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHQGCLDIKKFPSGDWHCIY 606
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F G VSG S Q IV L
Sbjct: 607 CCCKF--------------CGSVSG--SSNQRDDNDELIVSKL----------------- 633
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKH--KMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
L C CE ++H C++ + D R++ +FC C ++ L+ LL
Sbjct: 634 --------LTCQLCEEKYHRSCIEANDANTDDSRDV----FFCGNRCQELSERLEMLLGV 681
Query: 735 EAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
+ E + I++ DV + K E L+ AV+I +CF P
Sbjct: 682 KHEMEDGYSWTFIRRS-----------DVGFDASQIKPQMVECNSKLAVAVSIMDECFMP 730
Query: 795 IVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+D SG +LI S++Y R N + G AIL ++SA +R+ G ++AE+P +
Sbjct: 731 YIDHRSGINLIHSILYNRGSNFNRLNYSGFVTAILERGDEIISAASIRIRGNQLAEMPFI 790
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
T + +G + L +E L L V+ +V+PA E WT FGF+ ++ I
Sbjct: 791 GTRYMYRRQGMCRRLLNAVEWGLGSLNVELLVIPAISELRETWTSVFGFESLESTSKQIL 850
Query: 913 RKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTE 946
+ L+ F MLQK++ ++ + + +E
Sbjct: 851 HNK--NLLVFPHVDMLQKKISKHKLAGQNLNPSE 882
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/439 (27%), Positives = 174/439 (39%), Gaps = 106/439 (24%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
L +G GI C CC + + S+F+ HAG
Sbjct: 905 LKDGNITASGIQCQCCGTTFTMSKFKCHAGLRQEVPSLNLFLDTGKSYSLCQLQAWSIEQ 964
Query: 567 --RQYPGKD--------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
R+ KD NDD C C DGG L+ CD CP ++H+ C S IP G+WYC
Sbjct: 965 KVRKEHAKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSS 1024
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C +C SK
Sbjct: 1025 CL-----------------------------------------------CDICGEVIDSK 1037
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
L C QCER++HV C+ E G WFC C I ++ +
Sbjct: 1038 ELVTSLPALDCSQCERQYHVKCVSAK--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPD 1095
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
+ ++ + T +I A E + L A +I +CF PI+
Sbjct: 1096 HMDDDLCFTVLRNNGDKKVRTAEEI----------ALMAECNMKLMIATSIMEECFLPIL 1145
Query: 797 DSISGRDLIPSMVYG-RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 855
D +G D+IPS++Y R+ + G Y +L + S+VS +R+ G +AE+PLVATS
Sbjct: 1146 DPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATS 1205
Query: 856 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR 915
K N +G + L IE++L L+V+ ++L A WT FGF++ID +KR
Sbjct: 1206 KENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDES----DKKR 1261
Query: 916 CSQ--LVTFKGTSMLQKRV 932
S+ L GT +L+K +
Sbjct: 1262 LSKVRLAAVPGTVLLKKDL 1280
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 122/447 (27%), Positives = 179/447 (40%), Gaps = 114/447 (25%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQY--------------------------PGK------- 572
GI C CC S S+FEAHAG Y GK
Sbjct: 849 GIKCSCCQEVFSLSRFEAHAGSSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKESF 908
Query: 573 ----------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
+ND +C++C GG+L+ CD CP +FHK C L ++P+GDW+C C
Sbjct: 909 SRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSC----- 963
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
C C +C F
Sbjct: 964 -----------------------------C-------------CGICGENKFDGGSEQDN 981
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE-KLPE 741
+ C QCER+ CL+K L P G WFC C +I LQ LL + +
Sbjct: 982 VVFSCYQCERQC---CLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVDN 1038
Query: 742 FHLNAIKKYAGNSLET-VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSIS 800
+K LE + DI+ A E L+ A+ + H+CF+P+ + +
Sbjct: 1039 LTWTLLKPIRSKGLEIDLPDIE----------ALTEVYSKLNIALGVMHECFEPVKEPHT 1088
Query: 801 GRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 858
RD++ +++ G +L F G Y +L N ++S +RV+G++VAE+PL+ T
Sbjct: 1089 RRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLIGTRFQY 1148
Query: 859 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCS 917
G +L +EK L L V+ +VLPA + WT FGF K+ D E L R
Sbjct: 1149 RRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERL---RFLDY 1205
Query: 918 QLVTFKGTSMLQK---RVPACRIGSSS 941
+ F+ T M QK ++P + S+
Sbjct: 1206 SFLDFQDTVMCQKLLMKIPLAKSNQST 1232
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 101/139 (72%), Gaps = 4/139 (2%)
Query: 218 KISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVI 277
K+S +K +T+ +++ L V+ ++ G SGLRG+IRD GILCSC LC G RVI
Sbjct: 2 KVSFSK---IITKKWKSHLEVWVAKRHLHGW-LLVSGLRGVIRDEGILCSCCLCEGRRVI 57
Query: 278 PPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
PS+FEIHACKQYRRA +YICFENGKSLL++LRACR PL L+AT+Q+ + S PEEK F
Sbjct: 58 SPSQFEIHACKQYRRAVEYICFENGKSLLDLLRACRGAPLHDLEATIQNIVCSPPEEKYF 117
Query: 338 ACVRCKGTFPITCVGKTGP 356
C RCKG FP +C+ + GP
Sbjct: 118 TCKRCKGRFPSSCMERVGP 136
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 164/378 (43%), Gaps = 71/378 (18%)
Query: 564 HAGRQYPGK--------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
H R+ GK +ND +C++C GG L+ CD CP +FHK C L IP GDW+C
Sbjct: 718 HKTRETTGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCP 777
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C +R ++ G D V Q+
Sbjct: 778 SCCCGICGQR------------KIDGDDEVGQL--------------------------- 798
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
L C QCE ++HV CL+ + AD+ G WFC DC +I L LL +
Sbjct: 799 ---------LPCIQCEHKYHVRCLE-NGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEP 848
Query: 736 AE-KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
+ +K +S E S S E+ L+ A+++ H+CF+P
Sbjct: 849 VSVGVDNLTWTLVKFINPDSCEHDS---------SKSDLLAESYSKLNLAISVMHECFEP 899
Query: 795 IVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+ +S++ RDL+ +++ R L F G Y +L N ++S +RV+G++VAE+PLV
Sbjct: 900 LKESLTNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPLV 959
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
T +G +L +EK L L V+ +VLPA WT FGF K+ S +
Sbjct: 960 GTRLQYRRRGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQF 1019
Query: 913 RKRCSQLVTFKGTSMLQK 930
+ F+G M QK
Sbjct: 1020 LDY--TFLDFQGAIMCQK 1035
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 123/436 (28%), Positives = 180/436 (41%), Gaps = 103/436 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAGRQY----------PGK---------------- 572
+GY GI+C CC S F+ HAG + GK
Sbjct: 955 DGYVTRDGIVCKCCTELFSVCNFKIHAGFKLNRPCRNLFMESGKSFTLCQLQAWSTEYKV 1014
Query: 573 --------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
NDD C +C DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 1015 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC- 1073
Query: 619 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 678
+ C +VK+ EA S
Sbjct: 1074 -----------------------------TCRICGDLVKDREA--------------SSS 1090
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
F L C QCE ++H+ CLK+ ++E+ FC +C I S LQ LL
Sbjct: 1091 F---LALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLL------ 1138
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 798
F + + L + D D + A E L+ A+ I +CF +VD
Sbjct: 1139 --GFVNHIADGFTWTLLRCIHD-DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDP 1195
Query: 799 ISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
+G D+IP ++Y R + F G Y +L + ++VS +RV G VAE+PL+AT +
Sbjct: 1196 RTGIDMIPHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYE 1255
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRC 916
KG +LL IEK+L ++V+ IV+ A WT FGFK ++ + + +K
Sbjct: 1256 KFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-- 1313
Query: 917 SQLVTFKGTSMLQKRV 932
L+ F GT +L+K +
Sbjct: 1314 INLMVFPGTILLKKSL 1329
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 123/436 (28%), Positives = 180/436 (41%), Gaps = 103/436 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAGRQY----------PGK---------------- 572
+GY GI+C CC S F+ HAG + GK
Sbjct: 794 DGYVTRDGIVCKCCTELFSVCNFKIHAGFKLNRPCRNLFMESGKSFTLCQLQAWSTEYKV 853
Query: 573 --------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
NDD C +C DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 854 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC- 912
Query: 619 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 678
+ C +VK+ EA S
Sbjct: 913 -----------------------------TCRICGDLVKDREA--------------SSS 929
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
F L C QCE ++H+ CLK+ ++E+ FC +C I S LQ LL
Sbjct: 930 F---LALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLL------ 977
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 798
F + + L + D D + A E L+ A+ I +CF +VD
Sbjct: 978 --GFVNHIADGFTWTLLRCIHD-DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDP 1034
Query: 799 ISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
+G D+IP ++Y R + F G Y +L + ++VS +RV G VAE+PL+AT +
Sbjct: 1035 RTGIDMIPHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYE 1094
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRC 916
KG +LL IEK+L ++V+ IV+ A WT FGFK ++ + + +K
Sbjct: 1095 KFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-- 1152
Query: 917 SQLVTFKGTSMLQKRV 932
L+ F GT +L+K +
Sbjct: 1153 INLMVFPGTILLKKSL 1168
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 156/592 (26%), Positives = 233/592 (39%), Gaps = 140/592 (23%)
Query: 393 SENDSMCISYQNNKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPW-ELTRNSSRPG 451
SEN S +Y + ER ++ L KS+ K + + W EL + S R
Sbjct: 418 SENKSEGGAYTDTSEERIRS----SIKLGGKSTKKG-------RNGADWDELHKKSKRSL 466
Query: 452 LIANSTPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQR 511
N+ P S S RK K + T+L+ + +P ++ N + K
Sbjct: 467 YYNNARP--SCGSDSHYLHGRKTKKIGRCTLLVRSSKDKKNPAINGFNP----YSGKRTL 520
Query: 512 LHKLVFDESGLPDGTEVGYYA---CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ 568
L L+ ESG+ + Y + +LEG+ GI C CC+ ++ S+FE HAG +
Sbjct: 521 LSWLI--ESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSK 578
Query: 569 Y------------------------PGKDNDDL----------------CTICADGGNLL 588
KD +L C IC DGG+L+
Sbjct: 579 SCQPFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLI 638
Query: 589 PCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQI 648
CDGCP +H+ C + +P GDW+C C
Sbjct: 639 CCDGCPSTYHQNCLGMQVLPSGDWHCPNC------------------------------- 667
Query: 649 TKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR----TILLCDQCEREFHVGCL--KKH 702
C+ CD + + G ++L C CER +H CL + H
Sbjct: 668 -------------------TCKFCDAAVASGGKDGNFISLLSCGMCERRYHQLCLNDEAH 708
Query: 703 KMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID 762
K ++ FC C + LQ L + E + + I + ++T SD +
Sbjct: 709 K---VQSFGSASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHR-----VDTDSDTN 760
Query: 763 VRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFG 820
+ A E L+ +AI +CF PIVD SG DLI +++Y G N +
Sbjct: 761 SQM-----SAQRIENNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYT 815
Query: 821 GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRV 880
G Y AIL ++SA LR G ++AE+P + T I +G + LF IE + L+V
Sbjct: 816 GFYTAILERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKV 875
Query: 881 KSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT--FKGTSMLQK 930
+ +V+PA + WT FGF +D + RK L T F G MLQK
Sbjct: 876 EKLVIPAIPDFLHAWTGNFGFTPLDDSV----RKEMRSLNTLVFPGIDMLQK 923
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 123/436 (28%), Positives = 180/436 (41%), Gaps = 103/436 (23%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAGRQY----------PGK---------------- 572
+GY GI+C CC S F+ HAG + GK
Sbjct: 891 DGYVTRDGIVCKCCTELFSVCNFKIHAGFKLNRPCRNLFMESGKSFTLCQLQAWSTEYKV 950
Query: 573 --------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
NDD C +C DGG L+ CD CP FH+ C S +P+G+WYC C
Sbjct: 951 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNC- 1009
Query: 619 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 678
+ C +VK+ EA S
Sbjct: 1010 -----------------------------TCRICGDLVKDREA--------------SSS 1026
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
F L C QCE ++H+ CLK+ ++E+ FC +C I S LQ LL
Sbjct: 1027 F---LALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLL------ 1074
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 798
F + + L + D D + A E L+ A+ I +CF +VD
Sbjct: 1075 --GFVNHIADGFTWTLLRCIHD-DQKVHSSQKLALKAECNSKLAVALTIMEECFLSMVDP 1131
Query: 799 ISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
+G D+IP ++Y R + F G Y +L + ++VS +RV G VAE+PL+AT +
Sbjct: 1132 RTGIDMIPHVLYNRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAEMPLIATYE 1191
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRC 916
KG +LL IEK+L ++V+ IV+ A WT FGFK ++ + + +K
Sbjct: 1192 KFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDEKASLKK-- 1249
Query: 917 SQLVTFKGTSMLQKRV 932
L+ F GT +L+K +
Sbjct: 1250 INLMVFPGTILLKKSL 1265
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 178/444 (40%), Gaps = 117/444 (26%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY------------------------PGK 572
+LEG+ GI C CC+ ++ S+FE HAG + K
Sbjct: 547 MLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVRAWNMQK 606
Query: 573 DNDDL----------------CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
D +L C IC DGG+L+ CDGCP +H+ C + +P GDW+C
Sbjct: 607 DATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPN 666
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C+ CD +
Sbjct: 667 C--------------------------------------------------TCKFCDAAV 676
Query: 677 SGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQN 730
+ G ++L C CER +H CL + HK ++ FC C + LQ
Sbjct: 677 ASGGKDGNSISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFCGPKCLELFEKLQK 733
Query: 731 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 790
L + E + + I + ++T SD + + A E L+ +AI +
Sbjct: 734 YLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIENNSKLAVGLAIMDE 783
Query: 791 CFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
CF PIVD SG DLI +++Y G N + G Y AIL ++SA LR G ++AE
Sbjct: 784 CFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAE 843
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
+P + T I +G + LF IE + L+V+ +V+PA + WT FGF +D +
Sbjct: 844 MPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV 903
Query: 909 LSIYRKRCSQLVT--FKGTSMLQK 930
RK L T F G MLQK
Sbjct: 904 ----RKEMRSLNTLVFPGIDMLQK 923
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 120/449 (26%), Positives = 174/449 (38%), Gaps = 127/449 (28%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP----------GK---------------- 572
+G GI+C+CC +S S F AHAGR +P GK
Sbjct: 882 DGMVTWEGIVCNCCKKNLSVSDFMAHAGRSHPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 573 --------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
++DD C C DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 619 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 678
C +C G K
Sbjct: 1002 -----------------------------------------------CQVCGGPFSEKEV 1014
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
I C QC +H C+++ K+ L + WFC C I
Sbjct: 1015 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEI-------------- 1059
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLL----SGK--------AATPETRLLLSQAVA 786
++ + G + + D D+ W +L G+ A E + L+ A+
Sbjct: 1060 -----FIGLRSHVGT--DNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALT 1112
Query: 787 IFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQ 844
+ +CF +VD +G D+IP ++Y G N ++ G Y IL ++ +RV G
Sbjct: 1113 LLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGT 1172
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
+ AELP +ATS +G ++L + IEK+L VK +VL A E S W FGFK I
Sbjct: 1173 KAAELPFIATSVDFRRQGMCRILMSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPI 1232
Query: 905 -DPELLSIYRKRCSQLVTFKGTSMLQKRV 932
D E ++ L+ F GTS+L KR+
Sbjct: 1233 EDAERKQLHN---VNLMLFPGTSLLTKRL 1258
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/447 (26%), Positives = 182/447 (40%), Gaps = 113/447 (25%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
+ EG GI C CC + S F HAG
Sbjct: 707 MFEGRITREGIRCTCCQNLYGLSGFANHAGGSSNCRPSACIFLKDGRSLLDCMMEVMQDH 766
Query: 567 -----RQYPGKD-----NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+ P D ND++C++C GG L+ CD CP A+HK C +L IP GDW+C
Sbjct: 767 RTREITEKPHNDLFEGENDNICSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPS 826
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C+ C +C +
Sbjct: 827 CR-----------------------------------------------CGICGQNKIEE 839
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
+ G L C QCE ++HV CL+ + D R K WFC +C R+ + LQNLL +
Sbjct: 840 TEDG--HFLTCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLGKPV 896
Query: 737 EKLPEFHLNAIKKYAGNSLETV----SDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 792
+ + KY + V SD+ V E LS A+++ H+CF
Sbjct: 897 LVGADNLTWTLVKYVNSETCGVGGAESDLVV------------ENYSKLSVALSVMHECF 944
Query: 793 DPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 850
+P+ + S RD++ +++ + L F G Y +L N ++S +R+FG+++AE+P
Sbjct: 945 EPLHNPFSSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVP 1004
Query: 851 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 910
LV T G ++L +EK L L V+ +VLPA WT+ FGF+++ S
Sbjct: 1005 LVGTRFQYRRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERS 1064
Query: 911 IYRKRCSQLVTFKGTSMLQK---RVPA 934
+ + F+GT M QK R P+
Sbjct: 1065 QFLD--YSFLDFQGTVMCQKLLTRFPS 1089
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/454 (27%), Positives = 178/454 (39%), Gaps = 108/454 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
L +G GI C CC + + S+F+ HAG
Sbjct: 894 LKDGNITTSGIRCRCCGTTFTMSKFKCHAGLRQEVPSLNLFLGTGKSYSLCLLQAWSIEQ 953
Query: 567 --RQYPGKD--------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
R+ KD NDD C C DGG L+ CD CP ++H+ C IP G+WYC
Sbjct: 954 KVRKERVKDTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLPCQDIPDGNWYCSS 1013
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C +C SK
Sbjct: 1014 CL-----------------------------------------------CNICGEVITSK 1026
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
L C QCER++HV C+ E G WFC C +I + ++ +
Sbjct: 1027 ELRTSLPALECSQCERQYHVKCVSAK--VSCNEDGPGTWFCGRKCQQIYMIFRSRVGVPD 1084
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
+ ++ + T +I A E + L A++I +CF PI+
Sbjct: 1085 HVDNDLSCTILRNNGDKKVRTAGEI----------ALMAECNMKLMIALSIMEECFLPIL 1134
Query: 797 DSISGRDLIPSMVYGRNLRGQEFG--GMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D +G D+IPS++Y F G Y +L + S+VS +R+ G VAE+PLVAT
Sbjct: 1135 DPRTGIDIIPSILYNWRSDFIHFNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMPLVAT 1194
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
S N +G + L IE++L L+V+ ++L A WT FGF++ID +K
Sbjct: 1195 STENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLVETWTSTFGFREIDDS----DKK 1250
Query: 915 RCS--QLVTFKGTSMLQKRVPACRIGSSSTDSTE 946
R S +L GT +L+K + C G TD E
Sbjct: 1251 RLSMVRLAAVPGTVLLKKNLCECS-GVEDTDVAE 1283
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/438 (27%), Positives = 179/438 (40%), Gaps = 105/438 (23%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY------------------------PGK 572
+LEG+ GI C CC+ ++ S+FE HAG + K
Sbjct: 530 MLEGWITREGIHCDCCSKILTVSRFEIHAGSKACQPFQNIYLESGASLLQCQVRAWNMQK 589
Query: 573 DNDDL----------------CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
D ++ C IC DGG+L+ CDGCP +H+ C + +P GDW+C
Sbjct: 590 DATNVGLHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSGDWHCPN 649
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F DA G+ G L
Sbjct: 650 CTCKF-------CDAAVASGGK-------------------------DGNFL-------- 669
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
++L C CER +H CL ++ FC C + LQ L +
Sbjct: 670 ------SLLSCSMCERRYHQLCLSDEAQK-VQSFGSASSFCGPKCLELFEKLQKYLGVKN 722
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
E + + I + ++T SDI+ + LS + ++L + +AI +CF PIV
Sbjct: 723 EIEGGYSWSLIHR-----VDTDSDINSQ---LSAQRIENNSKLAV--GLAIMDECFLPIV 772
Query: 797 DSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D SG +LI +++Y G N + G Y AIL ++SA LR G ++AE+P + T
Sbjct: 773 DRRSGVNLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGTQLAEMPFIGT 832
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
I +G + LF IE + L+V+ +V+PA + WT FGF +D + RK
Sbjct: 833 RHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV----RK 888
Query: 915 RCSQLVT--FKGTSMLQK 930
L T F G MLQK
Sbjct: 889 EMRSLNTLVFPGIDMLQK 906
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 68/93 (73%), Positives = 83/93 (89%)
Query: 845 EVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
++AELPLVATS NHGKGYFQ LF+CIE+LL+FL+VK +VLPAAEEAESIWT+KFGF++I
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 905 DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
P+ LS YR+ C Q+VTFKGTSMLQK VP+CR+
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRV 228
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 163/375 (43%), Gaps = 96/375 (25%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+NDD C +C DGG L+ CD CP FH +C L ++P+GDW+C C
Sbjct: 126 ENDDTCAVCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC--------------- 170
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT---ILLCDQ 689
C C +S + P IL CDQ
Sbjct: 171 -----------------------------------CCASC--GRSLYDPTIQTEILYCDQ 193
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAI 747
CERE+H C+ M + FC C +I L+ L+ V + + + + L
Sbjct: 194 CEREYHSNCVPGSAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRS 250
Query: 748 KKY----AGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
+ Y + LE+V+D++ R L+ A+ + +CF P++D S D
Sbjct: 251 EHYDQSEENSKLESVADLNTR----------------LALALTVIQECFRPMIDPRSNID 294
Query: 804 LIPSMVYGRNLRGQE----FGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
++ ++Y R RG++ F G Y +L ++S +RV G AE+P + T
Sbjct: 295 MVSHILYNR--RGEDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYR 352
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKR 915
+G + L I+++L L V+++VLPA E WT FGF+K+ +L+ +
Sbjct: 353 KQGMCRRLINVIQQVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLTAAQGIQLMEL---- 408
Query: 916 CSQLVTFKGTSMLQK 930
+VTF G+S+LQK
Sbjct: 409 --NIVTFPGSSVLQK 421
Score = 40.0 bits (92), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 178/444 (40%), Gaps = 117/444 (26%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY------------------------PGK 572
+LEG+ GI C CC+ ++ S+FE HAG + K
Sbjct: 9 MLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVRAWNMQK 68
Query: 573 DNDDL----------------CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
D +L C IC DGG+L+ CDGCP +H+ C + +P GDW+C
Sbjct: 69 DATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPN 128
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C+ CD +
Sbjct: 129 CT--------------------------------------------------CKFCDAAV 138
Query: 677 SGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQN 730
+ G ++L C CER +H CL + HK ++ FC C + LQ
Sbjct: 139 ASGGKDGNFISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFCGPKCLELFEKLQK 195
Query: 731 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 790
L + E + + I + ++T SD + + A E L+ +AI +
Sbjct: 196 YLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIENNSKLAVGLAIMDE 245
Query: 791 CFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
CF PIVD SG DLI +++Y G N + G Y AIL ++SA LR G ++AE
Sbjct: 246 CFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAE 305
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
+P + T I +G + LF IE + L+V+ +V+PA + WT FGF +D +
Sbjct: 306 MPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV 365
Query: 909 LSIYRKRCSQLVT--FKGTSMLQK 930
RK L T F G MLQK
Sbjct: 366 ----RKEMRSLNTLVFPGIDMLQK 385
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 178/444 (40%), Gaps = 117/444 (26%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY------------------------PGK 572
+LEG+ GI C CC+ ++ S+FE HAG + K
Sbjct: 29 MLEGWITREGIHCDCCSKILTVSRFEIHAGSKSCQPFQNIYLESGASLLQCQVRAWNMQK 88
Query: 573 DNDDL----------------CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
D +L C IC DGG+L+ CDGCP +H+ C + +P GDW+C
Sbjct: 89 DATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSGDWHCPN 148
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C+ CD +
Sbjct: 149 CT--------------------------------------------------CKFCDAAV 158
Query: 677 SGFGPR----TILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQN 730
+ G ++L C CER +H CL + HK ++ FC C + LQ
Sbjct: 159 ASGGKDGNFISLLSCGMCERRYHQLCLNDEAHK---VQSFGSASSFCGPKCLELFEKLQK 215
Query: 731 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 790
L + E + + I + ++T SD + + A E L+ +AI +
Sbjct: 216 YLGVKTEIEGGYSWSLIHR-----VDTDSDTNSQM-----SAQRIENNSKLAVGLAIMDE 265
Query: 791 CFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
CF PIVD SG DLI +++Y G N + G Y AIL ++SA LR G ++AE
Sbjct: 266 CFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQLAE 325
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
+P + T I +G + LF IE + L+V+ +V+PA + WT FGF +D +
Sbjct: 326 MPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLDDSV 385
Query: 909 LSIYRKRCSQLVT--FKGTSMLQK 930
RK L T F G MLQK
Sbjct: 386 ----RKEMRSLNTLVFPGIDMLQK 405
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 131/480 (27%), Positives = 196/480 (40%), Gaps = 129/480 (26%)
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY-------------- 569
DG+ V Y G G GIIC+CC+ +S S+F++HAG ++
Sbjct: 863 DGSVVKY---------GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGRP 913
Query: 570 --------------------------PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
NDD C IC DGG L+ CD CP FH C S
Sbjct: 914 FMLCQLQAWSTEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLS 973
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 663
+ +P+G+WYC C T R + N E E+
Sbjct: 974 IQELPEGNWYCLNC-------------------------------TCRICGDLVNFE-EI 1001
Query: 664 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK----WFCCM 719
S L C QCE+++H CLK+ R++ G WFC
Sbjct: 1002 SS---------------SSDALKCFQCEQKYHGQCLKQ------RDIDSGVESHIWFCSG 1040
Query: 720 DCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVSDIDVRWRLLSGK--AATPE 776
C +I + LQ+ L ++A G S + I ++LS A E
Sbjct: 1041 SCQKIYAALQS------------QLGLTNQFANGFSWTLLRCIHYDQKILSTARLAMMAE 1088
Query: 777 TRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVV 834
L A+ I +CF +VD +G D+IP +VY + +F G Y IL + ++
Sbjct: 1089 CNSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLL 1148
Query: 835 SAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESI 894
+RV G E+AE+PL+AT +G + L IE++L +VK +V+ A
Sbjct: 1149 CVASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVET 1208
Query: 895 WTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTECV-SGVEV 953
WT+ FGF ++ E K L+ F GT +L+K A + +T++T + SGV++
Sbjct: 1209 WTEGFGFVTVENEEKQSLHKF--NLMVFPGTVLLKK---ALYVSGQTTETTVGIHSGVQL 1263
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 130/476 (27%), Positives = 194/476 (40%), Gaps = 121/476 (25%)
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY-------------- 569
DG+ V Y G G GIIC+CC+ +S S+F++HAG ++
Sbjct: 863 DGSVVKY---------GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGRP 913
Query: 570 --------------------------PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
NDD C IC DGG L+ CD CP FH C S
Sbjct: 914 FMLCQLQAWSTEYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLS 973
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 663
+ +P+G+WYC C T R + N E E+
Sbjct: 974 IQELPEGNWYCLNC-------------------------------TCRICGDLVNFE-EI 1001
Query: 664 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 723
S L C QCE+++H CLK+ + E WFC C +
Sbjct: 1002 SS---------------SSDALKCFQCEQKYHGQCLKQRDINSGVE--SHIWFCSGSCQK 1044
Query: 724 INSVLQNLLVQEAEKLPEFHLNAIKKYA-GNSLETVSDIDVRWRLLSGK--AATPETRLL 780
I + LQ+ L ++A G S + I ++LS A E
Sbjct: 1045 IYAALQS------------QLGLTNQFANGFSWTLLRCIHYDQKILSTARLAMMAECNSR 1092
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGI 838
L A+ I +CF +VD +G D+IP +VY + +F G Y IL + ++
Sbjct: 1093 LVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLCVAS 1152
Query: 839 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 898
+RV G E+AE+PL+AT +G + L IE++L +VK +V+ A WT+
Sbjct: 1153 IRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETWTEG 1212
Query: 899 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDSTECV-SGVEV 953
FGF ++ E K L+ F GT +L+K A + +T++T + SGV++
Sbjct: 1213 FGFVTVENEEKQSLHKF--NLMVFPGTVLLKK---ALYVSGQTTETTVGIHSGVQL 1263
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 159/362 (43%), Gaps = 66/362 (18%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
NDD C C DGG L+ CD CP ++H+ C S IP G+WYC C
Sbjct: 8 NDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCLCDI------------ 55
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
G V +DS E +T +L A L C QCER+
Sbjct: 56 --CGEV--IDSKELVT--------SLPA-----------------------LDCSQCERQ 80
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+HV C+ E G WFC C I ++ + + ++
Sbjct: 81 YHVKCVSAK--VPCNEDGSGTWFCGRKCHEIYMTFRSRVGVPDHMDDDLCFTVLRNNGDK 138
Query: 754 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-R 812
+ T +I A E + L A +I +CF PI+D +G D+IPS++Y R
Sbjct: 139 KVRTAEEI----------ALMAECNMKLMIATSIMEECFLPILDPRTGIDIIPSILYNWR 188
Query: 813 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 872
+ + G Y +L + S+VS +R+ G +AE+PLVATSK N +G + L IE
Sbjct: 189 SDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATSKENRQQGMCRRLMDYIE 248
Query: 873 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ--LVTFKGTSMLQK 930
++L L+V+ ++L A WT FGF++ID +KR S+ L GT +L+K
Sbjct: 249 EMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDES----DKKRLSKVRLAAVPGTVLLKK 304
Query: 931 RV 932
+
Sbjct: 305 DL 306
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 122/442 (27%), Positives = 174/442 (39%), Gaps = 109/442 (24%)
Query: 539 EGYKNGL----GIICHCCNSEVSPSQFEAHAGRQYP----------GK------------ 572
E K+GL G++C+CC +S S F AHAG +P GK
Sbjct: 902 EVIKDGLVTWEGVVCNCCKKTLSVSGFMAHAGFSHPQSSLGLFLESGKSYTLCQVEAWSA 961
Query: 573 ------------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
++DD C C DGG LL CD CP +H C S +P+G WYC
Sbjct: 962 EFMSRRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHPACLSAKELPEGSWYC 1021
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
C C +C G
Sbjct: 1022 HNCT-----------------------------------------------CQICGGPVS 1034
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
K I C QC +H C+++ K+ L + WFC C I L++ +
Sbjct: 1035 EKEVSTFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHVGT 1093
Query: 735 EAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
E E + ++ G L +V I A E + L+ A+ + +CF
Sbjct: 1094 ENILDSELSWSILRCNNDGQKLHSVQKI----------ACLAECNMKLAVALTLLEECFI 1143
Query: 794 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
+VD +G D+IP ++Y G N ++ G Y IL ++ +RV G + AELP
Sbjct: 1144 RMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHGTKAAELPF 1203
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLS 910
+ATS +G ++L IEK+L VK +VL A E S W FGFK I D E
Sbjct: 1204 IATSVDYRRQGMCRILMNIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKPIEDAERKQ 1263
Query: 911 IYRKRCSQLVTFKGTSMLQKRV 932
++ L+ F GTS+L KR+
Sbjct: 1264 LHN---VNLMLFPGTSLLTKRL 1282
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 147/338 (43%), Gaps = 64/338 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+ND +C++C GG L+ CD CP +FHK C L +P GDW+C C
Sbjct: 428 ENDHICSVCHYGGELILCDQCPSSFHKSCLGLMDVPDGDWFCSSC--------------- 472
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C +I G L R D S G +L C QCER
Sbjct: 473 -------------------CCKIC--------GQCLKRDSDLSMEDDG---VLDCTQCER 502
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP----EFHLNAIK 748
++HV CL + L PK WFC C +I L LL +K+P +K
Sbjct: 503 KYHVVCLGNKREECLEYFPKEHWFCSKRCQQIFLGLHELL---GKKIPVGLHNLTWTLLK 559
Query: 749 KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 808
N SDI+ A E +L+ A+ + H+ FDP+ + + RDL+ +
Sbjct: 560 SIQFNDQCEASDIE----------ALSENYSMLNIALDMMHEFFDPVEEPHTKRDLLKDV 609
Query: 809 VYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQL 866
++ + L F G Y +L + +S +RV+G++VAE+PLV T G +
Sbjct: 610 IFSKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGEKVAEIPLVGTRFQYRRLGMCCI 669
Query: 867 LFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
L +EK L L V+ ++LPA A + W FGF K+
Sbjct: 670 LMNVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKL 707
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/449 (25%), Positives = 176/449 (39%), Gaps = 108/449 (24%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
LLEG GI C+CC+ S FE HAG R P
Sbjct: 572 LLEGIITKEGIRCNCCDEVFSVLDFEVHAGGNRNQPFKSLYLEGGNSLLQCLHESMNKQS 631
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G NDD C IC DGG+L+ CDGCP FH+ C + P G WYC
Sbjct: 632 ESQLKGYHFVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYN 691
Query: 617 CQNMF-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F E+ +H+ + + + LS C LC
Sbjct: 692 CSCKFCEKDEAAKHETSTLPS--------------------------LSSCRLC------ 719
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
E ++H C+ + FC C + LQ L +
Sbjct: 720 ---------------EEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGV 761
Query: 736 AEKLPE-FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
LPE F + ++++ S V+D D+ ++ ++ A ++ +CF P
Sbjct: 762 KHPLPEGFSWSFLRRFELPS--EVADCDISEKIAYNAK--------MAVAFSVMDECFSP 811
Query: 795 IVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+VD SG +L+ ++VY G N +F A+L +++ +R+ G ++AE+P +
Sbjct: 812 LVDHRSGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFI 871
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSI 911
T + +G + L IE L L+V +V+PA E WT FGF + D E +I
Sbjct: 872 GTRYMYRRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTI 931
Query: 912 YRKRCSQLVTFKGTSMLQKRVPACRIGSS 940
+ L+ F G ML K + +I S
Sbjct: 932 ---KNLNLLVFPGVDMLGKSLVKEKITDS 957
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 178/445 (40%), Gaps = 110/445 (24%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
L +G GI C CC+ + S F+ HAG ++ P
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGLRQEIPSLNLFLGSGKSYTLCQLQAWSIEH 1025
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+NDD C +C DGG L+ CD CP ++H++C IP G WYC
Sbjct: 1026 KARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY- 1084
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
RC+ C +C K
Sbjct: 1085 ----------------------------------RCL------------CDICGEVINLK 1098
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
L C QCER++H C+ + + P WFC C +I L++ +
Sbjct: 1099 ELRSSLPALECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPI 1157
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
+ F ++ + T +DI A E + L A++I +CF PI+
Sbjct: 1158 HTIDGFSCTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPII 1207
Query: 797 DSISGRDLIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
D+ +G D+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+
Sbjct: 1208 DARTGIDIIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLI 1265
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
AT N +G + L IE++L L+V+ ++L A WT FGF ID +
Sbjct: 1266 ATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLD 1321
Query: 913 RKRCS--QLVTFKGTSMLQKRVPAC 935
RK S +LV+ GT +L++ + C
Sbjct: 1322 RKNLSRLRLVSVPGTVLLKRNLYEC 1346
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 134/448 (29%), Positives = 193/448 (43%), Gaps = 82/448 (18%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAH 564
+D LH ++F E GLPD T + Y G+ L +GYK G GIIC CC+ EV +PSQFE H
Sbjct: 584 RDNTLHPMLFKEGGLPDYTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKH 643
Query: 565 AG---RQYPGKDNDDLCTICADGGNLLPC-----DGCPRAFH-KECASLSSIPQGDWYCK 615
G R+ P + +DG L DG + E +L+S ++
Sbjct: 644 VGMGRRRQPYR-----SIYTSDGLTLHELALKLQDGLSSNVNIDELPTLTSGSGKEYSTT 698
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
+ KR LQ RV V+S C +CR
Sbjct: 699 SRPIIVPLKRTLQE--------RVLTVES---------------------CYMCRKPHTV 729
Query: 676 KSGFGPRTILLCDQCEREFHVGC----LKKHK--MADLRELPKGKWFCCMDCSRINSVLQ 729
I+ C+QCER HV C L+K K + L E + + CC C + + L
Sbjct: 730 LGVISVDMIVFCNQCERALHVKCYNNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLRASLH 789
Query: 730 NLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFH 789
+KK + ++ W+LL+G + + Q + IF
Sbjct: 790 E---------------GLKKREDIAFLRRIRYNICWQLLNGTNMRSDVQ---HQVIEIFK 831
Query: 790 DCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
D F D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAE
Sbjct: 832 DAFAETAPQ--DIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAE 889
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
L L+AT KGYF LL + IE L V+ + P E IW++K G+ +
Sbjct: 890 LVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----I 944
Query: 909 LSIYRKR----CSQLVTFKGTSMLQKRV 932
LS +K LV F S++QK +
Sbjct: 945 LSDEQKHSMLMAHPLVMFANLSLVQKSL 972
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC----RVIPPSKFEIHACKQYR 291
LL GV V Y + + L G + GG C+C GC +V+ +FE HA +
Sbjct: 31 LLQGVPVTYR--FEKHNAKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGVTSK 88
Query: 292 RASQYICFENGKSLLEVLRACRSVP 316
+ +I NG+SL E+ R VP
Sbjct: 89 NQNGHIFLRNGRSLYELFHKLREVP 113
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 159/362 (43%), Gaps = 68/362 (18%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
NDD C +C DGG L+ CD CP ++H C IP G WYC C+
Sbjct: 1000 NDDTCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR--------------- 1044
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
C +C SK P C QCER+
Sbjct: 1045 --------------------------------CDVCGEVVSSKEPRTPLHAFECSQCERQ 1072
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+H+ C+ + + P G WFC C +I + L++ + +P+ HL+ ++
Sbjct: 1073 YHIKCISGKVLCNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPD-HLD--DGFSCT 1123
Query: 754 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG-- 811
L D VR + + A E + L A++I +CF PI D +G D++P ++Y
Sbjct: 1124 ILHNNGDQKVR--MAADIALLAECNMKLIIALSILEECFLPIFDPRTGMDIMPLILYNWR 1181
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
N ++ G Y +L + S++S +R+ G VAE+PL+AT N +G + + I
Sbjct: 1182 SNFVHLDYKGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGMCRRIVDYI 1241
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLVTFKGTSML 928
E++L L+V+ ++L A WT FGF+ I D + LS R L + GT +L
Sbjct: 1242 EQMLKSLKVEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKLSKIR-----LASVPGTVLL 1296
Query: 929 QK 930
+K
Sbjct: 1297 KK 1298
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 178/445 (40%), Gaps = 110/445 (24%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
L +G GI C CC+ + S F+ HAG ++ P
Sbjct: 915 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGLRQEIPSLNLFLGSGKSYTLCQLQAWSIEH 974
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+NDD C +C DGG L+ CD CP ++H++C IP G WYC
Sbjct: 975 KARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY- 1033
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
RC+ C +C K
Sbjct: 1034 ----------------------------------RCL------------CDICGEVINLK 1047
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
L C QCER++H C+ + + P WFC C +I L++ +
Sbjct: 1048 ELRSSLPALECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPI 1106
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
+ F ++ + T +DI A E + L A++I +CF PI+
Sbjct: 1107 HTIDGFSCTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPII 1156
Query: 797 DSISGRDLIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
D+ +G D+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+
Sbjct: 1157 DARTGIDIIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLI 1214
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
AT N +G + L IE++L L+V+ ++L A WT FGF ID +
Sbjct: 1215 ATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLD 1270
Query: 913 RKRCS--QLVTFKGTSMLQKRVPAC 935
RK S +LV+ GT +L++ + C
Sbjct: 1271 RKNLSRLRLVSVPGTVLLKRNLYEC 1295
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 144 bits (363), Expect = 2e-31, Method: Composition-based stats.
Identities = 74/119 (62%), Positives = 89/119 (74%), Gaps = 1/119 (0%)
Query: 818 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 877
EFGGMYCAILTV VVSA RV G+EVAELPLVAT G+GYFQ L+ CIE+LL F
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 878 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACR 936
L+V S+VLPAAE AES+W +KF F K++ E L+ + R Q++TF+GTSMLQK VP R
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELN-HLCRDFQMMTFQGTSMLQKPVPEYR 134
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1179
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 120/450 (26%), Positives = 180/450 (40%), Gaps = 100/450 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
LLEG GI C+CC+ S FE HAG R P
Sbjct: 572 LLEGIITKEGIRCNCCDEVFSVLDFEVHAGGNRNQPFKSLYLEGGNSLLQCLHESMNKQS 631
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G NDD C IC DGG+L+ CDGCP FH+ C + P G WYC
Sbjct: 632 ESQLKGYHFVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYN 691
Query: 617 CQNMF-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDF 674
C F E+ +H+ + + + LS C LC C
Sbjct: 692 CSCKFCEKDEAAKHETSTLPS--------------------------LSSCRLCEEKC-- 723
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
S P T+ C + G + + D FC C + LQ L +
Sbjct: 724 --SKHYPHTLADHQACINQ--DGTVPGERSTD--------SFCGKYCQELFEELQ-LFIG 770
Query: 735 EAEKLPE-FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
LPE F + ++++ S V+D D+ ++ ++ A ++ +CF
Sbjct: 771 VKHPLPEGFSWSFLRRFELPS--EVADCDISEKIAYNAK--------MAVAFSVMDECFS 820
Query: 794 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
P+VD SG +L+ ++VY G N +F A+L +++ +R+ G ++AE+P
Sbjct: 821 PLVDHRSGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPF 880
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLS 910
+ T + +G + L IE L L+V +V+PA E WT FGF + D E +
Sbjct: 881 IGTRYMYRRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKT 940
Query: 911 IYRKRCSQLVTFKGTSMLQKRVPACRIGSS 940
I + L+ F G ML K + +I S
Sbjct: 941 I---KNLNLLVFPGVDMLGKSLVKEKITDS 967
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 72/113 (63%), Positives = 88/113 (77%)
Query: 825 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 884
I+ N++VV+AG+LRVFG+EVAELPLVAT + KGYFQLLF+CIEKLLS L V SIV
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIV 138
Query: 885 LPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPACRI 937
+PAAEE E +W +KFGF+K+ PE LS Y K C Q+V FKG SMLQK V + +I
Sbjct: 139 VPAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQI 191
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 28/35 (80%)
Query: 707 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPE 741
L+ELPKG WFC MDC+RINS LQ LL+ AEKL +
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSD 75
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 142 bits (358), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 180/427 (42%), Gaps = 104/427 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAG---------------------------RQYPGK------ 572
G++C CCN VS S+F+ HAG +Y +
Sbjct: 657 GVVCTCCNKTVSLSEFKNHAGFNQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWRL 716
Query: 573 -------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
NDD C +C DGG L+ CD CP FH+ C S+ +P+G WYC C
Sbjct: 717 EKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWI---- 772
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
C +V + AE R DF
Sbjct: 773 --------------------------CSELVSD-NAE-------RSQDFK---------- 788
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
C QC ++H CL+ ++ R+L +FC +C ++ + L + + +
Sbjct: 789 -CSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWS 845
Query: 746 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
+K + + + + R L+ KA E L+ A++I + F +VD +G D+I
Sbjct: 846 ILKCFQEDGMVHSA------RRLALKA---ECNSKLAVALSIMEESFLSMVDPRTGIDMI 896
Query: 806 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
P ++Y G +F G Y ++ + ++S +RV G +AE+PLVAT +G
Sbjct: 897 PHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGM 956
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
++L A IE++L L+V+ +V+ A WT+ FGFK +D E ++ L+ F
Sbjct: 957 CRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFP 1014
Query: 924 GTSMLQK 930
GT++L+K
Sbjct: 1015 GTTLLKK 1021
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 142 bits (358), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 133/565 (23%), Positives = 216/565 (38%), Gaps = 143/565 (25%)
Query: 432 RNLLKTRSPWELTRNSSRPGLIAN---STPVTSVHKSSQSQRQRK--ITKKSKKTVLISK 486
+N++K R+ E+ +PG+ N S+ ++ S+++ Q K K + +L+
Sbjct: 549 KNIVKRRTNKEICSAEKKPGVSRNRSRSSKDILANRGSRNKHQNKEDRVKDRRCGLLVRG 608
Query: 487 PFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQ--KLLEGYKNG 544
+A + W T ++ + D + + +V Y + ++L G
Sbjct: 609 STHDAEDNMDGYIPYEWKRT-----VYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITR 663
Query: 545 LGIICHCCNSEVSPSQFEAHAGRQYP---------------------------------- 570
GI C CC+ ++ +FE HAG +
Sbjct: 664 EGIFCGCCSKILTVGKFELHAGSKEKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGF 723
Query: 571 -----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
G+D +DD C IC DGG+L+ CD C FH +C + +P GDWYC+ C
Sbjct: 724 YKIDKGEDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSC------- 775
Query: 625 RFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF--SKSGFGPR 682
LCR C F K P
Sbjct: 776 -------------------------------------------LCRFCGFPQEKPSSSPE 792
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLP 740
+L C QC R++H C +P FC C +I L LL
Sbjct: 793 LLLSCLQCSRKYHQTCSSGTGTDFDCTIPGTSIDCFCSPGCRKIYKRLNKLL-------- 844
Query: 741 EFHLNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQ-------AVAIFH 789
IK + + W L+ + +A P+ + ++Q A +
Sbjct: 845 -----GIKNHM--------EAGFSWSLVHCFPNDQAMPPKNKEKMAQCNSKIALAFTVLD 891
Query: 790 DCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 847
+CF P +D SG ++I ++ Y G + +F G Y IL V+SA +R+ G ++A
Sbjct: 892 ECFQPHIDERSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVISAASVRIHGTDLA 951
Query: 848 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE 907
E+P + T + +G + L IE L L V+ +V+ A E E+ WT FGFK + P
Sbjct: 952 EMPFIGTRGMYRHQGMLRRLLNGIESALCSLNVQKLVVSAVTEMENTWTTVFGFKPVQPS 1011
Query: 908 LLSIYRKRCSQLVTFKGTSMLQKRV 932
R + L+ GT +L+KR+
Sbjct: 1012 --KKQRIKSLNLLIMNGTGLLEKRL 1034
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 109/451 (24%), Positives = 177/451 (39%), Gaps = 109/451 (24%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND----------- 575
+V Y + +LEG+ GI C CC+ ++ S+FE HAG + P +
Sbjct: 637 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQNIYLESGVSLLQ 696
Query: 576 -----------------------------DLCTICADGGNLLPCDGCPRAFHKECASLSS 606
D C IC DGG+L+ CDGCP FH+ C +
Sbjct: 697 CQIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQM 756
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC 666
+P G+W+C C F C
Sbjct: 757 LPLGEWHCPNCTCKF--------------------------------------------C 772
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKG-----KWFCCMDC 721
+ G + K + +C+ CE+++H C K ++ LP FC +C
Sbjct: 773 GIASG-NSEKDDASVYVLQICNLCEKKYHDSCTK-----EMDNLPNNINTSSLSFCGKEC 826
Query: 722 SRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLL 781
++ L+ L + E F + I + +S I R E L
Sbjct: 827 KELSEHLKKYLGTKHELEAGFSWSLIHRIDEDSEAACRGISQRV----------ECNSKL 876
Query: 782 SQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGIL 839
+ A+ + +CF P++D SG +LI +++Y G N + G Y A L ++++ +
Sbjct: 877 AIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTATLERGDEIIASASI 936
Query: 840 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 899
R G ++AE+P + T + +G + LF+ IE L L+V+ +V+PA E + WT F
Sbjct: 937 RFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIPAIAELTNTWTTVF 996
Query: 900 GFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
GF +D L + ++ F G ML K
Sbjct: 997 GFTHLDESLRQ--EMKSLNMMVFPGIDMLMK 1025
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/429 (25%), Positives = 181/429 (42%), Gaps = 104/429 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAG---------------------------RQYPGK------ 572
G++C CCN VS S+F+ HAG +Y +
Sbjct: 613 GVVCTCCNKTVSLSEFKNHAGFNQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWRL 672
Query: 573 -------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
NDD C +C DGG L+ CD CP FH+ C S+ +P+G WYC C
Sbjct: 673 EKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWI---- 728
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
C +V + AE R DF
Sbjct: 729 --------------------------CSELVSD-NAE-------RSQDFK---------- 744
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
C QC ++H CL+ ++ R+L +FC +C ++ + L + + +
Sbjct: 745 -CSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSWS 801
Query: 746 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
+K + + + + R L+ KA E L+ A++I + F +VD +G D+I
Sbjct: 802 ILKCFQEDGMVHSA------RRLALKA---ECNSKLAVALSIMEESFLSMVDPRTGIDMI 852
Query: 806 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
P ++Y G +F G Y ++ + ++S +RV G +AE+PLVAT +G
Sbjct: 853 PHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYRRQGM 912
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
++L A IE++L L+V+ +V+ A WT+ FGFK +D E ++ L+ F
Sbjct: 913 CRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFP 970
Query: 924 GTSMLQKRV 932
GT++L+K +
Sbjct: 971 GTTLLKKTL 979
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 159/563 (28%), Positives = 236/563 (41%), Gaps = 120/563 (21%)
Query: 421 VSKSSSKSVSLRNLLKTRSP-WELTRNSSRPGLIANSTPVTSVHKSSQSQRQRKITKKSK 479
+ +S+S +S N R P W+ R + +T ++ +S + T S
Sbjct: 394 MKESTSNGISNLNWSAFRRPRWQYKRGGT-------ATSTQTLSRSPEKGISGLSTGTSM 446
Query: 480 KTVLISKPFENASPPL--------------SFPNKSRWNITPK--DQRLHKLVFDESGLP 523
K P EN + PL S KS+ + T K D LH+LVF E G+P
Sbjct: 447 KINTEETPSENTAGPLHSEVTIVQEPPRGHSVGPKSKESRTSKVRDNSLHQLVFKEGGVP 506
Query: 524 DGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEV--SPSQFEAHAG---RQYPGKDNDDL 577
+ T + Y G+ L +GYK G I+C CC+ EV +PS FE HAG R+ P ++
Sbjct: 507 ELTILTYKLKHGEVLKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGKRRQPYRN---- 562
Query: 578 CTICADGGNL----LPCDGCPRAFHKECASLSSIPQGD------------WYCKYCQNMF 621
+G L L G + S ++ P GD Y ++
Sbjct: 563 -IYTPEGLTLHELALKLQG---GLNSNGNSSANFPGGDEPPNLSSGSSRESSTTYRPSIV 618
Query: 622 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 681
KR LQ A+ E+ C LC +
Sbjct: 619 PLKRTLQQIADKTES-----------------------------CRLCGDACTTIGTISE 649
Query: 682 RTILLCDQCEREFHVGC----LKKHK--MADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
I+ C+QCER HV C L+K K + L E + +FCC C + + L +L +
Sbjct: 650 DMIVFCNQCERPCHVKCYNNGLQKQKGPLNVLAEYMQFHFFCCQKCQLLRASLHEVL-NK 708
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPI 795
EK+ + + Y V W++L+G + + Q + IF F
Sbjct: 709 REKIRQ-----KRSY------------VFWQILNGMNPGINVQKYIHQVIEIFKVAFPKT 751
Query: 796 VDSISGRDLIPSMVYGRNLRGQ-EFGGMYCAILTVNSS-VVSAGILRVFGQEVAELPLVA 853
S G +I MV +++ G+ +F GMYCA+LT +S VVSA +L+V +EVAEL +VA
Sbjct: 752 AASDFG--VIQDMVNAKDVGGEKDFRGMYCAVLTTSSKLVVSAAVLKVRTEEVAELVIVA 809
Query: 854 TSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPE----LL 909
T KGYF LL IE L + V+ + E ESIW+ K GF + E LL
Sbjct: 810 TCNQFRKKGYFTLLLRQIEAHLKAMNVRLLTALVDPEMESIWSKKLGFTILSGEEKETLL 869
Query: 910 SIYRKRCSQLVTFKGTSMLQKRV 932
+ LV F+ +++QK +
Sbjct: 870 EAH-----PLVMFEDLTLMQKSL 887
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 114/435 (26%), Positives = 169/435 (38%), Gaps = 114/435 (26%)
Query: 546 GIICHCCNSEVSPSQFEAHAG---------------------------RQYPGK------ 572
GIIC CC+ ++ S+F+ HAG +Y +
Sbjct: 729 GIICTCCDKVLTLSEFKFHAGFTVNRPCLNIFMESGEPFTLCLLQAWSAEYKARRSQNQA 788
Query: 573 -------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
NDD C +C +GG L+ CD CP FH C S IP GDWYC C
Sbjct: 789 VHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------- 841
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
C +C K L
Sbjct: 842 ----------------------------------------CRICGNLVIDKDTSDAHDSL 861
Query: 686 LCDQCEREFHVGCLKKHKMADLRELP-KGKWFCCMDCSRINSVLQN---LLVQEAEKLPE 741
C QCE ++H CL+ D +E+ WFC C + S LQ L+ Q A+ +
Sbjct: 862 QCSQCEHKYHEKCLEDR---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGISW 918
Query: 742 FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISG 801
L I + + + +W L T L+ A+ I +CF + D +G
Sbjct: 919 TLLRCI--HDDQKVHSA-----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTG 966
Query: 802 RDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
+IP ++Y G F G Y +L ++S +RV G VAE+PL+AT
Sbjct: 967 IHMIPQVLYNWGSEFARLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQYR 1026
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ- 918
+G +LL + IE++L +V+ +V+ A + WT FGF +D I R+R ++
Sbjct: 1027 RQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVD----DIERQRLNKI 1082
Query: 919 -LVTFKGTSMLQKRV 932
L+ F GT +L K +
Sbjct: 1083 NLMVFPGTVLLVKSL 1097
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 124/468 (26%), Positives = 184/468 (39%), Gaps = 103/468 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
LLEG GI C+CC+ S FE HAG R P
Sbjct: 669 LLEGIITKEGIRCNCCDEVFSVLDFEVHAGGKRNQPFKSLYLEGGNSLLQCLHDFMNKQS 728
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G NDD C IC DGG+L+ CDGCP FH+ C + P G WYC
Sbjct: 729 ESQHKGYHFVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCN 788
Query: 617 CQNMF-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDF 674
C F E+ HD +A+ + LS C LC C
Sbjct: 789 CSCKFCEKVEAAIHDTSALHS--------------------------LSSCRLCEEKC-- 820
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
S P T+ C + G + + D FC C + LQ LL+
Sbjct: 821 --SNHYPHTLADHQACINQ--DGTVPGERSTDS--------FCGKYCQELFEELQ-LLIG 867
Query: 735 EAEKLPE-FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
LPE F + ++++ S V+D D+ ++ ++ A ++ +CF
Sbjct: 868 VKHPLPEGFSWSFLRRFELPS--EVADCDISEKIAYNAK--------MAVAFSVMDECFS 917
Query: 794 PIVDSISGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
P+VD SG +L+ ++VY N F A+L +++ +R+ G ++AE+P
Sbjct: 918 PLVDHRSGVNLLQNIVYNFWSNFHRLNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPF 977
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSI 911
+ T + +G + L IE L L+V +V+PA E WT FGF ++
Sbjct: 978 IGTRYMYRRQGMCRRLMDGIESALGSLKVAKLVIPAVPELIDTWTSGFGFTPVNESEKKT 1037
Query: 912 YRKRCSQLVTFKGTSMLQK-----RVPACRIGSSSTDSTECVSGVEVG 954
+ L+ F G ML K ++ + SS+ DS + VE G
Sbjct: 1038 IKNL--NLLVFPGVDMLGKSLVKEQITDSIVSSSNVDSCLKLRNVEEG 1083
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 134/297 (45%), Gaps = 87/297 (29%)
Query: 489 ENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGII 548
E ASPP+ S N LHK +F GL D EVGYY GQK L G K G GI+
Sbjct: 840 EAASPPVPSRESSGAN-------LHKALFLPGGLEDDIEVGYYVKGQKFLAGLKRGAGIL 892
Query: 549 CHCCNS------------------------EVSPSQFEAHAG---RQYP--------GK- 572
C CC +S S FE HAG R+ P G+
Sbjct: 893 CSCCQQVARNGMSSILMDSVVCGLTYEGERMISCSLFEQHAGWGSRRNPYTSIYLADGRS 952
Query: 573 ------------------------DNDDLCTICADGGNLLPCDGCPRAFHKEC-ASLSSI 607
++ D C C D G+L C CP A+H++C + S
Sbjct: 953 LHDAAQSLVVEQTVKQEGNTPAKIEHLDQCVECGDRGDLQLCTRCPNAYHQDCLGKVDSY 1012
Query: 608 PQGDWYCKYCQNMF-----ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAE 662
G+++C CQ +R+R + N G + S +++T RC R+++ EA
Sbjct: 1013 SSGEFFCPDCQEQRYGGTKDRRRSM---VNRRSKGAAKTLLSKDRVTGRCTRLLQVPEAV 1069
Query: 663 -LSGCLLCRGCDFSKSGFGPRTILLCDQ----------CEREFHVGCLKKHKMADLR 708
L GC+ C+ DF+K+GFGP+T LLCDQ CERE+HVGCLKKH + DL+
Sbjct: 1070 VLGGCVFCKSGDFAKTGFGPKTTLLCDQVSVGDMKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 48/76 (63%), Gaps = 1/76 (1%)
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
G+ + L IE+LL LRV+ + LPAAE AE IW ++FGF+++ E + + + +V
Sbjct: 1477 GHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMV- 1535
Query: 922 FKGTSMLQKRVPACRI 937
F G+SML+K +P I
Sbjct: 1536 FTGSSMLEKDIPPLEI 1551
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 707 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDI----- 761
+ELP+G+WFC DC I+S+L L+ E L + ++ + K LE D
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 762 -DVRWRLLSGKAATPETRLLLSQAVAIF 788
W+LL G+ P L++ V IF
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKALAEVVQIF 1289
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 56/146 (38%), Gaps = 53/146 (36%)
Query: 215 MSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGC 274
M K++ L + P + L ++GLLDG V Y+G + L GII++GG+LC CS C G
Sbjct: 577 MVKEVLLKEAPASAKLLLQSGLLDGHHVRYLG--RGGHIMLTGIIQEGGVLCDCSSCKGV 634
Query: 275 ---------------------------------------------------RVIPPSKFE 283
+V+ S FE
Sbjct: 635 QVTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFE 694
Query: 284 IHACKQYRRASQYICFENGKSLLEVL 309
HA R S +I ENGK L ++L
Sbjct: 695 KHAGSSARHPSDFIFLENGKCLKDIL 720
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/434 (26%), Positives = 167/434 (38%), Gaps = 112/434 (25%)
Query: 546 GIICHCCNSEVSPSQFEAHAG---------------------------RQYPGKD----- 573
GIIC CC ++ S+F+ HAG +Y +
Sbjct: 729 GIICICCGKVLTLSEFKFHAGFTLNRPCLNIFMESGEPFTLCLLQAWSTEYKARKSQNQA 788
Query: 574 --------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
NDD C +C +GG L+ CD CP FH C S IP GDWYC C
Sbjct: 789 VHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------- 841
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
C +C K L
Sbjct: 842 ----------------------------------------CRICGNLVIDKDTLDAHDSL 861
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQN---LLVQEAEKLPEF 742
C QCE ++H CL+ + L WFC C + S LQ+ L+ Q A+ +
Sbjct: 862 QCSQCEHKYHEKCLEDRDKQEGAILDT--WFCGQSCQEVYSGLQSQVGLVNQVADGISWT 919
Query: 743 HLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGR 802
L I + + + +W L T L+ A+ I +CF + D +G
Sbjct: 920 LLRCI--HDDQKVHSA-----QWFALKAVCNTK-----LAVALTIMEECFVSMFDPRTGI 967
Query: 803 DLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHG 860
LIP ++Y G F G Y +L + ++S +RV G VAE+PL+AT
Sbjct: 968 HLIPQVLYNWGSEFARLNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYRR 1027
Query: 861 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQ-- 918
+G +LL IE++L +V+ +V+ A + WT FGF +D I R+R ++
Sbjct: 1028 QGMCRLLVTAIEQVLISFKVEKLVISAIPDLVETWTKGFGFIPVD----DIERQRLNKIN 1083
Query: 919 LVTFKGTSMLQKRV 932
L+ F GT +L K +
Sbjct: 1084 LMVFPGTVLLVKSL 1097
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/436 (27%), Positives = 175/436 (40%), Gaps = 99/436 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------RQ- 568
+LEG+ GI C CC+ ++ S+FE HAG RQ
Sbjct: 421 MLEGWITRDGIHCRCCSKILTVSKFEIHAGSKLRQPFQNIVLDSGVSLLQCQVDAWNRQE 480
Query: 569 -------YP-----GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+P NDD C IC DGG+L+ CDGCP FH+ C ++ +P GDW+C
Sbjct: 481 ESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPN 540
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F G G ++ + T +EL C LC
Sbjct: 541 CTCKF--------------CGMADGSNAEDDTTV----------SELVTCSLC------- 569
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
E+++H C++ FC C + LQ + +
Sbjct: 570 --------------EKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQ 615
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
E F + I + + SD VR G E+ L+ A+ + +CF IV
Sbjct: 616 ELEAGFSWSLIHR-----TDPGSDTSVR-----GFPQRVESNSKLAIALTVMDECFLSIV 665
Query: 797 DSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D S +LI +++Y R N + G Y AIL ++ A +R+ G ++AE+P + T
Sbjct: 666 DRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGT 725
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
I +G + LF IE L L+V+ +++PA E WT FGF + E
Sbjct: 726 RHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPL--EESHKQEL 783
Query: 915 RCSQLVTFKGTSMLQK 930
R ++ F GT MLQK
Sbjct: 784 RSLNMLVFPGTDMLQK 799
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/436 (26%), Positives = 174/436 (39%), Gaps = 99/436 (22%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG---------------------------RQY 569
+LEG+ GI C CC+ ++ S+FE HAG RQ
Sbjct: 589 MLEGWITRDGIHCRCCSKILTVSKFEIHAGSKLRQPFQNIVLDSGVSLLQCQVDAWNRQE 648
Query: 570 PGKDN-------------DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
+ + DD C IC DGG+L+ CDGCP FH+ C ++ +P GDW+C
Sbjct: 649 ESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGDWHCPN 708
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C F G G ++ + T +EL C LC
Sbjct: 709 CTCKF--------------CGMADGSNAEDDTTV----------SELVTCSLC------- 737
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
E+++H C++ FC C + LQ + +
Sbjct: 738 --------------EKKYHTSCIQGVDAVLSDTNNPSTSFCGQGCRELFEHLQKFIGVKQ 783
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
E F + I + + SD VR G E+ L+ A+ + +CF IV
Sbjct: 784 ELEAGFSWSLIHR-----TDPGSDTSVR-----GFPQRVESNSKLAIALTVMDECFLSIV 833
Query: 797 DSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D S +LI +++Y R N + G Y AIL ++ A +R+ G ++AE+P + T
Sbjct: 834 DRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIHGTQLAEMPFIGT 893
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
I +G + LF IE L L+V+ +++PA E WT FGF + E
Sbjct: 894 RHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFNPL--EESHKQEL 951
Query: 915 RCSQLVTFKGTSMLQK 930
R ++ F GT MLQK
Sbjct: 952 RSLNMLVFPGTDMLQK 967
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 156/376 (41%), Gaps = 84/376 (22%)
Query: 569 YPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 628
Y +ND +C+IC GG L+ CD CP +FH+ C L +P+GDW+C C + L
Sbjct: 893 YSKDENDYICSICHFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICGQNKLS 952
Query: 629 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 688
AN V+ GP L C
Sbjct: 953 EHANIVD--------------------------------------------GP--FLTCY 966
Query: 689 QCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 748
QCE ++HV CL+ K K WFC C +I LQ LL
Sbjct: 967 QCECKYHVQCLRGTK--KFGSCSKPHWFCNKHCKQIYWGLQKLL---------------- 1008
Query: 749 KYAGNSLETVSDIDVRWRLLSGKAATP------------ETRLLLSQAVAIFHDCFDPIV 796
G S+ D ++ W LL ++ E + L+ A+ + H+CF+P+
Sbjct: 1009 ---GKSIPVGGD-NLTWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHECFEPVR 1064
Query: 797 DSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
+ + RD++ +++ R L+ F G Y +L N +++ +RV+G++VAE+PLV T
Sbjct: 1065 EQHTRRDIVEDVIFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAEVPLVGT 1124
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRK 914
G +L +E+ L L V+ +VLPA WT FGF K+ S +
Sbjct: 1125 RFQYRRLGMCHILMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSERSEFLN 1184
Query: 915 RCSQLVTFKGTSMLQK 930
+ F+ T M QK
Sbjct: 1185 Y--TFLNFQETVMCQK 1198
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 160/378 (42%), Gaps = 89/378 (23%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+D+D +C++C D G LL CD CP FH C L S PQGDW+C C
Sbjct: 408 EDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPACT------------- 454
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF-------SKSGFGP-RT 683
C +C D + GF R
Sbjct: 455 ----------------------------------CAICGSSDLDDPPATTTTQGFSSDRM 480
Query: 684 ILLCDQCEREFHVGCLKK------HKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
++ C+QC RE+HVGC+++ + AD +G W C CS+I L+ L V +A
Sbjct: 481 VISCEQCRREYHVGCMRERDNGLWYPEAD----GEGPWLCSEACSKIYLRLEELAVVQAP 536
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
+G SL V R + + E L A+ + +CF +++
Sbjct: 537 --------CRSVASGLSL-------VVLRRGAARDGEEEEHAKLCMALDVLRECFVTLIE 581
Query: 798 SISGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 855
+ DL +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 582 PRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTR 641
Query: 856 KINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRK 914
+G +LL I+KLL + V+ +VLPA E + WT FGF+ E+ R+
Sbjct: 642 FARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFR----EMGQADRQ 697
Query: 915 RCSQ--LVTFKGTSMLQK 930
+ ++ F+GT M K
Sbjct: 698 DVAHHAILRFQGTIMCHK 715
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 145/352 (41%), Gaps = 90/352 (25%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFL 627
+ND C++C GG L+ CDGCP AFH C L +P GDW+ C C F +
Sbjct: 701 ENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT-- 758
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
NA E +S C
Sbjct: 759 --STNAKEEKFIS----------------------------------------------C 770
Query: 688 DQCEREFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
QCE ++H CL+ D L ++ KWFC DC I +L +L+ + E
Sbjct: 771 KQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE--------- 821
Query: 747 IKKYAGNSLETVSDIDVRWRLL------------SGKAATPETRLLLSQAVAIFHDCFDP 794
VS + WRL+ S A E +LS A+ + H+ F+P
Sbjct: 822 -----------VSVEKLTWRLVQSLEPNMYGDDASKIEAAAENHCILSVALDVMHELFEP 870
Query: 795 IVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+ GRDL +++ R + F G Y +L N+ +VS +R+ G++VAE+P +
Sbjct: 871 VKRPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFI 930
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
T + +G ++L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 931 GTRFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 982
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 132/262 (50%), Gaps = 26/262 (9%)
Query: 654 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC-------LKKHKMAD 706
R ++ + C CR D P TI C+QCER HV C +KK +
Sbjct: 63 RTLQERVVQTESCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 707 LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWR 766
L+E ++ CC +C + + L+ V++ E++ L I+ ++ WR
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG--VEKGEEIA--FLRQIRS------------NICWR 166
Query: 767 LLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQ-EFGGMYCA 825
LLS A+ + +L +SQA+ IF D F D+ S D+ MVYG+N G+ +F GMYC
Sbjct: 167 LLSKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGMYCV 224
Query: 826 ILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVL 885
+LT ++ VVSA IL+V ++ AEL L+AT KGYF+LL IE L V ++
Sbjct: 225 VLTASTHVVSAAILKVRVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSLLMA 284
Query: 886 PAAEEAESIWTDKFGFKKIDPE 907
P E IW++K GF + E
Sbjct: 285 PVDPEMAQIWSEKLGFTILSAE 306
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/438 (26%), Positives = 172/438 (39%), Gaps = 97/438 (22%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ-----------------------YPG 571
+ +L+G GI C CC + FE+HAG + +
Sbjct: 7 RTVLKGKITTDGIQCDCCGETFAILDFESHAGSKSCQPLKNICLENGHSLLQCQLESWNK 66
Query: 572 KDNDD-----------------LCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
+D D C IC DGGNL+ CD CP FH+ C + P G W C
Sbjct: 67 QDESDRKGFHFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNC 126
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
YC F ++G D+ + D
Sbjct: 127 TYCSCKF---------------CGMAGGDTCQM-------------------------DE 146
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
+ + P +L C CE ++H C+ + + FC C + LQ LL
Sbjct: 147 NDTAAQP-ALLACCLCEEKYHHSCILAENTVN--DGYSSVSFCGKKCQELYDKLQALLGV 203
Query: 735 EAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
+ E F ++++ + SDI LSG E ++ A+ I +CF P
Sbjct: 204 KHEMEEGFAWTLVRRF-----DVGSDIS-----LSGMHRKVECNSKVAVALHIMDECFLP 253
Query: 795 IVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+ D SG +LI ++VY G N + G AIL ++SA +R+ G +AE+P +
Sbjct: 254 MPDHRSGVNLIRNIVYNFGSNFNRLNYSGFLTAILERGDEIISAASIRIHGNHLAEMPFI 313
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
T + +G + L + IE L L V+ +V+PA E WT FGFK ++ S
Sbjct: 314 GTRHMYRRQGMCRRLLSAIETALCSLNVEKLVIPAISELRETWTSVFGFKPLEGS--SKQ 371
Query: 913 RKRCSQLVTFKGTSMLQK 930
+ R ++V F G MLQK
Sbjct: 372 KMRNMKMVAFPGIDMLQK 389
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 162/385 (42%), Gaps = 79/385 (20%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
+DD C +CADGG LL CD CP FH C ++ +P+G W C YC
Sbjct: 511 SDDACGVCADGGELLCCDSCPSTFHPACLAMK-VPEGLWACHYC---------------- 553
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
RC+ + N + LS C C +
Sbjct: 554 -----------------RCVLCMANDDQGLS---------------------RCQHCTLK 575
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+H C + +++ R +C C ++++ L +++ F +K
Sbjct: 576 YHEIC--RPSLSNGR---GNGAYCSETCKKVSAQLSDMIGITNHTEDGFSWALLKIQKDE 630
Query: 754 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--G 811
+ + + DV E + L+ A+ + ++CF+P+ D + D++ VY G
Sbjct: 631 PVSSQNSPDVL-----------ECNVKLAVALGVLNECFNPVKDRRTKIDMLHQAVYSLG 679
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
+ + G Y +L N ++SA +LR+ G +VAE+P T +G + L +
Sbjct: 680 SEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRKQGMMRRLVNAV 739
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 931
E++L+ ++V+ +V+PA W F FK +DPEL R+R LV GT++LQK
Sbjct: 740 EQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPELKEEIRRR--SLVVITGTTLLQKP 797
Query: 932 VPACRIGSSS----TDSTECVSGVE 952
V A SS T++ SG E
Sbjct: 798 VVAAPPSPSSLHKQTEAAAAKSGAE 822
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 117/444 (26%), Positives = 169/444 (38%), Gaps = 113/444 (25%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP----------GKD------------- 573
L +G+ GI+C CC+ +S S F+AHA P GK
Sbjct: 577 LKDGWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSLNLCLQSGKSFTLCQIEAWNAEY 636
Query: 574 -----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
NDD C C DGG LL CD CP +H+ C S+ +P WYC
Sbjct: 637 MDRRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHN 696
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C +C GC ++
Sbjct: 697 CI-----------------------------------------------CRIC-GCPVTE 708
Query: 677 ---SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLV 733
S F I+ C QC H C++ A E+ +WFC C I L +
Sbjct: 709 KEISSFS--AIIKCLQCGAAHHDTCVEMGATA-FEEMDSDEWFCGTHCKEIYLGLHGCVG 765
Query: 734 QEAEKLPEFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 792
E+ ++ G + +V I A E L+ A+ + +CF
Sbjct: 766 VESSLGDGLSWTILRCNSGGQKMHSVQKI----------AHAIECNSKLAVALTLMEECF 815
Query: 793 DPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 850
+VD+ +G ++IP ++Y G + G Y IL ++ A +RV G + AELP
Sbjct: 816 AQMVDTRTGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEEILCAASIRVHGMKAAELP 875
Query: 851 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 910
+AT + + KG + L IE++L VK +VL A E S W FGFK I+
Sbjct: 876 FIATCREHRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELVSTWVSGFGFKPIE----E 931
Query: 911 IYRKRCS--QLVTFKGTSMLQKRV 932
RK+ L+ F GTS+L K +
Sbjct: 932 YERKQLDTINLMLFPGTSLLIKSL 955
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 149/360 (41%), Gaps = 86/360 (23%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
ND +C++C GG L+ CD CP +FHK C + +P GDW+C C
Sbjct: 698 NDHVCSVCHYGGELILCDHCPSSFHKRCLGMKDVPDGDWFCPSC---------------- 741
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
++ G + +++ TK I V N C QCE +
Sbjct: 742 --CCKICGQNKLKKDTKDFIDGVLN----------------------------CTQCEHQ 771
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+H+ CL ++ PK FC C + + + KL F +
Sbjct: 772 YHIMCLSNSWTDKWKDHPKENSFCSKKC-------EVYMQSDQHKLDAFDDETL------ 818
Query: 754 SLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--G 811
+ET S + + A+ + H+CF+PI + +GRDL+ +++ G
Sbjct: 819 -VETYSKLKI--------------------ALDVVHECFEPIEEPRTGRDLMKDVIFSNG 857
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
L F G Y +L N +VS +R+ G +VAE+PLV T G ++L +
Sbjct: 858 SELNRLNFQGFYTILLEKNDELVSVATVRIHGDKVAEIPLVGTRFQFRQLGMCRILMDVL 917
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 930
EK L L V+ +VLPA + WT FGF K+ D E L + F+ T M QK
Sbjct: 918 EKKLMELGVQRLVLPAVPGVLNTWTGSFGFSKMTDSERLQFVD---YTFLDFQDTVMCQK 974
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 117/483 (24%), Positives = 178/483 (36%), Gaps = 144/483 (29%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ----------------------------- 568
LEG G GI C CCN V+ S F AHAG +
Sbjct: 840 LEGKITGDGIHCGCCNEIVTISDFGAHAGSKQSDPLKNIYTEEETSLLQCLLDSWNKQDE 899
Query: 569 ----------YPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
G+D NDD C +C DGG+L+ CDGCP FHK C + P GDW+C YC
Sbjct: 900 SELKSFHFFDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAYC 959
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
F R+ G S + +
Sbjct: 960 CCKF---------------CRLVGGSSNQSVV--------------------------ND 978
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
F +L C CE +FH+ C++ + + K FC C ++ L+ LL + E
Sbjct: 979 EFTMPALLTCHLCEEKFHISCVEAN--GGKTDDSKDALFCGNKCQELSERLEMLLGVKHE 1036
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
F + I++ SD+ L + + ++L + A++I ++CF P +D
Sbjct: 1037 IEDGFSWSFIRR---------SDVGCDLSLTNPQLVECNSKLAV--ALSIMNECFMPYID 1085
Query: 798 SISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATS 855
SG +L+ S++Y G N + ++ G IL ++ +RV G +AE+P + T
Sbjct: 1086 HRSGTNLLRSILYNCGSNFKRLDYSGFITVILERGDEIICVASIRVHGNRLAEMPYIGTR 1145
Query: 856 KINHGKGYFQLLFACIE------------------------------------------- 872
+ +G + L IE
Sbjct: 1146 YMYRRQGMCRRLLNAIESEAVYGLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLRGNDW 1205
Query: 873 ---KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQ 929
K LS L V+ +V+PA E WT FGF+ + I L+ F +LQ
Sbjct: 1206 GGPKALSSLDVELLVIPAISELRETWTSVFGFEPLKQTSKQITNNM--NLLVFPHVDLLQ 1263
Query: 930 KRV 932
K++
Sbjct: 1264 KKI 1266
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/438 (24%), Positives = 162/438 (36%), Gaps = 111/438 (25%)
Query: 542 KNGL----GIICHCCNSEVSPSQFEAHAG------------------------------- 566
K+GL GI+C CCN +S + F+ HAG
Sbjct: 788 KDGLIKKEGIMCKCCNMVLSVTNFKNHAGFKQSRPCLNVFMKSGKPFTLCQLQAWSAEYK 847
Query: 567 ----------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
R +NDD C +C DGG L+ CD CP FH+ C S +P+G WYC
Sbjct: 848 TRKSRTIKVVRTADDDENDDSCGLCGDGGELICCDNCPSTFHQACLSTEELPEGSWYCPN 907
Query: 617 CQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
C C +C K
Sbjct: 908 CT-----------------------------------------------CWICGELVNDK 920
Query: 677 SGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEA 736
C QCE ++H C K + + WFC C + LQ+ +
Sbjct: 921 EDINSSNAFKCSQCEHKYHDSCWKNKTIG--KGGASDTWFCGGSCQAVYFGLQSRVGI-- 976
Query: 737 EKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIV 796
+N I +L D + A E L+ A+ I +CF +V
Sbjct: 977 -------INHIADGVCWTLLKCIHEDQKVHSAQRLALKAECNSKLAVALTIMEECFQSMV 1029
Query: 797 DSISGRDLIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
D +G D+IP ++Y N R + F G Y +L + ++S +R+ G VAE+PL+
Sbjct: 1030 DPRTGIDMIPHVLY--NWRSEFARLNFHGFYTVVLEKDDVLLSVASIRIHGATVAEMPLI 1087
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIY 912
AT +G + L IE++L +V+ +V+ A + WT+ FGF + +
Sbjct: 1088 ATCSNYRRQGMCRRLMTAIEEMLISFKVEKLVVSAIPDLVETWTEGFGFTPMSNDEKQSL 1147
Query: 913 RKRCSQLVTFKGTSMLQK 930
K L+ F GT +L+K
Sbjct: 1148 NK--INLMVFPGTILLKK 1163
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 108/452 (23%), Positives = 178/452 (39%), Gaps = 118/452 (26%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG------------------------------ 566
L++G G+ C CCN ++ FE HAG
Sbjct: 129 LMKGRARREGVRCECCNEVMTVWDFETHAGSVLQRPYEHIHVARSNSSLLQCQFEVWQSN 188
Query: 567 -------------RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
R +DD C ICADGG+L+ C+ C H +C L IPQGDW
Sbjct: 189 VEVERRTFNEIVPRNGASDKHDDACLICADGGDLICCEKCWSTSHLKCMGLERIPQGDWI 248
Query: 614 CKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD 673
C YC + K C + K+L+
Sbjct: 249 CPYC------------------------------VCKHCNKNDKDLQT------------ 266
Query: 674 FSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC-MDCSRINSVLQNLL 732
C QC++++H CL +K DL G+ C C + LQ+L+
Sbjct: 267 -------------CVQCDKKYHCQCLVSNKELDLN--ASGETLACDSHCGEVYEKLQSLV 311
Query: 733 VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCF 792
+ E F +++ ++L+ D+ + E ++ A + +CF
Sbjct: 312 GVKHELEGGFCWTLLQRMEPDNLD-FKDLHL----------ITECNSKIALAWEVLDECF 360
Query: 793 DPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELP 850
I+D + +++ S+ Y R NL F G Y AIL N ++SA +RV G ++AE+P
Sbjct: 361 TTIIDRHTQINVVQSVAYSRGSNLNRINFRGFYTAILEKNDDIISAATIRVHGTDLAEMP 420
Query: 851 LVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLS 910
+ T + G ++L +E + S + V+ +++P+ +E +W K GF I+ +
Sbjct: 421 FIGTRHLYRQNGMSRMLLVTLESIFSVMGVEHLIIPSVQELTEMWEGKCGFSPIED---A 477
Query: 911 IYRKRCS-QLVTFKGTSMLQKRVPACRIGSSS 941
+ +K + +TF LQK + + SSS
Sbjct: 478 VSQKITNWNTLTFPSAVRLQKALLSTPASSSS 509
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 168/445 (37%), Gaps = 135/445 (30%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--------------------------------------- 566
GI+C+CC +S S F+AHAG
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLGLFLQSGKSYTLCQVEAWSAELMSRRSDAYG 1026
Query: 567 RQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
R+ D NDD C C DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 1027 RKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYCHNC-------- 1078
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
+ C V E E+S FS IL
Sbjct: 1079 ----------------------TCRSCGNPVN--EKEVSS--------FSD-------IL 1099
Query: 686 LCDQCEREFHVGCLKKHKMADLRELP-KGK----WFCCMDCSRINSVLQNLLVQEAEKLP 740
C QC +H C+ D LP GK WFC C I L + +
Sbjct: 1100 KCLQCGDAYHNTCI------DRVMLPSDGKRSDTWFCGRYCKEIFMGLHSQV-------- 1145
Query: 741 EFHLNAIKKYAGNSLETVSDIDVRW----------RLLSGK--AATPETRLLLSQAVAIF 788
+E V + D+ W RL S + E L+ A+ +
Sbjct: 1146 -------------GVENVINNDLSWTILRCNSDGQRLHSAQKIGLMTECNTKLAVALTLL 1192
Query: 789 HDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEV 846
+CF +VD +G D+IP ++Y G N ++ G Y IL ++ +R+ G +
Sbjct: 1193 EECFIRMVDPRTGVDMIPHVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIRLHGTKA 1252
Query: 847 AELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP 906
AELP +ATS +G + L IEK+L V+ +VL A E + W FGFK I+
Sbjct: 1253 AELPFIATSVDYRRQGMCRRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFGFKPIED 1312
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQKR 931
+ + R L+ F G S+L KR
Sbjct: 1313 D--EKKQLRNVNLMLFPGASLLTKR 1335
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/428 (24%), Positives = 175/428 (40%), Gaps = 124/428 (28%)
Query: 534 GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG----RQY-------------------- 569
G+++L+G G GI C CCN+ ++ S+F+ HAG R Y
Sbjct: 179 GEQILQGVLTGDGIWCSCCNTVITVSEFQLHAGDEPNRPYQRIFISETGLSLLTCQAEAW 238
Query: 570 ---------------PGKD----NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQG 610
P +D DD C +CADGGNL+ CD CP +H C + PQG
Sbjct: 239 NQQGIPELQGYHLIEPREDVSDKYDDACVVCADGGNLICCDKCPSTYHISCLQMEDEPQG 298
Query: 611 DWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 670
+W C C C C
Sbjct: 299 EWRCPAC-----------------------------------------------ACKFCH 311
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHK--MADLR-ELPKGKW-FCCMDCSRINS 726
F S + C QC++++H C ++++ + DL + P FC CS+I
Sbjct: 312 THAFDIS------VFTCSQCDKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYE 365
Query: 727 VLQNLL---VQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQ 783
L+ L+ + E L L + AG LE D R L ++
Sbjct: 366 KLERLVGVRNELDEGLTWTLLRRMDPEAGVYLEESYD-----RTLCNSK--------IAV 412
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRV 841
AVA+ +CF+P++D + +++ S++Y G N F G Y AIL +S +R+
Sbjct: 413 AVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVASMRI 472
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
G ++AE+P +AT + ++ L C + L++ V+ +V+P+ E+ W + +GF
Sbjct: 473 HGNKLAEMPFIAT------RPSYRRLGMCHKLLVAIESVQYLVIPSIEQRVRRWEESYGF 526
Query: 902 KKIDPELL 909
+ I+ +++
Sbjct: 527 QAIENKVM 534
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 162/364 (44%), Gaps = 72/364 (19%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
ND +C++C GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 354 NDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC---------------- 397
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
K C ++ + + ++S F S + C QCE+
Sbjct: 398 --------------CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQN 429
Query: 694 FHVGCLKKHKMADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 751
H+GC+K ++ + + + + WFC C I+ LQNLL ++ +P
Sbjct: 430 VHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------V 476
Query: 752 GNSLETVSDIDVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 809
G++ E ++ W L+ + R L++A+ + H F P+ D I+ DLI +
Sbjct: 477 GDARENLT-----WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVF 531
Query: 810 YGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
+ + F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L
Sbjct: 532 LSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRL 591
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTS 926
+E L + VK + LPA EA + WT FGF K+ D + L + + + F+ T
Sbjct: 592 LNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTV 648
Query: 927 MLQK 930
QK
Sbjct: 649 RCQK 652
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 162/364 (44%), Gaps = 72/364 (19%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
ND +C++C GG L+ CD CP AFH C + IP G+WYC C
Sbjct: 331 NDCICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSC---------------- 374
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
K C ++ + + ++S F S + C QCE+
Sbjct: 375 --------------CCKICGQVTYDFDDQVSS--------FDTS------FVRCVQCEQN 406
Query: 694 FHVGCLKKHKMADL--RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 751
H+GC+K ++ + + + + WFC C I+ LQNLL ++ +P
Sbjct: 407 VHIGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLLWKQ---IP----------V 453
Query: 752 GNSLETVSDIDVRWRLLSG--KAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 809
G++ E ++ W L+ + R L++A+ + H F P+ D I+ DLI +
Sbjct: 454 GDARENLT-----WTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVF 508
Query: 810 YGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
+ + F G Y AIL ++VV+ +RV+G EVAE+PLVAT G + L
Sbjct: 509 LSKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRL 568
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTS 926
+E L + VK + LPA EA + WT FGF K+ D + L + + + F+ T
Sbjct: 569 LNELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK---YTFLGFQHTV 625
Query: 927 MLQK 930
QK
Sbjct: 626 RCQK 629
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/340 (26%), Positives = 137/340 (40%), Gaps = 92/340 (27%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY-----CKYCQNMFERKRFL 627
+ND C++C GG L+ CDGCP AFH C L +P GDW+ C C F +
Sbjct: 692 ENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQFFLKTT-- 749
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
NA E +S C
Sbjct: 750 --STNAKEEKFIS----------------------------------------------C 761
Query: 688 DQCEREFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNA 746
QCE ++H CL+ D L ++ KWFC DC S+ N+ +A K+
Sbjct: 762 KQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDCE--ESLEPNMYGDDASKI------- 812
Query: 747 IKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIP 806
A E +LS A+ + H+ F+P+ GRDL
Sbjct: 813 -------------------------EAAAENHCILSVALDVMHELFEPVKRPHGGRDLAE 847
Query: 807 SMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYF 864
+++ R + F G Y +L N+ +VS +R+ G++VAE+P + T + +G
Sbjct: 848 DVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGTRFQHRQRGMC 907
Query: 865 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
++L +EK+L L V+ +VLPA + W + FGF K+
Sbjct: 908 RVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 947
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 157/368 (42%), Gaps = 79/368 (21%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
+DD C +C DGG LL CD CP FH C ++ +PQG W C YC
Sbjct: 452 SDDACGVCGDGGELLCCDSCPSTFHPACLAMK-VPQGWWACHYC---------------- 494
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
RC+ + N + LS C C +
Sbjct: 495 -----------------RCVLCMANDDQGLS---------------------TCQHCSLK 516
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYA 751
+H C ++ +++ R + +C C ++++ L +++ E + L I+K
Sbjct: 517 YHEVC-RRPSLSNGRGIGA---YCSETCKKVSARLSDMVGVTNHTEDGFSWALLKIQKD- 571
Query: 752 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY- 810
E VS D AA E + L+ A+ + ++CF+P D + D++ VY
Sbjct: 572 ----EAVSSQDT--------AAVLECNVKLAVALGVLNECFNPAKDRRTKIDMLHQAVYS 619
Query: 811 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 869
G + + G Y +L + ++A +LR+ G +VAE+P AT +G + L
Sbjct: 620 LGSEFKRVSYEGFYTMVLDKDGETIAAALLRIHGTKVAEMPFAATLPAYRKQGMMRRLVN 679
Query: 870 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK-IDPELLSIYRKRCSQLVTFKGTSML 928
+E++L+ ++V +V+PA WT F F+ +DPE R+R LV GT++L
Sbjct: 680 AVEQVLASVQVDKLVIPAIAALVDTWTRSFSFRPLLDPESREEIRRR--SLVVIAGTTLL 737
Query: 929 QKRVPACR 936
K V A R
Sbjct: 738 HKPVAAAR 745
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 1/65 (1%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR 567
K + + D L DG +V Y K++ G G+ C CC++ V FEAHAGR
Sbjct: 303 KRHTILAWLIDAGFLSDGEKVFYVPVDGKVVSGAVTRTGVHCGCCDAVVPLPSFEAHAGR 362
Query: 568 QYPGK 572
PG+
Sbjct: 363 D-PGR 366
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 110/452 (24%), Positives = 168/452 (37%), Gaps = 106/452 (23%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--------------------------------------- 566
GI+C CCN +S ++F++HAG
Sbjct: 150 GIMCKCCNMVLSVTKFKSHAGFKLNRPCSNLFMESGKPFTLCQLQAWSAEYKSRKSGTQV 209
Query: 567 -RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
R NDD C +C DGG L+ CD CP FH+ C +P+G WYC C
Sbjct: 210 VRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPEGSWYCPNCT------- 262
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
C +C K
Sbjct: 263 ----------------------------------------CWICGDLVNDKEASSSVGAY 282
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
C QCE ++H C + + + L WFC C + S L + + N
Sbjct: 283 KCLQCEHKYHGACQQGKQTHE--GLVSDAWFCSGSCQEVYSGLHSRVGIN---------N 331
Query: 746 AIKKYAGNSLETVSDIDVRWRLLSGK--AATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
I G + I ++LS + A E L+ A+ I +CF +VD +G D
Sbjct: 332 PIAD--GFCWTLLRCIHEDQKVLSAQRLALKAECNSKLAVALTIMEECFQSMVDPRTGID 389
Query: 804 LIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
+IP +Y G + F G Y +L + +VSA +RV G VAE+PL+AT +
Sbjct: 390 MIPHALYNWGSDFARLNFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSNYRRQ 449
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
G + L IE++L +V+ +V+ A + WT FGF + + K +
Sbjct: 450 GMCRHLMTAIEEMLISYKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSLNK--INFMV 507
Query: 922 FKGTSMLQKRVPACRIGSSSTDSTECVSGVEV 953
F GT +L+K++ + + +D + EV
Sbjct: 508 FPGTILLKKQLYKTKEADTQSDWGDAAPLTEV 539
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/288 (31%), Positives = 135/288 (46%), Gaps = 33/288 (11%)
Query: 654 RIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADL 707
R ++ E C C + P TI+ C+QCER H+ C KK + L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 708 RELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRL 767
+E + CC +C + + L+ ++K G + ++ WRL
Sbjct: 130 KEYMCFHFLCCQECQSLRARLEE---------------GLEKCVGITFLRRIRSNICWRL 174
Query: 768 LSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMYCAI 826
LSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMYCA+
Sbjct: 175 LSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCAL 232
Query: 827 LTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLP 886
LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V + P
Sbjct: 233 LTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAP 292
Query: 887 AAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 930
E IW++K GF + E +L + LV FK ++QK
Sbjct: 293 VDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 335
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 147/374 (39%), Gaps = 63/374 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+NDD C C DGG LL CD CP +H+ C S +P+G WY C N R N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GN 54
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
+ VS + IL C QC
Sbjct: 55 PLSEKEVSTFSA---------------------------------------ILKCLQCGD 75
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYA 751
+H C+ + +M + WFC C I L N + E E + +K
Sbjct: 76 SYHDTCIDQ-EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTD 134
Query: 752 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY- 810
G L + I A E L+ A+ I +CF +VD +G D+IP ++Y
Sbjct: 135 GQKLHSSKKI----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVLYN 184
Query: 811 -GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 869
G N ++ G Y IL ++ +RV G + AELP +ATS +G + L
Sbjct: 185 KGSNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMD 244
Query: 870 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQ 929
IE +L V+++VL A E + W FGFK I+ R L+ F GTS+L
Sbjct: 245 TIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLT 302
Query: 930 KRVPACRIGSSSTD 943
KR+ S D
Sbjct: 303 KRLDGITAAKSEED 316
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/427 (25%), Positives = 164/427 (38%), Gaps = 115/427 (26%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQ------------------------------------- 568
GI C CCN + ++FEAHAG +
Sbjct: 800 GIQCDCCNKTFTSAEFEAHAGGKSCQPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFHF 859
Query: 569 --YPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
G+D NDD C IC DGG+L+ CD CP FH+ C + P G W+C YC F
Sbjct: 860 IDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKF---- 915
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
G V G C R + ++ ++
Sbjct: 916 ----------CGMVGG--------NTCQR-----DGNMAAV--------------SHALV 938
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLN 745
C CE ++H C ++ + + P FC +C + LQ L + E F
Sbjct: 939 TCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQELYERLQMLFGVKQELEAGFSWT 996
Query: 746 AIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
++++ + SDI V SG + + ++ A+ I +CF P+VD SG +LI
Sbjct: 997 FVRRF-----DVSSDISV-----SGMSWKVDCNSKVAVALQIMDECFVPMVDHKSGVNLI 1046
Query: 806 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
++VY G N + G + A+L +++A +R F +P+ S ++ G
Sbjct: 1047 RNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF----YSMPVSFHSSLSMG--- 1099
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFK 923
L L V +V+PA E WT FGFK ++ I R ++ F
Sbjct: 1100 -----------LCSLNVGKLVIPAISELTGTWTSVFGFKHLEGSDKQIMRNM--NMMVFP 1146
Query: 924 GTSMLQK 930
G MLQK
Sbjct: 1147 GVDMLQK 1153
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 180/451 (39%), Gaps = 126/451 (27%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYPGKD---------------------- 573
LEG GI C CC+ ++ ++FE HAG Q P ++
Sbjct: 232 LEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVDAWKKQS 291
Query: 574 ----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+DD C IC DGG+LL CD CP FH C + +P GDW+C+ C
Sbjct: 292 QSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLGIK-MPSGDWHCRSC 350
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
F S ++IT AEL CL
Sbjct: 351 ICRF--------------------CGSTQEITTS--------SAELLSCL---------- 372
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQE 735
QC R++H C ++ FC C +I L+ LL +
Sbjct: 373 -----------QCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK 421
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK-AATPETRLLL-------SQAVAI 787
NAI+ AG S V R K AA P+ + L + A ++
Sbjct: 422 ---------NAIE--AGFSWSLV-------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSV 463
Query: 788 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
+CF P +D SG ++I +++Y G + F Y IL V+SA +R+ G +
Sbjct: 464 MDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTD 523
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
+AE+P + T I +G L IE LS L V+ +V+PA E ++ WT FGFK ++
Sbjct: 524 LAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVE 583
Query: 906 PELLSIYRKRCSQL--VTFKGTSMLQKRVPA 934
P R++ L + GT +L+KR+ A
Sbjct: 584 PS----KRQKIKSLNILIIHGTGLLEKRLLA 610
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 179/451 (39%), Gaps = 126/451 (27%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYPGKD---------------------- 573
LEG GI C CC+ ++ ++FE HAG Q P ++
Sbjct: 655 LEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVDAWKKQS 714
Query: 574 ----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+DD C IC DGG+LL CD CP FH C + +P GDW+C C
Sbjct: 715 QSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLGI-KMPSGDWHCSSC 773
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
F S ++IT AEL CL
Sbjct: 774 ICRF--------------------CGSTQEITTSS--------AELLSCL---------- 795
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQE 735
QC R++H C ++ FC C +I L+ LL +
Sbjct: 796 -----------QCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK 844
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK-AATPETRLLL-------SQAVAI 787
NAI+ AG S V R K AA P+ + L + A ++
Sbjct: 845 ---------NAIE--AGFSWSLV-------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSV 886
Query: 788 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
+CF P +D SG ++I +++Y G + F Y IL V+SA +R+ G +
Sbjct: 887 MDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTD 946
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
+AE+P + T I +G L IE LS L V+ +V+PA E ++ WT FGFK ++
Sbjct: 947 LAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVE 1006
Query: 906 PELLSIYRKRCSQL--VTFKGTSMLQKRVPA 934
P R++ L + GT +L+KR+ A
Sbjct: 1007 PS----KRQKIKSLNILIIHGTGLLEKRLLA 1033
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 179/451 (39%), Gaps = 126/451 (27%)
Query: 538 LEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYPGKD---------------------- 573
LEG GI C CC+ ++ ++FE HAG Q P ++
Sbjct: 654 LEGRITRDGIHCGCCSKILTVAKFELHAGSKEQQPYENIFLEDGGATLSQCLVDAWKKQS 713
Query: 574 ----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+DD C IC DGG+LL CD CP FH C + +P GDW+C C
Sbjct: 714 QSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCPSTFHLACLGI-KMPSGDWHCSSC 772
Query: 618 QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
F S ++IT AEL CL
Sbjct: 773 ICRF--------------------CGSTQEITTSS--------AELLSCL---------- 794
Query: 678 GFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQE 735
QC R++H C ++ FC C +I L+ LL +
Sbjct: 795 -----------QCSRKYHQVCAPGTMKDSVKAESNSSTDCFCSPGCRKIYKHLRKLLGVK 843
Query: 736 AEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGK-AATPETRLLL-------SQAVAI 787
NAI+ AG S V R K AA P+ + L + A ++
Sbjct: 844 ---------NAIE--AGFSWSLV-------RCFPDKLAAPPKGKAHLIHCNSKTAVAFSV 885
Query: 788 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
+CF P +D SG ++I +++Y G + F Y IL V+SA +R+ G +
Sbjct: 886 MDECFLPRIDERSGINIIHNVIYNCGSDFNRLNFSKFYTFILERGDEVISAAAVRIHGTD 945
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
+AE+P + T I +G L IE LS L V+ +V+PA E ++ WT FGFK ++
Sbjct: 946 LAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVFGFKPVE 1005
Query: 906 PELLSIYRKRCSQL--VTFKGTSMLQKRVPA 934
P R++ L + GT +L+KR+ A
Sbjct: 1006 PS----KRQKIKSLNILIIHGTGLLEKRLLA 1032
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/431 (24%), Positives = 166/431 (38%), Gaps = 106/431 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--RQYPGKD------------------------------ 573
GI C CC+ ++ ++FE HAG Q P +
Sbjct: 780 GICCDCCSKILTVAKFELHAGSKEQQPYANIFLEDGGLSLFQCLLNAWDKQAQNEKKGFY 839
Query: 574 --------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
+DD C IC DGG+LL CD C FH C + +P GDW+C+
Sbjct: 840 KIDPADDPDDDTCGICGDGGDLLCCDRCTSTFHVACLGIE-MPSGDWFCR---------- 888
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
CI C C + S P +L
Sbjct: 889 -------------------------NCI------------CKFCGSAEERTSS--PAELL 909
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQNLLVQEAEKLPEFH 743
C QC R++H C + + + P FC C++I L+ LL + + F
Sbjct: 910 SCLQCSRKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTKIYKRLKRLLGLKNDLEAGFS 969
Query: 744 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
+ ++ +A D KA + A ++ +CF P +D SG +
Sbjct: 970 WSLVRCFA----------DTEATSTKKKAQLVHCNSKTALAFSVLDECFLPRIDERSGIN 1019
Query: 804 LIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
+I ++VY G + F G Y IL V+SA +R+ G + AE+P + T + +
Sbjct: 1020 IIHNVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVRIHGTDFAEMPFIGTRGMYRHQ 1079
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
G L IE L L V+ +V+PA E ++ W+ FGFK + P + + L+
Sbjct: 1080 GMCHRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFGFKPVGP--TKKQKIKSVNLLI 1137
Query: 922 FKGTSMLQKRV 932
GT +L+KR+
Sbjct: 1138 IHGTGLLEKRL 1148
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 125/456 (27%), Positives = 169/456 (37%), Gaps = 114/456 (25%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYP----------GK----------------------- 572
GI+C+CC S S F+ H G P GK
Sbjct: 956 GILCNCCTKTFSISDFKVHGGCSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDASG 1015
Query: 573 -------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
+NDD C C DGG LL CD CP +H+ C S +P+G W YC
Sbjct: 1016 RKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSW---YC-------- 1064
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
H+ G E E+S FS IL
Sbjct: 1065 ---HNCTCRSCGNPLS------------------EKEVST--------FS-------AIL 1088
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGK-----WFCCMDCSRINSVLQNLLVQEAEKLP 740
C QC +H C+ D LP G WFC C I L N + E
Sbjct: 1089 KCLQCGDSYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDN 1142
Query: 741 EFHLNAIK-KYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 799
E + +K G L + I A E L+ A+ I +CF +VD
Sbjct: 1143 ELSWSILKCNTDGQKLHSSKKI----------AHMTECNTKLAVALTILEECFVRMVDPR 1192
Query: 800 SGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
+G D+IP ++ N ++ G Y IL ++ +RV G + AELP +ATS
Sbjct: 1193 TGVDMIPHVL--SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYR 1250
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL 919
+G + L IE +L V+++VL A E + W FGFK I+ + R L
Sbjct: 1251 RQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDN--EKKQLRNVNL 1308
Query: 920 VTFKGTSMLQKRVPACRIGSSSTD-STECVSGVEVG 954
+ F GTS+L KR+ S D VSG+ G
Sbjct: 1309 MLFPGTSLLTKRLDGITAAKSEEDKDAYNVSGLPNG 1344
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 142/335 (42%), Gaps = 56/335 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+ND +C++C GG L+ CDGCP AFH C L +P GDW+C+ C + FL+ +
Sbjct: 701 ENDVICSVCHYGGKLILCDGCPSAFHANCLGLEEVPDGDWFCESCCCGACGQFFLKATSK 760
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
+ + + C+ C+
Sbjct: 761 YAKEEKF---------------------------ISCKQCEL------------------ 775
Query: 693 EFHVGCLKKHKMAD-LRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYA 751
++H CL+ D L KWFC DC I L L+ K E + +
Sbjct: 776 KYHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELI----GKPREVGVEKLTWRL 831
Query: 752 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 811
S E D +++ A E LS A+ + H+ F+P+ GRDL +++
Sbjct: 832 VQSFEPNMYGDDAYKI----EAVAENHCKLSVALDVMHELFEPVKRPHGGRDLAEDVIFS 887
Query: 812 R--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFA 869
R + F G Y +L N +V+ +R+ G++VAE+P + T + +G ++L
Sbjct: 888 RWSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFIGTRFQHRQRGMCRVLIN 947
Query: 870 CIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
+EK+L L V+ +VLPA + W + FGF K+
Sbjct: 948 ELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKM 982
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 161/615 (26%), Positives = 245/615 (39%), Gaps = 93/615 (15%)
Query: 228 VTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCN-GCRVIPPSKFEIHA 286
V L TGLL+G V YM K + + G I G C CS CN ++ +FE H
Sbjct: 69 VRGLLSTGLLEGFRVTYM---KDEVEEV-GRINGQGYSCGCSKCNYNSNIMNACEFEEHY 124
Query: 287 CKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKGTF 346
+ + +I + G SL V+ A + L ML ++ + P + + K +F
Sbjct: 125 GQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYN--KWKASF 182
Query: 347 P-----ITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSSENDSMCIS 401
+ V G C + G M Y+ ++ S VS S N + S
Sbjct: 183 QKRKDYLDAVASDG----CLTQSSQGLAAGEMIYSLRDYLKDS----VSNSISNLNWSAS 234
Query: 402 YQNNKRERKKTRKLLEADLVSKSSSKS---VSLRNLLKTRSPWELTRNSSRP----GLIA 454
+ + R ++ S S K S K + + N+ P G+ +
Sbjct: 235 KRRSGRRFRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHSENTGDPLSIDGVKS 294
Query: 455 NS---TPVTSVHKSSQSQRQRKITKKSKKTVLISKPFENASPPL---SFPNKSRWNITPK 508
+S T VT+ H S + + +S P + PL S +KS+ + T +
Sbjct: 295 DSPLPTAVTTNHSKHDS---------TNLGLSLSSPVKITQRPLRNCSIDSKSKESKT-R 344
Query: 509 DQRLHKLVFDESGLPDGTEVGY-YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAG- 566
D LH L+F E GL D T + Y G+ L +GYK G IIC+CCN E SPS FE HAG
Sbjct: 345 DTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGM 404
Query: 567 --RQYPGKDNDDLCTICADGGNLLPCDGC-PRAFHKECASLSSIPQGDWYCKYCQNMFER 623
R+ P + L + L D P F + AS+SS+ D+
Sbjct: 405 GRRRQPYHNIYTLEGLSLHKLALQLQDHLNPNGF--DNASVSSV--SDY----------- 449
Query: 624 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRT 683
H+ + GR S + + R ++ E C C + P T
Sbjct: 450 -----HNLTSSGCGREPSTTSGPIVPLK--RTLQERVVETESCYFCGYGHTTIGNINPDT 502
Query: 684 ILLCDQCEREFHVGCL------KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAE 737
I+ C+QCER H+ C KK + L+E + CC +C + + L+
Sbjct: 503 IIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQSLRARLE-------- 554
Query: 738 KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVD 797
++K G + ++ WRLLSG A+ + +L + Q + IF D F D
Sbjct: 555 -------EGLEKCVGITFLRRIRSNICWRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTD 607
Query: 798 SISGRDLIPSMVYGR 812
S D+I MV G+
Sbjct: 608 EHS--DIISDMVNGK 620
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 110/225 (48%), Gaps = 15/225 (6%)
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
F ++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 211 FADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILV 266
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 798
IK + E D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 267 GQNLTCTLIKPMQYQA-EDREDYDL--------AAMAENYSKLSVALEVMHECFDPVKDP 317
Query: 799 ISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 318 KTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRF 377
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
+ G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 378 QHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 30/40 (75%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
ND +C+IC GG+L+ CD CP +FH C ++ S+P+GDW+
Sbjct: 151 NDYVCSICHYGGDLICCDRCPSSFHAACLNIESVPEGDWF 190
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 150/384 (39%), Gaps = 64/384 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+NDD C C DGG LL CD CP +H+ C S +P+G WY C N R N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWY---CHNCTCRSC-----GN 54
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
+ VS + IL C QC
Sbjct: 55 PLSEKEVSTFSA---------------------------------------ILKCLQCGD 75
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYA 751
+H C+ + +M + WFC C I L N + E E + +K
Sbjct: 76 SYHDTCIDQ-EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTD 134
Query: 752 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 811
G L + I A E L+ A+ I +CF +VD +G D+IP ++
Sbjct: 135 GQKLHSSKKI----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL-- 182
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
N ++ G Y IL ++ +RV G + AELP +ATS +G + L I
Sbjct: 183 SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTI 242
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 931
E +L V+++VL A E + W FGFK I+ R L+ F GTS+L KR
Sbjct: 243 EMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKR 300
Query: 932 VPACRIGSSSTD-STECVSGVEVG 954
+ S D VSG+ G
Sbjct: 301 LDGITAAKSEEDKDAYNVSGLPNG 324
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 111/225 (49%), Gaps = 15/225 (6%)
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
F ++L C QCER+FH C K+ M E WFCC C + LQ LL +
Sbjct: 211 FADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQLLGKPILV 266
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDS 798
IK + + V D D+ AA E LS A+ + H+CFDP+ D
Sbjct: 267 GHNLTCTLIKPMQYQAEDRV-DYDL--------AAMAENYSKLSVALEVMHECFDPVKDP 317
Query: 799 ISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSK 856
+ RDL+ +++ G NL F G Y +L N +++ +LR++G +VAE+PL+ T
Sbjct: 318 KTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDKVAEMPLIGTRF 377
Query: 857 INHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
+ G ++L IEK L L V+ +VLPA+ + WT FGF
Sbjct: 378 QHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGF 422
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 29/40 (72%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
ND +C+IC GG+L+ CD CP +FH C ++ +P+GDW+
Sbjct: 151 NDYVCSICHYGGDLICCDRCPSSFHATCLNIERVPEGDWF 190
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 152/362 (41%), Gaps = 89/362 (24%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
++D C IC +GG L+ C+ CP FH EC SL +P+ W+
Sbjct: 267 NSDQCCGICNEGGELVCCETCPLTFHMECVSLLEVPKDAWF------------------- 307
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C R CL C + P C+QCER
Sbjct: 308 -------------------CFR-----------CLCCHCGE-------PLRTQPCEQCER 330
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG 752
FH GC +A G +F C +S N+ + AE +
Sbjct: 331 CFHPGCCDDAILA-------GDFFFC------SSGCWNIFQRLAEMVA------------ 365
Query: 753 NSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR 812
++ + ++ W LL + LL++A+ + FDP++D + D + +MV+ R
Sbjct: 366 -TVNPLGRSELSWSLLRRGRCDDK---LLAEALQVISSRFDPVLDCWTQLDYLDAMVFSR 421
Query: 813 NLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFAC 870
+ +F G Y A+L + VV +LR+ G +AE+P +AT G+G + LF
Sbjct: 422 SHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGICRSLFTA 481
Query: 871 IEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
+E++L+ L V+ +VL AA++ E +W + F F +D +L + R LV G LQK
Sbjct: 482 VEEMLARLGVEMMVLLAAKDTEKMWKNSFEFHAMDRKLKA--RTVALGLVALNGAGFLQK 539
Query: 931 RV 932
V
Sbjct: 540 SV 541
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 148/384 (38%), Gaps = 64/384 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+NDD C C DGG LL CD CP +H+ C S +P+G W YC H+
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSW---YC-----------HNCT 48
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
G V + IL C QC
Sbjct: 49 CRSCGNPLSEKEVSTFS---------------------------------AILKCLQCGD 75
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIK-KYA 751
+H C+ + +M + WFC C I L N + E E + +K
Sbjct: 76 SYHDTCIDQ-EMLPCGDKQSNIWFCGRYCKEIFIGLHNHVGIENFLDNELSWSILKCNTD 134
Query: 752 GNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYG 811
G L + I A E L+ A+ I +CF +VD +G D+IP ++
Sbjct: 135 GRKLHSSKKI----------AHMTECNTKLAVALTILEECFVRMVDPRTGVDMIPHVL-- 182
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
N ++ G Y IL ++ +RV G + AELP +ATS +G + L I
Sbjct: 183 SNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIATSVDYRRQGMCRRLMDTI 242
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 931
E +L V+++VL A E + W FGFK I+ R L+ F GTS+L KR
Sbjct: 243 EMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN--VNLMLFPGTSLLTKR 300
Query: 932 VPACRIGSSSTD-STECVSGVEVG 954
+ S D VSG+ G
Sbjct: 301 LDGITAAKSEEDKDAYNVSGLPNG 324
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 161/379 (42%), Gaps = 87/379 (22%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
N D+C +C GG+LL CDGCP AFH C LSS+P+ D + F
Sbjct: 253 NCDVCCVCHWGGDLLLCDGCPSAFHHTCLGLSSLPEEDLW------------FCPCCCCD 300
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
+ S V+S +L C C+ R
Sbjct: 301 ICGSMESPVNS-----------------KLMACEQCQ---------------------RR 322
Query: 694 FHVGCLKKHK-MADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG 752
FH+ CLK+ + R WFC C+R++S L+NL+ G
Sbjct: 323 FHLKCLKEEPGIVSCR-----GWFCSSQCNRVSSALENLI-------------------G 358
Query: 753 NSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSM 808
+ ++ D+ W L+ G+ E L AV I H F+P D SGRDL+ +
Sbjct: 359 CKIAVGNNGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTKDVFSGRDLVEEL 418
Query: 809 VYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
++ ++ G G G Y ++ ++ +RV ++V E+PLVAT G ++L
Sbjct: 419 IFRKDRTG--VGRGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMCRVL 475
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCSQLVTFKGTS 926
+EK +S + V +VLPAA+E S WT +FGF ++ E L + + ++ F GT
Sbjct: 476 VDELEKQMSQMGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVK---HGMLDFVGTV 532
Query: 927 MLQKRVPACRIGSSSTDST 945
M K + R + S + +
Sbjct: 533 MCHKFLVKERAENDSAEES 551
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/469 (23%), Positives = 174/469 (37%), Gaps = 126/469 (26%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
LLEG GI C+CC+ S FE HAG R P
Sbjct: 419 LLEGIITKEGIRCNCCDEVFSVLDFEVHAGGNRNQPFKSLYLEGGNSLLQCLHESMNKQS 478
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G NDD C IC DGG+L+ CDGCP FH+ C + P G WYC
Sbjct: 479 ESQLKGYHFVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYN 538
Query: 617 CQNMF-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C F E+ +H+ + + + LS C L
Sbjct: 539 CSCKFCEKDEAAKHETSTLPS--------------------------LSSCRL------- 565
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQE 735
CE ++H C+ + FC C + LQ L +
Sbjct: 566 --------------CEEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQ-LFIGV 608
Query: 736 AEKLPE-FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDP 794
LPE F + ++++ S V+D D+ ++ ++ A ++ +CF P
Sbjct: 609 KHPLPEGFSWSFLRRFELPS--EVADCDISEKIAYNAK--------MAVAFSVMDECFSP 658
Query: 795 IVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLV 852
+VD SG +L+ ++VY G N +F A+L +++ +R+ G ++AE+P +
Sbjct: 659 LVDHRSGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFI 718
Query: 853 ATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT---------------- 896
T + +G + L IE +++ + L +E +W
Sbjct: 719 GTRYMYRRQGMCRRLMDGIESFVAYF--SQMFLAISEVLLDVWQFCCYPACFGDGPFCFF 776
Query: 897 DKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 944
FGF + D E +I + L+ F G ML K + +I S S
Sbjct: 777 SGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKEKITDSVVSS 822
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 114/466 (24%), Positives = 177/466 (37%), Gaps = 118/466 (25%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
LLEG GI C+CC+ S FE HAG R P
Sbjct: 572 LLEGIITKEGIRCNCCDEVFSVLDFEVHAGGNRNQPFKSLYLEGGNSLLQCLHESMNKQS 631
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKY 616
G NDD C IC DGG+L+ CDGCP FH+ C + P G WYC
Sbjct: 632 ESQLKGYHFVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYN 691
Query: 617 CQNMF-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDF 674
C F E+ +H+ + + + LS C LC C
Sbjct: 692 CSCKFCEKDEAAKHETSTLPS--------------------------LSSCRLCEEKC-- 723
Query: 675 SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQ 734
S P T+ C + G + + D FC C + LQ L +
Sbjct: 724 --SKHYPHTLADHQACINQ--DGTVPGERSTDS--------FCGKYCQELFEELQ-LFIG 770
Query: 735 EAEKLPE-FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFD 793
LPE F + ++++ S V+D D+ ++ ++ A ++ +CF
Sbjct: 771 VKHPLPEGFSWSFLRRFELPS--EVADCDISEKIAYNAK--------MAVAFSVMDECFS 820
Query: 794 PIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPL 851
P+VD SG +L+ ++VY G N +F A+L +++ +R+ G ++AE+P
Sbjct: 821 PLVDHRSGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPF 880
Query: 852 VATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWT--------------- 896
+ T + +G + L IE +++ + L +E +W
Sbjct: 881 IGTRYMYRRQGMCRRLMDGIESFVAYF--SQMFLAISEVLLDVWQFCCYPACFGDGPFCF 938
Query: 897 -DKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSS 940
FGF + D E +I + L+ F G ML K + +I S
Sbjct: 939 FSGFGFAPVNDSEKKTI---KNLNLLVFPGVDMLGKSLVKEKITDS 981
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 146/377 (38%), Gaps = 92/377 (24%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANA 633
+D +C+ C GG+L+ CD CP FH C L +P +W+C C
Sbjct: 142 SDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC---------------- 185
Query: 634 VEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCERE 693
C C LC D S S C QC R
Sbjct: 186 ------------------C-------------CELCGKGDSSTSTNA------CLQCARA 208
Query: 694 FHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGN 753
+HV CL K + P + FC C + + L LL G
Sbjct: 209 YHVHCLTKDGCLLPTDYP-SENFCSKSCYELCAQLHQLL-------------------GI 248
Query: 754 SLETVSDIDVRWRLLSGKAAT--------PETRLLLSQAVAIFHDCFDPIVDSISGRDLI 805
S T D + W L T + Q + + H+CF + + + +D++
Sbjct: 249 SNPTSVD-GLTWTLTRSSKDVYNFPGMPRSSTHVKSFQILRVMHECFRSVKEPHTQKDMV 307
Query: 806 PSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGY 863
++Y G + F G Y +L +VS LR+ G + AE+PLVAT +G
Sbjct: 308 TDLIYNSGSKFKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVATPFNFRRQGM 367
Query: 864 FQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI---DPELLSIYRKRCSQLV 920
+LL + KLL+ RV+ ++LPA + +W FGF ++ + + LS Y V
Sbjct: 368 CRLLMQEVLKLLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQLSGY-----SFV 422
Query: 921 TFKGTSMLQKRVPACRI 937
F+GT MLQ + + RI
Sbjct: 423 GFQGTMMLQNVLTSSRI 439
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 114 bits (285), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/70 (74%), Positives = 63/70 (90%), Gaps = 1/70 (1%)
Query: 620 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 679
MF+R++F++H+ANAV AGRVSGVD +EQITKRCIRIV N EAE+S C+LCRG DFSKSGF
Sbjct: 1 MFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGF 59
Query: 680 GPRTILLCDQ 689
GPR I++CDQ
Sbjct: 60 GPRMIIMCDQ 69
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 557
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 131/268 (48%), Gaps = 35/268 (13%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 743
++ C+QC+R FH+ CLK+ D + WFC C+R+ S L+NLL
Sbjct: 315 LMACEQCQRRFHLTCLKE----DSCIVSSRGWFCSSQCNRVFSALENLL----------- 359
Query: 744 LNAIKKYAGNSLETVSDIDVRWRLL----SGKAATPETRLLLSQAVAIFHDCFDPIVDSI 799
G+ + +D D+ W L+ G+ E L AV I H F+P D
Sbjct: 360 --------GSKIAVGNDGDLVWTLMRAPNEGEHYDDEQISKLESAVEILHQGFEPTNDVF 411
Query: 800 SGRDLIPSMVYGRNLRGQEFG-GMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKIN 858
SGRDL+ ++Y ++ G G G Y ++ + ++ +RV ++V E+PLVAT
Sbjct: 412 SGRDLVEELIYRKDRTG--VGRGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSY 468
Query: 859 HGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID-PELLSIYRKRCS 917
G ++L +EK +S + V +VLPAA+E + WT++FGF ++ E L + +
Sbjct: 469 RRSGMCRVLMDELEKQMSQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVKH--- 525
Query: 918 QLVTFKGTSMLQKRVPACRIGSSSTDST 945
++ F GT M K + R + S + +
Sbjct: 526 GMLDFVGTIMCHKFLQKERAENDSAEES 553
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 29/41 (70%), Gaps = 1/41 (2%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD-WY 613
N D+C +C GG+LL CDGCP AFH C LSS+P+ D W+
Sbjct: 255 NCDVCCVCHWGGDLLLCDGCPSAFHHACLGLSSLPEEDLWF 295
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 113/433 (26%), Positives = 178/433 (41%), Gaps = 73/433 (16%)
Query: 527 EVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGN 586
+V Y + +LEG+ GI C CC+ ++ S+FE HAG + P + G +
Sbjct: 621 KVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAGSKLPQPYQN---IYLESGVS 677
Query: 587 LLPC--DGCPRAFHKECASLSSI------PQGDWYCKYCQNMFERKRFLQHDANAVEAGR 638
LL C D R H E S+ P D C C + G
Sbjct: 678 LLQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDD-TCGICG----------------DGGD 720
Query: 639 V---SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS-----KSGFGPRTILLCDQC 690
+ G S + I+++ E C C+ C + K + C C
Sbjct: 721 LICCDGCPSTFHQSCLDIQMLPPGEWHCPNCT-CKFCGIASETSDKDDASVNVLRTCILC 779
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKY 750
E+++H C K ++ LP + INS + +E ++L E+ +KKY
Sbjct: 780 EKKYHDSCTK-----EMDTLP----------NNINSSSLSFCGKECKELSEY----LKKY 820
Query: 751 AGNSLETVSDIDVRWRLLS-----------GKAATPETRLLLSQAVAIFHDCFDPIVDSI 799
G E + W L+ G E L+ A+ + +CF P++D
Sbjct: 821 LGTKHEL--EAGFSWCLIHRSDEDSEAACRGLTQRVECNSKLAIALTVMDECFLPVIDRR 878
Query: 800 SGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 857
SG +LI +++Y G N + G Y AIL +++A +R G ++AE+P + T I
Sbjct: 879 SGINLIRNILYNSGSNFSRLSYSGFYTAILERGDEIIAAASIRFHGTKIAEMPFIGTRHI 938
Query: 858 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS 917
+G + LF+ IE L L+V+ +V+PA E WT FGF +D L +
Sbjct: 939 YRRQGMCRRLFSAIELALCSLKVEKLVIPAVAELTHTWTTVFGFTYLDESLRQ--EMKSL 996
Query: 918 QLVTFKGTSMLQK 930
++ F G MLQK
Sbjct: 997 NMMVFPGIDMLQK 1009
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 126/478 (26%), Positives = 200/478 (41%), Gaps = 87/478 (18%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG------QKLLEGYKNGLGIICHCCNSEVSPSQF 561
K + + D + DG V Y G +K++ G G+ C CC+ V F
Sbjct: 367 KKHTILTWLIDGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVHCSCCDGVVPLPVF 426
Query: 562 EAHAGRQY----PGKDNDDLCTICADGGNLLPCDGCP------RAFHKECASLSSIPQGD 611
EAHAG + PG+ + G +LL C R FH + +++ Q +
Sbjct: 427 EAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFHAQAKVRAALEQEE 486
Query: 612 WYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GCLLC 669
C ++R L A ++ G V VE+I + +K E + S C +C
Sbjct: 487 DKCSQA-----KRRLL---AKHLKKGVV-----VERIMSPRMEKIKAGEKDSSDDACGVC 533
Query: 670 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRIN 725
+ G +L CD C FH CL ++P+G W C C+ C N
Sbjct: 534 -----ADGG----ELLCCDSCTSTFHPECLAI-------KVPEGSWSCHYCRCVLCMS-N 576
Query: 726 SVLQNLLV--QEAEKLPE----FHLNA--IKKYAGNS-------LETVSDI------DVR 764
LQ L Q A K E N I Y G + L V+ +
Sbjct: 577 DDLQGLSTCQQCARKYHESCRPLPGNGCDIGTYCGETCKKLFSQLAQVTGVTNPTGDGFW 636
Query: 765 WRLLSGKAATP----------ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GR 812
W LL + P E + L+ A+ +F++CF+P+ D + D++ VY G
Sbjct: 637 WALLRIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQAVYSLGS 696
Query: 813 NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIE 872
+ + G Y +L + +VSA +LR+ G +VAE+P T +G + L + +E
Sbjct: 697 QFKRLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRRLVSAVE 756
Query: 873 KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
++L+ ++V+ +V+PA + W F F+ +DP+L KR S LV GT++L K
Sbjct: 757 QVLASVQVEKLVIPAIDSLVDTWKRSFFFRPVDPQLREEL-KRLS-LVVITGTTLLHK 812
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 97/172 (56%), Gaps = 3/172 (1%)
Query: 222 NKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSK 281
NK P + EL TG+L+G V Y+ K + + LRG+I+ GILCSCS C G +V+ P
Sbjct: 257 NKIPTNLRELLATGMLEGQPVKYIM-RKGKRAVLRGVIKRIGILCSCSSCKGRKVVSPYY 315
Query: 282 FEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVR 341
FE+HA + S YI ENG +L +VLRAC + L ML+ ++ A+ P+E+ F C
Sbjct: 316 FEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIFRCKS 375
Query: 342 CKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYTTGIRIRASGPKLVSRSS 393
CK +F GK C+SC++SK + ++ + R + S K+ +S
Sbjct: 376 CKSSFSTLRSGKF--ALFCDSCLESKGAKNNISSSKVGRSQTSSAKVYKSAS 425
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 59/83 (71%), Gaps = 2/83 (2%)
Query: 485 SKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNG 544
+K +++ASP + S +T KD+ +HK+VF LP+GT+VGYY G++LL+GY
Sbjct: 418 AKVYKSASP--GAKSSSVGRLTRKDKGMHKVVFMSGILPEGTDVGYYVGGKRLLDGYIKE 475
Query: 545 LGIICHCCNSEVSPSQFEAHAGR 567
LGI CHCC++ VSPSQFE HAGR
Sbjct: 476 LGIYCHCCSTVVSPSQFEGHAGR 498
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 128/273 (46%), Gaps = 46/273 (16%)
Query: 673 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADL-RELPKGKWFCCMDCSRINSVLQNL 731
D + GF IL CDQC R+FHV C + + L R+ WFC C + S LQ+L
Sbjct: 40 DVQQDGF----ILSCDQCPRKFHVACARSRGLIKLERKGTCYSWFCSDKCEYVFSGLQHL 95
Query: 732 LVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLL---------LS 782
L G S+ +D ++ W LL K P+ L L
Sbjct: 96 L-------------------GKSVPVGTD-NLTWTLL--KRVEPDCFDLEVLSANNSKLK 133
Query: 783 QAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 840
A+ + H+CF+P D+ +G+DL+ +++ G NL F G Y +L N+ + + +R
Sbjct: 134 LALEVMHECFEPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVR 193
Query: 841 VFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFG 900
VFG +VAE+P VAT G ++L +E+ L L V+ +VLPAA W FG
Sbjct: 194 VFGDKVAEVPFVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFG 253
Query: 901 FKKI---DPELLSIYRKRCSQLVTFKGTSMLQK 930
F + D + S Y ++ F+GT + QK
Sbjct: 254 FSVMTYSDKKAHSDY-----PILFFQGTVLCQK 281
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 117/481 (24%), Positives = 192/481 (39%), Gaps = 90/481 (18%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYYACG-------QKLLEGYKNGLGIICHCCNSEVSPSQ 560
K + + D L DG V YY G +K++ G G+ C+CC++ V
Sbjct: 277 KKHTILTWLIDGGFLSDGETV-YYVPGDSGGAGKEKIVSGAVTRAGVHCNCCDAVVPLPV 335
Query: 561 FEAHAGRQYPGKDNDDLCT-----ICADGGNLLPC------DGCPRAFHKECASLSSIPQ 609
FE HAGR PG + G +LL + R FH + +++ Q
Sbjct: 336 FEVHAGR-VPGTGQQQQQVAWEKLLLVSGDSLLQSMQEAWQNEKVRTFHAQAKVRAALEQ 394
Query: 610 GDWYCKYCQNMFERKRFL-QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELS--GC 666
+ +N ++R L +H V VE+I + +K E + S C
Sbjct: 395 -----EEEKNSQAKRRLLAKHQKKGV---------VVERIMSPRMEKIKAGEKDSSDDAC 440
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--------- 717
+C D + +L CD C FH CL E+P G W C
Sbjct: 441 GVC--ADGGE-------LLCCDFCTSTFHPECLAI-------EVPDGSWSCHYCRCTLCM 484
Query: 718 ---------CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDID-----V 763
C +C+ L+ + + KK + E + ++
Sbjct: 485 SNDDQDLSTCQECACKYHESCRPLLGNGRDIGAYCGEICKKLSAKLSEVIGVMNSTEDGF 544
Query: 764 RWRLL----------SGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--G 811
W LL G A E + L+ A+ + + CF+P+ D + D++ VY G
Sbjct: 545 SWSLLRIHEDEPASSQGMPAVLERNVKLAVALGVLNQCFNPVKDRRTKIDMLHQAVYSLG 604
Query: 812 RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACI 871
+ + G Y IL + +VS +LR+ G++VAE+P T +G + + +
Sbjct: 605 SQFKRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGTLPAYRKQGMMHRVVSAV 664
Query: 872 EKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKR 931
E++L+ ++V+++++PA W F F+ +DP+L KR S LV GT+ML K
Sbjct: 665 EQVLASVQVETLIIPAIASMVDTWKRSFSFRPVDPQLREEL-KRLS-LVVITGTTMLHKP 722
Query: 932 V 932
V
Sbjct: 723 V 723
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 120/470 (25%), Positives = 192/470 (40%), Gaps = 109/470 (23%)
Query: 512 LHKLVFDESGLPDGTEVGYY-ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP 570
+ + D L +G V Y ++ G + GI+C CCN S + F+ HAG +
Sbjct: 3 IFSWLIDGEILSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEV- 61
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
L T+ DG ++L C +L I Q C + L D
Sbjct: 62 -HRTAALLTL-EDGRSVLECQK---------QALKKIEQAK-----CDEPANGQ--LTVD 103
Query: 631 ANAVEAGRVSG----VDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILL 686
A++A + VD VE ++ C +C G G + ++
Sbjct: 104 ETALKAMELKESELVVDDVE------------MDENDDTCAVC--------GDGGQ-LVC 142
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSR--INSVLQN-LLVQEAEKL 739
CD C FH+ CL+ L +P+G WFC C C R + +Q +L + +
Sbjct: 143 CDHCPSTFHLKCLR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHSNCV 196
Query: 740 P-----------EF-------HLNAIKKYAGNSLETVSDIDVRWRLLSGK---------- 771
P +F ++K G + V D+ W LL +
Sbjct: 197 PGCAMKYESSDNQFCSRKCFKIFRGLRKLVGR-VNKVDDM-YSWTLLRSEHYDQSAENSK 254
Query: 772 ---AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQE----FGGMYC 824
A TRL L A+ + +CF P++D S D++ ++Y R RG++ F G Y
Sbjct: 255 LESVADLNTRLAL--ALTVIQECFRPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYT 310
Query: 825 AILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIV 884
+L ++S +RV G AE+P + T +G + L I+++L L V+++V
Sbjct: 311 VVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLV 370
Query: 885 LPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLVTFKGTSMLQK 930
LPA E WT FGF+K+ +L+ + +VTF G+S+LQK
Sbjct: 371 LPAIAEFIETWTSAFGFQKLTAAQGIQLMEL------NIVTFPGSSVLQK 414
Score = 40.0 bits (92), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
L +G +V Y+ Q + G+I GILC C CN V + F++HA + R +
Sbjct: 13 LSEGAAVSYVNKDSNQVAS--GVISRDGILCKC--CN--EVFSMTSFQVHAGDEVHRTAA 66
Query: 296 YICFENGKSLLE 307
+ E+G+S+LE
Sbjct: 67 LLTLEDGRSVLE 78
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 105 bits (263), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 110/410 (26%), Positives = 177/410 (43%), Gaps = 104/410 (25%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
I+C CC+ + + FE+HAG R P + DG +LL C +E S
Sbjct: 37 AIVCDCCHVTFTITGFESHAGCTRHRPSTS-----ILLEDGRSLLDC-------QREALS 84
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK-NLEAE 662
SS +G+ + V + V++ K+ +VK N EA+
Sbjct: 85 -SSDHKGN-------------------HSVVNENQKKNHSIVKENRKKNHCVVKENSEAK 124
Query: 663 LSG-CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 721
C +C GFG + LCD+C FH+GCL L +P G+WFC C
Sbjct: 125 NDNVCSIC--------GFGG-DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCC 169
Query: 722 SRI---------------NSVLQNLLVQEAEKLPEFHLNAIKKYA--GNSLET------- 757
+I N++L + VQ +K +H +K N +E
Sbjct: 170 CKICYRPKCKQECKDHKDNNIL--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVC 224
Query: 758 ----------------VSDIDVRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIV 796
V+D ++ W L+ ++ + L+ A+ + ++ F+P
Sbjct: 225 GNMFLCLKKLLGKPIKVAD-NLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTF 283
Query: 797 DSISGRDLIPSMVYGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVAT 854
D++SGR+LI +V+ R F G Y IL V+S +R++GQ+VAE+ VAT
Sbjct: 284 DALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVAT 343
Query: 855 SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
+ +G LL IEK L+ L V+ ++L ++E+A +IWT FGF ++
Sbjct: 344 KEQYRRQGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWTKSFGFARM 393
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 111/221 (50%), Gaps = 16/221 (7%)
Query: 713 GKWFCCMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKA 772
G FC C + LQNLL + + PE+ +++ + E V +D R
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHEDVPEEVLALDKRV------- 76
Query: 773 ATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVN 830
E ++ A+++ +CF PI+D +G +LI ++VY G N +F G Y IL
Sbjct: 77 ---ECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERG 133
Query: 831 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 890
+++A +R+ G ++AE+P + T + +G + L IE +LS L V+ +++PA E
Sbjct: 134 DEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITE 193
Query: 891 AESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 930
WT KFGF + D E + K S LV F GT +LQK
Sbjct: 194 LVDTWTSKFGFSPLEDSEKQEV--KSISMLV-FPGTGLLQK 231
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 93/171 (54%), Gaps = 12/171 (7%)
Query: 765 WRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN-LRGQEFGGMY 823
WRLLSG A+ + +L + Q + IF D F D S D+I MV G+N + ++F GMY
Sbjct: 40 WRLLSGMDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 97
Query: 824 CAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSI 883
CA+LT ++ VVSA IL+V +++AEL L+AT KGYF LL IE L V +
Sbjct: 98 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 157
Query: 884 VLPAAEEAESIWTDKFGFKKIDPE----LLSIYRKRCSQLVTFKGTSMLQK 930
P E IW++K GF + E +L + LV FK ++QK
Sbjct: 158 TAPVDPEMAQIWSEKLGFTILSAEEKESMLESH-----PLVMFKNLVLVQK 203
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 174/407 (42%), Gaps = 98/407 (24%)
Query: 546 GIICHCCNSEVSPSQFEAHAG--RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS 603
I+C CC+ + + FE+HAG R P + DG +LL C +E S
Sbjct: 37 AIVCDCCHVTFTITGFESHAGCTRHRPSTS-----ILLEDGRSLLDC-------QREALS 84
Query: 604 LSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAEL 663
SS +G+ + V + V++ K+ +VK ++E
Sbjct: 85 -SSDHKGN-------------------HSVVNENQKKNHSIVKENRKKNHCVVKE-KSEA 123
Query: 664 SGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSR 723
+ +C C GFG + LCD+C FH+GCL L +P G+WFC C +
Sbjct: 124 NNDNVCSIC-----GFGG-DLALCDRCPSAFHLGCL------GLNRVPIGEWFCPTCCCK 171
Query: 724 I---------------NSVLQNLLVQEAEKLPEFHLNAIKKYA---------------GN 753
I N++L + VQ +K +H +K GN
Sbjct: 172 ICYRPKCKQECKDHKDNNIL--VCVQCEQK---YHFGCVKAVGIEFNHMENWFCSVVCGN 226
Query: 754 SLETVSDI---------DVRWRLLSGKAATPETRL-----LLSQAVAIFHDCFDPIVDSI 799
+ + ++ W L+ ++ + L+ A+ + ++ F+P D++
Sbjct: 227 MFLCLKKLLGKPIKVADNLTWTLVKNVSSVDDKEFNQKESKLNMALGVLYEGFNPTFDAL 286
Query: 800 SGRDLIPSMVYGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 857
SGR+LI +V+ R F G Y IL V+S +R++GQ+VAE+ VAT +
Sbjct: 287 SGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISVATVRIYGQKVAEVVFVATKEQ 346
Query: 858 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI 904
+G LL IEK L+ L V+ ++L ++E+A + WT FGF ++
Sbjct: 347 YRRQGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWTRSFGFARM 393
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 107/249 (42%), Gaps = 35/249 (14%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 743
++ CD C FH CL L+ L K + LVQ E +
Sbjct: 298 LICCDGCPSTFHQSCLDIQLFEQLQMLLGVK-------HELEDGFSWTLVQRTEVGFDIS 350
Query: 744 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
LN I + E L+ A++I +CF PIVD SG +
Sbjct: 351 LNGIPQKV------------------------ECNSKLAVALSIMDECFLPIVDQRSGIN 386
Query: 804 LIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
LI +++Y G N + G + AIL ++SA +R+ G ++AE+P + T I +
Sbjct: 387 LIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIYRRQ 446
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVT 921
G + L IE L L V+ +V+PA E WT FGFK + E+ S R ++
Sbjct: 447 GMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPL--EVSSRKEMRNMNMLV 504
Query: 922 FKGTSMLQK 930
F GT MLQK
Sbjct: 505 FHGTDMLQK 513
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 47/113 (41%), Gaps = 40/113 (35%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY--PGKD--------------------- 573
LLEG+ + GI C CC+ + S+FE HAG + P ++
Sbjct: 208 LLEGWISRDGIRCGCCSEIFTISKFEIHAGMKLCEPSQNIILETGISLLQCQLDSWNKQE 267
Query: 574 -----------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQ 609
NDD C IC DGG+L+ CDGCP FH+ C + Q
Sbjct: 268 ESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIQLFEQ 320
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 142/387 (36%), Gaps = 107/387 (27%)
Query: 535 QKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP------------------------ 570
+++L G GI C CC+ ++ ++FE HAG +
Sbjct: 674 REMLAGKITREGIFCGCCSKILTIAKFELHAGSKEKKPYANIFLEGGKVSLLQCLLDAWE 733
Query: 571 ---------------GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
G D +DD C IC DGG+L+ CD C FH +C + +P GDWYC
Sbjct: 734 KHTQSENKGFYKIDKGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYC 792
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
+ C LCR C F
Sbjct: 793 RSC--------------------------------------------------LCRFCGF 802
Query: 675 --SKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKW--FCCMDCSRINSVLQN 730
K P +L C QC R++H C +P FC C +I L
Sbjct: 803 PQEKPSSSPELLLSCLQCSRKYHQTCSSGTGTDSGCTMPGTSIDCFCSPGCRKIYKRLNK 862
Query: 731 LLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHD 790
LL + F + + +A + + + K A ++ L A + +
Sbjct: 863 LLGIKNHMEAGFSWSLVHCFANDQAMPNKNKE--------KLAQCNSKTAL--AFTVLDE 912
Query: 791 CFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAE 848
CF P +D SG ++I ++ Y G + +F G Y IL V++A +R+ G ++AE
Sbjct: 913 CFQPHIDDRSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVIAAASVRIHGTDLAE 972
Query: 849 LPLVATSKINHGKGYFQLLFACIEKLL 875
+P + T + +G + L IE ++
Sbjct: 973 MPFIGTRGMYRHQGMCRRLLNGIESVI 999
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 66/99 (66%), Gaps = 2/99 (2%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
++ DDLC C GG++ PC CPR+FH C LS +P +WYC C+N+ ++++ L +
Sbjct: 12 RETDDLCRECGQGGDIFPCKICPRSFHPACVGLSKVPS-EWYCDNCRNLVQKEKALAENK 70
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR 670
NA AGR +GVDS+EQI KR IRIV + +L GC LC+
Sbjct: 71 NAKAAGRQAGVDSIEQIMKRAIRIVP-ISDDLGGCALCK 108
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 129/270 (47%), Gaps = 33/270 (12%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM------------DCSRINSVLQNL 731
+LLCD+C FH C+ L+ P+G W C + D + + +
Sbjct: 576 LLLCDKCPSAFHHACVG------LQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 732 LVQEAEKLPE----FHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 787
+ EA +P L+ +++ + TV+ I RW+ AA L A+ +
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRR--RYMSTVTRI-TRWQHEEEDAADHGQ---LCAALDV 683
Query: 788 FHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
H+CFD +V+ + DL +V+ + L F G Y L +++ G LRVFG +
Sbjct: 684 LHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVFGNQ 743
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
VAELPLV T + +G +LL +EK+L + V+ +VLPA E +WT GF +
Sbjct: 744 VAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFHAMT 803
Query: 906 -PELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
+++ + + +++FKGT+M QK + A
Sbjct: 804 RSDVMEMAVEHA--ILSFKGTTMCQKTLLA 831
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
++ D +C++C DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 561 EEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR 607
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 112/479 (23%), Positives = 193/479 (40%), Gaps = 101/479 (21%)
Query: 516 VFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPG 571
+ D L D +V Y +K++ G GI C CCN+ V + FE HA + PG
Sbjct: 378 LIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFETHARCERPG 437
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFL---- 627
+ + LL G P LS Q W + M R++ +
Sbjct: 438 QPWE----------KLLLMSGKP---------LSKCMQEAWAQERVTAMRAREKAMASLE 478
Query: 628 QHDANAVEAGR----------VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKS 677
Q + +A R + GV V + R ++ KN + C C +
Sbjct: 479 QEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDACGVC--ADG 535
Query: 678 GFGPRTILLCDQCEREFHVGCL------KKHKMADLRELP----KGKW------------ 715
G +L CD C FH CL K + D ++L + W
Sbjct: 536 G----QLLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISAD 591
Query: 716 ----------------FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKKYAGNSLET 757
+C C +++S L +++ + E + L I+K + L T
Sbjct: 592 HQYCRPLQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK---DELVT 648
Query: 758 VSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLR 815
D+ V +L E+ + L+ A+ + ++CF+P+ D + D++ VY G +
Sbjct: 649 SEDMPV---IL-------ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFK 698
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
+ G Y +L + ++S +LR G+++AE+P T +G + L +EK+L
Sbjct: 699 RVNYEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVL 758
Query: 876 SFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQKRVPA 934
+ L+V+++V+PA + W F F+ + E+ +K LV GT++LQK + A
Sbjct: 759 ASLQVENLVIPAVADLVETWKRSFSFRPMQAEVRDEAKKLS--LVAITGTTLLQKPISA 815
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 113/487 (23%), Positives = 195/487 (40%), Gaps = 101/487 (20%)
Query: 508 KDQRLHKLVFDESGLPDGTEVGYY----ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEA 563
K + + D L D +V Y +K++ G GI C CCN+ V + FE
Sbjct: 372 KKHTVLTWLIDTGFLKDKAKVFYVPGDAGAAEKVISGMVTKTGIRCRCCNTVVPVAVFET 431
Query: 564 HAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFER 623
HA + PG+ + LL G P LS Q W + M R
Sbjct: 432 HARCERPGQPWE----------KLLLMSGKP---------LSKCMQEAWAQERVTAMRAR 472
Query: 624 KRFL----QHDANAVEAGR----------VSGVDSVEQITKRCIRIVKNLEAELSGCLLC 669
++ + Q + +A R + GV V + R ++ KN + C
Sbjct: 473 EKAMASLEQEKEKSSQAKRKLAKTKKMQLLDGVVVVSTSSPRH-QVKKNGGGKDCSDDAC 531
Query: 670 RGCDFSKSGFGPRTILLCDQCEREFHVGCL------KKHKMADLRELP----KGKW---- 715
C + G +L CD C FH CL K + D ++L + W
Sbjct: 532 GVC--ADGG----QLLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTA 585
Query: 716 ------------------------FCCMDCSRINSVLQNLL--VQEAEKLPEFHLNAIKK 749
+C C +++S L +++ + E + L I+K
Sbjct: 586 PGAAISADHQYCRPLQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQK 645
Query: 750 YAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 809
+ L T D+ V +L E+ + L+ A+ + ++CF+P+ D + D++ V
Sbjct: 646 ---DELVTSEDMPV---IL-------ESNVKLAVALGVLNECFNPVQDRRTKIDMLHQAV 692
Query: 810 Y--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
Y G + + G Y +L + ++S +LR G+++AE+P T +G + L
Sbjct: 693 YSLGSEFKRVNYEGFYTMVLEKDGEIISVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRL 752
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSM 927
+EK+L+ L+V+++V+PA + W F F+ + E+ +K LV GT++
Sbjct: 753 VKAVEKVLASLQVENLVIPAVADLVETWKRSFSFRPMQAEVRDEAKKLS--LVAITGTTL 810
Query: 928 LQKRVPA 934
LQK + A
Sbjct: 811 LQKPISA 817
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 128/293 (43%), Gaps = 55/293 (18%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCC--------------------MDCSR 723
++ CD C FH CL ++ LP+G W+C CS+
Sbjct: 726 LICCDNCPSTFHQACLS------MQVLPEGSWYCSSCSCQICSELVSDNGERSQDFKCSQ 779
Query: 724 INSVLQNLLVQEAEK----LPEFHL---NAIKKYAGNSLETVSDID-----VRWRLL--- 768
+ +Q K PE + N K Y G S V I+ + W +L
Sbjct: 780 CAHKYHGICLQGISKRRKLFPETYFCGKNCEKVYTGLS-SRVGVINPNADGLSWSILKCF 838
Query: 769 --SGK-------AATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQ 817
GK A E L+ A++I + F +VD +G D+IP ++Y G N
Sbjct: 839 QEDGKVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTGIDMIPHVLYNWGSNFARL 898
Query: 818 EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 877
+F G Y +L + ++S +RV G VAE+PLVAT +G ++L A IE++L
Sbjct: 899 DFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVATCSKYRRQGMCRILVAAIEEMLMS 958
Query: 878 LRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
L+V+ +V+ A WT+ FGFK +D E ++ L+ F GT +L K
Sbjct: 959 LKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR--INLMVFPGTILLMK 1009
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 45/108 (41%), Gaps = 40/108 (37%)
Query: 546 GIICHCCNSEVSPSQFEAHAG-------------------------------------RQ 568
G++C CCN VS S+F+ HAG R
Sbjct: 645 GVVCTCCNRTVSLSEFKNHAGFNQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWRS 704
Query: 569 YPGKD---NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWY 613
D NDD C +C DGG L+ CD CP FH+ C S+ +P+G WY
Sbjct: 705 EEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWY 752
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 123/356 (34%), Gaps = 123/356 (34%)
Query: 539 EGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP----------GK---------------- 572
+G GI+C+CC +S S F AHAGR +P GK
Sbjct: 882 DGMVTWEGIVCNCCKKNLSVSDFMAHAGRSHPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 573 --------------DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
++DD C C DGG LL CD CP +H+ C S +P+G WYC C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 619 NMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSG 678
C +C G K
Sbjct: 1002 -----------------------------------------------CQVCGGPFSEKEV 1014
Query: 679 FGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEK 738
I C QC +H C+++ K+ L + WFC C I
Sbjct: 1015 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEI-------------- 1059
Query: 739 LPEFHLNAIKKYAGNSLETVSDIDVRWRLL----SGK--------AATPETRLLLSQAVA 786
++ + G + + D D+ W +L G+ A E + L+ A+
Sbjct: 1060 -----FIGLRSHVGT--DNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVALT 1112
Query: 787 IFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR 840
+ +CF +VD +G D+IP ++Y G N ++ G Y IL ++ +R
Sbjct: 1113 LLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIR 1168
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 170/401 (42%), Gaps = 73/401 (18%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY--PGKDNDDLCTICADGGNLLPCDGCP 594
L EG G++C CC+ + S FEAH G + P + D L D
Sbjct: 2 LKEGMVTSDGLVCSCCDELFNLSGFEAHTGSKLRRPAAN-----IFVGDEAQLSIADCQN 56
Query: 595 RAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIR 654
AF E +L S+P + R++F + ++ SG+ +V +
Sbjct: 57 VAFKME--TLESLPG--------LPVARRRKFDSY----CQSDEDSGLTTVSSGSDVDYE 102
Query: 655 IVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGK 714
N + + C G ++ C+ C FH+ C+ L E+PK
Sbjct: 103 AAANSD---------QCCGICNEG---GELVCCETCPLTFHMECVS------LLEVPKDA 144
Query: 715 WFC--CMDCSRINSVLQNLLVQEAEKLPEFH--------------------LNAIKKYAG 752
WFC C+ C L+ ++ E+ FH N ++ A
Sbjct: 145 WFCFRCL-CCHCGEPLRTQPCEQCERC--FHPGCCDDAILAGDFFFCSSGCWNLFQRLA- 200
Query: 753 NSLETVSDI---DVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMV 809
+ TV+ + ++ W LL + LL++A+ + FDP++D + D + +MV
Sbjct: 201 EMVATVNPLGRSELSWSLLRRGRCDDK---LLAEALQLISSRFDPVLDCWTQLDYLDAMV 257
Query: 810 YGRNLRGQ--EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLL 867
+ R+ +F G Y A+L + VV +LR+ +AE+P +AT G+G + L
Sbjct: 258 FSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGICRSL 317
Query: 868 FACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPEL 908
F +E++L+ L V+++ L AA++ E +W + F F +D +L
Sbjct: 318 FTAVEEMLARLGVETMALLAAKDTEKMWKNSFEFHAVDRKL 358
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 3/136 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
S + + KK +LK KK+ N P V L TG+LDGV V Y I + LRG+I+
Sbjct: 259 SGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKY---IAWSREELRGVIKG 315
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LC C CN +VI +FE HA + + + +I FENGK++ +++ RS+P ML
Sbjct: 316 SGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQELRSIPQNMLF 375
Query: 322 ATLQSALSSLPEEKSF 337
+Q+ S +KSF
Sbjct: 376 EVIQTITGSPINQKSF 391
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 3/147 (2%)
Query: 191 SLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKF 250
+LI I + T KK E K+SKK+ N P V L TG+LDGV V Y I +
Sbjct: 429 ALISTAQITASGSETVSKKKEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKY---IAW 485
Query: 251 QASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLR 310
LRGII+ G LC C CN +VI +FE HA + + + +I FENGK++ +++
Sbjct: 486 SREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQ 545
Query: 311 ACRSVPLPMLKATLQSALSSLPEEKSF 337
+S P L +Q+ S +KSF
Sbjct: 546 ELKSTPQNSLFDVIQTITGSPINQKSF 572
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 72/130 (55%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y I LRGII+ G LC
Sbjct: 306 KNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---ISLSRKELRGIIKGSGYLCG 362
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +ICFENGK++ ++++ R+ P ML +Q+
Sbjct: 363 CQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPESMLFDAIQTV 422
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 423 FGAPINQKSF 432
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 116/258 (44%), Gaps = 43/258 (16%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 743
++ CD C +H CL +LR SR+ + + F
Sbjct: 1056 LICCDNCPASYHQDCLPCQIYMNLR-------------SRVGIPIHTI--------DGFS 1094
Query: 744 LNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSISGRD 803
++ + T +DI A E + L A++I +CF PI+D+ +G D
Sbjct: 1095 CTVLRNNGDQRVSTAADI----------AILAECNMKLVIALSIMEECFLPIIDARTGID 1144
Query: 804 LIPSMVYGRNLRGQ----EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINH 859
+IP ++Y N R ++ G Y +L + ++S +R+ G VAE+PL+AT N
Sbjct: 1145 IIPPILY--NWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1202
Query: 860 GKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCS-- 917
+G + L IE++L L+V+ ++L A WT FGF ID + RK S
Sbjct: 1203 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPID----DLDRKNLSRL 1258
Query: 918 QLVTFKGTSMLQKRVPAC 935
+LV+ GT +L++ + C
Sbjct: 1259 RLVSVPGTVLLKRNLYEC 1276
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 41/105 (39%), Gaps = 40/105 (38%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAG--RQYP------------------------ 570
L +G GI C CC+ + S F+ HAG ++ P
Sbjct: 966 LKDGNITKKGIRCRCCDMVFTMSMFKYHAGLRQEIPSLNLFLGSGKSYTLCQLQAWSIEH 1025
Query: 571 --------------GKDNDDLCTICADGGNLLPCDGCPRAFHKEC 601
+NDD C +C DGG L+ CD CP ++H++C
Sbjct: 1026 KARKERAKCTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDC 1070
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 92.8 bits (229), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 202 SALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRD 261
+ + S KN ELK SKKI N P V L TGLLDGV+V Y+ + + LRG I+
Sbjct: 401 TTIDSASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREKT--LRGTIKG 458
Query: 262 GGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLK 321
G LCSC +C G +V+ +FE HA + + + +I FENGK++ V++ ++ P ML
Sbjct: 459 TGYLCSCKVC-GNKVLNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLF 517
Query: 322 ATLQSALSSLPEEKSFACVRC 342
+++ S +K+F +
Sbjct: 518 NAIETVTGSAINQKNFLSWKA 538
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +KK + N P V L TG+LDGV V Y + LRG
Sbjct: 302 VAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKY---VSVSREELRG 358
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 359 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 418
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 419 SLLFDTIQTVFGAPINQKAF 438
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 4/152 (2%)
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGR--NLRGQEFGGMYCAILTVNSSVVSAGI 838
L A+++ H+CF+P+ +S+S RDL+ +++ R L F G Y +L N ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 839 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 898
+RV+G++VAE+PLV T G +L +EK L L V+ +VLPA WT
Sbjct: 1039 VRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRS 1098
Query: 899 FGFKKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
FGF K+ S + + F+G M QK
Sbjct: 1099 FGFAKMTNLERSQFLD--YTFLDFQGAIMCQK 1128
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 68/163 (41%), Gaps = 52/163 (31%)
Query: 565 AGRQYPG---KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMF 621
+G+ + G +ND +C++C GG L+ CD CP +FHK C L IP GDW+C C
Sbjct: 628 SGKSFSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGI 687
Query: 622 ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP 681
+R + D D VEQ
Sbjct: 688 CGQRKIDRD------------DEVEQ---------------------------------- 701
Query: 682 RTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
+L C QCE ++HV CL ++ AD+ G WFC DC ++
Sbjct: 702 --LLPCIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKL 741
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 86/172 (50%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYGQETGSSQTSSGVVSEQQVAKPSLEPVPKNKAETKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVTY---VSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 222 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 281
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 282 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 338
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 339 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 390
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENGK++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 3/140 (2%)
Query: 198 IAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRG 257
+A+ + T K ELK +K + N P V L TG+LDGV V Y I LRG
Sbjct: 341 VAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKY---ISVSREELRG 397
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L T+Q+ + +K+F
Sbjct: 458 SLLFDTIQTVFGAPIHQKAF 477
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ELK ++K + N P V L TG+LDGV V Y + LRGII+ G LC
Sbjct: 243 KNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKY---VSLSREELRGIIKGSGYLCG 299
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P ML +Q+
Sbjct: 300 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFDVIQTV 359
Query: 328 LSSLPEEKSF 337
+ +KSF
Sbjct: 360 FGAPINQKSF 369
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 82/159 (51%), Gaps = 3/159 (1%)
Query: 179 VTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLD 238
V Q E +++ + +A+ + + E+K +K + N P V L TG+LD
Sbjct: 251 VQQKEFDASDAHATASNTRVAKSKTESVSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLD 310
Query: 239 GVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYIC 298
GV V Y I LRG+I+ G LCSC CN +V+ +FE HA + + + +I
Sbjct: 311 GVPVKY---IALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIY 367
Query: 299 FENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
FENGK++ ++++ RS P ML +Q+ + +KSF
Sbjct: 368 FENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSF 406
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 88/172 (51%), Gaps = 3/172 (1%)
Query: 166 SAMKPKVEPVEVLVTQSEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKKP 225
S ++ + ++++ Q G S ++ + +A+ S + PK E K SKK + P
Sbjct: 232 SYVQDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFP 291
Query: 226 MTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIH 285
V L TG+LDGV V Y + LRG+I+ G LC C C+ +V+ FE H
Sbjct: 292 SNVRSLISTGMLDGVPVKY---VSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERH 348
Query: 286 ACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
A + + + +I FENG+++ ++++ R+ P +L +Q+ S +K+F
Sbjct: 349 AGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAF 400
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 72/133 (54%), Gaps = 3/133 (2%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGI 264
++ K E KMSKK + N P V L TG+LDGV V Y + L GII+ G
Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGY 388
Query: 265 LCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATL 324
LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +
Sbjct: 389 LCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAI 448
Query: 325 QSALSSLPEEKSF 337
Q+ S +KSF
Sbjct: 449 QTVTGSPINQKSF 461
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 89.7 bits (221), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K E KMSKK + N P V L TG+LDGV V Y + L GII+ G LC
Sbjct: 265 KNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY---VSLSREELHGIIKGSGYLCG 321
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P +L +Q+
Sbjct: 322 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 381
Query: 328 LSSLPEEKSF 337
S +KSF
Sbjct: 382 TGSPINQKSF 391
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 428 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 485
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 486 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVT 545
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 546 GSTINQKNF 554
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 74/140 (52%), Gaps = 7/140 (5%)
Query: 205 TSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYM-------GGIKFQASGLRG 257
++ K E KMSKK + N P V L TG+LDGV V Y+ G I L G
Sbjct: 443 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHG 502
Query: 258 IIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPL 317
II+ G LC C CN +V+ +FE HA + + + +I FENGK++ ++++ RS P
Sbjct: 503 IIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 562
Query: 318 PMLKATLQSALSSLPEEKSF 337
+L +Q+ S +KSF
Sbjct: 563 SLLFBAIQTVTGSPINQKSF 582
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 80/144 (55%), Gaps = 6/144 (4%)
Query: 790 DCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVA 847
+CF PI+D +G +LI ++VY G N +F G Y IL +++A +R+ G ++A
Sbjct: 3 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 62
Query: 848 ELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DP 906
E+P + T + +G + L IE +LS L V+ +++PA E WT KFGF + D
Sbjct: 63 EMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDS 122
Query: 907 ELLSIYRKRCSQLVTFKGTSMLQK 930
E + K S LV F GT +LQK
Sbjct: 123 EKQEV--KSISMLV-FPGTGLLQK 143
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 82/150 (54%), Gaps = 6/150 (4%)
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 841
AV I H+CF I++ + D+ +V+ R LR F G Y +L +VS G R+
Sbjct: 11 AVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGTFRI 70
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
GQ+ AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 71 CGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGF 130
Query: 902 KKI-DPELLSIYRKRCSQLVTFKGTSMLQK 930
+ + E L + + +++F+GT+M QK
Sbjct: 131 TVMSNSERLELAG---NSILSFQGTTMCQK 157
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 417 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 474
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 475 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 534
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 535 QEMLFDAIQNVTGSPINQKNF 555
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 197 AIAEGSALTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLR 256
AI + + K+ E +MSKK+ N P V L TG+LDGV V Y+ + L+
Sbjct: 396 AIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSR--EKNLK 453
Query: 257 GIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP 316
GII+ G LCSC CN + + +FE HA + + + +I FENGK++ V++ ++ P
Sbjct: 454 GIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTP 513
Query: 317 LPMLKATLQSALSSLPEEKSF 337
ML +Q+ S +K+F
Sbjct: 514 QEMLFDAIQNVTGSPINQKNF 534
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 72/134 (53%), Gaps = 2/134 (1%)
Query: 204 LTSPKKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGG 263
+ S KN ELK SKK+ N P V L TGLLDGV V Y+ + + L GII+ G
Sbjct: 432 IDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT--LEGIIKGTG 489
Query: 264 ILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKAT 323
LC C C + + +FE HA + + + +I FENGK++ V++ ++ P +L
Sbjct: 490 YLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGVLFNA 549
Query: 324 LQSALSSLPEEKSF 337
+Q+ S +K+F
Sbjct: 550 IQTVTGSHINQKNF 563
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 86.3 bits (212), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 119/269 (44%), Gaps = 36/269 (13%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKL 739
+L+CD+C FH C+ L P+G WFC C C + L
Sbjct: 423 LLMCDRCPSMFHHACVG------LESTPQGDWFCPACTCAICG-------------SSDL 463
Query: 740 PEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAIFHDCFDPIVDSI 799
+ + + +S R G+ E L A+ + +CF +++
Sbjct: 464 DDPPATTTTQGFSSDRMVISCEQCRRESRDGEE---EEHAKLCMALDVLRECFVTLIEPR 520
Query: 800 SGRDLIPSMVYG--RNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKI 857
+ DL +V+ LR +F G Y L +++ LRV+G+EVAE+PLV T
Sbjct: 521 TQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGTRFA 580
Query: 858 NHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTD-KFGFKKIDPELLSIYRKRC 916
+G +LL I+KLL + V+ +VLPA E + WT FG + E+ R+
Sbjct: 581 RRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIR----EMGQADRQDV 636
Query: 917 SQ--LVTFKGTSMLQKRVPA-CRIGSSST 942
+ ++ F+GT M K++P ++G ++T
Sbjct: 637 AHHAILRFQGTIMCHKQLPPQPQLGHTTT 665
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+D+D +C++C D G LL CD CP FH C L S PQGDW+C C
Sbjct: 408 EDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCPAC 453
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 85.9 bits (211), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 82/156 (52%), Gaps = 4/156 (2%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P + EL TGLL+G+ V+Y+ +A L+G+I I C C CNG + +
Sbjct: 403 LTKHPGNIRELLNTGLLEGMPVMYIIPHSKKAV-LKGVITGCNIRCFCLSCNGSKAVSAY 461
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACV 340
FE HA + + YI NG SL +VLRA PL L+ T++S++ + + S C+
Sbjct: 462 YFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCL 521
Query: 341 RCKGTFPITCVGKTGPGPLCNSCVKSKKPQGTMTYT 376
C P+ ++ LC C++SK+PQ +T +
Sbjct: 522 NCNE--PVLPSSQSE-NVLCQVCLESKQPQDPLTAS 554
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 70/139 (50%), Gaps = 17/139 (12%)
Query: 532 ACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCD 591
+ G++ ++GY I C+ CN VS ++ R + L + G N PC
Sbjct: 583 SAGKRKVDGYIKDQRIYCNHCNRVVSLFSHLSYFFRL--AHQHLKLMRVRDQGAN--PCV 638
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKR 651
G L +P +WYC C N+ ++++ L + NA AGR +GVDS+EQI KR
Sbjct: 639 G-----------LRKVP-SEWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKR 686
Query: 652 CIRIVKNLEAELSGCLLCR 670
IRIV + +L GC LC+
Sbjct: 687 AIRIVP-ISDDLGGCALCK 704
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 83/161 (51%), Gaps = 8/161 (4%)
Query: 182 SEGFGNESMSLIEVEAIAEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGL 236
SE G + + A S +T K ++ LK + + K+ P V L TG+
Sbjct: 279 SEADGVPELESSSFDVPASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGM 338
Query: 237 LDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQY 296
LDGV V Y+ + + LRGII+ G LC C CN +++ +FE HA + + + +
Sbjct: 339 LDGVPVKYVSVTREE---LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNH 395
Query: 297 ICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
I FENGK++ ++++ RS P +L T+Q+ + +KSF
Sbjct: 396 IYFENGKTIYQIVQELRSTPESLLFDTIQTIFGAPINQKSF 436
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK N P V L TG+ DGV V Y+ + ++ L+GII+ G LCSC
Sbjct: 429 KNKEPKTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKS--LKGIIKGTGYLCSC 486
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ ML +Q+
Sbjct: 487 DNCNQSKALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVT 546
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 547 GSTINQKNF 555
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 78/144 (54%), Gaps = 8/144 (5%)
Query: 199 AEGSALTSPKKNLELKMSKKISLNKK-----PMTVTELFETGLLDGVSVVYMGGIKFQAS 253
A S +T K ++ LK + + K+ P V L TG+LDGV V Y+ + +
Sbjct: 302 ASSSQITKQKPDITLKNRPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE-- 359
Query: 254 GLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACR 313
LRGII+ G LC C CN +++ +FE HA + + + +I FENGK++ ++++ R
Sbjct: 360 -LRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELR 418
Query: 314 SVPLPMLKATLQSALSSLPEEKSF 337
S P +L T+Q+ + +KSF
Sbjct: 419 STPESLLFDTIQTIFGAPINQKSF 442
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN E K +KK S N P V L TG+ DG+ V Y + L+G+I+ G LCSC
Sbjct: 386 KNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYC--TWSREKNLQGVIKGTGYLCSC 443
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
+C G + + +FE HA + + + +I FENGKS+ V++ ++ P ML +Q+
Sbjct: 444 DICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQEMLFDAIQTVT 503
Query: 329 SSLPEEKSF 337
+ +++F
Sbjct: 504 GATINQRNF 512
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 84.0 bits (206), Expect = 4e-13, Method: Composition-based stats.
Identities = 42/73 (57%), Positives = 51/73 (69%), Gaps = 1/73 (1%)
Query: 861 KGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQLV 920
+GYFQ LF CIE+LL+ L+VK VLPAA+EAESIWT +FGF KI + L Y K +
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKG-GRTT 61
Query: 921 TFKGTSMLQKRVP 933
F+GTS L K VP
Sbjct: 62 VFQGTSTLHKLVP 74
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 71/129 (55%), Gaps = 2/129 (1%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
KN + + SKK++ N P V L TG+LDGV V Y+ + L+G+I+ G LC C
Sbjct: 339 KNKDGRPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSR--EKNLKGLIKGAGYLCGC 396
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN + + +FE HA + + + +I FENGK++ V++ ++ P ML +Q+
Sbjct: 397 QECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEMLFEAIQTVT 456
Query: 329 SSLPEEKSF 337
S +K+F
Sbjct: 457 GSPINQKNF 465
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 82.4 bits (202), Expect = 1e-12, Method: Composition-based stats.
Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%)
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGI 838
L A + H+CF +V+ + DL +V+ R LR F G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 839 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDK 898
LRV+G++VAELPLV T + +G LL +E LL V+ +VLPA E WT
Sbjct: 69 LRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQTWTGS 128
Query: 899 FGFKKI-DPELLSIYRKRCSQLVTFKGTSMLQK 930
FGF+ + + L I + ++ F+GT+M K
Sbjct: 129 FGFQVMTQSQKLDIAQH---TIMCFQGTTMCHK 158
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 81.6 bits (200), Expect = 2e-12, Method: Composition-based stats.
Identities = 52/135 (38%), Positives = 75/135 (55%), Gaps = 10/135 (7%)
Query: 803 DLIPSMVYGRNLRGQ-EFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGK 861
D I +MV ++ G+ +F G+YCA+LT ++ VVSA IL+V +EVAEL L+AT K
Sbjct: 104 DDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRTEEVAELVLIATHNECRKK 163
Query: 862 GYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKR----CS 917
GYF LL + IE L V+ + P E IW++K G+ +LS +K
Sbjct: 164 GYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYT-----ILSDEQKHSMLMAH 218
Query: 918 QLVTFKGTSMLQKRV 932
LV F S++QK +
Sbjct: 219 PLVMFANLSLVQKSL 233
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 1/134 (0%)
Query: 209 KNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSC 268
K+ + K +KK S N P V L TG+ DGV+V Y + + L+GII+ G LC C
Sbjct: 367 KSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR-EVRNLKGIIKGTGYLCGC 425
Query: 269 SLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSAL 328
CN RV+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+
Sbjct: 426 GNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVT 485
Query: 329 SSLPEEKSFACVRC 342
S K+F +
Sbjct: 486 GSDINHKNFNTWKA 499
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 3/130 (2%)
Query: 208 KKNLELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCS 267
K ++K ++K S N P V L TG+LDGV V Y+ + + LRGII+ LC
Sbjct: 293 KNKQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE---LRGIIKGTTYLCG 349
Query: 268 CSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSA 327
C CN + + +FE HA + + + +I FENGK++ ++++ RS P L T+Q+
Sbjct: 350 CQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQTI 409
Query: 328 LSSLPEEKSF 337
+ +K+F
Sbjct: 410 FGAPINQKAF 419
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 105/250 (42%), Gaps = 63/250 (25%)
Query: 728 LQNLLVQEAEKLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRLLLSQAVAI 787
LQNLL + + PE+ +++ E V +D R E ++ A+++
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEVPEEVLALDKR----------VECNSKIAVALSL 1291
Query: 788 FHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGILR----- 840
+CF PIVD +G +LI ++VY G N +F G Y IL +++A +R
Sbjct: 1292 MDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLKEKN 1351
Query: 841 ------------------------------------VFGQEVAELPLVATSKINHGKGYF 864
+ G ++AE+P + T + +G
Sbjct: 1352 ILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQGMC 1411
Query: 865 QLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDP----ELLSIYRKRCSQLV 920
+ L IE +LS L ++ +++PA E WT KFGF +D E+ S+ ++
Sbjct: 1412 RRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKSV------SML 1465
Query: 921 TFKGTSMLQK 930
F GT +LQK
Sbjct: 1466 VFPGTGLLQK 1475
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 61/149 (40%), Gaps = 47/149 (31%)
Query: 537 LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQ---------------------------- 568
LLEG+ GI C CC+ +S +F HAG +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAGSEVNKPYRNILVDGLDIDLLHCLINAWNMQ 986
Query: 569 --------YP----GKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
+P G D NDD C IC DGGNL+ CDGCP FH C L ++P W C
Sbjct: 987 SDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWCCS 1046
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDS 644
C F H+ ++ +A + VDS
Sbjct: 1047 NCSCKF------CHEHSSDDAEDTADVDS 1069
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 103/254 (40%), Gaps = 38/254 (14%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQC-EREFHVGCLKKHKMADLR-ELPKGK--WFCCMDC 721
C C G D G R ++LC C H GC + L E+ G +FC +C
Sbjct: 16 CTHCGGGDVEPEGR--RVLVLCSACFAAGTHTGCHEDVTGEPLSSEITHGDGLYFCGKEC 73
Query: 722 SRINSVLQNLLVQEAE--KLPEFHLNAIKKYAGNSLETVSDIDVRWRLLSGKAATPETRL 779
R L+ + + PE + + Y + S ++ R+
Sbjct: 74 QRSYEALEAATGRRSRIRDEPEQYTFELVHYKQDDRTVRSAVETAMRM------------ 121
Query: 780 LLSQAVAIFHDCFDPIVDSISGRDLI---------PSMVYGRNLRGQEFGGMYCAILTVN 830
F F P++ +GRDL+ P G F AIL +
Sbjct: 122 --------FRTSFAPLIME-NGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFRLAILRMG 172
Query: 831 SSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEE 890
++++A LRVFG + AE+P V+T + + G+ + L +E LL V +V+P+ E
Sbjct: 173 GTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCLVIPSINE 232
Query: 891 AESIWTDKFGFKKI 904
+WT+KFGF KI
Sbjct: 233 LLPMWTNKFGFAKI 246
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 73/165 (44%), Gaps = 21/165 (12%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 376 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCE-----KEGIQWE 430
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + I++ +R+ E E + CR C K G +L CD
Sbjct: 431 AKDEDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVC---KDG---GELLCCDT 484
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 732
C +H+ CL L E+P G+W C C C I +Q +L
Sbjct: 485 CTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 524
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 566 GRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G + D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 458 GAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 511
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 28/46 (60%), Positives = 33/46 (71%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
+ N D C IC DGG+LL CD CPR+FH +C L SIP+ DWYCK C
Sbjct: 118 RQNRDHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRC 163
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + Q + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQRN-LKGMIKGTGYLCGCGNC 444
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 445 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 504
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 505 INHKNFNTWKA 515
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 74.7 bits (182), Expect = 2e-10, Method: Composition-based stats.
Identities = 36/97 (37%), Positives = 46/97 (47%), Gaps = 29/97 (29%)
Query: 547 IICHCCNSEVSPSQFEAHAG---RQYPGKD--------------------------NDDL 577
I+C CC E+SPSQFE+HAG R+ P + DD+
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC 614
C DGG+L+ C CPRAFH C L P+G W+C
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 69/164 (42%), Gaps = 28/164 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-----KEGVQWE 413
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + +I V + E E CR C K G +L CD C
Sbjct: 414 AK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC---KDGG---ELLCCDAC 457
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 458 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKIQKILT 497
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 73.9 bits (180), Expect = 4e-10, Method: Composition-based stats.
Identities = 34/71 (47%), Positives = 46/71 (64%)
Query: 600 ECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
+C LSS +G W C+YC+N +R+ L ++ NA+ AGRV GVD++EQI R IRI L
Sbjct: 4 KCVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTL 63
Query: 660 EAELSGCLLCR 670
E GC LC+
Sbjct: 64 ETGFGGCALCK 74
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 72/165 (43%), Gaps = 31/165 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 364 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCE-----KEGIQWE 418
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A R G D E ++E + CR C K G +L CD C
Sbjct: 419 A------REDGSDGEEDNGD-----AGDMEEDDHHMEFCRVC---KDG---GELLCCDSC 461
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+H+ CL L E+P G+W C CM C + +Q +L
Sbjct: 462 PSSYHIHCLN----PPLPEIPNGEWICPRCM-CPPMKGKVQKILT 501
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 66/131 (50%), Gaps = 2/131 (1%)
Query: 212 ELKMSKKISLNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLC 271
+ K +KK S N P V L TG+ DGV+V Y + L+G+I+ G LC C C
Sbjct: 386 DTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSR--ERNLKGMIKGTGYLCGCGNC 443
Query: 272 NGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSL 331
+V+ +FE HA + + + +I FENGK++ V++ ++ P L +Q+ S
Sbjct: 444 KLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAIQNVTGSD 503
Query: 332 PEEKSFACVRC 342
K+F +
Sbjct: 504 INHKNFNTWKA 514
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 71/164 (43%), Gaps = 19/164 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCE-----KEGIQWE 443
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A E I++ + + E E + CR C K G +L CD
Sbjct: 444 AKDEEFEDFEEDSEDRVISEVSLGVPTGAEEEDDDHMEFCRVC---KDG---GELLCCDT 497
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
C +H+ CL L E+P G+W C C I +Q +L
Sbjct: 498 CTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKGRVQKIL 537
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 566 GRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G + D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 471 GAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 524
>gi|154309635|ref|XP_001554151.1| hypothetical protein BC1G_07288 [Botryotinia fuckeliana B05.10]
Length = 765
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 84/196 (42%), Gaps = 53/196 (27%)
Query: 577 LCTICADGGNLLP-----CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+C IC ++ P CDGCP H++C S+ IP+GDW+CK CQ +
Sbjct: 293 VCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARAAE 352
Query: 632 NAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
N GV DS ++I+ C +C+G D K P I+LC+ C
Sbjct: 353 N-------DGVTDSDDEIS----------------CAVCQGLDSEK----PNEIILCENC 385
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP---------- 740
+ H C ++ + P+G+W C C ++ V +LL +E + P
Sbjct: 386 DYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLGPISNEVPSIEG 436
Query: 741 -EFHLNAIKKYAGNSL 755
E HL +++ N L
Sbjct: 437 FETHLKTMQRVLLNRL 452
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 51/93 (54%), Gaps = 11/93 (11%)
Query: 506 TPKDQRLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHA 565
+ +D LHK +F GL DGTE+GYYA Q +L+G K G GI C CCN E++ S FE HA
Sbjct: 507 SARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITCSAFERHA 566
Query: 566 ---GRQYP--------GKDNDDLCTICADGGNL 587
RQ P G+ D+C AD L
Sbjct: 567 RCEARQNPYGSILLADGRSLKDMCKELADQSKL 599
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 12/157 (7%)
Query: 221 LNKKPMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPS 280
L K P EL TGLL+G V + L GI +D G++C+C +C G +V+ S
Sbjct: 308 LTKPPRNAKELMATGLLEGHYVH----CSCRGEQLTGIFQDMGVVCNCRICKGTQVVSIS 363
Query: 281 KFEIHACKQYRRASQYICFENGKSLLEVLRACRSVP---LPMLKATLQSALSSLPE-EKS 336
FE H+ S I ENGK+L ++L A + +L+A LQ A+ + K
Sbjct: 364 AFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNILRA-LQHAIGEIQGISKE 422
Query: 337 FACVRC---KGTFPITCVGKTGPGPLCNSCVKSKKPQ 370
CV+C +G I+C G CV K P
Sbjct: 423 MTCVKCGKHEGGEFISCKGAKCSAAYHAECVGVKSPH 459
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 60/156 (38%), Gaps = 52/156 (33%)
Query: 520 SGLPDGTEVGYYACGQKLLEGYKNGLGIICHC--CNSE--VSPSQFEAHAGR--QYP--- 570
+GL +G V + +C + L G +G++C+C C VS S FEAH+G +P
Sbjct: 321 TGLLEGHYV-HCSCRGEQLTGIFQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSDN 379
Query: 571 -----GKDNDDL---------------------------------CTICA--DGGNLLPC 590
GK+ D+ C C +GG + C
Sbjct: 380 IYLENGKNLRDILSAGQESADCGDNILRALQHAIGEIQGISKEMTCVKCGKHEGGEFISC 439
Query: 591 DG--CPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
G C A+H EC + S DW+C C+ RK
Sbjct: 440 KGAKCSAAYHAECVGVKSPHLEDWFCAKCEKTQARK 475
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 69/164 (42%), Gaps = 28/164 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-----KEGVQWE 412
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + +I V + E E CR C K G +L CD C
Sbjct: 413 AK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCRVC---KDG---GELLCCDAC 456
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 457 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKIQKILT 496
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 71/164 (43%), Gaps = 19/164 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 272 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCE-----KEGIQWE 326
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A E I++ + + E E + CR C K G +L CD
Sbjct: 327 AKDEEFEDFEEDSEDRVISEVSLGVPMGAEEEDDDHMEFCRVC---KDGG---ELLCCDT 380
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
C +H+ CL L E+P G+W C C I +Q +L
Sbjct: 381 CTSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKGRVQRIL 420
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 566 GRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G + D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 354 GAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 407
>gi|347838360|emb|CCD52932.1| hypothetical protein [Botryotinia fuckeliana]
Length = 886
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 84/196 (42%), Gaps = 53/196 (27%)
Query: 577 LCTICADGGNLLP-----CDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+C IC ++ P CDGCP H++C S+ IP+GDW+CK CQ +
Sbjct: 293 VCEICTKPDSVAPNKILFCDGCPLIVHQKCYSVPKIPEGDWFCKKCQKARVAAEAARAAE 352
Query: 632 NAVEAGRVSGV-DSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
N GV DS ++I+ C +C+G D K P I+LC+ C
Sbjct: 353 N-------DGVTDSDDEIS----------------CAVCQGLDSEK----PNEIILCENC 385
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLP---------- 740
+ H C ++ + P+G+W C C ++ V +LL +E + P
Sbjct: 386 DYAVHQSC------GNIPKKPRGEWLCET-C--VSDVDHDLLDREIDLGPISNEVPSIEG 436
Query: 741 -EFHLNAIKKYAGNSL 755
E HL +++ N L
Sbjct: 437 FETHLKTMQRVLLNRL 452
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 38/65 (58%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
+S V P+ E H G + P D C++C GG LL CDGCPRAFH C L IP+ +W
Sbjct: 1170 SSPVRPAHVENHTGPKNPDDQWDVDCSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEW 1229
Query: 613 YCKYC 617
+C C
Sbjct: 1230 FCNEC 1234
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRF 626
C IC DGG LL CD CPRAFH C +S IP +WYCK C +R+R
Sbjct: 161 CNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMCSECLDRRRL 211
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Query: 565 AGRQYPGKDN-DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
AGR D ++ C IC++GG L+ CDGCP FH C L +P+G +C C
Sbjct: 1048 AGRTRESVDEWEEDCYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQ 618
GC R FH +CA L ++P DWYCK C+
Sbjct: 1263 GCDRVFHLKCAKLDAVPADDWYCKKCR 1289
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 76/165 (46%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + + P+G W C +C+ K +Q +
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCE-----KEGIQWE 396
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 397 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 440
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 441 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 481
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 2/118 (1%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 448 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 505
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRC 342
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F +
Sbjct: 506 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQTWKA 563
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 45/84 (53%), Gaps = 9/84 (10%)
Query: 569 YPGK------DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ-N 619
YP K NDD C +C DGG L+ CDGCPRAFH C L+SIP+G W C+ CQ N
Sbjct: 280 YPQKMTFAVEHNDDECAVCKDGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLCQSN 339
Query: 620 MFERKRFLQHDANAVEAGRVSGVD 643
+ + + A E S VD
Sbjct: 340 RLKDRTYTHVQPPATETSSGSAVD 363
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 419
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 420 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 463
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 464 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 504
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 416 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 459
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 460 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 69/141 (48%), Gaps = 13/141 (9%)
Query: 225 PMTVTELFETGLLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEI 284
P+ V L TG+ DGV V Y+ + ++ +RG+I+ G LCSC CN + +FE
Sbjct: 473 PLNVKSLLSTGMFDGVPVKYVSWTREKS--VRGVIKGSGYLCSCKDCNSSNCLNAYEFER 530
Query: 285 HACKQYRRASQYICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSFACVRCKG 344
HA + + + +I FENGK++ V++ ++ P L +Q+ +K+F K
Sbjct: 531 HANCKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTGCPINQKNFQT--WKA 588
Query: 345 TFPITCV---------GKTGP 356
++ V G TGP
Sbjct: 589 SYQAATVELQRIYGKDGATGP 609
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 96/219 (43%), Gaps = 31/219 (14%)
Query: 684 ILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKL-- 739
++ CD C FH CL + H M D L + + C + I ++ V + K+
Sbjct: 650 LVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGIAPIVHANSVGQLLKMLL 708
Query: 740 -PEFHLNA----------IKKYAGNSLETVSDIDVRWRLLSGKAATPETRL--------- 779
P + A +KKY G E + W L+ + A + L
Sbjct: 709 RPRMQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRECADSDLFLGEHPHIVEN 766
Query: 780 --LLSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVS 835
L+ A+ + +CF PIVD SG +++ +++Y G N FGG Y A+L VV+
Sbjct: 767 NSKLALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVA 826
Query: 836 AGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKL 874
+ +R G +AE+P + T + +G + LF+ IE +
Sbjct: 827 SASIRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESV 865
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 104/260 (40%), Gaps = 64/260 (24%)
Query: 410 KKTRKLLEADLVSKSSSKSVSL------RNLLKTRSPW--ELTRNSSRPGLIANSTPV-- 459
+KT++ +E D+ S+ S S S+ RN L ++ + +R+SSR +++P+
Sbjct: 423 RKTKRKIEKDMKSEEQSASDSVGKATFARNFLAIKNEVGNDDSRDSSRGTTSKSASPLHH 482
Query: 460 ---TSVHKSSQSQRQRKITKKSKKTVLISKPF--ENASPPLSFPNKSRWNITPKDQRLHK 514
S SS K +K + T+ + +P +N+ P+ + Q +
Sbjct: 483 QTEKSTGSSSHHVDGGKSSKHVRSTLSVRRPVRGDNSEGDGFVPSSEK-------QTILA 535
Query: 515 LVFDESGLPDGTEVGYYAC--GQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGR----- 567
+ D L +V Y +LEG+ GI C CC+ +S S+FE HAG
Sbjct: 536 WLIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHAGSKLRQP 595
Query: 568 ---------------QYPGKD--------------------NDDLCTICADGGNLLPCDG 592
Q D NDD C IC DGG+L+ CDG
Sbjct: 596 FQNIFLNTGVSLFQCQIDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDG 655
Query: 593 CPRAFHKECASLSSIPQGDW 612
CP FH+ C + DW
Sbjct: 656 CPSTFHQRCLDIRGHLMPDW 675
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 350 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 404
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 405 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 448
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 449 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 489
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 419
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 420 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 463
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 464 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 504
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 355 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 409
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 410 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 453
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 454 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 494
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 416 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 459
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 460 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 303 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 357
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 358 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 401
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 402 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 442
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 416 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 459
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 460 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 420
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 421 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 464
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 465 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 505
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 412
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 413 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 456
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 457 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 497
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 76/166 (45%), Gaps = 31/166 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 416 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 459
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C C + +Q +L+
Sbjct: 460 CPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKILI 500
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 72/165 (43%), Gaps = 31/165 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ M + + D
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWE-ARED 409
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A+ E +G ++ E + + C +C K G +L CD C
Sbjct: 410 ASEGEEDNEAGGEAEED------------DHHMEFCRVC------KDGG---ELLCCDSC 448
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C C + +Q +L
Sbjct: 449 PSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKILT 488
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 72/165 (43%), Gaps = 31/165 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ M + + D
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWE-ARED 409
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A+ E +G ++ E + + C +C K G +L CD C
Sbjct: 410 ASEGEEDNEAGGEAEED------------DHHMEFCRVC------KDGG---ELLCCDSC 448
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C C + +Q +L
Sbjct: 449 PSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKILT 488
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 74/165 (44%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 416
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E +V + E E + CR C K G +L CD
Sbjct: 417 AKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRVC---KDG---GELLCCDA 460
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 501
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 74/165 (44%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 423
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E +V + E E + CR C K G +L CD
Sbjct: 424 AKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRVC---KDG---GELLCCDA 467
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 508
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 74/165 (44%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 423
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E +V + E E + CR C K G +L CD
Sbjct: 424 AKEDNS---EGEEILED-------VVGDAEEEDDHHMEFCRVC---KDG---GELLCCDA 467
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
C +H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 CPSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 508
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 65/133 (48%), Gaps = 10/133 (7%)
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLR---------GQEFGGMYCAILTVNS 831
+ QA+ +F F P++ +GRDL+ + G G F G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMD-NGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRG 279
Query: 832 SVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEA 891
+VV+A LRVFG+ AELP VAT + G + L +E LL V +V+P+ +
Sbjct: 280 AVVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPL 339
Query: 892 ESIWTDKFGFKKI 904
+W KFGF +
Sbjct: 340 LPMWAAKFGFTPL 352
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 37/57 (64%), Gaps = 2/57 (3%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQ 628
NDD CT+C DGG L+ CDGCPRAFH C LSSIP G W C++C+ +K Q
Sbjct: 259 NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCRGHRVKKEKAQ 315
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 72/168 (42%), Gaps = 37/168 (22%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM---FERKRFL 627
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ M +E +
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAR--- 408
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
DA+ E +G ++ E + + C +C+ +L C
Sbjct: 409 -EDASEGEEDNEAGGEAEED------------DHHMEFCRVCKDGG---------ELLCC 446
Query: 688 DQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
D C +H+ CL L E+P G+W C C C + +Q +L
Sbjct: 447 DSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKILT 489
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 68/115 (59%), Gaps = 4/115 (3%)
Query: 776 ETRLLLSQAVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRG--QEFGGMYCAILTVNSSV 833
E R+ LS A+ + H C++P+ DS +G D++P ++ G L G ++ GM+ A+L +
Sbjct: 420 ELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGPAA 479
Query: 834 VSAGILRVFGQEVAELPLVAT-SKINHGKGYFQLLFACIEKLLSFLRVKSIVLPA 887
V+ + R FG ++AE+P++A ++ G +LL A +E+LL K+I PA
Sbjct: 480 VAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPA 533
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
DNDDLC IC GG+L+ C+ CP FH C L++ P+GD++C C+
Sbjct: 117 DNDDLCHICGLGGDLMCCETCPGVFHAACLGLAAPPEGDYHCPLCR 162
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 68/148 (45%), Gaps = 28/148 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L E+P G+W C
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 69/162 (42%), Gaps = 29/162 (17%)
Query: 558 PSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCK 615
P EA G Y D+ D C +C GG ++ CD CPRA+H C + P+G W C
Sbjct: 364 PPLSEAEDGDGYE-TDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCP 422
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
+C+ K +Q +A R D+ E + +E + CR C
Sbjct: 423 HCE-----KEGIQWEA------REDVSDAEEDNGE-----TGEMEEDDHHMEFCRVC--- 463
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
K G +L CD C +H+ CL L E+P G+W C
Sbjct: 464 KDG---GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWIC 498
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 458 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 501
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 33/149 (22%)
Query: 571 GKDNDDL--CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQ 628
G ++D C +C DGG+L+ CD CP++FH+ C +L+ IP GDW C C L
Sbjct: 451 GNESDHFWYCEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTG-----EGLP 505
Query: 629 HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 688
D ++ + + E + ++ K RG D ++LCD
Sbjct: 506 EDGDSSNSAQEEEEGEEETEHDQFCKVCK------------RGGD----------VILCD 543
Query: 689 QCEREFHVGCLKKHKMADLRELPKGKWFC 717
C FH+ CL L E+P+G W C
Sbjct: 544 FCSCVFHLRCLN----PPLGEVPEGDWKC 568
Score = 46.2 bits (108), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
+D C +C GG+++ CD C FH C + L +P+GDW C C+
Sbjct: 526 HDQFCKVCKRGGDVILCDFCSCVFHLRCLNPPLGEVPEGDWKCPRCK 572
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 68/148 (45%), Gaps = 28/148 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G + +E++ +LE E + CR C K G +L CD
Sbjct: 423 AKEDNS---EGEEILEEVG-------GDLEEEDDHHMEFCRVC---KDG---GELLCCDT 466
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L E+P G+W C
Sbjct: 467 CPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGGDA------EEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 67/148 (45%), Gaps = 28/148 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 417
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + G +++E V + E E + CR C K G +L CD
Sbjct: 418 AKEDNS---EGEETMED-------AVGDAEEEDDHHMEFCRVC---KDG---GELLCCDA 461
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L E+P G+W C
Sbjct: 462 CPSSYHIHCLN----PPLPEIPNGEWLC 485
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 69/164 (42%), Gaps = 28/164 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCE-----KEGVQWE 413
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + ++ V + E E CR C K G +L CD C
Sbjct: 414 AK----------EDNSELDDDLDDAVGDPEEEDHHMEFCRVC---KDG---GELLCCDVC 457
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 458 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPLKGKIQKILT 497
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 396
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 397 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 441
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 442 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 481
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 403
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 404 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 448
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 449 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 488
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 419 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 473
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 474 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 518
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 519 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 558
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 416 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 416 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 222 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 276
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 277 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 321
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 322 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 361
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 416 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 416
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 417 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 461
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 462 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 501
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 426
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 427 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 471
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 472 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 511
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 443
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 444 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 488
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 489 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 528
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 18/148 (12%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHCE-----KEGIQWE 432
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
A + I++ + + E + CR C K G +L CD
Sbjct: 433 AKDEDFEDFEEDSEDRVISEVSSGVPAGGDDEDDDHMEFCRVC---KDG---GELLCCDT 486
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L E+P G+W C
Sbjct: 487 CTSSYHIHCLN----PPLPEIPNGEWLC 510
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 31/62 (50%), Gaps = 6/62 (9%)
Query: 562 EAHAGRQYPGKDNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCK 615
E +G G D DD C +C DGG LL CD C ++H C + L IP G+W C
Sbjct: 452 EVSSGVPAGGDDEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCP 511
Query: 616 YC 617
C
Sbjct: 512 RC 513
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 416 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 396
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 397 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 441
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 442 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 481
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 415
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 416 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 461 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 500
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 24/147 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 302 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 356
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A +S + ++ +R + + + + C +C+ +L CD C
Sbjct: 357 AR----DDLSDGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG---------ELLCCDTC 403
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 404 PSSYHIHCLN----PPLPEIPNGEWIC 426
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 554 SEVSPSQFEAHAGRQYPGKDNDD-----LCTICADGGNLLPCDGCPRAFHKECAS--LSS 606
++S + E R+ G + +D C +C DGG LL CD CP ++H C + L
Sbjct: 359 DDLSDGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPE 418
Query: 607 IPQGDWYCKYCQ 618
IP G+W C C+
Sbjct: 419 IPNGEWICPRCK 430
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 72/164 (43%), Gaps = 27/164 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L+
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPALKGKVQKILI 507
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 68/163 (41%), Gaps = 21/163 (12%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 385 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCE-----KEGIQWE 439
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + + + + + + + C +C+ +L CD C
Sbjct: 440 AKEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFCRVCKDGG---------ELLCCDSC 490
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C I +Q +L
Sbjct: 491 PSSYHIHCLN----PPLPEIPNGEWLCPRCTCPPIKGRVQKIL 529
>gi|156045475|ref|XP_001589293.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980]
gi|154694321|gb|EDN94059.1| hypothetical protein SS1G_09927 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 883
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 39/152 (25%)
Query: 572 KDNDDLCTICADGGNLLP-----CDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFERKR 625
++ D +C IC + P CDGCP A H++C S+ IP GDW+CK CQ N +
Sbjct: 285 EEEDPVCEICTKPDSKRPNLILFCDGCPLAVHQKCYSVPKIPDGDWFCKKCQRNRVAAEA 344
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTIL 685
++ +A+ DS ++I C +CRG D K P I+
Sbjct: 345 ARANENDALN-------DSDDEIK----------------CAVCRGLDSKK----PNEII 377
Query: 686 LCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
LC+ C+ H C D+ + P+ +W C
Sbjct: 378 LCENCDYAVHQTC------GDIPKKPREEWLC 403
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 76/176 (43%), Gaps = 33/176 (18%)
Query: 562 EAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
EA G Y D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+
Sbjct: 244 EAEDGDGYE-TDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCE- 301
Query: 620 MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 679
K +Q +A E G D+ + V +E + CR C K G
Sbjct: 302 ----KEGIQWEAR--EEGSEGDEDNGD---------VGEMEDD-HHMEFCRVC---KDG- 341
Query: 680 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+L CD C +H+ CL L E+P G+W C C C + +Q +L
Sbjct: 342 --GELLCCDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKILT 390
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 104/245 (42%), Gaps = 49/245 (20%)
Query: 410 KKTRKLLEADL------VSKSSSKSVSLRNLLKTRSPW--ELTRNSSRPGLIANSTPVT- 460
+KT+ +E D+ S S K+ RN L ++ + +R+SS+ + +P+
Sbjct: 437 RKTKSKIEKDMKRELHSASDSDGKATFARNFLAIKNEVGNDDSRDSSQGTTSKSESPLHH 496
Query: 461 SVHKSSQSQRQR----KITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLV 516
KS+ S R K +K + T+L+ + N P ++ L
Sbjct: 497 QTEKSTGSSSHRVDGGKSSKHGRSTLLVRRSVRG-------DNSESDGFVPSSEKRTVLA 549
Query: 517 F-DESGLPDGTEVGYYACGQK---LLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQY--P 570
+ +SG +E Y ++ +LEG+ GI C CC+ ++ S+FE HAG + P
Sbjct: 550 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAGSKLRQP 609
Query: 571 GKD-----------------------NDDLCTICADGGNLLPCDGCPRAFHKECASLSSI 607
++ NDD C IC DGG+L+ CDGCP FH+ C +
Sbjct: 610 FQNIFLNSGGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDGCPSTFHQRCLDIRGH 669
Query: 608 PQGDW 612
DW
Sbjct: 670 LMPDW 674
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 97/249 (38%), Gaps = 51/249 (20%)
Query: 684 ILLCDQCEREFHVGCL--KKHKMADLRELPKGKWFCCMDCSRI------NSVLQ------ 729
++ CD C FH CL + H M D L + + C + I NSV Q
Sbjct: 649 LVCCDGCPSTFHQRCLDIRGHLMPDWIFL-RFNYRCFLLVIGIAPIVHANSVRQLLKMLL 707
Query: 730 NLLVQEAEKLPEFHLN-AIKKYAGNSLETVSDIDVRWRL-----------LSGKAATPET 777
L VQ K +L+ +KKY G E + W L LSG E
Sbjct: 708 RLWVQIPAKCVRKNLSEGVKKYVGVKHEL--EAGFSWSLVHRECTNSDLSLSGHPHIVEN 765
Query: 778 RLLLSQAVAIFHDCFDPIVDSISG-------RDLIPSMVYG---------------RNLR 815
L+ A+ + +CF PI+D SG R+ + +G N
Sbjct: 766 NSKLALALTVMDECFLPIIDRRSGHCKKFCLRNFTTVIFFGISLCWFVCLYIAFRRSNFN 825
Query: 816 GQEFGGMYCAILTVNSSVVSAGILRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLL 875
FGG Y A+L +V++ +R G +AE+P + T + +G + LF+ +E +
Sbjct: 826 RLNFGGFYTALLERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESVS 885
Query: 876 SFLRVKSIV 884
S V +
Sbjct: 886 STADVAKLT 894
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 73/165 (44%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 287 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 341
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 342 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 386
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C C + +Q +L+
Sbjct: 387 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKILI 426
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 73/165 (44%), Gaps = 29/165 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 113 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 167
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 168 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 212
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C C + +Q +L+
Sbjct: 213 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKILI 252
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVYGRN--LRGQEFGGMYCAILTVNSSVVSAGILRV 841
A+ + H+ F I++ + RDL +V+ R LR F G Y + V
Sbjct: 11 ALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL--------------V 56
Query: 842 FGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGF 901
G++ AELPL+ T +G +LL +EKLLS L V+ ++LP + WT FGF
Sbjct: 57 CGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGSFGF 116
Query: 902 KKIDPELLSIYRKRCSQLVTFKGTSMLQK 930
++ ++ + +++F+GT+M QK
Sbjct: 117 TEMSYS--DRFQYAANIILSFQGTTMCQK 143
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 72/179 (40%), Gaps = 42/179 (23%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ------------- 618
NDD C+ C DGG+LL CD C ++FH C + L IP+GDWYC C+
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSCKYKKSKTNVTKSPS 124
Query: 619 -NMFERKRFLQHDANAVE---------------AGRVSGVDSVEQITKRCIRI----VKN 658
+ K + + + E + S VD++ + ++ KN
Sbjct: 125 TTIINNKTYFKESEQSPEEMSPPYLPISSSPIGSTMSSLVDNLSSVNPSTFQLPQEYTKN 184
Query: 659 LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
+ S L C C+ S S G IL C++C +H C+ + + W C
Sbjct: 185 VNRNSSKKLNCLVCEES-SNSG--DILQCNKCNAAYHSTCVDSSSLGNKT----SAWLC 236
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 65/147 (44%), Gaps = 26/147 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A + G + +E++ E + CR C K G +L CD C
Sbjct: 423 AKEDNS---EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTC 467
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 468 PSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 5/53 (9%)
Query: 570 PGKDND---DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P +++D + C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 441 PEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 71/164 (43%), Gaps = 30/164 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 410 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCE-----KEGIQWE 464
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E G D+ + V +E + CR C K G +L CD C
Sbjct: 465 AR--EEGSEGEDDNGD---------VGEMEDD-HHMEFCRVC---KDG---GELLCCDSC 506
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 507 PSSYHIHCLN----PPLPEIPNGEWICPRCTCPSLKGKVQRILT 546
>gi|297805014|ref|XP_002870391.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316227|gb|EFH46650.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 351
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 107/279 (38%), Gaps = 81/279 (29%)
Query: 688 DQCEREFHVGCLKKHKMADLRELPKGKWFC----CMDCSRINSVLQNLLVQEAEKLPEFH 743
D+ ER+ H+ ++ P G W C C C + E+ +
Sbjct: 132 DKVERDCHL---------HIQMFPHGDWHCPNCTCKFCRAV-----------VEECSQTL 171
Query: 744 LNAIKKYAG--NSLE------------TVSDIDVRWRLLSGKAATPETRLLLSQA----V 785
+KKY G + LE T SD +RW TP QA +
Sbjct: 172 FEGVKKYVGVKHELEARFSWSLVHRECTDSDFILRW--------TPSYCGKQFQAGHSSL 223
Query: 786 AIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFGQE 845
+ +CF PI+D SG G YC + L+ G
Sbjct: 224 TVMDECFLPIIDRRSG-------------------GKYC----------TKCPLQFHGNR 254
Query: 846 VAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKID 905
+AE+ + T + +G + LF+ +E L L+V+ +V+PA + +W KFGFK ++
Sbjct: 255 LAEMQFIGTRHVYRHQGMCRRLFSVVESTLQNLKVELLVIPATADLSHVWISKFGFKYVE 314
Query: 906 PELLSIYRKRCSQLVTFKGTSMLQKRVPACRIGSSSTDS 944
L R L+ F G +LQK + A R S+ D+
Sbjct: 315 DSLKK--ELRSMNLLAFPGIDVLQKELLAPRHAKSAADT 351
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 57/102 (55%), Gaps = 2/102 (1%)
Query: 236 LLDGVSVVYMGGIKFQASGLRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQ 295
+LDGV V Y+ + + LRGII+ G LC C CN +VI +FE HA + + +
Sbjct: 1 MLDGVPVKYIAWSREKE--LRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKHPNN 58
Query: 296 YICFENGKSLLEVLRACRSVPLPMLKATLQSALSSLPEEKSF 337
+I FENGK++ +++ +S P L +Q+ S +KSF
Sbjct: 59 HIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSF 100
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 66.2 bits (160), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 83/194 (42%), Gaps = 36/194 (18%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ--NMFERKRFLQHD 630
C +C N +L CDGC R +H C LS IPQGDW+C C + RKR
Sbjct: 978 CRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPTQLSPRKR----T 1033
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
VE D+ E++ + + + C +C P ++LCD C
Sbjct: 1034 KAPVEVSSEEEDDN-EKVDEDGDEDEEEEDLNQEVCNICE---------SPGELILCDFC 1083
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIK 748
+ FH+ C+ DL+ LP+G W C C+ + N L K+ N I+
Sbjct: 1084 PKSFHLDCI------DLKRLPRGTWKCPPCVLGKKKNKRGSPPLT----KVKVRSRNNIR 1133
Query: 749 KYAGNSLETVSDID 762
KY L TV+D+D
Sbjct: 1134 KY---DLATVTDVD 1144
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
N ++C IC G L+ CD CP++FH +C L +P+G W C C
Sbjct: 1064 NQEVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPC 1107
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 67/164 (40%), Gaps = 36/164 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 379
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E E R+ K+ G LLC CD C
Sbjct: 380 AKDDEEEEEEAPGEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 417
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 732
+H+ CL L E+P G+W C CM C + +Q +L
Sbjct: 418 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 456
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 63/147 (42%), Gaps = 26/147 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ + + +
Sbjct: 257 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 316
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G D E+ ++E CR C K G +L CD C
Sbjct: 317 SEGEEILEEVGGDPEEEDD-------HHME-------FCRVC---KDG---GELLCCDPC 356
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 357 PSSYHIHCLN----PPLPEIPNGEWLC 379
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C L IP G+W C +C
Sbjct: 339 EFCRVCKDGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 66/147 (44%), Gaps = 24/147 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 437
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A +S + ++ +R + + + + C +C+ +L CD C
Sbjct: 438 AK----DELSEGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGG---------ELLCCDTC 484
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 485 PSSYHIHCLN----PPLPEIPNGEWIC 507
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
+ C +C DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 467 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 511
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 67/164 (40%), Gaps = 36/164 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 403
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E E R+ K+ G LLC CD C
Sbjct: 404 AKDDEEEEDEVAGEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 441
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 732
+H+ CL L E+P G+W C CM C + +Q +L
Sbjct: 442 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 480
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 67/164 (40%), Gaps = 36/164 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHCE-----KEGIQWE 381
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E E R+ K+ G LLC CD C
Sbjct: 382 AKGEEEEEEEAAGEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 419
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 732
+H+ CL L E+P G+W C CM C + +Q +L
Sbjct: 420 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 458
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 24/147 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 384 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWE 438
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A +S + + +R + + + + C +C+ +L CD C
Sbjct: 439 AR----DDLSEAEGEDDDDRRDEGMEEEDDHHIEFCRVCKDGG---------ELLCCDTC 485
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 486 PSSYHIHCLN----PPLPEIPNGEWIC 508
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
+ C +C DGG LL CD CP ++H C + L IP G+W C C+
Sbjct: 468 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRCK 512
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 68/164 (41%), Gaps = 36/164 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 502 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 556
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E V E R+ K+ G LLC CD C
Sbjct: 557 AKDEEEDEEEPVGEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 594
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLL 732
+H+ CL L E+P G+W C CM C + +Q +L
Sbjct: 595 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 633
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYCQNMFERKRF 626
C IC DGG LL CD CPRAFH C +S IP +WYCK C +R+R
Sbjct: 161 CNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMCTECLDRRRL 211
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 77/191 (40%), Gaps = 40/191 (20%)
Query: 560 QFEAHAGRQYPGKDNDDL----CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
QFE + P +++ D C IC +GG LL CDGCP FH C L IP+G +C
Sbjct: 1095 QFEDAEDDEGPTRESVDEWEEDCYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCH 1154
Query: 616 YCQN----MFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK----NLEAELSGCL 667
C +F ++ A + R S + + ++ ++ K + E+E SG
Sbjct: 1155 ECDTTVKPVFPVNGAKKNGKAASKRPRRSNSPTSRRRPRKQAKLGKAKSDDSESEDSGAE 1214
Query: 668 LCRGCDFSKS---------------------GFGPRTILLCDQCEREFHVGCLKKHKMAD 706
+ S G G +L CD C R FHV C+
Sbjct: 1215 SDSAVSTTSSARPASRPKAPEDQWDVDCSVCGLGGE-LLCCDGCPRAFHVNCIG------ 1267
Query: 707 LRELPKGKWFC 717
L E+P +WFC
Sbjct: 1268 LAEIPDTEWFC 1278
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDL-CTICADGGNLLPCDGCPRAFHKECASLSSIPQGD 611
+S VS + A R +D D+ C++C GG LL CDGCPRAFH C L+ IP +
Sbjct: 1216 DSAVSTTSSARPASRPKAPEDQWDVDCSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTE 1275
Query: 612 WYCKYC 617
W+C C
Sbjct: 1276 WFCNEC 1281
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 19/30 (63%)
Query: 592 GCPRAFHKECASLSSIPQGDWYCKYCQNMF 621
GC R FH +CA L ++P DWYCK C+
Sbjct: 1310 GCDRVFHLKCAKLDAVPADDWYCKKCRTKL 1339
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 59/147 (40%), Gaps = 37/147 (25%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P G W C +C+N L +D
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEEPPGGKWSCPHCEND------LVND 390
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+AV + + + + C LCR +L CD C
Sbjct: 391 NDAVTSKEAAPAKA----------------GNMEFCRLCRDGG---------ELLCCDSC 425
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL + L +P+G W C
Sbjct: 426 PSSYHRYCL----IPPLTTIPEGDWHC 448
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N + C +C DGG LL CD CP ++H+ C L++IP+GDW+C C
Sbjct: 406 NMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 97/234 (41%), Gaps = 53/234 (22%)
Query: 506 TPKDQ-RLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCN--SEVSPSQFE 562
TPKD R+ VF SG + T A G+++ + ++G + +++S + F
Sbjct: 832 TPKDPVRMMHTVFFFSGPDEQTADCRRANGKQITK--EDGPTRLLERTRVIAQISRTDFV 889
Query: 563 AHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS---SIPQGDWYCKYC-- 617
A A DND+ C+ C + G++L CDGCPR+FH EC +L+ +P DWYC C
Sbjct: 890 ADA------TDNDEFCSACGNAGDVLCCDGCPRSFHFECVNLTQSEDLPD-DWYCSECIM 942
Query: 618 -------------------------QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRC 652
F + +Q+ V+AG G D E + K
Sbjct: 943 RRFPSRVPIHKGAFAPALNALEKSIPRAFSLPKHIQNRFEGVKAG--PGGDYEEIVGKTV 1000
Query: 653 IRIVKNLEA-------ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
R E + + +LC C KS R IL C C +H+ CL
Sbjct: 1001 KRRTGFDETPDLFKQRDENQPVLCHAC--QKSSNDTRAILPCSLCSYYWHLDCL 1052
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 6/98 (6%)
Query: 840 RVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKF 899
R+ G ++AE+P + T I +G L IE LS L V+ +V+PA E ++ WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 900 GFKKIDPELLSIYRKRCSQL--VTFKGTSMLQKRVPAC 935
GFK ++P R++ L + GT +L+KR+ A
Sbjct: 61 GFKPVEPS----KRQKIKSLNILIIHGTGLLEKRLLAT 94
>gi|189239425|ref|XP_001814901.1| PREDICTED: similar to Toutatis [Tribolium castaneum]
Length = 2075
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 66/162 (40%), Gaps = 54/162 (33%)
Query: 561 FEAHAGRQYPGKDNDDLCTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCK 615
F R+Y K ++ C C G N LL CDGC + +H C + +IP+GDWYC
Sbjct: 1776 FNKMRARKYNSKLSN--CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCH 1833
Query: 616 YCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFS 675
C N +R C++C G S
Sbjct: 1834 ECMNKATGER---------------------------------------NCIVC-GKKSS 1853
Query: 676 KSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
SG ++LC+ C R +H C+ H + + ++P+GKW+C
Sbjct: 1854 TSG---TRLILCELCPRAYHTDCI--HPI--MHKVPRGKWYC 1888
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 25/102 (24%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E G CH C ++ + G+ N C +C
Sbjct: 1807 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKAT-------------GERN---CIVCG 1849
Query: 583 -----DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G L+ C+ CPRA+H +C + +P+G WYC C
Sbjct: 1850 KKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSKC 1891
Score = 43.1 bits (100), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1790 NCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1837
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 564 HAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF 621
H+ P +DN+D C +C DGG L+ CDGCPRAFH C L +P G W C C
Sbjct: 254 HSQDPAPYQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAEL 313
Query: 622 ERKR 625
R R
Sbjct: 314 GRLR 317
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 63/126 (50%), Gaps = 16/126 (12%)
Query: 818 EFGGMYCAILTVNSSVVSAGI------------LRVFGQEVAELPLVATSKINHGKGYFQ 865
+F G Y L + V A LR+ G +VAE+PLVAT+ +G Q
Sbjct: 62 DFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYRRQGMCQ 121
Query: 866 LLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKKI-DPELLSIYRKRCSQLVTFKG 924
+L +EK+LS L V+ +VLPA E +W FGF ++ E L + R + F+G
Sbjct: 122 VLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLR---FPFLGFQG 178
Query: 925 TSMLQK 930
T+M QK
Sbjct: 179 TTMFQK 184
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
NDD C +C DGG L+ CDGCPRAFH C + L SIP G W C+ C+
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERCR 202
>gi|270010529|gb|EFA06977.1| hypothetical protein TcasGA2_TC009937 [Tribolium castaneum]
Length = 2221
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 59/145 (40%), Gaps = 52/145 (35%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC + +H C + +IP+GDWYC C N +R
Sbjct: 1937 CQFCHSGDNEDKLLLCDGCDKGYHTYCFKPKMENIPEGDWYCHECMNKATGER------- 1989
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C++C G S SG ++LC+ C R
Sbjct: 1990 --------------------------------NCIVC-GKKSSTSG---TRLILCELCPR 2013
Query: 693 EFHVGCLKKHKMADLRELPKGKWFC 717
+H C+ H + + ++P+GKW+C
Sbjct: 2014 AYHTDCI--HPI--MHKVPRGKWYC 2034
Score = 46.2 bits (108), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 45/102 (44%), Gaps = 25/102 (24%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E G CH C ++ + G+ N C +C
Sbjct: 1953 DGCDKGYHTYCFKPKMENIPEG-DWYCHECMNKAT-------------GERN---CIVCG 1995
Query: 583 -----DGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G L+ C+ CPRA+H +C + +P+G WYC C
Sbjct: 1996 KKSSTSGTRLILCELCPRAYHTDCIHPIMHKVPRGKWYCSKC 2037
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
++ C F SG +LLCD C++ +H C K + +P+G W+C CM+
Sbjct: 1932 IMKANCQFCHSGDNEDKLLLCDGCDKGYHTYCFK----PKMENIPEGDWYCHECMN 1983
>gi|320167629|gb|EFW44528.1| hypothetical protein CAOG_02553 [Capsaspora owczarzaki ATCC 30864]
Length = 1716
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 565 AGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASL--SSIPQGDWYCKYC 617
A R P N+D C +C+ G LL CDGCPR +H C L +S+PQGDW+C C
Sbjct: 417 ARRAGPTDLNNDSCEVCSQTGELLCCDGCPRVYHATCLKLDTASLPQGDWFCPTC 471
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 547 IICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASL 604
++ +++ S F + + P ++N+D C C GN+L CD CPR++H +C +
Sbjct: 1407 LLPAAADTDGSDGDFAGRSSVKKPQRENEDHCNACKARGNVLCCDYCPRSYHLKCLKPPM 1466
Query: 605 SSIPQGDWYCKYCQN 619
S P+GDW C C++
Sbjct: 1467 SKPPRGDWKCPICKS 1481
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
G+ NDD C +C DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 278 GQKNDDECAVCRDGGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRC 326
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC-QNMFERKRFLQ 628
K+N+D C C DGG L+ CDGCPRAFH C L +P G W C C +N+ E + L+
Sbjct: 244 KENEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSCVENVTEPGQLLE 303
Query: 629 HD 630
D
Sbjct: 304 AD 305
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 58/147 (39%), Gaps = 36/147 (24%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 413 DHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHCEG----------- 461
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + +T++ R + + C +C+ +L CD C
Sbjct: 462 ---------EGI-TAATVTEKAGRNAADDDEHSEFCRICKDGG---------ELLCCDSC 502
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P G W C
Sbjct: 503 TSAYHTFCLN----PPLSEIPDGDWKC 525
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 41/91 (45%), Gaps = 3/91 (3%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLL 588
+ C LE G HC ++ + AGR D + + C IC DGG LL
Sbjct: 438 HLVCFDPELEEAPEGRWSCPHCEGEGITAATVTEKAGRNAADDDEHSEFCRICKDGGELL 497
Query: 589 PCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
CD C A+H C + LS IP GDW C C
Sbjct: 498 CCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
Length = 209
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 573 DNDDLCTICADGGNLLPCD-GCPRAFHKECASLSSIPQGD-WYCKYCQNMFERKR 625
+ND++C IC DGG+LL CD GC R +H C +L+S+P+G+ W C YC E+ R
Sbjct: 53 ENDEICYICGDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLCPYCARQKEKAR 107
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 62/147 (42%), Gaps = 34/147 (23%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 273 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 327
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ G + + + C R+ K+ G LLC CD C
Sbjct: 328 PKEEDEEEEEGGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 364
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 365 PSSYHLHCLN----PPLPEIPNGEWLC 387
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS + P+ F A G + N C +C++GG+LL C+ CP AFH+EC ++
Sbjct: 1109 GTVILASNSMICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1167
Query: 606 SIPQGDWYCKYCQ 618
+P+G WYC C+
Sbjct: 1168 DMPEGSWYCNDCK 1180
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG L+ C GCP+ +H +C SL+ P G W C + Q
Sbjct: 1545 KEREDECFSCGDGGQLVSCKRPGCPKVYHADCLSLTRRPAGKWECPWHQ 1593
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 575 DDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+++C IC G LL C+ C AFH EC LS +P+G + C C
Sbjct: 974 ENVCQICEKPGELLLCEAQCCGAFHLECLGLSEMPKGKFICTEC 1017
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 48/123 (39%), Gaps = 50/123 (40%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
++ D +C++C DGG LL CD CP AFH C L + P+GDW C C+
Sbjct: 462 EEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR------------- 508
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF---SKSGFGPRTILLCD 688
C +C G D + GF +TI+ C+
Sbjct: 509 ----------------------------------CGVCGGSDLDDDTAEGFTDKTIIYCE 534
Query: 689 QCE 691
QCE
Sbjct: 535 QCE 537
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/44 (56%), Positives = 30/44 (68%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
DD C IC +GGNL+ CD CPR H C LS IP+GD+YC C+
Sbjct: 1759 DDYCLICFEGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDCE 1802
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 47/80 (58%), Gaps = 4/80 (5%)
Query: 556 VSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 613
V P + ++ + N+D C +C +GG LL CD CP+ FH C +L++ P G+WY
Sbjct: 678 VQPGEQQSEKTPEADEDPNEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTASPSGEWY 737
Query: 614 CKYCQNMFERKRFLQHDANA 633
C +C+++ + +Q++ NA
Sbjct: 738 CTFCRDLNSPE--MQYNVNA 755
>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 588
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 509 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 567
Query: 613 YCKYCQ 618
YC C+
Sbjct: 568 YCNDCK 573
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1498 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1556
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1557 YCNDCK 1562
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1928 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1976
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 625
+DN+D C +C DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 222 QDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 277
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 625
+DN+D C +C DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 211 QDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 266
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 60/147 (40%), Gaps = 33/147 (22%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 332 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 386
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + E R+ K+ G LLC CD C
Sbjct: 387 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 424
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 425 PSSYHLHCLN----PPLPEIPNGEWLC 447
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 60/147 (40%), Gaps = 33/147 (22%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 371
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + E R+ K+ G LLC CD C
Sbjct: 372 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 409
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 410 PSSYHLHCLN----PPLPEIPNGEWLC 432
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/56 (48%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKR 625
+DN+D C +C DGG L+ CDGCPRAFH C L +P G W C C R R
Sbjct: 235 QDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLR 290
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 34/147 (23%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 353 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCE-----KEGIQWE 407
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ + + + C R+ K+ G LLC CD C
Sbjct: 408 PKDDDEEDEDLCEEADDHMEFC-RVCKD-----GGELLC-----------------CDTC 444
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 445 PSSYHIHCLN----PPLPEIPNGEWLC 467
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 648 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 706
Query: 613 YCKYCQ 618
YC C+
Sbjct: 707 YCNDCK 712
>gi|400599137|gb|EJP66841.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
2860]
Length = 1226
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS---SIPQGDWYCKYCQNMFERKRF 626
P DND+ C+ C + G++L CDGCPR+FH EC +L+ +P DWYC C R
Sbjct: 761 PATDNDEFCSACGNAGDVLCCDGCPRSFHFECVNLAQSEDLPD-DWYCNECIVRRFPSRV 819
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDF 674
H + S ++++E+ R + K ++ G D+
Sbjct: 820 PIH-----KGAFASALNNLEKSIPRAFSLPKRIQNRFEGVKAGPDGDY 862
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 72/157 (45%), Gaps = 38/157 (24%)
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE------------RKR 625
C +C +GG ++ CDGC R FH C ++ +P+G YCK+C R++
Sbjct: 1080 CYVCREGGQVVSCDGCQRVFHLSCLNIRRMPRGKLYCKHCSEGDTKGAEEKSVGGDGRRQ 1139
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG-----CLLCRGCDFSKSGFG 680
L+ A+ GR V+ ++I R + LE+ G C +C+ +G
Sbjct: 1140 SLRLSAD----GRHDDVEENDEI--RMKSSNRELESGAVGPWDVECFICK-------LYG 1186
Query: 681 PRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
+L CD C + FH+ C+ ++ P+ +WFC
Sbjct: 1187 E--LLGCDGCPKAFHLACI------GIKSWPQEEWFC 1215
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 29/61 (47%), Gaps = 21/61 (34%)
Query: 578 CTICADGGNLLPCDGCPRAFH-----------------KECASL----SSIPQGDWYCKY 616
C +C DGG LL CD CPRAFH ASL IP+ +WYCK+
Sbjct: 136 CNLCKDGGELLCCDRCPRAFHLKWYVGCFPSAVVAHQASRYASLGLQKEEIPESEWYCKF 195
Query: 617 C 617
C
Sbjct: 196 C 196
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 66/163 (40%), Gaps = 34/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHCE-----KEGIQWE 398
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ G + E R+ K+ G LLC CD C
Sbjct: 399 PKDDDEEDEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 436
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 475
>gi|210075923|ref|XP_503882.2| YALI0E12991p [Yarrowia lipolytica]
gi|199426911|emb|CAG79475.2| YALI0E12991p [Yarrowia lipolytica CLIB122]
Length = 584
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 78/200 (39%), Gaps = 44/200 (22%)
Query: 557 SPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASL----SSIPQGDW 612
+P + A P KDNDD C C G L C+ CP++FH C+ S+P G W
Sbjct: 189 TPPRVAAPPSEDDPTKDNDDFCDACKGLGRFLCCEACPKSFHFACSDPPYDDESLPDGQW 248
Query: 613 YCKYCQ--------------------------NMFERKRFLQHDANAVEAGRVSGV-DSV 645
+CK C+ F + ++ V G D
Sbjct: 249 FCKECKARRFPPEQAPRGIFSQLLNRISRTNPKEFRLPKSIREYFEGVSTGPFGEYSDDH 308
Query: 646 EQITK--------RCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVG 697
E+ K R + IV+NL+ L C C K+G R I CD C +H+
Sbjct: 309 EKTEKDITLALDDREVGIVRNLDKN-GKPLECFKC--GKNGLNGRGITSCDYCPTAWHLD 365
Query: 698 CLKKHKMADLRELPKGKWFC 717
C+ +A+++ L + KW C
Sbjct: 366 CIDP-PLANVKMLGR-KWKC 383
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1329 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1387
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1388 YCNDCK 1393
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1415 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1473
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1474 YCNDCK 1479
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1845 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1893
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1687 NSIICPNHFAPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1745
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1746 YCNDCK 1751
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2117 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2165
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFAPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 33/52 (63%), Gaps = 4/52 (7%)
Query: 568 QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
QY + NDD C++C DGG L+ CDGCPR+FH C L+ IP G W C C
Sbjct: 24 QY--QSNDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDAC 73
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 71/171 (41%), Gaps = 31/171 (18%)
Query: 565 AGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
AG Y D+ D C +C GG ++ CD CPRA+H C L PQG W C +C+
Sbjct: 392 AGEGY-ETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELERAPQGKWSCPHCEK--- 447
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPR 682
V+ + + KR R ++ E CR C K G
Sbjct: 448 ---------EGVQWEAKELEEEEMEEPKRERREEEDDHME-----FCRVC---KDG---G 487
Query: 683 TILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+L CD C +H+ CL L ++P G+W C C ++ +Q +L
Sbjct: 488 ELLCCDACVSSYHIHCLN----PPLPDIPHGEWLCPRCTCPQLKGKVQKIL 534
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1685 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1743
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1744 YCNDCK 1749
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2115 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1380 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1438
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1439 YCNDCK 1444
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1810 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1858
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1419 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1477
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1478 YCNDCK 1483
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1849 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1897
>gi|442623365|ref|NP_001260899.1| toutatis, isoform G [Drosophila melanogaster]
gi|440214304|gb|AGB93432.1| toutatis, isoform G [Drosophila melanogaster]
Length = 3094
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 54/145 (37%), Gaps = 52/145 (35%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2604 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------- 2656
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C++C G S G ++ CD C R
Sbjct: 2657 --------------------------------KCIVCGGHRPSPVG----KMIYCDLCPR 2680
Query: 693 EFHVGCLKKHKMADLRELPKGKWFC 717
+H C + L ++P+GKW+C
Sbjct: 2681 AYHADCY----IPPLLKVPRGKWYC 2701
Score = 44.7 bits (104), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 25/102 (24%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + ++ +G C N N+ C +C
Sbjct: 2620 DGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKAT-----------------NERKCIVCG 2662
Query: 583 DG-----GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2663 GHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2704
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 670 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2602 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2645
>gi|414887990|tpg|DAA64004.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1679
Score = 62.8 bits (151), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 13/122 (10%)
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICAD 583
DG + G+ + G + K G G V SQF + G N D C +C
Sbjct: 378 DGNDEGFRSTG---VRRKKRGRGSSAKVA---VGSSQFPEGSAEVDDG--NSDECYLCGM 429
Query: 584 GGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 641
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 430 DGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILG 486
Query: 642 VD 643
D
Sbjct: 487 TD 488
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1417 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1475
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1476 YCNDCK 1481
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1847 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1684 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1742
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1743 YCNDCK 1748
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2114 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1684 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1742
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1743 YCNDCK 1748
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2114 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1685 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1743
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1744 YCNDCK 1749
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2115 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1684 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1742
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1743 YCNDCK 1748
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2114 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2162
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1417 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1475
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1476 YCNDCK 1481
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1847 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1895
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1377 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1435
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1436 YCNDCK 1441
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1796 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1855
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1416 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1474
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1475 YCNDCK 1480
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1846 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1316 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1374
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1375 YCNDCK 1380
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1735 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1794
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1329 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1387
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1388 YCNDCK 1393
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1748 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1796
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
N+D C +C +GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 920 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 968
>gi|442623363|ref|NP_001260898.1| toutatis, isoform F [Drosophila melanogaster]
gi|440214303|gb|AGB93431.1| toutatis, isoform F [Drosophila melanogaster]
Length = 3058
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 54/145 (37%), Gaps = 52/145 (35%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 2568 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------- 2620
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C++C G S G ++ CD C R
Sbjct: 2621 --------------------------------KCIVCGGHRPSPVG----KMIYCDLCPR 2644
Query: 693 EFHVGCLKKHKMADLRELPKGKWFC 717
+H C + L ++P+GKW+C
Sbjct: 2645 AYHADCY----IPPLLKVPRGKWYC 2665
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 25/102 (24%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + ++ +G C N N+ C +C
Sbjct: 2584 DGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKAT-----------------NERKCIVCG 2626
Query: 583 DG-----GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 2627 GHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 2668
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 4/48 (8%)
Query: 670 RGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
+ C F SG +LLCD C++ +H C K + +P G W+C
Sbjct: 2566 QNCQFCTSGENEDKLLLCDGCDKGYHTYCFK----PKMDNIPDGDWYC 2609
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1582 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1640
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1641 YCNDCK 1646
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2012 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1401 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1459
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1460 YCNDCK 1465
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1831 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1879
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1416 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1474
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1475 YCNDCK 1480
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1846 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1894
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1134 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1192
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1193 YCNDCK 1198
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 63/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 732 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 780
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + + E E CR C K G +L CD C
Sbjct: 781 ---------EGIQWEPKDDEDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 825
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 826 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 864
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 479 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 527
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 528 ---------EGIQWEPKEDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 572
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 573 PSSYHLHCLN----PPLAEIPNGEWLCPRCTCPPLKGKVQRIL 611
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1583 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1641
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1642 YCNDCK 1647
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2013 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1685 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1743
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1744 YCNDCK 1749
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2115 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2163
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1420 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1478
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1479 YCNDCK 1484
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1850 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1898
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2116 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1582 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1640
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1641 YCNDCK 1646
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2012 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2060
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
N+D C +C +GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 815 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 863
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 78/175 (44%), Gaps = 43/175 (24%)
Query: 556 VSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYC- 614
V+ ++ E + + P +++D C +C + L+ C CPR+FH C + P+ DW C
Sbjct: 975 VTSAEDEEESTGEVP--ESNDFCEVCLNDEQLISCGSCPRSFHLICIQMKRAPRRDWRCL 1032
Query: 615 -------KYCQNMFERKRFLQHDANAVEAGRVSGVD-SVEQITKRCIRIVKNLEAELSGC 666
KY Q + + K+ ++ + A EA + D S+ Q C++ G
Sbjct: 1033 ACTAGVKKYKQELKDLKKIIE-EKEAFEAKDSNEEDFSINQ----CLKC---------GE 1078
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDC 721
LL RG + C C R++H+ C ADL + PKG W+C C
Sbjct: 1079 LLSRGH------------IECIGCGRKYHLAC------ADLTKRPKGDWYCKKRC 1115
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 10/52 (19%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
C +CRG P ++ C+ C+ FH+ C+K LRE+P+G+WFC
Sbjct: 916 CKVCRG------KATPDRMIRCETCDLVFHLPCIK----PALREIPRGEWFC 957
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2107 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1689 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1747
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1748 YCNDCK 1753
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2119 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2167
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1583 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1641
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1642 YCNDCK 1647
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2002 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 2/49 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
N+D C +C +GG LL CD CP+ FH C +L S P GDW C +C+N+
Sbjct: 933 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNL 981
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1583 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1641
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1642 YCNDCK 1647
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2002 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2061
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1686 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1744
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1745 YCNDCK 1750
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2105 HGKRRSQGEVTKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2164
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 492
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 493 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 537
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 538 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 576
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 375
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 376 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 420
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 421 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 459
>gi|414887991|tpg|DAA64005.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
gi|414887992|tpg|DAA64006.1| TPA: hypothetical protein ZEAMMB73_302261 [Zea mays]
Length = 1712
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 56/122 (45%), Gaps = 13/122 (10%)
Query: 524 DGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICAD 583
DG + G+ + G + K G G V SQF + G N D C +C
Sbjct: 378 DGNDEGFRSTG---VRRKKRGRGSSAKVA---VGSSQFPEGSAEVDDG--NSDECYLCGM 429
Query: 584 GGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSG 641
GNLL CDGCP AFH +C + +P+GDWYC C RK ++ AN + + G
Sbjct: 430 DGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWYCPEC---LIRKDGSRNIANPMRGAEILG 486
Query: 642 VD 643
D
Sbjct: 487 TD 488
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 389
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 434
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 473
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1375 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1433
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1434 YCNDCK 1439
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1794 HGKRRSQGEVTKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1853
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1580 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1638
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1639 YCNDCK 1644
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPG---KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
H R+ G K+ +D C C DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1999 HGKRRSQGEVTKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2058
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1379 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1437
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1438 YCNDCK 1443
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1809 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 1857
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 373
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 374 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 418
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 419 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 457
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 389
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 434
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1689 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1747
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1748 YCNDCK 1753
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2119 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2167
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 67 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 115
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 116 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 160
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 161 PSSYHLHCLN----PPLPEIPNGEWLC 183
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 143 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 186
>gi|324499809|gb|ADY39928.1| Chromodomain-helicase-DNA-binding protein 3 [Ascaris suum]
Length = 1844
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 41/147 (27%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+A+H C + P+G W C C
Sbjct: 254 DHQDYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEAPEGHWSCPSC------------- 300
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
EA + D E+ ++ N+E C +C+ + +L CD C
Sbjct: 301 ----EAAGIPQKDEEEE-----KKVATNMEY----CRVCKDVGW---------LLCCDTC 338
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H C+ L E+P+G+W C
Sbjct: 339 PSSYHAYCMN----PPLTEVPEGEWSC 361
Score = 47.8 bits (112), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N + C +C D G LL CD CP ++H C + L+ +P+G+W C C
Sbjct: 319 NMEYCRVCKDVGWLLCCDTCPSSYHAYCMNPPLTEVPEGEWSCPRC 364
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 389
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 434
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 460 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 518
Query: 613 YCKYCQ 618
YC C+
Sbjct: 519 YCNDCK 524
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 890 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 938
>gi|344299817|gb|EGW30170.1| hypothetical protein SPAPADRAFT_144654 [Spathaspora passalidarum
NRRL Y-27907]
Length = 604
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 64/167 (38%), Gaps = 41/167 (24%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC------------ 617
KDNDD C+ C + G + C+ CP++FH C + P+ DW+C+ C
Sbjct: 192 KDNDDFCSSCGNPGIFICCENCPKSFHFTCCDPPIEQPPEDDWFCRECIAKLHPEKIKTY 251
Query: 618 -----------------QNMFERKRFLQHDA--------NAVEAGRVSGVDSVEQITKRC 652
+F+ + L+ D N + + D +
Sbjct: 252 NDIGVFGQLLNQLEVKNPKVFQLPKHLREDTFIGVTTGDNGDYSDDTTKPDVPSSKNSQD 311
Query: 653 IRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ I +L + LC C +SG RT++ CD C +H+ CL
Sbjct: 312 VDIESSLYDKAGNPYLCHKC--GESGMNNRTLMHCDYCPLIYHIDCL 356
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 339 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 387
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 388 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 432
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 471
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 389
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 434
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|358059464|dbj|GAA94870.1| hypothetical protein E5Q_01524 [Mixia osmundae IAM 14324]
Length = 1418
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 80/198 (40%), Gaps = 55/198 (27%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQNMFERKRFLQH 629
N+D C C G+ L C+ CPR+FH C LS +P+ WYC C +F R++ +
Sbjct: 712 NNDFCDACKGKGHFLCCEACPRSFHFSCLDPPLELSDLPENSWYC--CTCLFSRRKTVV- 768
Query: 630 DANAVEAG-------RVSGVDSVEQITKRCIR----------------IVKNLEAELSG- 665
A EAG +V+ + E IR +V++ A+ +G
Sbjct: 769 -AIPSEAGPFTTLMTKVAKSNVTEFSLPASIRTFCKDVASRENGDFMDLVEHKPAKKTGV 827
Query: 666 -----------------CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLR 708
C C+G S S + IL CD C++ +H+ CL MA +
Sbjct: 828 EERDPYKLKSKSGASVLCFRCKGAASSPS----KPILSCDFCDQHWHLDCLDP-PMASM- 881
Query: 709 ELPKGKWFCCMDCSRINS 726
P +W C + + +
Sbjct: 882 PAPTKRWMCPTHPAHVQA 899
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 812 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 860
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 861 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 905
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 906 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 944
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 391
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 436
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 475
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1688 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1746
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1747 YCNDCK 1752
Score = 47.8 bits (112), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2118 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2166
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 400
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 401 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 445
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 446 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 484
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 58 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 116
Query: 613 YCKYCQ 618
YC C+
Sbjct: 117 YCNDCK 122
Score = 47.0 bits (110), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 488 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 536
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 477
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 522
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 561
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 1677 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 1735
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1736 YCNDCK 1741
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2107 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 2155
>gi|357510881|ref|XP_003625729.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
gi|355500744|gb|AES81947.1| hypothetical protein MTR_7g102630 [Medicago truncatula]
Length = 290
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 50/75 (66%), Gaps = 5/75 (6%)
Query: 184 GFGNESMSLIEVEAIAEGS----ALTSPKKNLELKMSKKISLNKK-PMTVTELFETGLLD 238
G G E+++ ++ E A S AL + +ELK SKKI+++KK P T+ ELF TGLLD
Sbjct: 203 GSGEETVTKLDQEGAAVESEIDGALAVRRNKMELKTSKKIAVDKKRPTTMKELFRTGLLD 262
Query: 239 GVSVVYMGGIKFQAS 253
GVSVVY+ GIK + S
Sbjct: 263 GVSVVYVSGIKKEES 277
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 315 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 363
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 364 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 408
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 409 PSSYHLHCLN----PPLPEIPNGEWLC 431
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 391 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 434
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 33/47 (70%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNM 620
NDD CT+C G+LL CDGCP +FH++C ++ +P+G W C C+ +
Sbjct: 1052 NDDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPECKTV 1098
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 426
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 427 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 471
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/90 (38%), Positives = 44/90 (48%), Gaps = 12/90 (13%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLP 589
+Y+CGQ E K + + + P AH NDD C C DGG L+
Sbjct: 217 FYSCGQS--EETKRASKAVESIFHHKGEPLTDGAHV--------NDDECAACKDGGELIC 266
Query: 590 CDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
CDGCP+AFH C L+SIP G W C +C
Sbjct: 267 CDGCPQAFHLTCLDPPLTSIPSGPWQCDWC 296
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 54/89 (60%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 944
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + V G+ V+Q ++C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 63/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 416
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + + E E CR C K G +L CD C
Sbjct: 417 ---------EGIQWEPKDDEDDEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 461
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 462 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 500
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P + N+D C +C DGG LL CDGCPRAFH C S L IP G W C C
Sbjct: 289 PHQKNEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 338
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 71/148 (47%), Gaps = 31/148 (20%)
Query: 784 AVAIFHDCFDPIVDSISGRDLIPSMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVFG 843
AV I H+CF I++ + D+ +V+ R + GQ+F
Sbjct: 774 AVDILHECFVTIIEPRTQSDISEDIVFNREICGQKF------------------------ 809
Query: 844 QEVAELPLVATSKINHGKGYFQLLFACIEKLLSFLRVKSIVLPAAEEAESIWTDKFGFKK 903
AELPL+ T + +G +LL +EKLL L V+ ++LPA E WT FGF
Sbjct: 810 ---AELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFGFTV 866
Query: 904 I-DPELLSIYRKRCSQLVTFKGTSMLQK 930
+ + E L + + +++F+GT+M QK
Sbjct: 867 MSNSERLEL---AGNSILSFQGTTMCQK 891
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 591 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 639
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 640 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 684
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 685 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 723
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 326 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 374
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 375 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 419
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 420 PSSYHLHCLN----PPLPEIPNGEWLC 442
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 402 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 445
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 382
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 383 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 427
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 428 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 466
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 54/89 (60%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 944
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + V G+ V+Q ++C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 426
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 427 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 471
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 510
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL CD CP AFH+EC ++ IP+G+W
Sbjct: 263 NSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNW 321
Query: 613 YCKYCQ 618
YC C+
Sbjct: 322 YCNDCK 327
Score = 46.6 bits (109), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C D G L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 695 KEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPWHQ 743
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 391
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 436
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 411 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 459
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 460 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 504
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 505 PSSYHLHCLN----PPLPEIPNGEWLC 527
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 487 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 530
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 66/163 (40%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 413
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E + D E+ + ++ E CR C K G +L CD C
Sbjct: 414 ----EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC---KDG---GELLCCDAC 458
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 459 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 497
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 477
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 522
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 62/151 (41%), Gaps = 27/151 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRAFH C L P+G W C +C+
Sbjct: 363 DHQDYCEVCQQGGEIMLCDTCPRAFHLVCLDPELEEAPEGSWSCPHCEK----------- 411
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKSGFGPRTILL 686
V A R + + +++ I K N E + C C K G ++
Sbjct: 412 EGVVAASRSTTPATGGDMSQNPQNIRKSAQPNEEEKDEHQEFCNEC---KDG---GDLIC 465
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
C +C +H CL L E+P+G W C
Sbjct: 466 CAKCPVSYHPECL----YPPLSEIPEGPWLC 492
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
++ + C C DGG+L+ C CP ++H EC LS IP+G W C C
Sbjct: 449 EHQEFCNECKDGGDLICCAKCPVSYHPECLYPPLSEIPEGPWLCPRC 495
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 567 RQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFER 623
R P +D N+D C +C +GG LL CD CP+ FH C +L+ P G+W+C +C+++
Sbjct: 701 RAEPEEDPNEDWCAVCQNGGELLCCDKCPKVFHLACHIPTLNESPSGEWFCSFCRDLVSP 760
Query: 624 KRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
+ +++D ++ +A V++ ++C R++ L
Sbjct: 761 E--MEYDCDSKDAPISEKFPPVDR--RKCERLLLRL 792
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 51 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 99
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 100 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 144
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 145 PSSYHLHCLN----PPLPEIPNGEWLC 167
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 127 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 170
>gi|170592228|ref|XP_001900871.1| CHD4 protein [Brugia malayi]
gi|158591738|gb|EDP30342.1| CHD4 protein, putative [Brugia malayi]
Length = 1846
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 42/147 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+A+H C + P+G W C C++
Sbjct: 254 DHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTCES----------- 302
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+G ++ K +I N+E CR C + G+ +L CD C
Sbjct: 303 ---------TGATKDDEEEK---KITTNME-------YCRTC--KEGGW----LLCCDTC 337
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P+G W C
Sbjct: 338 PSSYHAYCLN----PSLTEIPEGDWSC 360
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N + C C +GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 318 NMEYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 363
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS + P+ F A G + N C +C++GG+LL C+ CP AFH+EC ++
Sbjct: 1381 GSVVLASNSIICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1439
Query: 606 SIPQGDWYCKYCQ 618
+P+G WYC C+
Sbjct: 1440 EMPEGSWYCNDCK 1452
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1818 KEREDECFSCGDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1866
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 85/223 (38%), Gaps = 50/223 (22%)
Query: 405 NKRERKKTRKLLEADLVSKSSSKSVSLRNLLKTRSPWELTRNSSRPGLIANSTPVTSVH- 463
+KR+RK T+KLLE++ + + + + W + + P +S + H
Sbjct: 1107 SKRQRKPTKKLLESNDLDTA---------FMPKKEEWTPPKKGTGPSESDSSELYSPAHF 1157
Query: 464 --------KSSQSQRQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKL 515
K + QR+RK + S + K N R TP+ + L
Sbjct: 1158 PDLGEASEKLLEKQRKRKRPRHSSAAIHSKKEK----------NGERLGETPRSEG-EAL 1206
Query: 516 VFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND 575
V S P G G+ EG +N G+ PS + R +
Sbjct: 1207 VHGTSTSPKG---GHE-------EGSENDHGV----------PSSKKIQGERGGGAALKE 1246
Query: 576 DLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
++C IC G LL C+ C AFH +C LS +P G + C C
Sbjct: 1247 NVCQICEKPGELLLCEAQCCGAFHLQCLGLSEMPTGKFICNEC 1289
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 62/163 (38%), Gaps = 33/163 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 321 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 369
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 370 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 414
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 415 PSSYHLHCLN----PPLPEIPNGEWLCPRCTCPPLKGKVQRIL 453
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS + P+ F A G + N C +C++GG+LL C+ CP AFH+EC ++
Sbjct: 1393 GSVVLASNSIICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1451
Query: 606 SIPQGDWYCKYCQ 618
+P+G WYC C+
Sbjct: 1452 EMPEGSWYCNDCK 1464
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1830 KEREDECFSCGDGGQLVSCKKAGCPKVYHADCLNLTKRPAGKWECPWHQ 1878
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 56/148 (37%), Gaps = 46/148 (31%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQH 629
+++ D C +C GG ++ CD CPRA+H C L P+G W C YCQ
Sbjct: 372 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQ----------- 420
Query: 630 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 689
E + D Q RI K+ G LLC CD
Sbjct: 421 ----AEGNQEQEDDDEHQ---EFCRICKD-----GGELLC-----------------CDS 451
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H CL L E+P G+W C
Sbjct: 452 CPSAYHRFCLN----PPLEEVPDGEWKC 475
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
++ + C IC DGG LL CD CP A+H+ C + L +P G+W C C
Sbjct: 432 EHQEFCRICKDGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRC 478
>gi|291230097|ref|XP_002735005.1| PREDICTED: tripartite motif-containing 28 protein-like
[Saccoglossus kowalevskii]
Length = 995
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 59/113 (52%), Gaps = 8/113 (7%)
Query: 553 NSEVSP----SQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSS 606
N E+SP S E ++ + P + N+D C C +GG+LL CD CP+ FH +C SL++
Sbjct: 767 NYELSPAPGCSSSEVYSRQGIPVEFNEDYCACCQNGGDLLCCDTCPKVFHLQCHIPSLTA 826
Query: 607 IPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
P+ W C CQ++ + + + + + SG+ Q K C RI+ L
Sbjct: 827 TPKETWICGLCQDLCKEIQGISENDEHGKRKASSGLSEAHQ--KICERILLEL 877
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 58/103 (56%), Gaps = 13/103 (12%)
Query: 563 AHAGRQYPGKDND---DLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
A G + KD+D D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C
Sbjct: 732 ARTGGEGTNKDDDPNEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFC 791
Query: 618 QNMFERKRFLQHDANAVEAGR----VSGVDSVEQITKRCIRIV 656
+++ K +++D + ++ + G+ V+Q ++C R++
Sbjct: 792 RDL--SKPEVEYDCDNLQHSKKGKTAQGLSPVDQ--RKCERLL 830
>gi|302915931|ref|XP_003051776.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
gi|256732715|gb|EEU46063.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
Length = 1194
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 67/171 (39%), Gaps = 55/171 (32%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD----WYCKYCQNMFER--KRF 626
DND+ C+ C G+++ CDGCPR+FH EC + +P D WYC C +F+R R
Sbjct: 754 DNDEDCSACGAAGDVVCCDGCPRSFHFECVGM--VPSEDLPDEWYCNEC--LFKRYPSRV 809
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGC-------------------- 666
H A ++++E+ R + K L+ G
Sbjct: 810 PVHKGVFGPA-----LNNLEKSIPRAFSLPKKLQTRFEGVKAGPDGEYEEVTTAKTTKRK 864
Query: 667 ------------------LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+LC GC K+ R I+ C C R +H+ CL
Sbjct: 865 NGYEEVPDFFRQRDDGQPVLCHGC--QKAATDVRAIIPCSVCPRYWHIDCL 913
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 556 VSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWY 613
V P + ++ + N+D C +C +GG L+ CD CP+ FH C SL++ P G+WY
Sbjct: 678 VQPGEQQSEKTPEADEDPNEDWCAVCQNGGELICCDKCPKVFHLSCHVPSLTASPSGEWY 737
Query: 614 CKYCQNMFERKRFLQHDANA 633
C C+++ + +Q++ NA
Sbjct: 738 CTLCRDLNSPE--MQYNVNA 755
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 60/147 (40%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 365
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E + D E+ + ++ E CR C K G +L CD C
Sbjct: 366 ----EGIQWEPKDEEEEEEEGGCEEEEDDHME-----FCRVC---KDG---GELLCCDAC 410
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 411 PSSYHLHCLN----PPLPEIPNGEWLC 433
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 28/47 (59%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 390 DHMEFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 436
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 392 NEDWCAVCQNGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 451
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 452 LQHSKKGKTAQGLSPVDQ-----RKCERLL 476
>gi|348690302|gb|EGZ30116.1| hypothetical protein PHYSODRAFT_474458 [Phytophthora sojae]
Length = 239
Score = 61.6 bits (148), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 566 GRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 619
R P D + + C +C GG LL CDGC RA+H C SL +P+GDW+C YC++
Sbjct: 181 ARMRPAADQHREYCEMCLKGGKLLCCDGCERAYHLNCVRPSLLDVPEGDWFCPYCRD 237
Score = 39.7 bits (91), Expect = 8.1, Method: Composition-based stats.
Identities = 17/41 (41%), Positives = 25/41 (60%), Gaps = 6/41 (14%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMDCS 722
+L CD CER +H+ C++ L ++P+G WFC C D S
Sbjct: 203 LLCCDGCERAYHLNCVR----PSLLDVPEGDWFCPYCRDAS 239
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 30/147 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 391
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ G + E+ + CR C K G +L CD C
Sbjct: 392 PKEDDEDEEDGAEEEEEEEDDHME-------------FCRVC---KDG---GELLCCDTC 432
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLC 455
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 32/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEK----------- 389
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ + E E CR C K G +L CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCRVC---KDG---GELLCCDAC 434
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 64/154 (41%), Gaps = 44/154 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C C GG LL CD CPRA+H C +S+ P+GDW C +C
Sbjct: 249 ENNDFCEECKQGGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHC------------- 295
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+E G +I K + V + C +C+ + +LLCD C
Sbjct: 296 ---IEHG--------PEIVKEEPQKVND-----DFCKICKETE---------NLLLCDTC 330
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
FH C+ L ++P+ + + C C +
Sbjct: 331 VCAFHAYCMD----PPLTQVPQEETWNCPRCELV 360
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGD-WYCKYCQ 618
++ P K NDD C IC + NLL CD C AFH C L+ +PQ + W C C+
Sbjct: 304 KEEPQKVNDDFCKICKETENLLLCDTCVCAFHAYCMDPPLTQVPQEETWNCPRCE 358
>gi|449547717|gb|EMD38685.1| hypothetical protein CERSUDRAFT_113863 [Ceriporiopsis subvermispora
B]
Length = 906
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 83/218 (38%), Gaps = 65/218 (29%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKEC----ASLSSIPQGD--WYCKYCQN---- 619
PG N+D C+ C G+L+ CDGCPRAFH C + S +P+GD WYC C N
Sbjct: 210 PGLPNEDHCSSCRSLGSLVYCDGCPRAFHLWCLDPPMAASDLPEGDERWYCPACTNQQKP 269
Query: 620 ---MFERKRF----LQHDANAVEA------------------GRVSGVDSVEQITKRCIR 654
+ + +F L+H A + A R + VD+ E R R
Sbjct: 270 PPKISAKLKFIAPLLEHLATIIPAEYSLPNEIKTHFKDVATGPRGAYVDTSEIKAPRLNR 329
Query: 655 IVK-------NLEAELSGCLLCRGC---------------------DFSKSGFGPRTILL 686
+ + L+ +LC C D + PR I+
Sbjct: 330 LGQVEDRDPYRLKDRNGDPVLCFQCGTSALPPAVAATSPAAKRTKRDHNTFHDNPRAIIT 389
Query: 687 CDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
CD C +H+ CL +A + K KW C ++
Sbjct: 390 CDYCHLHWHLDCLDP-PLACMPPWSK-KWMCPNHADQV 425
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 54/90 (60%), Gaps = 7/90 (7%)
Query: 570 PGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRF 626
P +D N+D C +C +GG LL CD CP+ FH C +L P G+W+C +C+++ +
Sbjct: 700 PEEDPNEDWCAVCQNGGELLCCDRCPKVFHLSCHIPALHEPPSGEWFCSFCRDLVSPE-- 757
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
++++ N+ ++ G +++ ++C R++
Sbjct: 758 MEYNCNSNDSPVSDGFPPIDR--RKCERLL 785
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL CD CP+ FH C +L++ P G+W C +C+++
Sbjct: 667 HVGETRKEDDPNEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL 726
Query: 621 FERKRFLQHDANA----VEAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + + KR C R++
Sbjct: 727 --SKPEVEYDCDARSHSSEKRKADGLVKLTPVDKRKCERLL 765
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 62/147 (42%), Gaps = 23/147 (15%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCE-----KEGIQWE 427
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A R + E+ E + CR C K G +L CD C
Sbjct: 428 A------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC---KDG---GELLCCDTC 475
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 476 PSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|393911013|gb|EJD76123.1| LET-418 protein [Loa loa]
Length = 1755
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 42/147 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+A+H C + P+G W C C++
Sbjct: 168 DHQDYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES----------- 216
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+G ++ K ++ N+E CR C + G+ +L CD C
Sbjct: 217 ---------TGAPKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTC 251
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P+G W C
Sbjct: 252 PSSYHAYCLN----PSLTEIPEGDWSC 274
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 6/71 (8%)
Query: 549 CHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSS 606
C C S +P + E + N + C C +GG LL CD CP ++H C SL+
Sbjct: 211 CPTCESTGAPKEDE----EEKKVTTNMEYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTE 266
Query: 607 IPQGDWYCKYC 617
IP+GDW C C
Sbjct: 267 IPEGDWSCPRC 277
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P + N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 303 PHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSC 352
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 62/163 (38%), Gaps = 30/163 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 436 DHQDYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHC------------- 482
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E V E+ E E CR C K G +L CD C
Sbjct: 483 ----EKEGVQWEPKEEEDEYEGEMDDAEKEEEDDHMEYCRVC---KDG---GELLCCDAC 532
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L E+P G+W C C + +Q +L
Sbjct: 533 ISSYHIHCLN----PPLPEIPNGEWLCPRCTCPMLKGRVQKIL 571
>gi|332022570|gb|EGI62872.1| Bromodomain adjacent to zinc finger domain protein 2B [Acromyrmex
echinatior]
Length = 2202
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 56/146 (38%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1919 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 1971
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C ++G + ++LC+ C R
Sbjct: 1972 --------------------------------NCLVC----GKRAG---KNLVLCELCPR 1992
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 1993 AYHTDC---HNPV-MPKMPRGKWYCS 2014
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 1935 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 1977
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 1978 KRAGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2024
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 659 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 716
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1904 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 1959
Query: 717 C--CMD 720
C CM+
Sbjct: 1960 CHECMN 1965
>gi|312077956|ref|XP_003141528.1| CHromoDomain protein family member [Loa loa]
Length = 1696
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 42/147 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+A+H C + P+G W C C++
Sbjct: 109 DHQDYCEVCQQGGEIILCDTCPKAYHMVCLDPDMEEPPEGRWSCPTCES----------- 157
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+G ++ K ++ N+E CR C + G+ +L CD C
Sbjct: 158 ---------TGAPKEDEEEK---KVTTNME-------YCRTC--KEGGW----LLCCDTC 192
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P+G W C
Sbjct: 193 PSSYHAYCLN----PSLTEIPEGDWSC 215
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 6/71 (8%)
Query: 549 CHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSS 606
C C S +P + E + N + C C +GG LL CD CP ++H C SL+
Sbjct: 152 CPTCESTGAPKEDE----EEKKVTTNMEYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTE 207
Query: 607 IPQGDWYCKYC 617
IP+GDW C C
Sbjct: 208 IPEGDWSCPRC 218
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 627
N+D C C DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 250 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 309
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 310 LQHSKKGKTAQGLSPVDQ-----RKCERLL 334
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 627
N+D C C DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 385 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 440
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 31/47 (65%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
G DD+C+IC GG+++ CD CP +H +C L SIP G+W+C C
Sbjct: 173 GWAPDDVCSICGLGGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPAC 219
>gi|307169034|gb|EFN61879.1| Bromodomain adjacent to zinc finger domain protein 2B [Camponotus
floridanus]
Length = 2352
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 56/146 (38%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2069 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 2121
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C ++G + ++LC+ C R
Sbjct: 2122 --------------------------------NCLVC----GKRAG---KNLVLCELCPR 2142
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 2143 AYHTDC---HNPV-MPKMPRGKWYCS 2164
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 2085 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 2127
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 2128 KRAGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2174
Score = 44.3 bits (103), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 647 QITKRCIRI-VKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMA 705
++ RC+ + N +L C F SG +LLCD C+R +H C +
Sbjct: 2043 KLRNRCVSLKATNQYKQLLTTSQTSNCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----P 2098
Query: 706 DLRELPKGKWFC--CMD 720
+ +P G W+C CM+
Sbjct: 2099 KMENIPDGDWYCHECMN 2115
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 62/147 (42%), Gaps = 23/147 (15%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C + P+G W C +C+ K +Q +
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHCE-----KEGIQWE 427
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A R + E+ E + CR C K G +L CD C
Sbjct: 428 A------REESSEGEEENDDGRRDDGDVEEEDDHHMEFCRVC---KDG---GELLCCDTC 475
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 476 PSSYHLHCLN----PPLPDIPNGEWIC 498
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRC 501
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N+D C C DGG LL CDGCPRAFH C LS IP G W C C
Sbjct: 432 NEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNC 477
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 739 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 798
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 799 LQHSKKGKTAQGLSPVDQ-----RKCERLL 823
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 441 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 500
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 501 LQHSKKGKTAQGLSPVDQ-----RKCERLL 525
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 970 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 1029
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 1030 LQHSKKGKTAQGLSPVDQ-----RKCERLL 1054
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 2/62 (3%)
Query: 558 PSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCK 615
P + + + P ++D C +C G LL CDGCPR +H +C + L+ +P+GDW+C
Sbjct: 701 PRRGRTTSAQDSPTTSHEDYCIVCQQSGELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCP 760
Query: 616 YC 617
C
Sbjct: 761 AC 762
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 969
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 970 LQHSKKGKTAQGLSPVDQ-----RKCERLL 994
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 890 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 949
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 950 LQHSKKGKTAQGLSPVDQ-----RKCERLL 974
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 18/97 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+N E
Sbjct: 919 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCE 978
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
R+ H NA G+D +Q K+C ++V +L
Sbjct: 979 NTRYA-HSYNA-----QYGLDDYDQ--KKCEKLVLSL 1007
>gi|255544948|ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis]
gi|223547443|gb|EEF48938.1| hypothetical protein RCOM_1578820 [Ricinus communis]
Length = 1915
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 41/72 (56%), Gaps = 4/72 (5%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYCQNMFERKRFLQHDA 631
N D C +C GNL+ CDGCP A+H +C ++ S+P+GDW+C C +R +
Sbjct: 730 NSDECCLCKMDGNLICCDGCPAAYHSKCVGVANDSLPEGDWFCPEC--AIDRHKPWMKTR 787
Query: 632 NAVEAGRVSGVD 643
N++ + GVD
Sbjct: 788 NSLRGAELLGVD 799
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 862
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 863 LQHSKKGKTAQGLSPVDQ-----RKCERLL 887
>gi|15292405|gb|AAK93471.1| LP06732p [Drosophila melanogaster]
gi|220947368|gb|ACL86227.1| tou-PB [synthetic construct]
gi|220956830|gb|ACL90958.1| tou-PB [synthetic construct]
Length = 683
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 54/145 (37%), Gaps = 52/145 (35%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC + +H C + +IP GDWYC C N +R
Sbjct: 193 CQFCTSGENEDKLLLCDGCDKGYHTYCFKPKMDNIPDGDWYCYECVNKATNER------- 245
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
C++C G S G ++ CD C R
Sbjct: 246 --------------------------------KCIVCGGHRPSPVG----KMIYCDLCPR 269
Query: 693 EFHVGCLKKHKMADLRELPKGKWFC 717
+H C + L ++P+GKW+C
Sbjct: 270 AYHADCY----IPPLLKVPRGKWYC 290
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 7/51 (13%)
Query: 574 NDDLCTICADG-----GNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N+ C +C G ++ CD CPRA+H +C L +P+G WYC C
Sbjct: 243 NERKCIVCGGHRPSPVGKMIYCDLCPRAYHADCYIPPLLKVPRGKWYCHGC 293
>gi|380023668|ref|XP_003695637.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis florea]
Length = 2272
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 55/146 (37%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1989 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 2041
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C G K+ ++LC+ C R
Sbjct: 2042 --------------------------------NCLVC-GKRVGKN------LVLCELCPR 2062
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 2063 AYHTDC---HNPV-MPKMPRGKWYCS 2084
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 2005 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 2047
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 2048 KRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2094
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 659 LEAELSG--CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWF 716
LEA ++ ++ C F SG +LLCD C+R +H C + + +P G W+
Sbjct: 1974 LEASIAWDKSIMKANCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWY 2029
Query: 717 C--CMD 720
C CM+
Sbjct: 2030 CHECMN 2035
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 18/97 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+N E
Sbjct: 920 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPVGEWVCTLCRNPMKPEVEYDCE 979
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
R+ H NA G+D +Q K+C ++V +L
Sbjct: 980 NTRYA-HSYNA-----QYGLDDYDQ--KKCEKLVLSL 1008
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 758 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 817
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 818 LQHSKKGKTAQGLSPVDQ-----RKCERLL 842
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 771 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 830
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 831 LQHSKKGKTAQGLSPVDQ-----RKCERLL 855
>gi|340709835|ref|XP_003393506.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Bombus terrestris]
Length = 2263
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 55/146 (37%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 1980 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 2032
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C G K+ ++LC+ C R
Sbjct: 2033 --------------------------------NCLVC-GKRVGKN------LVLCELCPR 2053
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 2054 AYHTDC---HNPV-MPKMPRGKWYCS 2075
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 1996 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 2038
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 2039 KRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2085
Score = 43.1 bits (100), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 1979 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2026
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 69/186 (37%), Gaps = 53/186 (28%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC---QNMFERK---- 624
N D C C +GG L+ CD CP +FH EC L +P GDW+C+ C FE +
Sbjct: 26 NHDNCDSCGEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQDTPFEPQSMEM 85
Query: 625 ----RFLQH-----------------------------DANAVEAGRVSGVDSVEQITKR 651
R LQH DA+A G + DS + + R
Sbjct: 86 TIMGRLLQHLEGRNTVAFSLPLGVIKAVEPEDYADGQDDADA--PGESNLFDSDQSDSDR 143
Query: 652 CIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELP 711
+R K S C +C S+ C +C +H+ CL +A P
Sbjct: 144 GLRQGKRKRRHDSYCSVCSLPSPSRDDLA-----QCTRCPHSYHLWCLDPPLLAK----P 194
Query: 712 KGKWFC 717
KW C
Sbjct: 195 TVKWLC 200
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 849 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDC 906
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ + + V G+ V+Q+ +C R++
Sbjct: 907 DNSQHSKKGKTVQGLSPVDQM--KCERLL 933
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 862
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 863 LQHSKKGKTAQGLSPVDQ-----RKCERLL 887
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 56/101 (55%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 689 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL 748
Query: 621 FERKRFLQHDANA----VEAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A +E + G+ + I KR C R++
Sbjct: 749 --SKPEVEYDCDAPTHNLEKKKTEGLVKLTPIDKRKCERLL 787
>gi|350407087|ref|XP_003487980.1| PREDICTED: hypothetical protein LOC100749908 [Bombus impatiens]
Length = 2303
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 55/146 (37%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2020 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 2072
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C G K+ ++LC+ C R
Sbjct: 2073 --------------------------------NCLVC-GKRVGKN------LVLCELCPR 2093
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 2094 AYHTDC---HNPV-MPKMPRGKWYCS 2115
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 2036 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 2078
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 2079 KRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2125
Score = 43.1 bits (100), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2019 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2066
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 962 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 1021
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 1022 LQHSKKGKTAQGLSPVDQ-----RKCERLL 1046
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 332 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 377
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 800 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 859
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 860 LQHSKKGKTAQGLSPVDQ-----RKCERLL 884
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 815 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 874
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 875 LQHSKKGKTAQGLSPVDQ-----RKCERLL 899
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 969
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 970 LQHSKKGKTAQGLSPVDQ-----RKCERLL 994
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + PS F G + N C +C++GG+LL C+ CP AFH+EC ++ +PQG W
Sbjct: 520 NSFLCPSHFTPRKGCKNHEHINVSWCFVCSEGGSLLCCESCPAAFHRECLNI-EMPQGSW 578
Query: 613 YCKYCQNMFERKRF 626
+C C+ +R RF
Sbjct: 579 FCNDCK-AGKRPRF 591
Score = 47.4 bits (111), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
+Q K+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 945 KQEVTKEREDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 998
Score = 47.4 bits (111), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 33/53 (62%), Gaps = 1/53 (1%)
Query: 566 GRQYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
G+ P +++C +C G+LL CDG C AFH +C LS+ P+G ++C+ C
Sbjct: 370 GKIGPTSLKENVCQVCERTGDLLACDGHCYGAFHPQCIGLSAAPKGKFFCREC 422
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 31/147 (21%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCE-----KEGIQWE 397
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E+ ++E CR C K G +L CD C
Sbjct: 398 PKDDDD-------EEEEGGCEEEEEDDHME-------FCRVC---KDG---GELLCCDAC 437
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L E+P G+W C
Sbjct: 438 PSSYHLHCLN----PPLPEIPNGEWLC 460
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 50/88 (56%), Gaps = 6/88 (6%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL CD CP+ FH C +L+ P G+W+C +C+++ + +++D
Sbjct: 713 NEDWCAVCQNGGELLCCDKCPKVFHLSCHIPALNESPSGEWFCSFCRDLLNPE--MEYDC 770
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNL 659
N + VE+ ++C R++ L
Sbjct: 771 NRQDRPPSEKFPLVER--RKCERLLLRL 796
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 901 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 958
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 959 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 985
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 577
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 578 LQHSKKGKTAQGLSPVDQ-----RKCERLL 602
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 956
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 577
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 578 LQHSKKGKTAQGLSPVDQ-----RKCERLL 602
>gi|301093217|ref|XP_002997457.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110713|gb|EEY68765.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 248
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 25/57 (43%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 566 GRQYPGKDND-DLCTICADGGNLLPCDGCPRAFHKECA--SLSSIPQGDWYCKYCQN 619
+Q P D + C IC GG LL CDGC RA+H C +L +P+GDW+C +C++
Sbjct: 177 AQQRPAADQHLEFCGICLKGGKLLCCDGCERAYHLNCVRPALLDVPEGDWFCSHCRD 233
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 832 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 891
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 892 LQHSKKGKTAQGLSPVDQ-----RKCERLL 916
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 57/149 (38%), Gaps = 46/149 (30%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C C GG L+ CD CPRA+H C A++ P+GDW C +C ++H
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHG 303
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
V+ D C +C+ + +LLCD C
Sbjct: 304 PEIVKEEPAKQNDDF--------------------CKICKETE---------NLLLCDNC 334
Query: 691 EREFHVGCLKKHKMADLRELP--KGKWFC 717
FH C+ L ELP W C
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWAC 359
Score = 47.0 bits (110), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGD--WYCKYCQ 618
++ P K NDD C IC + NLL CD C +FH C L +P D W C C+
Sbjct: 308 KEEPAKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRCE 363
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 753 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 812
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 813 LQHSKKGKTAQGLSPVDQ-----RKCERLL 837
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 903 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 960
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 961 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 987
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 710 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 767
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 768 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 794
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 945
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 946 LQHSKKGKTAQGLSPVDQ-----RKCERLL 970
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 60.8 bits (146), Expect = 4e-06, Method: Composition-based stats.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 1017 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 1076
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 1077 LQHSKKGKTAQGLSPVDQ-----RKCERLL 1101
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 57/149 (38%), Gaps = 46/149 (30%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C C GG L+ CD CPRA+H C A++ P+GDW C +C ++H
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHC---------MEHG 303
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
V+ D C +C+ + +LLCD C
Sbjct: 304 PEIVKEEPAKQNDDF--------------------CKICKETE---------NLLLCDNC 334
Query: 691 EREFHVGCLKKHKMADLRELP--KGKWFC 717
FH C+ L ELP W C
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWAC 359
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 4/56 (7%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGD--WYCKYCQ 618
++ P K NDD C IC + NLL CD C +FH C L +P D W C C+
Sbjct: 308 KEEPAKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRCE 363
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 65/158 (41%), Gaps = 36/158 (22%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ +
Sbjct: 327 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHCE---------KEG 377
Query: 631 ANAVEAGRVSGVDSVEQ---ITKRCIRIVKNL-------EAELSGCLLCR-GCDFSKSGF 679
+V G DS + + + +IV + + C CR G D
Sbjct: 378 ITSVSKGNKECEDSGSEPAVTSDKDGKIVSAVHPTSPEKDEHQEFCTECRDGGD------ 431
Query: 680 GPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
++ CD C +H+ CL + L +P+G W C
Sbjct: 432 ----LICCDNCPASYHIACL----IPPLANIPEGVWLC 461
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 570 PGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
P KD + + CT C DGG+L+ CD CP ++H C L++IP+G W C C
Sbjct: 414 PEKDEHQEFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRC 464
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 956
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSC 344
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 849 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDC 906
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ + + V G+ V+Q+ +C R++
Sbjct: 907 DNSQHSKKGKTVQGLSPVDQM--KCERLL 933
>gi|328792710|ref|XP_623473.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
[Apis mellifera]
Length = 2293
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 55/146 (37%), Gaps = 55/146 (37%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + +IP GDWYC C N +R
Sbjct: 2010 CQFCHSGDNEDKLLLCDGCDRGYHTYCFRPKMENIPDGDWYCHECMNKATGER------- 2062
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
CL+C G K+ ++LC+ C R
Sbjct: 2063 --------------------------------NCLVC-GKRVGKN------LVLCELCPR 2083
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCC 718
+H C H + ++P+GKW+C
Sbjct: 2084 AYHTDC---HNPV-MPKMPRGKWYCS 2105
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 2026 DGCDRGYHTYCFRPKMENIPDG-DWYCHECMNKAT-------------GERN---CLVCG 2068
Query: 583 D--GGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
G NL+ C+ CPRA+H +C + + +P+G WYC C + +KR
Sbjct: 2069 KRVGKNLVLCELCPRAYHTDCHNPVMPKMPRGKWYCSNCHSKQPKKR 2115
Score = 43.1 bits (100), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 6/52 (11%)
Query: 671 GCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C F SG +LLCD C+R +H C + + +P G W+C CM+
Sbjct: 2009 NCQFCHSGDNEDKLLLCDGCDRGYHTYCFR----PKMENIPDGDWYCHECMN 2056
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 11/94 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 669 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 726
Query: 632 NAV----EAGRVSGVDSVEQITKRC---IRIVKN 658
+A E + G+ + I KR +I+KN
Sbjct: 727 DAPSHNSEKKKTEGLVKLTPIDKRVPDYYKIIKN 760
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSC 338
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 849 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--NKPEVEYDC 906
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ + + V G+ V+Q+ +C R++
Sbjct: 907 DNSQHSKKGKTVQGLSPVDQM--KCERLL 933
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 729 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 786
Query: 632 NA----VEAGRVSGVDSVEQITKR-CIRIV 656
+A + ++ G+ + I KR C R++
Sbjct: 787 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 816
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 54/147 (36%), Gaps = 47/147 (31%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++ D C +C GG ++ CD CPRA+H C L P+G W C +C+N
Sbjct: 374 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G D Q RI K+ G LLC CD C
Sbjct: 423 -----EGPAEQDDDEHQ---EFCRICKD-----GGELLC-----------------CDSC 452
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P G W C
Sbjct: 453 PSAYHTHCLN----PPLVEIPDGDWKC 475
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 32/60 (53%), Gaps = 6/60 (10%)
Query: 564 HAGRQYPGKDNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
H + P + +DD C IC DGG LL CD CP A+H C + L IP GDW C C
Sbjct: 419 HCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 478
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 847 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 904
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 905 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 931
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 34/62 (54%), Gaps = 9/62 (14%)
Query: 558 PSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCK 615
PS + H + N+D C +C DGG L+ CDGCPRAFH C S L IP G W C
Sbjct: 371 PSDLQLH-------QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCS 423
Query: 616 YC 617
C
Sbjct: 424 SC 425
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 54/147 (36%), Gaps = 47/147 (31%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++ D C +C GG ++ CD CPRA+H C L P+G W C +C+N
Sbjct: 371 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 419
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G D Q RI K+ G LLC CD C
Sbjct: 420 -----EGPAEQDDDEHQ---EFCRICKD-----GGELLC-----------------CDSC 449
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P G W C
Sbjct: 450 PSAYHTHCLN----PPLVEIPDGDWKC 472
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 32/60 (53%), Gaps = 6/60 (10%)
Query: 564 HAGRQYPGKDNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
H + P + +DD C IC DGG LL CD CP A+H C + L IP GDW C C
Sbjct: 416 HCENEGPAEQDDDEHQEFCRICKDGGELLCCDSCPSAYHTHCLNPPLVEIPDGDWKCPRC 475
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ + +
Sbjct: 368 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 427
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 428 LQHSKKGKTAQGLSPVDQ-----RKCERLL 452
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-----QNMFERK 624
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C Q+M R
Sbjct: 424 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQATVQDMRPRA 483
Query: 625 RFLQHDANAVE 635
+ VE
Sbjct: 484 EEPRPQEPPVE 494
>gi|297817898|ref|XP_002876832.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322670|gb|EFH53091.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 386
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 106/265 (40%), Gaps = 70/265 (26%)
Query: 707 LRELPKGKWFC--CMDCSRINSVLQNLLVQEAEKLPEFHLNAIKKYAG--NSLE------ 756
++ P G W C C C +V++++ K +KKY G + LE
Sbjct: 136 IKMFPHGDWHCPNCT-CKFCRAVVEDVSQTVGAKCL---FEGVKKYVGVKHELEARFSWS 191
Query: 757 ------TVSDIDVRWRLLSGKAATPETRLLLSQA----VAIFHDCFDPIVDSISGRDLIP 806
T SD +RW TP QA + + +CF PI+D SG
Sbjct: 192 LVHRECTDSDFILRW--------TPSYCGKQFQAGHSSLTVMDECFLPIIDRRSG----- 238
Query: 807 SMVYGRNLRGQEFGGMYCAILTVNSSVVSAGILRVF-GQEVAELPLVATSKINHGKGYFQ 865
G YC + L++F G +AE+ + T + +G +
Sbjct: 239 --------------GKYC----------TKCPLQLFHGNRLAEMQFIGTRHVYRHQGMCR 274
Query: 866 LLFACIE------KLLSFLRVKSIVLPAAEEAESIWTDKFGFKKIDPELLSIYRKRCSQL 919
LF+ +E K L L+V+ +V+PA + +W KFGFK ++ L R L
Sbjct: 275 RLFSVVESMSFDVKTLQNLKVELLVIPATADLSHVWISKFGFKYVEDSLKK--ELRSMNL 332
Query: 920 VTFKGTSMLQKRVPACRIGSSSTDS 944
+ F G +LQK + A R S+ D+
Sbjct: 333 LAFPGIDVLQKELLAPRHAKSAADT 357
>gi|242051184|ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
gi|241926713|gb|EER99857.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
Length = 1688
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 556 VSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWY 613
V SQF + G N D C +C GNLL CDGCP AFH +C + +P+GDWY
Sbjct: 409 VDSSQFPEGSAEMDDG--NSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGDWY 466
Query: 614 CKYC 617
C C
Sbjct: 467 CPEC 470
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFL 627
+ N+D C C DGG LL CDGCPRAFH C LS IP G W C C +R L
Sbjct: 301 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRDL 358
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS + P+ F A G + N C +C++GG+LL C+ CP AFH+EC ++
Sbjct: 407 GSVVLASNSIICPNHFTARRGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 465
Query: 606 SIPQGDWYCKYCQ 618
+P+G WYC C+
Sbjct: 466 EMPEGSWYCNDCK 478
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPC--DGCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG L+ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 844 KEREDECFSCGDGGQLVSCKKSGCPKVYHADCLNLTKRPAGKWECPWHQ 892
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|402590896|gb|EJW84826.1| chromodomain-helicase-DNA-binding protein 4 [Wuchereria bancrofti]
Length = 1519
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 60/147 (40%), Gaps = 42/147 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+A+H C + P+G W C C
Sbjct: 168 DHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPDMEEPPEGRWSCPTC------------- 214
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E+ + D E+ +I N+E CR C + G+ +L CD C
Sbjct: 215 ----ESTGAAKDDEEEK------KITTNME-------YCRTC--KEGGW----LLCCDTC 251
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P+G W C
Sbjct: 252 PSSYHAYCLN----PSLTEIPEGDWSC 274
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N + C C +GG LL CD CP ++H C SL+ IP+GDW C C
Sbjct: 232 NMEYCRTCKEGGWLLCCDTCPSSYHAYCLNPSLTEIPEGDWSCPRC 277
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 52/89 (58%), Gaps = 9/89 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 893 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 950
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ VEQ K C R++
Sbjct: 951 DNLQHSKKGKTAQGLFEVEQKIK-CERLL 978
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 250 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 297
>gi|449269479|gb|EMC80242.1| Transcription intermediary factor 1-alpha, partial [Columba livia]
Length = 914
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 37/61 (60%), Gaps = 3/61 (4%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+HAG D N+D C +C +GG LL C+ CP+ FH C SL S P G+W C +C++
Sbjct: 686 SHAGEGRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLRSFPSGEWICTFCRD 745
Query: 620 M 620
+
Sbjct: 746 L 746
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 255 LRGIIRDGGILCSCSLCNGCRVIPPSKFEIHACKQYRRASQYICFENGKSLLEVLRACRS 314
L GII GG LC CS CN +V+ +FE HA + R + +I ENGK + +++ ++
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 315 VPLPMLKATLQSALSSLPEEKSF 337
PL M+ ++ S E+ F
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFF 89
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 340
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
NDD C +C DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 286 NDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCC 331
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 52/127 (40%), Gaps = 27/127 (21%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C LQ
Sbjct: 297 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--- 344
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 691
GRV S ++++ L AE G C G +L C C
Sbjct: 345 -----GRVQQNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCA 391
Query: 692 REFHVGC 698
FH C
Sbjct: 392 AAFHWRC 398
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 52/127 (40%), Gaps = 27/127 (21%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C LQ
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--- 339
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 691
GRV S ++++ L AE G C G +L C C
Sbjct: 340 -----GRVQQNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCA 386
Query: 692 REFHVGC 698
FH C
Sbjct: 387 AAFHWRC 393
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 52/127 (40%), Gaps = 27/127 (21%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C LQ
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--- 343
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 691
GRV S ++++ L AE G C G +L C C
Sbjct: 344 -----GRVQQNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCA 390
Query: 692 REFHVGC 698
FH C
Sbjct: 391 AAFHWRC 397
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 52/127 (40%), Gaps = 27/127 (21%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C LQ
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC---------LQ--- 340
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 691
GRV S ++++ L AE G C G +L C C
Sbjct: 341 -----GRVQQNLSQPEVSR-----PPELPAETPGPAPSARCSVCGDG---TEVLRCAHCA 387
Query: 692 REFHVGC 698
FH C
Sbjct: 388 AAFHWRC 394
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+++ C +C+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 37 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 82
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 295 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 79/209 (37%), Gaps = 45/209 (21%)
Query: 564 HAGRQYPGK-DNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ 618
HA + PG+ N D C C DGG L+ CD CP +FH +C LS IP G+W C C+
Sbjct: 46 HAYFKRPGRGHNRDSCDACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACR 105
Query: 619 NMFERKRFL---QHDANAVEAGRVS----------------------GVDSVEQIT---- 649
+++ + + NA+E ++ G D ++ ++
Sbjct: 106 CAMKKENSIGNKRKKKNALEVLALAASLVNPKEFELPRELQIPITFPGTDKIDPVSFKRG 165
Query: 650 --KRCIRIVKNLEAELSGC---LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKM 704
I + G L R C + ++ CD C FH CL
Sbjct: 166 KHHNSNNINGKIHYHEYGSITPLPARLCFVCRKSCRKAPLIACDYCPLYFHQDCLD---- 221
Query: 705 ADLRELPKGKWFCCMDCSRINSVLQNLLV 733
L P G+W C + + QNLL
Sbjct: 222 PPLTAFPSGRWMCPNHLNHF--IDQNLLT 248
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 771 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 830
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 831 WICTFCRDL--SKPEVEYDCDAPSHISEKKKTEGLVKLTPIDKRKCERLL 878
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 771 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 830
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 831 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 878
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 268 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 325
Query: 632 NA----VEAGRVSGVDSVEQITKR-CIRIV 656
+A + ++ G+ + I KR C R++
Sbjct: 326 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 355
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 771 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 830
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 831 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 878
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 771 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 830
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 831 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 878
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 119 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 176
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 177 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 203
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 710 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 769
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 770 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 817
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 782 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 841
Query: 621 FERKRFLQHDANA----VEAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 842 --SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 880
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 716 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 775
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 776 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 823
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P + N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 405 PHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSC 454
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 725 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 784
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 785 --SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 823
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQHD 630
N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++ + + D
Sbjct: 819 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSRPEVEYDCDD 878
Query: 631 ANAVEAGRVSGVDSVEQITKR-CIRIV 656
+ E ++ G S+ I +R C RI+
Sbjct: 879 PSLAEKRKLGGAQSMAPIDQRKCERIL 905
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 130 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 189
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 190 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 237
>gi|255731658|ref|XP_002550753.1| predicted protein [Candida tropicalis MYA-3404]
gi|240131762|gb|EER31321.1| predicted protein [Candida tropicalis MYA-3404]
Length = 507
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 20/144 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC----------QNMF 621
NDD C+ C G L+ C+ CP++FH C L P+ +WYC+ C + +F
Sbjct: 201 NDDFCSSCELPGILICCESCPKSFHFTCCDPPLEEAPEDEWYCQECYSKIHPPILTEGIF 260
Query: 622 ERKRFLQHDANAVEAG---RVSGVDSVEQITKRCIRIVKNLEA---ELSGCLLCRGCDFS 675
+ F N E + + Q+ +R +LEA E LC C
Sbjct: 261 GKLLFELEKTNPKEFKLPVKFQLPKNESQLQERIKDKNWDLEALYDENDKPYLCHCC--G 318
Query: 676 KSGFGPRTILLCDQCEREFHVGCL 699
SG RT++ CD C +H+ CL
Sbjct: 319 GSGLNRRTLIHCDYCPLVYHIDCL 342
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 805 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 864
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 865 WICTFCRDL--SKPEVEYDCDAPSHISEKKKTEGLVKLTPIDKRKCERLL 912
>gi|322707103|gb|EFY98682.1| Essential subunit of the histone deacetylase Rpd3S complex
[Metarhizium anisopliae ARSEF 23]
Length = 1190
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 42/167 (25%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECASL---SSIPQGDWYCKYC--------- 617
P DND+ C+ C + G+++ CDGCPR+FH EC + +P +WYC C
Sbjct: 751 PNADNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDDLPD-EWYCNECLIRRYPSRV 809
Query: 618 ------------------QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
F + +Q+ V+AG + D E ++ R R
Sbjct: 810 PIHKGIFGSALNNLEKSIPRAFSLPKRVQNRFEGVKAG--ADGDYEEVVSNRTARKRNGT 867
Query: 660 EA-------ELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ E +LC C K R+I+ C C +H+ CL
Sbjct: 868 DEPDFFKQREDGQAVLCHSC--QKPATQIRSIIPCSVCSFYWHIDCL 912
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 684 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 743
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 744 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 791
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 688 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 747
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 748 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 795
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 764 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 823
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 824 --SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 862
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 805 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 864
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 865 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 912
>gi|118384512|ref|XP_001025404.1| PHD-finger family protein [Tetrahymena thermophila]
gi|89307171|gb|EAS05159.1| PHD-finger family protein [Tetrahymena thermophila SB210]
Length = 1453
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%)
Query: 568 QYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
Q +D D+ C +C G +L CD CP FH +C L S+P GDW C CQ ++R
Sbjct: 1384 QNKNQDWDEQCKVCGQHGEVLMCDTCPSVFHLKCIGLKSLPDGDWSCLECQQKLLKQR 1441
>gi|47220585|emb|CAG05611.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 60.1 bits (144), Expect = 6e-06, Method: Composition-based stats.
Identities = 23/47 (48%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQ 618
+DD C +C +GG LL CD CP+ +H C LS PQGDW C C+
Sbjct: 1 SDDFCAVCLNGGELLCCDRCPKVYHLSCHLPPLSGFPQGDWVCTLCR 47
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS V P+ F G + N C +C +GG+LL C+ CP AFH+EC ++ +P+G W
Sbjct: 1743 NSIVCPNHFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSW 1801
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1802 YCNDCK 1807
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C IC DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2172 KEREDECFICGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2220
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 568 QYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYCQN 619
Q P +++C +C G LL C+G C AFH C SL+ P+G + C C++
Sbjct: 1594 QSPASMKENICQVCEKTGELLLCEGQCCGAFHLPCISLAEAPKGKFVCPECKS 1646
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 805 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 864
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 865 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 912
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 805 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 864
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 865 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 912
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 805 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 864
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 865 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 912
>gi|357474041|ref|XP_003607305.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
gi|355508360|gb|AES89502.1| Chromodomain helicase DNA binding protein [Medicago truncatula]
Length = 1573
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLS--SIPQGDWYCKYC 617
N D C +C GNL+ CDGCP AFH C ++ S+P+GDWYC C
Sbjct: 795 NSDECCLCKMDGNLICCDGCPAAFHSRCVGIASDSLPEGDWYCPEC 840
>gi|302793688|ref|XP_002978609.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
gi|300153958|gb|EFJ20595.1| hypothetical protein SELMODRAFT_443911 [Selaginella moellendorffii]
Length = 1349
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N D C +C GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 471 NSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS V P+ F G + N C +C +GG+LL C+ CP AFH+EC ++
Sbjct: 1561 GSVVLSSNSIVCPNHFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI- 1619
Query: 606 SIPQGDWYCKYCQ 618
+P+G WYC C+
Sbjct: 1620 EMPKGSWYCNDCK 1632
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 571 GKDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1996 AKEREDECFSCGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2045
Score = 40.4 bits (93), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 542 KNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKE 600
K+ G+ C+ V P++ + R P + +C +C G LL C+G C AFH
Sbjct: 1398 KDDFGL----CDDSVLPAR-KIIGDRGGPASMKETICQVCEKTGELLLCEGQCCGAFHLA 1452
Query: 601 CASLSSIPQGDWYCKYCQN 619
C SL+ P+G + C C++
Sbjct: 1453 CISLADAPKGKFVCPECKS 1471
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|302774224|ref|XP_002970529.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
gi|300162045|gb|EFJ28659.1| hypothetical protein SELMODRAFT_441144 [Selaginella moellendorffii]
Length = 1340
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
N D C +C GNL+ CDGCP A+H C S S++P+GDWYC C
Sbjct: 471 NSDDCVLCGMDGNLICCDGCPAAYHSRCVGVSKSTLPEGDWYCPEC 516
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 706 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 765
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 766 --SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 804
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+++ C +C+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+++ C +C+DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 79
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 767 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 826
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 827 WICTFCRDL--SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 874
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 340
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 176 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 235
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 236 --SKPEVEYDCDAPSHNSEKKKTEGLVKLTPIDKRKCERLL 274
>gi|17562600|ref|NP_504523.1| Protein LET-418 [Caenorhabditis elegans]
gi|403399446|sp|G5EBZ4.1|LE418_CAEEL RecName: Full=Protein let-418; AltName: Full=Lethal protein 418
gi|11095333|gb|AAG29838.1| LET-418 [Caenorhabditis elegans]
gi|351020697|emb|CCD62685.1| Protein LET-418 [Caenorhabditis elegans]
Length = 1829
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 59/154 (38%), Gaps = 44/154 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C C G LL CD CPRA+H C ++ P+GDW C +C ++H
Sbjct: 254 ENNDYCEECKQDGELLLCDTCPRAYHTVCIDENMEEPPEGDWSCAHC---------IEHG 304
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
V+ D C +C+ + +LLCD C
Sbjct: 305 PEVVKEEPAKQNDEF--------------------CKICKETE---------NLLLCDSC 335
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCMDCSRI 724
FH C+ L E+PK + + C C +
Sbjct: 336 VCSFHAYCID----PPLTEVPKEETWSCPRCETV 365
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGD-WYCKYCQNM 620
++ P K ND+ C IC + NLL CD C +FH C L+ +P+ + W C C+ +
Sbjct: 309 KEEPAKQNDEFCKICKETENLLLCDSCVCSFHAYCIDPPLTEVPKEETWSCPRCETV 365
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
NDD C +C DGG L+ CDGCPRAFH C L+ IP G W C C
Sbjct: 253 NDDECAVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWRCGCC 298
>gi|291225093|ref|XP_002732536.1| PREDICTED: bromodomain adjacent to zinc finger domain, 1B-like
[Saccoglossus kowalevskii]
Length = 1438
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 63/151 (41%), Gaps = 27/151 (17%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C IC G+ LL CD C + FH C +LS +P+GDW C C+ R+ D
Sbjct: 1092 CKICRKKGDEDKLLLCDECNQPFHLYCLRPALSYVPKGDWMCPACKPSVARRNSRGRDYA 1151
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
+ G S D ++ + E+E +C CD + ++ C +C
Sbjct: 1152 ELNGG--SDSDEYDE--------TDSDESEAEHDEMCCMCD------DDQELVYCSRCPA 1195
Query: 693 EFHVGCLKKHKMADLRELPKGKWFC--CMDC 721
+H C LR P+GKW C C +C
Sbjct: 1196 AYHRECHD----PPLRNFPRGKWVCSACTNC 1222
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQN 619
++D++C +C D L+ C CP A+H+EC L + P+G W C C N
Sbjct: 1173 EHDEMCCMCDDDQELVYCSRCPAAYHRECHDPPLRNFPRGKWVCSACTN 1221
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 10/57 (17%)
Query: 661 AELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
AE + C +CR K G + +LLCD+C + FH+ CL+ L +PKG W C
Sbjct: 1087 AENAKCKICR-----KKGDEDK-LLLCDECNQPFHLYCLR----PALSYVPKGDWMC 1133
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 790 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDC 847
Query: 632 NAV----EAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 848 DATGHSSEKKKSDGLGRLMPIDKRKCERLL 877
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 27/47 (57%)
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
C +C GG +L CD CPR FH C L IP+G W C C + F R+
Sbjct: 824 CKVCGQGGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQ 870
Score = 43.9 bits (102), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 13/90 (14%)
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCE 691
N +++ ++ +E+I + ++ V+ + E C+ C G G + +LLCD C
Sbjct: 787 NTIQSQQLQPSLIIEEIYRDQVKKVQIQDGENIWEEQCKVC-----GQGGK-VLLCDTCP 840
Query: 692 REFHVGCLKKHKMADLRELPKGKWFCCMDC 721
R FH CLK L+E+PKGKW CM C
Sbjct: 841 RVFHPRCLK------LKEIPKGKW-SCMIC 863
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 73 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDC 130
Query: 632 NA----VEAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 131 DAPSHNSEKKKTEGLVKLTPIDKRKCERLL 160
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 89/196 (45%), Gaps = 27/196 (13%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 547 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDALASLENHVKTEPTDM 599
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + S A G KD+D D C +C +GG+
Sbjct: 600 NESCKQSGLSSLVNGKSPV-------RSLMHRSARIGGDGSNKDDDPNEDWCAVCQNGGD 652
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR----VS 640
LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ + V
Sbjct: 653 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNLQHSKKGKTVQ 710
Query: 641 GVDSVEQITKRCIRIV 656
G+ V+Q ++C R++
Sbjct: 711 GLSPVDQ--RKCERLL 724
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 824 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDC 881
Query: 632 NAV----EAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 882 DATGHSSEKKKSDGLGRLMPIDKRKCERLL 911
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 272 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGC 319
>gi|334187637|ref|NP_568273.2| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
gi|332004422|gb|AED91805.1| PHD-finger and DNA binding domain-containing protein [Arabidopsis
thaliana]
Length = 1602
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERK 624
N D C C G+LL CDGCP A+H +C L+S +P+GDWYC C F+R+
Sbjct: 607 NSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--AFDRR 657
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 566 GRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
G P + N+D C +C DGG L+ CDGCPRAFH C L IP G W C C
Sbjct: 69 GDPQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSC 122
>gi|14586370|emb|CAC42901.1| putative protein [Arabidopsis thaliana]
Length = 1595
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERK 624
N D C C G+LL CDGCP A+H +C L+S +P+GDWYC C F+R+
Sbjct: 607 NSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--AFDRR 657
>gi|297807283|ref|XP_002871525.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
gi|297317362|gb|EFH47784.1| hypothetical protein ARALYDRAFT_488087 [Arabidopsis lyrata subsp.
lyrata]
Length = 1581
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 34/53 (64%), Gaps = 4/53 (7%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERK 624
N D C C G+LL CDGCP A+H +C L+S +P+GDWYC C F+R+
Sbjct: 604 NSDDCCFCKMDGSLLCCDGCPAAYHSKCVGLASHLLPEGDWYCPEC--AFDRR 654
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 58/148 (39%), Gaps = 30/148 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC------------- 246
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
E V E+ R K E + CR C K G +L CD
Sbjct: 247 ----EKEGVQWEAKEEEEDYEEERGGKERRREEDDHMEYCRVC---KDG---GELLCCDA 296
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L ++P G+W C
Sbjct: 297 CISSYHIHCLN----PPLPDIPNGEWLC 320
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 277 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 299 NEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSC 344
>gi|326669387|ref|XP_003199000.1| PREDICTED: hypothetical protein LOC566660 [Danio rerio]
Length = 1062
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 562 EAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
E + P +N+D C +C GG+LL CD CP+ FH C L S P GDW C C++
Sbjct: 816 EHQPSEESPEIENEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLHSFPVGDWICTLCRD 875
Query: 620 M 620
+
Sbjct: 876 V 876
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 300 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCC 347
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 800 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 859
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 860 WICTFCRDL--SKPEVEYDCDAPSHNSEKRKTEGLVKLTPIDKRKCERLL 907
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 54/92 (58%), Gaps = 10/92 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P GDW C +C+++ K +++D
Sbjct: 6 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDC 63
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIVKNL 659
+ ++ + G+ V+Q ++C R++ L
Sbjct: 64 DNLQHSKKGKTAQGLSPVDQ--RKCERLLLYL 93
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 53/89 (59%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 706 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--SKPEVEYDC 763
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ ++ + G+ V+Q ++C R++
Sbjct: 764 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 790
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 259
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + + + C R+ K+ G LLC CD C
Sbjct: 260 EEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 296
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 297 ISSYHIHCLN----PPLPDIPNGEWLC 319
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 276 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
taurus]
gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
Length = 1016
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 780 HAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 839
>gi|410919013|ref|XP_003972979.1| PREDICTED: transcription intermediary factor 1-alpha-like [Takifugu
rubripes]
Length = 1006
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 37/57 (64%), Gaps = 3/57 (5%)
Query: 567 RQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
R P +D N+D C +C +GG LL CD CP+ FH C +L + P G+W+C +C+++
Sbjct: 682 RAAPEEDPNEDWCAVCQNGGELLCCDKCPKVFHLTCHIPTLIASPSGEWFCSFCRDL 738
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 430 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 489
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + + + C R+ K+ G LLC CD C
Sbjct: 490 EEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 526
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 527 ISSYHIHCLN----PPLPDIPNGEWLC 549
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 506 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 552
>gi|340522049|gb|EGR52282.1| predicted protein [Trichoderma reesei QM6a]
Length = 1069
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/169 (23%), Positives = 65/169 (38%), Gaps = 48/169 (28%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECASL---SSIPQGDWYCKYCQNMFERKRFL 627
G DND+ C+ C + G+++ CDGCPR+FH EC + +P +WYC C +RF
Sbjct: 630 GSDNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWYCNECL----VRRFP 684
Query: 628 QHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSG---------------------- 665
+ S ++++E+ R + K ++ G
Sbjct: 685 SR-VPVYKGAFASALNALEKSIPRAFSLPKKIQTRFEGVKAGADGDYEEVTTVKAKKRSG 743
Query: 666 ---------------CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+LC C K+ R I+ C C +H+ CL
Sbjct: 744 YDELPDFFKQREDGEAVLCHAC--QKAATEIRAIIPCSACPLHWHIDCL 790
>gi|327272268|ref|XP_003220907.1| PREDICTED: transcription intermediary factor 1-alpha-like [Anolis
carolinensis]
Length = 1038
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
AH G D N+D C +C +GG LL CD CP+ FH C +L + P G+W C +C++
Sbjct: 800 AHVGDSRKEDDPNEDWCAVCQNGGELLCCDRCPKVFHLSCHVPTLRNFPSGEWICTFCRD 859
Query: 620 M 620
+
Sbjct: 860 L 860
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 765 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 824
Query: 612 WYCKYCQNMFERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D +A E + G+ + I KR C R++
Sbjct: 825 WICTFCRDL--SKPEVEYDCDAPSHNSEKRKTEGLVKLTPIDKRKCERLL 872
>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
garnettii]
Length = 1014
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 59/110 (53%), Gaps = 11/110 (10%)
Query: 556 VSPSQFEA-HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGD 611
+ PSQ H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+
Sbjct: 769 LDPSQKSPLHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGE 828
Query: 612 WYCKYCQNMFERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
W C +C+++ K +++D ++ E + G+ + I KR C R++
Sbjct: 829 WICTFCRDL--SKPEVEYDCDTSSHNSEKKKTEGLAKLTPIDKRKCERLL 876
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 50/123 (40%), Gaps = 3/123 (2%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC-QNMFERKRFLQHD 630
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C Q ++ +
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQGRIQQNLSQPEE 352
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ +E + ++ C L C F + P T L C C
Sbjct: 353 SRPLEPSAETPGPTLSARCGVCGDSTDVLRCAHCAAAFHWRCHFPMAAVRPGTNLRCKSC 412
Query: 691 ERE 693
E
Sbjct: 413 SAE 415
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 55/101 (54%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 708 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 767
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G+ + I KR C R++
Sbjct: 768 --SKPEVEYDCDAPIHNSEKRKTEGLVKLTPIDKRKCERLL 806
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 66/163 (40%), Gaps = 30/163 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E + + + C R+ K+ G LLC CD C
Sbjct: 437 EEYEEGEEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 473
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 474 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 512
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 64/164 (39%), Gaps = 31/164 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C
Sbjct: 454 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC------------- 500
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCL-LCRGCDFSKSGFGPRTILLCDQ 689
E V E+ R K E + CR C K G +L CD
Sbjct: 501 ----EKEGVQWEAKEEEEDYEEERGGKERRREEDDHMEYCRVC---KDG---GELLCCDA 550
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
C +H+ CL L ++P G+W C C + +Q +L
Sbjct: 551 CISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 590
>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Bos taurus]
Length = 1047
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 811 HAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 870
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G + NS + P+ F G + N C +C++GG+LL C+ CP AFH+EC ++
Sbjct: 1497 GAVTLASNSIICPNHFTPRRGCKNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI- 1555
Query: 606 SIPQGDWYCKYCQ 618
+P+G W+C C+
Sbjct: 1556 DMPEGSWFCNDCK 1568
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+++D C C DGG L+ C GCP+ +H EC L+ P G W C + Q
Sbjct: 1931 KEHEDECFSCGDGGQLVSCKKPGCPKVYHAECLKLTRRPAGKWECPWHQ 1979
>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
taurus]
gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
Length = 1050
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 814 HAGEMRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 873
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 3 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 60
Query: 632 NA----VEAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 61 DAPSHNSEKKKTEGLVKLTPIDKRKCERLL 90
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 43/84 (51%), Gaps = 6/84 (7%)
Query: 540 GYKNGLGIICHCC-----NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCP 594
Y G G + C N+ V + F A G+ + N C +C+ GG+LL C+ CP
Sbjct: 792 AYHGGDGCLAAGCSVIASNNIVCTNHFTARKGKSHHAHVNVSWCFVCSKGGSLLCCESCP 851
Query: 595 RAFHKECASLSSIPQGDWYCKYCQ 618
AFH +C ++ +P G WYC C+
Sbjct: 852 AAFHPDCLNI-EMPDGSWYCNDCR 874
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1237 KESEDDCFRCGDGGQLVLCDRKCCTKAYHLSCLDLVKRPFGKWECPW 1283
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 577 LCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+C +C GNLL C+G C AFH C LS P+G + C C
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSCLGLSRRPEGKFICSEC 711
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C +GG+LL C+ CP AFH+EC ++ +P+G W
Sbjct: 1691 NSIICPNHFTPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSW 1749
Query: 613 YCKYCQ 618
YC C+
Sbjct: 1750 YCNDCK 1755
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 573 DNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2119 EREDECFSCGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 2166
Score = 43.5 bits (101), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 552 CNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQG 610
C+ +SPS+ + R P +++C +C G LL C+G C AFH C SL+ P+G
Sbjct: 1527 CDDTLSPSR-KIIGDRGGPASMKENICQVCEKTGELLLCEGQCCGAFHLACISLAEAPKG 1585
Query: 611 DWYCKYCQN 619
+ C C++
Sbjct: 1586 KFICPECKS 1594
>gi|313247391|emb|CBY15642.1| unnamed protein product [Oikopleura dioica]
Length = 1498
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
++++LC C GG LL C+ CPR +H +C + + IP GDW+C YC
Sbjct: 146 EHNELCQFCKSGGELLACESCPRVYHPKCLNPPQTEIPDGDWFCPYC 192
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 35/64 (54%), Gaps = 6/64 (9%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKE--CASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C DGG L+ CDGCPRAFH C L IP G W C C + R LQ
Sbjct: 303 NEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCL----QGRALQDTP 358
Query: 632 NAVE 635
+A E
Sbjct: 359 HAEE 362
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 893 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL--SKPEVEYDC 950
Query: 632 NAV----EAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 951 DAPSHNSEKKKTEGLVKLTPIDKRKCERLL 980
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 240 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCC 287
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 394 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 453
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + + + C R+ K+ G LLC CD C
Sbjct: 454 EEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 490
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 491 ISSYHIHCLN----PPLPDIPNGEWLC 513
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 470 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 516
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 996 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 1053
Query: 632 NAV----EAGRVSGVDSVEQITKR-CIRIV 656
+A E + G+ + I KR C R++
Sbjct: 1054 DAPSHNSEKKKTEGLVKLTPIDKRKCERLL 1083
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 55/148 (37%), Gaps = 47/148 (31%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQH 629
+++ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 348 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWSCPHCE----------- 396
Query: 630 DANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 689
E + D + + C + G LLC CD
Sbjct: 397 ----AEGTQEQDDDEHNEFCRLC---------KDGGELLC-----------------CDS 426
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L E+P G W C
Sbjct: 427 CTSAYHIFCLN----PPLSEIPDGDWKC 450
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 2/58 (3%)
Query: 562 EAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
EA ++ ++++ C +C DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 396 EAEGTQEQDDDEHNEFCRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
mutus]
Length = 940
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 704 HAGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 763
>gi|238502471|ref|XP_002382469.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
gi|220691279|gb|EED47627.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
Length = 875
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 70/163 (42%), Gaps = 37/163 (22%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ-------- 618
G++N+D C C G LL CDGC +FH C + ++ P+GDW+C C
Sbjct: 467 GEENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTL 526
Query: 619 ----NMFERKRF-----LQHDANAVEAGRVSGVDSV---EQITKRCIR----------IV 656
+ K F L+ V+ G D V +I R R ++
Sbjct: 527 LVAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLL 586
Query: 657 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ ++A+ +LC GC ++ G R I+ CD C FH+ C+
Sbjct: 587 RTVDAK-GKLILCHGC--GRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|317147911|ref|XP_001822376.2| PHD finger domain protein [Aspergillus oryzae RIB40]
Length = 857
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 70/163 (42%), Gaps = 37/163 (22%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ-------- 618
G++N+D C C G LL CDGC +FH C + ++ P+GDW+C C
Sbjct: 467 GEENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTL 526
Query: 619 ----NMFERKRF-----LQHDANAVEAGRVSGVDSV---EQITKRCIR----------IV 656
+ K F L+ V+ G D V +I R R ++
Sbjct: 527 LVAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLL 586
Query: 657 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ ++A+ +LC GC ++ G R I+ CD C FH+ C+
Sbjct: 587 RTVDAK-GKLILCHGC--GRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|83771111|dbj|BAE61243.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 730
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 70/163 (42%), Gaps = 37/163 (22%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ-------- 618
G++N+D C C G LL CDGC +FH C + ++ P+GDW+C C
Sbjct: 340 GEENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTL 399
Query: 619 ----NMFERKRF-----LQHDANAVEAGRVSGVDSV---EQITKRCIR----------IV 656
+ K F L+ V+ G D V +I R R ++
Sbjct: 400 LVAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLL 459
Query: 657 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ ++A+ +LC GC ++ G R I+ CD C FH+ C+
Sbjct: 460 RTVDAK-GKLILCHGC--GRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
[Desmodus rotundus]
Length = 951
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 715 HVGEARKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 774
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A E + G + I KR C R++
Sbjct: 775 --SKPEVEYDCDAPSHNSEKRKTEGPGKLAPIDKRKCERLL 813
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
>gi|410973162|ref|XP_003993024.1| PREDICTED: tripartite motif-containing protein 66 [Felis catus]
Length = 1211
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ FH C +L S P GDW C C+++ +
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLSFPGGDWVCTLCRSLTQ 1014
>gi|391871058|gb|EIT80224.1| PHD Zn-finger protein [Aspergillus oryzae 3.042]
Length = 730
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 70/163 (42%), Gaps = 37/163 (22%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYCQ-------- 618
G++N+D C C G LL CDGC +FH C + ++ P+GDW+C C
Sbjct: 340 GEENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTL 399
Query: 619 ----NMFERKRF-----LQHDANAVEAGRVSGVDSV---EQITKRCIR----------IV 656
+ K F L+ V+ G D V +I R R ++
Sbjct: 400 LVAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLL 459
Query: 657 KNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ ++A+ +LC GC ++ G R I+ CD C FH+ C+
Sbjct: 460 RTVDAK-GKLILCHGC--GRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS V + F A G+ + N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 813 NSIVCTNHFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 871
Query: 613 YCKYCQ 618
YC C+
Sbjct: 872 YCNDCR 877
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1240 KESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLGLVKRPFGKWECPW 1286
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 52/133 (39%), Gaps = 6/133 (4%)
Query: 571 GKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQH 629
G + +C +C G+LL C+G C RAFH C LS P G + C C + +
Sbjct: 667 GTKKEYVCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKFICSECTSGVHTCFVCKE 726
Query: 630 DANAVEAGRVSGVDSVEQ---ITKRCIRIVKN--LEAELSGCLLCRGCDFSKSGFGPRTI 684
V+ VS + K + + +N L CL C + S +
Sbjct: 727 RKADVKRCVVSHCGKFYHEACVKKFHLTVFENRGFRCPLHSCLSCHVSNPSHPRISKGKM 786
Query: 685 LLCDQCEREFHVG 697
+ C +C +H G
Sbjct: 787 MRCVRCPVAYHAG 799
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 295 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS V + F A G+ + N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 793 NSIVCTNHFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 851
Query: 613 YCKYCQ 618
YC C+
Sbjct: 852 YCNDCR 857
Score = 43.9 bits (102), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 43/99 (43%), Gaps = 24/99 (24%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDW----------------Y 613
K+++D C C DGG L+ CD C +A+H C L P G W +
Sbjct: 1220 KESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLGLVKRPFGKWECPWHHCDVCGKPSVSF 1279
Query: 614 CKYCQNMFERKRFLQHDANAVEAGR------VSGVDSVE 646
C +C N F ++ N+ G+ V GVDSVE
Sbjct: 1280 CHFCPNSFCKEHQDGTVLNSTLNGQLCCSEHVLGVDSVE 1318
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
Query: 554 SEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDW 612
+EVS S ++ G G + +C +C G+LL C+G C RAFH C LS P G +
Sbjct: 633 TEVSISSKKSERGT---GTKKEYVCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKF 689
Query: 613 YCKYC 617
C C
Sbjct: 690 VCSEC 694
>gi|334821760|gb|AEG90850.1| chromodomain-helicase-DNA-binding protein 5 [Apostichopus
japonicus]
Length = 110
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 42/148 (28%), Positives = 58/148 (39%), Gaps = 43/148 (29%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L + P+G W C C
Sbjct: 2 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEAAPEGKWSCPQC------------- 48
Query: 631 ANAVEAGRV-SGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQ 689
EAG VEQ + + + G LLC CDQ
Sbjct: 49 ----EAGAAPPSAHVVEQEQDEHMEFCRVRQD--GGELLC-----------------CDQ 85
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL L+++P G+W C
Sbjct: 86 CPSSYHIFCLN----PPLKKIPDGEWLC 109
Score = 43.1 bits (100), Expect = 0.78, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 549 CHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSS 606
C C + +P AH Q ++ + C + DGG LL CD CP ++H C + L
Sbjct: 45 CPQCEAGAAPPS--AHVVEQ-EQDEHMEFCRVRQDGGELLCCDQCPSSYHIFCLNPPLKK 101
Query: 607 IPQGDWYC 614
IP G+W C
Sbjct: 102 IPDGEWLC 109
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CDGCPRA+H C L P+G W C C
Sbjct: 342 DHQDYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTCVK----------- 390
Query: 631 ANAVEAG-RVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCD 688
N ++ R + D + +N++ + C C+ G D +L+CD
Sbjct: 391 -NGIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDGGD----------LLICD 439
Query: 689 QCEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL + ++P+G+W C
Sbjct: 440 TCPHSYHLNCLN----PPVEKVPEGEWSC 464
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS V + F A G+ + N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 814 NSIVCTNHFTAMKGKSHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 872
Query: 613 YCKYCQ 618
YC C+
Sbjct: 873 YCNDCR 878
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 44/99 (44%), Gaps = 24/99 (24%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDW----------------Y 613
K+++D C C DGG L+ CD C +A+H C L+ P G W +
Sbjct: 1241 KESEDDCFRCGDGGQLVLCDRKSCTKAYHLSCLDLAKRPFGKWECPWHHCDVCGKPSVSF 1300
Query: 614 CKYCQNMFERKRFLQHDANAVEAGR------VSGVDSVE 646
C +C N F ++ N+ G+ V GVDSVE
Sbjct: 1301 CHFCPNSFCKEHQDGTVLNSTLNGQLCCSEHVLGVDSVE 1339
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 4/65 (6%)
Query: 554 SEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDW 612
+EVS S ++ G G + +C +C G+LL C+G C RAFH C LS P G +
Sbjct: 654 TEVSISSKKSERGT---GTKKEYVCQLCEKTGDLLLCEGLCYRAFHVSCLGLSGRPAGKF 710
Query: 613 YCKYC 617
C C
Sbjct: 711 VCSEC 715
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLS----PPLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSC 338
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 295 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 342
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 310 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 362
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 292 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 337
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 305 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 357
>gi|301627494|ref|XP_002942909.1| PREDICTED: tripartite motif-containing protein 66-like [Xenopus
(Silurana) tropicalis]
Length = 1207
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 51/97 (52%), Gaps = 14/97 (14%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG +L CD CP+ FH C +L S P G+W C C+N+ K +++D
Sbjct: 964 ENEDFCAVCLNGGEMLCCDRCPKVFHLSCHVPALLSFPVGEWLCTLCRNL--TKPEVEYD 1021
Query: 631 --------ANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
N +E +D +Q ++C +V +L
Sbjct: 1022 CDNVRYCLENKMEIAAFPNLDDYDQ--RKCETLVLSL 1056
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 296 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 341
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 309 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 361
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C++GG+LL C+ CP AFH+EC ++ +PQG W
Sbjct: 1389 NSFLCPNHFTPRKGCKNHEHINVSWCFVCSEGGSLLCCEACPAAFHRECLNM-EMPQGSW 1447
Query: 613 YCKYCQ 618
+C C+
Sbjct: 1448 FCNDCK 1453
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 1819 KEREDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 1867
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 575 DDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+ +C +C G+LL C+G C AFH +C LS+ P+G + C+ C
Sbjct: 1247 ESVCQVCERTGDLLVCEGHCYGAFHLQCIGLSAPPKGKFLCREC 1290
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 84/197 (42%), Gaps = 29/197 (14%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 698 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDTLTSLENHVKTEPADM 750
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + S A G KD+D D C +C +GG+
Sbjct: 751 NESCKQSGLNSLVNGKSPV-------RSLMHRSARIGGDGSSKDDDPNEDWCAVCQNGGD 803
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVEAGRV 639
LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH A +
Sbjct: 804 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKTAQGL 863
Query: 640 SGVDSVEQITKRCIRIV 656
S VD ++C R++
Sbjct: 864 SPVDQ-----RKCERLL 875
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 291 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCC 338
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEF 742
++ CD C R FH+ CL L+E+P G W C C V QNL E + PE
Sbjct: 306 LICCDGCPRAFHLACLSP----PLQEIPSGLWRC--SCCLQGRVQQNLSQPEVSRPPEL 358
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+++ C +C DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
Length = 878
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 654 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--SKPEVEYDC 711
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ + + G+ V+Q ++C R++
Sbjct: 712 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 738
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+++ C +C DGG+LL CD CP +H+ C S L SIP+GDW C C
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRC 80
>gi|219112141|ref|XP_002177822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410707|gb|EEC50636.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 462
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 57/140 (40%), Gaps = 17/140 (12%)
Query: 488 FENASPPLSFPNKSRWNITPKDQRLHKLVF----DESGLPDGTEVGYYACGQKLLEGYK- 542
FE A + N S WN + +R +F D + E+ C L ++
Sbjct: 262 FEEAVKNMECDNGSDWNPDMR-RRFRDEIFRSRKDMDAVHGAMEIEMKTCLAYYLGTFRM 320
Query: 543 -NGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC 601
N ++ C E G+ D C IC DGG+LL CDGC +H +C
Sbjct: 321 SNDYRLLKTVCEEERQEGYSATEHGK--------DACAICGDGGSLLICDGCEGEYHMDC 372
Query: 602 A--SLSSIPQGDWYCKYCQN 619
SL+ IP+G W C C N
Sbjct: 373 VQPSLAEIPEGHWECDDCVN 392
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 304 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCC 351
>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Ovis aries]
Length = 1273
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C S+ +P G W
Sbjct: 753 NSIICTNHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLSI-DMPDGSW 811
Query: 613 YCKYCQ 618
+C C+
Sbjct: 812 FCNDCR 817
>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
Length = 1138
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ + +
Sbjct: 897 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDIGKPEVEYDCDN 956
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 957 LQHSKKGKTAQGLSPVDQ-----RKCERLL 981
>gi|307109592|gb|EFN57830.1| hypothetical protein CHLNCDRAFT_143249 [Chlorella variabilis]
Length = 295
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 578 CTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
C C +GG L+ CDGC A+H++CA L ++P+ DW+C C
Sbjct: 195 CQQCGEGGELVCCDGCTAAYHEQCAGLEAVPETDWFCPMC 234
>gi|224059262|ref|XP_002299795.1| predicted protein [Populus trichocarpa]
gi|222847053|gb|EEE84600.1| predicted protein [Populus trichocarpa]
Length = 89
Score = 58.9 bits (141), Expect = 1e-05, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 617
N D C +C GNL+ CDGCP A+H +C +++ +P+GDWYC C
Sbjct: 5 NSDECCLCKMDGNLICCDGCPAAYHAKCVGVANNYLPEGDWYCPEC 50
>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
Length = 912
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ K +++D
Sbjct: 671 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDL--SKPEVEYDC 728
Query: 632 NAVEAGR----VSGVDSVEQITKRCIRIV 656
+ + + G+ V+Q ++C R++
Sbjct: 729 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 755
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 6/84 (7%)
Query: 540 GYKNGLGIICHCC-----NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCP 594
Y G G + C N+ + + F A G+ + N C +C+ GG+LL C+ CP
Sbjct: 792 AYHGGDGCLAAGCSVIASNNIICTNHFTARKGKSHHAHVNVSWCFVCSKGGSLLCCESCP 851
Query: 595 RAFHKECASLSSIPQGDWYCKYCQ 618
AFH +C ++ +P G WYC C+
Sbjct: 852 AAFHPDCLNI-EMPDGSWYCNDCR 874
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1237 KESEDDCFRCGDGGQLVLCDRKCCTKAYHLSCLDLVKRPFGKWECPW 1283
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 577 LCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+C +C GNLL C+G C AFH C LS P+G + C C
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSCLGLSRRPEGKFICSEC 711
>gi|327264985|ref|XP_003217289.1| PREDICTED: tripartite motif-containing protein 66-like [Anolis
carolinensis]
Length = 1265
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 13/97 (13%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C N+ E
Sbjct: 908 ENEDFCAVCLNGGELLCCDYCPKVFHLLCHVPALLSFPVGEWVCTLCSNVEKPEVEYDCE 967
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
RF + + A + G++ +Q K+C ++V +L
Sbjct: 968 NARF-SYSNTLMPAREMYGLNDYDQ--KKCEKLVLSL 1001
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 434
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 435 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 472
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 473 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPMLKGRVQKIL 511
>gi|348522650|ref|XP_003448837.1| PREDICTED: PHD finger protein 21B-like [Oreochromis niloticus]
Length = 626
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 34/66 (51%), Gaps = 2/66 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++DD C +C + G L C CPRAFH C L + P+G WYC CQ K +
Sbjct: 434 EHDDHCAVCKEDGELQQCHNCPRAFHPTCLHPPLKTPPRGPWYCPKCQKKVLNKENMSWP 493
Query: 631 ANAVEA 636
N V++
Sbjct: 494 QNFVQS 499
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 30/163 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 503
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G + + + C R+ K+ G LLC CD C
Sbjct: 504 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 540
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 541 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 579
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 61/134 (45%), Gaps = 20/134 (14%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C C DGG+LL C+ C AFH C +SS+P+GDW+C C+ K +H
Sbjct: 88 NNDFCDACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSCEQNKNPKP--KHSK 145
Query: 632 NAVEAGRVSGVDSVEQITKRCIRIVK----NLEAELSGCLLCRGCDFSKSGFGPRTILLC 687
+ + S DS++ + C + + N + S C +C G D + +L C
Sbjct: 146 SILS----SLFDSLDTLNPSCFTLPEEYLLNNSFKQSFCNVCDGDDSMED------MLHC 195
Query: 688 DQ--CEREFHVGCL 699
C H CL
Sbjct: 196 SHSKCRISVHTYCL 209
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 7/91 (7%)
Query: 528 VGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNL 587
V Y+A Y G + NS + P+ F G + N C +C++GG+L
Sbjct: 1349 VAYHA------NDYCMAAGSVPLANNSFLCPNHFTPRKGCKNHEHINVSWCFVCSEGGSL 1402
Query: 588 LPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
L C+ CP AFH+EC ++ +PQG W+C C+
Sbjct: 1403 LCCESCPAAFHRECLNI-EMPQGSWFCNDCR 1432
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG ++ C GCP+ +H +C +LS P G W C + Q
Sbjct: 1797 KEREDECFYCGDGGQIVSCKKPGCPKVYHADCLNLSKRPAGRWECPWHQ 1845
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 395 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 454
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 455 EDYEEDGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 492
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 493 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 531
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
G N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 1 GSKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 49
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G + + + C R+ K+ G LLC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLC 453
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 410 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 456
>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
partial [Ornithorhynchus anatinus]
Length = 951
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 6/84 (7%)
Query: 540 GYKNGLGIICHCC-----NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCP 594
Y G G + C N+ + + F A G+ + N C +C+ GG+LL C+ CP
Sbjct: 599 AYHGGDGCMAAGCAVIASNNIICTNHFTARKGKSHHAHVNVSWCFVCSKGGSLLCCESCP 658
Query: 595 RAFHKECASLSSIPQGDWYCKYCQ 618
AFH +C ++ +P G WYC C+
Sbjct: 659 AAFHPDCLNI-EMPDGSWYCNDCR 681
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 336 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 395
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G + + + C R+ K+ G LLC CD C
Sbjct: 396 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 432
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 433 ISSYHIHCLN----PPLPDIPNGEWLC 455
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 412 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 458
>gi|357116142|ref|XP_003559843.1| PREDICTED: uncharacterized protein LOC100822072 [Brachypodium
distachyon]
Length = 1679
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDA 631
N D C +C GNLL CDGCP AFH +C + +P+G+WYC C + +R ++ A
Sbjct: 421 NSDECYLCGMDGNLLCCDGCPAAFHSKCVGVVEDLLPEGEWYCPEC--LMQRNNGSRNMA 478
Query: 632 NAVEAGRVSGVD 643
V G+D
Sbjct: 479 KLGRGAEVLGID 490
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 63/149 (42%), Gaps = 25/149 (16%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +
Sbjct: 48 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGSWSCPHCE-----KEGISMG 102
Query: 631 ANAVEAGRVSGVD-SVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCD 688
+ G+ +G + ++ K+ + C C G D ++ C+
Sbjct: 103 SQV--EGKATGTKMAPDKSAKQVAAASPEKDEHQEFCTECHDGGD----------LICCE 150
Query: 689 QCEREFHVGCLKKHKMADLRELPKGKWFC 717
C +H+ CL + L +P+G W C
Sbjct: 151 NCPVSYHLDCL----IPPLTNIPEGVWLC 175
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 55/110 (50%), Gaps = 4/110 (3%)
Query: 511 RLHKLVFDESGLPDGTEVGYYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYP 570
R + LV + L + E G ++C EG G + ++++P + P
Sbjct: 70 RAYHLVCLDPELEEAPE-GSWSCPHCEKEGISMGSQVEGKATGTKMAPDKSAKQVAAASP 128
Query: 571 GKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
KD + + CT C DGG+L+ C+ CP ++H +C L++IP+G W C C
Sbjct: 129 EKDEHQEFCTECHDGGDLICCENCPVSYHLDCLIPPLTNIPEGVWLCPRC 178
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%)
Query: 546 GIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLS 605
G I N+ V F+ +++ N C +C+ GG+L+ C+ CP AFH +C
Sbjct: 1057 GSIVLASNAIVCSKHFQPVKTQKHHSHVNVSWCFMCSKGGSLMCCESCPAAFHPDCIGYD 1116
Query: 606 SIPQGDWYCKYCQN 619
IP G WYC+ C N
Sbjct: 1117 EIPDGSWYCRDCTN 1130
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 573 DNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
+++D C C +GG L+ CD CP+A+H +C LS P G W C +
Sbjct: 1498 EHEDDCFRCGEGGELVMCDKKTCPKAYHVKCLGLSKRPYGKWECPW 1543
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 62/147 (42%), Gaps = 29/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 333 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 392
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G + + + C R+ K+ G LLC CD C
Sbjct: 393 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 429
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 430 ISSYHIHCLN----PPLPDIPNGEWLC 452
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 409 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 455
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 68/163 (41%), Gaps = 30/163 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G + + + C R+ K+ G LLC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 469
>gi|363728093|ref|XP_416340.3| PREDICTED: transcription intermediary factor 1-alpha [Gallus
gallus]
Length = 939
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+H G D N+D C +C +GG LL C+ CP+ FH C SL S P G+W C +C++
Sbjct: 702 SHTGEGRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRD 761
Query: 620 M 620
+
Sbjct: 762 L 762
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 48/87 (55%), Gaps = 4/87 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P F G + N C +C++GG+LL C+ CP AFH+EC ++ +PQG W
Sbjct: 185 NSFLCPRHFAPRKGCKNHEHINVSWCFVCSEGGSLLCCESCPAAFHQECLNM-EMPQGSW 243
Query: 613 YCKYCQNMFERKRFLQHDANAVEAGRV 639
+C C+ +R RF D V+ GR
Sbjct: 244 FCNDCK-AGKRPRF--KDILWVKWGRF 267
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 6/47 (12%)
Query: 577 LCT-----ICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
LCT +C G+LL CDG C AFH +C LS+ P+G ++C+ C
Sbjct: 41 LCTRTILQVCERTGDLLVCDGHCYGAFHPQCIGLSAAPEGRFFCREC 87
>gi|296084643|emb|CBI25766.3| unnamed protein product [Vitis vinifera]
Length = 126
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 781 LSQAVAIFHDCFDPIVDSISGRDLIPSMVY--GRNLRGQEFGGMYCAILTVNSSVVSAGI 838
++ AVA+ +CF+P++D + +++ S++Y G N F G Y AIL +S
Sbjct: 22 IAVAVAVMEECFEPVIDRHTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVAS 81
Query: 839 LRVFGQEVAELPLVATSKINHGKGYFQLLFACIEKLLSF 877
+R+ G ++AE+P +AT +G L IE + S
Sbjct: 82 MRIHGNKLAEMPFIATRPSYRRQGMCHKLLVAIESVSSL 120
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 4 AQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 52
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 67/165 (40%), Gaps = 43/165 (26%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C ++
Sbjct: 151 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHC---------VRSS 201
Query: 631 ANAV--EAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCD 688
N E G V D +++E CR C K G +L CD
Sbjct: 202 VNIAFEENGPVEQDDD------------EHME-------FCRVC---KDG---GELLCCD 236
Query: 689 QCEREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
C +HV CL ++ +P G W C C + + +Q +L
Sbjct: 237 TCTSAYHVFCL----APNVANVPDGIWHCPRCSCQPLKARVQKIL 277
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 3/90 (3%)
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLP 589
+ C + LE G HC S V+ + FE + + ++ + C +C DGG LL
Sbjct: 176 HLVCLEPELEQAPEGKWSCPHCVRSSVNIA-FEENGPVEQDDDEHMEFCRVCKDGGELLC 234
Query: 590 CDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
CD C A+H C +++++P G W+C C
Sbjct: 235 CDTCTSAYHVFCLAPNVANVPDGIWHCPRC 264
>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
harrisii]
Length = 962
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ + +
Sbjct: 721 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPEVEYDCDN 780
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S +D ++C R++
Sbjct: 781 LQHSKKGKTAQGLSPMDQ-----RKCERLL 805
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 84/197 (42%), Gaps = 29/197 (14%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 792 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDTLTSLENHVKTEPADM 844
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + S A G KD+D D C +C +GG+
Sbjct: 845 NESCKQSGLNSLVNGKSPV-------RSLMHRSARIGGDGSSKDDDPNEDWCAVCQNGGD 897
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVEAGRV 639
LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH A +
Sbjct: 898 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKTAQGL 957
Query: 640 SGVDSVEQITKRCIRIV 656
S VD ++C R++
Sbjct: 958 SPVDQ-----RKCERLL 969
>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
Length = 3661
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 73/168 (43%), Gaps = 27/168 (16%)
Query: 574 NDDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGD-WYCKYCQNMFER-----KRF 626
N D+C C DGG + CDG C R+FH C + P+ D W C C N ++ K+
Sbjct: 2736 NMDVCARCEDGGVTIMCDGPCQRSFHPACLGMDDNPEEDPWMCNRCMNKVQKCLECGKKG 2795
Query: 627 LQHDANAVEAGRVSGVDSVEQIT-------KRCIRIVKNLEAELS--GCL-----LCRGC 672
+ D++ A ++ G S Q++ K C+ + S G C C
Sbjct: 2796 SEMDSHN-RAVKIPGGVSRCQLSSCGRYYHKECLDKITPNRTSYSKEGNFKCPQHFCIDC 2854
Query: 673 DFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMD 720
+ + GPRT++ C +C + CLK R + KGKW C D
Sbjct: 2855 GKTSTNLGPRTLVKCLRCAKARCPDCLKT-----ARYVKKGKWMVCSD 2897
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 64/132 (48%), Gaps = 17/132 (12%)
Query: 684 ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCSRINSVLQNLLVQEAEKLPEFH 743
+L+CD C+ E+H+ CL+ L +PKG+W C + C+ + Q L + E
Sbjct: 551 MLVCDTCDAEYHLKCLR------LSSVPKGQWLCPI-CTVMLRKGQTLFSHQT----EVE 599
Query: 744 LNAIKKYAGNSLETVSDID--VRWRLLSGKAATPETRLLLSQAVAI--FHDCFD--PIVD 797
+ + ++E V ++ ++W LS + T ETR L+ AI FH D P+
Sbjct: 600 KAKLSQMPQPTVEVVDELKYLIKWSGLSYQFCTWETREELNNDGAIDRFHKLNDHPPLSP 659
Query: 798 SISGRDLIPSMV 809
+S +L+ ++
Sbjct: 660 PMSEEELMRTLA 671
Score = 44.3 bits (103), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 6/74 (8%)
Query: 587 LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGRVSGVDSVE 646
+L CD C +H +C LSS+P+G W C C M + + L VE ++S
Sbjct: 551 MLVCDTCDAEYHLKCLRLSSVPKGQWLCPICTVMLRKGQTLFSHQTEVEKAKLS------ 604
Query: 647 QITKRCIRIVKNLE 660
Q+ + + +V L+
Sbjct: 605 QMPQPTVEVVDELK 618
>gi|334331581|ref|XP_001379177.2| PREDICTED: tripartite motif-containing protein 66-like [Monodelphis
domestica]
Length = 1162
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 51/93 (54%), Gaps = 13/93 (13%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ K +++D
Sbjct: 911 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSL--TKPEVEYD 968
Query: 631 ANAVEAGR-------VSGVDSVEQITKRCIRIV 656
+ G +SG+ +Q K+C ++V
Sbjct: 969 CDNTRYGHTNPGGRLLSGLSDYDQ--KKCEKLV 999
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 84/197 (42%), Gaps = 29/197 (14%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 792 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDTLTSLENHVKTEPADM 844
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + S A G KD+D D C +C +GG+
Sbjct: 845 NESCKQSGLNSLVNGKSPV-------RSLMHRSARIGGDGSSKDDDPNEDWCAVCQNGGD 897
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RFLQHDANAVEAGRV 639
LL C+ CP+ FH C +L S P GDW C +C+++ + + LQH A +
Sbjct: 898 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDNLQHSKKGKTAQGL 957
Query: 640 SGVDSVEQITKRCIRIV 656
S VD ++C R++
Sbjct: 958 SPVDQ-----RKCERLL 969
>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Otolemur garnettii]
Length = 1371
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS V + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIVCTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1242 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1288
>gi|358387067|gb|EHK24662.1| hypothetical protein TRIVIDRAFT_112582, partial [Trichoderma virens
Gv29-8]
Length = 1105
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 66/165 (40%), Gaps = 41/165 (24%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKECASL---SSIPQGDWYCKYC---------- 617
G DND+ C+ C + G+++ CDGCPR+FH EC + +P +WYC C
Sbjct: 669 GSDNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWYCNECLVRRYPSRVP 727
Query: 618 -----------------QNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKN-- 658
F + LQ+ V+AG + D E +K+ R +
Sbjct: 728 VYKGAFASALNALEKSIPRAFSLPKKLQNRFEGVKAG--ADGDYEEVTSKQKKRAGYDEL 785
Query: 659 ----LEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCL 699
+ E +LC C K R I+ C C +H+ CL
Sbjct: 786 PDFFKQREDGEAVLCHAC--QKPATEIRAIIPCSACPFHWHIDCL 828
>gi|380791865|gb|AFE67808.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 949
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
[Monodelphis domestica]
Length = 1143
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ + +
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPEVEYDCDN 961
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S +D ++C R++
Sbjct: 962 LQHSKKGKTAQGLSPMDQ-----RKCERLL 986
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + D
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEED 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 437 DDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|13992535|emb|CAC38114.1| KIAA0298 protein [Mus musculus]
gi|148684989|gb|EDL16936.1| tripartite motif-containing 66, isoform CRA_b [Mus musculus]
Length = 1209
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 961 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1018
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1019 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1050
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 423 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 460
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 461 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 499
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 471 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 530
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 531 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 568
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 569 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 607
>gi|402894298|ref|XP_003910303.1| PREDICTED: tripartite motif-containing protein 66, partial [Papio
anubis]
Length = 1041
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++ +
Sbjct: 961 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQ 1012
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 528 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 587
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 588 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 625
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 626 ISSYHIHCLN----PPLPDIPNGEWLC 648
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 605 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 651
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + D
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEED 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 437 DDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|449670407|ref|XP_004207258.1| PREDICTED: methyl-CpG-binding domain-containing protein 9-like
[Hydra magnipapillata]
Length = 491
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 66/143 (46%), Gaps = 10/143 (6%)
Query: 576 DLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 635
D+ + G NLL D +K A+ + +G C YCQ+ ER RF QH+ +
Sbjct: 156 DMYFVVVFGDNLLSLDH--NDIYKTGATYDN-DKGILLCDYCQSTVERNRFGQHEELLI- 211
Query: 636 AGRVSGVDSVEQITKRCIRIVKNLEAELSG-CLLCRGCDFSKSGFGPRTILLCDQCEREF 694
+ G + +V+ + ++ S C+ C+ C + P T+L CD C++ +
Sbjct: 212 -CKDCGNKAHPSCLSYSAELVEQIRSDGSWQCIDCKACIICEGTGDPDTLLFCDACDKGY 270
Query: 695 HVGCLKKHKMADLRELPKGKWFC 717
H+ C + L ++P GKW C
Sbjct: 271 HMNCHE----PKLTQMPSGKWAC 289
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
C IC G+ LL CD C + +H C L+ +P G W C CQ
Sbjct: 248 CIICEGTGDPDTLLFCDACDKGYHMNCHEPKLTQMPSGKWACANCQ 293
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 433
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 434 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDTC 471
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 472 ISSYHIHCLN----PPLPDIPNGEWLC 494
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 451 DHMEYCRVCKDGGELLCCDTCISSYHIHCLNPPLPDIPNGEWLCPRC 497
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 392 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 446
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 447 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 489
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 490 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 528
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 59/147 (40%), Gaps = 36/147 (24%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+++LC C G L+ C CP +H EC LS + Q W+C C
Sbjct: 1422 DHNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLC------------- 1468
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
V +G DS E++ + E+ +C C ++ CD C
Sbjct: 1469 ---VMDRTTNGADSEEEMGSN--------DGEIEHEDVCSRCRHGGE------LICCDTC 1511
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+ FH+ C K LR++PKG W C
Sbjct: 1512 PKAFHMECCK----PVLRKVPKGHWEC 1534
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQ 618
+++D+C+ C GG L+ CD CP+AFH EC L +P+G W C+ C+
Sbjct: 1491 EHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENCK 1538
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%), Gaps = 8/54 (14%)
Query: 669 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFCCMDCS 722
CR C + G P +LLCD C R H+ CLK L+++PKG+WF C DC+
Sbjct: 1274 CRMC---RRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDCA 1319
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
C +C GGN +L CD C R H C L +P+G+W+CK C
Sbjct: 1274 CRMCRRGGNPEAMLLCDSCNRGHHMFCLKPPLKKVPKGEWFCKDC 1318
>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Monodelphis domestica]
Length = 1027
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++
Sbjct: 791 HAGESRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDL 850
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
+D C C GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 865 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQ 914
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 442
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 443 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 480
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 481 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 519
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
[Monodelphis domestica]
Length = 1126
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ + +
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPEVEYDCDN 961
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S +D ++C R++
Sbjct: 962 LQHSKKGKTAQGLSPMDQ-----RKCERLL 986
>gi|17569817|ref|NP_510140.1| Protein CHD-3 [Caenorhabditis elegans]
gi|6165993|sp|Q22516.2|CHD3_CAEEL RecName: Full=Chromodomain-helicase-DNA-binding protein 3 homolog;
Short=CHD-3
gi|3879819|emb|CAA91810.1| Protein CHD-3 [Caenorhabditis elegans]
gi|11095331|gb|AAG29837.1| CHD-3 [Caenorhabditis elegans]
Length = 1787
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 42/142 (29%)
Query: 578 CTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVE 635
C +C G L+ CD C RA+H C ++ P+GDW C +C+ +H + +
Sbjct: 268 CEVCNQDGELMLCDTCTRAYHVACIDENMEQPPEGDWSCPHCE---------EHGPDVL- 317
Query: 636 AGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFH 695
VE+ + +A + C +C+ ILLCD C +H
Sbjct: 318 --------IVEE---------EPAKANMDYCRICKETS---------NILLCDTCPSSYH 351
Query: 696 VGCLKKHKMADLRELPKGKWFC 717
C+ L E+P+G+W C
Sbjct: 352 AYCID----PPLTEIPEGEWSC 369
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P K N D C IC + N+L CD CP ++H C L+ IP+G+W C C
Sbjct: 323 PAKANMDYCRICKETSNILLCDTCPSSYHAYCIDPPLTEIPEGEWSCPRC 372
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 697 NSIICTGHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 755
Query: 613 YCKYCQ 618
+C C+
Sbjct: 756 FCNDCR 761
>gi|426245702|ref|XP_004016644.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66 [Ovis aries]
Length = 1286
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 17/97 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ E
Sbjct: 1038 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEMEYDCE 1097
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
RF Q A G+ +Q K+C ++V +L
Sbjct: 1098 NARFSQPGVRA-----PPGLSIYDQ--KKCEKLVLSL 1127
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|326912229|ref|XP_003202456.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Meleagris gallopavo]
Length = 1051
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+H G D N+D C +C +GG LL C+ CP+ FH C SL S P G+W C +C++
Sbjct: 814 SHTGEGRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRD 873
Query: 620 M 620
+
Sbjct: 874 L 874
>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
[Ornithorhynchus anatinus]
Length = 1050
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ FH C +L S P G+W C +C+++ + +
Sbjct: 809 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPEVEYDCDN 868
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S +D ++C R++
Sbjct: 869 LQHSKKGKTAQGLSPMDQ-----RKCERLL 893
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 422 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 481
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 482 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 519
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 520 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 558
>gi|47195997|emb|CAF91487.1| unnamed protein product [Tetraodon nigroviridis]
Length = 138
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+++D C +C + G L PC CPRAFH C L + P+G WYC CQ
Sbjct: 87 EHEDQCAVCEEDGELQPCRSCPRAFHPSCLHPPLKTPPRGPWYCPKCQK 135
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|45861756|gb|AAS78676.1| transcriptional intermediary factor 1 delta [Mus musculus]
Length = 1344
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1154 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1185
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 428
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 429 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDTC 471
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 472 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 510
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 429
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 430 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 472
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 473 ISSYHIHCLN----PPLPDIPNGEWLC 495
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 452 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|284004881|ref|NP_001164383.1| tripartite motif-containing protein 66 isoform 1 [Mus musculus]
Length = 1344
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1154 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1185
>gi|296480218|tpg|DAA22333.1| TPA: hCG23889-like [Bos taurus]
Length = 1213
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 17/97 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ E
Sbjct: 965 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEVEYDCE 1024
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
RF Q A G+ +Q K+C ++V +L
Sbjct: 1025 NARFSQPGVRA-----PPGLSIYDQ--KKCEKLVLSL 1054
>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + P+ F G + N C +C +GG+LL C+ CP AFH+EC ++ +P+G W
Sbjct: 186 NSIICPNHFLPRRGVKNHEHVNVSWCFVCTEGGSLLCCESCPAAFHRECLNI-EMPKGSW 244
Query: 613 YCKYCQ 618
YC C+
Sbjct: 245 YCNDCK 250
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 573 DNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 614 EREDECFSCGDGGQMVSCKKPGCPKVYHADCLNLTKRPAGRWECPWHQ 661
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 435 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 494
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 495 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 532
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 533 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 571
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 436 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 495
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 496 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 533
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 534 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 572
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
+ N+D C +C DGG L+ CDGCPRAFH C S L IP G W C C
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSC 143
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 371 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 425
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 426 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 468
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 469 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 507
>gi|359479239|ref|XP_002274937.2| PREDICTED: uncharacterized protein LOC100260139 [Vitis vinifera]
Length = 1976
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDA 631
N D C +C GNL+ CDGCP A+H C ++S +P GDWYC C ++ +
Sbjct: 592 NSDECCLCKMDGNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC--AIDKDKPWMKQR 649
Query: 632 NAVEAGRVSGVD 643
++ + GVD
Sbjct: 650 KSLRGAELLGVD 661
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|187950891|gb|AAI38033.1| Tripartite motif-containing 66 [Mus musculus]
gi|219519295|gb|AAI45107.1| Tripartite motif-containing 66 [Mus musculus]
Length = 1242
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1052 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1083
>gi|45861758|gb|AAS78677.1| transcriptional intermediary factor 1 delta short isoform [Mus
musculus]
Length = 1242
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1052 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1083
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C S+ +P G W
Sbjct: 809 NSIICTNHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLSI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 KPSEDECFRCGDGGQLVLCDRKSCTKAYHLPCLGLGKRPFGKWECPW 1282
>gi|74184798|dbj|BAE27995.1| unnamed protein product [Mus musculus]
Length = 1233
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1052 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1083
>gi|359072874|ref|XP_002693153.2| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1215
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 17/97 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ E
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEVEYDCE 1026
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
RF Q A G+ +Q K+C ++V +L
Sbjct: 1027 NARFSQPGVRA-----PPGLSIYDQ--KKCEKLVLSL 1056
>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
[Monodelphis domestica]
Length = 1060
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++
Sbjct: 824 HAGESRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDL 883
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
+D C C GG ++ CD CP+ FH +C +L +PQG W C C FER+
Sbjct: 860 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQ 909
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 89/196 (45%), Gaps = 27/196 (13%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 576 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDTLTGLENHVKTEPTDM 628
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + + + A G KD+D D C +C +GG+
Sbjct: 629 NESCKQSGLSSLVNGKSPVRNLMHR-------SARIGGDGNSKDDDPNEDWCAVCQNGGD 681
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR----VS 640
LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 682 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKGKTAQ 739
Query: 641 GVDSVEQITKRCIRIV 656
G+ V+Q ++C R++
Sbjct: 740 GLSPVDQ--RKCERLL 753
>gi|284004883|ref|NP_001164384.1| tripartite motif-containing protein 66 isoform 3 [Mus musculus]
Length = 1233
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1052 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1083
>gi|284004879|ref|NP_862901.3| tripartite motif-containing protein 66 isoform 2 [Mus musculus]
gi|342187123|sp|Q924W6.3|TRI66_MOUSE RecName: Full=Tripartite motif-containing protein 66; AltName:
Full=Transcriptional intermediary factor 1 delta
Length = 1242
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 631 ANAVEAGR-----VSGVDSVEQITKRCIRIVKNL 659
G + G+ +Q K+C ++V +L
Sbjct: 1052 CENARYGHPGVRVLPGLSMYDQ--KKCEKLVLSL 1083
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLC 497
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 454 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 500
>gi|440906647|gb|ELR56880.1| Tripartite motif-containing protein 66, partial [Bos grunniens mutus]
Length = 1230
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 49/97 (50%), Gaps = 17/97 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ E
Sbjct: 982 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEVEYDCE 1041
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
RF Q A G+ +Q K+C ++V +L
Sbjct: 1042 NARFSQPGVRA-----PPGLSIYDQ--KKCEKLVLSL 1071
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 411
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 412 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 449
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 450 ISSYHIHCLN----PPLPDIPNGEWLC 472
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 429 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 475
>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Monodelphis domestica]
Length = 1061
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
HAG D N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++
Sbjct: 825 HAGESRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDL 884
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|296083821|emb|CBI24209.3| unnamed protein product [Vitis vinifera]
Length = 1805
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 617
N D C +C GNL+ CDGCP A+H C ++S +P GDWYC C
Sbjct: 578 NSDECCLCKMDGNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC 623
>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
Length = 1016
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L+ P G+W C +C+++
Sbjct: 780 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDL 839
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A + + G+ + I KR C R++
Sbjct: 840 --SKPEVEYDCDAPSHNSDKRKSEGLTKLTPIDKRKCERLL 878
>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
Length = 1050
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L+ P G+W C +C+++
Sbjct: 814 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDL 873
Query: 621 FERKRFLQHDANAV----EAGRVSGVDSVEQITKR-CIRIV 656
K +++D +A + + G+ + I KR C R++
Sbjct: 874 --SKPEVEYDCDAPSHNSDKRKSEGLTKLTPIDKRKCERLL 912
>gi|242009521|ref|XP_002425532.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
gi|212509407|gb|EEB12794.1| bromodomain-containing protein, putative [Pediculus humanus corporis]
Length = 1963
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 54/146 (36%), Gaps = 57/146 (39%)
Query: 578 CTICADGGN---LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
C C G N LL CDGC R +H C + SIP GDWYC C+N ++
Sbjct: 1684 CQFCQSGDNEDKLLLCDGCDRGYHMYCFKPKMESIPDGDWYCHECKNKSNGEK------- 1736
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGP-RTILLCDQCE 691
C++C G P + ++C+ C
Sbjct: 1737 --------------------------------NCIVC--------GKRPIKNYVICEHCP 1756
Query: 692 REFHVGCLKKHKMADLRELPKGKWFC 717
R +H+ CL L ++P+ KW C
Sbjct: 1757 RIYHIECLN----PPLSKVPRAKWNC 1778
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%)
Query: 667 LLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
++ C F +SG +LLCD C+R +H+ C K + +P G W+C
Sbjct: 1679 IMKANCQFCQSGDNEDKLLLCDGCDRGYHMYCFK----PKMESIPDGDWYC 1725
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 22/107 (20%)
Query: 524 DGTEVGYYA-CGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA 582
DG + GY+ C + +E +G CH C ++ + G+ N C +C
Sbjct: 1700 DGCDRGYHMYCFKPKMESIPDG-DWYCHECKNKSN-------------GEKN---CIVCG 1742
Query: 583 DG--GNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYCQNMFERKR 625
N + C+ CPR +H EC + LS +P+ W C C + +K+
Sbjct: 1743 KRPIKNYVICEHCPRIYHIECLNPPLSKVPRAKWNCVTCAIKYPKKK 1789
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 60/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 209 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 268
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 269 EDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 306
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 307 ISSYHIHCLN----PPLPDIPNGEWLC 329
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 286 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 388 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 447
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 448 EDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 485
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 524
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 89/196 (45%), Gaps = 27/196 (13%)
Query: 470 RQRKITKKSKKTVLISKPFENASPPLSFPNKSRWNITPKDQRLHKLVFDESGLPDGTEVG 529
+Q K + ++S P + +PPLS + ++ + L L E+ +
Sbjct: 674 KQEKTEDGRRSACMLSSPESSLTPPLS----TNLHL---ESELDTLTGLENHVKTEPTDM 726
Query: 530 YYACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDND---DLCTICADGGN 586
+C Q L NG + + + A G KD+D D C +C +GG+
Sbjct: 727 NESCKQSGLSSLVNGKSPVRNLMHR-------SARIGGDGNSKDDDPNEDWCAVCQNGGD 779
Query: 587 LLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR----VS 640
LL C+ CP+ FH C +L S P GDW C +C+++ K +++D + ++ +
Sbjct: 780 LLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDI--GKPEVEYDCDNMQHSKKGKTAQ 837
Query: 641 GVDSVEQITKRCIRIV 656
G+ V+Q ++C R++
Sbjct: 838 GLSPVDQ--RKCERLL 851
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 252 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 306
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 307 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 349
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 350 ISSYHIHCLN----PPLPDIPNGEWLC 372
Score = 48.5 bits (114), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 329 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 375
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 432 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|147864569|emb|CAN78969.1| hypothetical protein VITISV_022739 [Vitis vinifera]
Length = 1318
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 617
N D C +C GNL+ CDGCP A+H C ++S +P GDWYC C
Sbjct: 576 NSDECCLCKMDGNLICCDGCPAAYHSRCVGVASDLLPDGDWYCPEC 621
>gi|403254198|ref|XP_003919864.1| PREDICTED: tripartite motif-containing protein 66 [Saimiri
boliviensis boliviensis]
Length = 1207
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 56/94 (59%), Gaps = 11/94 (11%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++ + + +++D
Sbjct: 959 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1016
Query: 631 ---ANAVEAG-RVS-GVDSVEQITKRCIRIVKNL 659
A + G R S G+ + +Q K+C ++V +L
Sbjct: 1017 CENARYNQPGMRASPGLSTYDQ--KKCEKLVLSL 1048
>gi|355697937|gb|EHH28485.1| hypothetical protein EGK_18929, partial [Macaca mulatta]
Length = 1236
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 988 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1037
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 427
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 428 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 470
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 471 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 509
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|350588054|ref|XP_003129432.3| PREDICTED: tripartite motif-containing protein 66 [Sus scrofa]
Length = 1250
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 17/97 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMF--------E 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ E
Sbjct: 998 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEMEYDCE 1057
Query: 623 RKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNL 659
R+ Q A A G+ +Q K+C ++V +L
Sbjct: 1058 NARYDQPGARA-----PPGLSLCDQ--KKCEKLVLSL 1087
>gi|297268466|ref|XP_001097566.2| PREDICTED: tripartite motif-containing protein 66 [Macaca mulatta]
Length = 1326
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 961 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1010
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 66/163 (40%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 403
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
+ E G G E R+ K+ G LLC CD C
Sbjct: 404 EDYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 441
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 442 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 480
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|432864384|ref|XP_004070295.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like, partial
[Oryzias latipes]
Length = 1042
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 564 HAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H GR+ N+D C +C +GG+LL CD CP+ FH +C ++ P+GD+ C +C+++
Sbjct: 795 HGGRE--DDPNEDWCAVCINGGDLLCCDNCPKVFHMKCHVPTIKIFPKGDFLCTFCRSL 851
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 283 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 342
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 343 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 380
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 381 ISSYHIHCLN----PPLPDIPNGEWLC 403
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 360 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 406
>gi|355752346|gb|EHH56466.1| hypothetical protein EGM_05877, partial [Macaca fascicularis]
Length = 1236
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 988 ENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1037
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Gorilla gorilla gorilla]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Saimiri boliviensis boliviensis]
Length = 1368
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Callithrix jacchus]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein
trithorax-5; AltName: Full=Wolf-Hirschhorn syndrome
candidate 1 protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|326912468|ref|XP_003202572.1| PREDICTED: PHD finger protein 21B-like, partial [Meleagris
gallopavo]
Length = 445
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+D+ CT C G NL PC CPRA+H C L + P+G W C CQ ++ L+ D
Sbjct: 263 HDEYCTACKRGVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQ-----QKVLKKDD 317
Query: 632 NAVEAGRVSGVDSVEQITKRCIR 654
N G ++ V S +T + ++
Sbjct: 318 NVPWTGTLAIVHSY--VTHKTVK 338
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
+D C C GG ++ CD CP+ FH +C L +P+G W C C + FER+
Sbjct: 844 EDRCNKCGQGGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNFERQ 893
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
6 [Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
7 [Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
8 [Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C S+ +P G W
Sbjct: 742 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLSI-EMPDGSW 800
Query: 613 YCKYCQ 618
+C C+
Sbjct: 801 FCNDCR 806
Score = 40.4 bits (93), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1169 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1215
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + D
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEED 431
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 432 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 469
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 470 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 508
>gi|268557732|ref|XP_002636856.1| C. briggsae CBR-LET-418 protein [Caenorhabditis briggsae]
Length = 1849
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 61/149 (40%), Gaps = 43/149 (28%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKRFLQHDAN 632
+N+D C C GG L+ CD CPRA+H C P+GDW C +C ++H
Sbjct: 253 ENNDFCETCKVGGELVLCDTCPRAYHTGCMD-EDPPEGDWSCPHC---------IEHGPE 302
Query: 633 AVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCER 692
++ E+ TK+ C +C+ + +LLCD C
Sbjct: 303 VIK----------EEPTKQNDDF----------CKICKETE---------NLLLCDSCVC 333
Query: 693 EFHVGCLKKHKMADLRELPKGKWFCCMDC 721
FH C+ L ++P+ + + C C
Sbjct: 334 AFHAYCID----PPLTQVPQEETWACPRC 358
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 567 RQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGD-WYCKYCQ 618
++ P K NDD C IC + NLL CD C AFH C L+ +PQ + W C C+
Sbjct: 305 KEEPTKQNDDFCKICKETENLLLCDSCVCAFHAYCIDPPLTQVPQEETWACPRCE 359
>gi|358415619|ref|XP_599031.6| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1322
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ +
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQ 1018
>gi|348558808|ref|XP_003465208.1| PREDICTED: PHD finger protein 21A isoform 1 [Cavia porcellus]
Length = 635
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|338727101|ref|XP_001500723.2| PREDICTED: tripartite motif-containing protein 66 [Equus caballus]
Length = 1211
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ +
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWACTLCRSLTQ 1014
>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
Length = 1017
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 781 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDL 840
Query: 621 FERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
K + +D ++ E + G+ + I KR C R++
Sbjct: 841 --SKPEVDYDCDVPSHHSEKRKSEGLTKLTPIDKRKCERLL 879
>gi|125812233|ref|XP_001337239.1| PREDICTED: autoimmune regulator-like [Danio rerio]
Length = 640
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 15/89 (16%)
Query: 531 YACGQKLLEGYKNGLGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPC 590
+ C +K+L+ + ++C C P ++NDD+C C G +L C
Sbjct: 437 FLCKKKILQIH----SLLCECVKCSTDPEDLME--------QNNDDVCYACHCGVDLRCC 484
Query: 591 DGCPRAFHKECASLSSIPQ--GDWYCKYC 617
DGCPRAFH +C L ++P+ G+W C +C
Sbjct: 485 DGCPRAFHSDC-HLPAVPEGSGEWICTFC 512
>gi|426245365|ref|XP_004016482.1| PREDICTED: PHD finger protein 21A isoform 1 [Ovis aries]
gi|426245367|ref|XP_004016483.1| PREDICTED: PHD finger protein 21A isoform 2 [Ovis aries]
Length = 636
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 418 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 477
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 478 GMWICPRCQDQMLKK 492
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|356544359|ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max]
Length = 1702
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYC 617
N D C +C G+L+ CDGCP AFH C ++S +P+GDWYC C
Sbjct: 674 NSDECCLCKMDGSLICCDGCPAAFHSRCVGIASDHLPEGDWYCPEC 719
>gi|384494855|gb|EIE85346.1| hypothetical protein RO3G_10056 [Rhizopus delemar RA 99-880]
Length = 1060
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/54 (50%), Positives = 32/54 (59%), Gaps = 5/54 (9%)
Query: 569 YPGKDNDDLCTIC---ADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
Y DN+D C IC D NLL CDGC R +H C + LSS+P+ DWYC C
Sbjct: 216 YINDDNNDTCEICHKTQDEENLLLCDGCNRGYHLYCLTPPLSSVPKTDWYCLQC 269
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus
norvegicus]
Length = 1346
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C S+ +P G W
Sbjct: 790 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLSI-EMPDGSW 848
Query: 613 YCKYCQ 618
+C C+
Sbjct: 849 FCNDCR 854
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1217 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1263
>gi|432103232|gb|ELK30472.1| Tripartite motif-containing protein 66 [Myotis davidii]
Length = 1324
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P GDW C C+++ +
Sbjct: 1101 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQ 1152
>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase Trim24; AltName: Full=Tripartite motif-containing
protein 24
gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
Length = 1051
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 815 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDL 874
Query: 621 FERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
K + +D ++ E + G+ + I KR C R++
Sbjct: 875 --SKPEVDYDCDVPSHHSEKRKSEGLTKLTPIDKRKCERLL 913
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 59/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 426
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 427 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 464
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 465 ISSYHIHCLN----PPLPDIPNGEWLC 487
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 444 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|350580137|ref|XP_003353943.2| PREDICTED: PHD finger protein 21A, partial [Sus scrofa]
Length = 517
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 299 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 358
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 359 GMWICPRCQDQMLKK 373
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 325 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 368
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
++ C +C DGG+LL CD CP +H+ C + L SIP+GDW C C
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRC 52
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 67/163 (41%), Gaps = 30/163 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 448
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G + + + C R+ K+ G LLC CD C
Sbjct: 449 EEYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 485
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 524
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 60/147 (40%), Gaps = 28/147 (19%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ K +Q +
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCE-----KEGVQWE 398
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
A E + + + + C +C+ +L CD C
Sbjct: 399 AKEEE--------EEYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 441
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL L ++P G+W C
Sbjct: 442 ISSYHIHCLN----PPLPDIPNGEWLC 464
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD C ++H C + L IP G+W C C
Sbjct: 421 DHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 467
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 304 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 362
Query: 613 YCKYCQ 618
+C C+
Sbjct: 363 FCNDCR 368
>gi|397494634|ref|XP_003818179.1| PREDICTED: tripartite motif-containing protein 66 [Pan paniscus]
Length = 1216
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 968 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1017
>gi|296479678|tpg|DAA21793.1| TPA: PHD finger protein 21A isoform 1 [Bos taurus]
Length = 635
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 417 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 476
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 477 GMWICPRCQDQMLKK 491
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 443 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 486
>gi|410973631|ref|XP_003993251.1| PREDICTED: PHD finger protein 21A isoform 2 [Felis catus]
gi|410973633|ref|XP_003993252.1| PREDICTED: PHD finger protein 21A isoform 3 [Felis catus]
Length = 633
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 418 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 477
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 478 GMWICPRCQDQMLKK 492
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
+S + S F A G+++ N C +C+ GG+LL C+ CP AFH +C S+ +P G W
Sbjct: 744 HSIICTSHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLSI-EMPDGSW 802
Query: 613 YCKYCQ 618
+C C+
Sbjct: 803 FCNECR 808
Score = 42.7 bits (99), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1171 KESEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPW 1217
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 457 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 515
Query: 613 YCKYCQ 618
+C C+
Sbjct: 516 FCNDCR 521
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 884 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 930
>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
Length = 1615
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 43/73 (58%), Gaps = 5/73 (6%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 1381 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDL 1440
Query: 621 FERKRFLQHDANA 633
K +++D +A
Sbjct: 1441 --SKPEVEYDCDA 1451
>gi|111226798|ref|XP_642746.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|90970774|gb|EAL68857.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 914
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
NDD C+ C DGG+LL CD C +FH C L+ IP GDW+C C
Sbjct: 60 NDDYCSGCGDGGDLLCCDNCQASFHLICLNPPLNEIPSGDWFCDSC 105
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 40/66 (60%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
N+ + P+ F G + N C +C++GG+LL C+ CP AFH+EC ++ +PQG W
Sbjct: 1608 NTFLCPNHFTPRKGCKNHEHINVSWCFVCSEGGSLLCCEACPAAFHRECLNI-EMPQGSW 1666
Query: 613 YCKYCQ 618
+C C+
Sbjct: 1667 FCNDCK 1672
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 2038 KEREDECFSCGDGGQIVSCKKPGCPKVYHADCLNLAKRPAGRWECPWHQ 2086
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Query: 575 DDLCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+++C +C G+LL C+G C AFH +C LS P+G ++C+ C
Sbjct: 1467 ENVCQVCERPGDLLVCEGHCYGAFHLQCVGLSVAPKGKFFCQEC 1510
>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
Length = 607
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L+ P G+W C +C+++ K +++D
Sbjct: 382 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDL--SKPEVEYDC 439
Query: 632 NAV----EAGRVSGVDSVEQITKR-CIRIV 656
+A + + G+ + I KR C R++
Sbjct: 440 DAPSHNSDKRKSEGLTKLTPIDKRKCERLL 469
>gi|345787769|ref|XP_542497.3| PREDICTED: tripartite motif-containing protein 66 [Canis lupus
familiaris]
Length = 1207
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ FH C +L P GDW C C+++ +
Sbjct: 959 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSLTQ 1010
>gi|297688829|ref|XP_002821872.1| PREDICTED: PHD finger protein 21A isoform 2 [Pongo abelii]
gi|297688831|ref|XP_002821873.1| PREDICTED: PHD finger protein 21A isoform 3 [Pongo abelii]
Length = 634
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 55/147 (37%), Gaps = 47/147 (31%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 323 EHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHCEG----------- 371
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G+ E E CR C K G +L CD C
Sbjct: 372 ---------EGIKEQE---------------EDDHMEFCRVC---KDG---GELLCCDTC 401
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+HV CL ++ +P G+W C
Sbjct: 402 PSAYHVHCLN----PPMKMIPDGEWHC 424
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+ + C +C DGG LL CD CP A+H C + + IP G+W+C C
Sbjct: 381 DHMEFCRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRC 427
>gi|156546910|ref|NP_057705.3| PHD finger protein 21A isoform b [Homo sapiens]
gi|397473591|ref|XP_003808291.1| PREDICTED: PHD finger protein 21A [Pan paniscus]
gi|426368103|ref|XP_004051052.1| PREDICTED: PHD finger protein 21A isoform 1 [Gorilla gorilla
gorilla]
gi|426368105|ref|XP_004051053.1| PREDICTED: PHD finger protein 21A isoform 2 [Gorilla gorilla
gorilla]
gi|119588418|gb|EAW68012.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588419|gb|EAW68013.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588421|gb|EAW68015.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|119588423|gb|EAW68017.1| PHD finger protein 21A, isoform CRA_a [Homo sapiens]
gi|168270650|dbj|BAG10118.1| PHD finger protein 21A [synthetic construct]
Length = 634
Score = 57.4 bits (137), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|12697937|dbj|BAB21787.1| KIAA1696 protein [Homo sapiens]
Length = 635
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 420 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 479
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 480 GMWICPRCQDQMLKK 494
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 446 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 489
>gi|441646789|ref|XP_003254588.2| PREDICTED: tripartite motif-containing protein 66 [Nomascus
leucogenys]
Length = 1223
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 975 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1024
>gi|426367401|ref|XP_004050721.1| PREDICTED: tripartite motif-containing protein 66 [Gorilla gorilla
gorilla]
Length = 1085
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++ +
Sbjct: 926 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQ 977
>gi|47220602|emb|CAG05628.1| unnamed protein product [Tetraodon nigroviridis]
Length = 715
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D CT+C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 559 GDIHEDFCTVCRRSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPKCQDQILKK 614
>gi|13509324|emb|CAC35389.1| KIAA0298 protein [Homo sapiens]
Length = 1214
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 966 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1015
>gi|380813476|gb|AFE78612.1| PHD finger protein 21A isoform b [Macaca mulatta]
Length = 633
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 418 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 477
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 478 GMWICPRCQDQMLKK 492
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|348579257|ref|XP_003475397.1| PREDICTED: transcription intermediary factor 1-alpha-like [Cavia
porcellus]
Length = 1165
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K +++D
Sbjct: 940 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDL--SKPEVEYDC 997
Query: 632 NA 633
+A
Sbjct: 998 DA 999
>gi|119589026|gb|EAW68620.1| hCG23889, isoform CRA_a [Homo sapiens]
Length = 1214
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 966 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1015
>gi|388454500|ref|NP_001252601.1| PHD finger protein 21A [Macaca mulatta]
gi|387539684|gb|AFJ70469.1| PHD finger protein 21A isoform b [Macaca mulatta]
Length = 634
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|73982505|ref|XP_861897.1| PREDICTED: PHD finger protein 21A isoform 5 [Canis lupus
familiaris]
Length = 633
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 418 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 477
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 478 GMWICPRCQDQMLKK 492
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 487
>gi|10435111|dbj|BAB14492.1| unnamed protein product [Homo sapiens]
gi|20135652|gb|AAM09095.1| BRAF35/HDAC2 complex 80 kDa protein [Homo sapiens]
Length = 634
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|403254657|ref|XP_003920077.1| PREDICTED: PHD finger protein 21A [Saimiri boliviensis boliviensis]
Length = 634
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 488
>gi|395815595|ref|XP_003781311.1| PREDICTED: PHD finger protein 21A [Otolemur garnettii]
Length = 634
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 419 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKAIPK 478
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 479 GMWICPRCQDQMLKK 493
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 445 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKAIPKGMWICPRCQD 488
>gi|395543452|ref|XP_003773631.1| PREDICTED: tripartite motif-containing protein 66 [Sarcophilus
harrisii]
Length = 1496
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 53/93 (56%), Gaps = 13/93 (13%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ K +++D
Sbjct: 1258 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSL--TKPEVEYD 1315
Query: 631 ------ANAVEAGRV-SGVDSVEQITKRCIRIV 656
++A GRV G+ +Q K+C ++V
Sbjct: 1316 CDNTRYSHAGLGGRVLPGLSEYDQ--KKCEKLV 1346
>gi|149725049|ref|XP_001489490.1| PREDICTED: PHD finger protein 21A isoform 1 [Equus caballus]
gi|338712004|ref|XP_003362634.1| PREDICTED: PHD finger protein 21A [Equus caballus]
Length = 629
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 414 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 473
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 474 GMWICPRCQDQMLKK 488
Score = 41.2 bits (95), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 440 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 483
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 575 DDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERK 624
+D C C GG ++ CD CP+ FH +C L IP+G W C C + FER+
Sbjct: 844 EDQCKECGKGGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNFERQ 893
>gi|209977097|ref|NP_055633.1| tripartite motif-containing protein 66 [Homo sapiens]
gi|269849740|sp|O15016.4|TRI66_HUMAN RecName: Full=Tripartite motif-containing protein 66
Length = 1216
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 968 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1017
>gi|198433831|ref|XP_002121767.1| PREDICTED: similar to zinc finger, MYND-type containing 8 [Ciona
intestinalis]
Length = 1878
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 30/49 (61%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFE 622
ND C +C G +L C+ CPR FH +C + S P+GDW+C C+ + E
Sbjct: 258 NDFYCWVCHKEGEVLCCELCPRVFHAKCLRMQSEPEGDWFCPECEKITE 306
>gi|410044930|ref|XP_003313014.2| PREDICTED: tripartite motif-containing protein 66 [Pan troglodytes]
Length = 1037
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 789 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 838
>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
Length = 981
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 54/102 (52%), Gaps = 10/102 (9%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C++
Sbjct: 744 VHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRD 803
Query: 620 MFERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
+ K + +D ++ E + G+ + I KR C R++
Sbjct: 804 L--SKPEVDYDCDVPSHHSEKRKSEGLTKLTPIDKRKCERLL 843
>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
Length = 764
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 9/90 (10%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD- 630
N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++ K + +D
Sbjct: 539 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDL--SKPEVDYDC 596
Query: 631 ---ANAVEAGRVSGVDSVEQITKR-CIRIV 656
++ E + G+ + I KR C R++
Sbjct: 597 DVPSHHSEKRKSEGLTKLTPIDKRKCERLL 626
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 69/170 (40%), Gaps = 28/170 (16%)
Query: 565 AGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC----ASLSSIPQGDWYCKYCQNM 620
+GR + N+D C C +GG+LL C+ CP AFH C IP G+W CK C+
Sbjct: 38 SGRSF----NNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIPAGEWICKKCKAE 93
Query: 621 F-ERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGF 679
+ + R + + V D+V+Q R+ K ++ + L +G
Sbjct: 94 YQDNTRPIAKQSELRSPFAVLVKDNVKQNPSE-FRLPKEMQYHIP--LPGSKKRKKSNGH 150
Query: 680 GPRT------------ILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
G T ++ CD C FH+ C+ L P G W C
Sbjct: 151 GSATKGGRRGNSFLKPLIHCDYCPLSFHLDCMD----PPLTTTPSGLWMC 196
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Equus caballus]
Length = 1365
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTGHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
K ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 KKSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
Length = 981
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 54/102 (52%), Gaps = 10/102 (9%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C++
Sbjct: 744 VHVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCQVPTLTNFPSGEWICTFCRD 803
Query: 620 MFERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
+ K + +D ++ E + G+ + I KR C R++
Sbjct: 804 L--SKPEVDYDCDVPSHHSEKRKSEGLTKLTPIDKRKCERLL 843
>gi|301773852|ref|XP_002922354.1| PREDICTED: PHD finger protein 21A-like, partial [Ailuropoda
melanoleuca]
Length = 582
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 367 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 426
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 427 GMWICPRCQDQMLKK 441
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 393 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 436
>gi|239609511|gb|EEQ86498.1| PHD finger domain-containing protein [Ajellomyces dermatitidis
ER-3]
gi|327350258|gb|EGE79115.1| PHD finger domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 1153
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Query: 564 HAGRQYPGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
HAG PG++ D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 395 HAG---PGEEQDSKCAICDDGDCENANAIVFCDGCDLAVHQECYGVPYIPEGQWLCRKCQ 451
>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
harrisii]
Length = 880
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 35/60 (58%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++
Sbjct: 644 HGGESRKDDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDL 703
>gi|261196576|ref|XP_002624691.1| PHD finger domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595936|gb|EEQ78517.1| PHD finger domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 1153
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Query: 564 HAGRQYPGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
HAG PG++ D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 395 HAG---PGEEQDSKCAICDDGDCENANAIVFCDGCDLAVHQECYGVPYIPEGQWLCRKCQ 451
>gi|348553320|ref|XP_003462475.1| PREDICTED: tripartite motif-containing protein 66-like [Cavia
porcellus]
Length = 1392
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ FH C +L S P G+W C C+++
Sbjct: 1144 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSL 1193
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 570 PGKDNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P ++DD C IC DGG LL CD CP A+H C + L IP GDW C C
Sbjct: 505 PADEDDDEHQEFCRICKDGGELLCCDNCPSAYHTFCLNPPLDDIPDGDWRCPRC 558
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 60/164 (36%), Gaps = 48/164 (29%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++ D C +C GG ++ CD CP+A+H C L P+G W C C
Sbjct: 454 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 500
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
EA + D E + RI K+ G LLC CD C
Sbjct: 501 ----EAEGPADEDDDEH--QEFCRICKD-----GGELLC-----------------CDNC 532
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
+H CL L ++P G W C C + +Q +L
Sbjct: 533 PSAYHTFCLN----PPLDDIPDGDWRCPRCSCPPLPDKVQKILT 572
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 808 NSIICTGHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSW 866
Query: 613 YCKYCQ 618
+C C+
Sbjct: 867 FCNDCR 872
Score = 43.1 bits (100), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1235 KESEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPW 1281
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 808 NSIICTGHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSW 866
Query: 613 YCKYCQ 618
+C C+
Sbjct: 867 FCNDCR 872
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1235 KESEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPW 1281
>gi|224096253|ref|XP_002191494.1| PREDICTED: transcription intermediary factor 1-alpha [Taeniopygia
guttata]
Length = 1061
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 36/61 (59%), Gaps = 3/61 (4%)
Query: 563 AHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+H G D N+D C +C +GG LL C+ CP+ FH C +L S P G+W C +C++
Sbjct: 824 SHTGEGRKEDDPNEDWCAVCQNGGKLLCCEKCPKVFHLSCHVPTLMSFPSGEWICTFCRD 883
Query: 620 M 620
+
Sbjct: 884 L 884
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 159 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 217
Query: 613 YCKYCQ 618
+C C+
Sbjct: 218 FCNDCR 223
>gi|432852260|ref|XP_004067159.1| PREDICTED: uncharacterized protein LOC101164387 [Oryzias latipes]
Length = 1310
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 3/88 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-RFLQH 629
+++D C +C GG LL CD CP+ FH C L S P GDW C C++ + + ++
Sbjct: 1104 ESEDFCAVCLIGGELLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDAIQPEVQYNCE 1163
Query: 630 DANAVEAGRVSGVDSVEQITKRCIRIVK 657
+ A A + G+ + +Q + +I+K
Sbjct: 1164 NERASGANPLHGLSACDQRARHYYQIIK 1191
>gi|5834582|emb|CAB55313.1| rfg7 protein [Homo sapiens]
Length = 1052
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 12/90 (13%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK-----RF 626
N+D C +C +GG+LL C+ CP+ H C +L S P GDW C +C+++ + +
Sbjct: 811 NEDWCAVCQNGGDLLCCEKCPKVSHLTCHVPTLLSFPSGDWICTFCRDIGKPEVEYDCDN 870
Query: 627 LQHDANAVEAGRVSGVDSVEQITKRCIRIV 656
LQH A +S VD ++C R++
Sbjct: 871 LQHSKKGKTAQGLSPVDQ-----RKCERLL 895
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 807 NSIICTGHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-DMPDGSW 865
Query: 613 YCKYCQ 618
+C C+
Sbjct: 866 FCNDCR 871
Score = 43.1 bits (100), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKY 616
K+++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1234 KESEDECFRCGDGGQLVLCDRKSCTKAYHLSCLGLGKRPFGKWECPW 1280
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 157 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 215
Query: 613 YCKYCQ 618
+C C+
Sbjct: 216 FCNDCR 221
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 157 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 215
Query: 613 YCKYCQ 618
+C C+
Sbjct: 216 FCNDCR 221
>gi|298707919|emb|CBJ30305.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1534
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYC 617
+D++DLC +C+ GG+LL CD C FH +C L +P GDW C++C
Sbjct: 107 EDHNDLCEVCSHGGDLLCCDTCSLVFHTKCHRPELKEVPAGDWNCQFC 154
Score = 43.5 bits (101), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 669 CRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC 717
CR C + K P LLC+ C+ E+H CL L E+PKGKW+C
Sbjct: 352 CRKC-YGKQD--PELALLCEHCDDEYHTYCLDP----PLTEVPKGKWYC 393
>gi|449502294|ref|XP_004174498.1| PREDICTED: PHD finger protein 21A isoform 4 [Taeniopygia guttata]
Length = 632
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 418 NSTMHPGTRKRANEDHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 477
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 478 GMWICPKCQDQMLKK 492
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 444 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 487
>gi|397610251|gb|EJK60736.1| hypothetical protein THAOC_18861, partial [Thalassiosira oceanica]
Length = 578
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 6/47 (12%)
Query: 577 LCTICADGGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 617
+C IC GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 441 VCEICRTGGDLIVCDGGDNEGGCGRAFHLECINLRTLPKGDWICKDC 487
>gi|195402015|ref|XP_002059606.1| GJ14859 [Drosophila virilis]
gi|194147313|gb|EDW63028.1| GJ14859 [Drosophila virilis]
Length = 884
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 33/60 (55%), Gaps = 5/60 (8%)
Query: 564 HAGRQYPGK-DNDDLCTICADGGNLLPCDGCPRAFHKECA----SLSSIPQGDWYCKYCQ 618
H + PG+ N DLC C +GGNLL CD CP +FH +C S IP G W C C+
Sbjct: 41 HPYYRRPGRGHNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSCR 100
>gi|346703214|emb|CBX25313.1| hypothetical_protein [Oryza brachyantha]
Length = 1891
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQ 609
N E +P+ A + +D N D C IC G L+ CDGCP A+H C + +PQ
Sbjct: 509 NLETAPNGNNPEAASAHASQDGNSDDCRICGMDGTLVCCDGCPWAYHSRCIGQNKAFLPQ 568
Query: 610 GDWYCKYC 617
GDW+C C
Sbjct: 569 GDWFCPEC 576
>gi|348558810|ref|XP_003465209.1| PREDICTED: PHD finger protein 21A isoform 2 [Cavia porcellus]
Length = 550
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 334 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 393
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 394 GMWICPRCQDQMLKK 408
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 360 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 403
>gi|149635024|ref|XP_001507820.1| PREDICTED: PHD finger protein 21A [Ornithorhynchus anatinus]
Length = 638
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 422 NSTMHPGTRKRANEDHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 481
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 482 GMWICPKCQDQMLKK 496
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 448 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPKCQD 491
>gi|336270508|ref|XP_003350013.1| hypothetical protein SMAC_00903 [Sordaria macrospora k-hell]
gi|380095404|emb|CCC06877.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 990
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 6/53 (11%)
Query: 571 GKDNDD-LCTICAD-----GGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYC 617
G D DD +C IC++ G ++ CDGC +A H++C + +P+GDWYCK C
Sbjct: 425 GSDGDDEVCVICSNPDTRKGNQIVFCDGCDKAVHQKCYGIPRLPRGDWYCKEC 477
>gi|358398564|gb|EHK47915.1| hypothetical protein TRIATDRAFT_129087 [Trichoderma atroviride IMI
206040]
Length = 1065
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 4/64 (6%)
Query: 557 SPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASL---SSIPQGDWY 613
+PS ++ + G DND+ C+ C + G+++ CDGCPR+FH EC + +P +WY
Sbjct: 615 TPSATTSNGAPRDQGSDNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWY 673
Query: 614 CKYC 617
C C
Sbjct: 674 CNEC 677
>gi|281352213|gb|EFB27797.1| hypothetical protein PANDA_013886 [Ailuropoda melanoleuca]
Length = 640
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 437 EEYEEEGEDEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 474
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|449456180|ref|XP_004145828.1| PREDICTED: uncharacterized protein LOC101215849 [Cucumis sativus]
gi|449510841|ref|XP_004163779.1| PREDICTED: uncharacterized LOC101215849 [Cucumis sativus]
Length = 1719
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 37/77 (48%), Gaps = 4/77 (5%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSS--IPQGDWYCKYCQNMFERKRFLQHDA 631
N D C +C G+LL CDGCP A+H C + IPQG WYC C +
Sbjct: 427 NSDECRLCGMDGSLLCCDGCPSAYHLRCIGMVKVLIPQGPWYCPECS--INKSEPTITKG 484
Query: 632 NAVEAGRVSGVDSVEQI 648
+A+ V G+D E I
Sbjct: 485 SALRGAEVFGIDPYEHI 501
>gi|7582284|gb|AAF64262.1|AF208848_1 BM-006 [Homo sapiens]
Length = 328
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 121 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPK 180
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 181 GMWICPRCQDQMLKK 195
>gi|293346153|ref|XP_002726281.1| PREDICTED: PHD finger protein 21A isoform 3 [Rattus norvegicus]
gi|392346534|ref|XP_003749579.1| PREDICTED: PHD finger protein 21A [Rattus norvegicus]
Length = 562
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 348 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 407
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 408 GMWICPRCQDQMLKK 422
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 374 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 417
>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
Length = 516
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 54/101 (53%), Gaps = 10/101 (9%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G D N+D C +C +GG LL C+ CP+ FH C +L++ P G+W C +C+++
Sbjct: 280 HVGETRKEDDPNEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDL 339
Query: 621 FERKRFLQHD----ANAVEAGRVSGVDSVEQITKR-CIRIV 656
K + +D ++ E + G+ + I KR C R++
Sbjct: 340 --SKPEVDYDCDVPSHHSEKRKSEGLTKLTPIDKRKCERLL 378
>gi|149022665|gb|EDL79559.1| similar to PHF21A protein (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 563
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 349 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 408
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 409 GMWICPRCQDQMLKK 423
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 375 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 418
>gi|397574031|gb|EJK48991.1| hypothetical protein THAOC_32170 [Thalassiosira oceanica]
Length = 884
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 6/47 (12%)
Query: 577 LCTICADGGNLLPCDG------CPRAFHKECASLSSIPQGDWYCKYC 617
+C IC GG+L+ CDG C RAFH EC +L ++P+GDW CK C
Sbjct: 747 VCEICRTGGDLIVCDGGDNEGGCGRAFHLECINLRTLPEGDWICKDC 793
>gi|28971718|dbj|BAC65327.1| PFTF1 [Mus musculus]
Length = 556
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 342 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 401
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 402 GMWICPRCQDQMLKK 416
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|354495678|ref|XP_003509956.1| PREDICTED: tripartite motif-containing protein 66-like [Cricetulus
griseus]
Length = 1384
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ +
Sbjct: 1136 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQ 1187
>gi|148695644|gb|EDL27591.1| PHD finger protein 21A, isoform CRA_a [Mus musculus]
Length = 557
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 343 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 402
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 403 GMWICPRCQDQMLKK 417
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 369 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 412
>gi|157841174|ref|NP_620094.2| PHD finger protein 21A isoform 1 [Mus musculus]
Length = 556
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 342 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 401
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 402 GMWICPRCQDQMLKK 416
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 67/156 (42%), Gaps = 27/156 (17%)
Query: 581 CADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDANAVEAGR 638
+ GG ++ CD CPRA+H C + P+G W C +C+ K +Q +A +
Sbjct: 287 SSQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCE-----KEGIQWEAKEDNS-- 339
Query: 639 VSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQCEREFHVGC 698
G + +E++ E + CR C K G +L CD C +H+ C
Sbjct: 340 -EGEEILEEVGG------DPEEEDDHHMEFCRVC---KDG---GELLCCDTCPSSYHIHC 386
Query: 699 LKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLLV 733
L L E+P G+W C C + +Q +L+
Sbjct: 387 LN----PPLPEIPNGEWLCPRCTCPSLKGKVQKILI 418
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 5/53 (9%)
Query: 570 PGKDND---DLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P +++D + C +C DGG LL CD CP ++H C + L IP G+W C C
Sbjct: 352 PEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 404
>gi|323456424|gb|EGB12291.1| hypothetical protein AURANDRAFT_61296 [Aureococcus anophagefferens]
Length = 757
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 28/46 (60%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN 619
N+D C IC GG LL CD C RAFH C L+ +P+G W C C+
Sbjct: 697 NEDHCFICRSGGELLCCDTCERAFHVTCCGLADVPEGAWSCYVCET 742
Score = 43.1 bits (100), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 35/86 (40%), Gaps = 15/86 (17%)
Query: 540 GYKNG-LGIICHCCNSEVSPSQFEAHAGRQYPGKDNDDLCTICA---DGGNLLPCDG--- 592
G NG +G C V F R YP C +C D G+LL CDG
Sbjct: 460 GLSNGAIGQACLGRVDHVEGRYFRFIGDRVYP-------CAVCGTDHDAGSLLLCDGRDG 512
Query: 593 -CPRAFHKECASLSSIPQGDWYCKYC 617
C H C L ++P GDW+C C
Sbjct: 513 RCVSTAHTHCIGLDAVPDGDWFCPQC 538
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 570 PGKDNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P ++DD C +C DGG +L CD CP A+H C + L IP GDW C C
Sbjct: 427 PADEDDDEHQEFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRC 480
Score = 53.1 bits (126), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 54/147 (36%), Gaps = 47/147 (31%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
++ D C +C GG ++ CD CP+A+H C L P+G W C C
Sbjct: 376 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 422
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
EA + D E + R+ K+ G +LC CD C
Sbjct: 423 ----EAEGPADEDDDEH--QEFCRVCKD-----GGEMLC-----------------CDSC 454
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L ++P G W C
Sbjct: 455 PSAYHTWCL----TPPLDDIPDGDWRC 477
>gi|17512445|gb|AAH19181.1| PHD finger protein 21A [Mus musculus]
Length = 556
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 553 NSEVSPSQFEAHAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
NS + P + +P D ++D C++C G LL CD C R +H +C L +IP+
Sbjct: 342 NSTMHPGTRKRANEEHWPKGDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPK 401
Query: 610 GDWYCKYCQNMFERK 624
G W C CQ+ +K
Sbjct: 402 GMWICPRCQDQMLKK 416
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 368 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 411
>gi|351697354|gb|EHB00273.1| PHD finger protein 21A, partial [Heterocephalus glaber]
Length = 664
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C+IC G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 468 GDIHEDFCSICRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 523
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 475 CSICR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 518
>gi|363728201|ref|XP_425507.3| PREDICTED: PHD finger protein 21B [Gallus gallus]
Length = 579
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+D+ CT C G NL PC CPRA+H C L + P+G W C CQ ++ L+ D
Sbjct: 397 HDEYCTACKRGVNLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQ-----QKVLKKDD 451
Query: 632 NAVEAGRVSGVDS 644
N G ++ V S
Sbjct: 452 NVPWTGTLAIVHS 464
>gi|114682470|ref|XP_001163886.1| PREDICTED: protein kinase C-binding protein 1 isoform 3 [Pan
troglodytes]
Length = 1105
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 40/73 (54%), Gaps = 7/73 (9%)
Query: 564 HAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ-NMFE 622
H GR ND C +C G +L C+ CPR +H +C L+S P+GDW+C C+ + F+
Sbjct: 58 HDGR------NDFYCWVCHREGQVLCCELCPRVYHAKCLRLTSEPEGDWFCPECETDAFQ 111
Query: 623 RKRFLQHDANAVE 635
+ L+ + E
Sbjct: 112 KPVPLEQHPDYAE 124
>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Danio rerio]
Length = 773
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 556 VSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCK 615
V P+ F G + N C +C++GG+LL C+ CP AFH+EC ++ +P+G WYC
Sbjct: 141 VCPNHFTPRKGCRNHEHVNVSWCFVCSEGGSLLCCESCPAAFHRECLNI-DMPEGSWYCN 199
Query: 616 YCQ 618
C+
Sbjct: 200 DCR 202
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 572 KDNDDLCTICADGGNLLPCD--GCPRAFHKECASLSSIPQGDWYCKYCQ 618
K+ +D C C DGG ++ C GCP+ +H +C +L+ P G W C + Q
Sbjct: 563 KEREDECFSCGDGGQIVSCKRPGCPKVYHADCLNLTKRPAGRWECPWHQ 611
>gi|391331672|ref|XP_003740267.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Metaseiulus occidentalis]
Length = 1925
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 65/166 (39%), Gaps = 31/166 (18%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD C RA+H C L P+G W C +C+ HD
Sbjct: 331 DHQDYCEVCQQGGEVILCDTCVRAYHLVCLEPELEEPPEGRWSCPHCEGEGISAEGAAHD 390
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCR-GCDFSKSGFGPRTILLCDQ 689
+ R G K+ + + C C+ G D +L CDQ
Sbjct: 391 SQEAAGPRSKGGSK-----------TKDGDEHMEYCRTCKEGGD----------LLHCDQ 429
Query: 690 CEREFHVGCLKKHKMADLRELPKGKWFC--CMDCSRINSVLQNLLV 733
C +H C+ L P GKW C C C ++ +Q +L
Sbjct: 430 CPGSYHFECV----FPPLNRPPSGKWTCPRCA-CEQLKGRVQRILA 470
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 14/83 (16%)
Query: 549 CHCCNSEVSPSQFEAHAGRQYPG---------KDNDD---LCTICADGGNLLPCDGCPRA 596
C C E ++ AH ++ G KD D+ C C +GG+LL CD CP +
Sbjct: 374 CPHCEGEGISAEGAAHDSQEAAGPRSKGGSKTKDGDEHMEYCRTCKEGGDLLHCDQCPGS 433
Query: 597 FHKECA--SLSSIPQGDWYCKYC 617
+H EC L+ P G W C C
Sbjct: 434 YHFECVFPPLNRPPSGKWTCPRC 456
>gi|340374938|ref|XP_003385994.1| PREDICTED: hypothetical protein LOC100632791 [Amphimedon
queenslandica]
Length = 1196
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 32/54 (59%), Gaps = 2/54 (3%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGD--WYCKYCQNMFERKR 625
ND C +C +GG +L CD CPR FH +C+ L+ P D W C C+N+ + +
Sbjct: 131 NDSYCWVCHEGGEVLCCDKCPRVFHLQCSGLAKAPDDDEEWICPVCKNISRKPK 184
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 312
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 313 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 350
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|260791426|ref|XP_002590730.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
gi|229275926|gb|EEN46741.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
Length = 1073
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 47/103 (45%), Gaps = 4/103 (3%)
Query: 559 SQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASL----SSIPQGDWYC 614
S+ EA R+ N D C C +GG+LL CD CP AFH +C +P+G+W C
Sbjct: 34 SRREAREPRRTGKSFNHDCCDSCKEGGDLLCCDRCPAAFHLQCCDPPLCEEDLPEGEWLC 93
Query: 615 KYCQNMFERKRFLQHDANAVEAGRVSGVDSVEQITKRCIRIVK 657
C + + D A A S S +Q R +IV+
Sbjct: 94 HRCMVLEQFPELDDRDETASNASVASSTASYKQRNLRDKKIVR 136
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 29/163 (17%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+ + + +
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 312
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E G G E R+ K+ G LLC CD C
Sbjct: 313 EEYEEEGEEEGEKEEEDDHMEYCRVCKD-----GGELLC-----------------CDAC 350
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFCCM-DCSRINSVLQNLL 732
+H+ CL L ++P G+W C C + +Q +L
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Cricetulus griseus]
Length = 1365
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|365989444|ref|XP_003671552.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
gi|343770325|emb|CCD26309.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
Length = 866
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 4/57 (7%)
Query: 565 AGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKEC----ASLSSIPQGDWYCKYC 617
A +Q ++NDD C+ C G+ L CD CPR+FH C +++P+GDW C +C
Sbjct: 374 AEQQKAEQENDDYCSACLQAGSFLCCDTCPRSFHFLCLDPPLDPNNLPEGDWSCHHC 430
>gi|258576351|ref|XP_002542357.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902623|gb|EEP77024.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1138
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Query: 564 HAGRQYPGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
HAG PG++ D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 399 HAG---PGEEQDSKCAICDDGDCENSNAIVFCDGCDLAVHQECYGVPYIPEGQWLCRKCQ 455
>gi|344249575|gb|EGW05679.1| Tripartite motif-containing protein 66 [Cricetulus griseus]
Length = 1297
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ +
Sbjct: 937 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQ 988
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 186 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 244
Query: 613 YCKYCQ 618
+C C+
Sbjct: 245 FCNDCR 250
>gi|345322044|ref|XP_001508010.2| PREDICTED: transcription intermediary factor 1-alpha
[Ornithorhynchus anatinus]
Length = 800
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 564 HAGRQYPGKD-NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
H G +D N+D C +C +GG LL C+ CP+ FH C +L + P G+W C +C+++
Sbjct: 564 HGGESRKEEDPNEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDL 623
>gi|47230454|emb|CAF99647.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2153
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
G +++D C +C GG+LL CD CP+ FH C L S P GDW C C+ +
Sbjct: 1875 GMESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCREV 1926
>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
Length = 584
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 28 NSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCCESCPAAFHPDCLNI-EMPDGSW 86
Query: 613 YCKYCQ 618
+C C+
Sbjct: 87 FCNDCR 92
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus
griseus]
Length = 1344
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 788 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-DMPDGSW 846
Query: 613 YCKYCQ 618
+C C+
Sbjct: 847 FCNDCR 852
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1215 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1261
>gi|238482371|ref|XP_002372424.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
gi|220700474|gb|EED56812.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
Length = 1184
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 5/54 (9%)
Query: 570 PGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
PG+D D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 418 PGEDQDSKCAICDDGDCENSNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRKCQ 471
>gi|449273319|gb|EMC82842.1| PHD finger protein 21B, partial [Columba livia]
Length = 446
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHDA 631
+D+ CT C G NL PC CPRA+H C L + P+G W C CQ ++ L+ D
Sbjct: 264 HDEHCTSCKRGINLQPCGTCPRAYHLNCLDPPLKTAPKGVWVCPKCQ-----QKVLKKDD 318
Query: 632 NAVEAGRVSGVDS 644
N G ++ V S
Sbjct: 319 NVPWTGTLAIVHS 331
>gi|169765768|ref|XP_001817355.1| PHD finger domain protein [Aspergillus oryzae RIB40]
gi|83765210|dbj|BAE55353.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391864581|gb|EIT73876.1| PHD finger protein [Aspergillus oryzae 3.042]
Length = 1184
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 5/54 (9%)
Query: 570 PGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
PG+D D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 418 PGEDQDSKCAICDDGDCENSNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRKCQ 471
>gi|241998816|ref|XP_002434051.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
gi|215495810|gb|EEC05451.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
Length = 907
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 574 NDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
N+D C++C DGG LL C CPR +H +C SLS+ P DW C C ++ +
Sbjct: 693 NEDWCSVCHDGGELLCCGSCPRVYHLQCHVPSLSATPSEDWTCLLCLDILK 743
>gi|410907814|ref|XP_003967386.1| PREDICTED: PHD finger protein 21A-like [Takifugu rubripes]
Length = 794
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D CT+C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 427 GDIHEDFCTVCRRSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPKCQDQILKK 482
>gi|405960336|gb|EKC26267.1| E3 ubiquitin-protein ligase TRIM33 [Crassostrea gigas]
Length = 899
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 6/70 (8%)
Query: 556 VSPSQFEAHAGRQYP-GKDND---DLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQ 609
V S G+Q P KD+D D C +C +GG+LL CD CP+ +H +C L P
Sbjct: 650 VHTSHLSVSHGKQMPENKDDDPNEDYCAVCQNGGDLLCCDKCPKVYHLKCHIPELKEFPS 709
Query: 610 GDWYCKYCQN 619
+W C C N
Sbjct: 710 DEWQCTMCTN 719
>gi|390367174|ref|XP_003731194.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Strongylocentrotus purpuratus]
Length = 2202
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 57/147 (38%), Gaps = 46/147 (31%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CP+AFH C L + P+G W C C
Sbjct: 346 DHQDYCEVCQQGGEIILCDTCPKAFHLVCLDPELETAPEGKWSCPNC------------- 392
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
E + + ++ + C R+ + G LLC C+QC
Sbjct: 393 ----EGEGIPEPEPADEHMEFC-RVCHD-----GGELLC-----------------CEQC 425
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H+ CL LR++P W C
Sbjct: 426 PSSYHIFCLN----PPLRKIPDDDWVC 448
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
P ++ + C +C DGG LL C+ CP ++H C + L IP DW C C
Sbjct: 402 PADEHMEFCRVCHDGGELLCCEQCPSSYHIFCLNPPLRKIPDDDWVCPRC 451
>gi|330925213|ref|XP_003300958.1| hypothetical protein PTT_12344 [Pyrenophora teres f. teres 0-1]
gi|311324662|gb|EFQ90949.1| hypothetical protein PTT_12344 [Pyrenophora teres f. teres 0-1]
Length = 1124
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 64/160 (40%), Gaps = 43/160 (26%)
Query: 571 GKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQNMFERKR 625
G++ D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ +
Sbjct: 410 GEEQDTKCAICDDGDCENTNAIVFCDGCDLAVHQECYGVPFIPEGQWLCRRCQLV----- 464
Query: 626 FLQHDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCD--FSKSGFGPRT 683
GR + V +EL GC+ C D F ++
Sbjct: 465 -----------GRGTPV------------------SELPGCIFCPNIDGAFKQTTAMKWA 495
Query: 684 ILLCDQCEREFHVG-CLKKHKMADLRELPKGKW-FCCMDC 721
LLC E +G + + D+ ++PK +W C C
Sbjct: 496 HLLCAMWIPEVSLGNATFQEPVQDVEKVPKTRWKLSCYIC 535
>gi|149068363|gb|EDM17915.1| rCG40452, isoform CRA_b [Rattus norvegicus]
Length = 1210
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 55/95 (57%), Gaps = 13/95 (13%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ + + +++D
Sbjct: 962 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1019
Query: 631 ANAVEAGRVS--GVDSVEQIT----KRCIRIVKNL 659
E R S GV ++ ++ K+C ++V +L
Sbjct: 1020 C---ENARYSHPGVRALPGLSMYDQKKCEKLVLSL 1051
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 810 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSW 868
Query: 613 YCKYCQ 618
+C C+
Sbjct: 869 FCNDCR 874
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1237 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1283
>gi|170284946|gb|AAI61082.1| whsc1 protein [Xenopus (Silurana) tropicalis]
Length = 1028
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 559 SQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
S F A+ + + N C +C++GG+LL C+ CP AFH +C ++ +P G W+C C+
Sbjct: 759 SHFAANKAKSHHAHINVSWCFVCSNGGSLLCCESCPAAFHPDCLNI-EMPDGSWFCNDCR 817
Query: 619 NMFERKRF 626
+ ++ RF
Sbjct: 818 -LGKKPRF 824
Score = 39.3 bits (90), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 577 LCTICADGGNLLPCDG-CPRAFHKECASLSSIPQGDWYCKYC 617
+C +C G+L+ C+G C AFH C LS+ P G + CK C
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSCIGLSTRPAGKYLCKEC 654
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 742 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSW 800
Query: 613 YCKYCQ 618
+C C+
Sbjct: 801 FCNDCR 806
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1169 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1215
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 809 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSW 867
Query: 613 YCKYCQ 618
+C C+
Sbjct: 868 FCNDCR 873
Score = 40.4 bits (93), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1236 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1282
>gi|67541010|ref|XP_664279.1| hypothetical protein AN6675.2 [Aspergillus nidulans FGSC A4]
gi|40738428|gb|EAA57618.1| hypothetical protein AN6675.2 [Aspergillus nidulans FGSC A4]
gi|259480259|tpe|CBF71226.1| TPA: PHD finger domain protein, putative (AFU_orthologue;
AFUA_7G05250) [Aspergillus nidulans FGSC A4]
Length = 1173
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 5/54 (9%)
Query: 570 PGKDNDDLCTICADG-----GNLLPCDGCPRAFHKECASLSSIPQGDWYCKYCQ 618
PG+D D C IC DG ++ CDGC A H+EC + IP+G W C+ CQ
Sbjct: 424 PGEDQDTKCAICDDGDCENANAIVFCDGCDLAVHQECYGVPFIPEGQWLCRKCQ 477
>gi|432863937|ref|XP_004070196.1| PREDICTED: PHD finger protein 21A-like [Oryzias latipes]
Length = 805
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D CT+C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 489 GDIHEDFCTVCRRSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPKCQDQILKK 544
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 553 NSEVSPSQFEAHAGRQYPGKDNDDLCTICADGGNLLPCDGCPRAFHKECASLSSIPQGDW 612
NS + F A G+++ N C +C+ GG+LL C+ CP AFH +C ++ +P G W
Sbjct: 790 NSIICTGHFTARKGKRHHTHVNVSWCFVCSKGGSLLCCEACPAAFHPDCLNI-EMPDGSW 848
Query: 613 YCKYCQ 618
+C C+
Sbjct: 849 FCNDCR 854
Score = 40.4 bits (93), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 572 KDNDDLCTICADGGNLLPCDG--CPRAFHKECASLSSIPQGDWYCKY 616
+ ++D C C DGG L+ CD C +A+H C L P G W C +
Sbjct: 1217 RQSEDECFRCGDGGQLVLCDRKFCTKAYHLSCLGLGKRPFGKWECPW 1263
>gi|348558812|ref|XP_003465210.1| PREDICTED: PHD finger protein 21A isoform 3 [Cavia porcellus]
Length = 682
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 485 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 540
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 492 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 535
>gi|156848686|ref|XP_001647224.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
gi|156117909|gb|EDO19366.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
Length = 682
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKECAS----LSSIPQGDWYCKYC 617
P DNDD C+ C G+ L CD CPR+FH C + +P+GDW C +C
Sbjct: 269 PEFDNDDFCSTCLQTGSFLCCDTCPRSFHFLCLNPPLDPDQLPEGDWSCPHC 320
>gi|426245369|ref|XP_004016484.1| PREDICTED: PHD finger protein 21A isoform 3 [Ovis aries]
Length = 682
Score = 56.2 bits (134), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 483 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 538
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 490 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 533
>gi|307199491|gb|EFN80104.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Harpegnathos saltator]
Length = 1948
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 56/147 (38%), Gaps = 45/147 (30%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERKRFLQHD 630
D+ D C +C GG ++ CD CPRA+H C L P+G W C +C+
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCEG----------- 414
Query: 631 ANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSKSGFGPRTILLCDQC 690
G D ++ + C R+ K+ G LLC CD C
Sbjct: 415 -----EGITGAADDDDEHMEFC-RVCKD-----GGELLC-----------------CDSC 446
Query: 691 EREFHVGCLKKHKMADLRELPKGKWFC 717
+H CL L E+P G W C
Sbjct: 447 TSAYHTHCLN----PPLSEIPDGDWKC 469
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 29/51 (56%), Gaps = 6/51 (11%)
Query: 573 DNDD----LCTICADGGNLLPCDGCPRAFHKECAS--LSSIPQGDWYCKYC 617
D+DD C +C DGG LL CD C A+H C + LS IP GDW C C
Sbjct: 422 DDDDEHMEFCRVCKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 472
>gi|62645406|ref|XP_342468.2| PREDICTED: PHD finger protein 21A isoform 2 [Rattus norvegicus]
gi|109470259|ref|XP_001070000.1| PREDICTED: PHD finger protein 21A isoform 1 [Rattus norvegicus]
Length = 694
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 499 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPRCQDQMLKK 554
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 506 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 549
>gi|260817522|ref|XP_002603635.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
gi|229288956|gb|EEN59646.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
Length = 190
Score = 56.2 bits (134), Expect = 8e-05, Method: Composition-based stats.
Identities = 21/55 (38%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 570 PGKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
P N+D C +C +GG+LL CD CP+ +H C ++ ++P GD+ C C+ + E
Sbjct: 9 PEDPNEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLCEELPE 63
>gi|444522352|gb|ELV13370.1| Tripartite motif-containing protein 66 [Tupaia chinensis]
Length = 1183
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++
Sbjct: 943 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSL 992
>gi|47214709|emb|CAG01062.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1036
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 67/147 (45%), Gaps = 26/147 (17%)
Query: 573 DNDDLCTICADGGN-----LLPCDGCPRAFHKECASLSSIPQGDWYCKYCQN---MFERK 624
D D +C IC DG +L CD C A H+EC + IP+G W C++C + +++
Sbjct: 214 DEDAVCCICMDGDGADSNVILFCDSCNIAVHQECYGVPYIPEGQWLCRHCLQVRLLPQQR 273
Query: 625 RFLQ--------HDANAVEAGRVSGVDSVEQITKRCIRIVKNLEAELSGCLLCRGCDFSK 676
R L+ H A A+ V D+V +R + +L+ C LCR +
Sbjct: 274 RSLKKTDDGRWGHVACALWVPEVGFSDTVFIEPIDGVRNIPPARWKLT-CYLCR-----E 327
Query: 677 SGFGPRTILLCDQ--CEREFHVGCLKK 701
G G + CD+ C FHV C +K
Sbjct: 328 KGAG--ACIQCDKVNCYTAFHVSCAQK 352
>gi|395815210|ref|XP_003781127.1| PREDICTED: tripartite motif-containing protein 66 [Otolemur
garnettii]
Length = 1211
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFE 622
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++ +
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQ 1014
>gi|354469852|ref|XP_003497336.1| PREDICTED: LOW QUALITY PROTEIN: PHD finger protein 21A-like
[Cricetulus griseus]
Length = 690
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 495 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 550
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 502 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 545
>gi|281341697|gb|EFB17281.1| hypothetical protein PANDA_004268 [Ailuropoda melanoleuca]
Length = 1217
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+N+D C +C +GG LL CD CP+ FH C +L P GDW C C++
Sbjct: 988 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRS 1036
>gi|90167365|sp|Q6ZPK0.2|PF21A_MOUSE RecName: Full=PHD finger protein 21A; AltName: Full=BHC80a;
AltName: Full=BRAF35-HDAC complex protein BHC80;
Short=mBHC80
Length = 659
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 464 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPRCQDQMLKK 519
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 471 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 514
>gi|291384596|ref|XP_002708841.1| PREDICTED: tripartite motif-containing 66 [Oryctolagus cuniculus]
Length = 1211
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNM 620
+N+D C +C +GG LL CD CP+ +H C +L S P G+W C C+++
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSL 1012
>gi|157841176|ref|NP_001103160.1| PHD finger protein 21A isoform 2 [Mus musculus]
Length = 688
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 493 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPRCQDQMLKK 548
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 500 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 543
>gi|329664170|ref|NP_001193126.1| PHD finger protein 21A [Bos taurus]
gi|296479679|tpg|DAA21794.1| TPA: PHD finger protein 21A isoform 2 [Bos taurus]
Length = 681
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 482 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 537
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 489 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 532
>gi|37360512|dbj|BAC98234.1| mKIAA1696 protein [Mus musculus]
Length = 660
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 465 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPRCQDQMLKK 520
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL+ L+ +PKG W C C D
Sbjct: 472 CSVCR-----KSG----QLLMCDTCSRVYHLDCLE----PPLKTIPKGMWICPRCQD 515
>gi|301761576|ref|XP_002916215.1| PREDICTED: tripartite motif-containing protein 66-like [Ailuropoda
melanoleuca]
Length = 1398
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 573 DNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQN 619
+N+D C +C +GG LL CD CP+ FH C +L P GDW C C++
Sbjct: 1150 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRS 1198
>gi|291384911|ref|XP_002708905.1| PREDICTED: BRAF35/HDAC2 complex [Oryctolagus cuniculus]
Length = 682
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 571 GKDNDDLCTICADGGNLLPCDGCPRAFHKEC--ASLSSIPQGDWYCKYCQNMFERK 624
G ++D C++C G LL CD C R +H +C L +IP+G W C CQ+ +K
Sbjct: 484 GDIHEDFCSVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKK 539
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 29/57 (50%), Gaps = 15/57 (26%)
Query: 666 CLLCRGCDFSKSGFGPRTILLCDQCEREFHVGCLKKHKMADLRELPKGKWFC--CMD 720
C +CR KSG +L+CD C R +H+ CL L+ +PKG W C C D
Sbjct: 491 CSVCR-----KSG----QLLMCDTCSRVYHLDCLD----PPLKTIPKGMWICPRCQD 534
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,280,113,655
Number of Sequences: 23463169
Number of extensions: 593696353
Number of successful extensions: 1529387
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2365
Number of HSP's successfully gapped in prelim test: 3252
Number of HSP's that attempted gapping in prelim test: 1508508
Number of HSP's gapped (non-prelim): 18259
length of query: 954
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 802
effective length of database: 8,792,793,679
effective search space: 7051820530558
effective search space used: 7051820530558
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)