BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001394
(1087 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 923 bits (2386), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 537/1121 (47%), Positives = 697/1121 (62%), Gaps = 113/1121 (10%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANG D+ + +V +++R GLKREF FA K SEI SLGRTRA + +S GCV P
Sbjct: 1 MANGKDNNKDMLV-AEVRPGLKREFAFAFKALSEISRSLGRTRASRTRSG--GGCV-SPA 56
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERP----IREKESKD 116
K K + +K +K ++ E+K EE V SDV ++ + E + E S
Sbjct: 57 SNNKKKRLKGKKDQK-------ARDLEEKEEENV-SDVVELGSGDEEATIGGLIESVSVS 108
Query: 117 DSENMGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKY 176
D+E G G G ++ V+E S EE EK+NE +EVL N
Sbjct: 109 DTEING-GNNGEIVEVKENGAGSMCLEE-------------TEKRNEHEEVLKN------ 148
Query: 177 GELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENE---HSEVDREKAEND 233
+ EE ++ + VE+E +NE ++ + E D K E+
Sbjct: 149 ---------DQCEERRSRSLPYSMKVEDESKNESDRTCEEIVSGSVPILMEEDSRKLEDV 199
Query: 234 LIGE---VKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRV---QMEM 287
I E +NE EEV+ D+ R A ++ CEE G + + +
Sbjct: 200 TIKEEIPKRNEPEEVLG------NDDLKRYADGNDQ--CEERISGSSPNSMNIDNFENQN 251
Query: 288 DEEKKNDIERELVENGVLESSM---------VGKHSSTLCNGESNVAKSVAVDGNDEGKT 338
E KN++E+ N +LES S L N E S+ + ND K
Sbjct: 252 GEHSKNEMEKVTAMNELLESKSDMNNVNEEGTSMSSVILMNSEGGAIDSLPI--NDSTK- 308
Query: 339 VNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQE---TVMK 395
V E+P+RRFTRSLL+ K+E+ + KD + A D G P T++K
Sbjct: 309 --VEKEKPMRRFTRSLLKPKMEIGQEYAVKD-------SSSAADDAGSPSAASNSGTMLK 359
Query: 396 PRK--VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDC 453
K +KF +KLK+ L+SGILEG V Y+RGSK +G G + L+GV+ GS I CFC C
Sbjct: 360 VWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSC 419
Query: 454 KGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS 513
+GN+VVTP++FE+HAGS+NKRPPEYIYLENG TLRD+MN CK++ LETL++A+ + G S
Sbjct: 420 RGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCS 479
Query: 514 SMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSV 573
S+K + FCL CR + A M LC C+ LK+SQA P +
Sbjct: 480 SLKNSTFCLKCRGKLAEASTGRSMTLCSQCMVLKDSQASI----------------PATT 523
Query: 574 ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMK--SSSVKSHGKITRKDLRMHKLVFEE 631
+ ++ A S +P+ + SS S +K +S KS G++T KDLRMHKLVFEE
Sbjct: 524 DTDKGYAESDVCAYRIVLTPKSHPVSKSSDSVLKCSTSRSKSQGRLTVKDLRMHKLVFEE 583
Query: 632 GGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYT 691
L DG EV Y+ RG+K LVGYKKGFGI C+CCN+EVSPSQFEAHAGWASRRKP+ HIYT
Sbjct: 584 DVLPDGTEVAYYSRGQKLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYT 643
Query: 692 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
SNGVSLHEL+I LS R FS+ +NDDLC IC DGGDLLCCD CPRA+H DC++LP IP+G
Sbjct: 644 SNGVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLCCDVCPRAYHKDCLALPEIPTG 703
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
W+C++C+N FQKEKFVE+NANA AAGR+ GVDP Q+ RCIRIV+T D + GGCV CR
Sbjct: 704 RWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVKTMDADFGGCVFCR 763
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
G DF K FG RTV+LCDQCE+E+HVGCLKDH MEDL+ELPKG W CC+DC RI+ AL+K
Sbjct: 764 GHDFDKI-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCCSDCCRIHSALEK 822
Query: 872 LVDRGEEKLPETSLDVIKKK-HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVS 930
LV RGEE+L ++SL++I KK E+ + + DVRWR+L K A D T ALLS+A++
Sbjct: 823 LVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAGD-TAALLSEALA 881
Query: 931 IFHDRFDPII----ESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 986
I H++F+PI+ S + DLI +MV+G + +GQ++ GMYCA+L +NQ VVS I R F
Sbjct: 882 ILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGMYCAVLMINQAVVSCAIIRFF 941
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G ELAELPLVATS+ QG+GYFQ+LF CIEKLLGFLN+K LVLP+A EA++IW NKFGF
Sbjct: 942 GLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAEEAESIWINKFGFR 1001
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
+T EE K+R DY +M+FQGTSML KPVPK RIVG+S G
Sbjct: 1002 KLTHEEFLKFRKDYQMMVFQGTSMLHKPVPKIRIVGRSEGG 1042
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 909 bits (2348), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/927 (52%), Positives = 616/927 (66%), Gaps = 90/927 (9%)
Query: 199 VVGVEEERRNECNQVLTN-----------VEENEHSEVDREKAENDLIGEVKNEFEEVVA 247
V+ +EE R +L N EE + +EV+ E+ N L+ + +
Sbjct: 81 VIDLEEARVESLAPLLNNYGDGEIVEVKEFEEAKENEVECEEKNNGLVPVLMDGVMAESG 140
Query: 248 VVEEE---KKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIER------- 297
V+E + + E D+V CEE G + +DE+ K + R
Sbjct: 141 VIENKGGGEVKEGDKVHA------CEEGSSG--------LVLIDEDSKPTVNRVLESKSG 186
Query: 298 -ELVENGVLESSMVGKHSSTLCNGESNVAKS----VAVDGNDEGKTVNVVVERPLRRFTR 352
EL ++ E G S ++ N E + V V+G+ + K V E+P RRFTR
Sbjct: 187 CELKKDDACEEGTSGLSSVSVKNDEGGYVNASFQPVVVNGDSKCK---VEEEKPFRRFTR 243
Query: 353 SLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLE 412
S L+ K+E S S DG K V + + V T M ++KF +KLK+ L+
Sbjct: 244 SALKPKIEPLDIS-SSDGVK---VDDTGSSSVAAITTTPTKMFAIDGLKKFPTKLKDLLD 299
Query: 413 SGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSN 472
SGILEG V Y+RG KV+GPG GL GVVK SGI CFCDDCKG +VVTP +FELHAGS+N
Sbjct: 300 SGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIFELHAGSAN 359
Query: 473 KRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAG 532
KRPPEYI+LENG TLRD+MN CK+S L+ L++A+R+ +G + KK+NFCL+CR S + AG
Sbjct: 360 KRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSCRGSITGAG 419
Query: 533 VEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRS 592
+ +LC C+ELK+SQA + P+ TDT R+
Sbjct: 420 TRKSKVLCSQCLELKDSQA-------------ILAPE----------------TDTKERT 450
Query: 593 PEPNSAQTSSHSKMKSSSVKSH--GKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFL 650
P P+ SS + +KSS +S+ G++T+KD+RMHKLVFEE L DG EVGY+ +G+K L
Sbjct: 451 PRPSPVPESSSALLKSSPSRSNSQGRLTKKDIRMHKLVFEEEVLPDGTEVGYYSQGKKLL 510
Query: 651 VGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPF 710
VGYKKGFGI C+CCN+EVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHEL+I LS R
Sbjct: 511 VGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKCRRH 570
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 770
S+KENDDLC IC DGG LLCCD CPRAFH +C+SLP IP G W+C+YC+NTF+KEKFVE
Sbjct: 571 STKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPKGKWYCKYCLNTFEKEKFVER 630
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
NANA AAGR+ G DP Q+ RCIRIV+T + E+GGCV CRG DF ++ FG RTVI+CDQ
Sbjct: 631 NANAIAAGRVAGTDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT-FGPRTVIICDQ 689
Query: 831 CEREYHVGCLKDHGMEDLQ---ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDV 887
CE+E+HVGCLK+H M+DL+ ELP GKW CC C+RI+ ALQKLV RGEEKLP++SL+
Sbjct: 690 CEKEFHVGCLKEHQMQDLKAICELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLNF 749
Query: 888 IKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL- 946
IKKKHEES S++ D+RWR+L KK D SD T +LLS+AV+IFH+RF PI SK
Sbjct: 750 IKKKHEESASESGGGDDIRWRLL-SKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKRK 808
Query: 947 ----DLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
D IP+MV G +GQD GMYCA+L VN VVSA + RIFGQELAELP+VATS+
Sbjct: 809 RDDHDFIPSMVKGGDMKGQDLGGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSKS 868
Query: 1003 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN--KYRNDY 1060
QGQGYFQ+LF CIEKLLGFLNVK LVLP+A E ++IWTNKFGFS +T++E +YR Y
Sbjct: 869 QGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKSY 928
Query: 1061 PLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
+M FQG+ MLQKPVPKCR+VGKS G
Sbjct: 929 QIMEFQGSLMLQKPVPKCRVVGKSEGG 955
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%), Gaps = 2/40 (5%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLG 40
MANGTD+++ V +K+R G KREFEFA + SEICGSLG
Sbjct: 1 MANGTDAKDAAV--AKVRPGHKREFEFAFRAHSEICGSLG 38
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 880 bits (2274), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/903 (51%), Positives = 612/903 (67%), Gaps = 58/903 (6%)
Query: 203 EEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAM 262
EEE +++ ++++ E S+VD G+ + E++ A+ EE K+E +
Sbjct: 106 EEEAKSDIVDLISD--EEPKSQVDES------TGDTGTKDEKLDAIRIEESKEE----LL 153
Query: 263 DVEEV---KCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVE-NGVLESSMVGKHSSTLC 318
D E+ + ++ + E +V +EE K + E E + + VGK+ S+
Sbjct: 154 DSEDPSSHRTVDLAIHSELVDVKVDPSYEEESKETLRNESEELSTCADLGKVGKNVSS-- 211
Query: 319 NGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGSLSKDGGK 372
+N +KS+ +D N G+ + ++P +RFTRS L+Q VE L+K +
Sbjct: 212 EEAANGSKSI-IDVN--GQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQV 268
Query: 373 RSDVTEVANDGVGGPVKQETV----MKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGS 427
++ TE + + GP+ V K +KV +KF +KLK+ L++GILEG+ V YIRGS
Sbjct: 269 ITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKKFPAKLKDLLDTGILEGLRVRYIRGS 328
Query: 428 KVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTL 487
K+K G +GL GV+ GSGI CFC++CKG +VV+P +FELHAGSSNKRPPEYIYLE G TL
Sbjct: 329 KIKALGETGLGGVISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTL 388
Query: 488 RDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC--RVSFSNAGVEELMLLCKSCVE 545
RDIMN C++ + E+ ++ +G S +K+ CLNC R+ S+ G+ MLLC SC++
Sbjct: 389 RDIMNACQNFSFDQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMD 446
Query: 546 LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK 605
++ PQ S + + +R+P+PN SS +
Sbjct: 447 SRK-------------------PQVSSSPSPSPSPSPTPIVFSKDRTPKPNVLSKSSDTI 487
Query: 606 MKSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTC 663
KS S K HG+ITRKDLR+HKLVFEE L DG EV Y+ RG+K LVGYKKG GI C+C
Sbjct: 488 TKSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQKLLVGYKKGSGIFCSC 547
Query: 664 CNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICM 723
CNSEVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHELSI LS R FS +NDDLC IC
Sbjct: 548 CNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICA 607
Query: 724 DGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGV 783
DGGDLLCCD CPR+FH DCV LP IP+G W+C+YC N FQKEKFVE+NANA AAGR+ GV
Sbjct: 608 DGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGV 667
Query: 784 DPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH 843
DP Q+ +RCIRIV+T + E+GGC LCR DF KS FG RTVILCDQCE+E+HVGCLK++
Sbjct: 668 DPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKEN 727
Query: 844 GMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDF 903
MEDL+ELP+GKW CC +C RI+ AL+KLV G EKLPE+ L ++KK E+ GS + D
Sbjct: 728 NMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDV 787
Query: 904 DVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDY 963
++RWRVL K + +SD TR+LLSKAVSIFHD FDPI++SAS D IP+M+YGR+ RGQ++
Sbjct: 788 EIRWRVLNWKML-SSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEF 846
Query: 964 HGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLN 1023
G+YCA+LTVN+ VVS GIFRIFG E+AELPLVAT + QGQGYFQSL+ CIE+ LGFLN
Sbjct: 847 GGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLN 906
Query: 1024 VKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGK 1083
VK LVLP+A EA+++W NKFGFS + EE +++ Y +MIFQGTSMLQK VPK R++
Sbjct: 907 VKNLVLPAADEAESLWINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPKYRVINS 966
Query: 1084 SVD 1086
+ +
Sbjct: 967 AAN 969
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANGT +E FVVLS++R GLKREF FALKVQS ICGSLGRTR+ K+ + + P
Sbjct: 1 MANGTAPDE-FVVLSRVRTGLKREFAFALKVQSTICGSLGRTRSTKLHNAIPESPT--PK 57
Query: 61 EVKKLKTY--------ESRKKRKRQEQSVVVKETEDKR----------EEEVKSDVFDVI 102
+K L T + Q +S V E E + EEE KSD+ D+I
Sbjct: 58 RLKGLGTMEAKEGEEEDEESDEAAQLRSCEVGEVEKVKIMEDMADSMSEEEAKSDIVDLI 117
Query: 103 NERERPIREKESKDDS 118
++ E + ES D+
Sbjct: 118 SDEEPKSQVDESTGDT 133
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 874 bits (2259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/902 (51%), Positives = 606/902 (67%), Gaps = 56/902 (6%)
Query: 203 EEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAM 262
EEE +++ ++++ E S+VD G+ + E++ A+ EE K+E +
Sbjct: 106 EEEAKSDIVDLISD--EEPKSQVDES------TGDTGTKDEKLDAIRIEESKEE----LL 153
Query: 263 DVEEV---KCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGKHSSTLCN 319
D E+ + ++ + E +V +EE K + E E + + +GK + +
Sbjct: 154 DSEDPSSHRTVDLAIHSELVDVKVDPSYEEESKETLRNESEE--LSTCADLGKAGKNVSS 211
Query: 320 GESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGSLSKDGGKR 373
E+ +D N G+ + ++P +RFTRS L+Q VE L+K +
Sbjct: 212 EEAANGSKSIIDVN--GQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVI 269
Query: 374 SDVTEVANDGVGGPVKQETV----MKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGSK 428
++ TE + + GP+ V K +KV +KF +KLK+ L++GILEG+ V YIRGSK
Sbjct: 270 TNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSK 329
Query: 429 VKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLR 488
+K G +GL GV+ GSGI CFC++CKG +VV+P +FELHAGSSNKRPPEYIYLE G TLR
Sbjct: 330 IKALGETGLGGVISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLR 389
Query: 489 DIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC--RVSFSNAGVEELMLLCKSCVEL 546
DIMN C++ + E+ ++ +G S +K+ CLNC R+ S+ G+ MLLC SC++
Sbjct: 390 DIMNACQNFSFDQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDS 447
Query: 547 KESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKM 606
K+ QA LSH E D +P+PN SS +
Sbjct: 448 KKPQAIDLL---SLSHYYMKE----------------FWADHLIITPKPNVLSKSSDTIT 488
Query: 607 KSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCC 664
KS S K HG+ITRKDLR+HKLVFEE L DG EV Y+ RG+K LVGYKKG GI C+CC
Sbjct: 489 KSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQKLLVGYKKGSGIFCSCC 548
Query: 665 NSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMD 724
NSEVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHELSI LS R FS +NDDLC IC D
Sbjct: 549 NSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICAD 608
Query: 725 GGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVD 784
GGDLLCCD CPR+FH DCV L IP+G W+C+YC N FQKEKFVE+NANA AAGR+ GVD
Sbjct: 609 GGDLLCCDGCPRSFHRDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGVD 668
Query: 785 PFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHG 844
P Q+ +RCIRIV+T + E+GGC LCR DF KS FG RTVILCDQCE+E+HVGCLK++
Sbjct: 669 PIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKENN 728
Query: 845 MEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFD 904
MEDL+ELP+GKW CC +C RI+ AL+KLV G EKLPE+ L ++KK E+ GS + D +
Sbjct: 729 MEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDVE 788
Query: 905 VRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYH 964
+RWRVL K + +SD TR+LLSKAVSIFHD FDPI++SAS D IP+M+YGR+ RGQ++
Sbjct: 789 IRWRVLNWKML-SSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEFG 847
Query: 965 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1024
G+YCA+LTVN+ VVS GIFRIFG E+AELPLVAT + QGQGYFQSL+ CIE+ LGFLNV
Sbjct: 848 GIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLNV 907
Query: 1025 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1084
K LVLP+A EA+++W NKFGFS + EE +++ Y +MIFQGTSMLQK VPK R++ +
Sbjct: 908 KNLVLPAADEAESLWINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPKYRVINSA 967
Query: 1085 VD 1086
+
Sbjct: 968 AN 969
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANGT +E FVVLS++R GLKREF FALKVQS ICGSLGRTR+ K+ + + P
Sbjct: 1 MANGTAPDE-FVVLSRVRTGLKREFAFALKVQSTICGSLGRTRSTKLHNAIPESPT--PK 57
Query: 61 EVKKLKTY--------ESRKKRKRQEQSVVVKETEDKR----------EEEVKSDVFDVI 102
+K L T + Q +S V E E + EEE KSD+ D+I
Sbjct: 58 RLKGLGTMEAKEGEEEDEESDEAAQLRSCEVGEVEKVKIMEDMADSMSEEEAKSDIVDLI 117
Query: 103 NERERPIREKESKDDS 118
++ E + ES D+
Sbjct: 118 SDEEPKSQVDESTGDT 133
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 872 bits (2254), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/812 (57%), Positives = 565/812 (69%), Gaps = 56/812 (6%)
Query: 290 EKKNDIERELVENGVLESSMVGKHSSTLCNGE----SNVAKSVAVDGNDEGKTVNVVVER 345
E+KND E L ++ E G S + NGE +N V VDG+ + K V E+
Sbjct: 204 ERKNDCE--LKKDDAREEGTSGLSSVLVKNGEGGDVNNSLHPVVVDGDIKCK---VEAEK 258
Query: 346 PLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVM----- 400
P RRFTRS L+ K+E S SD +V + G T P K+
Sbjct: 259 PFRRFTRSALKPKIETVDIS-------SSDGVKVDDRGSSSAAAATTTNTPTKMFSIDGS 311
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
+KF +KLK+ L+SGILEG V Y+RG+KV+GPG GL G+V+ SGI CFCDDCKG +VVT
Sbjct: 312 KKFPTKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVT 371
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
PA+F LHAGSSNKRPPEYI LENG TL D+MN CK+S L+TL++A+R+ G S KK+NF
Sbjct: 372 PAIFVLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNF 431
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPA 580
C NCR S + AG + +LC C LK+ QAGSA P + E +P SV
Sbjct: 432 CWNCRGSITGAGSRKSKVLCSQCFGLKDFQAGSA----PKTAKKERTAKPHSV------- 480
Query: 581 PSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEV 640
PE + + K S KS G++T+KD+R HKLVFEE L DG EV
Sbjct: 481 ------------PE----SSCNLLKSSLSGSKSQGRVTKKDIRTHKLVFEEEVLPDGTEV 524
Query: 641 GYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
GY+ +G+K L GYKKGFGI C+CCNSEVSPSQFEAHAGWASRRKP+ +IYTSNGVSLHEL
Sbjct: 525 GYYCQGKKLLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIYTSNGVSLHEL 584
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
+I LS R S KENDDLC IC DGG LLCCD CPRAFH +C+SLP IP G W+C+YC+N
Sbjct: 585 AISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPRGKWYCKYCLN 644
Query: 761 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 820
TF+KEKFVE NANA AAGR+ GVDP Q+ RCIRIV+T + E+GGCV CRG DF ++ F
Sbjct: 645 TFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT-F 703
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
G RTVI+CDQCE+E+HVGCLK+H M+DL+ELPKGKW CC C+RI+ ALQKLV RGEEKL
Sbjct: 704 GPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCCTGCERIHSALQKLVIRGEEKL 763
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
P++SL+ I KKHEES S++ DVRWR+L KK D+SD T ALLS AV+IFH+ FDPI
Sbjct: 764 PDSSLNFI-KKHEESASESGCSDDVRWRLL-SKKTDSSDVTEALLSDAVAIFHECFDPIT 821
Query: 941 ESASKL-----DLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPL 995
SK D IP+MV G + +GQD GMYCA+L VN VVVS + RIFGQELAELP+
Sbjct: 822 VDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLVNHVVVSVAVVRIFGQELAELPI 881
Query: 996 VATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNK 1055
VATS+ QGQGYFQ+LF CIEKLLGFLNVK LVLP+A E +IW NKFGF +T++E +
Sbjct: 882 VATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVGSIWKNKFGFGAITQDELME 941
Query: 1056 YRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
YR Y +M+FQG MLQKPVPKCRIVGKS G
Sbjct: 942 YRRRYQIMVFQGALMLQKPVPKCRIVGKSEGG 973
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 9/75 (12%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVL 57
MAN TD+++ ++K+R G KREFEFA + +SEI G LGRTR+ +V S N S
Sbjct: 1 MANSTDAKD--AAMAKVRPGHKREFEFAFRARSEIRGYLGRTRSSRVFSSPGNNGSNSYN 58
Query: 58 GPPEVKKLKTYESRK 72
G KKLK Y +K
Sbjct: 59 G----KKLKGYGIKK 69
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 843 bits (2178), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/1126 (46%), Positives = 671/1126 (59%), Gaps = 171/1126 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG-SLGRTRARKVQSNVDSGCVLGP 59
MA G DS+E FVVLS++R GLKREF FA+K QSEICG SLGRTRA K + V P
Sbjct: 1 MAKGADSDE-FVVLSRVRTGLKREFAFAMKAQSEICGGSLGRTRASKNR-------VEAP 52
Query: 60 PEVKKLKTYESRKKRKRQEQSVVVKETEDK-REEEVKSDVFDVINERERPIREKESKDDS 118
++ +RK+ ++ E+ K +ED EEEVKSDV D+ DD+
Sbjct: 53 -----VQPASARKRSRKSEEP---KTSEDAMSEEEVKSDVVDL------------QSDDN 92
Query: 119 ENMGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYGE 178
++G E A+ EE + E + E + EE+ D+V+ N EE
Sbjct: 93 NHVGESESAAMQVCEEEPKMLEPKPEPV---------ISEEEPKVLDDVI-NEEE----- 137
Query: 179 LDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLIGE- 237
+ E K E EP+V E EV E AE L E
Sbjct: 138 ------AVVAETLKGE--EPIVA-----------------ETLKEEVVDEMAEQPLCKEE 172
Query: 238 ----VKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKN 293
V +E E EE +K SD +A E+ CEE E E G V E D
Sbjct: 173 SEKGVSDEMAEQPLCEEESEKGVSDEMA---EQPLCEE-----EPEKGGVSEEKDS---- 220
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPL--RRFT 351
NGV + L N + + +GN + + + +E P RRFT
Sbjct: 221 --------NGV---------ALALVNDDGD-------EGNKKKRRMKKRLEMPQSERRFT 256
Query: 352 RSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET----VMKP-------RKVM 400
RS L+ K E + + V + +DGV G + +M P +
Sbjct: 257 RSALKVKSE------ETNDVEHVGVAGIDDDGVKGETEASAEASLLMTPPSSAKFSNSRL 310
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
+KF SKLK+ L +GILEG+ VMY++G+KV G GL+GV++ SG+ CFC C G +VVT
Sbjct: 311 KKFPSKLKDLLATGILEGLPVMYMKGAKVLFAGEKGLQGVIQDSGVLCFCKICNGVEVVT 370
Query: 461 PAVFELHAGSSNKRPPEYIYLENG---KTLRDIMNVCK--DSPLETLEKAVRMVLGSSSM 515
P VFELHAGS+NKRPPEYIY+ +G KTLRD+MN C D PLE++++AV+ +LG +M
Sbjct: 371 PTVFELHAGSANKRPPEYIYIHDGNCGKTLRDVMNACCCCDFPLESMDEAVQKLLGDFTM 430
Query: 516 KKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE--IKEP----------LSHS 563
KK++ CLNCR + GV +L +C C+ A ++ I +P + S
Sbjct: 431 KKSSICLNCRGACK--GVSKL--VCDLCLASPPQTAMASRKVISQPVQPRSPEPVVIQKS 486
Query: 564 SEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK-----MKSSSV--KSHGK 616
+ E QP S++ E P ++ + +PNS T K MK S+ KS G+
Sbjct: 487 LDNEVQPNSLDNEVPP-------NSLDNEVQPNSLDTGVQPKSFSNGMKHSASRGKSQGR 539
Query: 617 ITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAH 676
+TRKDLR+HKLVFE L DG E+ Y+ G+K LVGYKKG GI CTCCN +VS SQFEAH
Sbjct: 540 LTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGCGIFCTCCNEQVSASQFEAH 599
Query: 677 AGWASRRKPFQHIYTSNGVSLHELSIKLSLE-RPFSSKENDDLCGICMDGGDLLCCDSCP 735
AGWASRRKP+ HIYTSNG+SLHELSI LS + R FS+ +NDDLC IC DGGDLLCCD CP
Sbjct: 600 AGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLCCDGCP 659
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA-RAAGRIEGVDPFAQMVSRCI 794
RAFHIDCV LP IPSG+W+C+YC N FQK++ ++ NA AAGRI G D M RCI
Sbjct: 660 RAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKRCI 719
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 854
R+V+T + + GGC LC +F KS FG RTVI+CDQCE+EYHVGCLK+H ME+L++LP+G
Sbjct: 720 RVVKTVEVDHGGCALCSRPNFSKS-FGPRTVIICDQCEKEYHVGCLKEHNMENLEKLPEG 778
Query: 855 KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK 914
W C +C I+ AL LV E+ +P+ L +IKKKHEE + DV+WRV+ K
Sbjct: 779 NWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEKSLEIGAGLDVKWRVMNWKL 838
Query: 915 VDASDG---TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAIL 971
SD TR LLSKAV+IFH+RFDPI++S S D IP M++GR+ RGQD+ G+YCA+L
Sbjct: 839 DSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTMLFGRNIRGQDFSGIYCAVL 898
Query: 972 TVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPS 1031
TVN +VSAG+FR+FG E+AELPLVAT+ D QGQGYFQ LF CIE LLG LNVK LVLP+
Sbjct: 899 TVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLVLPA 958
Query: 1032 ASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
A EA++IWT KFGF+ + ++E NKY+ Y +MIFQGTS+LQKPVP+
Sbjct: 959 ADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPVPE 1004
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/1110 (45%), Positives = 658/1110 (59%), Gaps = 101/1110 (9%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MAN T++ E VV+S IR G+KREF +K Q+E S+G+ R + Q+ +G P
Sbjct: 1 MANRTEAGE--VVISSIRTGMKREFAMMMKAQAECGISIGQKRVTRSQNGALNGRASISP 58
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDSEN 120
+ +T S KR+++E +VV + NE+ E SKD+ E
Sbjct: 59 Q-NGSRTKSSTVKRRKKEANVVTADQS-----------LSHSNEKLDKTEELRSKDEGEK 106
Query: 121 MGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG--- 177
VE++ + SE E +D VV G E++KN V + E +
Sbjct: 107 -----------VEKLPLTSEWEEPKSD----VVDGASEDEKNG-TAVDSEIGEQETACVM 150
Query: 178 ---ELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDL 234
EL+ +GGS + + + V V+ E N L + E+ L
Sbjct: 151 EREELNSLGGSQKEKLINGDVAAKPVDVDNSGNGEENNGLVDCGGLFQKEL--------L 202
Query: 235 I-GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYE-PGRVQMEMDEEKK 292
I G+V + +EV D+SD V C G+ KE E G V M K
Sbjct: 203 INGDVAEKPKEV---------DKSDNDKQSDSLVNCGG-GVEKERELNGDVTEAMMNVVK 252
Query: 293 NDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGND--------EGKTVNVVVE 344
D E +L + ++E +M S C +S K+ G+ E V+ E
Sbjct: 253 TDNEEQL--DSLMEPTMEELPMSGDCKTKSEAIKADCAYGSALATVPESVENSKVSTSSE 310
Query: 345 RPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFY 404
+PLRRFTRS L+ K + S ++D + +G K E M + + K
Sbjct: 311 KPLRRFTRSCLKTKQQAMSASPAEDTKAEDALESDEASAIGTTSKLEMKMSKKVALVKIP 370
Query: 405 SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVF 464
+KLK L +G+LEG+ V Y+R +K +G GL+GV++GSGI CFC +C G +VVTP F
Sbjct: 371 TKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQGVIQGSGILCFCQNCGGTKVVTPNQF 430
Query: 465 ELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC 524
E+HAGSSNKRPPEYIYL+NGKTLRD++ CKD+P + LE A+R G+ +K+ FCLNC
Sbjct: 431 EMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEAAIRNATGAGDARKSTFCLNC 490
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEP-LSHSSEME---------------- 567
+ S A L C SC+ K+SQ +++ + S ++E
Sbjct: 491 KASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRDGQLEFIFLLNYYWADDLYKL 550
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P L+ SP+ + L +S E S+ T SK+ HG++TRKDLRMHKL
Sbjct: 551 GLPDLRGLQWSPSSNSVL-----KSTERMSSGTCPPSKV-------HGRLTRKDLRMHKL 598
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
VFE L DG + Y+VRG+K L GYKKG I C CC SEVSPSQFEAHAG ASRRKP+
Sbjct: 599 VFEGDVLPDGTALAYYVRGKKLLEGYKKGGAIFCYCCQSEVSPSQFEAHAGCASRRKPYS 658
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 747
HIYTSNGVSLHELSIKLS+ER SS ENDDLC IC DGGDLLCCD+CPRAFH +CVSLP
Sbjct: 659 HIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGDLLCCDNCPRAFHTECVSLPN 718
Query: 748 IPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGC 807
IP GTW+C+YC N F KEKF + +ANA AAGR+ G+D Q+ IRIV T E+G C
Sbjct: 719 IPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQITKCSIRIVDTLHAEVGVC 777
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 867
VLCR DF S FG +TVI+CDQCE+EYHV CL++H M+DL+ELPK KW CC +C I+
Sbjct: 778 VLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLKELPKDKWFCCKECNSIHY 837
Query: 868 ALQKLVDRGEEKLPETSLDVIKKK-HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
ALQKLV GE+ LP++ + +I +K ++ DN+++ DV+WR+L GK ++++ TR LS
Sbjct: 838 ALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSIN-DVKWRLLSGK--NSTEETRVWLS 894
Query: 927 KAVSIFHDRFDPIIESA-SKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 985
AVSIFHD FDPI +S+ S+LDLIP MVYGR+ + QD+ GM CAIL VN +VVSAG+ RI
Sbjct: 895 GAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLCAILMVNSLVVSAGVIRI 954
Query: 986 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1045
FG+E+AELPLVATS DCQG+GYFQSLF IE LL L VK LVLP+A EA++IWT KFGF
Sbjct: 955 FGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLVLPAAEEAESIWTKKFGF 1014
Query: 1046 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+T EE Y+++Y LMIFQGT+MLQK V
Sbjct: 1015 QHITPEELKHYKDNYQLMIFQGTAMLQKQV 1044
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/815 (53%), Positives = 565/815 (69%), Gaps = 27/815 (3%)
Query: 295 IERELVENGVLESSMVGKHSSTLCNGESNV----AKSVAVDGNDEG---KTVNVVVERPL 347
+E EL+E +E + S+ NG S ++S + G+ +G K VN V++PL
Sbjct: 254 VEEELLEQNNVEICLGLPSRSSQMNGHSLCLGLPSRSFQMSGHSQGVDKKAVNDTVDKPL 313
Query: 348 RRFTRSLLQQKVELAKGSL--SKDGGKRSDVTEVANDGVGGPVKQETVM------KPRKV 399
RRFTRSL++Q+ + +L + + DV AND + +V +P+K
Sbjct: 314 RRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRGRPKKF 373
Query: 400 MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
+R F +KLK+ + GILEG+ V Y+RG+KV+ G GL+GV+KGSG+ CFC C G QVV
Sbjct: 374 LRNFPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIGIQVV 433
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
+PA+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +KK++
Sbjct: 434 SPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LKKSS 492
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV--EL 575
CL+C+ + +++CKSC+E KE + ++ K + L+ SS P S+
Sbjct: 493 LCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSILRRS 552
Query: 576 EESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGG 633
+ SP S + +S EP + S SK S SHGK+TRKDLR+HKLVFE+
Sbjct: 553 KSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDI 612
Query: 634 LEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSN 693
L DG EVGYFV GEK LVGYKKGFGI C+CCN VSPS FEAHAG ASRRKPFQHIYT+N
Sbjct: 613 LPDGTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTN 672
Query: 694 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 753
GVSLHELS+ LS+++ FS ENDDLC IC DGG+L+CCD+CPR++H C SLP +PS W
Sbjct: 673 GVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERW 732
Query: 754 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCRG 812
C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CVLCRG
Sbjct: 733 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRG 792
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 872
FC+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN L L
Sbjct: 793 HSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNL 852
Query: 873 VDRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 929
+ RGEEKL L+ ++KK +EE+ D D+RWRVL G K+ +SD T+ LL+KA+
Sbjct: 853 IVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSG-KLTSSDDTKILLAKAL 911
Query: 930 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
SI H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+FG E
Sbjct: 912 SILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSE 971
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
LAELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ MT
Sbjct: 972 LAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMT 1031
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1084
+EE +YR DY +MIF GTSML+K VP V K+
Sbjct: 1032 DEEVKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 1066
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 800 bits (2067), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/886 (49%), Positives = 576/886 (65%), Gaps = 75/886 (8%)
Query: 234 LIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKN 293
LI E + + E +V+ EE+ D V + E + E + KE EP + E K
Sbjct: 123 LISEEQPKMLEPESVISEEEPKVLDDVINEEEAIVAETL---KEQEP-----IVPETLKE 174
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPL--RRFT 351
++ E+ E + K S NG VA ++ DG K++ +ERP RRFT
Sbjct: 175 EVVDEMAEQPLCIEESEEKDS----NG---VALALVNDGAKGKKSMKKRLERPQSERRFT 227
Query: 352 RSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET-------VMKPRKV----- 399
RS L+ K E + +D V G+ VK+ET + P V
Sbjct: 228 RSALKVKSE-----------ETNDGEHVGVAGISDGVKRETEAGASLVMTTPSSVKFSNR 276
Query: 400 --MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++KF +KL++ L +GILEG+ VMY++G KV G GL+GV++ SG+ CFC CKG +
Sbjct: 277 GKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEKGLQGVIQDSGVLCFCKICKGVE 336
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENG---KTLRDIMNVCK--DSPLETLEKAVRMVLGS 512
VVTP VFELHAGS+NKRPPEYIY+ +G KTLRD+MN C D PLE++++AV+ +LG
Sbjct: 337 VVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMNACCCCDFPLESMDEAVQKLLGD 396
Query: 513 SSMKKANFCLNCRVSFSNAGVEELMLLCKSCV--ELKESQAGSAEIKEPL---------- 560
+MKK++ CLNCR + GV L +C SC+ + + A + I +P+
Sbjct: 397 FTMKKSSICLNCRGACK--GVSRL--VCDSCLVSPAQTAVASNKGISQPVQPRSPEPVVI 452
Query: 561 --SHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK-----MKSSSV-- 611
S +E++P E++ + +G ++ + EP+S S K MK S+
Sbjct: 453 QKSLDNEVQPNSLHNEVQPNKLDTGMQPNSLDNGMEPDSLNNSMKPKSFSNGMKHSASRG 512
Query: 612 KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPS 671
KS G++TRKDLR+HKLVFE L DG E+ Y+ G+K LVGYKKG+GI CTCCN +VS S
Sbjct: 513 KSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQKLLVGYKKGYGIFCTCCNEQVSAS 572
Query: 672 QFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE-RPFSSKENDDLCGICMDGGDLLC 730
QFEAHAGWASRRKP+ HIYTSNG+SLHELSI LS + R FS+ +NDDLC IC DGGDLLC
Sbjct: 573 QFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLC 632
Query: 731 CDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA-RAAGRIEGVDPFAQM 789
CD CPRAFHIDCV LP IPSGTW+C+YC N FQK++ ++ NA AAGRI G D M
Sbjct: 633 CDGCPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELM 692
Query: 790 VSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQ 849
RCIR+V+T + + GGC LC +F KS FG +TVI+CDQCE+EYHVGCLKDH ME+L+
Sbjct: 693 NKRCIRVVRTLEVDHGGCALCSRPNFSKS-FGPQTVIICDQCEKEYHVGCLKDHNMENLE 751
Query: 850 ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
ELP G W C +C +I+ AL LV E+ +P+ L++IKKKHEE D DV+WRV
Sbjct: 752 ELPVGNWFCSGNCSQIHTALMDLVASKEKDVPDPLLNLIKKKHEEKSLDIGAGLDVKWRV 811
Query: 910 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 969
+ K S TR LLSKAV+IFH+RFDPI++S S D IPAM++GR+ RGQD+ G+YCA
Sbjct: 812 INWKLDSDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPAMLFGRNIRGQDFSGIYCA 871
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
+LTVN +VSAG+FR+FG E+AELPLVAT+ D QGQGYFQ LF CIE LLG LNVK LVL
Sbjct: 872 VLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLVL 931
Query: 1030 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
P+A EA++IWT KFGF+ + ++E NKY+ Y +MIFQGTS+LQKPV
Sbjct: 932 PAADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPV 977
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 79/146 (54%), Gaps = 22/146 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEIC-GSLGRTRARK--VQSNVDSGCVL 57
MA G DS+E FVVLS++R GLKREF FA+K QSEIC GSLGRTRA K V++ V
Sbjct: 1 MAKGADSDE-FVVLSRVRTGLKREFAFAMKAQSEICGGSLGRTRASKNRVEAPVQ----- 54
Query: 58 GPPEVKKLKTYESRKKRKRQEQSVVVKETEDK-REEEVKSDVFDVINERERPIREKESKD 116
PP RKR +S K ED EEEVKSDV D+ ++ E +
Sbjct: 55 -PPSA-----------RKRSRKSEEAKTLEDAMSEEEVKSDVVDLQSDDEPKNNNHVGES 102
Query: 117 DSENMGVGERGALMNVEEVKVVSERR 142
+S M V E +V ++SE +
Sbjct: 103 ESAAMQVCEEEPKSDVVLETLISEEQ 128
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 792 bits (2045), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/761 (52%), Positives = 513/761 (67%), Gaps = 63/761 (8%)
Query: 334 DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
DE K +++ +E L +RFTRS L+ K + + SL D + V ++ G V+ +
Sbjct: 1697 DESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SLESDYNFCNSVAIGVDEKTNGAVR--S 1753
Query: 393 VMKPRKVMRKFYSKL---------KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKG 443
+ P+K+ K K+ ++ LE+G+LEG V Y + KG L+G +KG
Sbjct: 1754 LTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTY--DGRKKG---YRLQGTIKG 1808
Query: 444 SGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE 503
+GI C C CKG++VV P+ FELHA S + +YIYL+NGK L D+++VCKD+PLETLE
Sbjct: 1809 NGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLE 1868
Query: 504 KAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHS 563
++ +GS +K++ P +
Sbjct: 1869 ATIQSAIGSFPVKRS---------------------------------------LPADEA 1889
Query: 564 SEMEPQPPS-VELEESPAPSGELTDTSNRSPEPNSAQTSSHSKM-KSSSVKSHGKITRKD 621
++M+P S ++ SPA S T R +P SS S + SS KS GKIT+KD
Sbjct: 1890 AKMDPLGNSCIKRNNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSENKSLGKITKKD 1949
Query: 622 LRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWAS 681
R+H+LVFEEGGL DG EV Y+ G+K L GYKKGFGI C CC+ EVS SQFEAHAGWAS
Sbjct: 1950 QRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWAS 2009
Query: 682 RRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHID 741
R+KP+ +IYTSNGVSLHEL+I LS R +S+++NDDLC IC DGG+LL CD CPRAFH
Sbjct: 2010 RKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRV 2069
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
C SLP IP W+CRYC N FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P+
Sbjct: 2070 CASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPE 2128
Query: 802 TELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 861
E+ CVLCRG DF KS FG RT+ILCDQCE+E+H+GCL+DH M+DL+ELP GKW CC +
Sbjct: 2129 AEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCLE 2188
Query: 862 CKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGT 921
C RI+ ALQKL RGEEKLP++ L+VIK+KHE G ++ D++VRWR+L GK AS T
Sbjct: 2189 CIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKL--ASPET 2246
Query: 922 RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAG 981
R LLS+AV+IFHDRFDPII+S + DLIPAMVYGR+ RGQD+ G+YCA++TVN VVSAG
Sbjct: 2247 RVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAG 2306
Query: 982 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1041
I R+FGQE+AELPLVATS D QG+GYFQ LF CIEKLL FLNV++ VLP+A EA+ IWT
Sbjct: 2307 ILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTK 2366
Query: 1042 KFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIV 1081
KFGF +T ++ ++YR + MI FQGT ML+K VP+ R +
Sbjct: 2367 KFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWRRI 2407
Score = 39.7 bits (91), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 13 VLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVLGPPEVKKLKT 67
V+S G+KRE FAL+V S+ CG +GRTR+ K Q+ ++ V KKLKT
Sbjct: 1525 VVSGFWKGMKRELAFALEVHSQ-CGLIGRTRSTKAQTQNETINCNGVAWNESNKKLKT 1581
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/907 (47%), Positives = 562/907 (61%), Gaps = 101/907 (11%)
Query: 210 CNQVLTNVEENEHSEVDREKAEND---LIGEVKNEFEEVVAV--------VEEEKKDESD 258
CN V N + ++ E ND I E KN ++ V+ +EE K DE
Sbjct: 35 CNGVAWNESNKKLKTLNDELEGNDSSDRIRETKNRWDGVIQYSRNKRLKRLEESKNDERR 94
Query: 259 RVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGKHSSTLC 318
+A EP + DEE+K D +V +VG
Sbjct: 95 TIAE----------------EPKDDESTTDEEQKTDENDPVVVEKPTGGYLVGPICEEEP 138
Query: 319 NGESNVA--KSVAVDGN---------DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSL 366
+S A K + DG+ DE K +++ +E L +RFTRS L+ K + + SL
Sbjct: 139 KSQSQKASIKDESNDGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SL 197
Query: 367 SKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL---------KNFLESGILE 417
D + V ++ G V+ ++ P+K+ K K+ ++ LE+G+LE
Sbjct: 198 ESDYNFCNSVAIGVDEKTNGAVR--SLTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLE 255
Query: 418 GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPE 477
G V Y + KG L+G +KG+GI C C CKG++VV P+ FELHA S + +
Sbjct: 256 GYPVTY--DGRKKG---YRLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAK 310
Query: 478 YIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELM 537
YIYL+NGK L D+++VCKD+PLETLE ++ +GS +K++
Sbjct: 311 YIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGSFPVKRS------------------- 351
Query: 538 LLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPS-VELEESPAPSGELTDTSNRSPEPN 596
P +++M+P S ++ SPA S T R +P
Sbjct: 352 --------------------LPADEAAKMDPLGNSCIKRNNSPATSIHRTSERARLLKPI 391
Query: 597 SAQTSSHSKMKSSS-VKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKK 655
SS S + +SS KS GKIT+KD R+H+LVFEEGGL DG EV Y+ G+K L GYKK
Sbjct: 392 PVTKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKK 451
Query: 656 GFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKEN 715
GFGI C CC+ EVS SQFEAHAGWASR+KP+ +IYTSNGVSLHEL+I LS R +S+++N
Sbjct: 452 GFGIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDN 511
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANAR 775
DDLC IC DGG+LL CD CPRAFH C SLP IP W+CRYC N FQ+EKFVE+NANA
Sbjct: 512 DDLCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAV 571
Query: 776 AAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREY 835
AAGR+ GVDP Q+ RCIRIV P+ E+ CVLCRG DF KS FG RT+ILCDQCE+E+
Sbjct: 572 AAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEF 630
Query: 836 HVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 895
H+GCL+DH M+DL+ELP GKW CC +C RI+ ALQKL RGEEKLP++ L+VIK+KHE
Sbjct: 631 HIGCLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERK 690
Query: 896 GSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG 955
G ++ D++VRWR+L GK AS TR LLS+AV+IFHDRFDPII+S + DLIPAMVYG
Sbjct: 691 GLESIADYNVRWRLLSGKL--ASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYG 748
Query: 956 RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCI 1015
R+ RGQD+ G+YCA++TVN VVSAGI R+FGQE+AELPLVATS D QG+GYFQ LF CI
Sbjct: 749 RNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSCI 808
Query: 1016 EKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKP 1074
EKLL FLNV++ VLP+A EA+ IWT KFGF +T ++ ++YR + MI FQGT ML+K
Sbjct: 809 EKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKG 868
Query: 1075 VPKCRIV 1081
VP+ R +
Sbjct: 869 VPEWRRI 875
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 133/558 (23%), Positives = 211/558 (37%), Gaps = 150/558 (26%)
Query: 21 LKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVLGPPEVKKLKT---------- 67
+KRE FAL+V S+ CG +GRTR+ K Q+ ++ V KKLKT
Sbjct: 1 MKRELAFALEVHSQ-CGLIGRTRSTKAQTQNETINCNGVAWNESNKKLKTLNDELEGNDS 59
Query: 68 ----------------YESRKKRKRQEQS------VVVKETED---KREEEVKSDVFD-V 101
Y K+ KR E+S + +E +D +EE K+D D V
Sbjct: 60 SDRIRETKNRWDGVIQYSRNKRLKRLEESKNDERRTIAEEPKDDESTTDEEQKTDENDPV 119
Query: 102 INERER------PIREKESKDDSENMGVGERGALMNVEEVKVVSERREEGNDEFGKV-VI 154
+ E+ PI E+E K S+ + ++E ND K+
Sbjct: 120 VVEKPTGGYLVGPICEEEPKSQSQKASI------------------KDESNDGSLKLQTA 161
Query: 155 GVEEEKKNECDEVLMNVEENKYGELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVL 214
G+ +E K E+ + +EE R + E V E N CN V
Sbjct: 162 GLIDESK----EIDIAMEEKL---------PKRFTRSALKSKEDTVESLESDYNFCNSVA 208
Query: 215 TNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGL 274
V+E + V R +G ++ KK ++V + + ++ E G+
Sbjct: 209 IGVDEKTNGAV-RSLTSPKKLG------------LKMSKKIALNKVPLTIRDLL--ETGM 253
Query: 275 GKEYEPGRVQMEMDEEKKN-DIERELVENGVLESSMVGKHSSTL---------CNGESNV 324
+ Y + D KK ++ + NG+L S + K S + C +
Sbjct: 254 LEGYP-----VTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHA 308
Query: 325 AKSVAVD-GNDEGKTVNVVVERPLRRFTRSLLQQKVEL-AKGSLSKDGGKRSDVTEVA-- 380
AK + +D G + ++V + PL ++ K SL D + D +
Sbjct: 309 AKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGSFPVKRSLPADEAAKMDPLGNSCI 368
Query: 381 --NDGVGGPVKQET----VMKPRKVMRKFYSKLKN---------------------FLES 413
N+ + + + ++KP V + S L N F E
Sbjct: 369 KRNNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEG 428
Query: 414 GILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG-SSN 472
G+ +G V Y G K L G KG GI C+C C+ V+ + FE HAG +S
Sbjct: 429 GLPDGTEVAYYAGGK------KLLDGYKKGFGIFCWCCHCE----VSASQFEAHAGWASR 478
Query: 473 KRPPEYIYLENGKTLRDI 490
K+P YIY NG +L ++
Sbjct: 479 KKPYSYIYTSNGVSLHEL 496
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/1106 (44%), Positives = 650/1106 (58%), Gaps = 124/1106 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
MA GT + E FVV R G KRE +F LK QSEICG SLGRTR + N +S V
Sbjct: 1 MAKGTATGE-FVV----RTGCKRELQFVLKSQSEICGGESLGRTRGSR-NLNGESRSV-- 52
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDS 118
EVKK R +S + K K EE + SD V + E E K D
Sbjct: 53 --EVKK-------SVRGSGLKSGIKKMRFSKDEEVLMSDTVGV------GLEEDEGKSD- 96
Query: 119 ENMGVGERGALMNVEEVK-VVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG 177
+++VEE K V + +E N E G N D V+ N E+
Sbjct: 97 ----------VVDVEETKGFVKDEGQEENVEIG-----------NGDDNVVTN--EDHLL 133
Query: 178 ELDGMGGSARTEEEK--NECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLI 235
E G TE EK + G+ V+ + E C L+ V E S + K E L+
Sbjct: 134 EQPEKGSLEVTEAEKVCSIGGDSVI--DREIVVACPAGLS-VLEKMASRSCKVKLERGLV 190
Query: 236 -----GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEE 290
+ +V + E DE D V + + + E+ +G G V EM+EE
Sbjct: 191 YAKPCKRLTRSMLKVEGIKSEVNVDE-DHVNPEKDAIDSEDNCVGVS---GSVACEMEEE 246
Query: 291 KKNDIERELVENGVLESSMVGKHSSTLCNG-ESNVAKSVAVDGNDEGKTVNVVVERPLRR 349
E+ SS + HS LC G S + + + K VN VV +PLRR
Sbjct: 247 LHEQNHAEICLGLPSRSSQMSGHS--LCLGLPSRSFQMSGLSQRVDKKAVNDVVAKPLRR 304
Query: 350 FTRSLLQQKVELAKGSLSKDGGKRSDVTEV---ANDGVGGPVKQETVM------KPRKVM 400
FTRSL++Q+ + +L + + +D+ EV AND +V +P+K +
Sbjct: 305 FTRSLVKQESDSDNPNLGNNT-ELADLGEVDMHANDVEMDDFHSPSVTTHNRRGRPKKFL 363
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
R F +KLK GILEG++V Y+RG+KV+ G GL+GV+KGSG+ CFC CKG QVV+
Sbjct: 364 RNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVVS 423
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
A++E+HA S+NKRPPEYI LE+G TLRD+MN CK++P TLE+ +R+V+G + +KK++
Sbjct: 424 TAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVGPN-LKKSSL 482
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKE--PLSHSSEMEPQPPSV--ELE 576
CLNC+ + +++CKSC+E KE + ++ K L+ SS P S+ +
Sbjct: 483 CLNCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSILSRSK 542
Query: 577 ESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGL 634
SP S + +S EP + S SK S S GK+TRKD+R+HKLVFE+ L
Sbjct: 543 SSPRQSNRQEQPTRKSTEPGVVPGTILSESKSSSIKSNSQGKLTRKDVRLHKLVFEDDIL 602
Query: 635 EDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG 694
DG EVGYFV GE VSPS FEAHAG ASRRKPFQHIYT+NG
Sbjct: 603 PDGTEVGYFVAGE--------------------VSPSSFEAHAGCASRRKPFQHIYTTNG 642
Query: 695 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
VSLHELS+ LS+++ FS ENDDLC IC DG C SL +PS W
Sbjct: 643 VSLHELSVALSMDQRFSIHENDDLCSICRDGV---------------CASLSSLPSERWS 687
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCRGR 813
C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIR+V + TEL CVLCRG
Sbjct: 688 CKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPSVCVLCRGH 747
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 873
FC+ F RTVI+CDQCE+E+HVGCLK+H + DL+ELP+ KW C DC++IN L L+
Sbjct: 748 SFCRLGFNSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEKINTTLGNLI 807
Query: 874 DRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVS 930
RGEEKL L+ I+ K +EES D+ D+RWRVL GK + +SD T+ LL+KAVS
Sbjct: 808 IRGEEKLTNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGK-LTSSDETKILLAKAVS 866
Query: 931 IFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
I H+RFDPI E+ ++ DLIPAMVYGR +GQD+ GMYC +L V++V+VS GIFR+FG EL
Sbjct: 867 ILHERFDPISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGIFRVFGSEL 926
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ MT+
Sbjct: 927 AELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTD 986
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPVP 1076
EE +YR DY +MIF GTSML+K VP
Sbjct: 987 EEVKEYRKDYSVMIFHGTSMLRKSVP 1012
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/683 (54%), Positives = 486/683 (71%), Gaps = 53/683 (7%)
Query: 375 DVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
+++ V N K+ + + PR V R+F +KLK L++GILE + V YIRGS+ +G G
Sbjct: 181 EISAVNNGEENTGTKRSSGLVPR-VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGE 239
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
SGLRGV+KGSGI C C+ CKG +VVTP +FELHAGSSNKRPPEYIYLENG +LR +MN
Sbjct: 240 SGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAW 299
Query: 495 KDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSA 554
K++ L++L++A+R+ +G S +KK+ FCLNC+ S AG+ +LC SC++LKESQA
Sbjct: 300 KNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNSKVLCLSCLQLKESQA--- 356
Query: 555 EIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS- 613
SP+ +D+ RSP+P++ S+ S K SS S
Sbjct: 357 -----------------------SPSQVTGSSDSHLRSPKPSTISRSAESVSKCSSSGSK 393
Query: 614 -HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE---KFLVGYKKGFGILCTCCNSEVS 669
+G++T+KDL +HKLVF E GL +G EVGY+VRG+ + LVGYK+G GI CTCCNSEVS
Sbjct: 394 SYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQVVTQLLVGYKRGSGIXCTCCNSEVS 453
Query: 670 PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLL 729
PSQFEAHAGWASRRKP+ HIYTSNGVSLHE SI LS R S +NDDLC IC+DGG+LL
Sbjct: 454 PSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLL 513
Query: 730 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM 789
CCD CPR FH +CVSL IP G W C++C N QKEKFVE+NANA AAGR+ GVDP Q+
Sbjct: 514 CCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQI 573
Query: 790 VSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQ 849
RCIRIV T E+GGC LCR +F +S FG RTV+LCDQCE+E+HVGCL++H M+DL+
Sbjct: 574 TKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLK 633
Query: 850 ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
E+PKGKW CC DCKRIN +LQKLV GEE+LP L IK+K+ +GS + D D++WR+
Sbjct: 634 EVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRL 693
Query: 910 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 969
+ G++ + + +LLS+A+SIFH++FDPI ++A + DL+P MV+G
Sbjct: 694 ICGRRASSIEAG-SLLSQALSIFHEQFDPIADAAGR-DLLPDMVHGSQ------------ 739
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
VVSA FRIFG+E+AELPLVAT +DCQGQGYFQ+LF C+E LLG L V++LVL
Sbjct: 740 -------VVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVL 792
Query: 1030 PSASEAQAIWTNKFGFSMMTEEE 1052
P+A A++IWTNKFGF+ +T+E+
Sbjct: 793 PAAEGAESIWTNKFGFNKVTQEQ 815
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/45 (73%), Positives = 36/45 (80%), Gaps = 1/45 (2%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRAR 45
MA GTDSEE FVVLS++R G KREF FA+K QS I GSLGRTR R
Sbjct: 1 MAKGTDSEE-FVVLSRVRPGCKREFAFAVKAQSAIAGSLGRTRTR 44
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/736 (51%), Positives = 497/736 (67%), Gaps = 50/736 (6%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 170 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSSALVPSTLKGFPTKL 216
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 217 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 276
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 277 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 335
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A A QP T
Sbjct: 336 ISEVGTSDSLVLCESCVGLKESHASPA--------------QP---------------TG 366
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
TS+RS +T++ SK SS K++G++T+KD+ +HKL F E L +G+EV Y+VRGE
Sbjct: 367 TSDRS-----TKTTTVSKCSSSGSKNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRGE 421
Query: 648 KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE 707
+ L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHELS+ L
Sbjct: 422 RLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLRG 481
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 767
R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+NT QK KF
Sbjct: 482 REPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGKF 541
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
VE NANA AAGR+ GVDP Q+ RCIRIV+ E GGC LCR +F S FG TV++
Sbjct: 542 VERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVMI 601
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDV 887
CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+LP+ L +
Sbjct: 602 CDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLRI 661
Query: 888 IKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLD 947
IKK++ GS + + D++WR+L G++ A++ +LLS+A+S+FH++F+PI ++ + D
Sbjct: 662 IKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAG-SLLSQALSLFHEQFNPIADAEGR-D 719
Query: 948 LIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1007
L+ MV+ S ++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +DCQGQGY
Sbjct: 720 LLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGY 779
Query: 1008 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQG 1067
FQ+L+ CIE+LL FL V +LVLP+A A+++W NKF F M +EE N D+ +M FQG
Sbjct: 780 FQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQG 839
Query: 1068 TSMLQKPVPKCRIVGK 1083
TSMLQKPVP+ R + +
Sbjct: 840 TSMLQKPVPEYRRISQ 855
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 1 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 54
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKR--EEEVKSD 97
P + L+ S KKR++ K ++R E+ V+S+
Sbjct: 55 PSRNRVLEN--SIKKRQKSSSLDSQKNNVEERFPEDRVRSN 93
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 728 bits (1880), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/814 (49%), Positives = 509/814 (62%), Gaps = 74/814 (9%)
Query: 324 VAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGS-----LSKDGGK 372
+AK + V+GN G + +RFTRS L KVE LA GS +S GG
Sbjct: 140 IAKDIKVEGNLPGWEI--------KRFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGG- 190
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGP 432
+ +E N K E M + + K ++ E+G+LEG+ V+Y+ VK
Sbjct: 191 --ETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGLLEGVPVIYM---GVKKA 245
Query: 433 GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
GLRG +K SGI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D++
Sbjct: 246 DDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQYICLENGKSLLDLLK 305
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAG 552
CK S +TLE V+ ++ SS +K C +C+ F ++ V ++ LC SC E K S+
Sbjct: 306 ACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSS-VGQVGPLCPSCEESKRSKWM 363
Query: 553 SAEIKEPLSH-SSEMEPQPPSVE------------------LEESPAPSGELTDTSNRSP 593
P S + P+ + ++ + S E T S RSP
Sbjct: 364 LTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKSKSSEYTSIS-RSP 422
Query: 594 EPNSAQTSSHSK----MKSSSVK--------------------SHGKITRKDLRMHKLVF 629
+ S +K M+ S+K + KIT KD R+HKLVF
Sbjct: 423 RSAPMRIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKLVF 482
Query: 630 EEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
EE GL DG EV YF RG+K L GYKKG GILC CCN VSPSQFE HAGW+SR+KP+ +I
Sbjct: 483 EEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAYI 542
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 749
YTSNGVSLHEL+I LS R +S+K+NDDLC IC+DGG+LL CD CPRAFH +C SL P
Sbjct: 543 YTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSSTP 602
Query: 750 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 809
G W+C++C N FQ+EKFVE+N NA AAGR+ GVDP Q+ RCIRIV+ +T+L GCVL
Sbjct: 603 RGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGCVL 662
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 869
CRG DF KS FG RT+ILCDQCE+E+HVGCLKDH M L+ELP+GKW C C RI+ AL
Sbjct: 663 CRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHSAL 722
Query: 870 QKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 929
QKL+ RG EKLP + L + +K E+ SD VD DV WR++ GK AS TR LLS+A+
Sbjct: 723 QKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGK--IASPETRLLLSEAI 780
Query: 930 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
+IFHDRFDPI++ S DLIPAMVYGR GQ++ GMYCAIL VN VVSA + R+FGQ+
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQD 840
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
+AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IWT KFGF +
Sbjct: 841 IAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIK 900
Query: 1050 EEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1082
++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 901 PDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 934
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/751 (49%), Positives = 491/751 (65%), Gaps = 48/751 (6%)
Query: 347 LRRFTRSLLQQKVELAK------GSLSKDGGKRSDVTEVANDGVGGPVKQ--ETVMKPRK 398
LR+ TRS KVE + ++ + + DV +A P K+ E M +
Sbjct: 137 LRQLTRSNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKI 196
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
+ +K E+G+LEG+ V+Y+ G K LRG +K GI C+C CKG +V
Sbjct: 197 ALDNIPMTVKELFETGLLEGVPVVYMGGKKA-----FCLRGTIKDVGILCYCSFCKGCRV 251
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
+ P+ FE+HA +R +YI ENGK+L D++N C++SPL++LE ++ + +K
Sbjct: 252 IPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKT 311
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEES 578
C C+ ++ V ++ LC SCVE KES P+ E
Sbjct: 312 FTCKRCKGTYPTILVGKVGPLCSSCVESKESNGS------------------PACE---- 349
Query: 579 PAPSGELTDTSNRSPEPNSAQTSSHSKMK--SSSVKSHGKITRKDLRMHKLVFEEGGLED 636
T+ +RS +P + S +S ++ SS K KIT KD R+HKLVFE+GGL D
Sbjct: 350 -------TNIKSRSSKPATVSKSLNSALEGVSSENKCQWKITTKDQRLHKLVFEDGGLPD 402
Query: 637 GAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS 696
G EV Y+ RG+K L+GYK+GFGILC CCN EVSPS FEAHAGWA+R+KP+ +IYTSNGVS
Sbjct: 403 GTEVAYYARGQKLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGVS 462
Query: 697 LHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 756
LHEL+I LS R +S+++NDDLC +C DGG L+ CD CPRAFH C SL IP G W C+
Sbjct: 463 LHELAISLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFCQ 522
Query: 757 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFC 816
+C N FQ+EKFVE+NANA AAGRI GVDP Q+ RCIRIV+ + EL GCVLCRG DF
Sbjct: 523 FCQNMFQREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYDFS 582
Query: 817 KSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRG 876
+S FG RT+ILCDQC +E+HVGCL+ H + +L+ELPKGKW CC DC RI+ AL+KL+ R
Sbjct: 583 RSGFGPRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLARE 642
Query: 877 EEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRF 936
E +P L+V+ KK+EE G + + DVRW++L GK AS T+ LLS+A++IF + F
Sbjct: 643 AEIIPNKLLEVVMKKNEEKGLETVNNIDVRWKLLTGK--SASPETKLLLSQALAIFQECF 700
Query: 937 DPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
DPI+++ + DLIP MVYG++ +GQDY GMYCA+L VN VVSA I RIFGQE+AELPLV
Sbjct: 701 DPIVDTTGR-DLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPLV 759
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
ATSN G+GYFQ LF IEKLL +L V ++VLP+A EA++IWT+KFGF + ++ +KY
Sbjct: 760 ATSNGNHGKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSKY 819
Query: 1057 RND-YPLMIFQGTSMLQKPVPKCRIVGKSVD 1086
R ++ F+GTSMLQK VP CRIV ++ +
Sbjct: 820 RKSCCQILTFKGTSMLQKAVPPCRIVNQNTE 850
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/632 (57%), Positives = 460/632 (72%), Gaps = 12/632 (1%)
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +KK++ CL
Sbjct: 1 MFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LKKSSLCL 59
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV--ELEES 578
+C+ + +++CKSC+E KE + ++ K + L+ SS P S+ + S
Sbjct: 60 SCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSILRRSKSS 119
Query: 579 PAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
P S + +S EP + S SK S SHGK+TRKDLR+HKLVFE+ L D
Sbjct: 120 PRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILPD 179
Query: 637 GAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS 696
G EVGYFV GEK LVGYKKGFGI C+CCN VSPS FEAHAG ASRRKPFQHIYT+NGVS
Sbjct: 180 GTEVGYFVAGEKMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGVS 239
Query: 697 LHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 756
LHELS+ LS+++ FS ENDDLC IC DGG+L+CCD+CPR++H C SLP +PS W C+
Sbjct: 240 LHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSCK 299
Query: 757 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCRGRDF 815
YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CVLCRG F
Sbjct: 300 YCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHSF 359
Query: 816 CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR 875
C+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN L L+ R
Sbjct: 360 CRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIVR 419
Query: 876 GEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 932
GEEKL L+ ++KK +EE+ D D+RWRVL G K+ +SD T+ LL+KA+SI
Sbjct: 420 GEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSG-KLTSSDDTKILLAKALSIL 478
Query: 933 HDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+FG ELAE
Sbjct: 479 HERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAE 538
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
LPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ MT+EE
Sbjct: 539 LPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEE 598
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1084
+YR DY +MIF GTSML+K VP V K+
Sbjct: 599 VKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 630
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 14/96 (14%)
Query: 398 KVMRKFYSKLKNFLESGIL-EGMSVMY-IRGSKVKGPGVSGLRGVVKGSGISCFCDDCKG 455
K+ RK K E IL +G V Y + G K+ L G KG GI C C
Sbjct: 159 KLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEKM-------LVGYKKGFGIHCSC----C 207
Query: 456 NQVVTPAVFELHAGSSNKRPP-EYIYLENGKTLRDI 490
N+VV+P+ FE HAG +++R P ++IY NG +L ++
Sbjct: 208 NKVVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHEL 243
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 723 bits (1866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/759 (49%), Positives = 483/759 (63%), Gaps = 108/759 (14%)
Query: 334 DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
DE K +++ +E L +RFTRS L+ K + + SL D + V ++ G V+ +
Sbjct: 1762 DESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SLESDYNFCNSVAIGVDEKTNGAVR--S 1818
Query: 393 VMKPRKVMRKFYSKL---------KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKG 443
+ P+K+ K K+ ++ LE+G+LEG V Y + KG L+G +KG
Sbjct: 1819 LTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTY--DGRKKG---YRLQGTIKG 1873
Query: 444 SGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE 503
+GI C C CKG++VV P+ FELHA S + +YIYL+NGK L D+++VCKD+PLETLE
Sbjct: 1874 NGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLE 1933
Query: 504 KAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHS 563
++ +GS +K++ P +
Sbjct: 1934 ATIQSAIGSFPVKRS---------------------------------------LPADEA 1954
Query: 564 SEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLR 623
++M+P S ++ + +P+ + TS R+ +D R
Sbjct: 1955 AKMDPLGNSC-IKRNNSPATSIHRTSERA---------------------------RDQR 1986
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+H+LVFEEGGL DG EV Y+ G+K L GYKKGFGI C CC+ EVS SQFEAHAGWASR+
Sbjct: 1987 LHRLVFEEGGLPDGTEVAYYAGGKKLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASRK 2046
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
KP+ +IYTSNGVSLHEL+I LS R +S+++NDDLC IC DGG+LL CD CPRAFH C
Sbjct: 2047 KPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVCA 2106
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
SLP IP W+CRYC N FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P+ E
Sbjct: 2107 SLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEAE 2165
Query: 804 LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
+ CVLCRG DF KS FG RT+ILCD QELP GKW CC +C
Sbjct: 2166 VSACVLCRGYDFSKSGFGPRTIILCD-------------------QELPSGKWFCCLECI 2206
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 923
RI+ ALQKL RGEEKLP++ L+VIK+KHE G ++ D++VRWR+L GK AS TR
Sbjct: 2207 RIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKL--ASPETRV 2264
Query: 924 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIF 983
LLS+AV+IFHDRFDPII+S + DLIPAMVYGR+ RGQD+ G+YCA++TVN VVSAGI
Sbjct: 2265 LLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGIL 2324
Query: 984 RIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKF 1043
R+FGQE+AELPLVATS D QG+GYFQ LF CIEKLL FLNV++ VLP+A EA+ IWT KF
Sbjct: 2325 RVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKKF 2384
Query: 1044 GFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIV 1081
GF +T ++ ++YR + MI FQGT ML+K VP+ R +
Sbjct: 2385 GFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWRRI 2423
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101203549
[Cucumis sativus]
Length = 946
Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/823 (48%), Positives = 510/823 (61%), Gaps = 83/823 (10%)
Query: 324 VAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGS-----LSKDGGK 372
+AK + V+GN G + +RFTRS L KVE LA GS +S GG
Sbjct: 140 IAKDIKVEGNLPGWEI--------KRFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGG- 190
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGP 432
+ +E N K E M + + K ++ E+G+LEG+ V+Y+ VK
Sbjct: 191 --ETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGLLEGVPVIYM---GVKKA 245
Query: 433 GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
GLRG +K SGI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D++
Sbjct: 246 DDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQYICLENGKSLLDLLK 305
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAG 552
CK S +TLE V+ ++ SS +K C +C+ F ++ V ++ LC SC E K S+
Sbjct: 306 ACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSS-VGQVGPLCPSCEESKRSKWM 363
Query: 553 SAEIKEPLSH-SSEMEPQPPSVE------------------LEESPAPSGELTDTSNRSP 593
P S + P+ + ++ + S E T S RSP
Sbjct: 364 LTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKSKSSEYTSIS-RSP 422
Query: 594 EPNSAQTSSHSK----MKSSSVK--------------------SHGKITRKDLRMHKLVF 629
+ S +K M+ S+K + KIT KD R+HKLVF
Sbjct: 423 RSAPMRIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKLVF 482
Query: 630 EEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
EE GL DG EV YF RG+K L GYKKG GILC CCN VSPSQFE HAGW+SR+KP+ +I
Sbjct: 483 EEDGLPDGTEVAYFARGQKLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAYI 542
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 749
YTSNGVSLHEL+I LS R +S+K+NDDLC IC+DGG+LL CD CPRAFH +C SL IP
Sbjct: 543 YTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSSIP 602
Query: 750 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 809
G W+C++C N FQ+EKFVE+N NA AAGR+ GVDP Q+ RCIRIV+ +T+L GCVL
Sbjct: 603 RGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGCVL 662
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 869
CRG DF KS FG RT+ILCDQCE+E+HVGCLKDH M L+ELP+GKW C C RI+ AL
Sbjct: 663 CRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHSAL 722
Query: 870 QKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 929
QKL+ RG EKLP + L + +K E+ SD VD DV WR++ GK AS TR LLS+A+
Sbjct: 723 QKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGK--IASPETRLLLSEAI 780
Query: 930 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ- 988
+IFHDRFDPI++ S DLIPAMVYGR GQ++ GMYCAIL VN VVSA + R+FGQ
Sbjct: 781 AIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQY 840
Query: 989 --------ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1040
++AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IWT
Sbjct: 841 CRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWT 900
Query: 1041 NKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1082
KFGF + ++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 901 EKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 943
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/818 (48%), Positives = 516/818 (63%), Gaps = 97/818 (11%)
Query: 295 IERELVENGVLESSMVGKHSSTLCNGESNV----AKSVAVDGNDEG---KTVNVVVERPL 347
+E EL+E +E + S+ NG S ++S + G+ +G K VN V++PL
Sbjct: 254 VEEELLEQNNVEICLGLPSRSSQMNGHSLCLGLPSRSFQMSGHSQGVDKKAVNDTVDKPL 313
Query: 348 RRFTRSLLQQKVELAKGSL--SKDGGKRSDVTEVAND---------GVGGPVKQETVMKP 396
RRFTRSL++Q+ + +L + + DV AND V P K+ +P
Sbjct: 314 RRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRG---RP 370
Query: 397 RKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
+K +R F +KLK+ + GILEG+ V Y+RG+KV
Sbjct: 371 KKFLRNFPAKLKDIFDCGILEGLIVYYVRGAKV--------------------------- 403
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
V+PA+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +K
Sbjct: 404 --VSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LK 460
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV- 573
K++ CL+C+ + +++CKSC+E KE + ++ K + L+ SS P S+
Sbjct: 461 KSSLCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSIL 520
Query: 574 -ELEESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+ SP S + +S EP + S SK S SHGK+TRKDLR+HKLVFE
Sbjct: 521 RRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFE 580
Query: 631 EGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY 690
+ L DG EVGYFV GE VSPS FEAHAG ASRRKPFQHIY
Sbjct: 581 DDILPDGTEVGYFVAGE--------------------VSPSTFEAHAGCASRRKPFQHIY 620
Query: 691 TSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS 750
T+NGVSLHELS+ LS+++ FS ENDDLC IC DG C SLP +PS
Sbjct: 621 TTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGV---------------CASLPSLPS 665
Query: 751 GTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVL 809
W C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CVL
Sbjct: 666 ERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVL 725
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 869
CRG FC+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN L
Sbjct: 726 CRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTL 785
Query: 870 QKLVDRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
L+ RGEEKL L+ ++KK +EE+ D D+RWRVL GK + +SD T+ LL+
Sbjct: 786 GNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGK-LTSSDDTKILLA 844
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 986
KA+SI H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+F
Sbjct: 845 KALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVF 904
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G ELAELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+
Sbjct: 905 GSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFT 964
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1084
MT+EE +YR DY +MIF GTSML+K VP V K+
Sbjct: 965 KMTDEEVKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 1002
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 704 bits (1817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/744 (50%), Positives = 500/744 (67%), Gaps = 30/744 (4%)
Query: 342 VVERPLRRFTRSLLQQKVELA-KGSLSKDGGKRSDVTEVANDGVGGPV-KQETVMKPRKV 399
VV R +R RS L+ KVE + +S+ ++ N +G P K E M + V
Sbjct: 75 VVARTRKRSRRSALEAKVECCDQMVVSETEQVVANGGSGINGALGAPRNKMELKMSKKIV 134
Query: 400 MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
+ + +K ++G L+G+SV+Y+ G K SGLRGV++ GI C C C G +V+
Sbjct: 135 VNRKPMTVKKLFDTGFLDGVSVVYMGGIK----KASGLRGVIRDGGILCSCCLCNGRRVI 190
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
P+ FE+HA +R +YI LENGK+L D++ C+ + L TLE V+ + S ++
Sbjct: 191 PPSQFEIHACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYF 250
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
C C+ F ++ VE + +C+SCVE ++S+ S + S P+P
Sbjct: 251 TCKRCKGCFPSSFVERVGPICRSCVESRKSEESSNNVVGKRVRS----PRPVV------- 299
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSV-KSHGKITRKDLRMHKLVFEEGGLEDGA 638
L++ S+ S S+Q H K ++ V S + D R+HKLVFEE GL DG
Sbjct: 300 -----LSNPSSTSELSVSSQVKRHRKKRTKLVFISISSVL--DQRLHKLVFEENGLPDGT 352
Query: 639 EVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 698
EV Y+ RG+K L G+K G GI+C CCN+E+SPSQFE HAGWASR+KP+ +IYTSNGVSLH
Sbjct: 353 EVAYYARGQKLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVSLH 412
Query: 699 ELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
EL+I LS +R +S+K+NDDLC +C DGG+LL CD CPRAFH +C +L IP G W+C++C
Sbjct: 413 ELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSIPRGDWYCQFC 472
Query: 759 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 818
N FQ+EKFV +NANA AAGR+EGVDP Q+ +RCIRIV+ + +L C LCRG DF +S
Sbjct: 473 QNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDFSRS 532
Query: 819 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
FG RT+ILCDQCE+EYHVGCL+DH M L+ELP+G WLCC DC RI+ L+ L+ +G E
Sbjct: 533 GFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVKGAE 592
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
+LPE+ L VIKKK EE G + + DVRWR+L GK AS TR LL +AVSIFH+ F+P
Sbjct: 593 RLPESLLGVIKKKQEEKGLEPII--DVRWRLLNGK--IASPETRPLLLEAVSIFHECFNP 648
Query: 939 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
I+++AS DLIPAMVYGR+ RGQ++ GMYCA+L VN VVSAG+ RIFG ++AELPLVAT
Sbjct: 649 IVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVAT 708
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR- 1057
SN G+GYFQ+LF CIE+LL FLNVK LVLP+A EA++IWT+KFGFS M +E YR
Sbjct: 709 SNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSKMNPDELTNYRK 768
Query: 1058 NDYPLMIFQGTSMLQKPVPKCRIV 1081
N + ++ F+GT+ML K VP CR++
Sbjct: 769 NCHQMVSFKGTNMLHKMVPSCRVI 792
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/725 (49%), Positives = 471/725 (64%), Gaps = 50/725 (6%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K E+G+LEG+ V+Y+ G K + GLRG +K GI C C C G +V+ P+ FE+
Sbjct: 14 VKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDVGILCSCAFCNGRRVIPPSQFEI 70
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N C+ +PL++LE ++ + +++ C C+
Sbjct: 71 HAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVERTFTCKRCKG 130
Query: 527 SFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVE--LEESPAPSGE 584
F + V ++ LC C E KES +S + P + + SG+
Sbjct: 131 IFPSICVGKIGPLCNLCAESKESHPTLTIGSSIISRYCQNLPSLILISWIINLKTITSGQ 190
Query: 585 ----LTDTSNR----SPE---------PNSAQTSSH--------------------SKMK 607
L S R SPE P S T S SK+
Sbjct: 191 FLLMLAHCSFRLSFLSPEQVLALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVA 250
Query: 608 SSSV----KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTC 663
S + + + KI+ +D R+H+LVFEEGGL DG E+ Y+ RG+K L GYK+GFGILC C
Sbjct: 251 SVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKLLGGYKRGFGILCHC 310
Query: 664 CNSEV-SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGIC 722
CN EV SPS FEAHAGWA+R+KP+ IYTSNGVSLH+L+I LS R +SS++NDDLC IC
Sbjct: 311 CNCEVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCIIC 370
Query: 723 MDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 782
DGGDLL CD CPRAFH C SL +PSG W+C++C NTFQ+EKFVE+NANA AAGR+
Sbjct: 371 ADGGDLLLCDGCPRAFHKGCASLSTVPSGDWYCQHCQNTFQREKFVEHNANAFAAGRVSE 430
Query: 783 VDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKD 842
+D Q+ RC RIV+ + EL GC LCRG DF +S FG RT+ILCDQCE+E+HVGCL+
Sbjct: 431 IDSIEQITKRCFRIVKNVEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRS 490
Query: 843 HGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVD 902
H M +L+ELPKG W CC DC RI+ LQKL+ RG EKLP++ L+ IKKKHEE G + +
Sbjct: 491 HKMANLKELPKGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISNS 550
Query: 903 FDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQD 962
DVRW +L GK AS + LLS+A+SIF + FDPI++S DLIP MVYG++ +GQD
Sbjct: 551 IDVRWTLLSGK--IASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQD 608
Query: 963 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1022
Y GMYCA+L VN +VSAGI R+FG+E+AELPLVAT N G+GYFQ LF CIEKLL FL
Sbjct: 609 YGGMYCAVLIVNSCIVSAGILRVFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAFL 668
Query: 1023 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIV 1081
NV+ LVLP+A EA++IW KFGF + E+ +KYR + M+ F+GTSMLQK VP C+IV
Sbjct: 669 NVQNLVLPAAEEAESIWIEKFGFQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAVPTCKIV 728
Query: 1082 GKSVD 1086
+S++
Sbjct: 729 NQSIE 733
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 25/121 (20%)
Query: 382 DGVGGPVKQETV-MKPRKVMRKFYSKLKN---------FLESGILEGMSVMYI-RGSKVK 430
D + P K +V + PRK RK Y K+ F E G+ +G + Y RG K+
Sbjct: 241 DLIAKPSKVASVHLSPRK--RK-YKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKLL 297
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG-SSNKRPPEYIYLENGKTLRD 489
G G +G GI C C +C +VV+P+ FE HAG ++ K+P IY NG +L D
Sbjct: 298 G-------GYKRGFGILCHCCNC---EVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHD 347
Query: 490 I 490
+
Sbjct: 348 L 348
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 686 bits (1769), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/688 (49%), Positives = 460/688 (66%), Gaps = 33/688 (4%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
V+ K + +K ++G+L+G+ V+Y+ K + LRG +K GI C C C G +V
Sbjct: 90 VVHKKPATVKELFQTGLLDGVPVVYV---GCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
+ P+ FE+HA + KR +YI LENGK++ ++M C+ +PL TLE ++ + S +K
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEES 578
C NCR F ++ VE + LLC SCVE ++S+ S SS++ SV+L+ +
Sbjct: 207 FTCKNCRGCFPSSNVERVGLLCLSCVESRKSEKSSIHAVGKRIRSSKL-----SVKLKTA 261
Query: 579 PAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGA 638
P S K S KS +I+++ R+HKL+FEE GL +GA
Sbjct: 262 PITS----------------------KCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGA 299
Query: 639 EVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 698
EV Y+ RG+K L G K GI+C CCN+E+SPSQFE HAGWASRRKP+ IYTSNGVSLH
Sbjct: 300 EVAYYARGQKLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYTSNGVSLH 359
Query: 699 ELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
EL+I LS + ++K+ND +C +C DGG+LL CD CPRAFH +C S+ IP G W+C+ C
Sbjct: 360 ELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQIC 419
Query: 759 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 818
+TF +E+ V +NA+A AAGR+EGVDP Q+ RCIRIV+ E+GGCVLCR DF +S
Sbjct: 420 QHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRS 479
Query: 819 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
FG RT+I+CDQCE+EYHVGCL+DH M L+ELP+G W CC DC RI+ L+ L+ R E
Sbjct: 480 GFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLENLLIRVAE 539
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
+LPE+ LDVIKKK + + DVRW++L GK AS TR LL +AVS+FH+ FDP
Sbjct: 540 RLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGK--IASPETRPLLLEAVSMFHECFDP 597
Query: 939 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
I++ A+ DLIPAMVYGR+ + QD+ GMYCA+L VN VVSAG+ RIFG+++AELPLVAT
Sbjct: 598 IVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAELPLVAT 657
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR- 1057
+G+GYFQ+LF CIE+LL FLNVK LVLP+A EA +IWT KFGFS M + YR
Sbjct: 658 RYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKPNQLTNYRM 717
Query: 1058 NDYPLMIFQGTSMLQKPVPKCRIVGKSV 1085
N + +M F+GT+ML K VP+CR++ +
Sbjct: 718 NCHQIMAFKGTNMLHKTVPQCRVINTQL 745
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/697 (50%), Positives = 453/697 (64%), Gaps = 51/697 (7%)
Query: 434 VSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV 493
VSGLRGV++ GI C C C+G +V++P+ FE+HA +R EYI ENGK+L D++
Sbjct: 32 VSGLRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRA 91
Query: 494 CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL-----MLLCKSCVELKE 548
C+ +PL LE ++ ++ S +K C C+ F ++ +E + + S E
Sbjct: 92 CRGAPLHDLEATIQNIVCSPPEEKYFTCKRCKGRFPSSCMERVGPICSSCVESSKSEESS 151
Query: 549 SQAGSAEIKEPL-------SHSSEMEPQPPSVELEESPA--------------------P 581
S I+ P S +SEM P P
Sbjct: 152 KNVVSKRIRSPRPVLVSKSSCASEMSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILP 211
Query: 582 SGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRK----------DLRMHKLVFEE 631
++T + + +T+S+S S +KS KIT+K D R+HKLVFEE
Sbjct: 212 RRKVTPKTKKKSLSVKLKTTSNSNCLSPQIKSEWKITKKLVPYSFPTCGDNRLHKLVFEE 271
Query: 632 GGLEDGAEVGYFVRGE------KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
GL DG+E+ Y+ G+ K L G+KKG GI+C CCN+E+SPSQFE HAGWASR+KP
Sbjct: 272 NGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRKKP 331
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
+ +IYTSNGVSLHELSI LS +R +S+ +NDDLC +C DGG+LL CD CPRAFH +C SL
Sbjct: 332 YAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGCPRAFHKECASL 391
Query: 746 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 805
IP G W+C++C N FQ+EKFV YN NA AAGR+EGVDP Q+ RCIRIV+ D EL
Sbjct: 392 SSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAELS 451
Query: 806 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
C LCRG DF KS FG RT+ILCDQCE+EYHVGCL+DH M L+ELPKG WLCC DC RI
Sbjct: 452 ACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCTRI 511
Query: 866 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 925
+ L+ ++ RG E+LP++ L VIKKK EE G D D +VRWR+L GKK AS TR LL
Sbjct: 512 HSTLENVLVRGAERLPKSLLAVIKKKQEEKGLDPINDINVRWRLLSGKK--ASPETRPLL 569
Query: 926 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 985
+AVSIFH+ FDPI+++ S DLI AMVYG+S RGQ++ GMYCA+L VN VVSAG+ RI
Sbjct: 570 LEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLIVNSSVVSAGMLRI 629
Query: 986 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1045
FG ++AELPLVATSN G+GYFQ+LF CIE+LL F+ VK LVLP+A EAQ+IWT+KFGF
Sbjct: 630 FGTDIAELPLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAAEEAQSIWTDKFGF 689
Query: 1046 SMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIV 1081
S + +E Y RN + FQGT+ML K VP CR++
Sbjct: 690 SKIKPDELANYRRNCNQFVTFQGTNMLHKMVPPCRVI 726
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 5/82 (6%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ +G + Y G ++ L G KGSGI C C N ++P+ FE+HAG
Sbjct: 269 FEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRC----CNTEISPSQFEVHAG 324
Query: 470 -SSNKRPPEYIYLENGKTLRDI 490
+S K+P YIY NG +L ++
Sbjct: 325 WASRKKPYAYIYTSNGVSLHEL 346
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/743 (48%), Positives = 471/743 (63%), Gaps = 83/743 (11%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 170 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSXAXVPSTLKGFPTKL 216
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 217 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 276
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 277 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 335
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A P P+ + ++ PS
Sbjct: 336 ISEVGTSDSLVLCESCVGLKESHA---------------SPAQPTGQQKQRLCPS----- 375
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
++H +++ I D+ +HKL F E L +G+EV Y+VRGE
Sbjct: 376 -------------AAHQEVR---------IMGGDVGLHKLAFGENDLPEGSEVSYYVRGE 413
Query: 648 -------KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
+ L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHEL
Sbjct: 414 VGTMRSKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHEL 473
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
S+ L R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+N
Sbjct: 474 SLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLN 533
Query: 761 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 820
T QK KFVE NANA AAGR+ GVDP Q+ RCIRIV++ E GGC LCR +F S F
Sbjct: 534 TLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKSQTDEAGGCALCRRHEFSTSGF 593
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
G TV++CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+L
Sbjct: 594 GPHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKGKWFCCRDCKDINSSLRKIVVRREEEL 653
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
P+ L +IKK++ GS + + D++WR+L G+ A++ +LLS+A+S+FH++F+PI
Sbjct: 654 PDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRXASATEAG-SLLSQALSLFHEQFNPIA 712
Query: 941 ESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1000
++ + DL+ MV+ S ++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +
Sbjct: 713 DAEGR-DLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRS 771
Query: 1001 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1060
DCQGQ V +LVLP+A A+++W NKF F M +EE N D+
Sbjct: 772 DCQGQ------------------VNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDF 813
Query: 1061 PLMIFQGTSMLQKPVPKCRIVGK 1083
+M FQGTSMLQKPVP+ R + +
Sbjct: 814 QMMTFQGTSMLQKPVPEYRRISQ 836
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 47/78 (60%), Gaps = 10/78 (12%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 1 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 54
Query: 59 PPEVKKLKTYESRKKRKR 76
P + L+ S KKR++
Sbjct: 55 PSRNRVLEN--SIKKRQK 70
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/734 (46%), Positives = 461/734 (62%), Gaps = 51/734 (6%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
++R+ ++ E+GIL+G+SV+Y+ K + GLRG++K GI C C C V
Sbjct: 256 LIRRRPETVRELFETGILDGLSVVYMGTVKSQA---FGLRGIIKDGGILCSCSSCDWAHV 312
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
++ + FE+HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKC 372
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEI----KEPLSHSSEME-----PQ 569
C C+ +F + + LC SC E++ SQA A + P +S ++ +
Sbjct: 373 FTCKRCKGAFPFSSLGHRGFLCMSCSEVETSQASPAAMWTSTSSPACIASPVKSRLKITR 432
Query: 570 PPSVELEESP---APSGELTDTSNR-------------SPEPN-SAQTSSHSKMK----S 608
PS + SP +P G T R S N S+Q SK K
Sbjct: 433 KPSESMSISPVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSKFKKMLTQ 492
Query: 609 SSVKSHG---------------KITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGY 653
SV ++TRKD +HKLVFE GGL +G E+GY+ RG+K L GY
Sbjct: 493 YSVTPKAVKSVSLSVSSKKRSYRLTRKDQGLHKLVFERGGLPEGTELGYYARGQKLLGGY 552
Query: 654 KKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK 713
K G GI C CC SEVSPS FEAHAGWASRRKP+ +IYTSNGVSLHE + S R +S+
Sbjct: 553 KMGAGIYCYCCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSAN 612
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
+N+DLC IC DGG+LL CDSCPRAFHI+CVSLP IP G WHC+YC N F E EYN N
Sbjct: 613 DNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVN 672
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
+ A G++EGVDP Q RCIR+V+ + E GCVLC G DFC+S FG RT+I+CDQCE+
Sbjct: 673 SSAVGQLEGVDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEK 732
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE 893
EYH+GCL + DL+ELPKG W C DC RIN LQKL+ G E L ++SL +I++K E
Sbjct: 733 EYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQE 792
Query: 894 ESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 953
+ + D D+RWR++ GK S +R LLS+A++IFHD FDPI++ S +LIP MV
Sbjct: 793 RTDVYSISDLDIRWRLISGKV--TSPESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRMV 850
Query: 954 YGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFC 1013
YG++ +GQDY G+ CA+LTVN VVSAG+ R+FG+E+AELPLVAT + +GYFQ LF
Sbjct: 851 YGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFS 910
Query: 1014 CIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQ 1072
CIEKLL LNV+++V+P+A EA+ +W NKFGF + E+ +KY + Y ++ F+G SMLQ
Sbjct: 911 CIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQ 970
Query: 1073 KPVPKCRIVGKSVD 1086
KPV +I+ K ++
Sbjct: 971 KPVHAHQIIDKKIE 984
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/716 (48%), Positives = 464/716 (64%), Gaps = 51/716 (7%)
Query: 374 SDVTEVANDGVGGPVKQETVMKPRK---VMRKFYSKLKNFLESGILEGMSVMYIRGSKVK 430
S++TEV +K T +K K V+ K +K ++G+L+G+ V+Y+ K
Sbjct: 77 SELTEVE-------LKTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYV---GCK 126
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDI 490
+ LRG +K GI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D+
Sbjct: 127 KDSTTELRGEIKDGGILCSCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDL 186
Query: 491 MNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCR----VSFSNAGVEELMLLCKSCVEL 546
M C+ +PL TLE ++ + S +K C +CR +S V ++L S L
Sbjct: 187 MRACRAAPLHTLEATIQNFINSPPEEKYFTCKSCRGPLGQYYSPIHVHVVLLNLNSVSLL 246
Query: 547 KESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKM 606
K +G E S SS++ SV+L+ P S K
Sbjct: 247 KLRNSGRQE----QSWSSKL-----SVKLKTVPITS----------------------KC 275
Query: 607 KSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNS 666
S KS +I+++ R+HKL+FEE GL +GAEV Y+ RG+K L G K GI+C CCN+
Sbjct: 276 LSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYYARGQKLLEGIKTRCGIVCRCCNT 335
Query: 667 EVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGG 726
EVSPSQFE HAGWASRRKP+ +IYTSNGVSLHEL+I LS + ++K+ND +C +C DGG
Sbjct: 336 EVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDGG 395
Query: 727 DLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPF 786
+LL CD CPRAFH +C S+ IP G W+C+ C +TF +E+ V YNA+A AAGR+EGVDP
Sbjct: 396 NLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDPI 455
Query: 787 AQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME 846
Q+ RCIRIV+ E+GGCVLCR DF +S FG RT+I+CDQCE+EYHVGCL+DH
Sbjct: 456 EQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKKA 515
Query: 847 DLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVR 906
L+ELP+G W CC DC I+ L+ L+ R E+LPE LDVIKKK E + + DVR
Sbjct: 516 YLKELPEGDWFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDVR 575
Query: 907 WRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGM 966
W++L GK AS TR LL +AVS+FH+ FDPI++ A+ DLIPAMVYGR+ + QD+ GM
Sbjct: 576 WKLLNGK--IASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGGM 633
Query: 967 YCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKT 1026
YCA+L VN VVSAG+ RIFG+++AELPLVAT +G+GYFQ+LF CIE+LL FLNVK
Sbjct: 634 YCALLIVNSSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVKN 693
Query: 1027 LVLPSASEAQAIWTNKFGFSMMTEEEQNKYR-NDYPLMIFQGTSMLQKPVPKCRIV 1081
LVLP+A EA++IWT KFGFS M ++ YR N + +M F+GT+ML K VP+CR+
Sbjct: 694 LVLPAAEEAESIWTEKFGFSKMKLDQLTNYRMNCHQIMAFKGTNMLHKTVPRCRVT 749
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/734 (45%), Positives = 457/734 (62%), Gaps = 51/734 (6%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
++R +++ E+G+L+G+SV+Y+ K + LRG+++ GI C C C V
Sbjct: 256 LIRSRPETVRDLFETGLLDGLSVVYMGTVKSQA---FPLRGIIRDGGILCSCSSCDWANV 312
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
++ + FE+HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEME---------PQ 569
C C+ F + + LCKSC E++ SQA A + S + + +
Sbjct: 373 FTCKRCKGPFPFSSLGHRGFLCKSCSEVETSQASLAATRTSTSAPACITSPVKSRLKITR 432
Query: 570 PPSVELEESPAPSGELTDTSNR----------------SPEPN-SAQTSSHSKMK----S 608
PS SP L +++ + S N S+Q SK K
Sbjct: 433 KPSESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFKKMLTQ 492
Query: 609 SSVKSHG---------------KITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGY 653
SV ++ RKD +HKLVF+ GGL +G E+GY+ RG+K L GY
Sbjct: 493 HSVTPKALKSVSLSVSSKKRSYRLARKDQGLHKLVFDRGGLPEGTELGYYARGQKLLGGY 552
Query: 654 KKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK 713
K G GI C CC EVSPS FEAHAGWASRRKP+ +IYTSNGVSLHE + S R +S+
Sbjct: 553 KMGAGIYCYCCKCEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSHGRKYSAN 612
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
+N+DLC IC DGG+LL CDSCPRAFHI+CVSLP IP G WHC+YC N F E EYN N
Sbjct: 613 DNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVN 672
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
+ A G++EGVDP Q+ RCIR+V+ + E GCVLC G DFC+S FG RT+I+CDQCE+
Sbjct: 673 SSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCEK 732
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE 893
EYH+GCL + DL+ELPKG W C DC RIN LQKL+ G EKL ++SL +I+ K E
Sbjct: 733 EYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQE 792
Query: 894 ESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 953
+ + D D+RWR++ GK S +R LLS+A++IFHD FDPI++ S +LIP MV
Sbjct: 793 RNDVYSISDLDIRWRLISGKV--TSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMV 850
Query: 954 YGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFC 1013
YG++ +GQDY G+ CA+LTVN VVSAG+ R+FG+E+AELPLVAT + +GYFQ LF
Sbjct: 851 YGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFS 910
Query: 1014 CIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQ 1072
CIEKLL LNV+++V+P+A EA+ +W NKFGF + E+ +KY + Y ++ F+G SMLQ
Sbjct: 911 CIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQ 970
Query: 1073 KPVPKCRIVGKSVD 1086
KPV +I+ K+++
Sbjct: 971 KPVDSHQIIDKTIE 984
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 291/464 (62%), Positives = 369/464 (79%), Gaps = 2/464 (0%)
Query: 614 HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQF 673
+G++T+KDL +HKLVF E GL +G EVGY+VRG++ LVGYK+G GI CTCCNSEVSPSQF
Sbjct: 363 YGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQQLLVGYKRGSGIFCTCCNSEVSPSQF 422
Query: 674 EAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDS 733
EAHAGWASRRKP+ HIYTSNGVSLHE SI LS R S +NDDLC IC+DGG+LLCCD
Sbjct: 423 EAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLLCCDG 482
Query: 734 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 793
CPR FH +CVSL IP G W C++C N QKEKFVE+NANA AAGR+ GVDP Q+ RC
Sbjct: 483 CPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKRC 542
Query: 794 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
IRIV T E+GGC LCR +F +S FG RTV+LCDQCE+E+HVGCL++H M+DL+E+PK
Sbjct: 543 IRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVPK 602
Query: 854 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 913
GKW CC DCKRIN +LQKLV GEE+LP L IK+K+ +GS + D D++WR++ G+
Sbjct: 603 GKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICGR 662
Query: 914 KVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTV 973
+ + + +LLS+A+SIFH++FDPI ++A + DL+P MV+G+S R D+ GMYCAILT+
Sbjct: 663 RASSIEAG-SLLSQALSIFHEQFDPIADAAGR-DLLPDMVHGKSTREWDFGGMYCAILTI 720
Query: 974 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1033
+ VVSA FRIFG+E+AELPLVAT +DCQGQGYFQ+LF C+E LLG L V++LVLP+A
Sbjct: 721 SSQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAAE 780
Query: 1034 EAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
A++IWTNKFGF+ +T+E++N +R DY ++ FQGT MLQK VP+
Sbjct: 781 GAESIWTNKFGFNKVTQEQRNNFRRDYQMVTFQGTLMLQKLVPR 824
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/166 (53%), Positives = 121/166 (72%), Gaps = 1/166 (0%)
Query: 375 DVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
+++ V N K+ + + PR V R+F +KLK L++GILE + V YIRGS+ +G G
Sbjct: 181 EISAVNNGEENTGTKRSSGLVPR-VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGE 239
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
SGLRGV+KGSGI C C+ CKG +VVTP +FELHAGSSNKRPPEYIYLENG +LR +MN
Sbjct: 240 SGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAW 299
Query: 495 KDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLC 540
K++ L++L++A+R+ +G S +KK+ FCLNC+ S AG+ +LC
Sbjct: 300 KNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNSKVLC 345
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/45 (73%), Positives = 36/45 (80%), Gaps = 1/45 (2%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRAR 45
MA GTDSEE FVVLS++R G KREF FA+K QS I GSLGRTR R
Sbjct: 1 MAKGTDSEE-FVVLSRVRPGCKREFAFAVKAQSAIAGSLGRTRTR 44
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 13/83 (15%)
Query: 410 FLESGILEGMSV-MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHA 468
F E+G+ EG V Y+RG ++ L G +GSGI C C C N V+P+ FE HA
Sbjct: 378 FGENGLPEGTEVGYYVRGQQL-------LVGYKRGSGIFCTC--C--NSEVSPSQFEAHA 426
Query: 469 GSSNKRPPE-YIYLENGKTLRDI 490
G +++R P +IY NG +L +
Sbjct: 427 GWASRRKPYLHIYTSNGVSLHEF 449
Score = 46.2 bits (108), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 36/56 (64%), Gaps = 5/56 (8%)
Query: 646 GEKFLVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSL 697
GE L G KG GILC+C + + V+P+ FE HAG +++R P ++IY NG SL
Sbjct: 238 GESGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPP-EYIYLENGTSL 292
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 333/701 (47%), Positives = 440/701 (62%), Gaps = 45/701 (6%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K E+G+LEG+ V+Y+ G K + GLRG +K +GI C C C G++V+ P+ FE+
Sbjct: 14 VKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDAGILCSCAFCNGHRVIPPSQFEI 70
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N C+ +PL++LE ++ + +++ C C+
Sbjct: 71 HAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVERTFTCKRCK- 129
Query: 527 SFSNAGVEELMLL--CKSCVELKESQAGSAEIKE--PLSHSSEMEP-QPPSVELEE---- 577
E+++ L K SQ + K+ P S +P + SV L
Sbjct: 130 -------EQVLALEYFKPASLSTSSQDNTPRKKKRKPEEQDSITKPSKSASVYLSSRKRK 182
Query: 578 ----SP------APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS----HGKITRKDLR 623
SP P L SP P ++ + S +D R
Sbjct: 183 YKKISPRLVCFFYPIDILFGLVMLSPFPFLWLVKIFVFIRKYAYLSPFCPFSGYQSQDQR 242
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+H+LVFEEGGL DG E+ Y+ RG+ + Y F L N S SQ + G+A
Sbjct: 243 LHRLVFEEGGLPDGTELAYYARGQVINITYSYPFTFLLLIVNKINSSSQ-KLLGGYA--- 298
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
+IYTSNGVSLHEL+I LS R +SS++NDDLC IC DGG+LL CD CPRAFH C
Sbjct: 299 ----YIYTSNGVSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPRAFHKGCA 354
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
S+P +PSG W+C+YC NTF++EK VE+NANA AAGR G+D Q+ RC RIV+ + E
Sbjct: 355 SIPTVPSGDWYCQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKRCFRIVKNIEAE 414
Query: 804 LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
L GC LCRG DF +S FG RT+ILCDQCE+E+HVGCL+ H M +L+ELPKG W CC DC
Sbjct: 415 LTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKGNWFCCMDCS 474
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 923
RI+ LQKL+ RG EKLP++ L+ IKKKHEE G + + + DVRW +L GK AS +
Sbjct: 475 RIHSTLQKLLIRGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSGK--IASPENKL 532
Query: 924 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIF 983
LLS+A+SIF + FDPI++S DLIP MVYG++ +GQDY GMYCA+LT+N +VSAGI
Sbjct: 533 LLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTINSSIVSAGIL 592
Query: 984 RIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKF 1043
R+FG+E+AELPLVAT N G+GYFQ LF CIEKLL FLNV+ LVLP+A EA++IWT KF
Sbjct: 593 RVFGEEVAELPLVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWTEKF 652
Query: 1044 GFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVGK 1083
GF + E+ NKYR M+ F+GTSMLQK VP CRIV +
Sbjct: 653 GFQKIKPEQLNKYRKSCCQMVRFEGTSMLQKAVPTCRIVNQ 693
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 335/754 (44%), Positives = 441/754 (58%), Gaps = 43/754 (5%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ D N+ + + +P RRFTRSLL+ KVE + + D T ++ V
Sbjct: 299 ISEDNNEASVDLALAPGKPQRRFTRSLLKVKVEARSTNNLLQSKEAIDSTSDSSRSVK-- 356
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + K + K S ++ L +G+LEGM V YI S V L+GV+ G I
Sbjct: 357 KMEMKMSKKVACLTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAV--LKGVITGCNIR 414
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G++ V FE HAGS+ K P ++IYL NG +LRD++ C+ SPLE+LEK +R
Sbjct: 415 CFCLSCNGSKDVCSYFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIR 474
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ + + CLNC S++ E LC+ C+E K+ Q
Sbjct: 475 SSIDPIAKRSYVNCLNCNEHLSSSQTEIFGSFLCQRCLEPKQHQ---------------- 518
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHK 626
+ P+PS SN S P+S ++ S GK+T KD +HK
Sbjct: 519 ----------DPPSPS--YACKSNSSLIPSSKDFLLKKTPLNTKGGSAGKVTTKDTGLHK 566
Query: 627 LVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 686
LVF+ L DG EV Y+V G++ + GY K I C CN VSPS FEAHAG +RRKP+
Sbjct: 567 LVFKV--LLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPY 624
Query: 687 QHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 746
+I+TSNGVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L
Sbjct: 625 DNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLS 684
Query: 747 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 806
G+PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LGG
Sbjct: 685 GVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLGG 742
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 866
C LC+ +DF S F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I
Sbjct: 743 CALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIR 802
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
+L K++ G L E+ +D+I+KKHE G + D+RWR+L G+K AS+ LLS
Sbjct: 803 SSLDKIISDGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLLS 860
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGI 982
AV I H FDPIIE S DLIP MV GR + GQDY GMYCA+LT+ VVSA +
Sbjct: 861 AAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAAL 920
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
R+ G E+AELPLVATS D QG GYFQ+LF CIE++L L +K +LP+A EA+ IW NK
Sbjct: 921 LRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNK 980
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
FGF+ + +E+ + Y N L IF GTS L K +P
Sbjct: 981 FGFTKIPQEQSDAYLNGAHLTIFHGTSNLYKAIP 1014
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 335/754 (44%), Positives = 442/754 (58%), Gaps = 45/754 (5%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ D N+ + + +P RRFTRSLL+ KVE + + D T ++ V
Sbjct: 310 ISEDNNEASVDLALAPGKPQRRFTRSLLKVKVEARSTNNLLQSKEAIDSTSDSSRSVK-- 367
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + K + K S ++ L +G+LEGM V YI S K + L+GV+ G I
Sbjct: 368 KMEMKMSKKVACLTKHPSNIRELLNTGLLEGMPVRYIIPSSKK----AVLKGVITGCNIR 423
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G++ V FE HAGS+ K P ++IYL NG +LRD++ C+ SPLE+LEK +R
Sbjct: 424 CFCLSCNGSKDVCSYFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIR 483
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ + + CLNC S++ E LC+ C+E K+ Q
Sbjct: 484 SSIDPIAKRSYVNCLNCNEHLSSSQTEIFGSFLCQRCLEPKQHQ---------------- 527
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHK 626
+ P+PS SN S P+S ++ S GK+T KD +HK
Sbjct: 528 ----------DPPSPS--YACKSNSSLIPSSKDFLLKKTPLNTKGGSAGKVTTKDTGLHK 575
Query: 627 LVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 686
LVF+ L DG EV Y+V G++ + GY K I C CN VSPS FEAHAG +RRKP+
Sbjct: 576 LVFKV--LLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKPY 633
Query: 687 QHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 746
+I+TSNGVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L
Sbjct: 634 DNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLS 693
Query: 747 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 806
G+PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LGG
Sbjct: 694 GVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLGG 751
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 866
C LC+ +DF S F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I
Sbjct: 752 CALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEIR 811
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
+L K++ G L E+ +D+I+KKHE G + D+RWR+L G+K AS+ LLS
Sbjct: 812 SSLDKIISDGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLLS 869
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGI 982
AV I H FDPIIE S DLIP MV GR + GQDY GMYCA+LT+ VVSA +
Sbjct: 870 AAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAAL 929
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
R+ G E+AELPLVATS D QG GYFQ+LF CIE++L L +K +LP+A EA+ IW NK
Sbjct: 930 LRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNK 989
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
FGF+ + +E+ + Y N L IF GTS L K +P
Sbjct: 990 FGFTKIPQEQSDAYLNGAHLTIFHGTSNLYKAIP 1023
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 321/721 (44%), Positives = 414/721 (57%), Gaps = 98/721 (13%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGI 446
K E M + + LK+ L +G+LEG V YI +G + LRGV+K GI
Sbjct: 143 KMELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAV------LRGVIKRVGI 196
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+ C D+ L+ L+ A+
Sbjct: 197 LCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAI 256
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ +G + K+ C C+ SF+ + LLC SC+E K SQ + K
Sbjct: 257 QNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQNSTRTSK--------- 307
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEP---NSAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ +P S + N SP NS+ S + S G+ITRKD
Sbjct: 308 --------IGRNPTSSAR--RSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKG 357
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+HKL F G L +G +VGY+V G++ L GY K FGI C CCN+ VSPSQFEAHAG A+RR
Sbjct: 358 LHKLAFMSGVLPEGTDVGYYVGGKRLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAARR 417
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
KP+ +IY SNGVSLHELS+ LS R S++++DDLC IC DGG+LL CDSCPRAFH +CV
Sbjct: 418 KPYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECV 477
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
IP GTW CRYC N Q+E + YN NA AAGRI+G+DP Q+ +R IRI TP T
Sbjct: 478 GFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTG 537
Query: 804 LGGCVLC-------------RGR--------------DFCKSRFGRRTVILCDQCEREYH 836
GGC LC R R DF K +F RTV+LCD
Sbjct: 538 FGGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCD------- 590
Query: 837 VGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG 896
Q LP+G W C ADC RI+ L+ L+ RG E + +++IK+K+E+
Sbjct: 591 ------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKA 638
Query: 897 SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 956
+ D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ DLIPAMVYG
Sbjct: 639 LNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG- 694
Query: 957 SHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIE 1016
VVSAG+FR+ G E+AELPLVATS D QG GYFQ+LF CIE
Sbjct: 695 ------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIE 736
Query: 1017 KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQGTS L K VP
Sbjct: 737 RLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLVP 796
Query: 1077 K 1077
K
Sbjct: 797 K 797
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 334/755 (44%), Positives = 441/755 (58%), Gaps = 59/755 (7%)
Query: 331 DGN-DEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVK 389
DG+ D +++E+P RRFTRSLL+ KVE SL DV + A+D VK
Sbjct: 143 DGSKDASVGFTLLLEKPQRRFTRSLLKTKVE---SSLVGS----DDVLDSASDSPPS-VK 194
Query: 390 QETVMKPRKV--MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + +KV + K ++ L +G+LEGM VMYI K + L+GV+ G I
Sbjct: 195 KMEMKMSKKVACLTKHPGNIRELLNTGMLEGMPVMYIIPHSKK----AVLKGVITGCNIR 250
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G + ++ FE HAGS+ K P +YIYL NG +LRD++ SPLE LE+ +R
Sbjct: 251 CFCLSCNGAKAISAYYFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIR 310
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEME 567
+ + CLNC + E +LC+ C+E K+ Q +PL+ S
Sbjct: 311 SSIDPVVKRSRINCLNCNELVLPSSHEN--VLCQVCLESKQPQ-------DPLTASY--- 358
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVK--SHGKITRKDLRMH 625
T N S + + + + SS K S GK+T KD R+H
Sbjct: 359 --------------------TCNGSSSLSRSSKEALLRNISSGKKGGSAGKVTNKDNRLH 398
Query: 626 KLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
KLVF L DG EV Y+V G++ + GY K I C CN VSPS FEAHAG SRRKP
Sbjct: 399 KLVFNV--LLDGTEVAYYVDGQRKVDGYIKDHRIYCNHCNRVVSPSAFEAHAGEGSRRKP 456
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
+ +I+TSNGVSLHEL++K+S + S +E DDLC C GGD+ C CPR+FH CV L
Sbjct: 457 YDNIFTSNGVSLHELAMKISKDMELSERETDDLCRECGQGGDIFPCKICPRSFHPACVGL 516
Query: 746 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 805
+P+ W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LG
Sbjct: 517 SKVPA-EWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLG 574
Query: 806 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
GC LC+ +DF + F RTVILCDQCE+EYHVGCL+ +L+ELP+ +W CC+ C
Sbjct: 575 GCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEEEWFCCSSCSET 634
Query: 866 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 925
+L K++ G + L + L++IKKKHE G D++W++L GK+ A++ LL
Sbjct: 635 RSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSGKR--ATEDGSILL 692
Query: 926 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAG 981
S AV IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA
Sbjct: 693 SAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAA 752
Query: 982 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1041
+ R+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW N
Sbjct: 753 LLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMN 812
Query: 1042 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
KFGFS ++ EE Y N L IF GTS + K VP
Sbjct: 813 KFGFSRISPEELEAYLNGAHLTIFHGTSYMYKAVP 847
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 577 bits (1486), Expect = e-161, Method: Compositional matrix adjust.
Identities = 330/738 (44%), Positives = 431/738 (58%), Gaps = 50/738 (6%)
Query: 346 PLRRFTRSLLQQKVELAKGS-LSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKV--MRK 402
P RRFTRSLL+ KVE S L + D T V K E M +KV + K
Sbjct: 465 PGRRFTRSLLKPKVEAPPASSLVVVPEEPVDSTPETPPSV---TKMEMKMS-KKVAFLSK 520
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
++ L +G+LEGM VMYI + K L+GV+ G I CFC C G++ +T
Sbjct: 521 HPGNTRDLLSTGLLEGMPVMYIIPNSKK----PVLKGVIAGCNIRCFCVKCDGSKTITTY 576
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
FELHAGSS K P EYIYL NG LRD++ C+ SPL++L+K ++ + ++ CL
Sbjct: 577 FFELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCL 636
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPS 582
NC + E+ LC C ES +PQ P+
Sbjct: 637 NCNGELPSQTEEQF--LCHDCC--PESN----------------QPQ----------DPT 666
Query: 583 GELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGY 642
L + + S S++ S +M +S S GK+T KD +HKLVF+ L DG EV Y
Sbjct: 667 SPLACSKSSSSLTPSSKESLLKRMSASKGASTGKVTTKDTGLHKLVFKV--LLDGTEVNY 724
Query: 643 FVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSI 702
+V G+K + GY K I C CN VSPS FEAHAG SRRKP+ +I+TSNGVSLHELS+
Sbjct: 725 YVDGQKKIDGYIKDQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELSM 784
Query: 703 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
+S + S +E DDLC C GGD+ C CPR+FH CV LP +PS W C C
Sbjct: 785 SISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTILV 844
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 822
QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+ +DF + F
Sbjct: 845 QKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKDFNNAVFDE 903
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++ G L E
Sbjct: 904 RTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISEGAHPLSE 963
Query: 883 TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIES 942
+ +D+I+KKHE G + ++RW+++ G+ A++ +LLS AV + H FDPIIE+
Sbjct: 964 SDVDIIRKKHESKGLVMDANTEIRWQLVAGRS--ATEDGNSLLSSAVPVIHQSFDPIIEA 1021
Query: 943 ASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
+ DLIP MV+GR + GQDY GMYCA+LTV VVSA + R+ G ++AELPLVAT
Sbjct: 1022 HTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVAT 1081
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
S D QG GYFQ LF CIE+LL L VK +LP+A EA+AIW KFGFS + +++ Y N
Sbjct: 1082 SMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYLN 1141
Query: 1059 DYPLMIFQGTSMLQKPVP 1076
L +F GT L K +P
Sbjct: 1142 GGHLTVFHGTLNLYKAIP 1159
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 301/586 (51%), Positives = 388/586 (66%), Gaps = 49/586 (8%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 232 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSSALVPSTLKGFPTKL 278
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 279 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 338
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 339 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 397
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A A QP T
Sbjct: 398 ISEVGTSDSLVLCESCVGLKESHASPA--------------QP---------------TG 428
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
TS+RS +T++ SK SS K++G++T+KD+ +HKL F E L +G+EV Y+VRGE
Sbjct: 429 TSDRS-----TKTTTVSKCSSSGSKNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRGE 483
Query: 648 KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE 707
+ L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHELS+ L
Sbjct: 484 RLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLRG 543
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 767
R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+NT QK KF
Sbjct: 544 REPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGKF 603
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
VE NANA AAGR+ GVDP Q+ RCIRIV+ E GGC LCR +F S FG TV++
Sbjct: 604 VERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVMI 663
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDV 887
CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+LP+ L +
Sbjct: 664 CDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLRI 723
Query: 888 IKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 933
IKK++ GS + + D++WR+L G++ A++ +LLS+A+S+FH
Sbjct: 724 IKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAG-SLLSQALSLFH 768
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 63 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 116
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKR--EEEVKSD 97
P + L+ S KKR++ K ++R E+ V+S+
Sbjct: 117 PSRNRVLE--NSIKKRQKSSSLDSQKNNVEERFPEDRVRSN 155
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 276/509 (54%), Positives = 351/509 (68%), Gaps = 5/509 (0%)
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P+ ++L + P EL+ RSP ++ + S S+ S G++TRKD +HKL
Sbjct: 234 PKRNKIKLTKIPKNLKELSPKIARSPTSSARVPKNFSPGAKST--SAGRLTRKDHGLHKL 291
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
VF G L +G +VGY+V G++ L GY K GI C CCN+ VSPSQFE HAG A+RRKP+
Sbjct: 292 VFLSGILPEGTDVGYYVGGKRLLDGYIKEPGIHCHCCNTVVSPSQFEGHAGRAARRKPYH 351
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 747
+IY SNGVSLHELS+ LS R S +++DDLC IC DGG+LL CD+CPRAFH +CV L
Sbjct: 352 NIYMSNGVSLHELSVSLSRGRKTSDRQSDDLCSICSDGGELLLCDTCPRAFHRECVDLTA 411
Query: 748 IPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGC 807
+P GTW CRYC Q+E + YN NA AAGRI+G+D Q+ +R IRI TP+T GGC
Sbjct: 412 VPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQIFTRSIRIATTPETGFGGC 471
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 867
LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+G W C +DC RI+
Sbjct: 472 ALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLTALPEGAWYCSSDCVRISE 531
Query: 868 ALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSK 927
++ L+ G E +P D+IKKK E+ G + D DVRWRVLR K +S+ ++ +LSK
Sbjct: 532 TMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDLDVRWRVLRDK---SSEDSKLVLSK 588
Query: 928 AVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
AV+IFH+ FDPII++ + DLIPAMVYGRS R QDY GMYCA+LTV VVSAG+FRI G
Sbjct: 589 AVAIFHESFDPIIQTTTGRDLIPAMVYGRSVRDQDYTGMYCAVLTVGNTVVSAGLFRIMG 648
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
+E AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A EA +IWT +FGFS
Sbjct: 649 REAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVLPAADEAVSIWTQRFGFSK 708
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
++ +E ++ +FQGTS L K +P
Sbjct: 709 ISRDELLEHLKGARTTVFQGTSTLHKLIP 737
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 398 KVMRKFYSKLKNFLESGIL-EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
++ RK + K SGIL EG V Y G K L G +K GI C C C N
Sbjct: 280 RLTRKDHGLHKLVFLSGILPEGTDVGYYVGGK------RLLDGYIKEPGIHCHC--C--N 329
Query: 457 QVVTPAVFELHAGSSNKRPPEY-IYLENGKTLRDI 490
VV+P+ FE HAG + +R P + IY+ NG +L ++
Sbjct: 330 TVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHEL 364
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 26/35 (74%)
Query: 9 EKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTR 43
+ FV+ S +RVGLKREF FA+ Q+ + SLGRTR
Sbjct: 2 DDFVLRSGVRVGLKREFAFAIASQAALAPSLGRTR 36
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 339/879 (38%), Positives = 501/879 (56%), Gaps = 91/879 (10%)
Query: 227 REKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVG---LGKEYEPGRV 283
+ ++EN E+K + E + +E E V + E C EV L +Y V
Sbjct: 21 KTESENSSRMELKRDHECIAGNIETEASPRKKPVKEALNEEGCSEVSNPILSPKYNASSV 80
Query: 284 QMEMDEEKKNDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVA-----VDGNDEG-K 337
Q I ++ E +++G+ +ST + ++V +S++ +G+ +G
Sbjct: 81 QT---------ITSQVAELASTNQAVLGEITST--SSGNSVPESLSDEEHSRNGSSDGVS 129
Query: 338 TVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDV-------TEVANDGVGGPVKQ 390
T VV+E P + S +K+ K S SK+ R+++ E N P
Sbjct: 130 TTQVVLEIP--KHVSSTGIRKITF-KFSKSKEAYNRTNMRVNTCWNLETRNLHFRAP-NM 185
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + + +K L +GIL+G V YI S+ K L+GV++ SG C C
Sbjct: 186 ELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE-----LQGVIRESGYLCGC 240
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C +V+T FE HAG + P +IYLENGK + I+ K +PL L++ ++ +
Sbjct: 241 SACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIA 300
Query: 511 GSS------SMKKANFCLNCRVSFSNAGVEELMLLCKSCV------ELKESQAGSAEIKE 558
GSS KA+F N V+ ++ +L + +++S GS +
Sbjct: 301 GSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGS---RL 357
Query: 559 PLSHSSEMEPQPPSVELEESPAPSGELT-DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKI 617
PL M+ E+T + + + +P+S S + K SS G I
Sbjct: 358 PLKQKELMK----------------EMTQERKHAAKKPSSYIYGSGLQHKKSS---EGAI 398
Query: 618 TRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHA 677
++D +H+L+F GL DGAE+ Y+V+G++ L GYK+G GI+C+ C+SEVSPSQFEAHA
Sbjct: 399 KKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHA 458
Query: 678 GWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRA 737
GWA+RR+P++HIYTSNG++LH+++I L+ + ++ ++DD+C +C DGGDL+ CD CPRA
Sbjct: 459 GWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRA 518
Query: 738 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 797
FH C+ L +P G W C C+ F ++ V AR RI+ R V
Sbjct: 519 FHPACLELQCLPEGDWRCPCCVENFCPDRKV-----ARPI-RIQ-----------LTRAV 561
Query: 798 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 857
+ P++E+GGCV+CR DF S+F RTV+LCDQCE+E+HVGCL+D G+ DL+ELPK KW
Sbjct: 562 KAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWF 621
Query: 858 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA 917
CC DC R+++ALQ L RG E +P + +I +K+ E G + D++W +L GK
Sbjct: 622 CCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCYK 681
Query: 918 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVV 977
LLS+ +IF + FDPI+ S+ + DLIP MVYGR+ GQ++ GMYC +L V
Sbjct: 682 EH--LPLLSRTTAIFRECFDPIVASSGR-DLIPVMVYGRNISGQEFGGMYCVVLLAKSTV 738
Query: 978 VSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQA 1037
VSAG+ R+FGQE+AELP+VATS + QG+G+F++LF CIE+LL L VKTLVLP+A EA+A
Sbjct: 739 VSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEA 798
Query: 1038 IWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
IWTNK GF M+EE KY + L IF+GTSML+K VP
Sbjct: 799 IWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVP 837
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 299/699 (42%), Positives = 428/699 (61%), Gaps = 60/699 (8%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + + +K L +GIL+G V YI S+ K L+GV++ SG C C
Sbjct: 241 ELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE-----LQGVIRESGYLCGC 295
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C +V+T FE HAG + P +IYLENGK + I+ K +PL L++ ++ +
Sbjct: 296 SACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIA 355
Query: 511 GSS------SMKKANFCLNCRVSFSNAGVEELMLLCKSCV------ELKESQAGSAEIKE 558
GSS KA+F N V+ ++ +L + +++S GS +
Sbjct: 356 GSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGS---RL 412
Query: 559 PLSHSSEMEPQPPSVELEESPAPSGELT-DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKI 617
PL M+ E+T + + + +P+S S + K SS G I
Sbjct: 413 PLKQKELMK----------------EMTQERKHAAKKPSSYIYGSGLQHKKSS---EGAI 453
Query: 618 TRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHA 677
++D +H+L+F GL DGAE+ Y+V+G++ L GYK+G GI+C+ C+SEVSPSQFEAHA
Sbjct: 454 KKRDNDLHRLLFMPNGLPDGAELAYYVKGQRILGGYKQGNGIVCSHCDSEVSPSQFEAHA 513
Query: 678 GWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRA 737
GWA+RR+P++HIYTSNG++LH+++I L+ + ++ ++DD+C +C DGGDL+ CD CPRA
Sbjct: 514 GWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPRA 573
Query: 738 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 797
FH C+ L +P G W C C+ F ++ V AR RI+ R V
Sbjct: 574 FHPACLELQCLPEGDWRCPCCVENFCPDRKV-----ARPI-RIQ-----------LTRAV 616
Query: 798 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 857
+ P++E+GGCV+CR DF S+F RTV+LCDQCE+E+HVGCL+D G+ DL+ELPK KW
Sbjct: 617 KAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKWF 676
Query: 858 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA 917
CC DC R+++ALQ L RG E +P + +I +K+ E G + D++W +L GK
Sbjct: 677 CCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCYK 736
Query: 918 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVV 977
LLS+ +IF + FDPI+ S+ + DLIP MVYGR+ GQ++ GMYC +L V
Sbjct: 737 EH--LPLLSRTTAIFRECFDPIVASSGR-DLIPVMVYGRNISGQEFGGMYCVVLLAKSTV 793
Query: 978 VSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQA 1037
VSAG+ R+FGQE+AELP+VATS + QG+G+F++LF CIE+LL L VKTLVLP+A EA+A
Sbjct: 794 VSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAEA 853
Query: 1038 IWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
IWTNK GF M+EE KY + L IF+GTSML+K VP
Sbjct: 854 IWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVP 892
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 554 bits (1427), Expect = e-154, Method: Compositional matrix adjust.
Identities = 330/807 (40%), Positives = 446/807 (55%), Gaps = 80/807 (9%)
Query: 307 SSMVGKHS----STLCNGES--NVAKSVAVD----GNDEGKTVNVVVERPLRRFTRSLLQ 356
++++G+H + C+G + NV +V D + V+ +P RRFTRSLL+
Sbjct: 421 TTVIGQHQREELTATCHGSNRPNVLDNVLADPPLVTDGAATPVSTSGLKPRRRFTRSLLK 480
Query: 357 QKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQE---TVMKPRK--------------- 398
K E + G SD + A+ + P ++ +++KP
Sbjct: 481 SKPEEDAVTSEVQG---SDDNKDASVDLAPPSERRFTRSLLKPPADTASPPSVKKMEMKM 537
Query: 399 -----VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDC 453
K +K L++G+LEGM VMYI + K + ++GV+ G I CFC C
Sbjct: 538 SKKVACFTKHPGNVKELLQTGLLEGMPVMYIIPNSKK----AVVKGVITGCNIRCFCIKC 593
Query: 454 KGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS 513
G++ ++ FELHAGS+ K P E+IYL NG +LRD++ C S LE+LE+ R +
Sbjct: 594 NGSRALSTYFFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPM 653
Query: 514 SMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSV 573
++ CLNC ++ E LC C++ K+ Q + S M P
Sbjct: 654 VIRSRPNCLNCGGHLPSSETEHF--LCHCCLDSKQPQDPPSPSYSCSKSDSSMTPS---- 707
Query: 574 ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGG 633
++SS S GK+T KD +HKLVF+
Sbjct: 708 ------------------------FKSSSVKISSIKKAGSSGKVTTKDTGLHKLVFKV-- 741
Query: 634 LEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSN 693
L DG EV Y+V G++ + GY K I C C+ VSPS FEAHAG SRRKP+ +I+TSN
Sbjct: 742 LLDGTEVAYYVDGQRKVDGYIKDQRIYCNHCSRVVSPSAFEAHAGEGSRRKPYDNIFTSN 801
Query: 694 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 753
GVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L PS W
Sbjct: 802 GVSLHELSMKISKDMELSERETDDLCRECGLGGDIFPCKMCPRSFHPACVRLSEFPS-EW 860
Query: 754 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 813
C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+ +
Sbjct: 861 FCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKRAIRIVPICD-DLGGCALCKKK 919
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 873
DF + F RTVILCDQCE+EYHVGCL+ DL+ELP G+W CC+ C I L K++
Sbjct: 920 DFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELPDGEWFCCSSCSEIRSCLDKMI 979
Query: 874 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 933
G + L + L++I+KKHE G D D+RW++L G+ A++ LLS AV I H
Sbjct: 980 SDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAGRS--ATEDGSLLLSSAVPIIH 1037
Query: 934 DRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
FDPIIE+ + DLIP MV GR + GQDY GMYCA++T+ VVSA + RI G +
Sbjct: 1038 QSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAVITLGSTVVSAALLRIMGGD 1097
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
+AELPLVATS D QG GYFQ LF C+E++L L +K +LP+A EA+AIW KFGFS +
Sbjct: 1098 VAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLPAAQEAEAIWMKKFGFSRIP 1157
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
+E+ Y N L +F GTS L K VP
Sbjct: 1158 QEQLEAYLNGAHLTVFHGTSNLYKAVP 1184
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 310/721 (42%), Positives = 402/721 (55%), Gaps = 118/721 (16%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGI 446
K E M + + LK+ L +G+LEG V YI +G + + LRGV+K GI
Sbjct: 142 KMELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKR------AVLRGVIKRVGI 195
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+ C D+ L+ L+ A+
Sbjct: 196 LCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAI 255
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ +G + K+ C C+ SF+ + LLC SC+E K SQ + K
Sbjct: 256 QNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQNSTRTSK--------- 306
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEP---NSAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ +P S + N SP NS+ S + S G+ITRKD
Sbjct: 307 --------IGRNPTSSAR--RSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKG 356
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+HKL F G L +G +VGY+V G+ VSPSQFEAHAG A+RR
Sbjct: 357 LHKLAFMSGVLPEGTDVGYYVGGK--------------------VSPSQFEAHAGRAARR 396
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
KP+ +IY SNGVSLHELS+ LS R S++++DDLC IC DGG+LL CDSCPRAFH +CV
Sbjct: 397 KPYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECV 456
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
IP GTW CRYC N Q+E + YN NA AAGRI+G+DP Q+ +R IRI TP T
Sbjct: 457 GFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTG 516
Query: 804 LGGCVLC-------------RGR--------------DFCKSRFGRRTVILCDQCEREYH 836
GGC LC R R DF K +F RTV+LCD
Sbjct: 517 FGGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCD------- 569
Query: 837 VGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG 896
Q LP+G W C ADC RI+ L+ L+ RG E + +++IK+K+E+
Sbjct: 570 ------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQKA 617
Query: 897 SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 956
+ D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ DLIPAMVYG
Sbjct: 618 LNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG- 673
Query: 957 SHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIE 1016
VVSAG+FR+ G E+AELPLVATS D QG GYFQ+LF CIE
Sbjct: 674 ------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCIE 715
Query: 1017 KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQGTS L K VP
Sbjct: 716 RLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLVP 775
Query: 1077 K 1077
K
Sbjct: 776 K 776
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 8 EEKFVVLSKIRVGLKREFEFALKVQSEICGSLG 40
E+KFV+ S +R GLKREF FA+ Q+ + +LG
Sbjct: 10 EDKFVLRSGVRTGLKREFAFAIASQAALSSTLG 42
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 287/694 (41%), Positives = 411/694 (59%), Gaps = 70/694 (10%)
Query: 386 GPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRG-SKVKGPGVSGLRGVVKGS 444
GP K E M + + + S +K L +GIL+G V Y+ S++K L+G++ G
Sbjct: 210 GPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK------LQGIINGG 263
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C +++ FE HAG + P +IYLENG+ + ++ K +PL L++
Sbjct: 264 GYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDE 323
Query: 505 AVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSS 564
+ V GSS V + +A A + S+
Sbjct: 324 VIMEVAGSS------------------------------VNMNSFEAWKASFHQ---DSA 350
Query: 565 EMEPQPPSVELEESPAPSGELTDTSNRSPEPN-SAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ + V+L + P P PN S H K K R+D
Sbjct: 351 NIVVENHDVKLPKLSHPV--------ERPNPNFSNAVLQHKKTAEKGTK------RRDND 396
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+H+L+F GL DGAE+ YFV+G++ L G+K+G GILC+ CN E+SPSQFEAHAG A+RR
Sbjct: 397 LHRLLFMPNGLPDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARR 456
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
+P++HIYT+NG++LH+++I L+ + ++ ++DD+C C +GGDL+ CD CPRA+H C+
Sbjct: 457 QPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCL 516
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
L +P G W C C + V N+ A + G + P +V R R+V+ P+ E
Sbjct: 517 HLQNVPEGVWSCPNCRDK------VGSNSKAISGGSLSFSKP---IVFRLTRVVKAPEYE 567
Query: 804 LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
+GGCV+CR DF ++F RTV+LCDQCERE+HVGCL+D G+ DL+ELPK KW CC +C
Sbjct: 568 IGGCVVCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECS 627
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGT 921
I++ALQ V G + +P++ D+I +KH G D A++ DVRW++L GK D
Sbjct: 628 NIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALN-DVRWQILSGKSRFPED-- 684
Query: 922 RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAG 981
LS+A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V +VVSAG
Sbjct: 685 LPFLSRATAIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAG 743
Query: 982 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1041
+ RIFG+E+AELP+VATS + QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT
Sbjct: 744 LLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTK 803
Query: 1042 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
K GF M+EE+ KY + L IF GTSML+K V
Sbjct: 804 KLGFRKMSEEQLIKYMREVQLTIFNGTSMLEKVV 837
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 296/731 (40%), Positives = 433/731 (59%), Gaps = 72/731 (9%)
Query: 393 VMKPRKVMRKFY-SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCD 451
V P+K++ Y S +K LE+GILEG V YI P V L+G++ G C C
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYI-----STPPVRELQGIIHSGGYLCGCT 213
Query: 452 DCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
C ++V++ FELHAG+ + P +I+LENG+ + +I+ K +P + LE+ +R V G
Sbjct: 214 TCSFSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAG 273
Query: 512 SS------------SMKKANFCLNC-----------RVSFSNAGVEELMLL------CKS 542
S+ + +N + C V + + V +L C+S
Sbjct: 274 SALNEEGLQAWKGDGQQVSNRVIACLTGTISRNIPLSVIYLDPVVTYHFVLLSETDSCES 333
Query: 543 CVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPN--SAQT 600
C L + + L+ P SVE P E T + EP + +
Sbjct: 334 CSYLGSGTGPGLDESQSLT--------PCSVENHYFP----EKTYAKDTLDEPKRIAKKL 381
Query: 601 SSHSKMKSSSVK-SHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGI 659
+SH K S G ++D +H+L+F GL DG E+ Y+V+ +K L GYK+G GI
Sbjct: 382 TSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLHGYKQGSGI 441
Query: 660 LCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLC 719
+C+CC+ E+SPSQFEAHAG A+RR+P++HI+ S+G+SLH++++ L+ ++ ++DD+C
Sbjct: 442 VCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMC 501
Query: 720 GICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN---TFQKEKFVEYNANARA 776
IC DGGDLL C CP+AFH C+ +P GTW+C C + + +K + + NAR
Sbjct: 502 SICGDGGDLLLCAGCPQAFHTACLKFQSVPEGTWYCSSCNDGPISSKKATATDPSGNARP 561
Query: 777 AGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYH 836
+V R R+V+ P++E+GGCV CR DF +F RTVILCDQCE+EYH
Sbjct: 562 ------------IVIRLSRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYH 609
Query: 837 VGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG 896
VGCL+++G+ DL+E+P+ KW CC+DC RI+ A+Q V G + +P LD+I +K E G
Sbjct: 610 VGCLRENGLCDLKEIPQEKWFCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREKG 669
Query: 897 --SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY 954
+DN D V WR+L GK LLS+A IF + FDPI+ + + DLIP MVY
Sbjct: 670 IFTDNG-DI-VEWRILSGKSRYPEH--LPLLSRAAVIFRECFDPIVAKSGR-DLIPVMVY 724
Query: 955 GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCC 1014
GR+ GQ++ GMYC +L VN +VVSA + RIFGQ++AELP+VATS + QG+GYFQ L+ C
Sbjct: 725 GRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQQVAELPIVATSREYQGRGYFQGLYAC 784
Query: 1015 IEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKP 1074
+E LL LNV+ LVLP+A EA++IWT KFGF+ M++++ +Y+ + L IF+GTSML+K
Sbjct: 785 VENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKK 844
Query: 1075 VPKCRIVGKSV 1085
VPK + +S
Sbjct: 845 VPKTTSLSEST 855
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 262/463 (56%), Positives = 320/463 (69%), Gaps = 23/463 (4%)
Query: 615 GKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFE 674
K +D MHK+VF G L++G +VGY+V G+ VSPSQFE
Sbjct: 438 AKNNSRDKGMHKVVFMSGVLQEGTDVGYYVGGK--------------------VSPSQFE 477
Query: 675 AHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSC 734
AHAG A+RRKP+ +IY SNGVSLHELSI L + S++++DDLC IC DGG LL CD+C
Sbjct: 478 AHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDTC 537
Query: 735 PRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCI 794
PRAFH +CVSL P GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R I
Sbjct: 538 PRAFHRECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSI 597
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 854
RI TP+T GGC LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+G
Sbjct: 598 RIATTPETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEG 657
Query: 855 KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK 914
W C DC RIN LQ L++ G E +P LDVIKKK E G + D DVRWRVL+ K
Sbjct: 658 AWYCSTDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDK- 716
Query: 915 VDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVN 974
+SD ++ +LSKAV+IFH+ FDPII+ ++ DLIPAMVYGRS R QDY GMYCA+LTVN
Sbjct: 717 --SSDDSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTVN 774
Query: 975 QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASE 1034
VVSAG+FRI G E+AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A E
Sbjct: 775 NTVVSAGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAEE 834
Query: 1035 AQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
A++IWT +FGF+ ++++E +Y +FQGTS L K V K
Sbjct: 835 AESIWTERFGFTKISQDELREYLKGGRTTVFQGTSNLHKLVAK 877
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 58/97 (59%)
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
QVV+P FE+HAGS+ K P +YI+LENG L D++ C + L+ LE AVR +G + K
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGS 553
+ C C+ SFS + L C SC+E K ++ S
Sbjct: 406 RTFRCKGCKSSFSTLCSGKFALFCDSCLESKGAKNNS 442
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 303/804 (37%), Positives = 455/804 (56%), Gaps = 76/804 (9%)
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERP------- 346
D+ + V++G++E S S + +++V S + + + V+E P
Sbjct: 70 DVSSQPVKSGLVECSGSDFGSEETVSDDASVVGSSQTEQSSDVLPSRFVLEIPKHLSSTG 129
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSK 406
+ + T L + K E L KD + V ++ K + V + S
Sbjct: 130 ITKITFKLSKPKKEFDDLPLIKDHTWDAGVVKMP--------------KKKIVSLSYPSN 175
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K LE+GILEG V YI P V L G++ G C C C ++V++ FE
Sbjct: 176 VKKLLETGILEGARVKYI-----STPPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQ 230
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HAG+ + P +I+LEN + + +I+ K +P LE+ +R V GS+ LN
Sbjct: 231 HAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSA--------LN--- 279
Query: 527 SFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSG--- 583
EE + K+ + Q+ S + ++ S + P ++ +S P
Sbjct: 280 -------EEGLRAWKASFQ----QSNSMSDRNYITDHSTVSYLGPGLDESQSLTPCSVEN 328
Query: 584 ----ELTDTSNRSPEPN--SAQTSSHSKMKSSSVK-SHGKITRKDLRMHKLVFEEGGLED 636
E T + EP + + +SH K S G ++D +H+L+F GL D
Sbjct: 329 HYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPD 388
Query: 637 GAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS 696
G E+ Y+V+ +K L GYK+G GI+C+CC+ E+SPSQFEAHAG A+RR+P++HI+ S+G+S
Sbjct: 389 GTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGLS 448
Query: 697 LHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 756
LH++++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +P GTW+C
Sbjct: 449 LHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCS 508
Query: 757 YCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 813
C + + +K + + NAR +V R R+V+ P++++GGCV CR
Sbjct: 509 SCNDGPISSKKATTTDPSGNARP------------IVIRLSRVVKAPESDIGGCVFCRSH 556
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 873
DF +F RTVILCDQCE+EYHVGCL+++G DL+E+P+ KW CC++C RI+ A+Q V
Sbjct: 557 DFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRIHTAVQNSV 616
Query: 874 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 933
G + LP LD+I +K E G + V WR+L GK LLS+A IF
Sbjct: 617 SCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEH--LPLLSRAAVIFR 674
Query: 934 DRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 993
+ FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIFGQE+AEL
Sbjct: 675 ECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQEVAEL 733
Query: 994 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1053
P+VATS + QG+GYFQ L+ C+E LL LNV+ LVLP+A EA++IWT KFGF+ M++++
Sbjct: 734 PIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQL 793
Query: 1054 NKYRNDYPLMIFQGTSMLQKPVPK 1077
+Y+ + L IF+GTSML+K VPK
Sbjct: 794 QEYQKEVQLTIFKGTSMLEKKVPK 817
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 302/699 (43%), Positives = 417/699 (59%), Gaps = 62/699 (8%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + + F S +K L +GIL+G V YI + L G++ G G C C
Sbjct: 203 ELKMSKKVLPNTFPSNVKKLLSTGILDGARVKYISPQRE-------LYGIIDGGGYLCGC 255
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V+T FELHAG+ + P +IYLENGK + I+ K +PL +++ ++
Sbjct: 256 PSCNFSRVLTAYEFELHAGAKTRHPNNHIYLENGKPICSIIQELKAAPLGAVDEVIKDAA 315
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE-IKEPLSHSSEMEPQ 569
GSS +N EE + K+ + G+ E L +S
Sbjct: 316 GSS--------IN----------EEFFQVWKASLHQCNGIIGADEKCYSMLPYSPHSLGS 357
Query: 570 PPSVELEESPAPSGELTDTSN---RSPEPNSAQTSSHSKMKSSSVKSHGKIT------RK 620
S LEES P SN R +S++ + + SS+ SH K T R+
Sbjct: 358 YSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRPSSL-SHPKKTNEGGTRRR 416
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWA 680
D +H+L+F GL DGAE+ Y+++G+K L GYK+G GI+C+CC+ E+SPSQFEAHAG A
Sbjct: 417 DNDLHRLLFMPNGLPDGAELAYYIKGQKMLAGYKQGNGIVCSCCDREISPSQFEAHAGMA 476
Query: 681 SRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHI 740
+RR+P++HIYTSNG++LH+++ L+ + ++ +DD+C C DGGDL+ C+SCPRAFH+
Sbjct: 477 ARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLIFCESCPRAFHL 536
Query: 741 DCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTP 800
C+ L +PS WHC C KF +R+ +V R R+V+TP
Sbjct: 537 VCLGLKYVPSDVWHCPNC------NKFGHGGNFSRS------------IVIRLTRVVKTP 578
Query: 801 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 860
+ E+GGCV CR DF F RTVILCDQCERE+HVGCL+D+G+ DL+E+PK W C
Sbjct: 579 EYEVGGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKEIPKDNWFCSN 638
Query: 861 DCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG---SDNAVDFDVRWRVLRGKKVDA 917
DC RI ALQ V G + +P L++I KH E G A DF +WR+L GK
Sbjct: 639 DCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDF--QWRILMGKSRYQ 696
Query: 918 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVV 977
D +LLS A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V VV
Sbjct: 697 ED--LSLLSAAAAIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVLLLVKNVV 753
Query: 978 VSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQA 1037
VSAG+ RIFG+++AELPLVATS + QG+GYFQ+LF CIE+LL LNV LVLP+A EA++
Sbjct: 754 VSAGLLRIFGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEEAES 813
Query: 1038 IWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
IWT +FGF MTEE+ ++Y + L IF+GTSML+K VP
Sbjct: 814 IWTRRFGFRKMTEEQLSQYTRELQLTIFKGTSMLEKEVP 852
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 282/687 (41%), Positives = 416/687 (60%), Gaps = 71/687 (10%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+G V YI PG L+G++ G G C C
Sbjct: 230 ELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYN-----PGKVELQGIIDGGGYLCGC 284
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENG+ + I+ K +PL L++ ++ V
Sbjct: 285 SMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIKNVA 344
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS ++ SF A E L+ + +++ S+S+++ P
Sbjct: 345 GSSVNEE---------SF-QAWKESLL-------------QSNGKVQAHKSYSTKLVGMP 381
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+N P S+ TS+ ++ S + G R+D +H+L+F
Sbjct: 382 -----------------HTNIRP---SSYTSNSGVLQKRS--ADGCTKRRDNDLHRLLFM 419
Query: 631 EGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY 690
GL DGAE+ Y+V+G+K L GYK+G GI+C CC+ E+SPSQFEAHAG A+RR+P++HIY
Sbjct: 420 PNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIY 479
Query: 691 TSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS 750
TSNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C+ L +P
Sbjct: 480 TSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPD 539
Query: 751 GTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 810
W C C++ N R + + + + R R+ +TP+ E+GGCV+C
Sbjct: 540 SGWQCLNCIDN---------AGNGRESSIVRPI------MIRLTRVDKTPEVEMGGCVVC 584
Query: 811 RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ 870
R DF ++F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC RI ALQ
Sbjct: 585 REHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYAALQ 644
Query: 871 KLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
V G E +P + ++I +KHE+ G + A++ D++WR+L GK LLS+A
Sbjct: 645 NSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMN-DIQWRILSGKSRYPEH--LPLLSRA 701
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 988
+IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN VVVSAG+ RIFG+
Sbjct: 702 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFGR 760
Query: 989 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1048
+AELPLVATS QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF M
Sbjct: 761 NVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKM 820
Query: 1049 TEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+E++ +K+ + L +F TSML+K V
Sbjct: 821 SEDQLSKHLREVQLTLFNKTSMLEKTV 847
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 285/690 (41%), Positives = 408/690 (59%), Gaps = 38/690 (5%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+ V YI S + L G++ G G C C
Sbjct: 206 ELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERE-----LDGIIDGGGYLCGC 260
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +IYLENGK + I+ K +PL ++ ++ V
Sbjct: 261 SSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 320
Query: 511 GSSSMKKANFCLNCRVSFSNA--GVEELMLLCKSCVELKESQAGSAEIKEPLSH-SSEME 567
GSS ++ ++ SNA G ++ C+ S +KE SS
Sbjct: 321 GSSINEEFFRVWKASLNQSNALVGADKKSYSELPCLPHSHVSYASQALKESFCPISSSFL 380
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
V + + SG TS R P+ S +K K ++ + ++D +H+L
Sbjct: 381 YNNNFVSQQTNMETSGVNKQTSKR---PSFYVPGSATKQKKTA---ESGVRKRDNDLHRL 434
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+F GL DG E+ Y+V+G+K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+P++
Sbjct: 435 LFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEIEISPSQFESHAGMSARRQPYR 494
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 747
HIYTSN ++LH+++I L+ + ++ DD+C C DGGDL+ C SCPRAFH C+ L
Sbjct: 495 HIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHD 554
Query: 748 IPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGC 807
P G WHC C F +V R R+V+TP+ ++GGC
Sbjct: 555 TPEGAWHCPNCNKLGHGGNFAR------------------PIVIRLTRVVKTPEYDVGGC 596
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 867
+CR DF F RTVILCDQCE+E+HVGCL++ G+ DL+E+PK W CC DC I +
Sbjct: 597 AVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYV 656
Query: 868 ALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALL 925
AL+ V G + +P + L+ I +KH E G D A +DV+W++L GK + D +LL
Sbjct: 657 ALRNSVSTGVQTIPVSLLNTINRKHVEKGLLVDEAA-YDVQWQILMGKSRNRED--LSLL 713
Query: 926 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 985
S A +IF + FDPI+ + DLIP MVYGR+ GQ++ GMYC +LTV VVVSAG+ RI
Sbjct: 714 SGAAAIFRECFDPIVAKTGR-DLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRI 772
Query: 986 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1045
FG+E+AELPLVAT+ + QG+GYFQ+LF CIE+LL LNV+ LVLP+A EA++IWT +FGF
Sbjct: 773 FGREVAELPLVATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFGF 832
Query: 1046 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
M+E + KY ++ L IF+GTSML+K V
Sbjct: 833 RKMSEGQLLKYTREFQLTIFKGTSMLEKEV 862
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 282/687 (41%), Positives = 416/687 (60%), Gaps = 71/687 (10%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+G V YI PG L+G++ G G C C
Sbjct: 233 ELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYN-----PGKVELQGIIDGGGYLCGC 287
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENG+ + I+ K +PL L++ ++ V
Sbjct: 288 SMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIKNVA 347
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS ++ SF A E L+ + +++ S+S+++ P
Sbjct: 348 GSSVNEE---------SF-QAWKESLL-------------QSNGKVQAHKSYSTKLVGMP 384
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+N P S+ TS+ ++ S + G R+D +H+L+F
Sbjct: 385 -----------------HTNIRP---SSYTSNTGVLQKRS--ADGCTKRRDNDLHRLLFM 422
Query: 631 EGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY 690
GL DGAE+ Y+V+G+K L GYK+G GI+C CC+ E+SPSQFEAHAG A+RR+P++HIY
Sbjct: 423 PNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHIY 482
Query: 691 TSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS 750
TSNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C+ L +P
Sbjct: 483 TSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVPD 542
Query: 751 GTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 810
W C C + N R + + + + R R+ +TP+ E+GGCV+C
Sbjct: 543 SGWQCLNCRDN---------AGNGRESSIVRPI------MIRLTRVDKTPEFEMGGCVVC 587
Query: 811 RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ 870
R DF ++F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC RI +ALQ
Sbjct: 588 REHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVALQ 647
Query: 871 KLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
V G E +P + ++I +KHE+ G + A++ D++WR+L GK LLS+A
Sbjct: 648 NSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMN-DIQWRILSGKSRYPEH--LPLLSRA 704
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 988
+IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN VVVSAG+ RIFG+
Sbjct: 705 AAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFGR 763
Query: 989 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1048
+AELPLVATS QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF M
Sbjct: 764 NVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRKM 823
Query: 1049 TEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+E++ +K+ + L +F TSML+K V
Sbjct: 824 SEDQLSKHLREVQLTLFNKTSMLEKTV 850
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 279/693 (40%), Positives = 411/693 (59%), Gaps = 49/693 (7%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V F + +K L +GIL+G +V YI PG L G++ G C C
Sbjct: 242 ELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYN-----PGKVELDGIIGDGGYLCGC 296
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENGK + I++ K + T ++ ++ V
Sbjct: 297 SMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKNVA 356
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS + + +F E L+ K K+ S I P +++S+
Sbjct: 357 GSS-INEGSF---------QVWKESLLQSNKKVPTQKKYSTKSTGI--PHTYNSQSIESA 404
Query: 571 PSV-------ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLR 623
S E+ + + +P++ S K S+ G ++D
Sbjct: 405 SSFSSLRVRNHFEQQMYVNQTADEWKRVVKKPSTYTYYSGIPQKRSA---DGCTKKRDND 461
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
+H+L+F GL DGAE+ Y+V+G+K L GYK+G GI+C CC+ E+SPSQFEAHAG A+RR
Sbjct: 462 LHRLLFMPNGLPDGAELAYYVKGQKLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARR 521
Query: 684 KPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
+P++HIY SNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C+
Sbjct: 522 QPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACL 581
Query: 744 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 803
L +P WHC C + E+ G P ++ R R+ + P+ E
Sbjct: 582 GLHSVPESGWHCLNCEDNTGDER---------------GARP---IMIRLTRVDKEPEYE 623
Query: 804 LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
+GGCV+CR DF +F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC
Sbjct: 624 VGGCVVCRANDFSVDKFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCN 683
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDF-DVRWRVLRGKKVDASDGTR 922
RI +ALQ V G + +P + ++I +KHE+ G D D++WR+L GK A
Sbjct: 684 RIYVALQNSVSAGADTIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAEH--L 741
Query: 923 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGI 982
LLS+A +IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN +VVSAG+
Sbjct: 742 PLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSAGL 800
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
RIFG+ +AELPLVATS + QG+GYFQ+LF CIE+LL LNV+ LVLP+A +A++IWT K
Sbjct: 801 LRIFGRNIAELPLVATSREHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWTKK 860
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
GF M+E++ K+ + L +F TS+L+K V
Sbjct: 861 LGFHKMSEDQLTKHLKEVQLTLFNKTSVLEKMV 893
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 511 bits (1316), Expect = e-141, Method: Compositional matrix adjust.
Identities = 271/689 (39%), Positives = 397/689 (57%), Gaps = 53/689 (7%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 182 FLSNVKKLLGTGILDGARVKYLSTSAARE-----LQGIIHSGGYLCGCTACDFSKVLGAY 236
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS---------- 512
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS
Sbjct: 237 EFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIAPPDVLEEVIRKVAGSALSEEGFQAW 296
Query: 513 --SSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
S + N + + + L+ S L ESQ+ + E ++
Sbjct: 297 KGSFQQDKNMTEDDSNHIMDHSFQSLVSYPGSGWSLDESQSSTPCFPEDNYFREKI---- 352
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVF 629
T + + +P + + +SH M S G ++D +H+L+F
Sbjct: 353 --------------CTKDTRHAHKPKAKKLTSHMFGMGCHKKVSGGGKWKRDNDLHRLLF 398
Query: 630 EEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
GL DG E+ Y+V+ +K L GYK+G GI+C+CC++++SPSQFEAHAG A RR+P++ I
Sbjct: 399 LPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGMAGRRQPYRRI 458
Query: 690 YTSNGVSLHELSIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 748
+ S+G+SLH++++ L+ ++ ++DD+C IC +GGDLL C CP+AFH C+ +
Sbjct: 459 HISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAFHTACLKFQSM 518
Query: 749 PSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDP-FAQMVSRCIRIVQTPDTELGGC 807
P GTW+C C N + DP +V R R+V+ P++E+GGC
Sbjct: 519 PEGTWYCSSC------------NDGPTSCKIATASDPNLKPIVIRLTRVVKAPESEIGGC 566
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 867
V CR DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW CC+DC RI+
Sbjct: 567 VFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCCSDCSRIHR 626
Query: 868 ALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSK 927
LQ G + +P LD I +K+ E G V WR+L GK LLS+
Sbjct: 627 VLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPEH--LPLLSR 684
Query: 928 AVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIFG
Sbjct: 685 AATIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFG 743
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
Q++AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++IWTNKFGF+
Sbjct: 744 QKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTNKFGFTK 803
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
MTE +Y+ + L IF+GTSML+K VP
Sbjct: 804 MTEHRLQRYQREVQLTIFKGTSMLEKKVP 832
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 264/623 (42%), Positives = 377/623 (60%), Gaps = 63/623 (10%)
Query: 456 NQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSM 515
++++ FE HAG + P +IYLENG+ + ++ K +PL L++ + V GSS
Sbjct: 214 TEILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSS-- 271
Query: 516 KKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVEL 575
V + +A A + S+ + + V+L
Sbjct: 272 ----------------------------VNMNSFEAWKASFHQ---DSANIVVENHDVKL 300
Query: 576 EESPAPSGELTDTSNRSPEPN-SAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGL 634
+ P P PN S H K K R+D +H+L+F GL
Sbjct: 301 PKLSHPV--------ERPNPNFSNAVLQHKKTAEKGTK------RRDNDLHRLLFMPNGL 346
Query: 635 EDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG 694
DGAE+ YFV+G++ L G+K+G GILC+ CN E+SPSQFEAHAG A+RR+P++HIYT+NG
Sbjct: 347 PDGAELAYFVKGQRILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTNG 406
Query: 695 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
++LH+++I L+ + ++ ++DD+C C +GGDL+ CD CPRA+H C+ L +P G W
Sbjct: 407 LTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVWS 466
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C + V N+ A + G + P +V R R+V+ P+ E+GGCV+CR D
Sbjct: 467 CPNCRDK------VGSNSKAISGGSLSFSKP---IVFRLTRVVKAPEYEIGGCVVCRRHD 517
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
F ++F RTV+LCDQCERE+HVGCL+D G+ DL+ELPK KW CC +C I++ALQ V
Sbjct: 518 FSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTVL 577
Query: 875 RGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 932
G + +P++ D+I +KH G D A++ DVRW++L GK D LS+A +IF
Sbjct: 578 NGAQIIPDSLSDLIIRKHVGKGLLVDEALN-DVRWQILSGKSRFPED--LPFLSRATAIF 634
Query: 933 HDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
+ FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V +VVSAG+ RIFG+E+AE
Sbjct: 635 RECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVAE 693
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
LP+VATS + QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF M+EE+
Sbjct: 694 LPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEEQ 753
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPV 1075
KY + L IF GTSML+K V
Sbjct: 754 LIKYMREVQLTIFNGTSMLEKVV 776
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 66/149 (44%), Gaps = 44/149 (29%)
Query: 343 VERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK 402
VERP F+ ++LQ K KG+ +D +D+ +
Sbjct: 307 VERPNPNFSNAVLQHKKTAEKGTKRRD----NDLHRLL---------------------- 340
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F+ +G+ +G + Y VKG + G G +G+GI C C N+ ++P+
Sbjct: 341 -------FMPNGLPDGAELAYF----VKGQRILG--GFKQGNGI--LCSHC--NREISPS 383
Query: 463 VFELHAGSSNKRPP-EYIYLENGKTLRDI 490
FE HAG + +R P +IY NG TL DI
Sbjct: 384 QFEAHAGMAARRQPYRHIYTTNGLTLHDI 412
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 271/687 (39%), Positives = 397/687 (57%), Gaps = 47/687 (6%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 175 FPSNVKKLLATGILDGARVKYLSISPARE-----LQGIIHSGGYLCGCTVCDFSKVLGAY 229
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS+ + + F
Sbjct: 230 EFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVAPPDVLEEVIRKVAGSA-LSEEGF-Q 287
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPS 582
+ SF +M + + + P S S E Q + E+
Sbjct: 288 AWKESFQQDDSNHIM-----------DYSFQSLVSYPGSGWSIDESQSSTPYFPENNYFR 336
Query: 583 GELTDTSNRS-PEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEV 640
+++ R +P + + +SH M + G ++D +H+L+F GL DG E+
Sbjct: 337 KKISTKDTRHEHKPKAKKVTSHMFGMGCHKKAAGGGKWKRDNDLHRLLFLPNGLPDGTEL 396
Query: 641 GYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
YFV+ +K L GYK+G GI+C+CC++E+SPSQFEAHAG A RR+P++HI+ S+G+SLH++
Sbjct: 397 AYFVKSQKLLQGYKQGSGIVCSCCDTEISPSQFEAHAGMAGRRQPYRHIHISSGLSLHDI 456
Query: 701 SIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +P GTW+C C
Sbjct: 457 AMSLADGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSC- 515
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQ-TPDTELGGCVLC------RG 812
N + DP + + I I + + C R
Sbjct: 516 -----------NDGPTSCKTATATDPNLKSIVGSIAIFSLSAHIRVLHSAYCFSPISDRS 564
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 872
DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW CC+DC RI+ ALQ
Sbjct: 565 LDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLCDLKGIPQDKWFCCSDCSRIHTALQSS 624
Query: 873 VDRGEEKLPETSLDVIKKKHEESGS--DNAVDFDVRWRVLRGKKVDASDGTRALLSKAVS 930
G + +P LD I +K+ E G DN + V WR+L GK A LLS+A +
Sbjct: 625 ASCGPQTIPTVLLDTISRKYREKGICIDNGDN--VEWRMLSGKSRYAEH--LPLLSRAAT 680
Query: 931 IFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIFGQ++
Sbjct: 681 IFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQKV 739
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++IWT KFGF+ MTE
Sbjct: 740 AELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTKKFGFTKMTE 799
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ KY+ + L IF+GTSML+K VP+
Sbjct: 800 HQLQKYQREVQLTIFKGTSMLEKKVPR 826
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 288/811 (35%), Positives = 438/811 (54%), Gaps = 85/811 (10%)
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERP------- 346
D+ + V++G++E S S + +++V S + + + V+E P
Sbjct: 61 DVSSQPVKSGLVECSGSDFGSEETVSDDASVVGSSQTEQSSDVLPSRFVLEIPKHLSSTG 120
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSK 406
+ + T L + K E L KD + V ++ K + V + S
Sbjct: 121 ITKITFKLSKPKKEFDDLPLIKDHTWDAGVVKMP--------------KKKIVSLSYPSN 166
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K LE+GILEG V YI P V L G++ G C C C ++V++ FE
Sbjct: 167 VKKLLETGILEGARVKYI-----STPPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQ 221
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS------------S 514
HAG+ + P +I+LEN + + +I+ K +P LE+ +R V GS+
Sbjct: 222 HAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSALNEEGLRAWKGDG 281
Query: 515 MKKANFCLNCR--VSFSNAGVEELMLLCKSCVELKESQA--GSAEIKEPLSHSSEMEPQP 570
+ +N + C + + LL V L E+ + + + L S + P
Sbjct: 282 QQVSNRVIACLTGIILRIIPLSVNALLSYHFVLLSETDSCESCSYLGPGLDESQSLTP-- 339
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVK-SHGKITRKDLRMHKLVF 629
+E + P+ + + +SH K S G ++D +H+L+F
Sbjct: 340 --CSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLF 397
Query: 630 EEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
GL DG E+ Y+V+ +++SPSQFEAHAG A+RR+P++HI
Sbjct: 398 MPNGLPDGTELAYYVK--------------------TQISPSQFEAHAGMAARRQPYRHI 437
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 749
+ S+G+SLH++++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +P
Sbjct: 438 FISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMP 497
Query: 750 SGTWHCRYCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 806
GTW+C C + + +K + + NAR +V R R+V+ P++++GG
Sbjct: 498 EGTWYCSSCNDGPISSKKATTTDPSGNARP------------IVIRLSRVVKAPESDIGG 545
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 866
CV CR DF +F RTVILCDQCE+EYHVGCL+++G DL+E+P+ KW CC++C RI+
Sbjct: 546 CVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRIH 605
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
A+Q V G + LP LD+I +K E G + V WR+L GK LLS
Sbjct: 606 TAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEH--LPLLS 663
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 986
+A IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIF
Sbjct: 664 RAAVIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIF 722
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
GQE+AELP+VATS + QG+GYFQ L+ C+E LL LNV+ LVLP+A EA++IWT KFGF+
Sbjct: 723 GQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFT 782
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
M++++ +Y+ + L IF+GTSML+K VPK
Sbjct: 783 KMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 813
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 234/468 (50%), Positives = 314/468 (67%), Gaps = 25/468 (5%)
Query: 620 KDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGW 679
KD +HKLVF+ GGL +G E+GY+ RG+ ++ + C SE + E
Sbjct: 492 KDQGLHKLVFDRGGLPEGTELGYYARGQTYITVDRN--------C-SEATKWALEYIVTV 542
Query: 680 ASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFH 739
AS + +IYTSNGVSLHE + S R +S+ +N+DLC IC DGG+LL CDSCPRAFH
Sbjct: 543 AS--ASYFYIYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAFH 600
Query: 740 IDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT 799
I+CVSLP IP G WHC+YC N F E EYN N+ A G++EGVDP Q+ RCIR+V+
Sbjct: 601 IECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVKN 660
Query: 800 PDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCC 859
+ E G S FG RT+I+CDQCE+EYH+GCL + DL+ELPKG W C
Sbjct: 661 MEAETNG-----------SGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFCS 709
Query: 860 ADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD 919
DC RIN LQKL+ G EKL ++SL +I+ K E + + D D+RWR++ GK S
Sbjct: 710 MDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKV--TSP 767
Query: 920 GTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVS 979
+R LLS+A++IFHD FDPI++ S +LIP MVYG++ +GQDY G+ CA+LTVN VVS
Sbjct: 768 ESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVVS 827
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
AG+ R+FG+E+AELPLVAT + +GYFQ LF CIEKLL LNV+++V+P+A EA+ +W
Sbjct: 828 AGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPLW 887
Query: 1040 TNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIVGKSVD 1086
NKFGF + E+ +KY + Y ++ F+G SMLQKPV +I+ K+++
Sbjct: 888 MNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQIIDKTIE 935
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 71/122 (58%), Gaps = 3/122 (2%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+++ E+G+L+G+SV+Y+ K + LRG+++ GI C C C V++ + FE+
Sbjct: 264 VRDLFETGLLDGLSVVYMGTVKSQA---FPLRGIIRDGGILCSCSSCDWANVISTSKFEI 320
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K C C+
Sbjct: 321 HACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKRFTCKRCKE 380
Query: 527 SF 528
F
Sbjct: 381 GF 382
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 262/697 (37%), Positives = 385/697 (55%), Gaps = 71/697 (10%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 182 FLSNVKKLLGTGILDGARVKYLSTSAARE-----LQGIIHSGGYLCGCTACDFSKVLGAY 236
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS---------- 512
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS
Sbjct: 237 EFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIAPPDVLEEVIRKVAGSALSEEGFQAW 296
Query: 513 --SSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
S + N + + + L+ S L ESQ+ + E ++
Sbjct: 297 KGSFQQDKNMTEDDSNHIMDHSFQSLVSYPGSGWSLDESQSSTPCFPEDNYFREKI---- 352
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVF 629
T + + +P + + +SH M S G ++D +H+L+F
Sbjct: 353 --------------CTKDTRHAHKPKAKKLTSHMFGMGCHKKVSGGGKWKRDNDLHRLLF 398
Query: 630 EEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
GL DG E+ Y+V+ +K L GYK+G GI+C+CC++++SPSQFEAHAG A RR+P++ I
Sbjct: 399 LPNGLPDGTELAYYVKSQKLLQGYKQGSGIVCSCCDTKISPSQFEAHAGMAGRRQPYRRI 458
Query: 690 YTSNGVSLHELSIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 748
+ S+G+SLH++++ L+ ++ ++DD+C IC +GGDLL C CP+AFH C+ +
Sbjct: 459 HISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAFHTACLKFQSM 518
Query: 749 PSGTWHCRYCMNTFQKEK---------FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT 799
P GTW+C C + K + NAN + P +
Sbjct: 519 PEGTWYCSSCNDGPTSCKIATASWLYTYFNLNANILVLHSAYSLSPISD----------- 567
Query: 800 PDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCC 859
R DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW CC
Sbjct: 568 -----------RSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCC 616
Query: 860 ADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD 919
+DC RI+ LQ G + +P LD I +K+ E G V WR+L GK
Sbjct: 617 SDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPEH 676
Query: 920 GTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVS 979
LLS+A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVS
Sbjct: 677 --LPLLSRAATIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVS 733
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
A + RIFGQ++AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++IW
Sbjct: 734 AALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIW 793
Query: 1040 TNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
TNKFGF+ MTE +Y+ + L IF+GTSML+K VP
Sbjct: 794 TNKFGFTKMTEHRLQRYQREVQLTIFKGTSMLEKKVP 830
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 224/468 (47%), Positives = 319/468 (68%), Gaps = 18/468 (3%)
Query: 613 SHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQ 672
S G ++D +H+L+F GL DG E+ Y+V+ +K L GYK+G GI+C+CC+ E+SPSQ
Sbjct: 70 SEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQKLLQGYKQGSGIVCSCCSREISPSQ 129
Query: 673 FEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCD 732
FEAHAG A+RR+P++HI+ S+G+SLH++++ L+ ++ ++DD+C IC DGGDLL C
Sbjct: 130 FEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCA 189
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQM 789
CP+AFH C+ +P GTW+C C + + +K + + NAR +
Sbjct: 190 GCPQAFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARP------------I 237
Query: 790 VSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQ 849
V R R+V+ P++++GGCV CR DF +F RTVILCDQCE+EYHVGCL+++G DL+
Sbjct: 238 VIRLSRVVKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLK 297
Query: 850 ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
E+P+ KW CC++C RI+ A+Q V G + LP LD+I +K E G + V WR+
Sbjct: 298 EIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRI 357
Query: 910 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 969
L GK LLS+A IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC
Sbjct: 358 LSGKSRYPEH--LPLLSRAAVIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCL 414
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
+L VN +VVSA + RIFGQE+AELP+VATS + QG+GYFQ L+ C+E LL LNV+ LVL
Sbjct: 415 VLIVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVL 474
Query: 1030 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
P+A EA++IWT KFGF+ M++++ +Y+ + L IF+GTSML+K VPK
Sbjct: 475 PAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 522
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 13/83 (15%)
Query: 410 FLESGILEGMSV-MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHA 468
F+ +G+ +G + Y++ K+ L+G +GSGI C C ++ ++P+ FE HA
Sbjct: 86 FMPNGLPDGTELAYYVKTQKL-------LQGYKQGSGIVCSC----CSREISPSQFEAHA 134
Query: 469 GSSNKRPP-EYIYLENGKTLRDI 490
G + +R P +I++ +G +L DI
Sbjct: 135 GMAARRQPYRHIFISSGLSLHDI 157
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 216/410 (52%), Positives = 268/410 (65%), Gaps = 16/410 (3%)
Query: 650 LVGYKKGFGILCTC--CN-SEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL 706
L G G I C C CN S+VSPS FEAHAG +RRKP+ +I+TSNGVSLHELS+K+S
Sbjct: 329 LKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNGVSLHELSMKISK 388
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 766
+ S +E DDLC C GGD+ C CPR+FH CV L G+PS W+C C N QKEK
Sbjct: 389 DMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVPS-EWYCDNCSNLVQKEK 447
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 826
+ N NA+AAGR GVD Q++ R IRIV ELG +DF S F RTVI
Sbjct: 448 ALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQ------KDFNNSVFDERTVI 501
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLD 886
LCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++ G L E+ +D
Sbjct: 502 LCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIISDGALILAESDID 561
Query: 887 VIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL 946
+I+KKHE G + D+RWR+L G+K AS+ LLS AV I H FDPIIE S
Sbjct: 562 IIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLLSAAVPIIHQSFDPIIEVQSGR 619
Query: 947 DLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
DLIP MV GR + GQDY GMYCA+LT+ VVSA + R+ G E+AELPLVATS D
Sbjct: 620 DLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEVAELPLVATSKDL 679
Query: 1003 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
QG GYFQ+LF CIE++L L +K +LP+A EA+ IW NKFGF+ + +E+
Sbjct: 680 QGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQEQ 729
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 109/253 (43%), Gaps = 64/253 (25%)
Query: 247 AVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLE 306
AV E K + +AM+++E G++ E + +++ ++++N + +
Sbjct: 198 AVAAETAKPD---MAMELQEPPTVTAANGRDVS--------HESFEQNLQHQVLDNALTD 246
Query: 307 SSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVEL-AKGS 365
S++ A+S A + G +P RRFTRSLL+ K E S
Sbjct: 247 PSLL--------------AESTATPASTAGL-------KPARRFTRSLLKNKPEEEPTAS 285
Query: 366 LSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIR 425
S+D V+ ++ D V + + P K R+F L L+ +L
Sbjct: 286 KSQDPA----VSMISEDNNEASV--DLALAPEKPQRRFTRSLLK-LQKAVL--------- 329
Query: 426 GSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPP-EYIYLENG 484
+GV+ G I CFC C G++ V+P+ FE HAG +R P + I+ NG
Sbjct: 330 ------------KGVIAGCNIRCFCLSCNGSK-VSPSAFEAHAGEGTRRKPYDNIFTSNG 376
Query: 485 KTLRDI-MNVCKD 496
+L ++ M + KD
Sbjct: 377 VSLHELSMKISKD 389
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 414 bits (1065), Expect = e-112, Method: Compositional matrix adjust.
Identities = 222/501 (44%), Positives = 302/501 (60%), Gaps = 30/501 (5%)
Query: 591 RSPEPNSAQTS-SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKF 649
RS E + Q S H+ S+S S I +D +HKLVF+E LEDGA VGYFV EK
Sbjct: 56 RSNESKNLQPSPKHTTQTSNSHASPTTINYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQ 115
Query: 650 LVG--YKKGFGILCTCC---------------NSEVSPSQFEAHAGWASRRKPFQHIYTS 692
L G K GILC CC + +VSPS+FEAHAGWASRRKP+ HI T+
Sbjct: 116 LQGEINIKQSGILCDCCKEVILEFFFCQIKKLDEQVSPSKFEAHAGWASRRKPYFHIRTT 175
Query: 693 NGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 752
+GVSLH+L+I + S +D+ C C G+LLCCD C RAFH+ C+ + P
Sbjct: 176 DGVSLHQLAINHRI----SISNSDEHCSKCKQRGNLLCCDGCQRAFHLGCIPVESPPKEK 231
Query: 753 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 812
W+C YC N QK+K VE+ N +I DP Q+ C V+ + E C LC
Sbjct: 232 WYCEYCRNKLQKDKNVEHKENVVTTQKIIESDPSEQIAKICTLSVKHKEVEHSSCALCSE 291
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 872
R F F TV++CDQCE++YHVGCLKDH M +L+++PK W C DC I++ L+
Sbjct: 292 RHFNNGEFSPWTVMICDQCEKDYHVGCLKDHNMANLKKVPKHYWFCGVDCYDIHMKLKNF 351
Query: 873 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 932
+ RG+ L ++ L +IK K E+ G + D++W+V + + + T +LLS V+IF
Sbjct: 352 MARGDVLLSDSLLSLIKNKKEQKGLETEFGLDIKWKVFNRQLIVSKIITSSLLSDVVTIF 411
Query: 933 HDRFDPIIESASKLDLIPAMVYGRSHRGQDYH-GMYCAILTVNQVVVSAGIFRIFGQELA 991
H++FD I+ + +K+DLIPAMV GR + + Y GMYCA+L VNQVVVSAGIFR+FG+E+A
Sbjct: 412 HEQFDSIVVTGTKIDLIPAMVKGRKIKDKYYFGGMYCAVLIVNQVVVSAGIFRVFGKEVA 471
Query: 992 ELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEE 1051
EL L+AT + Q QG+F+ L CIE +L L V+ LVLP+A EA+++W +KFGF TE
Sbjct: 472 ELSLIATKAEYQKQGFFKCLLSCIENVLKELKVERLVLPAAHEAESMWIDKFGF---TEP 528
Query: 1052 EQ----NKYRNDYPLMIFQGT 1068
Q YR + + +G
Sbjct: 529 NQGLGRRYYRRSWSFHLNKGV 549
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 307/849 (36%), Positives = 424/849 (49%), Gaps = 126/849 (14%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MAN T++ E VV+S IR G+KREF +K Q+E S+G+ R + Q+ +G P
Sbjct: 1 MANRTEAGE--VVISSIRTGMKREFAMMMKAQAECGISIGQKRVTRSQNGSLNGRASISP 58
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDSEN 120
+ +T S KR+++E +VV + NE+ E SKD+ E
Sbjct: 59 Q-NGSRTKSSTVKRRKKEANVVTADQS-----------LSHSNEKLDKTEELRSKDEGEK 106
Query: 121 MGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG--- 177
VE++ + SE E +D VV G E++KN V + E +
Sbjct: 107 -----------VEKLPLTSEWEEPKSD----VVDGASEDEKNGT-AVDSEIGEQETACVM 150
Query: 178 ---ELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDL 234
EL+ +GGS + + + V V+ E N L + E+ L
Sbjct: 151 EREELNSLGGSQKEKLINGDVAAKPVDVDNSGNGEENNGLVDCGGLFQKEL--------L 202
Query: 235 I-GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYE-PGRVQMEMDEEKK 292
I G+V + +EV D+SD V C G+ KE E G V +M + K
Sbjct: 203 INGDVAEKPKEV---------DKSDNDKQSDSLVNCGG-GVEKEKELNGDVTEKMMDVDK 252
Query: 293 NDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGND--------EGKTVNVVVE 344
D + +L + ++E +M S C +S K+ G+ E V+ E
Sbjct: 253 TDNKEQL--DSLMELAMEELPMSGDCKTKSEAIKADCAYGSALATVPESVENSKVSTSSE 310
Query: 345 RPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFY 404
+PLRRFTRS L+ K + S ++D + +G K E M + + K
Sbjct: 311 KPLRRFTRSCLKPKQQAMSASPAEDTKAEDALESDEASAIGTTSKLEMKMSKKVALVKIP 370
Query: 405 SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVF 464
+KLK L +G+LEG+ V Y RG KG L+GV++GSGI CFC +C G +VVTP F
Sbjct: 371 TKLKGLLATGLLEGLPVRYARGRPEKG-----LQGVIQGSGILCFCQNCGGTKVVTPNQF 425
Query: 465 ELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC 524
E+HAGSSNKRPPEYIYL+NGKTLRD++ CKD+P + LE A+R G+ +K+ CLNC
Sbjct: 426 EMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEAAIRNATGAGDARKSTVCLNC 485
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEP-LSHSSEME---------------- 567
+ S A L C SC+ K+SQ +++ + S ++E
Sbjct: 486 KASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRDGQLEFIFLLNYYWADDLYKL 545
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P L+ SP+ + L +S E S+ T SK+ HG++TRKDLRMHKL
Sbjct: 546 GLPDLRGLQWSPSSNSVL-----KSTERMSSGTCPPSKV-------HGRLTRKDLRMHKL 593
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
VFE L DG + Y+VRG +VSPSQFEAHAG ASRRKP
Sbjct: 594 VFEGDVLPDGTALAYYVRG--------------------KVSPSQFEAHAGCASRRKPGW 633
Query: 688 HIYTSNGVSLHELS--IKLSLER---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 742
+ + + + L I LE+ P + DLC IC DGGDLLCCD+CPRAFH +C
Sbjct: 634 YWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADGGDLLCCDNCPRAFHTEC 693
Query: 743 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 802
VSLP IP GTW+C+YC N F KEKF + +ANA AAGR+ G+D Q+ R IRIV T
Sbjct: 694 VSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQITKRSIRIVDTLHA 752
Query: 803 ELGGCVLCR 811
E+G CVLCR
Sbjct: 753 EVGVCVLCR 761
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 223/525 (42%), Positives = 302/525 (57%), Gaps = 64/525 (12%)
Query: 541 KSCVELKESQAGSAEIKEPLSHSSEMEPQP----PSVELEESPAPSGELTDTSNRSPEPN 596
K CV+ ++++ S S +P P +++ SP T ++RSPEP
Sbjct: 107 KFCVDYRDAKVVSGLFSSSCEESKNRQPSPRHTIETIDSRVSP------TTVNHRSPEPV 160
Query: 597 SAQTSSHSKMKSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYK 654
QTSS+ MK ++ KS +ITRKD +HKLVF+E LEDGA VGYF RG+
Sbjct: 161 VPQTSSYKGMKCNTYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGK------- 213
Query: 655 KGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKE 714
VSPS+FEAHAG ASRRKP+ +I T++GVSLHEL+ R S +
Sbjct: 214 -------------VSPSKFEAHAGRASRRKPYSYIRTADGVSLHELAN----NRRISMSD 256
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 774
+D+ C C G+LL CD C R+FH++C+ L P +C YC N F K+K V++ N
Sbjct: 257 SDERCSHCEQVGNLLWCDRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKEND 316
Query: 775 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 834
A GRI DP Q+ C + + + G C LC RDF + G RTV++C QCE+E
Sbjct: 317 VATGRIAEGDPSEQITEVCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEKE 376
Query: 835 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 894
+HV CLKDH M +L ELPK KW C DC I++ LQKL+ RGE +L
Sbjct: 377 FHVECLKDHNMANLVELPKDKWFCGIDCDDIHMKLQKLMARGEAEL-------------- 422
Query: 895 SGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY 954
D++WR+L K + L+SKA +IFH+RF I + +K+DLI AM+Y
Sbjct: 423 -------GLDIKWRLLNTKLNNPKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAMLY 475
Query: 955 GRSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQ------GY 1007
G GQ + GMYCA+L +V+ AGIFR+ GQE+AELPLVAT+ Q + GY
Sbjct: 476 GMEIEGQYSFEGMYCAVLYFKKVIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSGY 535
Query: 1008 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
F+SLF CIE +L L VKTLVLP+A EA+++W +KFGF+ ++E
Sbjct: 536 FRSLFSCIENMLRHLKVKTLVLPAAHEAESMWIDKFGFTKPNDKE 580
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 33/51 (64%), Gaps = 1/51 (1%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNV 51
M N DSE FV+ SK+R G KREF FAL SEI SL +TR RK Q+ V
Sbjct: 1 MTNFNDSE-LFVIKSKVRTGHKREFSFALNSYSEIGTSLSKTRPRKNQNMV 50
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/337 (57%), Positives = 236/337 (70%), Gaps = 4/337 (1%)
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
CV L GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R IRI T +
Sbjct: 5 CVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTLE 64
Query: 802 TELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 861
T GGC LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+G W C D
Sbjct: 65 TGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCSTD 124
Query: 862 CKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG-SDNAVDFDVRWRVLRGKKVDASDG 920
C RIN LQ L++ G E + LDVIKKK E G +D+ D DVRWRVL+ K +SD
Sbjct: 125 CVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDK---SSDD 181
Query: 921 TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSA 980
++ +LSKAV+IFH+ FDPII+ ++ DLIPAMVYGRS R QDY GMYC +LTVN +VVSA
Sbjct: 182 SKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSA 241
Query: 981 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1040
G+FRI G E+AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A EA++IWT
Sbjct: 242 GLFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWT 301
Query: 1041 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
+FGF+ ++++E +Y +FQGTS L K V K
Sbjct: 302 ERFGFAKISQDELREYLKGGRTTVFQGTSNLHKLVAK 338
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/386 (50%), Positives = 250/386 (64%), Gaps = 7/386 (1%)
Query: 695 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
VSLHELS+ +S + S +E DDLC C GGD+ C CPR+FH CV LP +PS W
Sbjct: 2 VSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWF 61
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+ +D
Sbjct: 62 CDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKD 120
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
F + F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++
Sbjct: 121 FNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMIS 180
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
G L E+ +D+I+KKHE G + ++RW+++ G+ A++ +LLS AV + H
Sbjct: 181 GGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRS--ATEDGNSLLSSAVPVIHQ 238
Query: 935 RFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
FDPIIE+ + DLIP MV+GR + GQDY GMYCA+LTV VVSA + R+ G ++
Sbjct: 239 SFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDV 298
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AELPLVATS D QG GYFQ LF CIE+LL L VK +LP+A EA+AIW KFGFS + +
Sbjct: 299 AELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQ 358
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPVP 1076
++ Y N L +F GT L K +P
Sbjct: 359 DQMEAYLNGGHLTVFHGTLNLYKAIP 384
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 369 bits (948), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 185/370 (50%), Positives = 235/370 (63%), Gaps = 41/370 (11%)
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 767
R S++++DDLC IC DGG+LL CDSCPRAFH +CV IP GTW CRYC N Q+E
Sbjct: 234 RNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESS 293
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
+ YN NA AAGRI+G+DP Q+ +R IRI TP T GGC LCR DF K +F RTV+L
Sbjct: 294 LAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLL 353
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDV 887
CD Q LP+G W C ADC RI+ L+ L+ RG E + +++
Sbjct: 354 CD-------------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEI 394
Query: 888 IKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLD 947
IK+K+E+ + D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ D
Sbjct: 395 IKRKYEQKALNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRD 451
Query: 948 LIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1007
LIPAMVYG + VVSAG+FR+ G E+AELPLVATS D QG GY
Sbjct: 452 LIPAMVYGNT-------------------VVSAGLFRVMGSEIAELPLVATSRDSQGLGY 492
Query: 1008 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQG 1067
FQ+LF CIE+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQG
Sbjct: 493 FQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQG 552
Query: 1068 TSMLQKPVPK 1077
TS L K VPK
Sbjct: 553 TSTLHKLVPK 562
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 83/147 (56%), Gaps = 3/147 (2%)
Query: 432 PGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM 491
P + LRGV+K GI C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+
Sbjct: 70 PPRAVLRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDIL 129
Query: 492 NVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQA 551
C D+ L+ L+ A++ +G + K+ C C+ SF+ + LLC SC+E K SQ
Sbjct: 130 RACSDATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQN 189
Query: 552 GSAEIK---EPLSHSSEMEPQPPSVEL 575
+ K P S + + + P +L
Sbjct: 190 STRTSKIGRNPTSSARRSKNESPGSKL 216
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 650 LVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
L G K GILC+C + + VSP FE HAG ++++ P +I+ NG +LH++
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAG-STKKHPSDYIFLENGNNLHDI 128
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 356 bits (913), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 174/367 (47%), Positives = 240/367 (65%), Gaps = 26/367 (7%)
Query: 641 GYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
G+F +K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+P++HIYTSNG++LH++
Sbjct: 13 GFFF--QKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHDI 70
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
+I L+ + ++ DD+C C DGGDL+ C SCPRAFH C+ L P G WHC C
Sbjct: 71 AISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCNK 130
Query: 761 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 820
F +V R R+V+TP+ ++GGC +CR DF F
Sbjct: 131 LGHGGNFAR------------------PIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDTF 172
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
RTVILCDQCE+E+HVGCL++ G+ DL+E+PK W CC DC I +AL+ V G + +
Sbjct: 173 DDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQTI 232
Query: 881 PETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
P + L++I +KH E G D A +DV+W++L GK + D +LLS A +IF + FDP
Sbjct: 233 PASLLNIINRKHVEKGLLVDEAA-YDVQWQILMGKSRNRED--LSLLSGAAAIFRECFDP 289
Query: 939 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
I+ + DLIP MVYGR+ GQ++ GMYC +LTV VVVSAG+ RIFG+E+AELPLVAT
Sbjct: 290 IVAKTGR-DLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVAT 348
Query: 999 SNDCQGQ 1005
+ + QG+
Sbjct: 349 NREHQGK 355
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPP-EYIYLENGKTLRDI 490
L G +G+GI C C + + ++P+ FE HAG S +R P +IY NG TL DI
Sbjct: 20 LGGYKQGNGIVCSCCEVE----ISPSQFESHAGMSARRQPYRHIYTSNGLTLHDI 70
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 336 bits (861), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 233/727 (32%), Positives = 373/727 (51%), Gaps = 101/727 (13%)
Query: 377 TEVANDGVGGPVKQ-ETVMKPRKVMRKFYSKLKNFLESG---------ILEGMSVMYIRG 426
+E +DG P ++ + + +P + R KLK+ + G E M+ + G
Sbjct: 245 SEEGSDGQKSPEERGDAIARPSQAPRAAAIKLKSMVPVGDKLLKAPRNAKELMATRLMEG 304
Query: 427 SKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
V+ G+ L G++K G+ C C +C+G+ +V+ + FE H+GS++ P + IYLENG
Sbjct: 305 HHVRCSCRGIQ-LTGMLKDMGVQCDCRNCRGSVIVSISAFEAHSGSTSHHPSDNIYLENG 363
Query: 485 KTLRDIMNV------CKDSPLETLEKAVRMVLG-----SSSMKKAN-------FCLNCRV 526
K LRDI++ C D+ L L+ A+ V G S K N +C R
Sbjct: 364 KNLRDILSAGQEAADCGDNILRALKMAIGDVQGVEKSKSKCAKCGNPEEGDLIYCKGARC 423
Query: 527 SF-SNAGVEELM------LLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
S +++G E+ C C + K+ A S ++K P+S +E E
Sbjct: 424 SVVAHSGCVEIANPHLGDWFCGKCEKTKKPHA-SVKVKRPISSGAEKE------------ 470
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
S+++ +++R D +HK +F GGLE+G E
Sbjct: 471 -----------------------DSRVREKDATVSARLSR-DAHLHKALFLPGGLENGTE 506
Query: 640 VGYFVRGEKFLVGYKKGFGILCTCCN----SEVSPSQFEAHAGWASRRKPFQHIYT-SNG 694
+GY+ + + L G K+G GI C+CCN S++S +FE HAG +RR P+ +I +G
Sbjct: 507 LGYYTKSQLKLKGVKRGKGICCSCCNKEASSDISCFEFEQHAGCEARRNPYGNILVLVDG 566
Query: 695 VSLHELSIKLSLERPFSSKEN-------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 747
SL ++ L+ + ++N + C C G+L C+ C A+ +C
Sbjct: 567 RSLKDVCKDLTHKNKLGEQQNCEPLARDVNCCYECSSSGELKTCNGCEEAWCDNCTKGEE 626
Query: 748 IPSGT-WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDTEL 804
+ S + W+CR C N K V N + E ++ RCIR ++ E+
Sbjct: 627 VDSDSKWYCRMCRNDTLK---VAQNGQKVSGKHQEESSSITEIDERGRCIRHLEG-HREV 682
Query: 805 GGCVLCRGRDFCKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
GGC +C+ + K+ F T+++CDQC REYHV CLKD GM++L ELP+G+W C CK
Sbjct: 683 GGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGMDNLNELPEGEWFCQKGCK 742
Query: 864 RINLALQKLVDRGEEKLPETSL-DVIKKKHEESG---SDNAVDFDVRWRVLRGKKVDASD 919
I+ L +LV G E L + + ++ + + ++SG ++ W++L GK +S
Sbjct: 743 VIDEILTQLVAIGPESLSHSIISELPENRQQKSGVIEKAESISPSFEWQILCGK--GSSP 800
Query: 920 GTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVS 979
L++AV+IF + DPI ++ + +LIP MV R + D+ G++C +L +N VVS
Sbjct: 801 ANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDYDFEGVFCVVLKLNGKVVS 860
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
A + +IFG+E AE+PLVATS QGQG+ ++L IE+LLG L+V+ LVLP+A + +++W
Sbjct: 861 AALLQIFGREFAEVPLVATSLPHQGQGFCKALMTTIERLLGVLSVERLVLPTAKDTESLW 920
Query: 1040 TNKFGFS 1046
NKFGFS
Sbjct: 921 VNKFGFS 927
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 327 bits (839), Expect = 2e-86, Method: Composition-based stats.
Identities = 157/274 (57%), Positives = 198/274 (72%), Gaps = 2/274 (0%)
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 873
DF KS F RTVI+CDQCEREYHVGCLK+H M DL+ELP+G+W C +DC RI+ ALQ L+
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 874 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 933
RG + LP +DVI+KK + G + + D+RW++L GK DA +R LLSKAV+IFH
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAE--SRLLLSKAVAIFH 120
Query: 934 DRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 993
+ FDPI+++ + DLIP MVYGR+ R QDY G+YCA+LTV VVSAGI R+ G E+AEL
Sbjct: 121 ESFDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAEL 180
Query: 994 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1053
PLVATS + QGQGYFQSLF CIE+LL L VK VLP+A EA++IWT KFGF+ +T +E
Sbjct: 181 PLVATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDEL 240
Query: 1054 NKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
+KY N +FQGTS L KPV + + G
Sbjct: 241 HKYLNGARTTVFQGTSTLHKPVTVPHVSSRETQG 274
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 326 bits (836), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 227/666 (34%), Positives = 340/666 (51%), Gaps = 64/666 (9%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV--- 493
L G+++ G+ C C CKG QVV+ + FE H+GS++ P IYLENGK LRDI++
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 494 ---CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQ 550
C L L+ A+ + G KK C C V SC E K S
Sbjct: 630 SADCGGDILGALKHAIGEIQGIP--KKEGACGKCGKREGGDFV--------SCKEPKCSA 679
Query: 551 AGSAE-IKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSS 609
AE + P H ++ E + P T + P + + + K K
Sbjct: 680 VYHAECVGLPSPH--RVDWFCAKCEKAQVKMPK---TVLKMKRPPAVTDREDTRLKEKEL 734
Query: 610 SVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVS 669
+V+S +D +HK +F GGL DG E+GY+ R + L G K+G GI C CCN E+S
Sbjct: 735 TVRS-----ARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCKCCNQEIS 789
Query: 670 PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDG---- 725
S FE HAG SRR P+ I ++G SL ++ +L+ + SK D + G
Sbjct: 790 CSAFEQHAGCESRRNPYGSILLADGRSLKDMCKELA----YQSKLGDRAHQVARTGDVKS 845
Query: 726 -------GDLLCCDSCPRAFHIDC---VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANAR 775
G L C + I+ S SG C+ C +K VE A+
Sbjct: 846 SSGSEEQGVLASSQRCESTWCINFGTRFSCQEADSGHPLCKIC------QKNVE-GAHKT 898
Query: 776 AAGRIEGVD--PFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
+ R++ P R +R+ Q PD+ GC +C+ K F T+++CDQC R
Sbjct: 899 SKKRVDATANIPATDDTGRNVRLFQAPDSS-SGCAICKKWTLKKCGFDM-TMLVCDQCGR 956
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL-DVIKKKH 892
EYHVGCL++ G+ L ELP+ +W C +C+ I L +LV G E L + + D+++ +
Sbjct: 957 EYHVGCLRESGI--LDELPEAEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESRQ 1014
Query: 893 EESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 949
+ G + W++L G + +G L++AV IF + DPI ++ S ++I
Sbjct: 1015 HQQGIVEMAESSSPVFGWQILHGAGENPVNGR--TLAQAVEIFTECSDPIKDAPSGQNMI 1072
Query: 950 PAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQ 1009
P MVY R + D+ G+YC +LT+N+ VVS + +IFG+E+AE+PL+ATS D Q QG+ +
Sbjct: 1073 PIMVYSRRFKDYDFDGIYCVVLTLNEKVVSTALLQIFGREVAEVPLIATSVDHQDQGFCK 1132
Query: 1010 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTS 1069
+L IE+LLG LNV+ LVLP++ A+ +W N+FGFS M + + R+ L++F GT+
Sbjct: 1133 ALMTTIERLLGVLNVERLVLPASKNAEFVWVNRFGFSRMEDAQLKHIRSMMGLLVFTGTT 1192
Query: 1070 MLQKPV 1075
ML K +
Sbjct: 1193 MLVKHI 1198
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 320 bits (819), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 230/730 (31%), Positives = 376/730 (51%), Gaps = 86/730 (11%)
Query: 374 SDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESG---------ILEGMSVMYI 424
SD E +G GPV+ P + R +KLK+ + G E M+ +
Sbjct: 350 SDGQESHEEGDTGPVR------PSQAPRAAATKLKSMVPIGDKLLKAPRNAKELMATRLM 403
Query: 425 RGSKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLE 482
G V+ G+ L G++K G+ C C +CK + +V+ + FE H+GS++ P + IYLE
Sbjct: 404 EGHFVRCSCRGIQ-LTGMLKDMGVRCDCRNCKSSVIVSISAFEAHSGSTSHHPSDNIYLE 462
Query: 483 NGKTLRDIMNV------CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL 536
NGK LRDI++ C D+ L L+ A+ + G K C C S E
Sbjct: 463 NGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKWKVT--CAKCWNSD-----EGD 515
Query: 537 MLLCKS--CVELKESQA---GSAEIKEPLSHSSEMEPQP-PSVELEESPAPSGELTDTSN 590
++ CK C + S+ + + + E +P +V+++ S + E D
Sbjct: 516 LIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATVKVKRSISSGTEKDD--G 573
Query: 591 RSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFL 650
R E ++ +++ +D +HK +F GGLEDG E+GY+ + + L
Sbjct: 574 RVREKDATESTR---------------LNRDAHLHKALFLPGGLEDGTELGYYTKSQLKL 618
Query: 651 VGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY-TSNGVSLHELSIKLSLERP 709
G K+G E++ +FE HAG +RR P+ +I ++G SL ++ +L+ +
Sbjct: 619 KGVKRGEAFKKVVV--EINCYKFEQHAGCEARRNPYGNILLVADGRSLKDVCKELAHKNK 676
Query: 710 FSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDCV-SLPGIPSGTWHCRYC--- 758
KE + C C G+L C C + C L G W+CR C
Sbjct: 677 LGEKEKRVARAGKVNSCYECGTRGELKNCHGCVETWCNSCTKGLETDSDGKWYCRMCRQD 736
Query: 759 -MNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDTELGGCVLCRGRDF 815
+N Q E+ R+ IEG+ A+ RC+R ++ E+GGC +C+ +
Sbjct: 737 TLNVAQIEQ-------KRSNKHIEGMSNIAETDERDRCVRHLEG-HREVGGCAICKKWNL 788
Query: 816 CKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
K+ F T+++CDQC REYHV CLKD G++DL ELP+G+W C DCK I+ L +LV
Sbjct: 789 SKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPEGEWFCQKDCKVIDEILTQLVA 848
Query: 875 RGEEKLPETSL-DVIKKKHEESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAVS 930
G E L ++ + ++++ + +++G+ + + W++L GK + ++ L++A++
Sbjct: 849 NGPELLTDSIISELLESRQQQTGAKDKAESSCPSFAWQILCGKSGNTAN--TQTLAEAIN 906
Query: 931 IFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
IF + DPI ++ + +LIP MV R + D+ G++C +L +N+ VVSA + +IFG E+
Sbjct: 907 IFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVFCIVLKLNEKVVSAALLQIFGGEI 966
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AE+PLVATS QGQG+ ++L IE+LLG L+V+ LVLP+A ++IW NKFGFS + +
Sbjct: 967 AEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGFSRVPD 1026
Query: 1051 EEQNKYRNDY 1060
+E + R +
Sbjct: 1027 DEGSFLRCTF 1036
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 317 bits (813), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 250/817 (30%), Positives = 384/817 (47%), Gaps = 147/817 (17%)
Query: 377 TEVANDGVGGPVKQETVM-KPRKVMRKFYSKLKNFLESG---------ILEGMSVMYIRG 426
+E +DG P + + V+ +P + R +KLK+ + G E M+ + G
Sbjct: 522 SEEGSDGQDSPEEGDDVLVRPSQAPRAAATKLKSMVPVGDKLVRAPRNAKELMATRLMEG 581
Query: 427 SKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
V+ G+ L G++K G+ C C +CKG+ +V+ + FE H+GS++ P + IYLENG
Sbjct: 582 HFVRCSCRGIQ-LTGMLKDMGVQCNCRNCKGSVIVSISAFEAHSGSTSHHPSDNIYLENG 640
Query: 485 KTLRDIMNV------CKDSPLETLEKAVRMVLG------------SSSMKKANFCLNCRV 526
K LRDI++ C D+ L L+ A+ + G SS +C R
Sbjct: 641 KNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKRKVTCAKCESSQEGDLIYCKGARC 700
Query: 527 SFSN----AGVEELML---LCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
S G+ L C C + K+ A +A++K +S ++ P+ V L
Sbjct: 701 SVVAHSRCVGIANPQLGDWFCGKCEKTKKHHA-AAKVKRSISGGAD--PEDGKVRLL--- 754
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
R + ++ ++S K ++SV+ +D +HK +F GGL DG E
Sbjct: 755 -----------RDLQGYASPSASKDKAATASVR-----LNRDAHLHKALFLPGGLVDGTE 798
Query: 640 VGYFVRGEKFLVGYKKGFGILCTCCNSEVS------------PSQ--------------- 672
+GY+ + + L G K+G GI C+CCN E PSQ
Sbjct: 799 LGYYTKSQLKLKGVKRGEGICCSCCNEETRHVQGGSRGGLMFPSQQDGTYYVFSQVKEIH 858
Query: 673 -------------------------FEAHAGWASRRKP-FQHIY-----TSN-------- 693
FE HAG +RR P + H+ TSN
Sbjct: 859 VTTFNEEMKFLSNDLDDIQQISCYEFEQHAGCEARRNPGYYHVVAKYAETSNEYDSYGNI 918
Query: 694 -----GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 748
G SL ++ L+ + KE G L DS + VSL GI
Sbjct: 919 LLVPDGRSLKDVCKDLAHKNKLDEKEKRAARGGESRFSLELEQDSVFNNSTVAIVSLIGI 978
Query: 749 ---PSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDTE 803
G W+CR C K V N + +EG+ A+ RCIR + P E
Sbjct: 979 EKDSEGRWYCRMCRQDSLK---VAQNGQKGSEKIMEGMSNIAETNEQGRCIRHLDGP-RE 1034
Query: 804 LGGCVLCRGRDFCKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 862
+GGC +C+ + K+ F T+++CDQC REYHV CLKD G++DL ELP G+W C DC
Sbjct: 1035 VGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKDC 1094
Query: 863 KRINLALQKLVDRGEEKLPETSLD-VIKKKHEESGSDNAVDFD---VRWRVLRGKKVDAS 918
K I+ L +LV G E L +++ +++ + + S + ++ W++L GK +
Sbjct: 1095 KVIDEILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSPA 1154
Query: 919 DGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVV 978
D L++A +IF D DPI + + +LIP MV R + D+ G++C +L +N VV
Sbjct: 1155 D--VQTLAEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKVV 1212
Query: 979 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1038
SA + +IFG+E+AE+PL+ATS GQ + ++L IE+LLG L+V+ LVLP+A +++
Sbjct: 1213 SAALLQIFGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTESV 1272
Query: 1039 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
W NKFGFS + E++ L F GTSM+ K +
Sbjct: 1273 WINKFGFSRVQEDQLKSICTTIRLTTFTGTSMVVKAI 1309
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 317 bits (811), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 222/701 (31%), Positives = 343/701 (48%), Gaps = 120/701 (17%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+ V YI S + L G++ G G C C
Sbjct: 208 ELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERE-----LDGIIDGGGYLCGC 262
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +IYLENGK + I+ K +PL ++ ++ V
Sbjct: 263 SSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 322
Query: 511 GSSSMKKANFCLNCRVSFSNA--GVEELMLLCKSCVELKESQAGSAEIKE---PLSHSSE 565
GSS ++ ++ SNA G ++ C+ S +KE P+S S
Sbjct: 323 GSSINEEFFRVWKASLNQSNALVGADKKCHSELPCLPHSHVSYASQALKESFCPISSSFL 382
Query: 566 MEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMH 625
S ++ SG TS R P+ S +K K ++ + ++D +H
Sbjct: 383 YNNNFVSQQMYMET--SGVNKQTSKR---PSLYFPGSATKQKKTA---ESGVRKRDNDLH 434
Query: 626 KLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
+L+F GL DG E+ Y+V+G+K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+P
Sbjct: 435 RLLFMPNGLPDGTELAYYVKGQKILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQP 494
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
++HIYTSNG++LH+++I L+ + ++ DD+C C DGGDL+ +H+
Sbjct: 495 YRHIYTSNGLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLM--------WHV----- 541
Query: 746 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 805
W R + +I+G + V R R+ + P+ +G
Sbjct: 542 -------WIYRILLKVL-------------GIVQIDGGNFARPTVIRLTRVGKIPEYNVG 581
Query: 806 GCVLCRGR------DFCKSRFGR-RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CV+CR DF + + V+L ++E+HVGCL++ G+ DL+E+P+ W C
Sbjct: 582 DCVVCRLNLLKFLIDFTLANSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWFC 641
Query: 859 CADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVD 916
C DC I +AL+ V G +K+P + L++I +KH E G D A +DV+W++L GK +
Sbjct: 642 CQDCNNIYVALRNSVSTGVQKIPASLLNIINRKHVEKGLLVDEAA-YDVQWQILMGKSRN 700
Query: 917 ASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 976
D +LLS A +IF + S + H M C
Sbjct: 701 RED--LSLLSGAAAIFR------VSSCT------------------LHLMKC-------- 726
Query: 977 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1036
F F L F CIE+LL LNV+ LVLP+ A+
Sbjct: 727 ----WFFLAFNSLFNPL------------------FSCIERLLCSLNVEQLVLPA---AE 761
Query: 1037 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
IWT +FGF M+E + KY ++ L IF+GTSML+K VP+
Sbjct: 762 TIWTRRFGFRKMSEGQLLKYTREFQLTIFKGTSMLEKEVPR 802
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 178/272 (65%), Gaps = 6/272 (2%)
Query: 809 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLA 868
LCR +DF + F RTVILCDQCE+EYHVGCLK+ +L+ELP+G+W CC+ C +
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 869 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
L K++ G + L E L++IKKKH G D++W++L GK+ DG+ LLS A
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRT-TEDGS-ILLSAA 129
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFR 984
V IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA + R
Sbjct: 130 VPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLR 189
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1044
+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW KFG
Sbjct: 190 VMGGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFG 249
Query: 1045 FSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
FS T EE Y N L IF GTS L K VP
Sbjct: 250 FSRTTPEELEAYLNGAHLTIFHGTSYLYKAVP 281
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 222/740 (30%), Positives = 349/740 (47%), Gaps = 120/740 (16%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV--- 493
L G++K G+ C C +CKG+ +V+ + FE H+GS++ P + IYLENGK LRD+++
Sbjct: 545 LTGMLKDMGVQCNCRNCKGSMIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQE 604
Query: 494 ---CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKES 549
C D+ L L+ A+ + G K C C G EE ++ CK
Sbjct: 605 AADCGDNILRALKMAIGDIQGVEKSKVT--CAEC------GGSEEGDLIYCK-------- 648
Query: 550 QAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSS----HSK 605
+SHS + P + G+ T R ++ S S+
Sbjct: 649 ----GARCSVVSHSRCVGSANPQL----GDWFCGKCEKTKKRHAAAKVKRSISAGTEDSE 700
Query: 606 MKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCN 665
++ + + ++ R D + K +F GGL DG E+GY+ + + L G K+G GI C+CCN
Sbjct: 701 VRDKATTASARLNR-DAHLRKALFLPGGLVDGTELGYYTKSQLKLKGVKRGEGICCSCCN 759
Query: 666 SEVSPSQFEAHAGWASRRKPFQHIY-TSNGVSLHELSIKLSLERPFSSKENDDL------ 718
E+S +FE HAG +RR P+ +I ++G SL ++S +L+ + KE D
Sbjct: 760 KEISCYEFEQHAGCEARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEVC 819
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCV-SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAA 777
C C + G+L C SC A+ C + G W+CR C Q V N +
Sbjct: 820 CYECSNSGELKRCHSCEEAWCDKCTKGMETDSEGRWYCRMCR---QDSLKVAQNGHKGTD 876
Query: 778 GRIEGVDPFAQM--VSRCIRIVQTPDTELGGCVLCRGRDFCKSRF-GRRTVILCDQCERE 834
IEG+ A+ RC+R ++ P E+GGC +C+ + K+ F T+++CDQ
Sbjct: 877 KIIEGMSNIAETDEKGRCVRHLEGP-REVGGCAICKKWNLSKTGFVDGMTILVCDQVRSL 935
Query: 835 YHVG-----CLKDHGMED-----------LQELPKGKWLCCADCKRINLALQKLVDRGEE 878
+ K +G D QELP+G+W C DCK I+ L +LV G E
Sbjct: 936 NQMLPGTRITWKVNGFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANGPE 995
Query: 879 KLPETSL-DVIKKKHEESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAVSIF-- 932
L + + ++++ + ++S ++ W++L G+ S L++A +IF
Sbjct: 996 LLSNSIISELLESRQQQSSVKVKLESSNPRFGWQILCGE--GGSSANVQTLAEAANIFTS 1053
Query: 933 --------------------------------------HDRFDPIIESASKLDL----IP 950
FDP S D+ +
Sbjct: 1054 IDDINLPYLWLVVGNYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNVA 1113
Query: 951 AMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQS 1010
+V R + D+ G++C +L +N+ VVSA + +IFG+E+AE+PL+ATS QGQG+ ++
Sbjct: 1114 LVVCSRRAKDHDFEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCKA 1173
Query: 1011 LFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSM 1070
L IE+LLG L+V+ LVLP+A ++IW NKFGFS + E++ + LM F GT M
Sbjct: 1174 LMTTIERLLGVLSVERLVLPTAKNTESIWINKFGFSRVPEDQLKRICTTIRLMTFTGTRM 1233
Query: 1071 LQK---PVPKCRIVGKSVDG 1087
L K P+ RI +S DG
Sbjct: 1234 LGKAITPMTLNRIQRQSRDG 1253
Score = 43.9 bits (102), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 54/105 (51%), Gaps = 12/105 (11%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
V+ + ++ R + + FL G+++G + Y S++K L+GV +G GI
Sbjct: 701 VRDKATTASARLNRDAHLRKALFLPGGLVDGTELGYYTKSQLK------LKGVKRGEGIC 754
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPP--EYIYLENGKTLRDI 490
C C C N+ ++ FE HAG +R P + + +G++L+D+
Sbjct: 755 CSC--C--NKEISCYEFEQHAGCEARRNPYGNILLVADGRSLKDV 795
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 63/152 (41%), Gaps = 28/152 (18%)
Query: 641 GYFVRGE---KFLVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSN 693
G+FVR L G K G+ C C N + VS S FEAH+G ++ P +IY N
Sbjct: 533 GHFVRCSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSISAFEAHSG-STSHHPSDNIYLEN 591
Query: 694 GVSLHEL----------------SIKLSLERPFSSKENDDLCGIC--MDGGDLLCCDS-- 733
G +L ++ ++K+++ +++ C C + GDL+ C
Sbjct: 592 GKNLRDVLSAGQEAADCGDNILRALKMAIGDIQGVEKSKVTCAECGGSEEGDLIYCKGAR 651
Query: 734 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
C H CV G W C C T ++
Sbjct: 652 CSVVSHSRCVGSANPQLGDWFCGKCEKTKKRH 683
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/275 (50%), Positives = 177/275 (64%), Gaps = 6/275 (2%)
Query: 806 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
G + R +DF + F RTVILCDQCE+EYHVGCL+ +L+ELP+G+W CC+ C
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 866 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 925
+L K++ G + L E L++I+KKHE G D++W++L GK+ A++ LL
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKR--ATEEGSILL 172
Query: 926 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAG 981
S AV IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA
Sbjct: 173 SAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAA 232
Query: 982 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1041
+ R+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW
Sbjct: 233 LMRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMK 292
Query: 1042 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
KFGFS + EE N L IF GTS L K VP
Sbjct: 293 KFGFSRIPPEELEACLNGAHLTIFHGTSYLYKAVP 327
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 253 bits (646), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 168/525 (32%), Positives = 242/525 (46%), Gaps = 137/525 (26%)
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAF 738
+ IY ++G SLH+ + L +E+ + E+ D C C D GDL C CP A+
Sbjct: 961 YTSIYLADGRSLHDAAQSLVVEQNVKQEGVTPAKIEHLDQCVECGDSGDLQLCTGCPNAY 1020
Query: 739 H-------------------------IDCV-SLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
H +DC+ S G + C C Q+++F
Sbjct: 1021 HQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCPDC----QEQRFGGTKD 1076
Query: 772 -----ANARAAGRIEGVDPFAQMVSRCIRIVQTPDT-ELGGCVLCRGRDFCKSRFGRRTV 825
R+ G + + +M+ RC R++Q P+ LGGCV C+ DF K+ FG +T
Sbjct: 1077 RRRSMTKRRSKGAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCVFCKSGDFAKTGFGPKTT 1136
Query: 826 ILCDQCEREYHVGCLK-------------------------------------DHGMED- 847
+LCDQCEREYHVGCLK D G E
Sbjct: 1137 LLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCVATVRLRDCDRGCESQ 1196
Query: 848 ----------------LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKK- 890
QELP+G+W C DCK I+ L LV G E L ++ + + +
Sbjct: 1197 GIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLRT 1256
Query: 891 ------KHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDR--------- 935
+ E++ + F+ W++L G+ D S+G L++AV IF R
Sbjct: 1257 NQARLERSEDATESSCSGFE--WQLLHGRGGDPSNGKA--LAEAVQIFSVRNLSDPGFPV 1312
Query: 936 -------------------FDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 976
FDPI + S DLIP MVY RS R QD+ G+YC +L +
Sbjct: 1313 RTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGGIYCVVLKYDNR 1372
Query: 977 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1036
VVS + R+FG++LAELPL+AT+ QGQG+ ++L IE+LLG L V+ L LP+A A+
Sbjct: 1373 VVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVERLALPAAEGAE 1432
Query: 1037 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1081
IW NKFGF M E + ++ +D +M+F G+ ML+K +P I
Sbjct: 1433 GIWLNKFGFRRMAEGQVKQFHSDLNMMVFTGSFMLEKEIPPLEIT 1477
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 140/303 (46%), Gaps = 84/303 (27%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++++ + K L+SG+L+G V Y+ RG + L G+++ G+ C C CKG Q
Sbjct: 585 LLKEAPASAKLLLQSGLLDGHHVRYMGRGGHIM------LTGIIQEGGVLCDCSSCKGVQ 638
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKK 517
VV + FE HAGSS + P ++I+LENGK L+DI+ + + ++ +K
Sbjct: 639 VVNVSAFEKHAGSSARHPSDFIFLENGKCLKDILEIGWN----------------ANKQK 682
Query: 518 ANFCLNCRVSFSNAGVEELMLLCKSCV-ELKESQAGSAEIKEPLSHSSEMEPQPPSVELE 576
N +M + KS + E+ + + ++ PL + QP L+
Sbjct: 683 MN----------------IMDVLKSAIGEVGGVKVQISSLEHPL-----IAIQP---ALK 718
Query: 577 ESPAPSGELTDTSNRSP---EPNSAQTSSHSKMKSSS-------VKSHGKITRKDL---- 622
+ P P L DT R P +P + Q + KM S + VK+ G R +
Sbjct: 719 KLPQPK-SLLDTKPRVPVDVKPRTPQGDTKPKMPSDTKARFTPEVKARGSDARASMPRLD 777
Query: 623 ---------------------RMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILC 661
+HK +F GGLED EVGY+V+G+K L G KKG GILC
Sbjct: 778 RTPREKETASPPVLSRESSGANLHKALFLPGGLEDDTEVGYYVKGQKSLAGVKKGAGILC 837
Query: 662 TCC 664
+CC
Sbjct: 838 SCC 840
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 233 bits (595), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 144/365 (39%), Positives = 194/365 (53%), Gaps = 53/365 (14%)
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP-----VKQETVMKPRKVMR 401
LRRFTRSLL + + LS T +N P + K K
Sbjct: 205 LRRFTRSLLVKDKDSNDDDLS-------GTTTASNASSPSPKTNTTSTSRSSNKGNKNTN 257
Query: 402 KFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
K + L+ L +G+LEG V YI +G + LRGV+K GI C C CKG +VV
Sbjct: 258 KIPTNLRELLATGMLEGQPVKYIMRKGKRAV------LRGVIKRIGILCSCSSCKGRKVV 311
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
+P FE+HAGS+ K P +YI+LENG L D++ C ++ L+ LE A+R +G + ++
Sbjct: 312 SPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIF 371
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
C +C+ SFS + L C SC+E K + K +S S Q S ++ +S
Sbjct: 372 RCKSCKSSFSTLRSGKFALFCDSCLESKGA-------KNNISSSKVGRSQTSSAKVYKSA 424
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
+P KSSSV G++TRKD MHK+VF G L +G +
Sbjct: 425 SPGA-----------------------KSSSV---GRLTRKDKGMHKVVFMSGILPEGTD 458
Query: 640 VGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE 699
VGY+V G++ L GY K GI C CC++ VSPSQFE HAG A+RRKP+ +IY SNGVSLHE
Sbjct: 459 VGYYVGGKRLLDGYIKELGIYCHCCSTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHE 518
Query: 700 LSIKL 704
LSI L
Sbjct: 519 LSISL 523
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 229 bits (584), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 196/749 (26%), Positives = 335/749 (44%), Gaps = 160/749 (21%)
Query: 384 VGGPVK---QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
+ PVK ET +P +++ ++ L +G+LEG V Y + + G
Sbjct: 224 ITAPVKVRVTETKYRPESILKD----VRGLLSTGLLEGFRVTYKKNEVER-------IGR 272
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
+ G G SC C +C ++ FE H+G S+ +I+L++G +L ++ K + L+
Sbjct: 273 INGQGYSCGCSECGYRNIMNACEFEQHSGESSNNQNNHIFLDSGISLYMVIQGLKYTKLD 332
Query: 501 TLEKAVRMVLG-------------------------------SSSMKKANFCLNCRVSFS 529
L + V+ + S +++N L + S
Sbjct: 333 MLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFDDAPSDPCSTQSSQESNIALTDSLKDS 392
Query: 530 NAGVEELMLLC----KSCVELKESQAGSAEIKEP-LSHSSEME----PQPPSVELEESPA 580
+ ++ +S + K G E P LS S E E S++ EE+P+
Sbjct: 393 TSNASSILNWSSFRRRSDRQFKR---GGTETSTPILSRSPEKEISDLSTSTSMKSEETPS 449
Query: 581 --PSGELTD--TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
+G LT T + P P+ H+ S+ K G+ +D +H ++F+EGGL D
Sbjct: 450 ENTAGLLTTDVTVIQDPPPD------HNV--DSNSKDLGQPKVRDNTLHPMLFKEGGLPD 501
Query: 637 GAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSN 693
+ Y ++ GE GYK G GI+C CC+ EV +PSQFE H G RR+P++ IYTS+
Sbjct: 502 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQPYRSIYTSD 561
Query: 694 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 753
G++LHEL++KL + SS N D LP + SG+
Sbjct: 562 GLTLHELALKL--QDGLSSNVNID--------------------------ELPTLTSGSG 593
Query: 754 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 813
EY+ +R + P + + + V++ C +CR
Sbjct: 594 K--------------EYSTTSRPI-----IVPLKRTLQERVLTVES-------CYMCRKP 627
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMED-------LQELPKGKWLCCADCKRIN 866
++ C+QCER HV C ++G++ L E + ++CC C+ +
Sbjct: 628 HTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLR 686
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
+L + + KK E+ + +++ W++L G + + +
Sbjct: 687 ASLHEGL----------------KKREDIAFLRRIRYNICWQLLNGTNMRSD-----VQH 725
Query: 927 KAVSIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFR 984
+ + IF D F E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I +
Sbjct: 726 QVIEIFKDAF---AETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILK 782
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1044
+ +E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G
Sbjct: 783 VRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLG 842
Query: 1045 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
++++++E+++ +PL++F S++QK
Sbjct: 843 YTILSDEQKHSMLMAHPLVMFANLSLVQK 871
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 167/574 (29%), Positives = 252/574 (43%), Gaps = 152/574 (26%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++++ + K L+SG+L+G V Y+ RG + L G+++ G+ C C CKG Q
Sbjct: 582 LLKEAPASAKLLLQSGLLDGHHVRYLGRGGHIM------LTGIIQEGGVLCDCSSCKGVQ 635
Query: 458 V---------------------------------------------------VTPAVFEL 466
V V + FE
Sbjct: 636 VTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFEK 695
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE--KAVRMVLGSSSMKKANFCLNC 524
HAGSS + P ++I+LENGK L+DI+ + ++ + + ++ +G +K
Sbjct: 696 HAGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNVMDVLKSAIGEVGGEKVQI---- 751
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGE 584
+S + L+ E K Q +P +++P+ P V+++
Sbjct: 752 -ISLDHP------LIAIQPAEKKLPQPRLVLDTKP-RVPVDLKPRMPQVDMKPR-----V 798
Query: 585 LTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD--------------------LRM 624
+ DT +R P A+++S K + V++ I R D +
Sbjct: 799 MLDTRSRMPLDTKAKSTSDVKARGGDVRA--TIPRLDRTTREKEAASPPVPSRESSGANL 856
Query: 625 HKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNS------------------ 666
HK +F GGLED EVGY+V+G+KFL G K+G GILC+CC
Sbjct: 857 HKALFLPGGLEDDIEVGYYVKGQKFLAGLKRGAGILCSCCQQVARNGMSSILMDSVVCGL 916
Query: 667 ------EVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKEND---- 716
+S S FE HAGW SRR P+ IY ++G SLH+ + L +E+ + N
Sbjct: 917 TYEGERMISCSLFEQHAGWGSRRNPYTSIYLADGRSLHDAAQSLVVEQTVKQEGNTPAKI 976
Query: 717 ---DLCGICMDGGDLLCCDSCPRAFHIDCVS-LPGIPSGTWHC------RYCMNTFQKEK 766
D C C D GDL C CP A+H DC+ + SG + C RY ++
Sbjct: 977 EHLDQCVECGDRGDLQLCTRCPNAYHQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRRS 1036
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT-ELGGCVLCRGRDFCKSRFGRRTV 825
V N R+ G + + ++ RC R++Q P+ LGGCV C+ DF K+ FG +T
Sbjct: 1037 MV----NRRSKGAAKTLLSKDRVTGRCTRLLQVPEAVVLGGCVFCKSGDFAKTGFGPKTT 1092
Query: 826 ILCDQ----------CEREYHVGCLKDHGMEDLQ 849
+LCDQ CEREYHVGCLK HG+EDL+
Sbjct: 1093 LLCDQVSVGDMKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 52/78 (66%)
Query: 1006 GYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIF 1065
G+ ++L IE+LLG L V+ L LP+A A+ IW N+FGF M EE+ ++ +D +M+F
Sbjct: 1477 GHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMVF 1536
Query: 1066 QGTSMLQKPVPKCRIVGK 1083
G+SML+K +P I G+
Sbjct: 1537 TGSSMLEKDIPPLEITGQ 1554
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 50/93 (53%), Gaps = 7/93 (7%)
Query: 848 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE--SGSDNAVDFD- 904
QELP+G+W C DCK I+ L LV G E L ++ + + K ++ GS++A +
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 905 --VRWRVLRGKKVDASDGTRALLSKAVSIFHDR 935
W++L G+ D S+G L++ V IF R
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKA--LAEVVQIFSVR 1292
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 106/192 (55%), Positives = 140/192 (72%), Gaps = 3/192 (1%)
Query: 890 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 949
KK EE + + DVRWR+L G+ AS T+ LL +AVS+FH+ FDPI++ A+ DLI
Sbjct: 2 KKQEERCLEPLREIDVRWRLLNGRV--ASPETKPLLLEAVSMFHECFDPIVDPATGRDLI 59
Query: 950 PAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQ 1009
PAMV GR+ R QD+ GMYCA+L VN V SA + RIFG ++AELPL+AT N +G+GYFQ
Sbjct: 60 PAMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQ 119
Query: 1010 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY-PLMIFQGT 1068
+LF CIE+LL FL+VK LVLP+A EA++IW +KFGFS M ++ YR +Y +M F+GT
Sbjct: 120 TLFSCIERLLSFLSVKNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGT 179
Query: 1069 SMLQKPVPKCRI 1080
ML K VP+CR+
Sbjct: 180 VMLHKTVPQCRV 191
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 211 bits (537), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 191/749 (25%), Positives = 327/749 (43%), Gaps = 183/749 (24%)
Query: 384 VGGPVK---QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
+ PVK ET +P +++ ++ L +G+LEG V Y
Sbjct: 346 ITAPVKVRVTETKYRPESILKD----VRGLLSTGLLEGFRVTY----------------- 384
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
K N+++ FE H+G S+ +I+L++G +L ++ K + L+
Sbjct: 385 -------------KKNEIMNACEFEQHSGESSNNQNNHIFLDSGISLYMVIQGLKYTKLD 431
Query: 501 TLEKAVRMVLG-------------------------------SSSMKKANFCLNCRVSFS 529
L + V+ + S +++N L + S
Sbjct: 432 MLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFDDAPSDPCSTQSSQESNIALTDSLKDS 491
Query: 530 NAGVEELMLLC----KSCVELKESQAGSAEIKEP-LSHSSEME----PQPPSVELEESPA 580
+ ++ +S + K G E P LS S E E S++ EE+P+
Sbjct: 492 TSNASSILNWSSFRRRSDRQFKR---GGTETSTPILSRSPEKEISDLSTSTSMKSEETPS 548
Query: 581 --PSGELTD--TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
+G LT T + P P+ H+ S+ K G+ +D +H ++F+EGGL D
Sbjct: 549 ENTAGLLTTDVTVIQDPPPD------HNV--DSNSKDLGQPKVRDNTLHPMLFKEGGLPD 600
Query: 637 GAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSN 693
+ Y ++ GE GYK G GI+C CC+ EV +PSQFE H G RR+P++ IYTS+
Sbjct: 601 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQPYRSIYTSD 660
Query: 694 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 753
G++LHEL++KL + SS N D LP + SG+
Sbjct: 661 GLTLHELALKL--QDGLSSNVNID--------------------------ELPTLTSGSG 692
Query: 754 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 813
EY+ +R + P + + + V++ C +CR
Sbjct: 693 K--------------EYSTTSRPI-----IVPLKRTLQERVLTVES-------CYMCRKP 726
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMED-------LQELPKGKWLCCADCKRIN 866
++ C+QCER HV C ++G++ L E + ++CC C+ +
Sbjct: 727 HTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLR 785
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
+L + + KK E+ + +++ W++L G + + +
Sbjct: 786 ASLHEGL----------------KKREDIAFLRRIRYNICWQLLNGTNMRSD-----VQH 824
Query: 927 KAVSIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFR 984
+ + IF D F E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I +
Sbjct: 825 QVIEIFKDAF---AETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILK 881
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1044
+ +E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G
Sbjct: 882 VRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLG 941
Query: 1045 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
++++++E+++ +PL++F S++QK
Sbjct: 942 YTILSDEQKHSMLMAHPLVMFANLSLVQK 970
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKG----NQVVTPAVFELHAGS 470
+L+G+ V Y R K + L G V G +C C G +V++ FE HAG
Sbjct: 31 LLQGVPVTY-RFEKHN----AKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGV 85
Query: 471 SNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
++K +I+L NG++L ++ + ++ P E +A RM G
Sbjct: 86 TSKNQNGHIFLRNGRSLYELFHKLREVPAEKFPEAFRMAAG 126
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 208 bits (529), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 190/711 (26%), Positives = 308/711 (43%), Gaps = 116/711 (16%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 115 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 167
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG------------SS 513
H G S ++I+L+ G +L ++ K L L + +G +S
Sbjct: 168 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKAS 227
Query: 514 SMKKANF--------CLNCRVSFSNAG--VEELMLLCKSCVELKESQAG-SAEIKEPLSH 562
K+ ++ CL AG + L K V S SA +
Sbjct: 228 FQKRKDYLDAVASDGCLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNWSASKRRSGRR 287
Query: 563 SSEMEPQPPSVELEESPAPSG--ELTDTSNR--SPEPNSAQTSSHSK-----MKSSSVKS 613
+ + + SP G TDTS + + E + SS K +++ S+ S
Sbjct: 288 FRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHRLSLSSPVKITQRPLRNCSIDS 347
Query: 614 HGKITR-KDLRMHKLVFEEGGLEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPS 671
K ++ +D +H L+F+E GL D + Y ++ GE GYK+G I+C CCN E SPS
Sbjct: 348 KSKESKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPS 407
Query: 672 QFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCC 731
FE HAG RR+P+ +IYT G+SLH+ + L L+ + D+ + L
Sbjct: 408 HFEEHAGMGRRRQPYHNIYTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTS 465
Query: 732 DSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVS 791
C R P SG + P + +
Sbjct: 466 SGCGRE--------PSTTSGP------------------------------IVPLKRTLQ 487
Query: 792 RCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GM 845
R+V+T C C T+I C+QCER H+ C + +
Sbjct: 488 E--RVVETES-----CYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPL 540
Query: 846 EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDV 905
E L+E +LCC +C+ +L+ ++ G EK + +++ + ++
Sbjct: 541 EILKEYMCFHFLCCQECQ----SLRARLEEGLEKC--VGITFLRR----------IRSNI 584
Query: 906 RWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYH 964
WR+L G +DAS + + + + IF D F + S D+I MV G++ + +D+
Sbjct: 585 CWRLLSG--MDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFR 640
Query: 965 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1024
GMYCA+LT + VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV
Sbjct: 641 GMYCALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNV 700
Query: 1025 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
L P E IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 701 SLLTAPVDPEMAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 751
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/286 (37%), Positives = 153/286 (53%), Gaps = 38/286 (13%)
Query: 805 GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK--------- 855
GGCVLC+ DF + FG RT+I+CDQCEREYH+GCL +HG L ELP+GK
Sbjct: 575 GGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLAEHGRAHLTELPEGKASLYDILLT 634
Query: 856 ------WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
W C +CK I +++ V +S+ V + + W+V
Sbjct: 635 LHLNGEWHCSPECKGIATRMRERV---------SSVPV------------PLQGEYSWQV 673
Query: 910 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 969
LRGK D + T L A I + FDPI++ + DL+ AMVY + DY GMY A
Sbjct: 674 LRGK--DGTHATTWALKAAQEILTESFDPILDLVTGADLMMAMVYAQELGDWDYTGMYTA 731
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
+L V + +FR+FG++LAE+PLVAT + QG+ + L E L V++L L
Sbjct: 732 VLRRRGKAVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLMAAFEDYFRSLGVQSLCL 791
Query: 1030 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
P+A W + FGF+ +T EEQ ++ ++IF GT +LQKP+
Sbjct: 792 PAAQSTVETWIHGFGFAAITPEEQAATCSELRVLIFPGTELLQKPL 837
Score = 96.3 bits (238), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 113/411 (27%), Positives = 167/411 (40%), Gaps = 102/411 (24%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSG-ISCFCDDCKGNQVVTPAV-- 463
L+ L+ G L G V + G L G + G I+C C C+ + TP V
Sbjct: 230 LREVLKGGALRGQPVFFQSRH-----GDLLLNGSITEEGQIACPCKQCRAKK--TPGVSC 282
Query: 464 --FELHAGSSNKRPPEYIYLEN-GKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAGS +RP E IYL N +L++ + D G S+ + +
Sbjct: 283 SEFEEHAGSRERRPGESIYLTNLSISLKEFCALVNDE-------------GRSADRHGSA 329
Query: 521 CLNC--------------RVSFSNAGVEELM---LLCKSCVELKESQAGSAEIK-EPLSH 562
C C V AG+EE+ C +CV A A++K +PL
Sbjct: 330 CGLCMDGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDACV------ARGAQLKAKPLP- 382
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS-HGKITRKD 621
S +P P+ + E + + + + + M + +++ G ++
Sbjct: 383 -SPQKPSKPAKWRQPKQKAPKEKKHGGSGAAKKAAHAGAPRVHMAAPALRVVSGARRERN 441
Query: 622 LRMHKLVF---EEGGLEDGAEVGYFV-RGEKFLVGYKK-------GFGILCTCCNSEVSP 670
HK +F E GGL DG V Y +GE+ L G + GILC CCN +S
Sbjct: 442 SNKHKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCACCNGVISC 501
Query: 671 SQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS---LERPFSSKENDDLCGI------ 721
SQFEAHAG SRR P+ +I+T+ GVSL +L+ + E P S + LC +
Sbjct: 502 SQFEAHAGRGSRRAPYDNIFTAAGVSLRKLACLMPASEAESPISHRPA-ALCAVADRRAL 560
Query: 722 -----------CMDGGDLLC-----------------CDSCPRAFHIDCVS 744
+ GG +LC CD C R +HI C++
Sbjct: 561 EPELVTVSGEAALHGGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLA 611
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 61/108 (56%), Gaps = 9/108 (8%)
Query: 659 ILCTC--CNSEVSP----SQFEAHAGWASRRKPFQHIYTSN-GVSLHELSIKLSLERPFS 711
I C C C ++ +P S+FE HAG + R+P + IY +N +SL E ++ E S
Sbjct: 265 IACPCKQCRAKKTPGVSCSEFEEHAG-SRERRPGESIYLTNLSISLKEFCALVNDEG-RS 322
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ + CG+CMDGGDLLCCD CP A H C L +P G W C C+
Sbjct: 323 ADRHGSACGLCMDGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDACV 370
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 150/513 (29%), Positives = 238/513 (46%), Gaps = 97/513 (18%)
Query: 592 SPEPNSAQTSSHSKMKSS----SVKSHGKITRKDLR-----------------MHKLVFE 630
SP P +A T++HSK S+ S+ S KIT++ LR +H L+F+
Sbjct: 342 SPLP-TAVTTNHSKHDSTNLGLSLSSPVKITQRPLRNCSIDSKSKESKTRDTTLHPLIFK 400
Query: 631 EGGLEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
E GL D + Y ++ GE GYK+G I+C CCN E SPS FE HAG RR+P+ +I
Sbjct: 401 EDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNI 460
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 749
YT G+SLH+ + L L+ + D+ + L C R P
Sbjct: 461 YTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGRE--------PSTT 510
Query: 750 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 809
SG + P + + R+V+T C
Sbjct: 511 SGP------------------------------IVPLKRTLQE--RVVETE-----SCYF 533
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQELPKGKWLCCADCK 863
C T+I C+QCER H+ C + +E L+E +LCC +C+
Sbjct: 534 CGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQ 593
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 923
+L+ ++ G EK + +++ + ++ WR+L G +DAS +
Sbjct: 594 ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWRLLSG--MDASRDVKL 635
Query: 924 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMYCAILTVNQVVVSAGI 982
+ + + IF D F + S D+I MV G++ + +D+ GMYCA+LT + VVSA I
Sbjct: 636 YMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAI 693
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
++ +++AEL L+AT ++C+ +GYF L IE L NV L P E IW+ K
Sbjct: 694 LKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEK 753
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
GF++++ EE+ +PL++F+ ++QK +
Sbjct: 754 LGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 786
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 8/97 (8%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 115 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 167
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETL 502
H G S ++I+L+ G +L ++ K L L
Sbjct: 168 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNML 204
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 202/432 (46%), Gaps = 75/432 (17%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL------------- 704
GI C CCN + FE HAG +S P H++ +G+SL + ++
Sbjct: 607 GIKCMCCNEIFTVENFEVHAG-SSTPLPSAHMFLKDGMSLSQCLVEFMGGNKPRDPHPLH 665
Query: 705 -SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQ 763
L+ S E+D +C +C DGGDLL CD+CP ++H DCV L IP G W+C C
Sbjct: 666 ARLKGKNSDLESDSICSVCHDGGDLLLCDNCPSSYHHDCVGLEAIPEGNWYCPSCR---- 721
Query: 764 KEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRR 823
S C PDT S+F +
Sbjct: 722 --------------------------CSICNLSDYDPDT---------------SQFTEK 740
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
T++ CDQCEREYHVGC ++ + L P+G W C C + LQ+L+ + E
Sbjct: 741 TIVYCDQCEREYHVGCTRNSDNQ-LICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEG 799
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
I K +GSD+ D+D ++ A + L AV I H+ F IIE
Sbjct: 800 VSWTILKFCSGNGSDHG-DYD--------DEIMADHYGK--LCVAVGILHECFVTIIEPR 848
Query: 944 SKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1001
++ D+ +V+ R R ++ G Y +L +S G FRI GQ+ AELPL+ TS+
Sbjct: 849 TQSDISEDIVFNRESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTSSP 908
Query: 1002 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1061
+ QG + L +EKLL L V+ L+LP+ E WT FGF++M+ ++ + +
Sbjct: 909 YRRQGMCRLLINELEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNSDRLELAGN-S 967
Query: 1062 LMIFQGTSMLQK 1073
++ FQGT+M QK
Sbjct: 968 ILSFQGTTMCQK 979
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 6/57 (10%)
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTL 487
GPG+ G ++ GI C C N++ T FE+HAGSS P +++L++G +L
Sbjct: 595 GPGIK--EGSIRRDGIKCMC----CNEIFTVENFEVHAGSSTPLPSAHMFLKDGMSL 645
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 144/495 (29%), Positives = 224/495 (45%), Gaps = 85/495 (17%)
Query: 600 TSSHSKMKSSSVKSHGKITRKDLR---MHKLVFEEGGLEDGAEVGYFVR--GEKFLVGYK 654
T SH+++ SS+ HG+I R + L+ ++ L ++ Y R G + G
Sbjct: 680 TMSHARVLSST---HGQILPYQHRAKTVLSLLVDKNILLPRVKLTYKQRSDGPRLKEGAV 736
Query: 655 KGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL------------SI 702
GI C CCN + FE HAG S R P HI+ +G L + S+
Sbjct: 737 TKDGIKCRCCNELFTLESFEVHAG-CSTRLPAAHIFLKDGRPLSQCLVELMGENKPKESL 795
Query: 703 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
+ L+ +S E+D +C IC +GG++L CD+CP +FH CV L P G+W+C C
Sbjct: 796 HVRLKTNYSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWYCPSCR--- 852
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFC--KSRF 820
C +C D+ ++F
Sbjct: 853 --------------------------------------------CSICDSSDYDPDTNKF 868
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
+T++ CDQCEREYHVGC+++ G + L P+G W C C I LQ L+ +
Sbjct: 869 TEKTIMYCDQCEREYHVGCMRNKG-DQLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPTP 927
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
E I + E+ S + ++ GK L A+ + H+ F II
Sbjct: 928 VEGLSCTILRFDRENASQHGDFYNEIIAEQYGK-----------LCIALDVLHECFVTII 976
Query: 941 ESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
E +++ DL +V+ R R ++ G Y IL + ++S G FR+ G++ AELPL+ T
Sbjct: 977 EPSTRRDLSEDIVFNRESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAELPLIGT 1036
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ QG + L +EKLL L V+ LVLP+ + WT FGF M+ ++ +
Sbjct: 1037 RVQYRRQGMCRLLMNELEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAMSFSDRFELA- 1095
Query: 1059 DYPLMIFQGTSMLQK 1073
+ ++ FQGT++ QK
Sbjct: 1096 ESSILSFQGTTICQK 1110
Score = 43.9 bits (102), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 15/107 (14%)
Query: 439 GVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN--VCKD 496
G V GI C C N++ T FE+HAG S + P +I+L++G+ L + + ++
Sbjct: 734 GAVTKDGIKCRC----CNELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVELMGEN 789
Query: 497 SPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSC 543
P E+L VR+ S + + C C N G E +LLC +C
Sbjct: 790 KPKESLH--VRLKTNYSDTESDSICSIC-----NEGGE--ILLCDNC 827
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/445 (29%), Positives = 194/445 (43%), Gaps = 99/445 (22%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE---------- 707
GI+C+CC S S FEAH+G S+R+ I+ + G SL +L ++ + E
Sbjct: 28 GIVCSCCKKLWSLSGFEAHSG-TSQRRACASIFNNKGESLLDLQVQ-AWELLDSKVNPKE 85
Query: 708 --RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ S ENDD CG+C DGG L+CCD CP +H+ C+ L +P G W C C
Sbjct: 86 NVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR------ 139
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK--SRFGRR 823
C +C G ++ S F
Sbjct: 140 -----------------------------------------CAICGGSEYNADGSSFNEM 158
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
TV+LCDQCEREYHV CL GM + P W C C +I L+KLV
Sbjct: 159 TVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFEGLRKLV---------- 208
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGT----------RALLSKAVSIFH 933
G N + + W +LR + D + R+ L+ A+ +
Sbjct: 209 ------------GISNTIGEGLSWTLLRSGEDDLPSASSMNREQMAEHRSKLAVALGVMQ 256
Query: 934 DRFDPIIESASKLDLIPAMVYGRSH---RGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
+ F P+++ +K+DL+ ++Y R ++ G Y +L + V+S RI G L
Sbjct: 257 ECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLL 316
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AE+PL+ T + QG + L IE LL L +++ VLP+ E W N FGF M
Sbjct: 317 AEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAP 376
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPV 1075
++ + ++ F G ++LQKP+
Sbjct: 377 TQRLELVK-LSVVSFPGVTLLQKPL 400
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 192 bits (488), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 142/464 (30%), Positives = 213/464 (45%), Gaps = 78/464 (16%)
Query: 620 KDLRMHKLVFEEGGLEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAG 678
+D +H L+F+EGGL D + Y ++ GE GYK G GI+C CC+ E +PS FE HAG
Sbjct: 105 RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNCCSQEFAPSHFEEHAG 164
Query: 679 WASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICM-DGGDLLCCDSCPRA 737
RR+P+ +IYT G +LH+L+ L L+ +S D+ D +L C R
Sbjct: 165 MGRRRQPYHNIYTPEGSTLHKLA--LQLQDHLNSNGFDNASVSSFSDYPNLTSASGCGRQ 222
Query: 738 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 797
P SG P + R +
Sbjct: 223 --------PSTTSG---------------------------------PIVPLK----RTL 237
Query: 798 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQEL 851
Q E C C +I C+QCER HV C + +E L+E
Sbjct: 238 QGRVVETESCYFCGYGHTTIGNIDPDMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEY 297
Query: 852 PKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR 911
++LCC +C Q L DR EE L +K EE + ++ WR+L
Sbjct: 298 VCFRFLCCQEC-------QLLRDRLEEGL---------EKCEEIAFLRRIRSNICWRLLS 341
Query: 912 GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL-DLIPAMVYGRS-HRGQDYHGMYCA 969
G +DAS + + + + IF D F +ES + D+ MV ++ + +D+ GMYCA
Sbjct: 342 G--MDASRDVKLFMPQVIDIFKDAF---VESTDEHSDIFSDMVNCKNGDQEKDFRGMYCA 396
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
+LT + VVSA I ++ +++AEL L+AT +C+ +GYF L IE L NV L
Sbjct: 397 LLTASTHVVSAAILKVRMEQIAELVLIATRRECRKKGYFILLLKSIEANLRAWNVSLLTA 456
Query: 1030 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
P E IW+ K GF++++ EE+ +PL++F+ ++QK
Sbjct: 457 PVDPEMAQIWSEKLGFTILSAEEKESVLESHPLVMFKNLVLVQK 500
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 137/475 (28%), Positives = 204/475 (42%), Gaps = 97/475 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEK--FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
+ E L + +V Y + + L G + GI+C+CC S S FEAH+G S+R+
Sbjct: 21 LLETKMLHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSG-TSQRRA 79
Query: 686 FQHIYTSNGVSLHELSIKLSLE------------RPFSSKENDDLCGICMDGGDLLCCDS 733
I+ + G SL +L ++ + E + S ENDD CG+C DGG L+CCD
Sbjct: 80 CASIFNNKGESLLDLQVQ-AWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDH 138
Query: 734 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 793
CP +H+ C+ L +P G W C C EYNA+
Sbjct: 139 CPSTYHLSCLLLKELPEGEWFCPSCRCAICGGS--EYNADG------------------- 177
Query: 794 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
S F TV+LCDQCEREYHV CL GM + P
Sbjct: 178 ------------------------SSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPD 213
Query: 854 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 913
W C C +I L+KLV G N + + W +LR
Sbjct: 214 DSWFCGDHCDKIFQGLRKLV----------------------GISNNIGEGLSWTLLRSG 251
Query: 914 KVDASDGT----------RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH---RG 960
+ D R+ L+ A+ + + F P+++ +K+DL+ ++Y R
Sbjct: 252 EDDLPSANSMNREQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 311
Query: 961 QDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLG 1020
++ G Y +L + V+S RI G LAE+PL+ T + QG + L IE LL
Sbjct: 312 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQ 371
Query: 1021 FLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
L +++ VLP+ E W N FGF M ++ + ++ F G ++LQKP+
Sbjct: 372 RLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLELVK-LSVVSFPGVTLLQKPL 425
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/277 (36%), Positives = 152/277 (54%), Gaps = 31/277 (11%)
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 866
CVLC DF + F +TV++CDQCE+EYH+GCL+ H M D+Q +P+G+W C +C RI
Sbjct: 1512 CVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVRIR 1571
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
AL + V GE +P RW++LRGK + T LS
Sbjct: 1572 DALGEDVAAGEVLMPGNPA-------------------YRWQILRGK--NGRQQTWHALS 1610
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV-------VVS 979
++I + FDPII++ S DL+PAMV + D+ GMY +L V
Sbjct: 1611 TVLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVL 1670
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCI-EKLLGFLNVKTLVLPSASEAQAI 1038
A +FR+ G +AE+PLVAT DC+ QG+ ++L + KLLG L V+ +VLP+ ++AQ
Sbjct: 1671 AALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLG-LGVRAIVLPATADAQPA 1729
Query: 1039 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
W + F + E R+++ ++IF T++L +P+
Sbjct: 1730 W-RQLQFQDLDEPSTRVARSEHRMVIFPHTTVLARPL 1765
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 632 GGLEDGAEVGYFVRGEKFLVGY-------KKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
GGL+ V Y ++G++ L G GILC C++ +S S FEAHAG RR
Sbjct: 1359 GGLKHDDIVWYTLQGKRLLRGVVQLQACGAATSGILCGHCDTIISASAFEAHAGHKHRRN 1418
Query: 685 PFQHIYTSNGVSLHELSIKL 704
P++ I T +G++L ++ +L
Sbjct: 1419 PYEAILTDDGMTLKRIAERL 1438
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 135/456 (29%), Positives = 200/456 (43%), Gaps = 99/456 (21%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 704
R + L G+ GI C CC+ ++ S+FE HAG + R+PFQ+IY +GVSL E I
Sbjct: 679 RTKVMLEGWVTRDGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLDSGVSLLECQIDA 737
Query: 705 -----SLER-PFSS------KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 752
S+ER F S NDD CGIC DGGDL+CCD CP FH C+ + +P G
Sbjct: 738 WNRQESIERIGFHSVNTDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIMMLPPGD 797
Query: 753 WHCRYCMNTF---QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 809
WHC C F E FV+ + T +EL C L
Sbjct: 798 WHCPNCTCKFCGIASEDFVQEDG--------------------------TNVSELLTCSL 831
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 869
C ++YH CL+D + C C+ + L
Sbjct: 832 ---------------------CAKKYHKSCLQDVDAPCIDFNNSTPCFCGKTCRELFEQL 870
Query: 870 QKLVDRGEEKLPETSLDVIKKKHE-ESGSDNAVDFDVRWRVLRGKKVDASDGTRAL---- 924
QK + KHE ESG W ++ +D + L
Sbjct: 871 QKY---------------LGIKHELESG--------FSWSLVHRMDIDLDMSLQGLPQRV 907
Query: 925 -----LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVV 977
L+ A+S+ + F PI++ S +++I ++Y G + +Y G Y AIL +
Sbjct: 908 ECNSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNRLNYSGFYAAILERGDEI 967
Query: 978 VSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQA 1037
+SA R G +LAE+P + T + + QG + LF IE L L V+ L++P+ SE
Sbjct: 968 ISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRRLFSAIESALCSLKVQKLIIPAISELTH 1027
Query: 1038 IWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
WT FGF+ +++ + + ++ +++F G MLQK
Sbjct: 1028 TWTGVFGFTTLSDSLKQELKS-MNMLVFPGIDMLQK 1062
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 129/441 (29%), Positives = 194/441 (43%), Gaps = 85/441 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL--------SLERP 709
GI C+CC S S+FEAHAG +S + +I+ +G SL E +++ +
Sbjct: 594 GIKCSCCQEVFSLSRFEAHAG-SSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKES 652
Query: 710 FSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
FS K END +C +C GGDL+ CD CP +FH C+ L G W C C
Sbjct: 653 FSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSCC--- 709
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 822
C +C F
Sbjct: 710 --------------------------------------------CGICGENKFDGGSEQD 725
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEE 878
V C QCER+YHVGCL+ G L P G W C CK+I L LQKL+ + G +
Sbjct: 726 NVVFSCYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 785
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
L T L I+ K ++ D+ ++A + L+ A+ + H+ F+P
Sbjct: 786 NLTWTLLKPIRSK--------GLEIDL-------PDIEALTEVYSKLNIALGVMHECFEP 830
Query: 939 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+ E ++ D++ +++ G ++ G Y +L N ++S R++G+++AE+PL+
Sbjct: 831 VKEPHTRRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLI 890
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
T + G L +EK L L V+ LVLP+ WT FGFS MT+ E+ ++
Sbjct: 891 GTRFQYRRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRF 950
Query: 1057 RNDYPLMIFQGTSMLQKPVPK 1077
DY + FQ T M QK + K
Sbjct: 951 L-DYSFLDFQDTVMCQKLLMK 970
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 131/441 (29%), Positives = 204/441 (46%), Gaps = 85/441 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK-------------- 703
GI C+CC+ ++ S+FE HAG + +P ++IY G SL + ++
Sbjct: 571 GIHCSCCDEVITISKFEMHAG-SRVGQPLENIYVHTGSSLLQCLLESWNKQNEPQCKGYN 629
Query: 704 ---LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
+ +E P NDD CGIC DGGDL+CCDSCP FH C+ + PSG WHC YC
Sbjct: 630 FVDVDVEDP-----NDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC-- 682
Query: 761 TFQKEKFVEYNANARAAGRIE-GVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
+ + G++ G+ P E VLC+
Sbjct: 683 ------------SCKVCGQVTIGLHPMDD------------HHEAAADVLCK-------- 710
Query: 820 FGRRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
CD CE +YH C++ ++ D P C C+ ++ LQ+L+ ++
Sbjct: 711 --------CDLCEEKYHPICVQMNNASGDDVNNP---LFCGKKCQMLHERLQRLLGVRQD 759
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
S +I++ SD VD DV +K+ + + L+ A+ + + F P
Sbjct: 760 MKEGFSWTLIRR------SD--VDSDVSLCNEVAQKIKCN----SELAVALFVMDECFLP 807
Query: 939 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+I+ S ++LI ++Y G + ++ G Y AIL + V+ A RI G ELAE+P +
Sbjct: 808 VIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFI 867
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
T + QG + IE +L LNV+ LV+P+ SE + W + FGF + E + +
Sbjct: 868 GTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDETTKQRM 927
Query: 1057 RNDYPLMIFQGTSMLQKPVPK 1077
R L++F G MLQK + K
Sbjct: 928 RK-MSLLVFPGVEMLQKLLLK 947
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGIL---EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L ++ GIL E + M R ++VK L G + GI C C D +V+T +
Sbjct: 536 LAWMIDLGILSLDEKVQYMNQRKTRVK------LEGRLTRDGIHCSCCD----EVITISK 585
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P E IY+ G +L
Sbjct: 586 FEMHAGSRVGQPLENIYVHTGSSL 609
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 181 bits (458), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 202/437 (46%), Gaps = 85/437 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK-------------- 703
GI C+CC+ ++ S+FE HAG + +P ++IY G SL + ++
Sbjct: 598 GIHCSCCDEVITISKFEMHAG-SRVGQPLENIYVHTGSSLLQCLLESWNKQNEPQCKGYN 656
Query: 704 ---LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
+ +E P NDD CGIC DGGDL+CCDSCP FH C+ + PSG WHC YC
Sbjct: 657 FVDVDVEDP-----NDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC-- 709
Query: 761 TFQKEKFVEYNANARAAGRIE-GVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
+ + G++ G+ P E VLC+
Sbjct: 710 ------------SCKVCGQVTIGLHPMDD------------HHEAAADVLCK-------- 737
Query: 820 FGRRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
CD CE +YH C++ ++ D P C C+ ++ LQ+L+ ++
Sbjct: 738 --------CDLCEEKYHPICVQMNNASGDDVNNP---LFCGKKCQMLHERLQRLLGVRQD 786
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
S +I++ SD VD DV +K+ + + L+ A+ + + F P
Sbjct: 787 MKEGFSWTLIRR------SD--VDSDVSLCNEVAQKIKCN----SELAVALFVMDECFLP 834
Query: 939 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+I+ S ++LI ++Y G + ++ G Y AIL + V+ A RI G ELAE+P +
Sbjct: 835 VIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFI 894
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
T + QG + IE +L LNV+ LV+P+ SE + W + FGF + E + +
Sbjct: 895 GTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDETTKQRM 954
Query: 1057 RNDYPLMIFQGTSMLQK 1073
R L++F G MLQK
Sbjct: 955 RK-MSLLVFPGVEMLQK 970
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGIL---EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L ++ GIL E + M R ++VK L G + GI C C D +V+T +
Sbjct: 563 LAWMIDLGILSLDEKVQYMNQRKTRVK------LEGRLTRDGIHCSCCD----EVITISK 612
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P E IY+ G +L
Sbjct: 613 FEMHAGSRVGQPLENIYVHTGSSL 636
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 133/472 (28%), Positives = 201/472 (42%), Gaps = 105/472 (22%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGW---ASRRKPFQHIYTSNGVSLHELSIKL---- 704
G+ G GI CTCCN ++ ++F AHA + RR+ + ++ +G SL + ++L
Sbjct: 422 GFITGEGIRCTCCNKTLTVAEFAAHATARRGSDRREAWARVFLKDGRSLSQCLVELMRRD 481
Query: 705 -----------SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 753
++ S E D +C IC DGGDLL CD+CP AFH CV L P G W
Sbjct: 482 VAVVAARNGDVRVKEKCSDPEGDSVCSICNDGGDLLLCDNCPSAFHHACVGLQATPEGDW 541
Query: 754 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 813
C C C +C G
Sbjct: 542 FCPSCR-----------------------------------------------CGVCGGS 554
Query: 814 DFCKSR-----FGRRTVILCDQCEREYHVGCLKDHGM--------------EDLQELPKG 854
DF + F +T+I CDQCEREYHVGC++ G E+ +E P
Sbjct: 555 DFDATAAGGGGFTDKTIIYCDQCEREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWP-- 612
Query: 855 KWLCCADCKRINLALQKLVDRGEEK---LPET--------SLDVIKKKHEESGSDNAVDF 903
WLC +C + LQ L E+ +P T SL +++++ S A
Sbjct: 613 -WLCSPECGEVFRHLQGLAAVARERSIPIPTTVPTTVEGVSLSILRRRRRRPISMVATGS 671
Query: 904 DVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQ 961
+ ++ + L A+ + H+ F +IE ++ DL +V+ R R
Sbjct: 672 GCQ-----EEEEEEDAAEHGQLCSALDVLHECFVTLIEPRTQTDLTADIVFNRESELRRL 726
Query: 962 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1021
++ G Y L +++ G R+ G E+AELPLV T + QG L +EK+L
Sbjct: 727 NFRGYYVVGLEKAGELITVGTLRVLGTEVAELPLVGTRFAHRRQGMCHLLVTELEKVLRQ 786
Query: 1022 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+ V+ LVLP+ E +WT GF MT + + ++ ++ FQGT+M K
Sbjct: 787 VGVRRLVLPAVPELLPMWTASLGFHPMTRSDVMEIAAEHAILSFQGTTMCHK 838
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 180 bits (456), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 131/446 (29%), Positives = 208/446 (46%), Gaps = 78/446 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 701
L G+ GI C+CCN S +F HAG + KP+++I +G+ + L +
Sbjct: 922 LLEGFATRDGINCSCCNEVYSVLEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 979
Query: 702 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
++ +ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 980 MQSDVERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWC 1039
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F E + G + A + D
Sbjct: 1040 CANCSCKFCHEHSSD------------GAEDTADV------------------------D 1063
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
+ ++ C QCE +YH C + G C C+ + LQ L+
Sbjct: 1064 Y--------SLHTCSQCEEQYHEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLA 1115
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRALLSKAVSIFH 933
++ PE S V+++ HE DV VL K+V+ + + ++ A+S+
Sbjct: 1116 VKKDLEPEYSCRVVQRIHE----------DVPEEVLALDKRVECN----SRIAVALSLMD 1161
Query: 934 DRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELA 991
+ F PII+ + ++LI +VY G + D+ G Y IL +++A RI G +LA
Sbjct: 1162 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 1221
Query: 992 ELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEE 1051
E+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + +
Sbjct: 1222 EMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDS 1281
Query: 1052 EQNKYRNDYPLMIFQGTSMLQKPVPK 1077
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1282 EKQEVKS-ISMLVFPGTGLLQKPLLK 1306
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/471 (27%), Positives = 205/471 (43%), Gaps = 96/471 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ + G +E +V Y R + L G+ GI C CC+ ++ S+FE HAG + +P+Q
Sbjct: 610 LIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQ 668
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSCP 735
+IY +GVSL + I + + K NDD CGIC DGGDL+CCD CP
Sbjct: 669 NIYLESGVSLLQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDGCP 728
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH C+ + +P G WHC C F C
Sbjct: 729 STFHQSCLDIQMLPPGEWHCPNCTCKF------------------------------CGI 758
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
+T D + + RT IL CE++YH C K+ M+ L
Sbjct: 759 ASETSDKDDASVNVL------------RTCIL---CEKKYHDSCTKE--MDTLPNNINSS 801
Query: 856 WL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 913
L C +CK ++ L+K + G+ + ++ W ++
Sbjct: 802 SLSFCGKECKELSEYLKKYL----------------------GTKHELEAGFSWCLIHRS 839
Query: 914 KVDASDGTRAL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 962
D+ R L L+ A+++ + F P+I+ S ++LI ++Y G +
Sbjct: 840 DEDSEAACRGLTQRVECNSKLAIALTVMDECFLPVIDRRSGINLIRNILYNSGSNFSRLS 899
Query: 963 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1022
Y G Y AIL +++A R G ++AE+P + T + + QG + LF IE L L
Sbjct: 900 YSGFYTAILERGDEIIAAASIRFHGTKIAEMPFIGTRHIYRRQGMCRRLFSAIELALCSL 959
Query: 1023 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
V+ LV+P+ +E WT FGF+ + E + + ++ +M+F G MLQK
Sbjct: 960 KVEKLVIPAVAELTHTWTTVFGFTYLDESLRQEMKS-LNMMVFPGIDMLQK 1009
Score = 43.5 bits (101), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 607 LAWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 656
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 657 LHAGSKLPQPYQNIYLESGVSL 678
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 206/447 (46%), Gaps = 82/447 (18%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 704
R + L G+ GI C CC+ ++ S+FE HAG + R+PFQ+I +GVSL + +
Sbjct: 417 RTKVMLEGWITRDGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDA 475
Query: 705 SLERPFSSKEN------------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 752
+ S + DD CGIC DGGDL+CCD CP FH C+++ +PSG
Sbjct: 476 WNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGD 535
Query: 753 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 812
WHC C F G +G + DT +
Sbjct: 536 WHCPNCTCKF--------------CGMADGSN-------------AEDDTTVS------- 561
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQELPKGKWLCCADCKRINLALQ 870
++ C CE++YH C++ D + D C C+ + LQ
Sbjct: 562 -----------ELVTCSLCEKKYHTSCIQGVDAVLSDTNN--PSTSFCGQGCRELFEHLQ 608
Query: 871 KLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKVDASDGTRALLSKA 928
K + +E S +I + + GSD +V RG ++V+++ + L+ A
Sbjct: 609 KFIGVKQELEAGFSWSLIHRT--DPGSDTSV---------RGFPQRVESN----SKLAIA 653
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIF 986
+++ + F I++ S+++LI ++Y R +Y G Y AIL ++ A RI
Sbjct: 654 LTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIH 713
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G +LAE+P + T + + QG + LFC IE L L V+ L++P+ SE WT FGF+
Sbjct: 714 GTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFN 773
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+ E + + R+ +++F GT MLQK
Sbjct: 774 PLEESHKQELRS-LNMLVFPGTDMLQK 799
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 206/447 (46%), Gaps = 82/447 (18%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 704
R + L G+ GI C CC+ ++ S+FE HAG + R+PFQ+I +GVSL + +
Sbjct: 585 RTKVMLEGWITRDGIHCRCCSKILTVSKFEIHAG-SKLRQPFQNIVLDSGVSLLQCQVDA 643
Query: 705 SLERPFSSKEN------------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 752
+ S + DD CGIC DGGDL+CCD CP FH C+++ +PSG
Sbjct: 644 WNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDGCPSTFHQSCLNIQMLPSGD 703
Query: 753 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 812
WHC C F G +G + DT +
Sbjct: 704 WHCPNCTCKF--------------CGMADGSN-------------AEDDTTVS------- 729
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQELPKGKWLCCADCKRINLALQ 870
++ C CE++YH C++ D + D C C+ + LQ
Sbjct: 730 -----------ELVTCSLCEKKYHTSCIQGVDAVLSDTNN--PSTSFCGQGCRELFEHLQ 776
Query: 871 KLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKVDASDGTRALLSKA 928
K + +E S +I + + GSD +V RG ++V+++ + L+ A
Sbjct: 777 KFIGVKQELEAGFSWSLIHRT--DPGSDTSV---------RGFPQRVESN----SKLAIA 821
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIF 986
+++ + F I++ S+++LI ++Y R +Y G Y AIL ++ A RI
Sbjct: 822 LTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGFYTAILERGDEIICAASIRIH 881
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G +LAE+P + T + + QG + LFC IE L L V+ L++P+ SE WT FGF+
Sbjct: 882 GTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEMLIIPAISELMHTWTVGFGFN 941
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+ E + + R+ +++F GT MLQK
Sbjct: 942 PLEESHKQELRS-LNMLVFPGTDMLQK 967
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 81/131 (61%), Positives = 105/131 (80%), Gaps = 1/131 (0%)
Query: 952 MVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1011
MVYGR+ RGQ++ GMYCA+L VN VVSA + RIFG ++AELPLVATSN G+GYFQ+L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 1012 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSM 1070
F CIE+LL F+NVK+LVLP+A EA++IWT+KFGFS M +E + YR ++ M+ F+GT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 1071 LQKPVPKCRIV 1081
L K VP CRI+
Sbjct: 121 LHKLVPPCRII 131
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 176 bits (446), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 132/472 (27%), Positives = 205/472 (43%), Gaps = 98/472 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ + G +E +V Y R + L G+ GI C CC+ ++ S+FE HAG + +P+Q
Sbjct: 611 LIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQ 669
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSCP 735
+IY +GVSL + I + + K NDD CGIC DGGDL+CCD CP
Sbjct: 670 NIYLESGVSLLQCQIDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGCP 729
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH C+ + +P G W C C F A+G E D
Sbjct: 730 STFHQSCLDIQMLPPGEWRCMNCTCKFC----------GIASGTSEKDDASV-------- 771
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
CVL +C+ CE++YH C K+ M+ L
Sbjct: 772 -----------CVL----------------HICNLCEKKYHDSCTKE--MDTLPNNINSS 802
Query: 856 WL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE-ESGSDNAVDFDVRWRVLRG 912
L C +CK ++ L+K + KHE ESG W ++
Sbjct: 803 SLSFCGKECKELSEHLKKY---------------LGTKHELESG--------FSWSLIHR 839
Query: 913 KKVDASDGTRAL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQ 961
D+ R + L+ +++ + F P+I+ S ++LI ++Y G +
Sbjct: 840 TDDDSEAACRGISQRVECNSKLAITLTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRL 899
Query: 962 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1021
Y G Y AIL +++A R G ++AE+P + T + + QG + LF IE L
Sbjct: 900 SYSGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHIYRRQGMCRRLFSAIESTLCS 959
Query: 1022 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
L V+ LV+P+ +E WT FGF+ + + + + ++ +M+F G MLQK
Sbjct: 960 LKVEKLVIPAIAEVTNTWTTVFGFTHLDKSLRQEMKS-LNMMVFPGIDMLQK 1010
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 608 LAWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 657
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 658 LHAGSKLPQPYQNIYLESGVSL 679
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 879 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 936
Query: 707 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 937 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 996
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F +E + + A I VD
Sbjct: 997 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 1021
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 1022 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 1072
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 1073 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 1119
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 1120 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 1179
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1180 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1239
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 1240 KQEVKS-TSMLVFPGTGLLQKPLLK 1263
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 137/492 (27%), Positives = 210/492 (42%), Gaps = 87/492 (17%)
Query: 604 SKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLV-----GYKKGFG 658
SK ++ V + I K L + + + + +V Y V+G V G G
Sbjct: 609 SKKRAQKVSAPSLINHKPLNVLSYLIDNSIILPRCKVYYKVKGRHRKVCTLADGKITRDG 668
Query: 659 ILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP--------- 709
I C CC S FE HA +S +P I+ +G SL + IK+ +
Sbjct: 669 IKCNCCMGIYSFVGFENHASGSSTCRPSARIFLEDGRSLLDCQIKMMHDHKTRETTGKSF 728
Query: 710 --FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 767
S END +C +C GG+L+ CD CP +FH C+ L IP+G W C C ++
Sbjct: 729 SGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR- 787
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
+I+G D Q++
Sbjct: 788 -----------KIDGDDEVGQLLP------------------------------------ 800
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL----VDRGEEKLPET 883
C QCE +YHV CL ++G D+ G W C DC++I L KL V G + L T
Sbjct: 801 CIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWT 859
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
+ I E S K D + + L+ A+S+ H+ F+P+ ES
Sbjct: 860 LVKFINPDSCEHDSS---------------KSDLLAESYSKLNLAISVMHECFEPLKESL 904
Query: 944 SKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1001
+ DL+ +++ R ++ G Y +L N+ ++S R++G+++AE+PLV T
Sbjct: 905 TNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPLVGTRLQ 964
Query: 1002 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1061
+ +G L +EK L L V+ LVLP+ WT FGF+ MT E++++ DY
Sbjct: 965 YRRRGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQFL-DYT 1023
Query: 1062 LMIFQGTSMLQK 1073
+ FQG M QK
Sbjct: 1024 FLDFQGAIMCQK 1035
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 135/464 (29%), Positives = 204/464 (43%), Gaps = 78/464 (16%)
Query: 632 GGLEDGAEVGYFVR--GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
G + A+V Y R L G+ GI C CC+ + S+FE HAG +P Q+I
Sbjct: 152 GTVPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAG-MKLCEPSQNI 210
Query: 690 YTSNGVSLHELSIK------------LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRA 737
G+SL + + L + NDD CGIC DGGDL+CCD CP
Sbjct: 211 ILETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPST 270
Query: 738 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 797
FH C+ + PSG WHC YC F F+ + +
Sbjct: 271 FHQSCLDIQKFPSGDWHCIYCSCKF--------------------CGMFSGNTDQMNYNL 310
Query: 798 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 857
D+ L + C CE +YH C + G + + +
Sbjct: 311 DVNDSAL---------------------LTCQLCEEKYHHMCTQ--GEDSILDDSSSPSF 347
Query: 858 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKV 915
C C+ + LQ L+ E S ++++ V FD+ L G +KV
Sbjct: 348 CGKTCRELFEQLQMLLGVKHELEDGFSWTLVQR--------TEVGFDIS---LNGIPQKV 396
Query: 916 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTV 973
+ + + L+ A+SI + F PI++ S ++LI ++Y G + +Y G + AIL
Sbjct: 397 ECN----SKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILER 452
Query: 974 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1033
+ ++SA RI G +LAE+P + T + + QG + L IE L LNV+ LV+P+ S
Sbjct: 453 GEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAIS 512
Query: 1034 EAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
E WT+ FGF + + + RN +++F GT MLQKP+ K
Sbjct: 513 ELMQTWTSVFGFKPLEVSSRKEMRN-MNMLVFHGTDMLQKPLLK 555
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 143/272 (52%), Gaps = 29/272 (10%)
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 866
CVLC +F + F +TV++CDQCE+EYH+GCL+ H M D+Q +P+G+W C +C RI
Sbjct: 1786 CVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRKHKMVDMQAVPEGEWFCSDECVRIR 1845
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
L K ++ GE + S N RW+ +RG+ D + T L
Sbjct: 1846 ELLTKSLEEGETTM----------------SGNPA---YRWQFIRGR--DGTKATARALK 1884
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV-------VVS 979
+ I + FDPII++ S DL+P MV+ S D+ GMY +L V
Sbjct: 1885 TVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQGMYSILLRYRGADKEARGRPVL 1944
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
AG+ R+ G +AE+PLVAT DC+ QG+ ++L + L L V+ +VLP+ ++A W
Sbjct: 1945 AGLVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHRLIALGVRAMVLPATADALPAW 2004
Query: 1040 TNKFGFSMMTEEEQNKYRNDYPLMIFQGTSML 1071
+ F + E R ++ ++IF TS++
Sbjct: 2005 -RQLAFMDLDEGSVRVARGEHRMIIFPHTSVV 2035
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 61/108 (56%), Gaps = 10/108 (9%)
Query: 606 MKSSSVKSHGKITRKDLRMHKLVFEEG--GLEDGAEVGYFVRGEKFLVGY------KKGF 657
M++S+ ++ R+ +R L F+ G L+DG V Y ++G++ L G G
Sbjct: 1507 MRASAARTSAS-QRRPIRSRTL-FDGGPDSLQDGERVHYTIQGQRLLSGTVVIVQRTAGS 1564
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS 705
GILC CC+ +S S FE+HAG RR P++ I T NG +L ++ +++
Sbjct: 1565 GILCDCCSKVISASAFESHAGHKHRRNPYESIMTDNGTTLKRIAERIT 1612
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 724 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 781
Query: 707 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 782 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 841
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F +E + + A I VD
Sbjct: 842 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 866
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 867 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 917
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 918 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 964
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 965 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 1024
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1025 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1084
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 1085 KQEVKS-TSMLVFPGTGLLQKPLLK 1108
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 175 bits (443), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 88/441 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL--------SLERP 709
GI C+CC S S+FEAHAG +S + +I+ +G SL E +++ +
Sbjct: 849 GIKCSCCQEVFSLSRFEAHAG-SSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKES 907
Query: 710 FSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
FS K END +C +C GGDL+ CD CP +FH C+ L +P G W C C
Sbjct: 908 FSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSCC--- 964
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 822
C +C F
Sbjct: 965 --------------------------------------------CGICGENKFDGGSEQD 980
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEE 878
V C QCER+ CL+ G L P G W C CK+I L LQKL+ + G +
Sbjct: 981 NVVFSCYQCERQ---CCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 1037
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
L T L I+ K ++ D+ ++A + L+ A+ + H+ F+P
Sbjct: 1038 NLTWTLLKPIRSK--------GLEIDL-------PDIEALTEVYSKLNIALGVMHECFEP 1082
Query: 939 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+ E ++ D++ +++ G ++ G Y +L N ++S R++G+++AE+PL+
Sbjct: 1083 VKEPHTRRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLI 1142
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
T + G L +EK L L V+ LVLP+ WT FGFS MT+ E+ ++
Sbjct: 1143 GTRFQYRRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRF 1202
Query: 1057 RNDYPLMIFQGTSMLQKPVPK 1077
DY + FQ T M QK + K
Sbjct: 1203 L-DYSFLDFQDTVMCQKLLMK 1222
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/445 (28%), Positives = 203/445 (45%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER 708
L G GI C+CC+ +S +F AHAG + P+++I +G + L ++
Sbjct: 923 LLEGIVTRDGIDCSCCSKVLSVLEFVAHAG-SEVNTPYRNILV-DGQDIDLLHCLINAWN 980
Query: 709 PFSSKE--------------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
S E NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 981 MQSDAEKQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWC 1040
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C KF ++N +G + A + S
Sbjct: 1041 CANCSC-----KFCHEHSN-------DGAEDTADVDS----------------------- 1065
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH C ++ G C C+ + LQ L+
Sbjct: 1066 ---------SLHTCSQCEEQYHEACSPENDSITNLSSQTGNLFCQQSCRLLFEELQNLLA 1116
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S V+++ HE DV VL +D + ++ A+S+ +
Sbjct: 1117 VKKDLEPEYSCRVVQRIHE----------DVPEEVL---PLDTRVECNSKIAVALSLMDE 1163
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+ G Y IL +++A RI G +LAE
Sbjct: 1164 CFLPIVDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAE 1223
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT++FGFS + + E
Sbjct: 1224 MPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSE 1283
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 1284 KEEVKS-ISMLVFPGTGLLQKPLLK 1307
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 132/464 (28%), Positives = 211/464 (45%), Gaps = 78/464 (16%)
Query: 628 VFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ + G +E +V Y R + L G+ GI C CC+ ++ S+FE HAG + +P+Q
Sbjct: 626 LIDSGTVELSQKVQYRRRKKVMLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPYQ 684
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSCP 735
+IY +GVSL + I+ + S K NDD CGIC DGGDL+CCD CP
Sbjct: 685 NIYLESGVSLLQCQIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGCP 744
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH C+ + +P G WHC C F A+G E D +
Sbjct: 745 STFHQSCLDIQMLPLGEWHCPNCTCKFC----------GIASGNSEKDDA-------SVY 787
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
++Q +C+ CE++YH C K+ M++L
Sbjct: 788 VLQ----------------------------ICNLCEKKYHDSCTKE--MDNLPNNINTS 817
Query: 856 WL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 913
L C +CK ++ L+K + E S +I + E+S A + RV
Sbjct: 818 SLSFCGKECKELSEHLKKYLGTKHELEAGFSWSLIHRIDEDS---EAACRGISQRVECNS 874
Query: 914 KVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAIL 971
K L+ A+++ + F P+I+ S ++LI ++Y G + +Y G Y A L
Sbjct: 875 K----------LAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTATL 924
Query: 972 TVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPS 1031
++++ R G ++AE+P + T + + QG + LF IE L L V+ LV+P+
Sbjct: 925 ERGDEIIASASIRFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIPA 984
Query: 1032 ASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+E WT FGF+ + E + + ++ +M+F G ML KP+
Sbjct: 985 IAELTNTWTTVFGFTHLDESLRQEMKS-LNMMVFPGIDMLMKPL 1027
Score = 43.1 bits (100), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 623 LSWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 672
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 673 LHAGSKLPQPYQNIYLESGVSL 694
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 132/447 (29%), Positives = 203/447 (45%), Gaps = 81/447 (18%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + + +F AHAG KP++++ +G+ L +S
Sbjct: 896 LLEGLVTRDGIQCRCCSKDFALLEFVAHAG-GQVSKPYRNVLV-DGLDKDLLHCLISAWD 953
Query: 707 -----ER----PFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 954 KQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWR 1013
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C KF + ++N A A++ S C
Sbjct: 1014 CANCCC-----KFCQEHSNDDAPD-------IAEVDSLCT-------------------- 1041
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK--GKWLCCADCKRINLALQKL 872
C QCE YH C + E+ +P G C C+ + LQ L
Sbjct: 1042 -------------CSQCEENYHPVCSPE--TENPSSVPSQAGDLFCQQSCRLLFEELQNL 1086
Query: 873 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 932
+ ++ PE + +IK HE+ + A+ D R + ++ A+S+
Sbjct: 1087 LAVKKDLEPEFACRIIKCIHEDV-PETALALDERVEC------------NSKIAVALSLM 1133
Query: 933 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
+ F PII+ + ++LI +VY G + D+ G Y IL +VSA RI G +
Sbjct: 1134 DECFLPIIDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKC 1193
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AE+P + T N + QG + L IE +L L V+ L++P+ SE WT+KFGFS +
Sbjct: 1194 AEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLKVQKLIIPAISELVDTWTSKFGFSPLEV 1253
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1254 SEKQEVKS-ISMLVFPGTGLLQKPLLK 1279
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 124/445 (27%), Positives = 199/445 (44%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 860 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 917
Query: 707 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 918 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 977
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F +E + + A I VD
Sbjct: 978 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 1002
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 1003 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 1053
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 1054 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 1100
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+ G Y +L +++A RI G +LAE
Sbjct: 1101 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAE 1160
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1161 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1220
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 1221 KQEVKS-TSMLVFPGTGLLQKPLLK 1244
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 332 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 389
Query: 707 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 390 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 449
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F +E + + A I VD
Sbjct: 450 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 474
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 475 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 525
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 526 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 572
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 573 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 632
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 633 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 692
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 693 KQEVKS-TSMLVFPGTGLLQKPLLK 716
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/469 (26%), Positives = 198/469 (42%), Gaps = 96/469 (20%)
Query: 652 GYKKGFGIL-CTCCNSEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKL-- 704
G+ G GI+ C CCN + ++ EAHA G RR+ + ++ +G SL ++L
Sbjct: 352 GFITGEGIIRCKCCNKTFTVAELEAHATGGIGTDDRREAWARVFVEDGRSLSLCLMELMR 411
Query: 705 ------------------SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 746
++ S +E D +C +C+D G+LL CD CP AFH CV L
Sbjct: 412 RDDVGAAAANRNRNGSVMRVKEKCSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQ 471
Query: 747 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 806
P G W C C
Sbjct: 472 ATPEGDWCCPLCR----------------------------------------------- 484
Query: 807 CVLCRGRDF---CKSRFGRRTVILCDQCEREYHVGCLKDHGMED------LQELPKGK-- 855
C +C G D F +T+I C+QCEREYHVGC++ G E+ + L + +
Sbjct: 485 CGVCGGSDLDDDTAEGFTDKTIIYCEQCEREYHVGCMRRGGSEEESAAEWCRRLSESEGP 544
Query: 856 ------WLCCADCKRINLALQKLVDRGEEK-LPETSLDVIKKKHEESGSDNAVDFDVRWR 908
WLC +C + LQ LV + +P S + + + R
Sbjct: 545 EEEWRPWLCSPECGEVFQHLQALVASSRARSIPHYS----RGAYHSAPCGRRRYMSTVTR 600
Query: 909 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGM 966
+ R + + L A+ + H+ FD ++E ++ DL +V+ + R ++ G
Sbjct: 601 ITRWQHEEEDAADHGQLCAALDVLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGY 660
Query: 967 YCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKT 1026
Y L +++ G R+FG ++AELPLV T + QG + L +EK+L + V+
Sbjct: 661 YVVGLEKAGELINVGTLRVFGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRR 720
Query: 1027 LVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
LVLP+ E +WT GF MT + + ++ ++ F+GT+M QK +
Sbjct: 721 LVLPAVPELMPMWTASLGFHAMTRSDVMEMAVEHAILSFKGTTMCQKTL 769
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/465 (28%), Positives = 210/465 (45%), Gaps = 78/465 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVR-GEKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
+ E G ++ +V Y R G K ++ G+ GI C CC+ ++ S+FE HAG S +
Sbjct: 523 WLIESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-Q 581
Query: 685 PFQHIYTSNGVSLHELSIK----------LSLER--PFSSKENDDLCGICMDGGDLLCCD 732
PFQ+IY +G SL + ++ L+L + NDD CGIC DGGDL+CCD
Sbjct: 582 PFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCD 641
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
CP +H +C+ + +PSG WHC C F A A+G +G F ++S
Sbjct: 642 GCPSTYHQNCLGMQVLPSGDWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS- 690
Query: 793 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 852
C CER YH CL D + +Q
Sbjct: 691 -----------------------------------CGMCERRYHQLCLNDEAHK-VQSFG 714
Query: 853 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 912
C C + LQK + E S +I + +S +++ +
Sbjct: 715 SASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSA 765
Query: 913 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 970
++++ + L+ ++I + F PI++ S +DLI ++Y G + +Y G Y AI
Sbjct: 766 QRIE----NNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAI 821
Query: 971 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1030
L ++SA R G +LAE+P + T + + QG + LF IE + L V+ LV+P
Sbjct: 822 LERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIP 881
Query: 1031 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ + WT FGF+ + + + + R+ ++F G MLQKP+
Sbjct: 882 AIPDFLHAWTGNFGFTPLDDSVRKEMRS-LNTLVFPGIDMLQKPL 925
Score = 44.3 bits (103), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 48/84 (57%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 521 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 570
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 571 FEIHAGSKSCQPFQNIYLESGASL 594
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 198/433 (45%), Gaps = 83/433 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS--------LERP 709
GI CTCC + S F HAG +S +P I+ +G SL + +++ E+P
Sbjct: 716 GIRCTCCQNLYGLSGFANHAGGSSNCRPSACIFLKDGRSLLDCMMEVMQDHRTREITEKP 775
Query: 710 FSS---KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 766
+ END++C +C GG+L+ CD CP A+H +C++L GIP G W C C
Sbjct: 776 HNDLFEGENDNICSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPSCR------- 828
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 826
C +C ++ G +
Sbjct: 829 ----------------------------------------CGICGQNKIEETEDGH--FL 846
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEEKLPE 882
C QCE +YHV CL++ +D + K W C +C+R+ LQ L+ + G + L
Sbjct: 847 TCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLGKPVLVGADNLTW 905
Query: 883 TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIES 942
T ++K + E+ + G + D + LS A+S+ H+ F+P+
Sbjct: 906 T---LVKYVNSETCG------------VGGAESDLVVENYSKLSVALSVMHECFEPLHNP 950
Query: 943 ASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1000
S D++ +++ RS + ++ G Y +L N+ ++S RIFG+++AE+PLV T
Sbjct: 951 FSSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRF 1010
Query: 1001 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1060
+ G + L +EK L L V+ LVLP+ WTN FGF MT E++++ DY
Sbjct: 1011 QYRRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFL-DY 1069
Query: 1061 PLMIFQGTSMLQK 1073
+ FQGT M QK
Sbjct: 1070 SFLDFQGTVMCQK 1082
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 194/443 (43%), Gaps = 86/443 (19%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG--VSLHEL--------- 700
G K GI+C CCN +S + F+ HAG+ R P +++ +G +L +L
Sbjct: 790 GLIKKEGIMCKCCNMVLSVTNFKNHAGFKQSR-PCLNVFMKSGKPFTLCQLQAWSAEYKT 848
Query: 701 --SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
S + + R ENDD CG+C DGG+L+CCD+CP FH C+S +P G+W+C C
Sbjct: 849 RKSRTIKVVRTADDDENDDSCGLCGDGGELICCDNCPSTFHQACLSTEELPEGSWYCPNC 908
Query: 759 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 818
C +C K
Sbjct: 909 T-----------------------------------------------CWICGELVNDKE 921
Query: 819 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG----KWLCCADCKRINLALQKLVD 874
C QCE +YH C K+ + + KG W C C+ + LQ V
Sbjct: 922 DINSSNAFKCSQCEHKYHDSCWKN------KTIGKGGASDTWFCGGSCQAVYFGLQSRVG 975
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++K HE+ +V +++ + L+ A++I +
Sbjct: 976 IINHIADGVCWTLLKCIHEDQ------------KVHSAQRLALKAECNSKLAVALTIMEE 1023
Query: 935 RFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F +++ + +D+IP ++Y RS + ++HG Y +L + V++S RI G +AE
Sbjct: 1024 CFQSMVDPRTGIDMIPHVLYNWRSEFARLNFHGFYTVVLEKDDVLLSVASIRIHGATVAE 1083
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+PL+AT ++ + QG + L IE++L V+ LV+ + + WT FGF+ M+ +E
Sbjct: 1084 MPLIATCSNYRRQGMCRRLMTAIEEMLISFKVEKLVVSAIPDLVETWTEGFGFTPMSNDE 1143
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPV 1075
+ N LM+F GT +L+KP+
Sbjct: 1144 KQSL-NKINLMVFPGTILLKKPL 1165
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 192/435 (44%), Gaps = 83/435 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK---------LSLER 708
GI C CC + + FE+HAG ++ +P +I +G SL + K ++ E
Sbjct: 550 GIECNCCQKIFTLTGFESHAG-STNHRPAANIILEDGRSLLDCQRKKKPRIKMQRVTREA 608
Query: 709 PFSSKEN------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
+ ++N D +C +C DGGDL+ CD CP FH +CV L IP G W C C
Sbjct: 609 KWKGRQNQHQGETDYICSVCHDGGDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGI 668
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTP-DTELGGCVLCRGRDFCKSRFG 821
E +YN VQ P D+ L
Sbjct: 669 CGENKFKYN-------------------------VQEPKDSRL----------------- 686
Query: 822 RRTVILCDQCEREYHVGCLKDHGMEDLQEL-PKGKWLCCADCKRINLALQKLVDRGEEKL 880
+ CDQCER+YH+GCL++ G+ L+ PK W C C+ I + LQ L+ +
Sbjct: 687 ----LSCDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVG 742
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
P+ + K + D V+A G + L AV + H+ F+P
Sbjct: 743 PDNLTWTLWKFMDSDSCD----------------VEAPTGKHSKLDLAVEVIHECFEPAT 786
Query: 941 ESASKLDLIPAMVYGR--SHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
E+ + D+ +++ R + ++ G Y +L N +++ R+FG ++AE+PLV T
Sbjct: 787 ETYTGRDIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGT 846
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ G + L +EK L L V+ L+LP+ W N FGFS +T+ E+ +Y
Sbjct: 847 RFLFRRLGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYL- 905
Query: 1059 DYPLMIFQGTSMLQK 1073
D+ + F GT QK
Sbjct: 906 DHTFLDFPGTIKCQK 920
Score = 46.6 bits (109), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 71/159 (44%), Gaps = 16/159 (10%)
Query: 340 NVVVERPLRRFTRSLLQQKVELAKGSLSKDGG----KRSDVTEVANDGVGGPVKQ--ETV 393
N + RP R T L++ + K ++ +R V E G +Q T
Sbjct: 439 NASLPRPRREKTIETLKKLRDYQKSHQEQNASPLKQRRGKVIETLEKPRDGQKRQSSRTA 498
Query: 394 MK---PRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
M+ PR K L +++ ++ + RGSK GPG RG + GI C C
Sbjct: 499 MQGVTPRSSKCKPRCALSWMIDNNLVSPGEKVSYRGSK--GPG-ELTRGRITREGIECNC 555
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRD 489
C+ ++ T FE HAGS+N RP I LE+G++L D
Sbjct: 556 --CQ--KIFTLTGFESHAGSTNHRPAANIILEDGRSLLD 590
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 130/465 (27%), Positives = 209/465 (44%), Gaps = 78/465 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVR-GEKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
+ E G ++ +V Y R G K ++ G+ GI C CC+ ++ S+FE HAG S +
Sbjct: 523 WLIESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-Q 581
Query: 685 PFQHIYTSNGVSLHELSIK----------LSLER--PFSSKENDDLCGICMDGGDLLCCD 732
PFQ+IY +G SL + ++ L+L + NDD CGIC DGGDL+CCD
Sbjct: 582 PFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCD 641
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
CP +H +C+ + +PSG WHC C F A A+G +G
Sbjct: 642 GCPSTYHQNCLGMQVLPSGDWHCPNCTCKFCDA--------AVASGGKDG---------- 683
Query: 793 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 852
+++ C CER YH CL D + +Q
Sbjct: 684 ----------------------------NSISLLSCGMCERRYHQLCLNDEAHK-VQSFG 714
Query: 853 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 912
C C + LQK + E S +I + +S +++ +
Sbjct: 715 SASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSA 765
Query: 913 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 970
++++ + L+ ++I + F PI++ S +DLI ++Y G + +Y G Y AI
Sbjct: 766 QRIE----NNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAI 821
Query: 971 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1030
L ++SA R G +LAE+P + T + + QG + LF IE + L V+ LV+P
Sbjct: 822 LERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIP 881
Query: 1031 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ + WT FGF+ + + + + R+ ++F G MLQKP+
Sbjct: 882 AIPDFLHAWTGNFGFTPLDDSVRKEMRS-LNTLVFPGIDMLQKPL 925
Score = 44.3 bits (103), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 48/84 (57%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 521 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 570
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 571 FEIHAGSKSCQPFQNIYLESGASL 594
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 706
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 195 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 252
Query: 707 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 253 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 312
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F +E + + A I VD
Sbjct: 313 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 337
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 338 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 388
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 389 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 435
Query: 935 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 436 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 495
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 496 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 555
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ + ++ +++F GT +LQKP+ K
Sbjct: 556 KQEVKS-TSMLVFPGTGLLQKPLLK 579
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 169 bits (428), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 81/132 (61%), Positives = 100/132 (75%), Gaps = 1/132 (0%)
Query: 952 MVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1011
MVYGR GQ++ GMYCAIL VN VVSA + R+FGQ++AELPLVATSN G+GYFQ+L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 1012 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSM 1070
F CIE+LL FL VK LVLP+A EA++IWT KFGF + ++ + YR M+ F+GTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 1071 LQKPVPKCRIVG 1082
LQK VP CR+VG
Sbjct: 121 LQKTVPSCRVVG 132
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 169 bits (428), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 132/449 (29%), Positives = 197/449 (43%), Gaps = 85/449 (18%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP---FQHIYTSNGVSLHELSIKL---- 704
G G GI C CC + F+ HA R F +I+ +G SL + +++
Sbjct: 349 GRISGSGIKCNCCGKVYTLCGFDYHASGKQGRTATSIFSNIFLEDGRSLLDCQMQIMHDH 408
Query: 705 -------SLERPFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
LER SSK END +C +C GG+L+ CD CP +FH C+ L +P G W
Sbjct: 409 TKNLGEEPLERWQSSKDQVENDHICSVCHYGGELILCDQCPSSFHKSCLGLMDVPDGDWF 468
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C C +I G C L R D
Sbjct: 469 CSSC----------------------------------CCKIC-------GQC-LKRDSD 486
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
G V+ C QCER+YHV CL + E L+ PK W C C++I L L +L+
Sbjct: 487 LSMEDDG---VLDCTQCERKYHVVCLGNKREECLEYFPKEHWFCSKRCQQIFLGLHELLG 543
Query: 875 R----GEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVS 930
+ G L T L I+ + SD ++A ++L+ A+
Sbjct: 544 KKIPVGLHNLTWTLLKSIQFNDQCEASD----------------IEALSENYSMLNIALD 587
Query: 931 IFHDRFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQ 988
+ H+ FDP+ E +K DL+ +++ RS + ++HG Y +L + +S R++G+
Sbjct: 588 MMHEFFDPVEEPHTKRDLLKDVIFSKRSELNRLNFHGFYTVLLQKDDEFISVATVRVYGE 647
Query: 989 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1048
++AE+PLV T + G L +EK L L V+ L+LP+ A W FGFS +
Sbjct: 648 KVAEIPLVGTRFQYRRLGMCCILMNVLEKKLRELGVQRLILPAVPSALNTWIGSFGFSKL 707
Query: 1049 TEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
TE ++ + DY + FQ T M K + K
Sbjct: 708 TELDRLQLL-DYTFLDFQDTIMCHKLLTK 735
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 169 bits (428), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 128/449 (28%), Positives = 203/449 (45%), Gaps = 81/449 (18%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS---LHELS 701
R + L G GI C+CC+ V+ +F AHAG KP+++I +G+ LH L
Sbjct: 946 RSKVLLEGLITRDGINCSCCSKVVTVLEFVAHAG-GQLSKPYRNILV-DGLDNDLLHCLI 1003
Query: 702 IKLSLER--------PFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS 750
I + P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS
Sbjct: 1004 IAWDKQSDSERQAFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPS 1063
Query: 751 GTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 810
W C C K + N A E +DP
Sbjct: 1064 DDWRCTNCSC-----KLCHEHLNHDAPDNAE-IDPLHS---------------------- 1095
Query: 811 RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK--GKWLCCADCKRINLA 868
C QCE++YH C + E L + G C C+ +
Sbjct: 1096 -----------------CSQCEKKYHPSCSPE--TEKLSSVSSQAGNHFCQQSCRLLFEE 1136
Query: 869 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
LQ L+ ++ PE + +I+ HE++ + +D D R + ++ A
Sbjct: 1137 LQNLLAVEKDLGPEYACRIIQCIHEDA-PETVLDLDGRVEC------------NSKIAVA 1183
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 986
+S+ + F PI++ + ++LI +VY G + D+ G Y IL +VSA RI
Sbjct: 1184 LSLMDECFLPIVDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIH 1243
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G +LAE+P + T + + QG + L IE +L L V+ L++P+ +E WT+KFGFS
Sbjct: 1244 GTKLAEMPFIGTRHMYRRQGMCRRLLDGIEMILSSLKVEKLIIPAINELVDTWTSKFGFS 1303
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ ++ + ++ +++F GT +LQK +
Sbjct: 1304 PLEVSDKQEVKS-INMLVFPGTGLLQKSL 1331
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/447 (27%), Positives = 205/447 (45%), Gaps = 79/447 (17%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 701
L G+ GI C+CC+ +S +F HAG + KP+++I +G+ + L +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 984
Query: 702 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
++ ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +P+ W
Sbjct: 985 MQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWC 1044
Query: 755 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 814
C C F E + +A ++
Sbjct: 1045 CSNCSCKFCHEHSSD---DAEDTADVDS-------------------------------- 1069
Query: 815 FCKSRFGRRTVILCDQCEREYHVGCLKD-HGMEDLQELPKGKWLCCADCKRINLALQKLV 873
++ C QCE + C D + G C C+ + LQ L+
Sbjct: 1070 ---------SLHTCSQCEEQCTEACSPDIDSIATNLSSQTGNLFCQQSCRLLFEELQNLL 1120
Query: 874 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRALLSKAVSIF 932
++ PE S V+++ HEE V VL K+V+ + + ++ A+S+
Sbjct: 1121 AVKKDLEPEYSCRVVQRIHEE----------VPEEVLALDKRVECN----SKIAVALSLM 1166
Query: 933 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
+ F PI++ + ++LI +VY G + D+ G Y IL +++A RI G +L
Sbjct: 1167 DECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRIHGTKL 1226
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AE+P + T N + QG + L IE +L LN++ L++P+ +E WT+KFGFS + +
Sbjct: 1227 AEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDD 1286
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1287 SEKQEVKS-VSMLVFPGTGLLQKPLLK 1312
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 136/477 (28%), Positives = 216/477 (45%), Gaps = 90/477 (18%)
Query: 628 VFEEGGLEDGAEVGYFVRGEK--FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
+ + G ++ +V Y R + L G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 603 LIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEIHAG-SKLRQP 661
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSK-------------ENDDLCGICMDGGDLLCCD 732
FQ+I+ +G+SL + + + R SK NDD CGIC DGGDL+CCD
Sbjct: 662 FQNIFLESGLSLLQCQ-RDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCD 720
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
CP FH C+ + P G WHC C +Y A
Sbjct: 721 GCPSTFHQSCLDILIPPPGDWHCPNCT--------CKYCGVA------------------ 754
Query: 793 CIRIVQTPDT---ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DL 848
I I Q +T E+ C+L CE+++H C +E D
Sbjct: 755 SIDICQGDNTSVSEISTCIL---------------------CEKKFHESC----NLEMDT 789
Query: 849 QELPKG--KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVR 906
G C C+ + +LQK + E S +I++ E+S DV
Sbjct: 790 PVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRRTSEDS--------DVS 841
Query: 907 WRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYH 964
R L ++++++ + L+ A+++ + F PI++ S ++LI ++Y G + +Y
Sbjct: 842 VRGL-SQRIESN----SKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYS 896
Query: 965 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1024
G Y AIL ++SA R G +LAE+P + T + + QG + LFC IE L V
Sbjct: 897 GFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKV 956
Query: 1025 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1081
+ L++P+ +E W FGFS + + + R +++F GT MLQK + + IV
Sbjct: 957 EKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRL-MNMLVFPGTDMLQKLLIQETIV 1012
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/456 (27%), Positives = 197/456 (43%), Gaps = 88/456 (19%)
Query: 635 EDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR----------K 684
+DG+ V Y G G GI+C CC+ +S S+F++HAG+ R +
Sbjct: 862 KDGSVVKY---------GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGR 912
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHID 741
PF + E + S R E NDD CGIC DGG+L+CCD+CP FH
Sbjct: 913 PFMLCQLQAWST--EYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHS 970
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
C+S+ +P G W+C N R G + + I + D
Sbjct: 971 CLSIQELPEGNWYC--------------LNCTCRICGDLVNFE----------EISSSSD 1006
Query: 802 TELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 861
+ C QCE++YH CLK ++ E W C
Sbjct: 1007 -----------------------ALKCFQCEQKYHGQCLKQRDIDSGVE--SHIWFCSGS 1041
Query: 862 CKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGT 921
C++I ALQ + + S +++ H + ++L ++
Sbjct: 1042 CQKIYAALQSQLGLTNQFANGFSWTLLRCIHYDQ------------KILSTARLAMMAEC 1089
Query: 922 RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG--RSHRGQDYHGMYCAILTVNQVVVS 979
+ L A++I + F +++ + +D+IP +VY S D+HG Y IL + V++
Sbjct: 1090 NSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLC 1149
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
R+ G ELAE+PL+AT + + QG + L IE++L VK LV+ + W
Sbjct: 1150 VASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETW 1209
Query: 1040 TNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
T FGF + E E+ + + + LM+F GT +L+K +
Sbjct: 1210 TEGFGF-VTVENEEKQSLHKFNLMVFPGTVLLKKAL 1244
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 167 bits (424), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 136/477 (28%), Positives = 216/477 (45%), Gaps = 90/477 (18%)
Query: 628 VFEEGGLEDGAEVGYFVRGEK--FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
+ + G ++ +V Y R + L G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 485 LIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEIHAG-SKLRQP 543
Query: 686 FQHIYTSNGVSLHELSIKLSLERPFSSK-------------ENDDLCGICMDGGDLLCCD 732
FQ+I+ +G+SL + + + R SK NDD CGIC DGGDL+CCD
Sbjct: 544 FQNIFLESGLSLLQCQ-RDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCD 602
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
CP FH C+ + P G WHC C +Y A
Sbjct: 603 GCPSTFHQSCLDILIPPPGDWHCPNCT--------CKYCGVA------------------ 636
Query: 793 CIRIVQTPDT---ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DL 848
I I Q +T E+ C+L CE+++H C +E D
Sbjct: 637 SIDICQGDNTSVSEISTCIL---------------------CEKKFHESC----NLEMDT 671
Query: 849 QELPKG--KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVR 906
G C C+ + +LQK + E S +I++ E+S DV
Sbjct: 672 PVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRRTSEDS--------DVS 723
Query: 907 WRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYH 964
R L ++++++ + L+ A+++ + F PI++ S ++LI ++Y G + +Y
Sbjct: 724 VRGL-SQRIESN----SKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYS 778
Query: 965 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1024
G Y AIL ++SA R G +LAE+P + T + + QG + LFC IE L V
Sbjct: 779 GFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKV 838
Query: 1025 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1081
+ L++P+ +E W FGFS + + + R +++F GT MLQK + + IV
Sbjct: 839 EKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRL-MNMLVFPGTDMLQKLLIQETIV 894
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 167 bits (424), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 197/446 (44%), Gaps = 77/446 (17%)
Query: 645 RGEKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK 703
RG K ++ G+ GI C CC+ ++ S+FE HAG + +PFQ+IY +G SL + ++
Sbjct: 525 RGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAG-SKACQPFQNIYLESGASLLQCQVR 583
Query: 704 ------------LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
L NDD CGIC DGGDL+CCD CP +H C+ + +PSG
Sbjct: 584 AWNMQKDATNVGLHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQTCLGMQVLPSG 643
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
WHC C F A A+G +G F ++S
Sbjct: 644 DWHCPNCTCKFCDA--------AVASGGKDG--NFLSLLS-------------------- 673
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
C CER YH CL D + +Q C C + LQK
Sbjct: 674 ----------------CSMCERRYHQLCLSDEA-QKVQSFGSASSFCGPKCLELFEKLQK 716
Query: 872 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 931
+ E S +I + +S ++ L ++++ + L+ ++I
Sbjct: 717 YLGVKNEIEGGYSWSLIHRVDTDSDINSQ---------LSAQRIE----NNSKLAVGLAI 763
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
+ F PI++ S ++LI ++Y G + +Y G Y AIL ++SA R G +
Sbjct: 764 MDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGTQ 823
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
LAE+P + T + + QG + LF IE + L V+ LV+P+ + WT FGF+ +
Sbjct: 824 LAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD 883
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ + + R+ ++F G MLQKP+
Sbjct: 884 DSVRKEMRS-LNTLVFPGIDMLQKPL 908
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 47/84 (55%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 504 LSWLIESGVVQLRQKVQYMKRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 553
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P + IYLE+G +L
Sbjct: 554 FEIHAGSKACQPFQNIYLESGASL 577
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 124/456 (27%), Positives = 196/456 (42%), Gaps = 88/456 (19%)
Query: 635 EDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR----------K 684
+DG+ V Y G G GI+C CC+ +S S+F++HAG+ R +
Sbjct: 862 KDGSVVKY---------GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGR 912
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHID 741
PF + E + S R E NDD CGIC DGG+L+CCD+CP FH
Sbjct: 913 PFMLCQLQAWST--EYKTRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHS 970
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
C+S+ +P G W+C N R G + + I + D
Sbjct: 971 CLSIQELPEGNWYC--------------LNCTCRICGDLVNFE----------EISSSSD 1006
Query: 802 TELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 861
+ C QCE++YH CLK + E W C
Sbjct: 1007 -----------------------ALKCFQCEQKYHGQCLKQRDINSGVE--SHIWFCSGS 1041
Query: 862 CKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGT 921
C++I ALQ + + S +++ H + ++L ++
Sbjct: 1042 CQKIYAALQSQLGLTNQFANGFSWTLLRCIHYDQ------------KILSTARLAMMAEC 1089
Query: 922 RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG--RSHRGQDYHGMYCAILTVNQVVVS 979
+ L A++I + F +++ + +D+IP +VY S D+HG Y IL + V++
Sbjct: 1090 NSRLVVALTIMEECFLSMVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLC 1149
Query: 980 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1039
R+ G ELAE+PL+AT + + QG + L IE++L VK LV+ + W
Sbjct: 1150 VASIRVHGSELAEMPLIATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETW 1209
Query: 1040 TNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
T FGF + E E+ + + + LM+F GT +L+K +
Sbjct: 1210 TEGFGF-VTVENEEKQSLHKFNLMVFPGTVLLKKAL 1244
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 167 bits (422), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 122/476 (25%), Positives = 205/476 (43%), Gaps = 80/476 (16%)
Query: 627 LVFEEGGLEDGAEVGYF--VRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
+ + G L+ +V Y R L G+ GI C CC+ ++ S+FE HAG + R+
Sbjct: 583 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAG-SKLRQ 641
Query: 685 PFQHIYTSNGVSLHELSIKLSLERP------------FSSKENDDLCGICMDGGDLLCCD 732
PFQ+I+ ++GVSL + I ++ + NDD CGIC DGGDL+CCD
Sbjct: 642 PFQNIFLNSGVSLLQCQIDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCD 701
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
CP FH C+ + P G WHC C F
Sbjct: 702 GCPSTFHQRCLDIRMFPLGDWHCPNCTCKF------------------------------ 731
Query: 793 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 852
C +++ +G C CE++YH C+ +
Sbjct: 732 CKAVIEDVTQTVGANT-------------------CKMCEKKYHKSCMPKANVTPADTTE 772
Query: 853 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 912
C CK ++ ++K V E S ++ + E + SD + L G
Sbjct: 773 PITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHR--ECTNSDLS---------LSG 821
Query: 913 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 970
+ ++ L+ A+++ + F PII+ S ++++ ++Y G + ++ G Y A+
Sbjct: 822 HPHIVENNSK--LALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNRLNFGGFYTAL 879
Query: 971 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1030
L +V++ R G LAE+P + T + + QG + LF +E L L VK L++P
Sbjct: 880 LERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESALQHLKVKLLIIP 939
Query: 1031 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVD 1086
+ ++ +W +KFGF + + + + R+ L+ F G +LQK + R +VD
Sbjct: 940 ATADFSHVWISKFGFRQVEDSLKKEMRS-MNLLTFPGIDVLQKELLAPRHTESAVD 994
Score = 40.0 bits (92), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 66/141 (46%), Gaps = 17/141 (12%)
Query: 353 SLLQQKVELAKGSLSK--DGGKRSD---VTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
S L + E + GS S DGGK S T + V G + P R + L
Sbjct: 525 SPLHHQTEKSTGSSSHRVDGGKSSKHGRSTLLVRRSVRGDNSESDGFVPSSEKRTVLAWL 584
Query: 408 KNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
++SG L+ VMY+ + + + L G + GI C C C ++++ + FE+
Sbjct: 585 ---IDSGTLQLSEKVMYMNQRRTR----AMLEGWITRDGIHCGC--C--SKILAVSKFEI 633
Query: 467 HAGSSNKRPPEYIYLENGKTL 487
HAGS ++P + I+L +G +L
Sbjct: 634 HAGSKLRQPFQNIFLNSGVSL 654
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 128/446 (28%), Positives = 202/446 (45%), Gaps = 77/446 (17%)
Query: 645 RGEKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK 703
RG K ++ G+ GI C CC+ ++ S+FE HAG S +PFQ+IY +G SL + ++
Sbjct: 4 RGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-QPFQNIYLESGASLLQCQVR 62
Query: 704 ----------LSLER--PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
L+L + NDD CGIC DGGDL+CCD CP +H +C+ + +PSG
Sbjct: 63 AWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSG 122
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
WHC C F A A+G +G F ++S
Sbjct: 123 DWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS-------------------- 152
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
C CER YH CL D + +Q C C + LQK
Sbjct: 153 ----------------CGMCERRYHQLCLNDEAHK-VQSFGSASSFCGPKCLELFEKLQK 195
Query: 872 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 931
+ E S +I + +S +++ + ++++ + L+ ++I
Sbjct: 196 YLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSAQRIE----NNSKLAVGLAI 242
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
+ F PI++ S +DLI ++Y G + +Y G Y AIL ++SA R G +
Sbjct: 243 MDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQ 302
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
LAE+P + T + + QG + LF IE + L V+ LV+P+ + WT FGF+ +
Sbjct: 303 LAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD 362
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ + + R+ ++F G MLQKP+
Sbjct: 363 DSVRKEMRS-LNTLVFPGIDMLQKPL 387
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 10/66 (15%)
Query: 422 MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYL 481
M RG+KV L G + GI C C ++++T + FE+HAGS + +P + IYL
Sbjct: 1 MRRRGAKVM------LEGWITREGIHCDC----CSKILTVSRFEIHAGSKSCQPFQNIYL 50
Query: 482 ENGKTL 487
E+G +L
Sbjct: 51 ESGASL 56
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 127/446 (28%), Positives = 202/446 (45%), Gaps = 77/446 (17%)
Query: 645 RGEKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK 703
RG K ++ G+ GI C CC+ ++ S+FE HAG + +PFQ+IY +G SL + ++
Sbjct: 24 RGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVR 82
Query: 704 ----------LSLER--PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
L+L + NDD CGIC DGGDL+CCD CP +H +C+ + +PSG
Sbjct: 83 AWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSG 142
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
WHC C F A A+G +G F ++S
Sbjct: 143 DWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS-------------------- 172
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
C CER YH CL D + +Q C C + LQK
Sbjct: 173 ----------------CGMCERRYHQLCLNDEAHK-VQSFGSASSFCGPKCLELFEKLQK 215
Query: 872 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 931
+ E S +I + +S +++ + ++++ + L+ ++I
Sbjct: 216 YLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSAQRIE----NNSKLAVGLAI 262
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
+ F PI++ S +DLI ++Y G + +Y G Y AIL ++SA R G +
Sbjct: 263 MDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQ 322
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
LAE+P + T + + QG + LF IE + L V+ LV+P+ + WT FGF+ +
Sbjct: 323 LAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD 382
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ + + R+ ++F G MLQKP+
Sbjct: 383 DSVRKEMRS-LNTLVFPGIDMLQKPL 407
Score = 43.5 bits (101), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C ++++T +
Sbjct: 3 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC----CSKILTVSR 52
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 53 FEIHAGSKSCQPFQNIYLESGASL 76
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 166 bits (419), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 193/432 (44%), Gaps = 79/432 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK------------LS 705
G++CTCCN VS S+F+ HAG+ ++ P +++ +G ++
Sbjct: 657 GVVCTCCNKTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 715
Query: 706 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
LE+ NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W+C C E
Sbjct: 716 LEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWICSE 775
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
V NA R +DF
Sbjct: 776 -LVSDNAE-------------------------------------RSQDF---------- 787
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 788 -KCSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSW 844
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
++K E+ V +++ + L+ A+SI + F +++ +
Sbjct: 845 SILKCFQEDG------------MVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 892
Query: 946 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+D+IP ++Y G + D+ G Y ++ + V++S R+ G +AE+PLVAT + +
Sbjct: 893 IDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYR 952
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 953 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 1011
Query: 1064 IFQGTSMLQKPV 1075
+F GT++L+K +
Sbjct: 1012 VFPGTTLLKKTL 1023
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 166 bits (419), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 128/479 (26%), Positives = 199/479 (41%), Gaps = 105/479 (21%)
Query: 634 LEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR--------- 683
L +GA V Y + + G GILC CCN S + F+ HAG R
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 684 ---------------------KPFQHIYTSNGVSLHELSIKLS--LERPFSSKENDDLCG 720
+P T + +L + +K S + ENDD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 721 ICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRI 780
+C DGG L+CCD CP FH+ C+ L +P G W C C + GR
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRCC--------------CASCGR- 177
Query: 781 EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL 840
DP Q TE ++ CDQCEREYH C+
Sbjct: 178 SLYDPTIQ-------------TE---------------------ILYCDQCEREYHSNCV 203
Query: 841 KDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNA 900
M + C C +I L+KLV R + S +++ +H + +N+
Sbjct: 204 PGSAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSEENS 260
Query: 901 VDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRG 960
K+++ L+ A+++ + F P+I+ S +D++ ++Y R RG
Sbjct: 261 -------------KLESVADLNTRLALALTVIQECFRPMIDPRSNIDMVSHILYNR--RG 305
Query: 961 QD----YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIE 1016
+D + G Y +L Q ++S R+ G AE+P + T + + QG + L I+
Sbjct: 306 EDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQ 365
Query: 1017 KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
++L L V+TLVLP+ +E WT+ FGF +T Q + ++ F G+S+LQKP+
Sbjct: 366 QVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLT-AAQGIQLMELNIVTFPGSSVLQKPL 423
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 166 bits (419), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 193/432 (44%), Gaps = 79/432 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK------------LS 705
G++CTCCN VS S+F+ HAG+ ++ P +++ +G ++
Sbjct: 613 GVVCTCCNKTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 671
Query: 706 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
LE+ NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W+C C E
Sbjct: 672 LEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWICSE 731
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
V NA R +DF
Sbjct: 732 -LVSDNAE-------------------------------------RSQDF---------- 743
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 744 -KCSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSW 800
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
++K E+ V +++ + L+ A+SI + F +++ +
Sbjct: 801 SILKCFQEDG------------MVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 848
Query: 946 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+D+IP ++Y G + D+ G Y ++ + V++S R+ G +AE+PLVAT + +
Sbjct: 849 IDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYR 908
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 909 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 967
Query: 1064 IFQGTSMLQKPV 1075
+F GT++L+K +
Sbjct: 968 VFPGTTLLKKTL 979
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/474 (27%), Positives = 201/474 (42%), Gaps = 82/474 (17%)
Query: 619 RKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGF--GILCTCCNSEVSPSQFEAH 676
R+ + + E L G V F RG + +V F GI+C CC + S FEAH
Sbjct: 379 RESYSIVSWLIENKVLVSGTHV--FCRGSENIVKRGSIFSDGIVCNCCRVNFTVSGFEAH 436
Query: 677 AGWASRRKPFQHIYTSNGVSLHELSIKLSLER-------PFSSKENDDLCGICMDGGDLL 729
AG +R +P I +G SL + + ++ S ND++C IC GGDL+
Sbjct: 437 AG-CTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEANSEANNDNVCSICGFGGDLV 495
Query: 730 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM 789
CD CP AFH+ C+ L +P G W C C
Sbjct: 496 LCDRCPSAFHLGCLGLDRVPDGDWFCPTC------------------------------- 524
Query: 790 VSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG---RRTVILCDQCEREYHVGCLKDHGM- 845
C +I P CK ++C QCE+++H GC+K
Sbjct: 525 ---CCKICYRPK--------------CKQECADGNENNFLVCVQCEQKFHFGCVKTTRFG 567
Query: 846 --EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDF 903
+ K W C C + L L+KL+ + + + ++K S D+ DF
Sbjct: 568 SSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNINWTLLK---NVSSDDDGGDF 624
Query: 904 DVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS--HRGQ 961
K++A A+ + ++ F+P I++ S +LI +V+ R H+
Sbjct: 625 TSNEFSQEKHKLNA----------ALGVLYEGFNPTIDALSGRELIKDLVFSRDSEHKRL 674
Query: 962 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1021
++ G Y IL V+S RIFGQ++AE+ VAT +G+G + L +E+ L
Sbjct: 675 NFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQHRGRGMCRLLMDELEEQLTR 734
Query: 1022 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
L V LVL S+ +A WT FGF+ MT E++ K D + F + M KP+
Sbjct: 735 LGVGRLVLHSSEDAINTWTKSFGFARMTSEDKCKL-IDNTFLEFHNSIMCLKPL 787
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 22/124 (17%)
Query: 371 GKRSDVTEVA-----NDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIR 425
G+R V +++ ++G G K+ V+K R +R+ YS + +E+ +L + ++ R
Sbjct: 348 GRRGKVLKMSIMKKNSEGFG---KRGKVLK-RGGIRESYSIVSWLIENKVLVSGTHVFCR 403
Query: 426 GSK--VKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLEN 483
GS+ VK RG + GI C C C+ N V+ FE HAG + RP I LE+
Sbjct: 404 GSENIVK-------RGSIFSDGIVCNC--CRVNFTVSG--FEAHAGCTRHRPSISILLED 452
Query: 484 GKTL 487
G++L
Sbjct: 453 GRSL 456
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 188/446 (42%), Gaps = 108/446 (24%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE-LSIKLSLERPFSSK--- 713
GI+C CC +S S F AHAG + H +S G+ L S L L +S++
Sbjct: 889 GIVCNCCKKNLSVSDFMAHAGRS-------HPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 714 --------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C G K
Sbjct: 1002 -----------------------------------------------CQVCGGPFSEKEV 1014
Query: 820 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 879
+ C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1015 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV------ 1067
Query: 880 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAV 929
G+DN +D D+ W +LR G+K+ + L L+ A+
Sbjct: 1068 ----------------GTDNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVAL 1111
Query: 930 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
++ + F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G
Sbjct: 1112 TLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHG 1171
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
+ AELP +ATS D + QG + L IEK+L NVK LVL + E + W + FGF
Sbjct: 1172 TKAAELPFIATSVDFRRQGMCRILMSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKP 1231
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+ + E+ + N LM+F GTS+L K
Sbjct: 1232 IEDAERKQLHN-VNLMLFPGTSLLTK 1256
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 194/455 (42%), Gaps = 96/455 (21%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER--------- 708
GI C CCN + + FE H S R HI +G SL + I + +
Sbjct: 829 GIKCCCCNKLYTINGFEIHVSGTSSRSA-AHILLEDGKSLLDCQILWNKKTRSFKNQAST 887
Query: 709 ----PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQK 764
+S END +C IC GG L+ CD CP +FH C+ L +P G W C C
Sbjct: 888 CGKGDYSKDENDYICSICHFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICG 947
Query: 765 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 824
+ + +AN IV P
Sbjct: 948 QNKLSEHAN----------------------IVDGP------------------------ 961
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETS 884
+ C QCE +YHV CL+ G + K W C CK+I LQKL+ + S
Sbjct: 962 FLTCYQCECKYHVQCLR--GTKKFGSCSKPHWFCNKHCKQIYWGLQKLLGK--------S 1011
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDAS----------DGTRALLSKAVSIFHD 934
+ V G DN + W +L+ D + ++ L+ A+ + H+
Sbjct: 1012 IPV--------GGDN-----LTWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHE 1058
Query: 935 RFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F+P+ E ++ D++ +++ R + ++ G Y +L N+ +++ R++G+++AE
Sbjct: 1059 CFEPVREQHTRRDIVEDVIFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAE 1118
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+PLV T + G L +E+ L L V+ LVLP+ WT FGFS MT+ E
Sbjct: 1119 VPLVGTRFQYRRLGMCHILMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSE 1178
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
++++ N Y + FQ T M QK + K +V S+ G
Sbjct: 1179 RSEFLN-YTFLNFQETVMCQKFLLKNTVVPSSLSG 1212
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 198/447 (44%), Gaps = 77/447 (17%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK- 703
R + L G+ GI C CC+ ++ S+FE HAG + R+PFQ+IY +G SL + I+
Sbjct: 5 RTKVMLEGWVTRDGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLESGASLLDCQIEA 63
Query: 704 LSLERPF-----------SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 752
+ + P + NDD CG+C DGGDL+CCD CP FH C+ + +P G
Sbjct: 64 WNRQEPVKRLGFQAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPPGD 123
Query: 753 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 812
WHC C F C +
Sbjct: 124 WHCPNCSCKF--------------------------------------------CGVASD 139
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 872
++F + ++ C C ++YH C+++ + C C+ + LQK
Sbjct: 140 KNFQRDDTTVSKLLTCSLCVKKYHKSCMQEINTLSIDTNNSVASFCGKKCRELFEQLQKY 199
Query: 873 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKVDASDGTRALLSKAVS 930
+ E S +I + +S + L+G ++V+ + + L+ ++S
Sbjct: 200 LGVKHELEAGFSWSLIHRTDADSDTS-----------LQGLPQRVECN----SKLAVSLS 244
Query: 931 IFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 988
+ + F PI++ S ++LI ++Y G + ++ G Y IL ++SA R G
Sbjct: 245 VMDECFLPIVDRRSGINLIQNVLYNCGSNFNRLNFGGFYALILERGDEIISAASIRFHGT 304
Query: 989 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1048
LAE+P + T + + QG + LF IE L L V+ L++P+ SE WT FGF+ +
Sbjct: 305 RLAEMPFIGTRHMYRRQGMCRRLFYAIESTLCSLKVEKLIIPAISELMHTWTEVFGFTTL 364
Query: 1049 TEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
E + + ++ +++F G MLQK +
Sbjct: 365 DESLKQELKS-MNMLVFPGIDMLQKQL 390
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRD 489
L G V GI C C ++++T + FE+HAGS ++P + IYLE+G +L D
Sbjct: 10 LEGWVTRDGIHCGC----CSKILTVSKFEIHAGSKLRQPFQNIYLESGASLLD 58
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 160 bits (405), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 122/446 (27%), Positives = 187/446 (41%), Gaps = 108/446 (24%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL-----HELSIKLSLERPFSS 712
G++C CC +S S F AHAG F H +S G+ L + L + F S
Sbjct: 913 GVVCNCCKKTLSVSGFMAHAG-------FSHPQSSLGLFLESGKSYTLCQVEAWSAEFMS 965
Query: 713 K-------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 966 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHPACLSAKELPEGSWYCHNCT 1025
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C G K
Sbjct: 1026 -----------------------------------------------CQICGGPVSEKEV 1038
Query: 820 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 879
+ C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1039 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV------ 1091
Query: 880 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAV 929
G++N +D ++ W +LR G+K+ + L L+ A+
Sbjct: 1092 ----------------GTENILDSELSWSILRCNNDGQKLHSVQKIACLAECNMKLAVAL 1135
Query: 930 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
++ + F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G
Sbjct: 1136 TLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHG 1195
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
+ AELP +ATS D + QG + L IEK+L NVK LVL + E + W + FGF
Sbjct: 1196 TKAAELPFIATSVDYRRQGMCRILMNIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKP 1255
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+ + E+ + N LM+F GTS+L K
Sbjct: 1256 IEDAERKQLHN-VNLMLFPGTSLLTK 1280
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 159 bits (402), Expect = 7e-36, Method: Composition-based stats.
Identities = 73/122 (59%), Positives = 93/122 (76%)
Query: 962 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1021
++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +DCQGQGYFQ+L+ CIE+LL F
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 1022 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1081
L V +LVLP+A A+++W NKF F M +EE N D+ +M FQGTSMLQKPVP+ R +
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPVPEYRRI 136
Query: 1082 GK 1083
+
Sbjct: 137 SQ 138
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/433 (25%), Positives = 186/433 (42%), Gaps = 79/433 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKLSLERPFS- 711
GI+CTCC+ ++ S+F+ HAG+ R P +I+ +G L S + R +
Sbjct: 729 GIICTCCDKVLTLSEFKFHAGFTVNR-PCLNIFMESGEPFTLCLLQAWSAEYKARRSQNQ 787
Query: 712 ------SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ +NDD CG+C +GG+L+CCD+CP FH+ C+S IP G W+C C
Sbjct: 788 AVHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------ 841
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
C +C K
Sbjct: 842 -----------------------------------------CRICGNLVIDKDTSDAHDS 860
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELP-KGKWLCCADCKRINLALQKLVDRGEEKLPETS 884
+ C QCE +YH CL+D D QE+ W C C+ + LQ V + S
Sbjct: 861 LQCSQCEHKYHEKCLEDR---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGIS 917
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 944
+++ H++ +A +W L+ L+ A++I + F + + +
Sbjct: 918 WTLLRCIHDDQKVHSA-----QWFALKAV-------CNTKLAVALTIMEECFVSMFDPRT 965
Query: 945 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
+ +IP ++Y G ++ G Y +L V++S R+ G +AE+PL+AT +
Sbjct: 966 GIHMIPQVLYNWGSEFARLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQY 1025
Query: 1003 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1062
+ QG + L IE++L V+ LV+ + + WT FGF + + E+ + N L
Sbjct: 1026 RRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIERQRL-NKINL 1084
Query: 1063 MIFQGTSMLQKPV 1075
M+F GT +L K +
Sbjct: 1085 MVFPGTVLLVKSL 1097
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 187/444 (42%), Gaps = 81/444 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 708
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 709 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 764
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK 699
Query: 765 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 824
++ ++ + + L C L
Sbjct: 700 DEAAKHETSTLPS--------------------------LSSCRL--------------- 718
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET- 883
CE +YH C+ G + C C+ + LQ + + LPE
Sbjct: 719 ------CEEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGF 769
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
S +++ S D D+ ++ A ++ A S+ + F P+++
Sbjct: 770 SWSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHR 816
Query: 944 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1001
S ++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 817 SGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYM 876
Query: 1002 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1061
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + + E+ +N
Sbjct: 877 YRRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKN-LN 935
Query: 1062 LMIFQGTSMLQKPVPKCRIVGKSV 1085
L++F G ML K + K +I V
Sbjct: 936 LLVFPGVDMLGKSLVKEKITDSVV 959
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/434 (27%), Positives = 185/434 (42%), Gaps = 90/434 (20%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKE--- 714
GI C CCN + ++FEAHAG S +PF++IY G SL + + + S+ +
Sbjct: 800 GIQCDCCNKTFTSAEFEAHAGGKSC-QPFENIYLETGSSLLQCQLDSWYKEDDSAHKGFH 858
Query: 715 ---------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
NDD CGIC DGGDL+CCDSCP FH C+ + PSG WHC YC+ F
Sbjct: 859 FIDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCLEIRKFPSGLWHCMYCLCKF--- 915
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
G + G + C R D + +
Sbjct: 916 -----------CGMVGG--------NTCQR-----DGNMAAV--------------SHAL 937
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C CE +YH C ++ + + P C +C+ + LQ L +E S
Sbjct: 938 VTCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQELYERLQMLFGVKQELEAGFSW 995
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
+++ + SD +V + W+V KV + A+ I + F P+++ S
Sbjct: 996 TFVRRF--DVSSDISVS-GMSWKVDCNSKV----------AVALQIMDECFVPMVDHKSG 1042
Query: 946 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
++LI +VY G + +Y G + A+L +++A R F +P+
Sbjct: 1043 VNLIRNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAASIRYF----YSMPV-------- 1090
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
S + L LNV LV+P+ SE WT+ FGF + ++ RN +M
Sbjct: 1091 ------SFHSSLSMGLCSLNVGKLVIPAISELTGTWTSVFGFKHLEGSDKQIMRN-MNMM 1143
Query: 1064 IFQGTSMLQKPVPK 1077
+F G MLQKP+ K
Sbjct: 1144 VFPGVDMLQKPLLK 1157
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 110/432 (25%), Positives = 183/432 (42%), Gaps = 77/432 (17%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH---------ELSIKLSLER 708
GI+C CC ++ S+F+ HAG+ R P +I+ +G E + S +
Sbjct: 729 GIICICCGKVLTLSEFKFHAGFTLNR-PCLNIFMESGEPFTLCLLQAWSTEYKARKSQNQ 787
Query: 709 PFSSKEND---DLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ END D CG+C +GG+L+CCD+CP FH+ C+S IP G W+C C
Sbjct: 788 AVHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------ 841
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
C +C K
Sbjct: 842 -----------------------------------------CRICGNLVIDKDTLDAHDS 860
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C QCE +YH CL+D ++ L W C C+ + LQ V + S
Sbjct: 861 LQCSQCEHKYHEKCLEDRDKQEGAIL--DTWFCGQSCQEVYSGLQSQVGLVNQVADGISW 918
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
+++ H++ +A +W L+ L+ A++I + F + + +
Sbjct: 919 TLLRCIHDDQKVHSA-----QWFALKAV-------CNTKLAVALTIMEECFVSMFDPRTG 966
Query: 946 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+ LIP ++Y G ++ G Y +L + V++S R+ G +AE+PL+AT + +
Sbjct: 967 IHLIPQVLYNWGSEFARLNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYR 1026
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L V+ LV+ + + WT FGF + + E+ + N LM
Sbjct: 1027 RQGMCRLLVTAIEQVLISFKVEKLVISAIPDLVETWTKGFGFIPVDDIERQRL-NKINLM 1085
Query: 1064 IFQGTSMLQKPV 1075
+F GT +L K +
Sbjct: 1086 VFPGTVLLVKSL 1097
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 156 bits (394), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 124/437 (28%), Positives = 187/437 (42%), Gaps = 82/437 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKE--- 714
GI C CC + S FEAHAG + +P ++I+ NG SL ++ + S ++
Sbjct: 18 GIQCDCCGETFAISDFEAHAG-SKSCQPLKNIFLENGPSLLHCQLESWHRQDESDRKGFH 76
Query: 715 ---------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
NDD CGIC DGG+L+CCDSCP FH C+ + +PSG W+C YC F
Sbjct: 77 FVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYCSCKF--- 133
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
A G +D R +
Sbjct: 134 -------CGMAGGDACQMDE-------------------------------NDAAARPAL 155
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C CE +YH C+ + + + C C+ ++ LQ L+ E +
Sbjct: 156 LTCCLCEEKYHHSCIP--AEDTINDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAW 213
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRAL-----LSKAVSIFHDRFDPII 940
V+++ FDV G + S R + ++ A+ I + F P+
Sbjct: 214 TVVRR------------FDV------GSDITLSGMHRKVECNSKVAVALHIMDECFLPMP 255
Query: 941 ESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
+ S ++LI +VY G + +Y G AIL V+SA RI G +LAE+P + T
Sbjct: 256 DHRSGVNLIRNIVYNFGSNFNRLNYCGFLTAILERGDEVISAASIRIHGNQLAEMPFIGT 315
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ + QG + L IE L LNV+ LV+P+ SE + WT+ FGF + + K R
Sbjct: 316 RHMYRRQGMCRRLLGAIETALCSLNVEKLVIPAISELRETWTSVFGFKQLEGLSKQKMRY 375
Query: 1059 DYPLMIFQGTSMLQKPV 1075
++ F G MLQKP+
Sbjct: 376 -MKMVAFPGVDMLQKPL 391
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 184/443 (41%), Gaps = 102/443 (23%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---- 713
GILC CC +S S F+AHAG R ++ +G S ++ S +
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLG-LFLQSGKSYTLCQVEAWSAELMSRRSDAY 1025
Query: 714 --------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C
Sbjct: 1026 GRKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYC---------- 1075
Query: 766 KFVEYNANARAAGRI---EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 822
+N R+ G + V F+ ++ +C+
Sbjct: 1076 ----HNCTCRSCGNPVNEKEVSSFSDIL-KCL---------------------------- 1102
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
QC YH C+ D M W C CK I + L
Sbjct: 1103 -------QCGDAYHNTCI-DRVMLPSDGKRSDTWFCGRYCKEIFMGL------------- 1141
Query: 883 TSLDVIKKKHEESGSDNAVDFDVRWRVLR----------GKKVDASDGTRALLSKAVSIF 932
H + G +N ++ D+ W +LR +K+ L+ A+++
Sbjct: 1142 ---------HSQVGVENVINNDLSWTILRCNSDGQRLHSAQKIGLMTECNTKLAVALTLL 1192
Query: 933 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
+ F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G +
Sbjct: 1193 EECFIRMVDPRTGVDMIPHVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIRLHGTKA 1252
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AELP +ATS D + QG + L IEK+L +V+ LVL + E W + FGF + +
Sbjct: 1253 AELPFIATSVDYRRQGMCRRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFGFKPIED 1312
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQK 1073
+E+ + RN LM+F G S+L K
Sbjct: 1313 DEKKQLRN-VNLMLFPGASLLTK 1334
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/438 (27%), Positives = 184/438 (42%), Gaps = 95/438 (21%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSLER---- 708
GI C CC + + S+F+ HAG + P +++ G S L SI+ + +
Sbjct: 914 GIQCQCCGTTFTMSKFKCHAG-LRQEVPSLNLFLDTGKSYSLCQLQAWSIEQKVRKEHAK 972
Query: 709 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ +NDD CG C DGG+L+CCD+CP ++H C+S IP G W+C C+
Sbjct: 973 DTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCL------ 1026
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
C +C K
Sbjct: 1027 -----------------------------------------CDICGEVIDSKELVTSLPA 1045
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C QCER+YHV C+ E G W C C I + + V
Sbjct: 1046 LDCSQCERQYHVKCVS--AKVPCNEDGSGTWFCGRKCHEIYMTFRSRV------------ 1091
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRG---KKVDASDGTRAL------LSKAVSIFHDRF 936
G + +D D+ + VLR KKV ++ + L A SI + F
Sbjct: 1092 ----------GVPDHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECF 1141
Query: 937 DPIIESASKLDLIPAMVYG-RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPL 995
PI++ + +D+IP+++Y RS +Y G Y +L + +VS R+ G LAE+PL
Sbjct: 1142 LPILDPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPL 1201
Query: 996 VATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNK 1055
VATS + + QG + L IE++L L V+ L+L + WT+ FGF + E ++ K
Sbjct: 1202 VATSKENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDK-K 1260
Query: 1056 YRNDYPLMIFQGTSMLQK 1073
+ L GT +L+K
Sbjct: 1261 RLSKVRLAAVPGTVLLKK 1278
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 140/535 (26%), Positives = 225/535 (42%), Gaps = 126/535 (23%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 573 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMI 632
Query: 630 EEGGLEDGAEVGYF-VRGEKF-LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ G + + A+V Y +G + L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 633 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 691
Query: 688 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 735
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 692 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 751
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH+ C+ + +PSG WHC C
Sbjct: 752 STFHLACLGIK-MPSGDWHCSSC------------------------------------- 773
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRT----VILCDQCEREYHVGCLKDHGMEDLQEL 851
+CR FC S T ++ C QC R+YH C + ++
Sbjct: 774 -------------ICR---FCGSTQEITTSSAELLSCLQCSRKYHQVCAPGTMKDSVKAE 817
Query: 852 PKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
C C++I L+KL+ G NA++ W +
Sbjct: 818 SNSSTDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSL 855
Query: 910 LR--GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSH 958
+R K+ A +A L + A S+ + F P I+ S +++I ++Y G
Sbjct: 856 VRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDF 915
Query: 959 RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1018
++ Y IL V+SA RI G +LAE+P + T + QG L IE
Sbjct: 916 NRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESA 975
Query: 1019 LGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
L LNV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 976 LSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 1029
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 655 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 698
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 140/535 (26%), Positives = 225/535 (42%), Gaps = 126/535 (23%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 572 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMI 631
Query: 630 EEGGLEDGAEVGYF-VRGEKF-LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ G + + A+V Y +G + L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 632 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 690
Query: 688 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 735
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 691 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 750
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH+ C+ + +PSG WHC C
Sbjct: 751 STFHLACLGIK-MPSGDWHCSSC------------------------------------- 772
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRT----VILCDQCEREYHVGCLKDHGMEDLQEL 851
+CR FC S T ++ C QC R+YH C + ++
Sbjct: 773 -------------ICR---FCGSTQEITTSSAELLSCLQCSRKYHQVCAPGTMKDSVKAE 816
Query: 852 PKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 909
C C++I L+KL+ G NA++ W +
Sbjct: 817 SNSSTDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSL 854
Query: 910 LR--GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSH 958
+R K+ A +A L + A S+ + F P I+ S +++I ++Y G
Sbjct: 855 VRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDF 914
Query: 959 RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1018
++ Y IL V+SA RI G +LAE+P + T + QG L IE
Sbjct: 915 NRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESA 974
Query: 1019 LGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
L LNV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 975 LSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 1028
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 654 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 697
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 139/531 (26%), Positives = 224/531 (42%), Gaps = 118/531 (22%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 150 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYVPYRWKRTVLSWMI 209
Query: 630 EEGGLEDGAEVGYFVR--GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 687
+ G + + A+V Y + L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 210 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 268
Query: 688 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 735
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 269 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 328
Query: 736 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 795
FH+ C+ + +PSG WHCR C+ R G +
Sbjct: 329 STFHLACLGIK-MPSGDWHCRSCI--------------CRFCGSTQE------------- 360
Query: 796 IVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
+ T EL C+ QC R+YH C + ++
Sbjct: 361 -ITTSSAELLSCL---------------------QCSRKYHQVCAPGTMKDSVKAESNSS 398
Query: 856 W--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-- 911
C C++I L+KL+ G NA++ W ++R
Sbjct: 399 TDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSLVRCF 436
Query: 912 GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 962
K+ A +A L + A S+ + F P I+ S +++I ++Y G +
Sbjct: 437 PDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLN 496
Query: 963 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1022
+ Y IL V+SA RI G +LAE+P + T + QG L IE L L
Sbjct: 497 FSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSL 556
Query: 1023 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
NV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 557 NVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 606
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 232 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 275
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 115/432 (26%), Positives = 194/432 (44%), Gaps = 79/432 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKLSLERP-FS 711
G++CTCCN VS S+F+ HAG+ ++ P +++ +G L S + R +
Sbjct: 645 GVVCTCCNRTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 703
Query: 712 SKE------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
S+E NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W
Sbjct: 704 SEEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSW------------ 751
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
Y ++ E V + R +DF
Sbjct: 752 ----YCSSCSCQICSELVSDNGE----------------------RSQDF---------- 775
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 776 -KCSQCAHKYHGICLQ--GISKRRKLFPETYFCGKNCEKVYTGLSSRVGVINPNADGLSW 832
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
++K E+ +V +++ + L+ A+SI + F +++ +
Sbjct: 833 SILKCFQEDG------------KVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 880
Query: 946 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+D+IP ++Y G + D+ G Y +L + V++S R+ G +AE+PLVAT + +
Sbjct: 881 IDMIPHVLYNWGSNFARLDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVATCSKYR 940
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 941 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 999
Query: 1064 IFQGTSMLQKPV 1075
+F GT +L K +
Sbjct: 1000 VFPGTILLMKTL 1011
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 153 bits (386), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 118/460 (25%), Positives = 197/460 (42%), Gaps = 96/460 (20%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL------- 704
G+ GILC+CC+ +S S F+AHA + R ++ +G S I+
Sbjct: 580 GWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSL-NLCLQSGKSFTLCQIEAWNAEYMD 638
Query: 705 ----SLERPFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ R + + NDD CG C DGG+LLCCD+CP +H C+S+ +P +W+C C+
Sbjct: 639 RRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHNCI 698
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
RI P TE K
Sbjct: 699 ----------------------------------CRICGCPVTE-------------KEI 711
Query: 820 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 879
+I C QC +H C+ + G +E+ +W C CK I L L V
Sbjct: 712 SSFSAIIKCLQCGAAHHDTCV-EMGATAFEEMDSDEWFCGTHCKEIYLGLHGCV------ 764
Query: 880 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG----------KKVDASDGTRALLSKAV 929
G ++++ + W +LR +K+ + + L+ A+
Sbjct: 765 ----------------GVESSLGDGLSWTILRCNSGGQKMHSVQKIAHAIECNSKLAVAL 808
Query: 930 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
++ + F ++++ + +++IP ++Y G + +Y G Y IL + ++ A R+ G
Sbjct: 809 TLMEECFAQMVDTRTGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEEILCAASIRVHG 868
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
+ AELP +AT + + +G + L IE++L +VK LVL + E + W + FGF
Sbjct: 869 MKAAELPFIATCREHRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELVSTWVSGFGFKP 928
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1087
+ EE + K + LM+F GTS+L K + + KS G
Sbjct: 929 I-EEYERKQLDTINLMLFPGTSLLIKSLEDGTLTEKSGAG 967
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 154/289 (53%), Gaps = 34/289 (11%)
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC-------LKDHGMED 847
R+VQT C CR D + T+ C+QCER HV C +K +E
Sbjct: 68 RVVQTE-----SCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 848 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRW 907
L+E ++LCC +C+ + L+ V++GEE + +++ + ++ W
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG-VEKGEE------IAFLRQ----------IRSNICW 165
Query: 908 RVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGM 966
R+L K DAS + +S+A+ IF D F ++ S D+ MVYG++ G+ D+ GM
Sbjct: 166 RLL--SKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGM 221
Query: 967 YCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKT 1026
YC +LT + VVSA I ++ ++ AEL L+AT ++C+ +GYF+ L IE L NV
Sbjct: 222 YCVVLTASTHVVSAAILKVRVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSL 281
Query: 1027 LVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
L+ P E IW+ K GF++++ EE+ +PL++FQ ++QKP+
Sbjct: 282 LMAPVDPEMAQIWSEKLGFTILSAEEKKSMLESHPLVMFQDLVLVQKPL 330
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 185/443 (41%), Gaps = 106/443 (23%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---- 713
GILC CC S S F+ H G S K ++ +G S ++ S K
Sbjct: 956 GILCNCCTKTFSISDFKVHGG-CSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDAS 1014
Query: 714 --------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C
Sbjct: 1015 GRKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC---------- 1064
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
+N R+ G +P ++ K +
Sbjct: 1065 ----HNCTCRSCG-----NPLSE----------------------------KEVSTFSAI 1087
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKL 880
+ C QC YH C+ D + LP G W C CK I + L
Sbjct: 1088 LKCLQCGDSYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL----------- 1130
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVS 930
H G +N +D ++ W +L+ G+K+ +S + L+ A++
Sbjct: 1131 -----------HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALT 1179
Query: 931 IFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 990
I + F +++ + +D+IP ++ + DY G Y IL ++ R+ G +
Sbjct: 1180 ILEECFVRMVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKA 1237
Query: 991 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1050
AELP +ATS D + QG + L IE +L +V+TLVL + E W + FGF + +
Sbjct: 1238 AELPFIATSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIED 1297
Query: 1051 EEQNKYRNDYPLMIFQGTSMLQK 1073
E+ + RN LM+F GTS+L K
Sbjct: 1298 NEKKQLRN-VNLMLFPGTSLLTK 1319
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 190/445 (42%), Gaps = 86/445 (19%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 706
GI C+CC + S FE HA S R +I+ +G SL E ++ L
Sbjct: 635 GIKCSCCRRIFTISGFEVHANGGSCRAA-ANIFLDDGRSLLECQVEAYETRKKAQPPDIL 693
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 766
+ END +C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 694 KMKLRQGENDVICSVCHYGGKLILCDGCPSAFHANCLGLEEVPDGDWFCESCC------- 746
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK--SRFGRR- 823
G C G+ F K S++ +
Sbjct: 747 -------------------------------------CGAC----GQFFLKATSKYAKEE 765
Query: 824 TVILCDQCEREYHVGCLKDHGMED-LQELPKGKWLCCADCKRINLALQKLVDR----GEE 878
I C QCE +YH CL+ G D L KW C DC+ I + L +L+ + G E
Sbjct: 766 KFISCKQCELKYHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELIGKPREVGVE 825
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 938
KL + +++ D+A K++A LS A+ + H+ F+P
Sbjct: 826 KL---TWRLVQSFEPNMYGDDAY------------KIEAVAENHCKLSVALDVMHELFEP 870
Query: 939 IIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+ DL +++ R + + ++ G Y +L N+ +V+ RI G+++AE+P +
Sbjct: 871 VKRPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFI 930
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
T + +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ ++
Sbjct: 931 GTRFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKEF 990
Query: 1057 RNDYPLMIFQGTSMLQKPVPKCRIV 1081
+ L+ F T + QK + K +V
Sbjct: 991 LK-FTLLEFGRTILCQKILIKSSVV 1014
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1179
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 124/443 (27%), Positives = 186/443 (41%), Gaps = 69/443 (15%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 708
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 709 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W YC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAW---YC------- 689
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
YN + + + E + L C LC + C +
Sbjct: 690 ----YNCSCKFCEKDEAAKHETSTLP-----------SLSSCRLCEEK--CSKHYPH--- 729
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET-S 884
L D H C+ G + C C+ + LQ + + LPE S
Sbjct: 730 TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGFS 780
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 944
+++ S D D+ ++ A ++ A S+ + F P+++ S
Sbjct: 781 WSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHRS 827
Query: 945 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 828 GVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMY 887
Query: 1003 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1062
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + + E+ +N L
Sbjct: 888 RRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKN-LNL 946
Query: 1063 MIFQGTSMLQKPVPKCRIVGKSV 1085
++F G ML K + K +I V
Sbjct: 947 LVFPGVDMLGKSLVKEKITDSVV 969
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/438 (25%), Positives = 188/438 (42%), Gaps = 81/438 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKL-------S 705
GI+C CCN +S ++F++HAG+ R P +++ +G L S +
Sbjct: 150 GIMCKCCNMVLSVTKFKSHAGFKLNR-PCSNLFMESGKPFTLCQLQAWSAEYKSRKSGTQ 208
Query: 706 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ R +NDD CG+C DGG+L+CCD+CP FH C+ +P G+W YC N
Sbjct: 209 VVRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPEGSW---YCPNC---- 261
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
C +C D + +V
Sbjct: 262 ----------------------------------------TCWICG--DLVNDKEASSSV 279
Query: 826 --ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
C QCE +YH C G + + L W C C+ + L V
Sbjct: 280 GAYKCLQCEHKYHGAC--QQGKQTHEGLVSDAWFCSGSCQEVYSGLHSRVGINNPIADGF 337
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
+++ HE+ +VL +++ + L+ A++I + F +++
Sbjct: 338 CWTLLRCIHEDQ------------KVLSAQRLALKAECNSKLAVALTIMEECFQSMVDPR 385
Query: 944 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1001
+ +D+IP +Y G ++ G Y +L + V+VSA R+ G +AE+PL+AT ++
Sbjct: 386 TGIDMIPHALYNWGSDFARLNFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSN 445
Query: 1002 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1061
+ QG + L IE++L V+ LV+ + + WT FGF ++++E+ N
Sbjct: 446 YRRQGMCRHLMTAIEEMLISYKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSL-NKIN 504
Query: 1062 LMIFQGTSMLQKPVPKCR 1079
M+F GT +L+K + K +
Sbjct: 505 FMVFPGTILLKKQLYKTK 522
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 123/441 (27%), Positives = 187/441 (42%), Gaps = 73/441 (16%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLERP---- 709
GI C CC+ S FE HAG R +PF+ +Y G SL H+ K S +
Sbjct: 678 GIRCNCCDEVFSVLDFEVHAG-GKRNQPFKSLYLEGGNSLLQCLHDFMNKQSESQHKGYH 736
Query: 710 ----FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 764
S NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 737 FVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEK 796
Query: 765 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 824
+ ++ +A + L C LC + C + +
Sbjct: 797 VEAAIHDTSALHS--------------------------LSSCRLCEEK--CSNHYPH-- 826
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET- 883
L D H C+ G + C C+ + LQ L+ + LPE
Sbjct: 827 -TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLLIGV-KHPLPEGF 876
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
S +++ S D D+ ++ K ++ A S+ + F P+++
Sbjct: 877 SWSFLRRFELPS---EVADCDISEKIAYNAK----------MAVAFSVMDECFSPLVDHR 923
Query: 944 SKLDLIPAMVYG---RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1000
S ++L+ +VY HR ++ A+L +++ RI G +LAE+P + T
Sbjct: 924 SGVNLLQNIVYNFWSNFHR-LNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRY 982
Query: 1001 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1060
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + E E+ +N
Sbjct: 983 MYRRQGMCRRLMDGIESALGSLKVAKLVIPAVPELIDTWTSGFGFTPVNESEKKTIKN-L 1041
Query: 1061 PLMIFQGTSMLQKPVPKCRIV 1081
L++F G ML K + K +I
Sbjct: 1042 NLLVFPGVDMLGKSLVKEQIT 1062
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 190/444 (42%), Gaps = 96/444 (21%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 708
GI C CC + + S+F+ HAG + P +++ G S L SI K+ ER
Sbjct: 903 GIRCRCCGTTFTMSKFKCHAG-LRQEVPSLNLFLGTGKSYSLCLLQAWSIEQKVRKERVK 961
Query: 709 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+ +NDD CG C DGG+L+CCD+CP ++H C+ IP G W+C C+
Sbjct: 962 DTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLPCQDIPDGNWYCSSCL------ 1015
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
C +C K
Sbjct: 1016 -----------------------------------------CNICGEVITSKELRTSLPA 1034
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C QCER+YHV C+ E G W C C++I + + V
Sbjct: 1035 LECSQCERQYHVKCVS--AKVSCNEDGPGTWFCGRKCQQIYMIFRSRV------------ 1080
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLR--GKKVDASDGTRALLSK-------AVSIFHDRF 936
G + VD D+ +LR G K + G AL+++ A+SI + F
Sbjct: 1081 ----------GVPDHVDNDLSCTILRNNGDKKVRTAGEIALMAECNMKLMIALSIMEECF 1130
Query: 937 DPIIESASKLDLIPAMVYG-RSHRGQDYH-GMYCAILTVNQVVVSAGIFRIFGQELAELP 994
PI++ + +D+IP+++Y RS H G Y +L + +VS R+ G +AE+P
Sbjct: 1131 LPILDPRTGIDIIPSILYNWRSDFIHFNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMP 1190
Query: 995 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1054
LVATS + + QG + L IE++L L V+ L+L + WT+ FGF + + ++
Sbjct: 1191 LVATSTENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLVETWTSTFGFREIDDSDK- 1249
Query: 1055 KYRNDYPLMIFQGTSMLQKPVPKC 1078
K + L GT +L+K + +C
Sbjct: 1250 KRLSMVRLAAVPGTVLLKKNLCEC 1273
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 149 bits (376), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 141/540 (26%), Positives = 228/540 (42%), Gaps = 107/540 (19%)
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQ--TSSHSKMKSSSVKSHGKITRK 620
+S E EL +SP E D N P+P A TS ++ + S ++
Sbjct: 168 ASSFEKPKDKYELVKSPVLPSE-QDVWNEKPQPRKAAKGTSRRNQRTALSYLVDRELISP 226
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWA 680
R+H V +G L V + RG G+ I C CC++ S+FEAH G +
Sbjct: 227 GDRVHCNVTRDGRL-----VTW--RGSITNEGF-----IKCDCCSNLFPISKFEAHTG-S 273
Query: 681 SRRKPFQHIYTSNGVSLHELSIKL------------SLERPFSSKENDD----------- 717
++ +P +I+ +G SL + +L + E+ + +N D
Sbjct: 274 TKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHNDNADSDTHGLDKNDC 333
Query: 718 LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAA 777
+C +C GG+L+ CD CP AFH C+ + GIPSG W+C C + +++
Sbjct: 334 ICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCCCKICGQVTYDFD------ 387
Query: 778 GRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHV 837
+ V F RC+ QCE+ H+
Sbjct: 388 ---DQVSSFDTSFVRCV-----------------------------------QCEQNVHI 409
Query: 838 GCLKD-HGMEDL-QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 895
GC+K +ED Q + + W C C+ I++ LQ L+ +++P + E
Sbjct: 410 GCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLL---WKQIP------VGDARE-- 458
Query: 896 GSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG 955
++ W +++ S+ R L+KA+ + H F P+ + +K DLI +
Sbjct: 459 --------NLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLS 510
Query: 956 RSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFC 1013
+ + ++ G Y AIL VV+ R++G E+AE+PLVAT + G + L
Sbjct: 511 KRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLN 570
Query: 1014 CIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+E L + VK L LP+ EA WT FGF+ MT+ ++ Y + FQ T QK
Sbjct: 571 ELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK-YTFLGFQHTVRCQK 629
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 127/496 (25%), Positives = 197/496 (39%), Gaps = 119/496 (23%)
Query: 650 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL---SIKLSL 706
L G G GI C CCN V+ S F AHAG + + P ++IYT SL + S
Sbjct: 840 LEGKITGDGIHCGCCNEIVTISDFGAHAG-SKQSDPLKNIYTEEETSLLQCLLDSWNKQD 898
Query: 707 ERPFSS---------KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 757
E S NDD CG+C DGGDL+CCD CP FH C+ + PSG WHC Y
Sbjct: 899 ESELKSFHFFDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHKSCLDIKKFPSGDWHCAY 958
Query: 758 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 817
C F R+V GG +
Sbjct: 959 CCCKF-------------------------------CRLV-------GG---SSNQSVVN 977
Query: 818 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 877
F ++ C CE ++H+ C++ +G + + K C C+ ++ L+ L+
Sbjct: 978 DEFTMPALLTCHLCEEKFHISCVEANGGK--TDDSKDALFCGNKCQELSERLEMLLGVKH 1035
Query: 878 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 937
E S I++ SD D + L V+ + + L+ A+SI ++ F
Sbjct: 1036 EIEDGFSWSFIRR------SDVGCDLSLTNPQL----VECN----SKLAVALSIMNECFM 1081
Query: 938 PIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPL 995
P I+ S +L+ +++Y G + + DY G IL ++ R+ G LAE+P
Sbjct: 1082 PYIDHRSGTNLLRSILYNCGSNFKRLDYSGFITVILERGDEIICVASIRVHGNRLAEMPY 1141
Query: 996 VATSNDCQGQGYFQSLFCCIE--------------------------------------- 1016
+ T + QG + L IE
Sbjct: 1142 IGTRYMYRRQGMCRRLLNAIESEAVYGLSVNLGACPMVVTVRVCNYSMNGSRGFVFDVLR 1201
Query: 1017 -------KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTS 1069
K L L+V+ LV+P+ SE + WT+ FGF + + + + N+ L++F
Sbjct: 1202 GNDWGGPKALSSLDVELLVIPAISELRETWTSVFGFEPLKQTSK-QITNNMNLLVFPHVD 1260
Query: 1070 MLQKPVPKCRIVGKSV 1085
+LQK + K I +++
Sbjct: 1261 LLQKKISKHAIANENL 1276
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 149 bits (375), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 145/540 (26%), Positives = 230/540 (42%), Gaps = 107/540 (19%)
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQ--TSSHSKMKSSSVKSHGKITRK 620
+S E EL +SP E D N P+P A TS ++ + S ++
Sbjct: 191 ASSFEKPKDKYELVKSPVLPSE-QDVRNEKPQPRKAAKGTSRRNQRTALSYLVDRELISP 249
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWA 680
R+H V +G L V + RG G+ I C CC++ S+FEAH G +
Sbjct: 250 GDRVHCNVTRDGRL-----VTW--RGSITNEGF-----IKCDCCSNLFPISKFEAHTG-S 296
Query: 681 SRRKPFQHIYTSNGVSLHELSIKL------------SLERPFSSKENDD----------- 717
++ +P +I+ +G SL + +L + E+ + +N D
Sbjct: 297 TKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHNDNADSDTHGLDKNDC 356
Query: 718 LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAA 777
+C +C GG+L+ CD CP AFH C+ + GIPSG W+C C +
Sbjct: 357 ICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCC--------------CKIC 402
Query: 778 GRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHV 837
G++ F VS + DT CV QCE+ H+
Sbjct: 403 GQV--TYDFDDQVS-------SFDTSFVRCV---------------------QCEQNVHI 432
Query: 838 GCLKD-HGMEDL-QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 895
GC+K +ED Q + + W C C+ I++ LQ L+ +++P + E
Sbjct: 433 GCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLL---WKQIP------VGDARE-- 481
Query: 896 GSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG 955
++ W +++ S+ R L+KA+ + H F P+ + +K DLI +
Sbjct: 482 --------NLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFLS 533
Query: 956 RSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFC 1013
+ + ++ G Y AIL VV+ R++G E+AE+PLVAT + G + L
Sbjct: 534 KRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLLN 593
Query: 1014 CIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+E L + VK L LP+ EA WT FGF+ MT+ ++ Y + FQ T QK
Sbjct: 594 ELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK-YTFLGFQHTVRCQK 652
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 123/479 (25%), Positives = 199/479 (41%), Gaps = 88/479 (18%)
Query: 624 MHKLVFEEGGLEDGAEVGYFV--RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWAS 681
++ + + G + + +V Y R + L G GI C CC+ ++ +FE HAG +
Sbjct: 629 VYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITREGIFCGCCSKILTVGKFELHAG-SK 687
Query: 682 RRKPFQHIYTSNG-VSLHELSI------KLSLERPF-----SSKENDDLCGICMDGGDLL 729
+KP+ +I+ G VSL + + L + F E+DD C IC DGGDL+
Sbjct: 688 EKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGFYKIDKGEDEHDDTCAICGDGGDLV 747
Query: 730 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM 789
CCD C FH+DC+ + +PSG W+CR C+ F
Sbjct: 748 CCDHCASTFHLDCLGIK-LPSGDWYCRSCLCRF--------------------------- 779
Query: 790 VSRCIRIVQTPDTE---LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME 846
C + P + L C+ C R+YH C G +
Sbjct: 780 ---CGFPQEKPSSSPELLLSCLQC---------------------SRKYHQTCSSGTGTD 815
Query: 847 DLQELPKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFD 904
+P C C++I L KL+ IK E S + V
Sbjct: 816 FDCTIPGTSIDCFCSPGCRKIYKRLNKLLG-------------IKNHMEAGFSWSLVHCF 862
Query: 905 VRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 962
+ + K + + ++ A ++ + F P I+ S +++I + Y G D
Sbjct: 863 PNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHIDERSGINMIHNVAYNCGSDFSRLD 922
Query: 963 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1022
+ G Y IL V+SA RI G +LAE+P + T + QG + L IE L L
Sbjct: 923 FSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGMYRHQGMLRRLLNGIESALCSL 982
Query: 1023 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1081
NV+ LV+ + +E + WT FGF + ++ + ++ L+I GT +L+K + + + V
Sbjct: 983 NVQKLVVSAVTEMENTWTTVFGFKPVQPSKKQRIKS-LNLLIMNGTGLLEKRLLQTQTV 1040
Score = 43.1 bits (100), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 12/91 (13%)
Query: 396 PRKVMRKFYSKLKNFLESGIL-EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCK 454
P + R YS + ++ G++ E M V Y+ ++ + L G + GI C C C
Sbjct: 622 PYEWKRTVYSWM---IDLGVVSEDMQVKYMNNNRTR----EMLAGKITREGIFCGC--C- 671
Query: 455 GNQVVTPAVFELHAGSSNKRPPEYIYLENGK 485
++++T FELHAGS K+P I+LE GK
Sbjct: 672 -SKILTVGKFELHAGSKEKKPYANIFLEGGK 701
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 706
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 892 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 950
Query: 707 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 951 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 1010
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C D K R
Sbjct: 1011 -----------------------------------------------CRICG--DLVKDR 1021
Query: 820 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 877
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 1022 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 1078
Query: 878 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 937
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 1079 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1126
Query: 938 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 992
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1127 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1183
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1184 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1243
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPV 1075
+ + LM+F GT +L+K +
Sbjct: 1244 KASLKK-INLMVFPGTILLKKSL 1265
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 706
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 956 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 1014
Query: 707 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 1015 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 1074
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C D K R
Sbjct: 1075 -----------------------------------------------CRICG--DLVKDR 1085
Query: 820 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 877
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 1086 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 1142
Query: 878 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 937
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 1143 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1190
Query: 938 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 992
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1191 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1247
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1248 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1307
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPV 1075
+ + LM+F GT +L+K +
Sbjct: 1308 KASLKK-INLMVFPGTILLKKSL 1329
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 652 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 706
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 795 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 853
Query: 707 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 854 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 913
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C D K R
Sbjct: 914 -----------------------------------------------CRICG--DLVKDR 924
Query: 820 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 877
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 925 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 981
Query: 878 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 937
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 982 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1029
Query: 938 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 992
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1030 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1086
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1052
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1087 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1146
Query: 1053 QNKYRNDYPLMIFQGTSMLQKPV 1075
+ + LM+F GT +L+K +
Sbjct: 1147 KASLKK-INLMVFPGTILLKKSL 1168
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 117/435 (26%), Positives = 189/435 (43%), Gaps = 77/435 (17%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 708
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 975 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 1033
Query: 709 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
P + ENDD CG+C DGG+L+CCD+CP ++H DC+ IP G+W+C
Sbjct: 1034 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY--------- 1084
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
RC+ C +C K
Sbjct: 1085 --------------------------RCL------------CDICGEVINLKELRSSLPA 1106
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C QCER+YH C+ + + + P W C C++I + L+ V + S
Sbjct: 1107 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1165
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
V++ ++ RV + L A+SI + F PII++ +
Sbjct: 1166 TVLRNNGDQ-------------RVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTG 1212
Query: 946 LDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+D+IP ++Y RS DY G Y +L + ++S R+ G +AE+PL+AT + +
Sbjct: 1213 IDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1272
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L L V+ L+L + WT FGF + + ++ K + L+
Sbjct: 1273 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDR-KNLSRLRLV 1331
Query: 1064 IFQGTSMLQKPVPKC 1078
GT +L++ + +C
Sbjct: 1332 SVPGTVLLKRNLYEC 1346
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 116/439 (26%), Positives = 186/439 (42%), Gaps = 82/439 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 706
GI C+CC + + FE HA AS +I+ +G SL E ++ L
Sbjct: 635 GIKCSCCRRVFTINGFEVHANGASCSGA-ANIFLDDGRSLLECQVEAYKKRKKAQPPDML 693
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH-----CRYCMNT 761
+ END C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 694 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 753
Query: 762 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 821
F K + NA+ + +F
Sbjct: 754 FLKTT----STNAK-----------------------------------------EEKF- 767
Query: 822 RRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
I C QCE +YH CL+ D + L ++ KW C DC+ I + L L+ + E
Sbjct: 768 ----ISCKQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE-- 821
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
S++ + + +S N D K++A+ +LS A+ + H+ F+P+
Sbjct: 822 --VSVEKLTWRLVQSLEPNMYGDD-------ASKIEAAAENHCILSVALDVMHELFEPVK 872
Query: 941 ESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
DL +++ R + + ++ G Y +L N +VS RI G+++AE+P + T
Sbjct: 873 RPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 932
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ +
Sbjct: 933 RFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK 992
Query: 1059 DYPLMIFQGTSMLQKPVPK 1077
+ L+ F T + +K + K
Sbjct: 993 -FTLLEFGRTILCEKILIK 1010
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 146 bits (368), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 118/444 (26%), Positives = 189/444 (42%), Gaps = 90/444 (20%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSI--------------- 702
GI C CC S S F+AH + +P ++++ NG SL +
Sbjct: 68 GIKCNCCQKLFSLSGFQAHVTGNNICRPAENLFLGNGKSLVSCQVELMRKKIMRFNQEPV 127
Query: 703 ----------KLSLERPFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
K L P S+ ND +C IC GGDL+CCD CP +FH
Sbjct: 128 VRATGTGSRSKFRLLAPLGSENCNDYVCSICHYGGDLICCDRCPSSFHAT---------- 177
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
C+N IE V P C D++
Sbjct: 178 ------CLN-------------------IERV-PEGDWFCPCCCCGICGDSQF------- 204
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
D +F +++ C QCER++H C K+ GM +E W CC C+ + LQ+
Sbjct: 205 --DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQ 258
Query: 872 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 931
L+ G+ L +L K + +++ VD+D+ A + LS A+ +
Sbjct: 259 LL--GKPILVGHNLTCTLIKPMQYQAEDRVDYDL----------AAMAENYSKLSVALEV 306
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
H+ FDP+ + +K DL+ +++ G + ++ G Y +L N +++ + RI+G +
Sbjct: 307 MHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDK 366
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
+AE+PL+ T + G + L IEK L L V+ LVLP++ WT FGF+ MT
Sbjct: 367 VAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPMT 426
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQK 1073
E ++ + + + F T M +K
Sbjct: 427 ESDRLDFLG-FTFLDFHDTIMCKK 449
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 146 bits (368), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 117/435 (26%), Positives = 189/435 (43%), Gaps = 77/435 (17%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 708
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 924 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 982
Query: 709 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
P + ENDD CG+C DGG+L+CCD+CP ++H DC+ IP G+W+C
Sbjct: 983 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY--------- 1033
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
RC+ C +C K
Sbjct: 1034 --------------------------RCL------------CDICGEVINLKELRSSLPA 1055
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 885
+ C QCER+YH C+ + + + P W C C++I + L+ V + S
Sbjct: 1056 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1114
Query: 886 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 945
V++ ++ RV + L A+SI + F PII++ +
Sbjct: 1115 TVLRNNGDQ-------------RVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTG 1161
Query: 946 LDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1003
+D+IP ++Y RS DY G Y +L + ++S R+ G +AE+PL+AT + +
Sbjct: 1162 IDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1221
Query: 1004 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1063
QG + L IE++L L V+ L+L + WT FGF + + ++ K + L+
Sbjct: 1222 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDR-KNLSRLRLV 1280
Query: 1064 IFQGTSMLQKPVPKC 1078
GT +L++ + +C
Sbjct: 1281 SVPGTVLLKRNLYEC 1295
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 164/377 (43%), Gaps = 93/377 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 834 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 888
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 889 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 938
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 939 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 996
+++ + +D+IP ++Y G + DY G Y IL ++ R+ G + AELP +
Sbjct: 168 MVDPRTGVDMIPHVLYNKGSNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFI 227
Query: 997 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1056
ATS D + QG + L IE +L +V+TLVL + E W + FGF + + E+ +
Sbjct: 228 ATSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQL 287
Query: 1057 RNDYPLMIFQGTSMLQK 1073
RN LM+F GTS+L K
Sbjct: 288 RN-VNLMLFPGTSLLTK 303
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 144 bits (363), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/448 (28%), Positives = 189/448 (42%), Gaps = 99/448 (22%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL--------HELSIKLSLERP 709
GI +CC SP FEAHAG +S + +I+ + SL H ++ K +
Sbjct: 279 GIKNSCCQEVFSPRGFEAHAG-SSFHQSDANIFLEDEGSLLEGQRQMVHRITGKSFTKES 337
Query: 710 FSSKE-------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
K+ NDD+C +C GGDL+ CD CP FH C+ L +P G W C C
Sbjct: 338 SHGKKSNGDQCNNDDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSCCCRI 397
Query: 763 QKE-KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 821
E +F EY+ D F +F
Sbjct: 398 CGENRFDEYSEE----------DNF--------------------------------KFS 415
Query: 822 RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLP 881
C QCE +YHVGCL+ L+ P G C C++I L+ + +P
Sbjct: 416 ------CHQCELQYHVGCLRKQRHVKLETYPDGTRFCSTQCEKI---FLGLLKLLGKPIP 466
Query: 882 ETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD----GTRAL------LSKAVSI 931
G DN + W +L+ + D +AL L+ A+++
Sbjct: 467 -------------VGVDN-----LTWTLLKPTISEWFDMDVPDNKALTEVYSKLNIALNV 508
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
H+ F+PI E + DL+ +++ G + ++ G Y +L N ++S R+ G++
Sbjct: 509 MHECFEPIKEPHTGRDLVEDVIFCRGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEK 568
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
+AE+PLV T + + G + L IEK L L V+ L LP+A W FGFS MT
Sbjct: 569 VAEVPLVGTRSQYRRLGMCRILINEIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMT 628
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
+ E+ + DY + FQ T M QK + K
Sbjct: 629 DSERLTFL-DYTFLDFQDTVMCQKLLMK 655
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 122/462 (26%), Positives = 196/462 (42%), Gaps = 113/462 (24%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----------------HEL- 700
I+C CC+ + + FE+HAG +R +P I +G SL H +
Sbjct: 37 AIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVV 95
Query: 701 --------SI-------KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
SI + + S +ND++C IC GGDL CD CP AFH+ C+ L
Sbjct: 96 NENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSICGFGGDLALCDRCPSAFHLGCLGL 155
Query: 746 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 805
+P G W C C C +I P
Sbjct: 156 NRVPIGEWFCPTC----------------------------------CCKICYRP----- 176
Query: 806 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
+ + CK +++C QCE++YH GC+K G+E + W C C +
Sbjct: 177 -----KCKQECKDH-KDNNILVCVQCEQKYHFGCVKAVGIE-FNHMEN--WFCSVVCGNM 227
Query: 866 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG-KKVDASDGTR-- 922
L L+KL+ G V ++ W +++ VD + +
Sbjct: 228 FLCLKKLL----------------------GKPIKVADNLTWTLVKNVSSVDDKEFNQKE 265
Query: 923 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS--HRGQDYHGMYCAILTVNQVVVSA 980
+ L+ A+ + ++ F+P ++ S +LI +V+ R H ++ G Y IL V+S
Sbjct: 266 SKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISV 325
Query: 981 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1040
RI+GQ++AE+ VAT + QG L IEK L L V+ L+L S+ +A IWT
Sbjct: 326 ATVRIYGQKVAEVVFVATKEQYRRQGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWT 385
Query: 1041 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSM----LQKPVPKC 1078
FGF+ M +++ ++ D+ + FQ ++M L+ P+ C
Sbjct: 386 KSFGFARMASKDKCQFI-DHTFLEFQNSTMCLKALKTPIWSC 426
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 115/444 (25%), Positives = 190/444 (42%), Gaps = 90/444 (20%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER--------- 708
GI C CC S + F+AH + +P ++++ NG SL ++L ++
Sbjct: 68 GIKCNCCQKLFSLTGFQAHVTGNNICRPAENLFLGNGKSLVSCQVELMRKKIMMFDQGPA 127
Query: 709 ----------------PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
P S+ ND +C IC GGDL+CCD CP +FH
Sbjct: 128 VRAAGTGSRSKFRSLAPLGSENCNDYVCSICHYGGDLICCDRCPSSFHAA---------- 177
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
C+N IE V P C D++
Sbjct: 178 ------CLN-------------------IESV-PEGDWFCPCCCCGICGDSQF------- 204
Query: 812 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
D +F +++ C QCER++H C K+ GM +E W CC C+ + LQ+
Sbjct: 205 --DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE----HWFCCKTCEMMQWGLQQ 258
Query: 872 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 931
L+ G+ L +L K + +++ D+D+ A + LS A+ +
Sbjct: 259 LL--GKPILVGQNLTCTLIKPMQYQAEDREDYDLA----------AMAENYSKLSVALEV 306
Query: 932 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 989
H+ FDP+ + +K DL+ +++ G + ++ G Y +L N +++ + RI+G +
Sbjct: 307 MHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVLLERNDELIAVALLRIYGDK 366
Query: 990 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1049
+AE+PL+ T + G + L IEK L L V+ LVLP++ WT FGF+ MT
Sbjct: 367 VAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLPASRSVLNTWTTSFGFTPMT 426
Query: 1050 EEEQNKYRNDYPLMIFQGTSMLQK 1073
E ++ + + + F T M +K
Sbjct: 427 ESDRLDFLG-FTFLDFHDTIMCKK 449
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/442 (26%), Positives = 187/442 (42%), Gaps = 92/442 (20%)
Query: 666 SEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKLSLE-------------- 707
+E + ++F H +S R+P+ ++ +G SL + ++L +
Sbjct: 332 AEYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKK 391
Query: 708 -RPFSSK---------ENDD-LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 756
P+ + E+DD +C +C D G+LL CD CP FH CV L P G W C
Sbjct: 392 YLPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCP 451
Query: 757 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFC 816
C G + DP A ++ F
Sbjct: 452 ACTCAI--------------CGSSDLDDPPATTTTQ---------------------GFS 476
Query: 817 KSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
R VI C+QC REYHVGC++ D+G+ + +G WLC C +I L L++L
Sbjct: 477 SDRM----VISCEQCRREYHVGCMRERDNGLWYPEADGEGPWLCSEACSKIYLRLEELA- 531
Query: 875 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 934
V++ S ++ + RG D + A L A+ + +
Sbjct: 532 ------------VVQAPCRSVASGLSLVV-----LRRGAARDGEEEEHAKLCMALDVLRE 574
Query: 935 RFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 992
F +IE ++ DL +V+ R D+ G Y L +++ R++G+E+AE
Sbjct: 575 CFVTLIEPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAE 634
Query: 993 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN-KFGFSMMTEE 1051
+PLV T + QG + L I+KLLG + V+ LVLP+ E A WT FGF M +
Sbjct: 635 VPLVGTRFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFREMGQA 694
Query: 1052 EQNKYRNDYPLMIFQGTSMLQK 1073
++ + + ++ FQGT M K
Sbjct: 695 DRQDVAH-HAILRFQGTIMCHK 715
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 185/442 (41%), Gaps = 92/442 (20%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKEND- 716
GI C CC + FE+HAG + +P ++I NG SL ++ LE E+D
Sbjct: 18 GIQCDCCGETFAILDFESHAG-SKSCQPLKNICLENGHSL----LQCQLESWNKQDESDR 72
Query: 717 ---------------DLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 761
D CGIC DGG+L+CCDSCP FH C+ + PSG W+C YC
Sbjct: 73 KGFHFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYCSCK 132
Query: 762 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 821
F A G +D DT
Sbjct: 133 F----------CGMAGGDTCQMDE--------------NDTA-----------------A 151
Query: 822 RRTVILCDQCEREYHVGC-LKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
+ ++ C CE +YH C L ++ + D C C+ + LQ L+ E
Sbjct: 152 QPALLACCLCEEKYHHSCILAENTVNDGYSSVS---FCGKKCQELYDKLQALLGVKHEME 208
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRAL-----LSKAVSIFHDR 935
+ ++++ FDV G + S R + ++ A+ I +
Sbjct: 209 EGFAWTLVRR------------FDV------GSDISLSGMHRKVECNSKVAVALHIMDEC 250
Query: 936 FDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 993
F P+ + S ++LI +VY G + +Y G AIL ++SA RI G LAE+
Sbjct: 251 FLPMPDHRSGVNLIRNIVYNFGSNFNRLNYSGFLTAILERGDEIISAASIRIHGNHLAEM 310
Query: 994 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1053
P + T + + QG + L IE L LNV+ LV+P+ SE + WT+ FGF + +
Sbjct: 311 PFIGTRHMYRRQGMCRRLLSAIETALCSLNVEKLVIPAISELRETWTSVFGFKPLEGSSK 370
Query: 1054 NKYRNDYPLMIFQGTSMLQKPV 1075
K RN ++ F G MLQKP+
Sbjct: 371 QKMRN-MKMVAFPGIDMLQKPL 391
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 93/375 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 834 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 888
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 889 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 938
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGRKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 939 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
+++ + +D+IP ++ + DY G Y IL ++ R+ G + AELP +AT
Sbjct: 168 MVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIAT 225
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
S D + QG + L IE +L +V+TLVL + E W + FGF + + E+ + RN
Sbjct: 226 SVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN 285
Query: 1059 DYPLMIFQGTSMLQK 1073
LM+F GTS+L K
Sbjct: 286 -VNLMLFPGTSLLTK 299
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 126/490 (25%), Positives = 209/490 (42%), Gaps = 103/490 (21%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEK--FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
+ + G + +V Y K L G G GI C CC+ + S FE HAG +
Sbjct: 463 WMIDSGTVLQNGKVHYMPHKSKSAVLDGEITGNGIHCGCCDKIFTISDFELHAG-SKLAD 521
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKE------------NDDLCGICMDGGDLLCCD 732
P ++IY G SL + + ++ S ++ NDD CG+C DGGDL+CCD
Sbjct: 522 PLKNIYVGEGTSLLQCLLDSWNKQDESERKGFHFVDVAGEDPNDDTCGVCGDGGDLICCD 581
Query: 733 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
C +TF +
Sbjct: 582 G------------------------CPSTFHQG--------------------------- 590
Query: 793 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT------------VILCDQCEREYHVGCL 840
C+ I + P + C+ C + FC S G ++ C CE +YH C+
Sbjct: 591 CLDIKKFPSGDWH-CIYCCCK-FCGSVSGSSNQRDDNDELIVSKLLTCQLCEEKYHRSCI 648
Query: 841 K--DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSD 898
+ D +D +++ + C C+ ++ L+ L+ E S I++
Sbjct: 649 EANDANTDDSRDV----FFCGNRCQELSERLEMLLGVKHEMEDGYSWTFIRR-------- 696
Query: 899 NAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH 958
+ V FD ++ + V+ + + L+ AVSI + F P I+ S ++LI +++Y R
Sbjct: 697 SDVGFDAS--QIKPQMVECN----SKLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGS 750
Query: 959 RGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIE 1016
+Y G AIL ++SA RI G +LAE+P + T + QG + L +E
Sbjct: 751 NFNRLNYSGFVTAILERGDEIISAASIRIRGNQLAEMPFIGTRYMYRRQGMCRRLLNAVE 810
Query: 1017 KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
LG LNV+ LV+P+ SE + WT+ FGF + E + ++ L++F MLQK +
Sbjct: 811 WGLGSLNVELLVIPAISELRETWTSVFGFESL-ESTSKQILHNKNLLVFPHVDMLQKKIS 869
Query: 1077 KCRIVGKSVD 1086
K ++ G++++
Sbjct: 870 KHKLAGQNLN 879
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 125/495 (25%), Positives = 208/495 (42%), Gaps = 97/495 (19%)
Query: 603 HSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVR-GEKFLVGYKKGFGILC 661
H +SSS ++K + L+ + +E+ A++ Y GE+ L G G GI C
Sbjct: 136 HYSTRSSSKVDSESPSQKRTILSWLI-DSRVVENNAKIVYKNEAGEQILQGVLTGDGIWC 194
Query: 662 TCCNSEVSPSQFEAHAGWASRRKPFQHIYTS-NGVSL-------------HELS-IKLSL 706
+CCN+ ++ S+F+ HAG R P+Q I+ S G+SL EL L
Sbjct: 195 SCCNTVITVSEFQLHAGDEPNR-PYQRIFISETGLSLLTCQAEAWNQQGIPELQGYHLIE 253
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 766
R S + DD C +C DGG+L+CCD CP +HI C+ + P G W C C
Sbjct: 254 PREDVSDKYDDACVVCADGGNLICCDKCPSTYHISCLQMEDEPQGEWRCPAC-------- 305
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 826
C C F S V
Sbjct: 306 ---------------------------------------ACKFCHTHAFDIS------VF 320
Query: 827 LCDQCEREYHVGCLKDH-GM---EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
C QC+++YH C +++ GM ++ C + C +I L++LV E
Sbjct: 321 TCSQCDKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDEG 380
Query: 883 TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIES 942
+ ++++ E+G +D R L K+ + AV++ + F+P+I+
Sbjct: 381 LTWTLLRRMDPEAGVYLEESYD---RTLCNSKI----------AVAVAVMEECFEPVIDR 427
Query: 943 ASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1000
++++++ +++Y G + + G Y AIL +S RI G +LAE+P +AT
Sbjct: 428 HTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIAT-- 485
Query: 1001 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1060
+ ++ L C + L+ +V+ LV+PS + W +GF + + +
Sbjct: 486 ----RPSYRRLGMCHKLLVAIESVQYLVIPSIEQRVRRWEESYGFQAIENKVMGELIKVK 541
Query: 1061 PLMIFQGTSMLQKPV 1075
LM F LQKP+
Sbjct: 542 SLM-FHCAIRLQKPL 555
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 143/288 (49%), Gaps = 34/288 (11%)
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL------KDHGMEDL 848
R+VQT C C + ++ C+QCER HV C K +E L
Sbjct: 273 RVVQTES-----CYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEIL 327
Query: 849 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 908
++ +LCC +C+ + + L+ + +K EE + ++ WR
Sbjct: 328 KDYLCFHFLCCQECQSLRVRLEGM-----------------EKCEEIAFLGRIRSNICWR 370
Query: 909 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGMY 967
+L DAS + LS+ + IF D F +ES I MVYG++ G+ D+ GMY
Sbjct: 371 LL--SSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMY 425
Query: 968 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1027
C +LT + VVSA I ++ + AEL L+AT ++C+ +GYF+ L IE L NV L
Sbjct: 426 CVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLL 485
Query: 1028 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ P E IW++K GF++++ +E+ +PL++F+ ++QKP+
Sbjct: 486 MAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 533
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 77/107 (71%), Gaps = 2/107 (1%)
Query: 600 TSSHSKMKSSSVKSHGKITRK-DLRMHKLVFEEGGLEDGAEVGYFVR-GEKFLVGYKKGF 657
TS+ + SSSV S+ + ++K D +H L+F+E GL D + Y ++ GE L GYK+G
Sbjct: 118 TSAQRPLPSSSVGSNSRESKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGA 177
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 704
GI+C CCN EVSPS+FE HAG RR+P+Q+IYTS G++LH+++++L
Sbjct: 178 GIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYTSQGLTLHDVALQL 224
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 10/82 (12%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ + + Y K+K G + L+G +G+GI C C C NQ V+P+ FE HAG
Sbjct: 148 FKEAGLPDNTLLTY----KLKN-GEALLQGYKQGAGIVCNC--C--NQEVSPSEFEKHAG 198
Query: 470 SSNKRPP-EYIYLENGKTLRDI 490
+R P + IY G TL D+
Sbjct: 199 MGKRRQPYQNIYTSQGLTLHDV 220
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 93/375 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 834 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 888
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 889 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 938
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 939 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
+++ + +D+IP ++ + DY G Y IL ++ R+ G + AELP +AT
Sbjct: 168 MVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIAT 225
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
S D + QG + L IE +L +V+TLVL + E W + FGF + + E+ + RN
Sbjct: 226 SVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN 285
Query: 1059 DYPLMIFQGTSMLQK 1073
LM+F GTS+L K
Sbjct: 286 -VNLMLFPGTSLLTK 299
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 143/288 (49%), Gaps = 34/288 (11%)
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL------KDHGMEDL 848
R+VQT C C + ++ C+QCER HV C K +E L
Sbjct: 304 RVVQTES-----CYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEIL 358
Query: 849 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 908
++ +LCC +C+ + + L+ + +K EE + ++ WR
Sbjct: 359 KDYLCFHFLCCQECQSLRVRLEGM-----------------EKCEEIAFLGRIRSNICWR 401
Query: 909 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGMY 967
+L DAS + LS+ + IF D F +ES I MVYG++ G+ D+ GMY
Sbjct: 402 LL--SSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMY 456
Query: 968 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1027
C +LT + VVSA I ++ + AEL L+AT ++C+ +GYF+ L IE L NV L
Sbjct: 457 CVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLL 516
Query: 1028 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
+ P E IW++K GF++++ +E+ +PL++F+ ++QKP+
Sbjct: 517 MAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 564
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 77/107 (71%), Gaps = 2/107 (1%)
Query: 600 TSSHSKMKSSSVKSHGKITRK-DLRMHKLVFEEGGLEDGAEVGYFVR-GEKFLVGYKKGF 657
TS+ + SSSV S+ + ++K D +H L+F+E GL D + Y ++ GE L GYK+G
Sbjct: 149 TSAQRPLPSSSVGSNSRESKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGA 208
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 704
GI+C CCN EVSPS+FE HAG RR+P+Q+IYTS G++LH+++++L
Sbjct: 209 GIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYTSQGLTLHDVALQL 255
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 10/82 (12%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ + + Y K+K G + L+G +G+GI C C C NQ V+P+ FE HAG
Sbjct: 179 FKEAGLPDNTLLTY----KLKN-GEALLQGYKQGAGIVCNC--C--NQEVSPSEFEKHAG 229
Query: 470 SSNKRPP-EYIYLENGKTLRDI 490
+R P + IY G TL D+
Sbjct: 230 MGKRRQPYQNIYTSQGLTLHDV 251
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 139 bits (351), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 120/449 (26%), Positives = 185/449 (41%), Gaps = 99/449 (22%)
Query: 650 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-VSLHELSIKLSLER 708
LVG GI C CC+ ++ ++FE HAG + ++P+ +I+ +G +SL + + ++
Sbjct: 772 LVGRVTREGICCDCCSKILTVAKFELHAG-SKEQQPYANIFLEDGGLSLFQCLLNAWDKQ 830
Query: 709 PFSSKEN-----------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 757
+ K+ DD CGIC DGGDLLCCD C FH+ C+ + +PSG W CR
Sbjct: 831 AQNEKKGFYKIDPADDPDDDTCGICGDGGDLLCCDRCTSTFHVACLGIE-MPSGDWFCRN 889
Query: 758 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 817
C+ F G E +P EL C+ C
Sbjct: 890 CICKF--------------CGSAE-------------ERTSSP-AELLSCLQC------- 914
Query: 818 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKW--LCCADCKRINLALQKLVDR 875
R+YH C + E + P C C +I L++L+
Sbjct: 915 --------------SRKYHQVCAQGIEREFVSTTPSASIDCFCSPGCTKIYKRLKRLL-- 958
Query: 876 GEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR--GKKVDASDGTRALL-------S 926
G N ++ W ++R S +A L +
Sbjct: 959 --------------------GLKNDLEAGFSWSLVRCFADTEATSTKKKAQLVHCNSKTA 998
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFR 984
A S+ + F P I+ S +++I +VY G ++ G Y IL V+SA R
Sbjct: 999 LAFSVLDECFLPRIDERSGINIIHNVVYNCGSDFSRLNFSGFYTFILERGDEVISAATVR 1058
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1044
I G + AE+P + T + QG L IE L LNV+ LV+P+ E Q W+ FG
Sbjct: 1059 IHGTDFAEMPFIGTRGMYRHQGMCHRLLDAIESALCSLNVRRLVIPAIPELQNTWSTVFG 1118
Query: 1045 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
F + ++ K ++ L+I GT +L+K
Sbjct: 1119 FKPVGPTKKQKIKS-VNLLIIHGTGLLEK 1146
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 139 bits (350), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 119/455 (26%), Positives = 191/455 (41%), Gaps = 109/455 (23%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----------------HEL- 700
I+C CC+ + + FE+HAG +R +P I +G SL H +
Sbjct: 37 AIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVV 95
Query: 701 --------SI-------KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
SI + + S ND++C IC GGDL CD CP AFH+ C+ L
Sbjct: 96 NENQKKNHSIVKENRKKNHCVVKEKSEANNDNVCSICGFGGDLALCDRCPSAFHLGCLGL 155
Query: 746 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 805
+P G W C C C +I P
Sbjct: 156 NRVPIGEWFCPTC----------------------------------CCKICYRP----- 176
Query: 806 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
+ + CK +++C QCE++YH GC+K G+E + W C C +
Sbjct: 177 -----KCKQECKDH-KDNNILVCVQCEQKYHFGCVKAVGIE-FNHMEN--WFCSVVCGNM 227
Query: 866 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG-KKVDASDGTR-- 922
L L+KL+ G V ++ W +++ VD + +
Sbjct: 228 FLCLKKLL----------------------GKPIKVADNLTWTLVKNVSSVDDKEFNQKE 265
Query: 923 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS--HRGQDYHGMYCAILTVNQVVVSA 980
+ L+ A+ + ++ F+P ++ S +LI +V+ R H ++ G Y IL V+S
Sbjct: 266 SKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISV 325
Query: 981 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1040
RI+GQ++AE+ VAT + QG L IEK L L V+ L+L S+ +A WT
Sbjct: 326 ATVRIYGQKVAEVVFVATKEQYRRQGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWT 385
Query: 1041 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
FGF+ M +++ ++ D+ + FQ ++M K +
Sbjct: 386 RSFGFARMASKDKCQFI-DHTFLEFQNSTMCLKAL 419
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 136 bits (342), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 108/335 (32%), Positives = 157/335 (46%), Gaps = 59/335 (17%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVM-YIRGSKVKGPGVSGLRGVVKGSG 445
P+ + PR K + +G+LEG V RG + L G+ + G
Sbjct: 303 PIGNKLTKPPRNA--------KELMATGLLEGHYVHCSCRGEQ--------LTGIFQDMG 346
Query: 446 ISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV------CKDSPL 499
+ C C CKG QVV+ + FE H+GS++ P + IYLENGK LRDI++ C D+ L
Sbjct: 347 VVCNCRICKGTQVVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNIL 406
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE---I 556
L+ A+ + G + K C+ C + G E + SC K S A AE +
Sbjct: 407 RALQHAIGEIQG---ISKEMTCVKC---GKHEGGEFI-----SCKGAKCSAAYHAECVGV 455
Query: 557 KEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGK 616
K P LE+ E T R P+P S + +S GK
Sbjct: 456 KSP--------------HLEDWFCAKCEKTQA--RKPQPLLKVKRSPAGTDKEDARSKGK 499
Query: 617 ----ITR--KDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSP 670
I R +D +HK +F GGL DG E+GY+ R + L G K+G GI C+CCN E++
Sbjct: 500 EQTMIARSARDAHLHKALFLPGGLADGTELGYYARNQCILKGVKQGGGICCSCCNQEITC 559
Query: 671 SQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS 705
S FE HA +R+ P+ I ++G SL ++ +L+
Sbjct: 560 SAFERHARCEARQNPYGSILLADGRSLKDMCKELA 594
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 83/185 (44%), Gaps = 30/185 (16%)
Query: 601 SSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGIL 660
SS SK+K+ V K+T+ +L+ GL +G V RGE+ L G + G++
Sbjct: 293 SSASKLKAM-VPIGNKLTKPPRNAKELMAT--GLLEGHYVHCSCRGEQ-LTGIFQDMGVV 348
Query: 661 CTC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL------------SIKL 704
C C C VS S FEAH+G ++ P +IY NG +L ++ +I
Sbjct: 349 CNCRICKGTQVVSISAFEAHSG-STSHHPSDNIYLENGKNLRDILSAGQESADCGDNILR 407
Query: 705 SLERPFS-----SKENDDL-CGICMDGGDLLCCDS--CPRAFHIDCVSLPGIPSGTWHCR 756
+L+ SKE + CG +GG+ + C C A+H +CV + W C
Sbjct: 408 ALQHAIGEIQGISKEMTCVKCGK-HEGGEFISCKGAKCSAAYHAECVGVKSPHLEDWFCA 466
Query: 757 YCMNT 761
C T
Sbjct: 467 KCEKT 471
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 135 bits (341), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 125/533 (23%), Positives = 214/533 (40%), Gaps = 120/533 (22%)
Query: 569 QPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLV 628
+PP + +P+ +GE + TS P A+T S + HG I + +++ +
Sbjct: 62 RPPKRTRKSTPS-AGEAS-TSGSKPAEYVARTIM------SRLIDHGMIIENE-KIYYVA 112
Query: 629 FEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 688
EG + + + E + G + G+ C CCN ++ FE HAG + ++P++H
Sbjct: 113 DREGDSDARKQS----KKEVLMKGRARREGVRCECCNEVMTVWDFETHAG-SVLQRPYEH 167
Query: 689 IYTS---------------NGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDS 733
I+ + + V + + + R +S ++DD C IC DGGDL+CC+
Sbjct: 168 IHVARSNSSLLQCQFEVWQSNVEVERRTFNEIVPRNGASDKHDDACLICADGGDLICCEK 227
Query: 734 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 793
C H+ C+ L IP G W C YC+
Sbjct: 228 CWSTSHLKCMGLERIPQGDWICPYCV---------------------------------- 253
Query: 794 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
+ D +L CV QC+++YH CL + DL
Sbjct: 254 CKHCNKNDKDLQTCV---------------------QCDKKYHCQCLVSNKELDLN--AS 290
Query: 854 GKWLCC-ADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 912
G+ L C + C + LQ LV KHE G W +L+
Sbjct: 291 GETLACDSHCGEVYEKLQSLV---------------GVKHELEGG-------FCWTLLQR 328
Query: 913 KKVDASDG--------TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--D 962
+ D D + ++ A + + F II+ ++++++ ++ Y R +
Sbjct: 329 MEPDNLDFKDLHLITECNSKIALAWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRIN 388
Query: 963 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1022
+ G Y AIL N ++SA R+ G +LAE+P + T + + G + L +E + +
Sbjct: 389 FRGFYTAILEKNDDIISAATIRVHGTDLAEMPFIGTRHLYRQNGMSRMLLVTLESIFSVM 448
Query: 1023 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
V+ L++PS E +W K GFS + + K N + + F LQK +
Sbjct: 449 GVEHLIIPSVQELTEMWEGKCGFSPIEDAVSQKITN-WNTLTFPSAVRLQKAL 500
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 157/385 (40%), Gaps = 97/385 (25%)
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 767
RP + +DD CG+C DGG+LLCCDSCP FH C+++ +P G W C YC
Sbjct: 445 RPGAKDSSDDACGVCGDGGELLCCDSCPSTFHPACLAM-KVPQGWWACHYCR-------- 495
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
CVLC D + +
Sbjct: 496 ---------------------------------------CVLCMAND-------DQGLST 509
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWL---CCADCKRINLALQKLVDRGEEKLPETS 884
C C +YH C + L G+ + C CK+++ L +V
Sbjct: 510 CQHCSLKYHEVCRRP-------SLSNGRGIGAYCSETCKKVSARLSDMV----------- 551
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA--SDGTRAL------LSKAVSIFHDRF 936
G N + W +L+ +K +A S T A+ L+ A+ + ++ F
Sbjct: 552 -----------GVTNHTEDGFSWALLKIQKDEAVSSQDTAAVLECNVKLAVALGVLNECF 600
Query: 937 DPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELP 994
+P + +K+D++ VY G + Y G Y +L + ++A + RI G ++AE+P
Sbjct: 601 NPAKDRRTKIDMLHQAVYSLGSEFKRVSYEGFYTMVLDKDGETIAAALLRIHGTKVAEMP 660
Query: 995 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1054
AT + QG + L +E++L + V LV+P+ + WT F F + + E
Sbjct: 661 FAATLPAYRKQGMMRRLVNAVEQVLASVQVDKLVIPAIAALVDTWTRSFSFRPLLDPESR 720
Query: 1055 KYRNDYPLMIFQGTSMLQKPVPKCR 1079
+ L++ GT++L KPV R
Sbjct: 721 EEIRRRSLVVIAGTTLLHKPVAAAR 745
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 133 bits (334), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 145/565 (25%), Positives = 229/565 (40%), Gaps = 76/565 (13%)
Query: 20 GLKREFEFALKVQSEICGSLGRTRA-RKVQSNVDSGCVLGPPEVKKLKTYESRKKRKRQE 78
+KRE FAL+ SEI S GRTR+ R + S D +K+ K +
Sbjct: 26 AMKRELAFALQSLSEISASPGRTRSGRPISSLPDPSASAS---LKRRKRSDPPAPAPAPA 82
Query: 79 QSVVVKETEDKREEEVKSDVFDVINE---RERPIREKESKDDSENMGVGERGALMNVEEV 135
+V T E + D+I P D + + V + L +
Sbjct: 83 ADLVSPPTPPIDAEPPNQPLRDIIQPLHGSNPPTVSDHRSDSNADQEVVPQSMLEAPQPF 142
Query: 136 KVVSERREEGNDEFGKVVIGVEEEKKNEC---DEVLMNVEENKYGELDGMGGSARTEEEK 192
+ + + K + E+ E D + + + DG S
Sbjct: 143 HAETATEDSAMEVIAKPNLSAAEDSAAETATEDSAMEVIAKPNVAAEDGASKSN------ 196
Query: 193 NECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEE 252
V E+ + T + + S A D G E ++A +
Sbjct: 197 -------VATEDTAAVAPTLLQTTIAD---SASKPNVATEDTAGVPPTLLETILAAGDHH 246
Query: 253 KKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGK 312
D + +K + V + G VQ + ++++ V+N +L ++M+ +
Sbjct: 247 ----CDNTNSNGASLKQQAVTIAT----GNVQCDNSNSNGARLQQQAVDNVLLSNTMLAE 298
Query: 313 HS-----------------STLCNGESNVAKSVAV------DG-NDEGKTVNVVVERPLR 348
+ S L N + + +S A DG ND +++E+P R
Sbjct: 299 DAATPAAAADLKPARRFTRSLLKNNKPDKEESAACESQATPDGSNDASFDFPLLLEKPQR 358
Query: 349 RFTRSLLQQKVELAK-GSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKV--MRKFYS 405
RFTRSLL+ KVE + GS D + A+D PVK+ + +KV + K
Sbjct: 359 RFTRSLLKTKVESSLVGS--------DDALDSASDS--PPVKKMEMKMSKKVACLTKHPG 408
Query: 406 KLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
++ L +G+LEGM VMYI K + L+GV+ G I CFC C G++ V+ FE
Sbjct: 409 NIRELLNTGLLEGMPVMYIIPHSKK----AVLKGVITGCNIRCFCLSCNGSKAVSAYYFE 464
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCR 525
HAGS+ K P +YIYL NG +LRD++ SPLE LEK +R + + + CLNC
Sbjct: 465 QHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCLNCN 524
Query: 526 VSFSNAGVEELMLLCKSCVELKESQ 550
+ E +LC+ C+E K+ Q
Sbjct: 525 EPVLPSSQSE-NVLCQVCLESKQPQ 548
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
CV L +PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D
Sbjct: 637 CVGLRKVPS-EWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVPISD 695
Query: 802 TELGGCVLCR 811
+LGGC LC+
Sbjct: 696 -DLGGCALCK 704
Score = 40.0 bits (92), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 633 GLEDGAEVGYFVRGEK--FLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPF 686
GL +G V Y + K L G G I C C CN VS FE HAG ++++ P
Sbjct: 417 GLLEGMPVMYIIPHSKKAVLKGVITGCNIRCFCLSCNGSKAVSAYYFEQHAG-STKKHPA 475
Query: 687 QHIYTSNGVSLHEL 700
+IY NG SL ++
Sbjct: 476 DYIYLGNGNSLRDV 489
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 132 bits (333), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/453 (25%), Positives = 174/453 (38%), Gaps = 117/453 (25%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 706
GI C+CC + + FE HA AS +I+ +G SL E ++ L
Sbjct: 626 GIKCSCCRRVFTINGFEVHANGASCSGA-ANIFLDDGRSLLECQVEAYKKRKKAQPPDML 684
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH-----CRYCMNT 761
+ END C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 685 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 744
Query: 762 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 821
F K + NA+
Sbjct: 745 FLKTT----STNAK---------------------------------------------- 754
Query: 822 RRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 880
I C QCE +YH CL+ D + L ++ KW C DC
Sbjct: 755 EEKFISCKQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDC------------------ 796
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
EES N D K++A+ +LS A+ + H+ F+P+
Sbjct: 797 ------------EESLEPNMYGDD-------ASKIEAAAENHCILSVALDVMHELFEPVK 837
Query: 941 ESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
DL +++ R + + ++ G Y +L N +VS RI G+++AE+P + T
Sbjct: 838 RPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 897
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ +
Sbjct: 898 RFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK 957
Query: 1059 DYPLMIFQGTSMLQK---------PVPKCRIVG 1082
+ L+ F T + +K P+P +G
Sbjct: 958 -FTLLEFGRTILCEKILIKSGVADPIPSIASLG 989
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 150/364 (41%), Gaps = 79/364 (21%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 774
+DD CG+C DGG+LLCCDSCP FH C+++ +P G W C YC
Sbjct: 511 SDDACGVCADGGELLCCDSCPSTFHPACLAM-KVPEGLWACHYCR--------------- 554
Query: 775 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 834
CVLC D + + C C +
Sbjct: 555 --------------------------------CVLCMAND-------DQGLSRCQHCTLK 575
Query: 835 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK-KKHE 893
YH C L C CK+++ L ++ S ++K +K E
Sbjct: 576 YHEIC-----RPSLSNGRGNGAYCSETCKKVSAQLSDMIGITNHTEDGFSWALLKIQKDE 630
Query: 894 ESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 953
S N+ D VL L+ A+ + ++ F+P+ + +K+D++ V
Sbjct: 631 PVSSQNSPD------VLE---------CNVKLAVALGVLNECFNPVKDRRTKIDMLHQAV 675
Query: 954 Y--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1011
Y G + Y G Y +L N ++SA + RI G ++AE+P T + QG + L
Sbjct: 676 YSLGSEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRKQGMMRRL 735
Query: 1012 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSML 1071
+E++L + V+ LV+P+ + W F F + E + + R L++ GT++L
Sbjct: 736 VNAVEQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPELKEEIRRRS-LVVITGTTLL 794
Query: 1072 QKPV 1075
QKPV
Sbjct: 795 QKPV 798
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 145/288 (50%), Gaps = 27/288 (9%)
Query: 795 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDL 848
R +Q E C C T+I C+QCER H+ C + +E L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 849 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 908
+E +LCC +C+ +L+ ++ G EK + +++ + ++ WR
Sbjct: 130 KEYMCFHFLCCQECQ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWR 173
Query: 909 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMY 967
+L G +DAS + + + + IF D F + S D+I MV G++ + +D+ GMY
Sbjct: 174 LLSG--MDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 229
Query: 968 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1027
CA+LT + VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV L
Sbjct: 230 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 289
Query: 1028 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
P E IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 290 TAPVDPEMAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 337
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 126/543 (23%), Positives = 208/543 (38%), Gaps = 161/543 (29%)
Query: 607 KSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRG------EKFLVGYKKGFGIL 660
+ S + T+K + L+ + G + DG V Y G EK + G G+
Sbjct: 355 RRSGTDAVAPATKKHTILTWLI-DGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVH 413
Query: 661 CTCCNSEVSPSQFEAHAGWASR------RKPFQHIYTSNGVSL----------------- 697
C+CC+ V FEAHAG R R+P++ + +G SL
Sbjct: 414 CSCCDGVVPLPVFEAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFH 473
Query: 698 -------------------------HELSIKLSLERPFSSK----------ENDDLCGIC 722
L + +ER S + +DD CG+C
Sbjct: 474 AQAKVRAALEQEEDKCSQAKRRLLAKHLKKGVVVERIMSPRMEKIKAGEKDSSDDACGVC 533
Query: 723 MDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 782
DGG+LLCCDSC FH +C+++ +P G+W C YC
Sbjct: 534 ADGGELLCCDSCTSTFHPECLAI-KVPEGSWSCHYCR----------------------- 569
Query: 783 VDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC--L 840
CVLC D + + C QC R+YH C L
Sbjct: 570 ------------------------CVLCMSNDDLQG------LSTCQQCARKYHESCRPL 599
Query: 841 KDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNA 900
+G D+ C CK++ L ++ +G N
Sbjct: 600 PGNGC-DIGTY------CGETCKKLFSQLAQV----------------------TGVTNP 630
Query: 901 VDFDVRWRVLRGKKVD--ASDGTRALLSK------AVSIFHDRFDPIIESASKLDLIPAM 952
W +LR +K + +S+ A+L + A+ +F++ F+P+ + +K+D++
Sbjct: 631 TGDGFWWALLRIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQA 690
Query: 953 VY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQS 1010
VY G + Y G Y +L + +VSA + RI G ++AE+P T + QG +
Sbjct: 691 VYSLGSQFKRLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRR 750
Query: 1011 LFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSM 1070
L +E++L + V+ LV+P+ W F F + + + + + L++ GT++
Sbjct: 751 LVSAVEQVLASVQVEKLVIPAIDSLVDTWKRSFFFRPVDPQLREELKR-LSLVVITGTTL 809
Query: 1071 LQK 1073
L K
Sbjct: 810 LHK 812
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 130 bits (326), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 164/366 (44%), Gaps = 64/366 (17%)
Query: 710 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVE 769
+ +NDD CG+C DGG+L+CCD+CP ++H+ C+ IP G+W+C C
Sbjct: 995 LQADQNDDTCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR---------- 1044
Query: 770 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCD 829
D ++VS + P T L C
Sbjct: 1045 -------------CDVCGEVVSS-----KEPRTPLHA-------------------FECS 1067
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 889
QCER+YH+ C+ + + + P G W C C++I +L+ V +P+ D
Sbjct: 1068 QCERQYHIKCISGKVLCNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPDHLDDGFS 1121
Query: 890 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 949
+ D V R+ + A + ++ A+SI + F PI + + +D++
Sbjct: 1122 CTILHNNGDQKV------RMAADIALLAECNMKLII--ALSILEECFLPIFDPRTGMDIM 1173
Query: 950 PAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1007
P ++Y RS+ DY G Y +L + ++S R+ G +AE+PL+AT + + QG
Sbjct: 1174 PLILYNWRSNFVHLDYKGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGM 1233
Query: 1008 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQG 1067
+ + IE++L L V+ L+L + WT+ FGF + + ++ K + L G
Sbjct: 1234 CRRIVDYIEQMLKSLKVEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKL-SKIRLASVPG 1292
Query: 1068 TSMLQK 1073
T +L+K
Sbjct: 1293 TVLLKK 1298
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 129 bits (325), Expect = 6e-27, Method: Composition-based stats.
Identities = 52/97 (53%), Positives = 72/97 (74%)
Query: 659 ILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDL 718
I+C+CC E+SPSQFE+HAG ++RR+P++HIYTSNG+SLH+++I L+ + ++ DD+
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 755
C DGGDL+ C SCPRAFH C+ L P G WHC
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 129 bits (325), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/460 (25%), Positives = 185/460 (40%), Gaps = 99/460 (21%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 708
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 428 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 486
Query: 709 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 764
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 487 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK 546
Query: 765 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 824
++ ++ + + L C L
Sbjct: 547 DEAAKHETSTLPS--------------------------LSSCRL--------------- 565
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE-T 883
CE +YH C+ G + C C+ + LQ + + LPE
Sbjct: 566 ------CEEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGF 616
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 943
S +++ S D D+ ++ A ++ A S+ + F P+++
Sbjct: 617 SWSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHR 663
Query: 944 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1001
S ++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 664 SGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYM 723
Query: 1002 CQGQG--------------YFQSLFCCIEKLLGFLNVKTLVLPSA--SEAQAIWTNKFGF 1045
+ QG YF +F I ++L L+V A + + + FGF
Sbjct: 724 YRRQGMCRRLMDGIESFVAYFSQMFLAISEVL--LDVWQFCCYPACFGDGPFCFFSGFGF 781
Query: 1046 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSV 1085
+ + + E+ +N L++F G ML K + K +I V
Sbjct: 782 APVNDSEKKTIKN-LNLLVFPGVDMLGKSLVKEKITDSVV 820
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 129 bits (324), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 158/374 (42%), Gaps = 82/374 (21%)
Query: 710 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVE 769
+ +NDD CG C DGG+L+CCD+CP ++H C+S IP G W YC
Sbjct: 3 LQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNW---YC----------- 48
Query: 770 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCD 829
S C+ C +C K + C
Sbjct: 49 ---------------------SSCL------------CDICGEVIDSKELVTSLPALDCS 75
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 889
QCER+YHV C+ E G W C C I + + V
Sbjct: 76 QCERQYHVKCVS--AKVPCNEDGSGTWFCGRKCHEIYMTFRSRV---------------- 117
Query: 890 KKHEESGSDNAVDFDVRWRVLRG---KKVDASDGTRAL------LSKAVSIFHDRFDPII 940
G + +D D+ + VLR KKV ++ + L A SI + F PI+
Sbjct: 118 ------GVPDHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPIL 171
Query: 941 ESASKLDLIPAMVYG-RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATS 999
+ + +D+IP+++Y RS +Y G Y +L + +VS R+ G LAE+PLVATS
Sbjct: 172 DPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATS 231
Query: 1000 NDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRND 1059
+ + QG + L IE++L L V+ L+L + WT+ FGF + E ++ + +
Sbjct: 232 KENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRL-SK 290
Query: 1060 YPLMIFQGTSMLQK 1073
L GT +L+K
Sbjct: 291 VRLAAVPGTVLLKK 304
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 184/459 (40%), Gaps = 87/459 (18%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 708
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 709 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W YC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAW---YC------- 689
Query: 766 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 825
YN + + + E + L C LC + C +
Sbjct: 690 ----YNCSCKFCEKDEAAKHETSTLP-----------SLSSCRLCEEK--CSKHYPH--- 729
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET-S 884
L D H C+ G + C C+ + LQ + + LPE S
Sbjct: 730 TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGFS 780
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 944
+++ S D D+ ++ A ++ A S+ + F P+++ S
Sbjct: 781 WSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHRS 827
Query: 945 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 828 GVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMY 887
Query: 1003 QGQG--------------YFQSLFCCIEKLLGFLNVKTLVLPSA--SEAQAIWTNKFGFS 1046
+ QG YF +F I ++L L+V A + + + FGF+
Sbjct: 888 RRQGMCRRLMDGIESFVAYFSQMFLAISEVL--LDVWQFCCYPACFGDGPFCFFSGFGFA 945
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSV 1085
+ + E+ +N L++F G ML K + K +I V
Sbjct: 946 PVNDSEKKTIKN-LNLLVFPGVDMLGKSLVKEKITDSVV 983
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 114/450 (25%), Positives = 178/450 (39%), Gaps = 120/450 (26%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRR-------KPFQHIYTSNGVSLHELSIKL------ 704
GI C+CC + S FE HAG S R KP I+ +G SL E I++
Sbjct: 619 GIKCSCCGKVYTLSGFELHAGIKSCRSGASKYCKPAASIFLDDGRSLLECQIQMMRDKEM 678
Query: 705 ---------SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 755
SL+ + ND +C +C GG+L+ CD
Sbjct: 679 SNHKAETPDSLKGSWDRDGNDHVCSVCHYGGELILCD----------------------- 715
Query: 756 RYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF 815
+C ++F K RC+ + PD + C C +
Sbjct: 716 -HCPSSFHK---------------------------RCLGMKDVPDGDWF-CPSCCCKIC 746
Query: 816 CKSRFGRRT------VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 869
+++ + T V+ C QCE +YH+ CL + + ++ PK C C+ +
Sbjct: 747 GQNKLKKDTKDFIDGVLNCTQCEHQYHIMCLSNSWTDKWKDHPKENSFCSKKCEVYMQSD 806
Query: 870 QKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 929
Q +D ++ ET ++ K L A+
Sbjct: 807 QHKLDAFDD---ETLVETYSK----------------------------------LKIAL 829
Query: 930 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 987
+ H+ F+PI E + DL+ +++ G ++ G Y +L N +VS RI G
Sbjct: 830 DVVHECFEPIEEPRTGRDLMKDVIFSNGSELNRLNFQGFYTILLEKNDELVSVATVRIHG 889
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
++AE+PLV T + G + L +EK L L V+ LVLP+ WT FGFS
Sbjct: 890 DKVAEIPLVGTRFQFRQLGMCRILMDVLEKKLMELGVQRLVLPAVPGVLNTWTGSFGFSK 949
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
MT+ E+ ++ DY + FQ T M QK + K
Sbjct: 950 MTDSERLQFV-DYTFLDFQDTVMCQKLLMK 978
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 155/369 (42%), Gaps = 95/369 (25%)
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 770
++ +D CGIC +GG+L+CC++ C TF E
Sbjct: 264 AAANSDQCCGICNEGGELVCCET------------------------CPLTFHME----- 294
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTP-DTELGGCVLCRGRDFCKSRFGRRTVILCD 829
C+ +++ P D LC C RT C+
Sbjct: 295 ----------------------CVSLLEVPKDAWFCFRCLC-----CHCGEPLRTQP-CE 326
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 889
QCER +H GC D L + C + C I L ++V +++ +
Sbjct: 327 QCERCFHPGCCDD------AILAGDFFFCSSGCWNIFQRLAEMV---------ATVNPLG 371
Query: 890 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 949
+ ++ W +LR + D LL++A+ + RFDP+++ ++LD +
Sbjct: 372 RS------------ELSWSLLRRGRCDDK-----LLAEALQVISSRFDPVLDCWTQLDYL 414
Query: 950 PAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1007
AMV+ RSH D+ G Y A+L VV + RI G LAE+P +AT +GQG
Sbjct: 415 DAMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGI 474
Query: 1008 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN-DYPLMIFQ 1066
+SLF +E++L L V+ +VL +A + + +W N F F M + + K R L+
Sbjct: 475 CRSLFTAVEEMLARLGVEMMVLLAAKDTEKMWKNSFEFHAM--DRKLKARTVALGLVALN 532
Query: 1067 GTSMLQKPV 1075
G LQK V
Sbjct: 533 GAGFLQKSV 541
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/95 (60%), Positives = 72/95 (75%), Gaps = 1/95 (1%)
Query: 989 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1048
++AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IWT KFGF +
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 1049 TEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1082
++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 230
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 146/290 (50%), Gaps = 37/290 (12%)
Query: 795 RIVQTPDTELGGCVLCRGRDFCKS--RFGRRTVILCDQCEREYHVGCLKDHGMED----- 847
R +Q + C LC D C + ++ C+QCER HV C ++G++
Sbjct: 622 RTLQQIADKTESCRLCG--DACTTIGTISEDMIVFCNQCERPCHVKCY-NNGLQKQKGPL 678
Query: 848 --LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDV 905
L E + + CC C+ + +L +++++ E+ I++K V
Sbjct: 679 NVLAEYMQFHFFCCQKCQLLRASLHEVLNKREK---------IRQKRSY----------V 719
Query: 906 RWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYH 964
W++L G ++ + + + + IF F +AS +I MV + G+ D+
Sbjct: 720 FWQILNG--MNPGINVQKYIHQVIEIFKVAFPKT--AASDFGVIQDMVNAKDVGGEKDFR 775
Query: 965 GMYCAILTVN-QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLN 1023
GMYCA+LT + ++VVSA + ++ +E+AEL +VAT N + +GYF L IE L +N
Sbjct: 776 GMYCAVLTTSSKLVVSAAVLKVRTEEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMN 835
Query: 1024 VKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
V+ L E ++IW+ K GF++++ EE+ +PL++F+ +++QK
Sbjct: 836 VRLLTALVDPEMESIWSKKLGFTILSGEEKETLLEAHPLVMFEDLTLMQK 885
Score = 96.3 bits (238), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 152/363 (41%), Gaps = 86/363 (23%)
Query: 387 PVKQ-ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSG 445
PV+ ET +P +++ ++ L +G+LEG SV Y + K+
Sbjct: 257 PVRMAETKYRPETLLKD----VRGLLSTGLLEGFSVTYKKNGKMNA-------------- 298
Query: 446 ISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKA 505
C+ FE HAG S+ ++I+L++G +L ++ K L L
Sbjct: 299 ----CE------------FEQHAGQSSNNQNDHIFLDSGISLYKLIQALKYKKLHLLADL 342
Query: 506 VRMVLG---------------------------------SSSMKKANFCLNCRVSFSNAG 532
+ G SS A ++ + S SN G
Sbjct: 343 IEEQTGLPPNLIEYGKWKASFEVQNDDLEDAASDHCSTQSSQDSDAGVTISMKESTSN-G 401
Query: 533 VEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPS------VELEESPA--PSGE 584
+ L + + G+A + LS S E S + EE+P+ +G
Sbjct: 402 ISNLNWSAFRRPRWQYKRGGTATSTQTLSRSPEKGISGLSTGTSMKINTEETPSENTAGP 461
Query: 585 LTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFV 644
L EP + +S + K +D +H+LVF+EGG+ + + Y +
Sbjct: 462 LHSEVTIVQEPPRGHSVGPKSKESRTSKV------RDNSLHQLVFKEGGVPELTILTYKL 515
Query: 645 R-GEKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELS 701
+ GE GYK+G ILC CC+ EV +PS FE HAG RR+P+++IYT G++LHEL+
Sbjct: 516 KHGEVLKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGKRRQPYRNIYTPEGLTLHELA 575
Query: 702 IKL 704
+KL
Sbjct: 576 LKL 578
Score = 46.6 bits (109), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 78/317 (24%), Positives = 122/317 (38%), Gaps = 68/317 (21%)
Query: 437 LRGVVKGSGISCFC----DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
L G V G C C D +G + ++ FE HAG+ +K +I+L NG +L + +
Sbjct: 26 LEGFVAEGGYVCACTSPCDYRRGGKALSALQFEKHAGAVSKNQNGHIFLRNGTSLYALFH 85
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNC------RVSFSNAGVEELMLLCKSCVE- 545
++ P + + RM G A R+ S+ E+ C + E
Sbjct: 86 ALREVPAQRFAEDFRMAAGVQMTVPAATGAPLAAREPDRLQASDLTAEQAP--CAAAQEA 143
Query: 546 -----LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQT 600
L E + S + + +HSS E P S PS L D + E +A
Sbjct: 144 QTLDNLTEEEKASLSLLDLRAHSSVTESDPMDGIEGSSIHPSLSLRDVAE---EIENAAR 200
Query: 601 SSHSKMKSSSVK-----SHGKIT------------RKDLRMHKLVFEEGGLEDG------ 637
HS V+ S G +T + +RM +++ +E
Sbjct: 201 CDHSMPDVGGVRDAPRDSGGLLTDTLVTVPVMEFKYQPVRMADTKYDQIMMESKYQPVRM 260
Query: 638 AEVGYF-------VRG-------EKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 683
AE Y VRG E F V YKK N +++ +FE HAG +S
Sbjct: 261 AETKYRPETLLKDVRGLLSTGLLEGFSVTYKK---------NGKMNACEFEQHAGQSSNN 311
Query: 684 KPFQHIYTSNGVSLHEL 700
+ HI+ +G+SL++L
Sbjct: 312 QN-DHIFLDSGISLYKL 327
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 117 bits (292), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 144/383 (37%), Gaps = 92/383 (24%)
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 770
+S +D +C C GGDL+ CD CP FH+ C+ L +P W C C
Sbjct: 138 NSSLSDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC------------ 185
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPD--TELGGCVLCRGRDFCKSRFGRRTVILC 828
C + D T C+ C
Sbjct: 186 ----------------------CCELCGKGDSSTSTNACLQC------------------ 205
Query: 829 DQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVI 888
R YHV CL G + P + C C + L +L+
Sbjct: 206 ---ARAYHVHCLTKDGCLLPTDYPSENF-CSKSCYELCAQLHQLL--------------- 246
Query: 889 KKKHEESGSDNAVDFD-VRWRVLRGKKVDASD-------GTRALLSKAVSIFHDRFDPII 940
G N D + W + R K D + T + + + H+ F +
Sbjct: 247 -------GISNPTSVDGLTWTLTRSSK-DVYNFPGMPRSSTHVKSFQILRVMHECFRSVK 298
Query: 941 ESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
E ++ D++ ++Y G + ++HG Y +L +VS RI G + AE+PLVAT
Sbjct: 299 EPHTQKDMVTDLIYNSGSKFKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVAT 358
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ + QG + L + KLL V+ L+LP+ + + +W FGFS M E+ + +
Sbjct: 359 PFNFRRQGMCRLLMQEVLKLLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQL-S 417
Query: 1059 DYPLMIFQGTSMLQKPVPKCRIV 1081
Y + FQGT MLQ + RI
Sbjct: 418 GYSFVGFQGTMMLQNVLTSSRIT 440
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 157/395 (39%), Gaps = 87/395 (22%)
Query: 645 RGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-VSLHELSIK 703
R + L G GI C CC+ ++ ++FE HAG + +KP+ +I+ G VSL + +
Sbjct: 672 RTREMLAGKITREGIFCGCCSKILTIAKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLD 730
Query: 704 LSLERPFSSK------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 751
+ E+ S+ E+DD C IC DGGDL+CCD C FH+DC+ + +PSG
Sbjct: 731 -AWEKHTQSENKGFYKIDKGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSG 788
Query: 752 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE---LGGCV 808
W+CR C+ F C + P + L C+
Sbjct: 789 DWYCRSCLCRF------------------------------CGFPQEKPSSSPELLLSCL 818
Query: 809 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKW--LCCADCKRIN 866
C R+YH C G + +P C C++I
Sbjct: 819 QC---------------------SRKYHQTCSSGTGTDSGCTMPGTSIDCFCSPGCRKIY 857
Query: 867 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 926
L KL+ IK E S + V + + K + + +
Sbjct: 858 KRLNKLLG-------------IKNHMEAGFSWSLVHCFANDQAMPNKNKEKLAQCNSKTA 904
Query: 927 KAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFR 984
A ++ + F P I+ S +++I + Y G D+ G Y IL V++A R
Sbjct: 905 LAFTVLDECFQPHIDDRSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVIAAASVR 964
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLL 1019
I G +LAE+P + T + QG + L IE ++
Sbjct: 965 IHGTDLAEMPFIGTRGMYRHQGMCRRLLNGIESVI 999
Score = 43.5 bits (101), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 8/71 (11%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKR 474
I E M V Y+ ++ + L G + GI C C C ++++T A FELHAGS K+
Sbjct: 659 ISEDMHVKYMNSNRTR----EMLAGKITREGIFCGC--C--SKILTIAKFELHAGSKEKK 710
Query: 475 PPEYIYLENGK 485
P I+LE GK
Sbjct: 711 PYANIFLEGGK 721
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 165/393 (41%), Gaps = 96/393 (24%)
Query: 592 SPEPNSAQTSSHSKMKSS----SVKSHGKITRKDLR-----------------MHKLVFE 630
SP P +A T++HSK S+ S+ S KIT++ LR +H L+F+
Sbjct: 296 SPLP-TAVTTNHSKHDSTNLGLSLSSPVKITQRPLRNCSIDSKSKESKTRDTTLHPLIFK 354
Query: 631 EGGLEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 689
E GL D + Y ++ GE GYK+G I+C CCN E SPS FE HAG RR+P+ +I
Sbjct: 355 EDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNI 414
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 749
YT G+SLH+ + L L+ + D+ + L C R P
Sbjct: 415 YTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGRE--------PSTT 464
Query: 750 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 809
SG + P + + R+V+T C
Sbjct: 465 SGP------------------------------IVPLKRTLQE--RVVETE-----SCYF 487
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQELPKGKWLCCADCK 863
C T+I C+QCER H+ C + +E L+E +LCC +C+
Sbjct: 488 CGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQ 547
Query: 864 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 923
+L+ ++ G EK + +++ + ++ WR+L G +DAS +
Sbjct: 548 ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWRLLSG--MDASRDVKL 589
Query: 924 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 956
+ + + IF D F + S D+I MV G+
Sbjct: 590 YMPQVIDIFKDAFMDSTDEHS--DIISDMVNGK 620
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 8/106 (7%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 69 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 121
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
H G S ++I+L+ G +L ++ K L L + +G
Sbjct: 122 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIG 167
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 111/432 (25%), Positives = 170/432 (39%), Gaps = 135/432 (31%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL-----HELSIKL-SLE---- 707
G++C+CC+ + S FEAH G + R+P +I+ + L ++ K+ +LE
Sbjct: 11 GLVCSCCDELFNLSGFEAHTG-SKLRRPAANIFVGDEAQLSIADCQNVAFKMETLESLPG 69
Query: 708 ------RPFSS------------------------KENDDLCGICMDGGDLLCCDSCPRA 737
R F S +D CGIC +GG+L+CC++
Sbjct: 70 LPVARRRKFDSYCQSDEDSGLTTVSSGSDVDYEAAANSDQCCGICNEGGELVCCET---- 125
Query: 738 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 797
C TF E C+ ++
Sbjct: 126 --------------------CPLTFHME---------------------------CVSLL 138
Query: 798 QTP-DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DLQELPKGK 855
+ P D LC C RT C+QCER +H GC D + D G
Sbjct: 139 EVPKDAWFCFRCLC-----CHCGEPLRTQP-CEQCERCFHPGCCDDAILAGDFFFCSSGC 192
Query: 856 WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKV 915
W NL ++L E V E + W +LR +
Sbjct: 193 W---------NLF---------QRLAEMVATVNPLGRSE----------LSWSLLRRGRC 224
Query: 916 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTV 973
D LL++A+ + RFDP+++ ++LD + AMV+ RSH D+ G Y A+L
Sbjct: 225 DDK-----LLAEALQLISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQR 279
Query: 974 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1033
VV + RI LAE+P +AT +GQG +SLF +E++L L V+T+ L +A
Sbjct: 280 GAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVETMALLAAK 339
Query: 1034 EAQAIWTNKFGF 1045
+ + +W N F F
Sbjct: 340 DTEKMWKNSFEF 351
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 80/114 (70%), Gaps = 1/114 (0%)
Query: 969 AILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLV 1028
I+ N VV+AG+ R+FG+E+AELPLVAT + +GYFQ LF CIEKLL LNV ++V
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIV 138
Query: 1029 LPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIV 1081
+P+A E + +W NKFGF + E+ +KY + Y ++ F+G SMLQKPV +I+
Sbjct: 139 VPAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQII 192
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 848 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 889
L+ELPKG W C DC RIN LQKL+ G EKL ++ +I+
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSCFGIIQ 82
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 118/222 (53%), Gaps = 21/222 (9%)
Query: 856 WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKV 915
+LCC +C Q L R EE L E + + + S ++ WR+L G +
Sbjct: 5 FLCCQEC-------QSLRARLEEGL-EKCVGITFLRRIRS--------NICWRLLSG--M 46
Query: 916 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMYCAILTVN 974
DAS + + + + IF D F + S D+I MV G++ + +D+ GMYCA+LT +
Sbjct: 47 DASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTAS 104
Query: 975 QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASE 1034
VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV L P E
Sbjct: 105 THVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPE 164
Query: 1035 AQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1076
IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 165 MAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSLA 206
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 557
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 153/371 (41%), Gaps = 87/371 (23%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 774
N D+C +C GGDLL CD CP AFH C+ L +P +++ +
Sbjct: 255 NCDVCCVCHWGGDLLLCDGCPSAFHHACLGLSSLP-------------EEDLWFCPCCCC 301
Query: 775 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 834
G +E +++L C +QC+R
Sbjct: 302 DICGSMES----------------PANSKLMAC---------------------EQCQRR 324
Query: 835 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 894
+H+ CLK ED + W C + C R+ AL+ L+
Sbjct: 325 FHLTCLK----EDSCIVSSRGWFCSSQCNRVFSALENLL--------------------- 359
Query: 895 SGSDNAV--DFDVRWRVLR----GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDL 948
GS AV D D+ W ++R G+ D D + L AV I H F+P + S DL
Sbjct: 360 -GSKIAVGNDGDLVWTLMRAPNEGEHYD--DEQISKLESAVEILHQGFEPTNDVFSGRDL 416
Query: 949 IPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYF 1008
+ ++Y + G G Y ++ ++ R+ +++ E+PLVAT + + G
Sbjct: 417 VEELIYRKDRTGVG-RGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSYRRSGMC 474
Query: 1009 QSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGT 1068
+ L +EK + + V LVLP+A E WT +FGFS+M E+ + + ++ F GT
Sbjct: 475 RVLMDELEKQMSQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVK-HGMLDFVGT 533
Query: 1069 SMLQKPVPKCR 1079
M K + K R
Sbjct: 534 IMCHKFLQKER 544
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 128/286 (44%), Gaps = 44/286 (15%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKR---------------- 864
++ CD C +H+ CL+ L+ +P+G W C CA C R
Sbjct: 140 LVCCDHCPSTFHLKCLR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHS 193
Query: 865 --INLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 922
+ K + + + + G N VD W +LR + D S
Sbjct: 194 NCVPGCAMKYESSDNQFCSRKCFKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSAENS 253
Query: 923 AL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQD----YHGMYCA 969
L L+ A+++ + F P+I+ S +D++ ++Y R RG+D + G Y
Sbjct: 254 KLESVADLNTRLALALTVIQECFRPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYTV 311
Query: 970 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1029
+L Q ++S R+ G AE+P + T + + QG + L I+++L L V+TLVL
Sbjct: 312 VLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVL 371
Query: 1030 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
P+ +E WT+ FGF +T Q + ++ F G+S+LQKP+
Sbjct: 372 PAIAEFIETWTSAFGFQKLT-AAQGIQLMELNIVTFPGSSVLQKPL 416
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 64/158 (40%), Gaps = 33/158 (20%)
Query: 634 LEDGAEVGYFVR-GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR--------- 683
L +GA V Y + + G GILC CCN S + F+ HAG R
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 684 ---------------------KPFQHIYTSNGVSLHELSIKLS--LERPFSSKENDDLCG 720
+P T + +L + +K S + ENDD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 721 ICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
+C DGG L+CCD CP FH+ C+ L +P G W C C
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC 170
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 112/214 (52%), Gaps = 12/214 (5%)
Query: 345 RPLRRFTRSLLQQKVELAKGS------LSKDGGKRSDVTEVANDGVGGPVKQ--ETVMKP 396
RP +R T+S L+ K++ + + + + R DV +A P K+ E M
Sbjct: 17 RP-KRVTKSKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKKNLELKMSK 75
Query: 397 RKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
+ + +K E+G+LEG+ V+Y+ G K + GLRG +K GI C C C G
Sbjct: 76 KIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDVGILCSCAFCNGR 132
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
+V+ P+ FE+HA +R +YI ENGK+L D++N C+ +PL++LE ++ + ++
Sbjct: 133 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 192
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQ 550
+ C C+ F + V ++ LC C E KES
Sbjct: 193 RTFTCKRCKGIFPSICVGKIGPLCNLCAESKESH 226
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 119/226 (52%), Gaps = 16/226 (7%)
Query: 854 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 913
G C C+ + LQ L+ ++ PE S V+++ HE DV VL
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHE----------DVPEEVL--- 70
Query: 914 KVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAIL 971
+D + ++ A+S+ + F PII+ + ++LI +VY G + D+ G Y IL
Sbjct: 71 ALDKRVECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFIL 130
Query: 972 TVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPS 1031
+++A RI G +LAE+P + T N + QG + L IE +L LNV+ L++P+
Sbjct: 131 ERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPA 190
Query: 1032 ASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1077
+E WT+KFGFS + + E+ + ++ +++F GT +LQKP+ K
Sbjct: 191 ITELVDTWTSKFGFSPLEDSEKQEVKS-ISMLVFPGTGLLQKPLLK 235
>gi|298204724|emb|CBI25222.3| unnamed protein product [Vitis vinifera]
Length = 248
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 71/94 (75%)
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
V++ HA SSNKRPPEYIYLENG +LR +MN K++ L +L++A+R+ +G S +K++ FCL
Sbjct: 139 VWKSHADSSNKRPPEYIYLENGTSLRGVMNAWKNAALYSLDEAIRVAIGCSMIKRSTFCL 198
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEI 556
NC+ AG+ +LC SC++LKESQAG +++
Sbjct: 199 NCKGRIFGAGIGNSKVLCLSCLQLKESQAGPSQV 232
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 125/259 (48%), Gaps = 34/259 (13%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKG---KWLCCADCKRINLALQKLVDRGEEKLP 881
++ CDQC R++HV C + G+ L+ KG W C C+ + LQ L+
Sbjct: 47 ILSCDQCPRKFHVACARSRGLIKLER--KGTCYSWFCSDKCEYVFSGLQHLLG------- 97
Query: 882 ETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD-----GTRALLSKAVSIFHDRF 936
K G+DN + W +L+ + D D + L A+ + H+ F
Sbjct: 98 ---------KSVPVGTDN-----LTWTLLKRVEPDCFDLEVLSANNSKLKLALEVMHECF 143
Query: 937 DPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELP 994
+P ++ + DL+ +++ G + ++ G Y +L N + + R+FG ++AE+P
Sbjct: 144 EPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVRVFGDKVAEVP 203
Query: 995 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1054
VAT + G + L +E+ L L V+ LVLP+A W FGFS+MT ++
Sbjct: 204 FVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFGFSVMTYSDK- 262
Query: 1055 KYRNDYPLMIFQGTSMLQK 1073
K +DYP++ FQGT + QK
Sbjct: 263 KAHSDYPILFFQGTVLCQK 281
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 106 bits (264), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 156/371 (42%), Gaps = 87/371 (23%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 774
N D+C +C GGDLL CD CP AFH C+ L +P +++ +
Sbjct: 253 NCDVCCVCHWGGDLLLCDGCPSAFHHTCLGLSSLP-------------EEDLWFCPCCCC 299
Query: 775 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 834
G +E +++L C +QC+R
Sbjct: 300 DICGSMES----------------PVNSKLMAC---------------------EQCQRR 322
Query: 835 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 894
+H+ CLK E+ + W C + C R++ AL+ L+
Sbjct: 323 FHLKCLK----EEPGIVSCRGWFCSSQCNRVSSALENLI--------------------- 357
Query: 895 SGSDNAV--DFDVRWRVLR----GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDL 948
G AV + D+ W ++R G+ D D + L AV I H F+P + S DL
Sbjct: 358 -GCKIAVGNNGDLVWTLMRAPNEGEHYD--DEQISKLESAVEILHQGFEPTKDVFSGRDL 414
Query: 949 IPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYF 1008
+ +++ + G G Y ++ + ++ R+ +++ E+PLVAT ++ + G
Sbjct: 415 VEELIFRKDRTGVG-RGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMC 472
Query: 1009 QSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGT 1068
+ L +EK + + V LVLP+A E + WT +FGFS+M E+ + + ++ F GT
Sbjct: 473 RVLVDELEKQMSQMGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVK-HGMLDFVGT 531
Query: 1069 SMLQKPVPKCR 1079
M K + K R
Sbjct: 532 VMCHKFLVKER 542
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 106 bits (264), Expect = 8e-20, Method: Composition-based stats.
Identities = 54/146 (36%), Positives = 88/146 (60%), Gaps = 2/146 (1%)
Query: 930 SIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFG 987
+I D D E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I ++
Sbjct: 86 TILSDEQDAFAETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRT 145
Query: 988 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1047
+E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G+++
Sbjct: 146 EEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTI 205
Query: 1048 MTEEEQNKYRNDYPLMIFQGTSMLQK 1073
+++E+++ +PL++F S++QK
Sbjct: 206 LSDEQKHSMLMAHPLVMFANLSLVQK 231
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 124/255 (48%), Gaps = 37/255 (14%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETS 884
+I CD C +H CL E LQ L L ++ ++ G S
Sbjct: 298 LICCDGCPSTFHQSCLDIQLFEQLQML---------------LGVKHELEDG------FS 336
Query: 885 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 944
++++ V FD+ + +KV+ + + L+ A+SI + F PI++ S
Sbjct: 337 WTLVQRTE--------VGFDISLNGIP-QKVECN----SKLAVALSIMDECFLPIVDQRS 383
Query: 945 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1002
++LI ++Y G + +Y G + AIL + ++SA RI G +LAE+P + T +
Sbjct: 384 GINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIY 443
Query: 1003 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1062
+ QG + L IE L LNV+ LV+P+ SE WT+ FGF + + + RN +
Sbjct: 444 RRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEVSSRKEMRN-MNM 502
Query: 1063 MIFQGTSMLQKPVPK 1077
++F GT MLQKP+ K
Sbjct: 503 LVFHGTDMLQKPLLK 517
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 58/122 (47%), Gaps = 15/122 (12%)
Query: 638 AEVGYFVR--GEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 695
A+V Y R L G+ GI C CC+ + S+FE HAG +P Q+I G+
Sbjct: 195 AKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAGM-KLCEPSQNIILETGI 253
Query: 696 SLHELSIK------------LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
SL + + L + NDD CGIC DGGDL+CCD CP FH C+
Sbjct: 254 SLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCL 313
Query: 744 SL 745
+
Sbjct: 314 DI 315
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 68/110 (61%), Gaps = 2/110 (1%)
Query: 702 IKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 761
+K+S + S +E DDLC C GGD+ C CPR+FH CV L +PS W+C C N
Sbjct: 1 MKISKDMELSERETDDLCRECGQGGDIFPCKICPRSFHPACVGLSKVPS-EWYCDNCRNL 59
Query: 762 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 811
QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+
Sbjct: 60 VQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLGGCALCK 108
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 103 bits (256), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 135/345 (39%), Gaps = 109/345 (31%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE-LSIKLSLERPFSSK--- 713
GI+C CC +S S F AHAG + H +S G+ L S L L +S++
Sbjct: 889 GIVCNCCKKNLSVSDFMAHAGRS-------HPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 714 --------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
C +C G F +
Sbjct: 1002 -----------------------------------------------CQVCGGP-FSEKE 1013
Query: 820 FGRRTVIL-CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
+ I C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1014 VSTFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV----- 1067
Query: 879 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKA 928
G+DN +D D+ W +LR G+K+ + L L+ A
Sbjct: 1068 -----------------GTDNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVA 1110
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAIL 971
+++ + F +++ + +D+IP ++Y G + DY G Y IL
Sbjct: 1111 LTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVIL 1155
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/267 (27%), Positives = 127/267 (47%), Gaps = 33/267 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKRINL---ALQKLVDRG- 876
++LCD+C +H C+ LQ P+G W C C C +L + D+
Sbjct: 576 LLLCDKCPSAFHHACV------GLQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 877 ----EEKLPET----SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
+P T SL ++++ S V RW+ ++ DA+D + L A
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRRRYMS----TVTRITRWQ---HEEEDAADHGQ--LCAA 680
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVYGRSHR--GQDYHGMYCAILTVNQVVVSAGIFRIF 986
+ + H+ FD ++E ++ DL +V+ + ++ G Y L +++ G R+F
Sbjct: 681 LDVLHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVF 740
Query: 987 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1046
G ++AELPLV T + QG + L +EK+L + V+ LVLP+ E +WT GF
Sbjct: 741 GNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFH 800
Query: 1047 MMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
MT + + ++ ++ F+GT+M QK
Sbjct: 801 AMTRSDVMEMAVEHAILSFKGTTMCQK 827
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%)
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
S +E D +C +C+DGG+LL CD CP AFH CV L P G W C C
Sbjct: 559 SEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLC 606
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 151/377 (40%), Gaps = 105/377 (27%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 774
+DD CG+C DGG+LLCCD +C +TF E
Sbjct: 436 SDDACGVCADGGELLCCD------------------------FCTSTFHPE--------- 462
Query: 775 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELG----GCVLCRGRDFCKSRFGRRTVILCDQ 830
C+ I + PD C LC D + + C +
Sbjct: 463 ------------------CLAI-EVPDGSWSCHYCRCTLCMSND-------DQDLSTCQE 496
Query: 831 CEREYHVGC--LKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVI 888
C +YH C L +G D+ C CK+++ L +++
Sbjct: 497 CACKYHESCRPLLGNG-RDIGAY------CGEICKKLSAKLSEVI--------------- 534
Query: 889 KKKHEESGSDNAVDFDVRWRVLRGKKVD--ASDGTRALLSK------AVSIFHDRFDPII 940
G N+ + W +LR + + +S G A+L + A+ + + F+P+
Sbjct: 535 -------GVMNSTEDGFSWSLLRIHEDEPASSQGMPAVLERNVKLAVALGVLNQCFNPVK 587
Query: 941 ESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
+ +K+D++ VY G + Y G Y IL + +VS + RI G+++AE+P T
Sbjct: 588 DRRTKIDMLHQAVYSLGSQFKRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGT 647
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1058
+ QG + +E++L + V+TL++P+ + W F F + + + + +
Sbjct: 648 LPAYRKQGMMHRVVSAVEQVLASVQVETLIIPAIASMVDTWKRSFSFRPVDPQLREELKR 707
Query: 1059 DYPLMIFQGTSMLQKPV 1075
L++ GT+ML KPV
Sbjct: 708 -LSLVVITGTTMLHKPV 723
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 100 bits (248), Expect = 5e-18, Method: Composition-based stats.
Identities = 45/69 (65%), Positives = 53/69 (76%), Gaps = 1/69 (1%)
Query: 762 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 821
FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P+ E+ CVLCRG DF KS FG
Sbjct: 2 FQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIVN-PEAEVSACVLCRGYDFSKSGFG 60
Query: 822 RRTVILCDQ 830
R +I+CDQ
Sbjct: 61 PRMIIMCDQ 69
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 85/144 (59%), Gaps = 3/144 (2%)
Query: 936 FDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 993
F PII+ + ++LI +VY G + D+ G Y IL +++A RI G +LAE+
Sbjct: 5 FLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEM 64
Query: 994 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1053
P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + + E+
Sbjct: 65 PFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEK 124
Query: 1054 NKYRNDYPLMIFQGTSMLQKPVPK 1077
+ ++ +++F GT +LQKP+ K
Sbjct: 125 QEVKS-ISMLVFPGTGLLQKPLLK 147
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 85/151 (56%), Gaps = 3/151 (1%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 982
L AV I H+ F IIE ++ D+ +V+ R R ++ G Y +L +VS G
Sbjct: 8 LCIAVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGT 67
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
FRI GQ+ AELPL+ T + + QG + L +EKLL L V+ L+LP+ E WT
Sbjct: 68 FRICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCS 127
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
FGF++M+ E+ + + ++ FQGT+M QK
Sbjct: 128 FGFTVMSNSERLELAGNS-ILSFQGTTMCQK 157
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 90/185 (48%), Gaps = 18/185 (9%)
Query: 628 VFEEGGLEDGAEVGYF--VRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 685
+ + G L+ +V Y R L G+ GI C CC+ +S S+FE HAG + R+P
Sbjct: 537 LIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHAG-SKLRQP 595
Query: 686 FQHIYTSNGVSLHELSIKLSLERP------------FSSKENDDLCGICMDGGDLLCCDS 733
FQ+I+ + GVSL + I ++ + NDD CGIC DGGDL+CCD
Sbjct: 596 FQNIFLNTGVSLFQCQIDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDG 655
Query: 734 CPRAFHIDCVSLPGIPSGTW-HCRYCMNTFQKEKFVEYNANARAAGRIEG--VDPFAQMV 790
CP FH C+ + G W R+ F + +A + G++ + P Q+
Sbjct: 656 CPSTFHQRCLDIRGHLMPDWIFLRFNYRCFLLVIGIAPIVHANSVGQLLKMLLRPRMQIP 715
Query: 791 SRCIR 795
++C+R
Sbjct: 716 AKCVR 720
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 52/96 (54%), Gaps = 2/96 (2%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGI 982
L+ A+++ + F PI++ S ++++ ++Y G + ++ G Y A+L VV++
Sbjct: 770 LALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVASAS 829
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1018
R G LAE+P + T + + QG + LF IE +
Sbjct: 830 IRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESV 865
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 83/151 (54%), Gaps = 3/151 (1%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGI 982
L A+S+ H+ F+P+ ES S DL+ +++ R ++ G Y +L N+ ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
R++G+++AE+PLV T + G L +EK L L V+ LVLP+ WT
Sbjct: 1039 VRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRS 1098
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
FGF+ MT E++++ DY + FQG M QK
Sbjct: 1099 FGFAKMTNLERSQFL-DYTFLDFQGAIMCQK 1128
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 87/219 (39%), Gaps = 60/219 (27%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP-------- 709
GI C CC S FE HA S +P I+ +G SL + IK+ +
Sbjct: 572 GIKCNCCMGIYSFVGFENHASGNSTCRPSASIFLEDGRSLLDCLIKMMHDHKTMETSGKS 631
Query: 710 ---FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 766
S END +C +C GG+L+ CD CP +FH C+ L IP+G W C C ++
Sbjct: 632 FSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR 691
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 826
+I+ D Q++ CI
Sbjct: 692 ------------KIDRDDEVEQLLP-CI-------------------------------- 706
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
QCE +YHV CL++ G D+ G W C DC+++
Sbjct: 707 ---QCEHKYHVRCLEN-GAADISTRYLGNWFCGKDCEKL 741
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 27/277 (9%)
Query: 825 VILCDQCEREYHVGCL------KDHGMEDLQELP----KGKWLCCADCKRINLALQ---K 871
++ CD C +H CL K + D Q+L + WL A I+ Q
Sbjct: 538 LLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRP 597
Query: 872 LVDRGEE---KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK---VDASDGTRAL- 924
L G E ET + + G N + W +L+ +K V + D L
Sbjct: 598 LQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQKDELVTSEDMPVILE 657
Query: 925 ----LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVV 978
L+ A+ + ++ F+P+ + +K+D++ VY G + +Y G Y +L + ++
Sbjct: 658 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 717
Query: 979 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1038
S + R G++LAE+P T Q QG + L +EK+L L V+ LV+P+ ++
Sbjct: 718 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 777
Query: 1039 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
W F F M E +++ + L+ GT++LQKP+
Sbjct: 778 WKRSFSFRPMQAEVRDEAKK-LSLVAITGTTLLQKPI 813
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
+DD CG+C DGG LLCCD+CP FH DC+++
Sbjct: 525 SDDACGVCADGGQLLCCDTCPSTFHPDCLAI 555
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 27/277 (9%)
Query: 825 VILCDQCEREYHVGCL------KDHGMEDLQELP----KGKWLCCADCKRINLALQ---K 871
++ CD C +H CL K + D Q+L + WL A I+ Q
Sbjct: 540 LLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRP 599
Query: 872 LVDRGEE---KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK---VDASDGTRAL- 924
L G E ET + + G N + W +L+ +K V + D L
Sbjct: 600 LQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQKDELVTSEDMPVILE 659
Query: 925 ----LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVV 978
L+ A+ + ++ F+P+ + +K+D++ VY G + +Y G Y +L + ++
Sbjct: 660 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 719
Query: 979 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1038
S + R G++LAE+P T Q QG + L +EK+L L V+ LV+P+ ++
Sbjct: 720 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 779
Query: 1039 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1075
W F F M E +++ + L+ GT++LQKP+
Sbjct: 780 WKRSFSFRPMQAEVRDEAKK-LSLVAITGTTLLQKPI 815
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 745
+DD CG+C DGG LLCCD+CP FH DC+++
Sbjct: 527 SDDACGVCADGGQLLCCDTCPSTFHPDCLAI 557
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 117/259 (45%), Gaps = 33/259 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKRINLALQKLVDRGEEKL 880
+++CD+C +H C+ L+ P+G W C CA C +L +
Sbjct: 423 LLMCDRCPSMFHHACV------GLESTPQGDWFCPACTCAICGSSDL----------DDP 466
Query: 881 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 940
P T+ + SD V + R + D + A L A+ + + F +I
Sbjct: 467 PATTTT------QGFSSDRMV---ISCEQCRRESRDGEEEEHAKLCMALDVLRECFVTLI 517
Query: 941 ESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 998
E ++ DL +V+ R D+ G Y L +++ R++G+E+AE+PLV T
Sbjct: 518 EPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 577
Query: 999 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN-KFGFSMMTEEEQNKYR 1057
+ QG + L I+KLLG + V+ LVLP+ E A WT FG M + ++
Sbjct: 578 RFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIREMGQADRQDVA 637
Query: 1058 NDYPLMIFQGTSMLQKPVP 1076
+ + ++ FQGT M K +P
Sbjct: 638 H-HAILRFQGTIMCHKQLP 655
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 54/122 (44%), Gaps = 29/122 (23%)
Query: 666 SEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKLSLER------------- 708
+E + ++F H +S R+P+ ++ +G SL + ++L +
Sbjct: 332 AEYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKK 391
Query: 709 --PFSSK---------ENDD-LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 756
P+ + E+DD +C +C D G+LL CD CP FH CV L P G W C
Sbjct: 392 YLPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCP 451
Query: 757 YC 758
C
Sbjct: 452 AC 453
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 88.2 bits (217), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 81/140 (57%), Gaps = 13/140 (9%)
Query: 388 VKQETVMKPR--KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
K ETV+K + K +K F S +K+ L +GI +G+ V Y S+ K L+GV
Sbjct: 379 AKNETVVKNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYCTWSREKN-----LQGV 433
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
+KG+G C CD CKG + + FE HAG+ +K P +I+ ENGK++ ++ K+SP E
Sbjct: 434 IKGTGYLCSCDICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQE 493
Query: 501 TLEKAVRMVLGSSSMKKANF 520
L A++ V G +++ + NF
Sbjct: 494 MLFDAIQTVTG-ATINQRNF 512
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 71/120 (59%), Gaps = 6/120 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +GI +G+ V Y+ S+ K L+G++KG+G C CD+C ++ +
Sbjct: 441 NNFPSNVKSLLSTGIFDGVQVKYVSWSREKS-----LKGIIKGTGYLCSCDNCNQSKALN 495
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAG+ K P +IY ENGKT+ ++ K++P + L A++ V G S++ + NF
Sbjct: 496 AYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVTG-STINQKNF 554
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 40/73 (54%), Gaps = 6/73 (8%)
Query: 633 GLEDGAEVGYFVRG-EKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 687
G+ DG +V Y EK L G KG G LC+C CN ++ +FE HAG A + P
Sbjct: 454 GIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTKHPNN 512
Query: 688 HIYTSNGVSLHEL 700
HIY NG +++ +
Sbjct: 513 HIYFENGKTIYAV 525
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 87.0 bits (214), Expect = 5e-14, Method: Composition-based stats.
Identities = 40/73 (54%), Positives = 51/73 (69%)
Query: 1005 QGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI 1064
QGYFQ+LF CIE+LL L VK VLP+A EA++IWT +FGF +T++E +Y +
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTV 62
Query: 1065 FQGTSMLQKPVPK 1077
FQGTS L K VPK
Sbjct: 63 FQGTSTLHKLVPK 75
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 86.3 bits (212), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 114/254 (44%), Gaps = 32/254 (12%)
Query: 290 EKKNDIERELVENGVLESSMVG---KHSSTLC----NGESNVAKSVAVDGNDEGKTVN-- 340
E +DI + N ESS + H C G+ N G +EG ++
Sbjct: 159 ENNSDIPTDQAFNRENESSFISIGDAHIVATCPTYIKGDDNAISISDAYGKEEGNMISFG 218
Query: 341 --------VVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
+ V RP+ + +S + V+ + K+ SD A++ K E+
Sbjct: 219 EFHDAHDMIAVGRPISSYAQSYDESSVQTPEAVQQKEFDA-SDAHATASNTRVAKSKTES 277
Query: 393 VMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGS 444
V + + K RK F S +++ + +G+L+G+ V YI S+ + LRGV+KGS
Sbjct: 278 VSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLDGVPVKYIALSREE------LRGVIKGS 331
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C ++V+ FE HAG K P +IY ENGKT+ I+ + +P L
Sbjct: 332 GYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFD 391
Query: 505 AVRMVLGSSSMKKA 518
++ V G+ +K+
Sbjct: 392 VIQTVFGAPINQKS 405
Score = 44.3 bits (103), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC+C CN ++ +FE HAG ++ P H
Sbjct: 307 GMLDGVPVKYIALSREELRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKH-PNNH 365
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 366 IYFENGKTIYQI 377
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 84/167 (50%), Gaps = 24/167 (14%)
Query: 388 VKQETVMKPRKVMR--------KFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
VK ETV K ++ ++ F S +++ + +GIL+G+ V YI S+ + LRG
Sbjct: 344 VKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSREE------LRG 397
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C +V+ FE HAG K P +IY ENGKT+ I+ + +P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNA--------GVEELML 538
L ++ V G+ +KA N + SF A G EEL L
Sbjct: 458 SLLFDTIQTVFGAPIHQKA--FRNWKESFQAATRELQRIYGKEELNL 502
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 378 GILDGVPVKYISVSREELRGIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKH-PNNH 436
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 437 IYFENGKTIYQI 448
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K ET ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG C
Sbjct: 266 KAETKSSKKEASTSFPSNVRSLISTGMLDGVPVTYVSISREE------LRGVIKGSGYLC 319
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C+ +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 320 GCQTCEFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQT 379
Query: 509 VLGSSSMKKA 518
V GS +KA
Sbjct: 380 VFGSPINQKA 389
Score = 39.7 bits (91), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 17/113 (15%)
Query: 593 PEP-NSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLV 651
P P N A+T S K S+S S+ + L+ G+ DG V Y + L
Sbjct: 261 PVPKNKAETKSSKKEASTSFPSN---------VRSLI--STGMLDGVPVTYVSISREELR 309
Query: 652 GYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 700
G KG G LC C E ++ FE HAG ++ P HIY NG +++++
Sbjct: 310 GVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKH-PNNHIYFENGKTIYQI 361
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 99/199 (49%), Gaps = 16/199 (8%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ G D+ + + V+RPL + S Q V + ++ + + + VA++
Sbjct: 178 LSFGGFDDAHDI-IPVDRPLSSYDHSYDQSSVR-TREAVDEKELRTTTAKAVASNTQATK 235
Query: 388 VKQETVMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
+ E V K R K RK F S +++ + +G+L+G+ V Y+ S+ + LRG
Sbjct: 236 SRTEPVSKNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYVSLSREE------LRG 289
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C ++V+ FE HAG K P +IY ENGKT+ I+ + +P
Sbjct: 290 IIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 349
Query: 500 ETLEKAVRMVLGSSSMKKA 518
L ++ V G+ +K+
Sbjct: 350 SMLFDVIQTVFGAPINQKS 368
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 270 GMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKH-PNNH 328
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 329 IYFENGKTIYQI 340
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 101/214 (47%), Gaps = 25/214 (11%)
Query: 341 VVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVM 400
+ V RP + + Q V ++ + K+ S VA+ VK ETV K ++ +
Sbjct: 259 ISVGRPAAEYDQLYNQSSVHVSTTAHEKELDVSSS-DAVASTLQVAKVKSETVSKNKQEL 317
Query: 401 R--------KFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDD 452
+ F S +++ + +GIL+G+ V Y+ S+ + LRG++KGSG C C
Sbjct: 318 KTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSREE------LRGIIKGSGYLCGCQS 371
Query: 453 CKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS 512
C +V+ FE HAG K P +IY ENGKT+ I+ + +P L ++ V G+
Sbjct: 372 CNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGA 431
Query: 513 SSMKKANFCLNCRVSFSNA--------GVEELML 538
+KA N + SF A G EEL L
Sbjct: 432 PINQKA--FRNWKESFQAATRELQRIYGKEELNL 463
Score = 41.2 bits (95), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 339 GILDGVPVKYVSVSREELRGIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKH-PNNH 397
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 398 IYFENGKTIYQI 409
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 64/103 (62%)
Query: 434 VSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV 493
VSGLRGV++ GI C C C+G +V++P+ FE+HA +R EYI ENGK+L D++
Sbjct: 32 VSGLRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRA 91
Query: 494 CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL 536
C+ +PL LE ++ ++ S +K C C+ F ++ +E +
Sbjct: 92 CRGAPLHDLEATIQNIVCSPPEEKYFTCKRCKGRFPSSCMERV 134
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 7/145 (4%)
Query: 390 QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+E M + F S +K+ L +G+L+G+ V Y+ S+ K L+G++KG+G C
Sbjct: 431 KEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKN-----LKGIIKGTGYLCS 485
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C++C ++ + FE HAG K P +IY ENGKT+ ++ K++P E L A++ V
Sbjct: 486 CENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNV 545
Query: 510 LGSSSMKKANFCLNCRVSFSNAGVE 534
G S + + NF + + S+ A +E
Sbjct: 546 TG-SPINQKNFRI-WKASYQAATLE 568
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 6/73 (8%)
Query: 633 GLEDGAEVGYFVRG-EKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 687
G+ DG V Y EK L G KG G LC+C CN ++ +FE HAG ++ P
Sbjct: 455 GMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKH-PNN 513
Query: 688 HIYTSNGVSLHEL 700
HIY NG +++ +
Sbjct: 514 HIYFENGKTIYAV 526
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 7/145 (4%)
Query: 390 QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+E M + F S +K+ L +G+L+G+ V Y+ S+ K L+G++KG+G C
Sbjct: 410 KEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKN-----LKGIIKGTGYLCS 464
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C++C ++ + FE HAG K P +IY ENGKT+ ++ K++P E L A++ V
Sbjct: 465 CENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNV 524
Query: 510 LGSSSMKKANFCLNCRVSFSNAGVE 534
G S + + NF + + S+ A +E
Sbjct: 525 TG-SPINQKNFRI-WKASYQAATLE 547
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 6/73 (8%)
Query: 633 GLEDGAEVGYFVRG-EKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 687
G+ DG V Y EK L G KG G LC+C CN ++ +FE HAG ++ P
Sbjct: 434 GMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKH-PNN 492
Query: 688 HIYTSNGVSLHEL 700
HIY NG +++ +
Sbjct: 493 HIYFENGKTIYAV 505
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 627 LVFEEGGLEDGAEVGYF--VRGEKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 684
+ + G L+ +V Y R L G+ GI C CC+ ++ S+FE HAG + R+
Sbjct: 550 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAG-SKLRQ 608
Query: 685 PFQHIYTSNGVS--LHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 742
PFQ+I+ ++G + + S+ + + P NDD CGIC DGGDL+CCD CP FH C
Sbjct: 609 PFQNIFLNSGGAGNIGFCSVDVIADDP-----NDDACGICGDGGDLVCCDGCPSTFHQRC 663
Query: 743 VSLPGIPSGTW 753
+ + G W
Sbjct: 664 LDIRGHLMPDW 674
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 81/178 (45%), Gaps = 17/178 (9%)
Query: 313 HSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSK--DG 370
HS++ +G++ A++ N+ G + + + S L + E + GS S DG
Sbjct: 452 HSASDSDGKATFARNFLAIKNEVGNDDSRDSSQGTTSKSESPLHHQTEKSTGSSSHRVDG 511
Query: 371 GKRSD---VTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILE-GMSVMYIRG 426
GK S T + V G + P R + L ++SG L+ VMY+
Sbjct: 512 GKSSKHGRSTLLVRRSVRGDNSESDGFVPSSEKRTVLAWL---IDSGTLQLSEKVMYMNQ 568
Query: 427 SKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
+ + + L G + GI C C C ++++ + FE+HAGS ++P + I+L +G
Sbjct: 569 RRTR----AMLEGWITRDGIHCGC--C--SKILAVSKFEIHAGSKLRQPFQNIFLNSG 618
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 16/131 (12%)
Query: 649 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 701
L G+ GI C+CC+ +S +F HAG + KP+++I +G+ + L +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 984
Query: 702 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 754
++ ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +P+ W
Sbjct: 985 MQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWC 1044
Query: 755 CRYCMNTFQKE 765
C C F E
Sbjct: 1045 CSNCSCKFCHE 1055
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 117/252 (46%), Gaps = 57/252 (22%)
Query: 869 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 928
LQ L+ ++ PE S V+++ HEE + + D K+V+ + + ++ A
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEV-PEEVLALD--------KRVECN----SKIAVA 1288
Query: 929 VSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFR-- 984
+S+ + F PI++ + ++LI +VY G + D+ G Y IL +++A R
Sbjct: 1289 LSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLK 1348
Query: 985 ---------------------------------------IFGQELAELPLVATSNDCQGQ 1005
I G +LAE+P + T N + Q
Sbjct: 1349 EKNILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQ 1408
Query: 1006 GYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIF 1065
G + L IE +L LN++ L++P+ +E WT+KFGFS + + E+ + ++ +++F
Sbjct: 1409 GMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKS-VSMLVF 1467
Query: 1066 QGTSMLQKPVPK 1077
GT +LQKP+ K
Sbjct: 1468 PGTGLLQKPLLK 1479
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 8/146 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +G+L+G++V Y+ S+ K LRG +KG+G C C C GN+V+
Sbjct: 421 NNFPSNVKSLLSTGLLDGVAVKYVSWSREKT-----LRGTIKGTGYLCSCKVC-GNKVLN 474
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HA K P +IY ENGKT+ ++ K++P E L A+ V G S++ + NF
Sbjct: 475 AYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFNAIETVTG-SAINQKNF 533
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVEL 546
L+ + S+ A E + K V L
Sbjct: 534 -LSWKASYEAATRELQRIYGKEEVTL 558
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 5/126 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +GI +G++V Y S+ + L+G++KG+G
Sbjct: 382 PKIKDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSRERN-----LKGMIKGTGY 436
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +CK N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 437 LCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 496
Query: 507 RMVLGS 512
+ V GS
Sbjct: 497 QNVTGS 502
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 6/160 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +G+ +G++V Y S+ V L+G++KG+G
Sbjct: 366 PKSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR----EVRNLKGIIKGTGY 421
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +C N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 422 LCGCGNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 481
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVEL 546
+ V G S + NF + S+ A +E + K V L
Sbjct: 482 QNVTG-SDINHKNFN-TWKASYHVASLELQRIYGKDDVTL 519
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 4/126 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +GI +G++V Y S+ + L+G++KG+G
Sbjct: 382 PKIKDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQ----RNLKGMIKGTGY 437
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +CK N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 438 LCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 497
Query: 507 RMVLGS 512
+ V GS
Sbjct: 498 QNVTGS 503
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 40/67 (59%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
++N D C IC DGGDLLCCD+CPR+FH CV L IP W+C+ C+ K K +
Sbjct: 118 RQNRDHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRCVPIMDKRKHTDEEK 177
Query: 773 NARAAGR 779
RA R
Sbjct: 178 RKRAQER 184
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
K+ D+ C C GG +LCC+SC H+ C L P WHC C+
Sbjct: 1299 KQWDEACYKCNKGGKVLCCESCNHVCHLSCSGLLKKPLDEWHCEECL 1345
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 83.6 bits (205), Expect = 6e-13, Method: Composition-based stats.
Identities = 50/151 (33%), Positives = 78/151 (51%), Gaps = 3/151 (1%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 982
L A + H+ F ++E ++ DL +V+ R R + G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
R++G+++AELPLV T + QG L +E LLG V+ LVLP+ E WT
Sbjct: 69 LRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQTWTGS 128
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
FGF +MT+ ++ + +M FQGT+M K
Sbjct: 129 FGFQVMTQSQKLDIAQ-HTIMCFQGTTMCHK 158
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 265 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 318
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 319 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 378
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 379 TVFGSPINQKA 389
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 70/120 (58%), Gaps = 6/120 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +GI +G+ V Y+ S+ K L+G++KG+G C CD+C ++ +
Sbjct: 442 NNFPSNVKSLLSTGIFDGVQVKYVSWSREKS-----LKGIIKGTGYLCSCDNCNQSKALN 496
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAG+ K P +IY ENGKT+ ++ K++ + L A++ V G S++ + NF
Sbjct: 497 AYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVTG-STINQKNF 555
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 40/73 (54%), Gaps = 6/73 (8%)
Query: 633 GLEDGAEVGYFVRG-EKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 687
G+ DG +V Y EK L G KG G LC+C CN ++ +FE HAG A + P
Sbjct: 455 GIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTKHPNN 513
Query: 688 HIYTSNGVSLHEL 700
HIY NG +++ +
Sbjct: 514 HIYFENGKTIYAV 526
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 111/227 (48%), Gaps = 29/227 (12%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ G D+ + + V RP+ + Q V+ + K+ G ++ VA++
Sbjct: 244 LSFGGFDDAHDI-IPVCRPINNYDHPYDQSSVKTREAVDQKELGAKA----VASNTRATK 298
Query: 388 VKQETVMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
K E V K R K RK F S +++ + +G+L+G+ V YI S+ + LRG
Sbjct: 299 SKSEPVSKNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYISLSRKE------LRG 352
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C ++V+ FE HAG K P +I ENGKT+ I+ +++P
Sbjct: 353 IIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPE 412
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNA--------GVEELML 538
L A++ V G+ +K+ F + + SF A G EELML
Sbjct: 413 SMLFDAIQTVFGAPINQKS-FRI-WKESFKAATRELQRIYGKEELML 457
Score = 39.7 bits (91), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y K L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 333 GMLDGVPVKYISLSRKELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKH-PNNH 391
Query: 689 IYTSNGVSLHEL 700
I NG +++++
Sbjct: 392 ICFENGKTIYQI 403
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGSG C
Sbjct: 267 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREE------LHGIIKGSGYLC 320
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L A++
Sbjct: 321 GCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQT 380
Query: 509 VLGSSSMKKA 518
V GS +K+
Sbjct: 381 VTGSPINQKS 390
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 9/122 (7%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD----LRMHKLVFEEGGLEDGAEVGY 642
D SN + +SAQ + +S KS K+++K+ + G+ DG V Y
Sbjct: 242 DASNANGTLSSAQLAKLRPESASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY 301
Query: 643 FVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 698
+ L G KG G LC C CN ++ +FE HAG ++ P HIY NG +++
Sbjct: 302 VSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKH-PNNHIYFENGKTIY 360
Query: 699 EL 700
++
Sbjct: 361 QI 362
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGSG C
Sbjct: 337 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREE------LHGIIKGSGYLC 390
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L A++
Sbjct: 391 GCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQT 450
Query: 509 VLGSSSMKKA 518
V GS +K+
Sbjct: 451 VTGSPINQKS 460
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 9/122 (7%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD----LRMHKLVFEEGGLEDGAEVGY 642
D SN + +SAQ + +S KS K+++K+ + G+ DG V Y
Sbjct: 312 DASNANGTLSSAQLAKLRPESASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKY 371
Query: 643 FVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 698
+ L G KG G LC C CN ++ +FE HAG ++ P HIY NG +++
Sbjct: 372 VSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKH-PNNHIYFENGKTIY 430
Query: 699 EL 700
++
Sbjct: 431 QI 432
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 113/258 (43%), Gaps = 35/258 (13%)
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQC-EREYHVGCLKDHGMEDLQ-ELPKGK--WLCCADC 862
C C G D GRR ++LC C H GC +D E L E+ G + C +C
Sbjct: 16 CTHCGGGDVEPE--GRRVLVLCSACFAAGTHTGCHEDVTGEPLSSEITHGDGLYFCGKEC 73
Query: 863 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 922
+R AL+ R I+ + E+ + ++ K+ D + R
Sbjct: 74 QRSYEALEAATGRRSR---------IRDEPEQ----------YTFELVHYKQDDRT--VR 112
Query: 923 ALLSKAVSIFHDRFDPIIESASK--LDLI------PAMVYGRSHRGQDYHGMYCAILTVN 974
+ + A+ +F F P+I + L+++ P G ++ AIL +
Sbjct: 113 SAVETAMRMFRTSFAPLIMENGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFRLAILRMG 172
Query: 975 QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASE 1034
+++A R+FG + AE+P V+T + G+ + L +E LL V LV+PS +E
Sbjct: 173 GTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCLVIPSINE 232
Query: 1035 AQAIWTNKFGFSMMTEEE 1052
+WTNKFGF+ + +E
Sbjct: 233 LLPMWTNKFGFAKIATDE 250
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K+E + + F S +++ L +G+L+G+ V YI S+ + LRG++KGSG C
Sbjct: 448 KEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSREE------LRGIIKGSGYLC 501
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C ++V+ FE HAG K P +IY ENGKT+ I+ K +P +L ++
Sbjct: 502 GCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQT 561
Query: 509 VLGSSSMKKA 518
+ GS +K+
Sbjct: 562 ITGSPINQKS 571
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG + P H
Sbjct: 473 GMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAG-CKTKHPNNH 531
Query: 689 IYTSNGVSLHEL 700
IY NG +++ +
Sbjct: 532 IYFENGKTIYGI 543
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENG+T+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 78/150 (52%), Gaps = 19/150 (12%)
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRK--VMRKFYSKLKNFLESGILEGMSVMYIRGSKVK 430
++DVT DG +P K F S +K+ L +G+L+G+ V YI S+ K
Sbjct: 333 KTDVTPKNKDG-----------RPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSREK 381
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDI 490
L+G++KG+G C C +C + + FE HA K P +IY ENGKT+ +
Sbjct: 382 N-----LKGLIKGAGYLCGCQECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGV 436
Query: 491 MNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
+ K++P E L +A++ V G S + + NF
Sbjct: 437 VQELKNTPQEMLFEAIQTVTG-SPINQKNF 465
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 94/185 (50%), Gaps = 13/185 (7%)
Query: 356 QQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK------FYSKLKN 409
QQ+V+ ++ LS+ +S+ A+ K +TV K +++ K F +K+
Sbjct: 419 QQEVQTSEQVLSQKYLAQSNADPAASCTPKATSKTDTVSKNKQINTKKDPSSHFPLNVKS 478
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
L +G+ +G+ V Y+ ++ K +RGV+KGSG C C DC + + FE HA
Sbjct: 479 LLSTGMFDGVPVKYVSWTREKS-----VRGVIKGSGYLCSCKDCNSSNCLNAYEFERHAN 533
Query: 470 SSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFS 529
K P +IY ENGKT+ ++ K++P + L + ++ V G + + NF + S+
Sbjct: 534 CKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTG-CPINQKNF-QTWKASYQ 591
Query: 530 NAGVE 534
A VE
Sbjct: 592 AATVE 596
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 79.7 bits (195), Expect = 9e-12, Method: Composition-based stats.
Identities = 37/70 (52%), Positives = 44/70 (62%)
Query: 742 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 801
CV L GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R IRI T +
Sbjct: 5 CVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTLE 64
Query: 802 TELGGCVLCR 811
T GGC LC+
Sbjct: 65 TGFGGCALCK 74
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 69/134 (51%), Gaps = 4/134 (2%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSG----LRGVVKGS 444
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGS
Sbjct: 448 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHGIIKGS 507
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L
Sbjct: 508 GYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFB 567
Query: 505 AVRMVLGSSSMKKA 518
A++ V GS +K+
Sbjct: 568 AIQTVTGSPINQKS 581
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 94/185 (50%), Gaps = 13/185 (7%)
Query: 356 QQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK------FYSKLKN 409
QQ+V+ ++ LS+ +S+ A+ K +TV K +++ K F +K+
Sbjct: 394 QQEVQTSEQVLSQKYLAQSNADPAASCTPKATSKTDTVSKNKQINTKKDPSSHFPLNVKS 453
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
L +G+ +G+ V Y+ ++ K +RGV+KGSG C C DC + + FE HA
Sbjct: 454 LLSTGMFDGVPVKYVSWTREKS-----VRGVIKGSGYLCSCKDCNSSNCLNAYEFERHAN 508
Query: 470 SSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFS 529
K P +IY ENGKT+ ++ K++P + L + ++ V G + + NF + S+
Sbjct: 509 CKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTG-CPINQKNF-QTWKASYQ 566
Query: 530 NAGVE 534
A VE
Sbjct: 567 AATVE 571
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 77/180 (42%), Gaps = 38/180 (21%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVEY 770
N+D C C +GGDLLCC+ CP AFH+ C P GIP+G W C+ C EY
Sbjct: 43 NNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIPAGEWICKKCK--------AEY 94
Query: 771 NANARA-AGRIEGVDPFAQMVSRCIRIVQTPD-----TELGGCVLCRGRDFCK------- 817
N R A + E PFA +V ++ Q P E+ + G K
Sbjct: 95 QDNTRPIAKQSELRSPFAVLVKDNVK--QNPSEFRLPKEMQYHIPLPGSKKRKKSNGHGS 152
Query: 818 -SRFGRR------TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ 870
++ GRR +I CD C +H+ C+ L P G W+C + A+Q
Sbjct: 153 ATKGGRRGNSFLKPLIHCDYCPLSFHLDCMD----PPLTTTPSGLWMCPNHAEHFLPAMQ 208
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 73/165 (44%), Gaps = 25/165 (15%)
Query: 660 LCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLC 719
C +E SP +H+G + T++G H + + + P ++N+D C
Sbjct: 193 FCVPAAAEGSPKHLVSHSG--------EMCVTTHG---HLPAPPVHSQEPALYQDNEDEC 241
Query: 720 GICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAA 777
+C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+ + + + A A
Sbjct: 242 AVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLREADTAAEQLPA 301
Query: 778 ---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 810
G P RC + P D + GG LC
Sbjct: 302 VPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 346
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 46/70 (65%), Gaps = 3/70 (4%)
Query: 705 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCM-NT 761
SL+ P KEN+D C C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+ N
Sbjct: 236 SLKPPAQPKENEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSCVENV 295
Query: 762 FQKEKFVEYN 771
+ + +E +
Sbjct: 296 TEPGQLLEAD 305
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 74/144 (51%), Gaps = 7/144 (4%)
Query: 376 VTEVANDGVGGPVKQETVMKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
T G+ K+E + +KV F S +++ L +G+L+G+ V YI S+ +
Sbjct: 253 ATHTTASGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKYIAWSREE---- 308
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
LRGV+KGSG C C C ++V+ FE HA K P +IY ENGKT+ I+
Sbjct: 309 --LRGVIKGSGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQEL 366
Query: 495 KDSPLETLEKAVRMVLGSSSMKKA 518
+ P L + ++ + GS +K+
Sbjct: 367 RSIPQNMLFEVIQTITGSPINQKS 390
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 11/139 (7%)
Query: 387 PVKQETVMKPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVV 441
P+ + K K +K F S +K+ L +G+L+G+ V Y+ S+ K L G++
Sbjct: 431 PIDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT-----LEGII 485
Query: 442 KGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLET 501
KG+G C C +C N+ + FE HA K P +I+ ENGKT+ ++ K++P
Sbjct: 486 KGTGYLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGV 545
Query: 502 LEKAVRMVLGSSSMKKANF 520
L A++ V G S + + NF
Sbjct: 546 LFNAIQTVTG-SHINQKNF 563
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 70/129 (54%), Gaps = 11/129 (8%)
Query: 395 KPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+P MRK F S +++ + +G+L+G+ V Y+ ++ + LRG++KGSG C
Sbjct: 313 RPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE------LRGIIKGSGYLCG 366
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C C ++++ FE HAG K P +IY ENGKT+ I+ + +P L ++ +
Sbjct: 367 CQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTI 426
Query: 510 LGSSSMKKA 518
G+ +K+
Sbjct: 427 FGAPINQKS 435
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 337 GMLDGVPVKYVSVTREELRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKH-PNNH 395
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 396 IYFENGKTIYQI 407
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 70/129 (54%), Gaps = 11/129 (8%)
Query: 395 KPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+P MRK F S +++ + +G+L+G+ V Y+ ++ + LRG++KGSG C
Sbjct: 319 RPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE------LRGIIKGSGYLCG 372
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C C ++++ FE HAG K P +IY ENGKT+ I+ + +P L ++ +
Sbjct: 373 CQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTI 432
Query: 510 LGSSSMKKA 518
G+ +K+
Sbjct: 433 FGAPINQKS 441
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 633 GLEDGAEVGYFVRGEKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQH 688
G+ DG V Y + L G KG G LC C CN ++ +FE HAG ++ P H
Sbjct: 343 GMLDGVPVKYVSVTREELRGIIKGSGYLCGCQSCNFSKMLNAYEFERHAGCKTKH-PNNH 401
Query: 689 IYTSNGVSLHEL 700
IY NG +++++
Sbjct: 402 IYFENGKTIYQI 413
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 698 HELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHC 755
H ++ + + P ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C
Sbjct: 247 HLPAVPVHSQDPAPYQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQC 306
Query: 756 RYCMNTFQKEKFVEYNANARAA---GRIEGVDPFAQMVSRCIR 795
C+ + + + A A G P S C+R
Sbjct: 307 SSCVAELGRLREADTAAEQLPAVPDKEEHGAQPGGSHWSTCVR 349
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 84/153 (54%), Gaps = 3/153 (1%)
Query: 928 AVSIFHDRFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRI 985
A+SI + F PII++ + +D+IP ++Y RS DY G Y +L + ++S R+
Sbjct: 1125 ALSIMEECFLPIIDARTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRL 1184
Query: 986 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1045
G +AE+PL+AT + + QG + L IE++L L V+ L+L + WT FGF
Sbjct: 1185 HGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGF 1244
Query: 1046 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKC 1078
+ + ++ K + L+ GT +L++ + +C
Sbjct: 1245 VPIDDLDR-KNLSRLRLVSVPGTVLLKRNLYEC 1276
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 55/98 (56%), Gaps = 13/98 (13%)
Query: 658 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 708
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 975 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 1033
Query: 709 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 743
P + ENDD CG+C DGG+L+CCD+CP ++H DC+
Sbjct: 1034 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCL 1071
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 411
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
E + A G E D + C + GG +LC
Sbjct: 412 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 445
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 446 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 488
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 75/166 (45%), Gaps = 27/166 (16%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 385 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC-----EKEGIQWE 439
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A E + V + + D + C +C+ G C CD
Sbjct: 440 AKEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFCRVCKDGGELLC-----------CD 488
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 489 SCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 529
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 62/146 (42%), Gaps = 20/146 (13%)
Query: 710 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM-NTFQKEK 766
F+ + NDD C +C DGG+L+CCD CPRAFH+ C+ P IP GTW C+ C N +
Sbjct: 286 FAVEHNDDECAVCKDGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLCQSNRLKDRT 345
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRI-------------VQTPDTELGGCVLCRGR 813
+ A VD + +Q+ D E G + G
Sbjct: 346 YTHVQPPATETSSGSAVDFSFFSSLSSTSLSTVSASSSAQSIGLQSSDGERVGIRMACGI 405
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGC 839
+ R +I C QC + YH C
Sbjct: 406 CY----LTRGELITCPQCLQAYHALC 427
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNT 761
P ++N+D C +C DGG+LLCCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 287 PQPHQKNEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQA 341
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 75.1 bits (183), Expect = 2e-10, Method: Composition-based stats.
Identities = 47/128 (36%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 962 DYHGMYCAILTVNQVVVSAGI------------FRIFGQELAELPLVATSNDCQGQGYFQ 1009
D+ G Y L + V A RI G ++AE+PLVAT+ + QG Q
Sbjct: 62 DFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYRRQGMCQ 121
Query: 1010 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTS 1069
L +EK+L L+V+ LVLP+ SE +W + FGFS M+ E+ + +P + FQGT+
Sbjct: 122 VLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLR-FPFLGFQGTT 180
Query: 1070 MLQKPVPK 1077
M QK + K
Sbjct: 181 MFQKILSK 188
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 330 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQ 379
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 74.7 bits (182), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 150
Score = 40.8 bits (94), Expect = 3.9, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 795 RIVQTPDTELGGCVLC-RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
R++ +G LC + D C +I CD C R +H+ CL L+E+P
Sbjct: 80 RVLHPSPVCMGVSCLCQKNEDECAVCRDGGELICCDGCPRAFHLACLS----PPLREIPS 135
Query: 854 GKWLCCADCKRINLALQKLVDRGEEKLPE 882
G W C+ C + +Q++ R EE P+
Sbjct: 136 GTWR-CSSC--LQATVQEVQPRAEEPRPQ 161
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 56/120 (46%), Gaps = 14/120 (11%)
Query: 705 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTF 762
S E ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 214 SQEPALYQQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL 273
Query: 763 QKEKFVEYNANARAA---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 810
+ + + A A G P RC + P D + GG LC
Sbjct: 274 GRLREADTAAEQLPAVPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 333
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 68/153 (44%), Gaps = 26/153 (16%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY- 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P GTW C +C +KE
Sbjct: 327 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHC----EKEGITSVS 382
Query: 771 --NANARAAGRIEGV--DPFAQMVSRCIRIVQTPDTELGGCVLCR-GRDFCKSRFGRRTV 825
N +G V D ++VS D C CR G D +
Sbjct: 383 KGNKECEDSGSEPAVTSDKDGKIVSAVHPTSPEKDEHQEFCTECRDGGD----------L 432
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
I CD C YH+ CL + L +P+G WLC
Sbjct: 433 ICCDNCPASYHIACL----IPPLANIPEGVWLC 461
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ + C C DGGDL+CCD+CP ++HI C+ P IP G W C C
Sbjct: 418 EHQEFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRC 464
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 56/120 (46%), Gaps = 14/120 (11%)
Query: 705 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTF 762
S E ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 203 SQEPALYQQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL 262
Query: 763 QKEKFVEYNANARAA---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 810
+ + + A A G P RC + P D + GG LC
Sbjct: 263 GRLREADTAAEQLPAVPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 322
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 99/229 (43%), Gaps = 36/229 (15%)
Query: 546 LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK 605
L+E +A P + SS P + + P+ E + R P N QT S S
Sbjct: 134 LEEPRATPPAALSPRATSS------PGSQAKAKPSKKPESSAEPQRLPLGNGIQTMSTSV 187
Query: 606 MKSSSVKS---HGKITRKDLRMHKLVFEEGGLEDGAEVG-YFVRGEKF--LVGYK----- 654
++ +V S G + + + VFE GG + +VG F KF +G K
Sbjct: 188 QRAVTVSSGDVPGACGAVEGILIQQVFESGGSKKCIQVGGEFYTPSKFEDPIGGKNKTRS 247
Query: 655 KGFGILCTCCNSEVS-PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK 713
G L ++ S P E AG R + + + LH+
Sbjct: 248 SGLKTLVRAKGTQASAPGGGELRAGQQGRLQAPPALPSEP--QLHQ-------------- 291
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 292 KNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQ 340
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 41/66 (62%), Gaps = 2/66 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEK 766
P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ ++
Sbjct: 301 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQGRSQQD 360
Query: 767 FVEYNA 772
A
Sbjct: 361 LPRAEA 366
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 36/49 (73%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 298 QNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSCLQ 346
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 74.3 bits (181), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 150
Score = 40.4 bits (93), Expect = 5.1, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 795 RIVQTPDTELGGCVLC-RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
R++ +G LC + D C +I CD C R +H+ CL L+E+P
Sbjct: 80 RVLHPSPVCMGVSCLCQKNEDECAVCRDGGELICCDGCPRAFHLACLS----PPLREIPS 135
Query: 854 GKWLCCADCKRINLALQKLVDRGEEKLPE 882
G W C+ C + +Q++ R EE P+
Sbjct: 136 GTWR-CSSC--LQATVQEVQPRAEEPRPQ 161
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 55/97 (56%), Gaps = 11/97 (11%)
Query: 669 SPSQFEAHAGWASRRKPFQHIYT-SNGVSLHELSIKLSLERPFSS---KENDDLCGICMD 724
+P +FE G + R P + ++ V H S+ FSS ++NDD C +C D
Sbjct: 236 TPGKFEDAGGKSKNRSPKPPVRPKASQVPQHVRSLSR-----FSSLWGQKNDDECAVCRD 290
Query: 725 GGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
GG+L+CCD CPRAFH+ C+ P IPSGTW C C+
Sbjct: 291 GGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRCV 327
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/53 (54%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 403 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSCL 455
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 250 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 304
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 378 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 432
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 393 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPQPQ 443
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 17/151 (11%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 982
L A+ + H+ F IIE ++ DL +V+ R R ++ G Y +
Sbjct: 8 LCIALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL------------ 55
Query: 983 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1042
+ G++ AELPL+ T + QG + L +EKLL L V+ L+LP+ + WT
Sbjct: 56 --VCGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGS 113
Query: 1043 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
FGF+ M+ ++ +Y + ++ FQGT+M QK
Sbjct: 114 FGFTEMSYSDRFQYAANI-ILSFQGTTMCQK 143
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 347
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 12/116 (10%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
V+++F K LESG+LEG++V Y+ R +V G GVVK I C C CK +Q
Sbjct: 530 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLG------SGVVKSGVILCNCRHCKSHQ 583
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM---NVCKDSP--LETLEKAVRM 508
+ FE H GS+ + P ++I+L+NG+ LR+++ + +D P + L KAV M
Sbjct: 584 GFNASSFEKHVGSTARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVSM 639
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 2/54 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
P ++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+
Sbjct: 426 PHLHQKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCIQ 479
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 410
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
E + A G E D + C + GG +LC
Sbjct: 411 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 444
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 445 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 487
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPQPQ 358
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 12/116 (10%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
V+++F K LESG+LEG++V Y+ R +V G GVVK I C C CK +Q
Sbjct: 503 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLG------SGVVKSGVILCNCRHCKSHQ 556
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM---NVCKDSP--LETLEKAVRM 508
+ FE H GS+ + P ++I+L+NG+ LR+++ + +D P + L KAV M
Sbjct: 557 GFNASSFEKHVGSTARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVSM 612
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 410
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
E + A G E D + C + GG +LC
Sbjct: 411 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 444
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 445 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 487
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDV 437
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
+ + G +E D + C + GG +LC
Sbjct: 438 SDAEEDNGETGEMEEDDHHMEFCRVC---------KDGGELLC----------------- 471
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 472 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPPMKGKVQKIL 514
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 69/147 (46%), Gaps = 19/147 (12%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRAFH+ C+ L P G+W C +C +KE V
Sbjct: 363 DHQDYCEVCQQGGEIMLCDTCPRAFHLVCLDPELEEAPEGSWSCPHC----EKEGVV--- 415
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A +R+ G D + IR P+ E ++FC +I C +C
Sbjct: 416 AASRSTTPATGGD--MSQNPQNIRKSAQPNEEEKD----EHQEFCNECKDGGDLICCAKC 469
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L E+P+G WLC
Sbjct: 470 PVSYHPECL----YPPLSEIPEGPWLC 492
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 297 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSCLQ 346
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 73.2 bits (178), Expect = 8e-10, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
P ++N+D C +C DGG+L+CCD CPRAFH+ C++ P IPSGTW C C+ E
Sbjct: 71 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSCLQATVSE 129
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 272 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 326
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 30/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ V++
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC-----EKEGVQWE 413
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A + +++ V P+ E CR CK ++ CD C
Sbjct: 414 AK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCR---VCKDG---GELLCCDAC 457
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 458 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKIL 496
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 300 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCCLQ 349
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 72/164 (43%), Gaps = 30/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC------------- 404
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + E + +++ V P+ E CR CK ++ CD C
Sbjct: 405 --EKEGVQWEAKEDNSEIDDDMDDTVGDPEEEDHHMEFCR---VCKDG---GELLCCDAC 456
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 457 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKIL 495
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 75/165 (45%), Gaps = 26/165 (15%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM-NTFQKEK----F 767
NDD C +C DGG+L+CCD CPRAFH+ C+ P IPSG+W C +C + +KEK
Sbjct: 259 NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCRGHRVKKEKAQLPL 318
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG------RDFCK-SRF 820
N + I V ++ + S + T GG C G R+ C
Sbjct: 319 QPQQTNTNCSNSITDVSFYSSLSSSLTSVTATMSAS-GGRNQCSGGELVGVREVCGVCHL 377
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 865
G + C QC + +HV C KG+ +C + C R+
Sbjct: 378 GGGDLTHCFQCLKHFHVNC----------HFSKGRSICLS-CSRL 411
>gi|432852260|ref|XP_004067159.1| PREDICTED: uncharacterized protein LOC101164387 [Oryzias latipes]
Length = 1310
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 771
E++D C +C+ GG+LLCCD CP+ FH+ C P PSG W C C + Q E V+YN
Sbjct: 1104 ESEDFCAVCLIGGELLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDAIQPE--VQYN 1161
Query: 772 A-NARAAGR--IEGVDPFAQMVSRCIRIVQTP 800
N RA+G + G+ Q +I++ P
Sbjct: 1162 CENERASGANPLHGLSACDQRARHYYQIIKRP 1193
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 94/228 (41%), Gaps = 54/228 (23%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEV 668
S G + + + VFE GG + +VG GE +
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEFY------------------- 232
Query: 669 SPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSKE 714
PS+FE +G S KP + G ++S + + P ++
Sbjct: 233 KPSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQK 292
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 65/127 (51%), Gaps = 25/127 (19%)
Query: 688 HIYTSNGVSLHELSIKLSLERP-----------FSSKENDDLCGICMDGGDLLCCDSCPR 736
H+ NG S +++ S+ RP N+D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGGGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 737 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 790
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 791 SRCIRIV 797
+C R++
Sbjct: 928 -KCERLL 933
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 40/59 (67%), Gaps = 2/59 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 769
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ ++ +
Sbjct: 240 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCCLQGRGQQGLAQ 298
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 76/170 (44%), Gaps = 31/170 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 376 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 430
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIV----QTPDTELGGCVLCR--GRDFCKSRFGRRTV 825
A E + +R+ + D + C +C+ G C
Sbjct: 431 AKDEDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVCKDGGELLC--------- 481
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 482 --CDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 524
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 711 SSKENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ +E+DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 459 AEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 511
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 69/127 (54%), Gaps = 25/127 (19%)
Query: 688 HIYTSNGVSLHELSIKLSLERPF--------SSKEND---DLCGICMDGGDLLCCDSCPR 736
H+ NG S +++ S+ RP S+K++D D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGVGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 737 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 790
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 791 SRCIRIV 797
+C R++
Sbjct: 928 -KCERLL 933
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 69/127 (54%), Gaps = 25/127 (19%)
Query: 688 HIYTSNGVSLHELSIKLSLERPF--------SSKEND---DLCGICMDGGDLLCCDSCPR 736
H+ NG S +++ S+ RP S+K++D D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGVGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 737 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 790
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 791 SRCIRIV 797
+C R++
Sbjct: 928 -KCERLL 933
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNT 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 424 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQA 474
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 882
+I CD C R +H+ CL LQE+P G W C+ C + +Q + R EE P+
Sbjct: 439 LICCDGCPRAFHLACLSP----PLQEIPSGTWR-CSSC--LQATVQDMRPRAEEPRPQ 489
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 72.4 bits (176), Expect = 1e-09, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 5/104 (4%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKR 474
+L+G+ V YI S+ K LRG++KGSG C C C ++V+ FE HAG K
Sbjct: 1 MLDGVPVKYIAWSREKE-----LRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKH 55
Query: 475 PPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
P +IY ENGKT+ I+ K +P +L ++ + GS +K+
Sbjct: 56 PNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKS 99
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 925 LSKAVSIFHDRFDPIIESASK--LDLI------PAMVYGRSHRGQDYHGMYCAILTVNQV 976
+ +A+ +F F P++ + LD++ P + G ++ G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMDNGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRGA 280
Query: 977 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1036
VV+A R+FG+ AELP VAT + G + L +E LL V LV+PS
Sbjct: 281 VVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPLL 340
Query: 1037 AIWTNKFGFSMMTEEE 1052
+W KFGF+ +TE+E
Sbjct: 341 PMWAAKFGFTPLTEQE 356
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 13/112 (11%)
Query: 692 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 749
+NG S+ + K + + ++ EN+D C +C++GG+LLCCD CP+ FH+ C +L P
Sbjct: 900 TNGTSI---TKKSPVTQEVNTIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFP 956
Query: 750 SGTWHCRYCMNTFQKEKFVEYNA-NARAAGRIE---GVDPFAQMVSRCIRIV 797
G W C C N + E VEY+ N R A G+D + Q +C ++V
Sbjct: 957 VGEWVCTLCRNPMKPE--VEYDCENTRYAHSYNAQYGLDDYDQ--KKCEKLV 1004
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 42/64 (65%), Gaps = 4/64 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 678 NEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 735
Query: 773 NARA 776
+AR+
Sbjct: 736 DARS 739
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
NDD C +C DGG+L+CCD CPRAFH+ C++ P IPSG+W C C
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERC 201
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 26/165 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKF-VEY 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G+W C +C +KE +
Sbjct: 48 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGSWSCPHC----EKEGISMGS 103
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
+A G D A+ V+ +P+ + ++FC +I C+
Sbjct: 104 QVEGKATGTKMAPDKSAKQVA-----AASPEKD-------EHQEFCTECHDGGDLICCEN 151
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL + L +P+G WLC C CK + + K++
Sbjct: 152 CPVSYHLDCL----IPPLTNIPEGVWLCPRCG-CKPLKARVSKIL 191
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 291 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSCLQ 340
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 7/57 (12%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLP 881
+I CD C R +H+ CL LQE+P G W CC+ C + +Q+ + EE P
Sbjct: 306 LICCDGCPRAFHLACLS----PPLQEIPSGTWRCCS-C--LQGGVQQGLPHAEEPQP 355
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 36/151 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 364 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDG 423
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
+ + AG +E D + C + GG +LC
Sbjct: 424 SDGEEDNGDAGDMEEDDHHMEFCRVC---------KDGGELLC----------------- 457
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C YH+ CL L E+P G+W+C
Sbjct: 458 CDSCPSSYHIHCLN----PPLPEIPNGEWIC 484
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM 759
+ C +C DGG+LLCCDSCP ++HI C++ LP IP+G W C CM
Sbjct: 444 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCM 488
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 421
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 422 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 459
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 460 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 500
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 432 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 469
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 470 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 510
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 408
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 409 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 446
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 447 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 487
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 13/112 (11%)
Query: 692 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 749
+NG S+ + K + + ++ EN+D C +C++GG+LLCCD CP+ FH+ C +L P
Sbjct: 901 TNGTSIMK---KSPVTQEVNTIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFP 957
Query: 750 SGTWHCRYCMNTFQKEKFVEYNA-NARAAGRIE---GVDPFAQMVSRCIRIV 797
G W C C N + E VEY+ N R A G+D + Q +C ++V
Sbjct: 958 VGEWVCTLCRNPMKPE--VEYDCENTRYAHSYNAQYGLDDYDQ--KKCEKLV 1005
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 448
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 449 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 486
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 487 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 527
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 6/75 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCCD CP+ FH+ C +P + PSG W C +C + E +EYN
Sbjct: 705 NEDWCAVCQNGGELLCCDRCPKVFHLSC-HIPALHEPPSGEWFCSFCRDLVSPE--MEYN 761
Query: 772 ANARAAGRIEGVDPF 786
N+ + +G P
Sbjct: 762 CNSNDSPVSDGFPPI 776
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 419 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 478
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 479 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 516
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 517 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 557
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 222 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 281
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 282 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 319
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 320 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 360
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 4/57 (7%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM--NTFQKE 765
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+ T Q++
Sbjct: 383 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRD 439
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 4/57 (7%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM--NTFQKE 765
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+ T Q++
Sbjct: 383 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRD 439
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 729 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 786
Query: 773 NARAAG----RIEGVDPFAQMVSR-CIRIV 797
+AR+ +IEG+ + R C R++
Sbjct: 787 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 816
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 34/166 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDN 418
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ DP + D + C +C+ G C CD
Sbjct: 419 SELDDDLDDAVGDP------------EEEDHHMEFCRVCKDGGELLC-----------CD 455
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 456 VCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKIQKIL 496
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W+C +C + E ++YN
Sbjct: 696 NEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTASPSGEWYCTFCRDLNSPE--MQYNV 753
Query: 773 NA 774
NA
Sbjct: 754 NA 755
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 268 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 325
Query: 773 NARAAG----RIEGVDPFAQMVSR-CIRIV 797
+AR+ +IEG+ + R C R++
Sbjct: 326 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 355
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+
Sbjct: 301 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQ 350
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 920 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYN 976
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 933 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYN 989
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 94/228 (41%), Gaps = 54/228 (23%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEV 668
S G + + + VFE GG + +VG GE +
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEFY------------------- 232
Query: 669 SPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSKE 714
PS+FE +G S KP + G ++S + + P ++
Sbjct: 233 KPSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQK 292
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|345787769|ref|XP_542497.3| PREDICTED: tripartite motif-containing protein 66 [Canis lupus
familiaris]
Length = 1207
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 959 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSLTQPE--MEYD 1016
Query: 772 A-NAR 775
NAR
Sbjct: 1017 CENAR 1021
>gi|281341697|gb|EFB17281.1| hypothetical protein PANDA_004268 [Ailuropoda melanoleuca]
Length = 1217
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 988 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSPTQPE--MEYD 1045
Query: 772 A-NAR 775
NAR
Sbjct: 1046 CENAR 1050
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 70/170 (41%), Gaps = 46/170 (27%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
+ D C C GG+L+ CDSCP +FH+DCV P G+P W C+ C+ +
Sbjct: 1000 HSDRCARCRRGGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLCVLEAESSPL----- 1054
Query: 773 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 832
G +G D + +RC + Q +ILCD C
Sbjct: 1055 ----EGCSDGTDSHCDVCARCYKHGQ--------------------------LILCDVCP 1084
Query: 833 REYHVGCLKDHGMEDLQELPKGKW---LCCADCKRINLALQKLVDRGEEK 879
+H+ C L ++P GKW +C DC+ ++ V+RG K
Sbjct: 1085 LAFHLRCTD----PPLLKVPSGKWTCQICVKDCQPVSSG--GFVNRGPAK 1128
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 62/145 (42%), Gaps = 17/145 (11%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF-VEYNAN 773
D C C G L+ C++CP A+H+ C + P IP+G W C C + QK+ +++
Sbjct: 912 DECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVCTDKSQKKPTGIKFKGK 971
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
R G + + + S + D C CR R G +ILCD C
Sbjct: 972 HR-KGLLPTSSSPSSLSSDLETLGIVADGHSDRCARCR-------RGGE--LILCDSCPL 1021
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLC 858
+H+ C+ L +P WLC
Sbjct: 1022 SFHLDCVD----PPLLGVPPDIWLC 1042
Score = 47.0 bits (110), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 10/71 (14%)
Query: 719 CGICMDGGD---LLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNAN 773
C IC GD +L CD C R H+ C+ P IPSG W+C C + K V+ N+
Sbjct: 778 CRICRRKGDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDC-----RPKIVKQNSR 832
Query: 774 ARAAGRIEGVD 784
R +E D
Sbjct: 833 RRRKSTLEDYD 843
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/48 (56%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ NDD C +C DGG+L+CCD CPR+FH+ C+ P IPSGTW C C
Sbjct: 26 QSNDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDAC 73
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 287 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 346
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 347 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 384
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 385 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 425
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 416
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + EG + +V + D + C +C+ G C CD
Sbjct: 417 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 459
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 460 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 500
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 435 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/228 (29%), Positives = 94/228 (41%), Gaps = 54/228 (23%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEKFLVGYKKGFGILCTCCNSEV 668
S G + + + VFE GG + +VG GE +
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEFY------------------- 232
Query: 669 SPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSKE 714
PS+FE +G S KP + G ++S + + P ++
Sbjct: 233 KPSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQK 292
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 293 NEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 815 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYNC 872
Query: 773 N 773
+
Sbjct: 873 D 873
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 33/149 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 410 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC-----EKEGIQWE 464
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + EG D + D + C +C+ G C CD
Sbjct: 465 AREEGS---EGEDDNGDVGE------MEDDHHMEFCRVCKDGGELLC-----------CD 504
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+W+C
Sbjct: 505 SCPSSYHIHCLN----PPLPEIPNGEWIC 529
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCDSCP ++HI C++ LP IP+G W C C
Sbjct: 489 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 532
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 423
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + EG + +V + D + C +C+ G C CD
Sbjct: 424 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 466
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 467 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 507
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 442 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 37/54 (68%), Gaps = 2/54 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
P ++N+D C +C DGG+L+CCD CPRAFH+ C+ P IPSGTW C C+
Sbjct: 297 PQLQQKNEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCLQ 350
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 423
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + EG + +V + D + C +C+ G C CD
Sbjct: 424 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 466
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 467 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 507
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 442 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
KQ+ ++ F + +++ + +G+L+G+ V Y+ ++ + LRG++KG+ C
Sbjct: 295 KQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE------LRGIIKGTTYLC 348
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C + + FE HAG +K P +IY ENGKT+ I+ + +P +L ++
Sbjct: 349 GCQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQT 408
Query: 509 VLGSSSMKKA 518
+ G+ +KA
Sbjct: 409 IFGAPINQKA 418
>gi|301761576|ref|XP_002916215.1| PREDICTED: tripartite motif-containing protein 66-like [Ailuropoda
melanoleuca]
Length = 1398
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 1150 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSPTQPE--MEYD 1207
Query: 772 A-NAR 775
NAR
Sbjct: 1208 CENAR 1212
>gi|312080256|ref|XP_003142522.1| PHD-finger family protein [Loa loa]
gi|307762316|gb|EFO21550.1| PHD-finger family protein [Loa loa]
Length = 569
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 96/248 (38%), Gaps = 83/248 (33%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKF--- 767
N C CM+GG+LLCCD CP +FH+ C P IPSG W C CM+ +
Sbjct: 55 NSQYCSACMEGGELLCCDRCPASFHLMCHEPPIERSSIPSGKWLCNRCMHAVAHDTSPKI 114
Query: 768 --------VEYNA-NARAAGRIEGVD---------------------------------- 784
VE+++ + R + I G+
Sbjct: 115 IKKVPKLDVEHSSLDVRRSHAISGIKEQSADRSLSALAVLADAALAANAEQFSLPHDLSC 174
Query: 785 -----PFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 839
PF + ++ ++V P C LC R+ + T+I CD C YH+ C
Sbjct: 175 EIPPLPFDERLTS-TQLVPPPK---AICHLCASRN------EKSTIIQCDFCSLCYHLDC 224
Query: 840 LKDHGMEDLQELPKGKWLCCADCKRI-------NLALQK--LVDRGEEKLPE-----TSL 885
L L +PK KW+C A + I L+L+K L+ R + P +
Sbjct: 225 L----TPPLPSVPKDKWMCPAHVEHILDRKLAKTLSLRKRLLIWRKYARQPRNGPYFSVP 280
Query: 886 DVIKKKHE 893
D IKK +E
Sbjct: 281 DCIKKLYE 288
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 31/44 (70%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
DD+C IC GGD++CCD CP +H+ C+ LP IPSG W C C+
Sbjct: 177 DDVCSICGLGGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPACV 220
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
NDD C C DGGDLLCCDSC ++FH+ C++ P IP G W+C C
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSC 110
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
DFC S ++ CD CE+ +H+ CL L+E+P+G W C + CK
Sbjct: 67 DFCSSCKDGGDLLCCDSCEKSFHLMCLN----PPLEEIPEGDWYCNS-CK 111
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 77/171 (45%), Gaps = 33/171 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 443
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQT-----PDTELGGCVLCR--GRDFCKSRFGRRT 824
A E +++S V T D + C +C+ G C
Sbjct: 444 AKDEEFEDFEEDS-EDRVISEVSLGVPTGAEEEDDDHMEFCRVCKDGGELLC-------- 494
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 495 ---CDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 537
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%), Gaps = 6/61 (9%)
Query: 704 LSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRY 757
+SL P ++E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C
Sbjct: 464 VSLGVPTGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPR 523
Query: 758 C 758
C
Sbjct: 524 C 524
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 4/59 (6%)
Query: 800 PDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
P E G +D+C+ +ILCD C R YH+ CL+ +L + P+GKW C
Sbjct: 378 PGDEEGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLE----PELDKAPEGKWSC 432
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 75/166 (45%), Gaps = 36/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 254 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC-----EKEGIQWE 308
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + E +M D + C +C+ G C CD
Sbjct: 309 AREEGSEGDEDNGDVGEMED---------DHHMEFCRVCKDGGELLC-----------CD 348
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 349 SCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 389
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 417
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + EG + V + D + C +C+ G C CD
Sbjct: 418 AKEDNS---EGEETMEDAVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 460
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 461 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQKIL 501
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 436 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 304 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCCLQ 353
>gi|340371689|ref|XP_003384377.1| PREDICTED: hypothetical protein LOC100637285 [Amphimedon
queenslandica]
Length = 400
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 33/47 (70%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ D C IC DGG+LLCCD CP+A+H+ C+ P IP GTWHC C
Sbjct: 315 EHADYCHICQDGGELLCCDRCPKAYHLQCLYPPLKKIPDGTWHCPRC 361
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 40/56 (71%), Gaps = 2/56 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 4 AQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 59
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 350 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 409
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 410 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 447
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 448 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 488
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|402894298|ref|XP_003910303.1| PREDICTED: tripartite motif-containing protein 66, partial [Papio
anubis]
Length = 1041
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 956 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1014
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1015 -MEYDCENAR 1023
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 80/178 (44%), Gaps = 38/178 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE +++
Sbjct: 353 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC----EKEG-IQWE 407
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + + C R+ + GG +LC CD C
Sbjct: 408 PKDDDEEDEDLCEEADDHMEFC-RVCKD-----GGELLC-----------------CDTC 444
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVD-RGEEKLPETSLD 886
YH+ CL L E+P G+WLC C C + +Q+++ + +E P SLD
Sbjct: 445 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQRILHWKWKESSPLLSLD 497
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 39/169 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 422
Query: 772 A---NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVI 826
A N+ +E V A+ + D + C +C+ G C
Sbjct: 423 AKEDNSEGEEILEEVGGDAE---------EEDDHHMEFCRVCKDGGELLC---------- 463
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 464 -CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 441 AEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 424
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 425 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 462
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 463 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 503
>gi|349580356|dbj|GAA25516.1| K7_Rco1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 684
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 113/321 (35%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPTVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
+++CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMICDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHADYDFNQDF 537
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 113 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 172
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G DP + D + C +C+ G C CD
Sbjct: 173 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 210
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 211 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 251
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 303 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 362
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 363 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 400
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 401 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 441
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 355 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 414
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 415 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 452
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 453 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 493
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 380
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + + C R+ + GG +LC CD C
Sbjct: 381 KDDEEEEEEAPGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 417
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVD--RGEEKLP 881
YH+ CL L E+P G+WLC C C + +QK++ GE LP
Sbjct: 418 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKILHWTWGEPPLP 466
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 417
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 418 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 455
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 456 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 496
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 2/64 (3%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
NDD C C DGG+L+CCD CP+AFH+ C+ P IPSG W C +C T K + +
Sbjct: 251 NDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWCCGTRGKRETTQQPP 310
Query: 773 NARA 776
A++
Sbjct: 311 LAKS 314
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 760
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 424
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 425 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 462
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 463 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 503
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 425
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 426 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 463
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 464 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 504
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 640 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 697
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ ++G+ P Q +C R++
Sbjct: 698 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 724
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|355697937|gb|EHH28485.1| hypothetical protein EGK_18929, partial [Macaca mulatta]
Length = 1236
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 983 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1041
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1042 -MEYDCENAR 1050
>gi|355752346|gb|EHH56466.1| hypothetical protein EGM_05877, partial [Macaca fascicularis]
Length = 1236
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 983 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1041
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1042 -MEYDCENAR 1050
>gi|403254198|ref|XP_003919864.1| PREDICTED: tripartite motif-containing protein 66 [Saimiri
boliviensis boliviensis]
Length = 1207
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 954 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1012
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1013 -MEYDCENAR 1021
>gi|297268466|ref|XP_001097566.2| PREDICTED: tripartite motif-containing protein 66 [Macaca mulatta]
Length = 1326
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 956 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1014
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1015 -MEYDCENAR 1023
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+L+CCD CP+ FH+ C SL PSG W+C C + E ++YN
Sbjct: 696 NEDWCAVCQNGGELICCDKCPKVFHLSCHVPSLTASPSGEWYCTLCRDLNSPE--MQYNV 753
Query: 773 NA 774
NA
Sbjct: 754 NA 755
>gi|426367401|ref|XP_004050721.1| PREDICTED: tripartite motif-containing protein 66 [Gorilla gorilla
gorilla]
Length = 1085
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 921 PPAPVENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 979
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 980 -MEYDCENAR 988
>gi|397494634|ref|XP_003818179.1| PREDICTED: tripartite motif-containing protein 66 [Pan paniscus]
Length = 1216
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 963 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1021
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1022 -MEYDCENAR 1030
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 50/102 (49%), Gaps = 11/102 (10%)
Query: 669 SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSS---------KENDDLC 719
+P +FE +G R P T EL + P + ++NDD C
Sbjct: 198 TPGKFEEPSGKNKTRSPKPPTRTKATQGRAELQLSQQSGGPATPAQPPELHLHQKNDDEC 257
Query: 720 GICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+C DGG+L+CCD CPRAFH+ C+ P IPSG W C C+
Sbjct: 258 AVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWRCGCCI 299
>gi|395815210|ref|XP_003781127.1| PREDICTED: tripartite motif-containing protein 66 [Otolemur
garnettii]
Length = 1211
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 958 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1016
Query: 767 FVEYNA-NAR 775
VEY+ NAR
Sbjct: 1017 -VEYDCENAR 1025
>gi|441646789|ref|XP_003254588.2| PREDICTED: tripartite motif-containing protein 66 [Nomascus
leucogenys]
Length = 1223
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 970 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1028
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1029 -MEYDCENAR 1037
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 944
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ ++G+ P Q +C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+ NDD C +C DGG+L+CCD CPRAFH+ C+ P IPSG W C C+
Sbjct: 284 QRNDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCCI 332
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 944
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ ++G+ P Q +C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|410044930|ref|XP_003313014.2| PREDICTED: tripartite motif-containing protein 66 [Pan troglodytes]
Length = 1037
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 784 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 842
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 843 -MEYDCENAR 851
>gi|255956317|ref|XP_002568911.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590622|emb|CAP96817.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 986
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 71/199 (35%), Gaps = 53/199 (26%)
Query: 705 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-- 758
S+E P +N D C C GGDLLCCD+C ++H DC+ P P G WHC C
Sbjct: 538 SIEPP----DNSDDCHQCGKGGDLLCCDTCINSYHFDCLDPPLDPKNPPQGEWHCPKCSI 593
Query: 759 MNTF-------QKEKFVEYNANARAAGRIEGVDP----------------FAQMVSRCIR 795
N+F + K E+ GVD + + R
Sbjct: 594 RNSFSTLIAHSKHYKKTEFQVPQDIKEHFHGVDEGIVFEDGYARNLKHQRYYKPAPHLPR 653
Query: 796 IVQTPDTE----------------LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 839
+ + P + G C+ C F G R +I CD C +H+ C
Sbjct: 654 LTKPPKQDGPVTYANPMLLRETDNKGDCIRCSKCGFTSQ--GTRPIITCDYCPCRFHLDC 711
Query: 840 LKDHGMEDLQELPKGKWLC 858
L PK W+C
Sbjct: 712 LDPPRANPPN--PKVGWMC 728
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 11/95 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 669 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 726
Query: 773 NARAAG----RIEG---VDPFAQMVSRCIRIVQTP 800
+A + + EG + P + V +I++ P
Sbjct: 727 DAPSHNSEKKKTEGLVKLTPIDKRVPDYYKIIKNP 761
>gi|410973162|ref|XP_003993024.1| PREDICTED: tripartite motif-containing protein 66 [Felis catus]
Length = 1211
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ FH+ C LP + P G W C C + Q E +EY
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVFHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--MEY 1019
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1020 DCENAR 1025
>gi|296480218|tpg|DAA22333.1| TPA: hCG23889-like [Bos taurus]
Length = 1213
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 965 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1021
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1022 DCENAR 1027
>gi|359072874|ref|XP_002693153.2| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1215
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1023
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1024 DCENAR 1029
>gi|440906647|gb|ELR56880.1| Tripartite motif-containing protein 66, partial [Bos grunniens mutus]
Length = 1230
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 982 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1038
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1039 DCENAR 1044
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 404
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + + C R+ + GG +LC CD C
Sbjct: 405 KDDEEEEDEVAGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 441
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV--DRGEEKLP 881
YH+ CL L E+P G+WLC C C + +QK++ G+ LP
Sbjct: 442 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKILHWSWGDPPLP 490
>gi|358415619|ref|XP_599031.6| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1322
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1023
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1024 DCENAR 1029
>gi|151946233|gb|EDN64464.1| conserved protein [Saccharomyces cerevisiae YJM789]
Length = 684
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHADYDFNQDF 537
>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
harrisii]
Length = 962
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 721 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 778
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 779 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 805
>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
Length = 912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 671 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 728
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 729 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 755
>gi|348553320|ref|XP_003462475.1| PREDICTED: tripartite motif-containing protein 66-like [Cavia
porcellus]
Length = 1392
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E +EY+
Sbjct: 1144 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1201
Query: 772 A-NAR 775
NAR
Sbjct: 1202 CENAR 1206
>gi|338727101|ref|XP_001500723.2| PREDICTED: tripartite motif-containing protein 66 [Equus caballus]
Length = 1211
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E +EY
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWACTLCRSLTQPE--MEY 1019
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1020 DCENAR 1025
>gi|115385623|ref|XP_001209358.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187805|gb|EAU29505.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 888
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 74/186 (39%), Gaps = 43/186 (23%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------- 758
+EN+D C C G LLCCD C ++H C++ P P G W+C C
Sbjct: 455 QENNDYCRECNGSGQLLCCDGCTDSYHFSCLNPPLDPANPPEGDWYCPRCEIKRPSRKIR 514
Query: 759 --MNTFQKEKFV------EYNANARAA--GRIEGVDPFAQMVSRCIRIVQT---PDTEL- 804
+ +E F +Y A R G+ E PF ++ R +R +T D L
Sbjct: 515 DQLERVSQEDFALPERIRDYFAGVRTGEDGKYEDFIPFPKINPRSMRGTRTGRYDDPHLL 574
Query: 805 ------GGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKDH-GMEDLQ----ELP 852
G + C C ++ G R +I CD C +H+ C M +Q + P
Sbjct: 575 RVHDAKGNLIFCHA---CGRTSNGNRPIIQCDYCPCAFHMDCCDPPLAMPPVQRPGSDKP 631
Query: 853 KGKWLC 858
W+C
Sbjct: 632 HHNWMC 637
>gi|13509324|emb|CAC35389.1| KIAA0298 protein [Homo sapiens]
Length = 1214
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 961 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1019
Query: 767 FVEYN 771
+EY+
Sbjct: 1020 -MEYD 1023
>gi|119589026|gb|EAW68620.1| hCG23889, isoform CRA_a [Homo sapiens]
Length = 1214
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 961 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1019
Query: 767 FVEYN 771
+EY+
Sbjct: 1020 -MEYD 1023
>gi|118344202|ref|NP_001071923.1| zinc finger protein [Ciona intestinalis]
gi|92081546|dbj|BAE93320.1| zinc finger protein [Ciona intestinalis]
Length = 1205
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 32/47 (68%), Gaps = 4/47 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYC 758
N+D CG+C +GGDLLCCDSCP+ FH+ C +P I P G W C C
Sbjct: 933 NEDWCGVCNNGGDLLCCDSCPKVFHVHC-HVPEILAAPPGIWQCMLC 978
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 65/149 (43%), Gaps = 28/149 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC----EKEGI---- 434
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + + R + + D + C +C+ G C CD
Sbjct: 435 -QWEAKDELSEGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGGELLC-----------CD 482
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+W+C
Sbjct: 483 TCPSSYHIHCLN----PPLPEIPNGEWIC 507
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 467 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 510
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 801 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 373 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 426
>gi|326669387|ref|XP_003199000.1| PREDICTED: hypothetical protein LOC566660 [Danio rerio]
Length = 1062
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 36/60 (60%), Gaps = 4/60 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C+ GGDLLCCD CP+ FH+ C P P G W C C + Q E VEY+
Sbjct: 827 ENEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLHSFPVGDWICTLCRDVEQPE--VEYD 884
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 746 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDLSKPE--VEYDC 803
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 804 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 830
>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
Length = 878
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 654 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 711
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 712 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 738
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 18/100 (18%)
Query: 689 IYTSNGVSLHELSIKL-----SLERPFSSKE-------NDDLCGICMDGGDLLCCDSCPR 736
+ +S G SL + S L +P S K+ N+D C +C +GG+LLCCD CP+
Sbjct: 675 VQSSGGSSLPDSSTAAKSKPRQLHKPESEKKAEPEEDPNEDWCAVCQNGGELLCCDKCPK 734
Query: 737 AFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYNAN 773
FH+ C +P + PSG W C +C + E +EY+ N
Sbjct: 735 VFHLSC-HIPALNESPSGEWFCSFCRDLLNPE--MEYDCN 771
>gi|425777912|gb|EKV16064.1| hypothetical protein PDIP_37980 [Penicillium digitatum Pd1]
gi|425779981|gb|EKV18004.1| hypothetical protein PDIG_11750 [Penicillium digitatum PHI26]
Length = 988
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 71/199 (35%), Gaps = 53/199 (26%)
Query: 705 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-- 758
S+E P EN D C C GGDLLCCD+C ++H +C+ P P G WHC C
Sbjct: 537 SIEPP----ENSDDCHQCGKGGDLLCCDTCINSYHFECLDPPLDPKNPPQGEWHCPKCSI 592
Query: 759 MNTF-------QKEKFVEYNANARAAGRIEGVDP----------------FAQMVSRCIR 795
N+F K E+ +GVD + + R
Sbjct: 593 RNSFSTLIAHSNHYKKTEFQLPQDIKEHFQGVDEGIVFDDDYARNLKHQRYYKAAPHLPR 652
Query: 796 IVQTPDTE----------------LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 839
+ + P + G C+ C F G R +I CD C +H+ C
Sbjct: 653 LTKAPKQDGPTIYANPMLLREYDNKGDCIRCSKCGFTSQ--GTRPIITCDYCPCRFHLDC 710
Query: 840 LKDHGMEDLQELPKGKWLC 858
L PK W+C
Sbjct: 711 LDPPRAHPPN--PKVGWMC 727
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 62/150 (41%), Gaps = 33/150 (22%)
Query: 711 SSKENDDL--CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFV 768
S E+D C +C DGGDL+ CD+CP++FH C++L IP G W C C E
Sbjct: 450 SGNESDHFWYCEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTGEGLPEDGD 509
Query: 769 EYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILC 828
N+ A+ E Q C R GG VILC
Sbjct: 510 SSNS-AQEEEEGEEETEHDQFCKVCKR---------GG-----------------DVILC 542
Query: 829 DQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
D C +H+ CL L E+P+G W C
Sbjct: 543 DFCSCVFHLRCLN----PPLGEVPEGDWKC 568
>gi|365989444|ref|XP_003671552.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
gi|343770325|emb|CCD26309.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
Length = 866
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
+ENDD C C+ G LCCD+CPR+FH C+ P +P G W C +C+
Sbjct: 381 QENDDYCSACLQAGSFLCCDTCPRSFHFLCLDPPLDPNNLPEGDWSCHHCL 431
>gi|209977097|ref|NP_055633.1| tripartite motif-containing protein 66 [Homo sapiens]
gi|269849740|sp|O15016.4|TRI66_HUMAN RecName: Full=Tripartite motif-containing protein 66
Length = 1216
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 963 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1021
Query: 767 FVEYN 771
+EY+
Sbjct: 1022 -MEYD 1025
>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
[Monodelphis domestica]
Length = 1143
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 959
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 960 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 986
>gi|432103232|gb|ELK30472.1| Tripartite motif-containing protein 66 [Myotis davidii]
Length = 1324
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 765
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 1096 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE 1154
Query: 766 KFVEYNA-NAR 775
+EY+ NAR
Sbjct: 1155 --MEYDCENAR 1163
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 119 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 176
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 177 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 203
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 60/148 (40%), Gaps = 46/148 (31%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY 770
+E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C YC +E+
Sbjct: 372 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQAEGNQEQ---- 427
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
E D + C + GG +LC CD
Sbjct: 428 ----------EDDDEHQEFCRIC---------KDGGELLC-----------------CDS 451
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH CL L+E+P G+W C
Sbjct: 452 CPSAYHRFCLN----PPLEEVPDGEWKC 475
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ + C IC DGG+LLCCDSCP A+H C++ P +P G W C C
Sbjct: 432 EHQEFCRICKDGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRC 478
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 706 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 763
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 764 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 790
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 6/74 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCCD CP+ FH+ C +P + PSG W C +C + E +EY+
Sbjct: 709 NEDWCAVCQNGGELLCCDKCPKVFHLAC-HIPTLNESPSGEWFCSFCRDLVSPE--MEYD 765
Query: 772 ANARAAGRIEGVDP 785
+++ A E P
Sbjct: 766 CDSKDAPISEKFPP 779
>gi|291384596|ref|XP_002708841.1| PREDICTED: tripartite motif-containing 66 [Oryctolagus cuniculus]
Length = 1211
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 958 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLSQPE- 1016
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1017 -MEYDCENAR 1025
>gi|410919013|ref|XP_003972979.1| PREDICTED: transcription intermediary factor 1-alpha-like [Takifugu
rubripes]
Length = 1006
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 6/74 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + E +EYN
Sbjct: 690 NEDWCAVCQNGGELLCCDKCPKVFHLTCHIPTLIASPSGEWFCSFCRDLVSPE--MEYNC 747
Query: 773 NARAAGRIEGVDPF 786
A EG P
Sbjct: 748 KEGLAS--EGFPPI 759
>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
[Monodelphis domestica]
Length = 1126
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 959
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 960 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 986
>gi|401838195|gb|EJT41934.1| RCO1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 703
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 80/216 (37%), Gaps = 78/216 (36%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C +
Sbjct: 277 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPVDPNHLPEGDWHCNECKFKIFINNSLT 336
Query: 761 TFQKEK--FVEYNAN----ARAAGRIEGVDP----FAQMVSRCIRIVQT----------- 799
T +K + F++ N N A+ RIE ++P + V+T
Sbjct: 337 TLKKNESNFIKQNDNVKMFAKLLFRIESLNPRQFQLPNYIKETFPAVKTGSRGQYSDEND 396
Query: 800 -------------------------PDTEL----GGCVLCRGRDFC-KSRFG-------R 822
PD + G ++C C ++R G
Sbjct: 397 KIPLTDRQLFNTSYGQSITKLDSYNPDIHIDSSSGEFLICYK---CHQTRLGSWSHPENS 453
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
R ++ CD C+ +H+ C+ ++L KW C
Sbjct: 454 RLIMTCDYCQTPWHLDCIPRASFKNL----GSKWKC 485
>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
[Ornithorhynchus anatinus]
Length = 1050
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 809 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 866
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 867 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 893
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 710 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 767
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 768 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 794
>gi|367000541|ref|XP_003685006.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
gi|357523303|emb|CCE62572.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
Length = 669
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 78/219 (35%), Gaps = 80/219 (36%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM---------- 759
+NDD C C G LCCD+CPR+FH C++ P +P G W C C+
Sbjct: 263 DNDDFCSTCQQSGSFLCCDTCPRSFHFLCLNPPLDPDNLPEGDWSCPQCVFKAKNPTIAA 322
Query: 760 -------------------------------NTFQ-----KEKFVEYNANARAAGRIEGV 783
FQ K+ F ANAR + E
Sbjct: 323 YKKSEREFIAELPNKNKLFGKLLFNIQSQNSKQFQLPNTIKDTFENVKANARGQYQDEKE 382
Query: 784 -----------DPFAQMVSRCIRIVQTP----DTELGGCVLCRGRDFC-KSRFG------ 821
P+ Q +++ TP D E G ++C C K++ G
Sbjct: 383 KTPLTERQIFGTPYGQSITKLDGY--TPELHIDPESGEFLICYH---CQKTKMGTWENYE 437
Query: 822 --RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
R ++ CD C+ +H+ CL D L+ L KW C
Sbjct: 438 TESRLIMRCDYCQTPWHLNCLPDVPRASLKNL-GNKWKC 475
>gi|344280903|ref|XP_003412221.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Loxodonta africana]
Length = 1343
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 1090 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1148
Query: 767 FVEYNA-NAR 775
+EY+ NAR
Sbjct: 1149 -MEYDCENAR 1157
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 739 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 796
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 797 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 823
>gi|350588054|ref|XP_003129432.3| PREDICTED: tripartite motif-containing protein 66 [Sus scrofa]
Length = 1250
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 765
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 993 PPAPVENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE 1051
Query: 766 KFVEYNA-NAR 775
+EY+ NAR
Sbjct: 1052 --MEYDCENAR 1060
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 758 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 815
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 816 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 842
>gi|6323720|ref|NP_013791.1| Rco1p [Saccharomyces cerevisiae S288c]
gi|2497139|sp|Q04779.1|RCO1_YEAST RecName: Full=Transcriptional regulatory protein RCO1
gi|763022|emb|CAA88800.1| unknown [Saccharomyces cerevisiae]
gi|190408306|gb|EDV11571.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|259148648|emb|CAY81893.1| Rco1p [Saccharomyces cerevisiae EC1118]
gi|285814078|tpg|DAA09973.1| TPA: Rco1p [Saccharomyces cerevisiae S288c]
gi|365763801|gb|EHN05327.1| Rco1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 684
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|344249575|gb|EGW05679.1| Tripartite motif-containing protein 66 [Cricetulus griseus]
Length = 1297
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 937 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQPE--MEYD 994
Query: 772 A-NAR 775
NAR
Sbjct: 995 CENAR 999
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 767 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 824
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 825 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 851
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 967
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 968 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 994
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 832 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 889
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 890 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 916
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 368 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 425
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 426 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 452
>gi|392297235|gb|EIW08335.1| Rco1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 684
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 669 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 726
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 727 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 753
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 59/156 (37%), Gaps = 51/156 (32%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C N
Sbjct: 374 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 422
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D Q R + GG +LC CD C
Sbjct: 423 -----EGPAEQDDDEHQEFCRICKD--------GGELLC-----------------CDSC 452
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC----CADCK 863
YH CL L E+P G W C CA K
Sbjct: 453 PSAYHTHCLN----PPLVEIPDGDWKCPRCSCAPLK 484
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 903 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 960
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 961 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 987
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 890 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 947
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 948 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 974
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 901 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 958
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 959 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 985
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 59/156 (37%), Gaps = 51/156 (32%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C N
Sbjct: 371 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 419
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D Q R + GG +LC CD C
Sbjct: 420 -----EGPAEQDDDEHQEFCRICKD--------GGELLC-----------------CDSC 449
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC----CADCK 863
YH CL L E+P G W C CA K
Sbjct: 450 PSAYHTHCLN----PPLVEIPDGDWKCPRCSCAPLK 481
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 956
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 575
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 576 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 602
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 970 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1027
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 1028 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1054
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 967
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 968 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 994
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 815 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 872
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 873 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 899
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 771 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 828
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 829 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 855
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 885 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 942
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 943 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 969
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 575
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 576 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 602
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 847 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 904
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 905 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 931
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 956
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 848
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 849 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 875
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 860
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 861 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 887
>gi|426245702|ref|XP_004016644.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66 [Ovis aries]
Length = 1286
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 770
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E +EY
Sbjct: 1038 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--MEY 1094
Query: 771 NA-NAR 775
+ NAR
Sbjct: 1095 DCENAR 1100
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 860
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 861 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 887
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 75/169 (44%), Gaps = 29/169 (17%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 272 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 326
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD-----FCKSRFGRRTVI 826
A R++ + LG + D FC+ ++
Sbjct: 327 AKDEEFE--------DFEEDSEDRVIS--EVSLGVPMGAEEEDDDHMEFCRVCKDGGELL 376
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
CD C YH+ CL L E+P G+WLC C C I +Q+++
Sbjct: 377 CCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQRIL 420
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%), Gaps = 6/61 (9%)
Query: 704 LSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRY 757
+SL P ++E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C
Sbjct: 347 VSLGVPMGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPR 406
Query: 758 C 758
C
Sbjct: 407 C 407
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
AR+AG E + F + + P E G +D+C+ +ILCD C R
Sbjct: 236 ARSAGE-EKEENFKRSCGPGFDLAPVPGDEEGDGYETDHQDYCEVCQQGGEIILCDTCPR 294
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL+ +L + P+GKW C
Sbjct: 295 AYHLVCLE----PELDKAPEGKWSC 315
>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
[Desmodus rotundus]
Length = 951
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 726 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 783
Query: 773 NA 774
+A
Sbjct: 784 DA 785
>gi|354495678|ref|XP_003509956.1| PREDICTED: tripartite motif-containing protein 66-like [Cricetulus
griseus]
Length = 1384
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1136 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQPE--MEYD 1193
Query: 772 A-NAR 775
NAR
Sbjct: 1194 CENAR 1198
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 962 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1019
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 1020 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1046
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 800 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 857
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 858 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 884
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 885 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 942
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 943 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 969
>gi|327264985|ref|XP_003217289.1| PREDICTED: tripartite motif-containing protein 66-like [Anolis
carolinensis]
Length = 1265
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C N + E VEY+
Sbjct: 908 ENEDFCAVCLNGGELLCCDYCPKVFHLLCHVPALLSFPVGEWVCTLCSNVEKPE--VEYD 965
Query: 772 A-NAR 775
NAR
Sbjct: 966 CENAR 970
>gi|156848686|ref|XP_001647224.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
gi|156117909|gb|EDO19366.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
Length = 682
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 84/223 (37%), Gaps = 70/223 (31%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM-------NT- 761
+NDD C C+ G LCCD+CPR+FH C++ P +P G W C +C+ NT
Sbjct: 272 DNDDFCSTCLQTGSFLCCDTCPRSFHFLCLNPPLDPDQLPEGDWSCPHCIFRMKYPTNTA 331
Query: 762 FQK--EKFV-EYNANARAAGR----IEGVDPFAQMVSRCIRIV----------------- 797
F+K ++FV + R G+ +E +P V + +R
Sbjct: 332 FKKAEKEFVNDLPQQNRLFGKLLFQLESTNPIQYRVPKTVREAFQNVKTGARGQYQDDRE 391
Query: 798 -----------------------QTPDTELGG-------CVLC---RGRDFCKSRFGRRT 824
TPDT + C C + + + +
Sbjct: 392 KEPLTEKLIFGTPYGQSITKLDCYTPDTHINTEDNSFLLCFRCGTTKMGTWDNYKTDSKL 451
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 867
++ CD C +H+ C+ + ++ L KW+C + NL
Sbjct: 452 IMKCDYCNTPWHLDCIPEVPRASMKNL-GTKWMCPLHADKANL 493
>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
Length = 1138
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 897 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDIGKPE--VEYDC 954
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 955 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 981
>gi|431919612|gb|ELK18000.1| Tripartite motif-containing protein 66 [Pteropus alecto]
Length = 450
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 765
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 197 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCCSLTQPE 255
Query: 766 KFVEYNA-NAR 775
+EY+ NAR
Sbjct: 256 --MEYDCENAR 264
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 736 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 793
Query: 773 NA 774
+A
Sbjct: 794 DA 795
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 45/166 (27%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C+ + F E
Sbjct: 151 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHCVRSSVNIAFEE-- 208
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
G +E Q D + C +C+ G C CD
Sbjct: 209 -----NGPVE----------------QDDDEHMEFCRVCKDGGELLC-----------CD 236
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YHV CL ++ +P G W C C+ C+ + +QK++
Sbjct: 237 TCTSAYHVFCL----APNVANVPDGIWHCPRCS-CQPLKARVQKIL 277
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 39/64 (60%), Gaps = 6/64 (9%)
Query: 701 SIKLSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWH 754
S+ ++ E +++DD C +C DGG+LLCCD+C A+H+ C++ + +P G WH
Sbjct: 201 SVNIAFEENGPVEQDDDEHMEFCRVCKDGGELLCCDTCTSAYHVFCLAPNVANVPDGIWH 260
Query: 755 CRYC 758
C C
Sbjct: 261 CPRC 264
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 736 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 793
Query: 773 NA 774
+A
Sbjct: 794 DA 795
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 65/153 (42%), Gaps = 36/153 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 767
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + ++
Sbjct: 302 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARDDL 361
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTV 825
+ R EGV+ + D + C +C+ G C
Sbjct: 362 SDGEGEDEEDRRDEGVE-------------EEDDHHIEFCRVCKDGGELLC--------- 399
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C YH+ CL L E+P G+W+C
Sbjct: 400 --CDTCPSSYHIHCLN----PPLPEIPNGEWIC 426
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 386 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 429
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 801 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 292 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 345
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 753 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 810
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 811 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 837
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 730 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 787
Query: 773 NA 774
+A
Sbjct: 788 DA 789
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 793 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 850
Query: 773 NA 774
+A
Sbjct: 851 DA 852
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 717 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 774
Query: 773 NA 774
+A
Sbjct: 775 DA 776
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 704 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 761
Query: 773 NA 774
+A
Sbjct: 762 DA 763
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 719 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 776
Query: 773 NA 774
+A
Sbjct: 777 DA 778
>gi|301627494|ref|XP_002942909.1| PREDICTED: tripartite motif-containing protein 66-like [Xenopus
(Silurana) tropicalis]
Length = 1207
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG++LCCD CP+ FH+ C +L P G W C C N + E VEY+
Sbjct: 964 ENEDFCAVCLNGGEMLCCDRCPKVFHLSCHVPALLSFPVGEWLCTLCRNLTKPE--VEYD 1021
Query: 772 AN 773
+
Sbjct: 1022 CD 1023
>gi|207342360|gb|EDZ70146.1| YMR075Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 671
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 790 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 847
Query: 773 NA 774
+A
Sbjct: 848 DA 849
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 824 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 881
Query: 773 NA 774
+A
Sbjct: 882 DA 883
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 187 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 244
Query: 773 NA 774
+A
Sbjct: 245 DA 246
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 708 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 765
Query: 773 NA 774
+A
Sbjct: 766 DA 767
>gi|256273506|gb|EEU08440.1| Rco1p [Saccharomyces cerevisiae JAY291]
Length = 684
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDCDFNQDF 537
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 NA 774
+A
Sbjct: 849 DA 850
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CDSCPRA+H+ C+ L P G W C +C + + E
Sbjct: 436 DHQDYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHCEKEGVQWEPKEEE 495
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + C R+ + GG +LC CD C
Sbjct: 496 DEYEGEMDDAEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 532
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 533 ISSYHIHCLN----PPLPEIPNGEWLCPRCT-CPMLKGRVQKIL 571
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 64/147 (43%), Gaps = 26/147 (17%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 257 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 316
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G DP + D + C +C+ ++ CD C
Sbjct: 317 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGG---------ELLCCDPC 356
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 357 PSSYHIHCLN----PPLPEIPNGEWLC 379
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD CP ++HI C++ LP IP+G W C +C
Sbjct: 339 EFCRVCKDGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 2 SKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 56
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 NA 774
+A
Sbjct: 849 DA 850
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 775 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 832
Query: 773 NA 774
+A
Sbjct: 833 DA 834
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 441 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 498
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 499 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 525
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 NA 774
+A
Sbjct: 849 DA 850
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 820 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 877
Query: 773 NA 774
+A
Sbjct: 878 DA 879
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 NA 774
+A
Sbjct: 849 DA 850
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 1017 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1074
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 1075 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1101
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 996 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 1053
Query: 773 NA 774
+A
Sbjct: 1054 DA 1055
>gi|444522352|gb|ELV13370.1| Tripartite motif-containing protein 66 [Tupaia chinensis]
Length = 1183
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 943 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1000
Query: 772 A-NAR 775
NAR
Sbjct: 1001 CENAR 1005
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 34/147 (23%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 273 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC-----EKEGIQWE 327
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG + + C R+ + GG +LC CD C
Sbjct: 328 PKEEDEEEEEGGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 364
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 365 PSSYHLHCLN----PPLPEIPNGEWLC 387
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 347 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
Length = 1615
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 1392 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 1449
Query: 773 NA 774
+A
Sbjct: 1450 DA 1451
>gi|13992535|emb|CAC38114.1| KIAA0298 protein [Mus musculus]
gi|148684989|gb|EDL16936.1| tripartite motif-containing 66, isoform CRA_b [Mus musculus]
Length = 1209
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 961 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1018
Query: 772 A-NAR 775
NAR
Sbjct: 1019 CENAR 1023
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 893 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 950
Query: 773 NA 774
+A
Sbjct: 951 DA 952
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 785 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 842
Query: 773 NA 774
+A
Sbjct: 843 DA 844
>gi|344299817|gb|EGW30170.1| hypothetical protein SPAPADRAFT_144654 [Spathaspora passalidarum
NRRL Y-27907]
Length = 604
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 82/217 (37%), Gaps = 62/217 (28%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 769
+K+NDD C C + G +CC++CP++FH C P P W CR C+ EK
Sbjct: 191 TKDNDDFCSSCGNPGIFICCENCPKSFHFTCCDPPIEQPPEDDWFCRECIAKLHPEKIKT 250
Query: 770 YN---ANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDT------- 802
YN + ++E +P + + +R PD
Sbjct: 251 YNDIGVFGQLLNQLEVKNPKVFQLPKHLREDTFIGVTTGDNGDYSDDTTKPDVPSSKNSQ 310
Query: 803 ----------ELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKD--------- 842
+ G LC C +S RT++ CD C YH+ CL D
Sbjct: 311 DVDIESSLYDKAGNPYLCHK---CGESGMNNRTLMHCDYCPLIYHIDCLNDPICGPKTIG 367
Query: 843 ------HGMEDLQELPKG--KWLCCADCKRINLALQK 871
+ +EDL LP G K DC+ ++ +L K
Sbjct: 368 NKWRCPNHIEDL--LPPGYAKLRQFKDCQVVDSSLHK 402
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 250 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 307
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 308 DNLQHSKKGKTAQGLSPVDQR--KCERLL 334
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 50/91 (54%), Gaps = 12/91 (13%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 819 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSRPE--VEYDC 876
Query: 772 -----ANARAAGRIEGVDPFAQMVSRCIRIV 797
A R G + + P Q +C RI+
Sbjct: 877 DDPSLAEKRKLGGAQSMAPIDQ--RKCERIL 905
>gi|149068363|gb|EDM17915.1| rCG40452, isoform CRA_b [Rattus norvegicus]
Length = 1210
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 962 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1019
Query: 772 A-NAR 775
NAR
Sbjct: 1020 CENAR 1024
>gi|410913151|ref|XP_003970052.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1057
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 771
E++D C +C+ GGDLLCCD CP+ FH+ C P PSG W C C ++ E VEY+
Sbjct: 824 ESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDSVLPE--VEYD 881
Query: 772 A-NARAAG 778
N R +G
Sbjct: 882 CENERTSG 889
>gi|45861758|gb|AAS78677.1| transcriptional intermediary factor 1 delta short isoform [Mus
musculus]
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 772 A-NAR 775
NAR
Sbjct: 1052 CENAR 1056
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 700 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 757
Query: 773 NA 774
+A
Sbjct: 758 DA 759
>gi|348579257|ref|XP_003475397.1| PREDICTED: transcription intermediary factor 1-alpha-like [Cavia
porcellus]
Length = 1165
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 940 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 997
Query: 773 NA 774
+A
Sbjct: 998 DA 999
>gi|154290947|ref|XP_001546062.1| hypothetical protein BC1G_15537 [Botryotinia fuckeliana B05.10]
Length = 1238
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 70/185 (37%), Gaps = 42/185 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGT---WHCRYCM-NTFQKEKFV 768
ENDD C C G LLCCD C R+FH CV L P P+ T W C C+ + + K
Sbjct: 845 ENDDYCAACGGNGKLLCCDGCTRSFHFKCVDLSPDSPTLTTEEWFCNECLAQSVPRSKDE 904
Query: 769 EYNANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDTEL------- 804
E +E +P A + + IR +VQ P
Sbjct: 905 EIGYFGPLLANMERKNPSAFHLPKAIREYFENVKTGTEGEYEEGVVQKPKNNRGYDEAPD 964
Query: 805 --------GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK- 855
G +LC G R +I C C +H+ CL D + +E P G+
Sbjct: 965 YFKLKDSKGNPILCHGCHLSAVN-PSRAIIPCTYCSLSWHLDCL-DPPL--AKEPPPGRS 1020
Query: 856 WLCCA 860
W C A
Sbjct: 1021 WRCPA 1025
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 36/164 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 502 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 557
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + + C R+ + GG +LC CD C
Sbjct: 558 KDEEEDEEEPVGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 594
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 595 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 633
>gi|284004879|ref|NP_862901.3| tripartite motif-containing protein 66 isoform 2 [Mus musculus]
gi|342187123|sp|Q924W6.3|TRI66_MOUSE RecName: Full=Tripartite motif-containing protein 66; AltName:
Full=Transcriptional intermediary factor 1 delta
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 772 A-NAR 775
NAR
Sbjct: 1052 CENAR 1056
>gi|187950891|gb|AAI38033.1| Tripartite motif-containing 66 [Mus musculus]
gi|219519295|gb|AAI45107.1| Tripartite motif-containing 66 [Mus musculus]
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 772 A-NAR 775
NAR
Sbjct: 1052 CENAR 1056
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 36/164 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC----EKEGIQWEA 382
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ + C R+ + GG +LC CD C
Sbjct: 383 KGEEEEEEEAAGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 419
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 420 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 458
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 67.0 bits (162), Expect = 5e-08, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 6 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 63
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 64 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 90
>gi|323353060|gb|EGA85360.1| Rco1p [Saccharomyces cerevisiae VL3]
Length = 541
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 760
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 115 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 174
Query: 761 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 799
T +K F++ N N + + I+ +P + V+T
Sbjct: 175 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 234
Query: 800 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 823
PDT + G ++C + ++R G R
Sbjct: 235 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 292
Query: 824 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 883
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 293 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 326
Query: 884 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 942
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 327 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 373
Query: 943 ASKLDLIPAMVYGRSHRGQDY 963
+ +IP + QD+
Sbjct: 374 NGNIQIIPTTSHTDYDFNQDF 394
>gi|347840107|emb|CCD54679.1| hypothetical protein [Botryotinia fuckeliana]
Length = 1267
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 70/185 (37%), Gaps = 42/185 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGT---WHCRYCM-NTFQKEKFV 768
ENDD C C G LLCCD C R+FH CV L P P+ T W C C+ + + K
Sbjct: 874 ENDDYCAACGGNGKLLCCDGCTRSFHFKCVDLSPDSPTLTTEEWFCNECLAQSVPRSKDE 933
Query: 769 EYNANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDTEL------- 804
E +E +P A + + IR +VQ P
Sbjct: 934 EIGYFGPLLANMERKNPSAFHLPKAIREYFENVKTGTEGEYEEGVVQKPKNNRGYDEAPD 993
Query: 805 --------GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK- 855
G +LC G R +I C C +H+ CL D + +E P G+
Sbjct: 994 YFKLKDSKGNPILCHGCHLSAVNPS-RAIIPCTYCSLSWHLDCL-DPPL--AKEPPPGRS 1049
Query: 856 WLCCA 860
W C A
Sbjct: 1050 WRCPA 1054
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 150 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 207
Query: 773 NA 774
+A
Sbjct: 208 DA 209
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 787 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 844
Query: 773 NA 774
+A
Sbjct: 845 DA 846
>gi|74184798|dbj|BAE27995.1| unnamed protein product [Mus musculus]
Length = 1233
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 772 A-NAR 775
NAR
Sbjct: 1052 CENAR 1056
>gi|284004883|ref|NP_001164384.1| tripartite motif-containing protein 66 isoform 3 [Mus musculus]
Length = 1233
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 772 A-NAR 775
NAR
Sbjct: 1052 CENAR 1056
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 72/183 (39%), Gaps = 42/183 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC----VSLPGIPSGTWHCRYCMNTFQKEKFV- 768
N D C C DGG+L+CCD CP +FH+ C + L IP+G W C C +KE +
Sbjct: 56 HNRDSCDACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACRCAMKKENSIG 115
Query: 769 ----EYNANARAAGRIEGVDPFAQMVSRCIRI-VQTPDTE-LGGCVLCRGRDF------- 815
+ NA A V+P + R ++I + P T+ + RG+
Sbjct: 116 NKRKKKNALEVLALAASLVNPKEFELPRELQIPITFPGTDKIDPVSFKRGKHHNSNNING 175
Query: 816 --------------------CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
C+ + +I CD C +H CL L P G+
Sbjct: 176 KIHYHEYGSITPLPARLCFVCRKSCRKAPLIACDYCPLYFHQDCLD----PPLTAFPSGR 231
Query: 856 WLC 858
W+C
Sbjct: 232 WMC 234
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 8/76 (10%)
Query: 718 LCGICMDG---GDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
LC +C L+ CD CP FH DC+ P PSG W C +N F + +
Sbjct: 192 LCFVCRKSCRKAPLIACDYCPLYFHQDCLDPPLTAFPSGRWMCPNHLNHFIDQNLL---T 248
Query: 773 NARAAGRIEGVDPFAQ 788
+ A R++ D FA
Sbjct: 249 SCAATERVKLWDKFAH 264
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 28/149 (18%)
Query: 925 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFR 984
L AV I H+ F IIE ++ D+ +V+ R
Sbjct: 771 LCIAVDILHECFVTIIEPRTQSDISEDIVFNR---------------------------E 803
Query: 985 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1044
I GQ+ AELPL+ T + + QG + L +EKLL L V+ L+LP+ E WT FG
Sbjct: 804 ICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFG 863
Query: 1045 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
F++M+ E+ + + ++ FQGT+M QK
Sbjct: 864 FTVMSNSERLELAGN-SILSFQGTTMCQK 891
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 73 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 130
Query: 773 NA 774
+A
Sbjct: 131 DA 132
>gi|45861756|gb|AAS78676.1| transcriptional intermediary factor 1 delta [Mus musculus]
Length = 1344
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 772 A-NAR 775
NAR
Sbjct: 1154 CENAR 1158
>gi|284004881|ref|NP_001164383.1| tripartite motif-containing protein 66 isoform 1 [Mus musculus]
Length = 1344
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 772 A-NAR 775
NAR
Sbjct: 1154 CENAR 1158
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 57/150 (38%), Gaps = 51/150 (34%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY 770
+E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C
Sbjct: 348 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWSCPHC------------ 395
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILC 828
A G E Q D C LC+ G C C
Sbjct: 396 ----EAEGTQE----------------QDDDEHNEFCRLCKDGGELLC-----------C 424
Query: 829 DQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
D C YH+ CL L E+P G W C
Sbjct: 425 DSCTSAYHIFCLN----PPLSEIPDGDWKC 450
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+++ C +C DGG+LLCCDSC A+HI C++ P IP G W C C
Sbjct: 407 EHNEFCRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 893 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 950
Query: 773 N 773
+
Sbjct: 951 D 951
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 33/147 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE +++
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEG-IQWE 371
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG + R+ + GG +LC CD C
Sbjct: 372 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 409
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 410 PSSYHLHCLN----PPLPEIPNGEWLC 432
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|449269479|gb|EMC80242.1| Transcription intermediary factor 1-alpha, partial [Columba livia]
Length = 914
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 698 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLRSFPSGEWICTFCRDLSKPE--VEYDC 755
Query: 773 NARA 776
+ A
Sbjct: 756 DKPA 759
>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
Length = 1016
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 NA 774
+A
Sbjct: 849 DA 850
>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
Length = 1050
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 NA 774
+A
Sbjct: 883 DA 884
>gi|300797500|ref|NP_001178768.1| tripartite motif-containing protein 66 [Rattus norvegicus]
Length = 1348
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1100 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1157
Query: 772 A-NAR 775
NAR
Sbjct: 1158 CENAR 1162
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 984 RIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKF 1043
RI G +LAE+P + T + QG L IE L LNV+ LV+P+ E Q WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 1044 GFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1073
GF + ++ K ++ ++I GT +L+K
Sbjct: 61 GFKPVEPSKRQKIKS-LNILIIHGTGLLEK 89
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 60/158 (37%), Gaps = 43/158 (27%)
Query: 710 FSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 760
F S+E D D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 402 FPSEEGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHC-- 459
Query: 761 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 820
+ E +A D ++ C + GG +LC
Sbjct: 460 --EGEGITAATVTEKAGRNAADDDEHSEFCRIC---------KDGGELLC---------- 498
Query: 821 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C YH CL L E+P G W C
Sbjct: 499 -------CDSCTSAYHTFCLN----PPLSEIPDGDWKC 525
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ + C IC DGG+LLCCDSC A+H C++ P IP G W C C
Sbjct: 482 EHSEFCRICKDGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEE 376
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D + C + GG +LC CD C
Sbjct: 377 EEEEEGGCEEEEDDHMEFCRVC---------KDGGELLC-----------------CDAC 410
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 411 PSSYHLHCLN----PPLPEIPNGEWLC 433
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 393 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 436
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 64/147 (43%), Gaps = 33/147 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 332 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC-----EKEGIQWE 386
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG + R+ + GG +LC CD C
Sbjct: 387 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 424
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 425 PSSYHLHCLN----PPLPEIPNGEWLC 447
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|189238110|ref|XP_001814014.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270008739|gb|EFA05187.1| hypothetical protein TcasGA2_TC015317 [Tribolium castaneum]
Length = 706
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 77/209 (36%), Gaps = 62/209 (29%)
Query: 705 SLERPFSSK----ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCR 756
+ E P+ + N D C C +GGDL+CCD CP +FH+ C P IPSG W C
Sbjct: 38 ATEHPYFKRPGKGHNHDSCDACGEGGDLICCDKCPSSFHLQCHDPPLEEKDIPSGEWLCH 97
Query: 757 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT-----------PDTELG 805
C + K+ N R+A G P VS ++Q P +
Sbjct: 98 SC--KYAKKSNAPQTRNKRSASTPAGKAPKKAKVSPMDLLIQAASSMNPKQFDLPRSMSV 155
Query: 806 GCVLCRGRDFCK---SRFGRRT---------------------------------VILCD 829
C+ G D + SR GR++ +I CD
Sbjct: 156 PCIF-PGTDKVEAPYSRNGRKSMKVHKDHEKKQDATIFLPAKKCHECRKSCRVAPLISCD 214
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C +H+ CL L P G+W+C
Sbjct: 215 FCSLYFHMDCLD----PPLTTPPSGRWMC 239
>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
Length = 607
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 382 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 439
Query: 773 NARA 776
+A +
Sbjct: 440 DAPS 443
>gi|449504115|ref|XP_004174568.1| PREDICTED: tripartite motif-containing protein 66 [Taeniopygia
guttata]
Length = 958
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 7/82 (8%)
Query: 692 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 749
SNG++ + K + + S EN+D C +C++ G+LLCCD CP+ FH+ C +L P
Sbjct: 692 SNGIATMK---KSPVTQEVSPIENEDFCAVCLNVGELLCCDHCPKVFHLSCHVPALLSFP 748
Query: 750 SGTWHCRYCMNTFQKEKFVEYN 771
G W C C N + E VEY+
Sbjct: 749 VGEWVCSLCRNPVKPE--VEYD 768
>gi|301093217|ref|XP_002997457.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110713|gb|EEY68765.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 248
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 42/61 (68%), Gaps = 3/61 (4%)
Query: 703 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMN 760
+++ +RP ++ ++ + CGIC+ GG LLCCD C RA+H++CV +L +P G W C +C +
Sbjct: 175 RIAQQRP-AADQHLEFCGICLKGGKLLCCDGCERAYHLNCVRPALLDVPEGDWFCSHCRD 233
Query: 761 T 761
Sbjct: 234 A 234
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 77/179 (43%), Gaps = 36/179 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE +++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC----EKEG-IQWE 398
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG + R+ + GG +LC CD C
Sbjct: 399 PKDDDEEDEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 436
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVI 888
YH+ CL L E+P G+WLC C C + +Q+++ + P + V+
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQRILHWKWTEPPAPFITVL 490
>gi|238502471|ref|XP_002382469.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
gi|220691279|gb|EED47627.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
Length = 875
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 762
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 468 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 527
Query: 763 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 798
+ Y A + G+ + V PF ++ R R +++
Sbjct: 528 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 587
Query: 799 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 588 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEE 424
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D + C + GG +LC CD C
Sbjct: 425 EEEEEGGCEEEEDDHMEFCRVC---------KDGGELLC-----------------CDAC 458
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 459 PSSYHLHCLN----PPLPEIPNGEWLC 481
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 441 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 484
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 713 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+E DD C +C DGG+LLCCD+CP A+H+ C++ P IP G WHC C
Sbjct: 377 QEEDDHMEFCRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRC 427
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 56/149 (37%), Gaps = 51/149 (34%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C L P G W C +C KE
Sbjct: 323 EHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHCEGEGIKE------ 376
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
Q D + C +C+ G C CD
Sbjct: 377 --------------------------QEEDDHMEFCRVCKDGGELLC-----------CD 399
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YHV CL ++ +P G+W C
Sbjct: 400 TCPSAYHVHCLN----PPMKMIPDGEWHC 424
>gi|317147911|ref|XP_001822376.2| PHD finger domain protein [Aspergillus oryzae RIB40]
Length = 857
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 762
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 468 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 527
Query: 763 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 798
+ Y A + G+ + V PF ++ R R +++
Sbjct: 528 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 587
Query: 799 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 588 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
garnettii]
Length = 1014
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 789 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 846
Query: 773 NARA 776
+ +
Sbjct: 847 DTSS 850
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 59/152 (38%), Gaps = 47/152 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTF--QKEKFV 768
++ D C +C GG+++ CD+CPRA+H+ C+ P + P G W C +C N +
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLD-PDMEEPPGGKWSCPHCENDLVNDNDAVT 395
Query: 769 EYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVI 826
A AG +E C LCR G C
Sbjct: 396 SKEAAPAKAGNME------------------------FCRLCRDGGELLC---------- 421
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C YH CL + L +P+G W C
Sbjct: 422 -CDSCPSSYHRYCL----IPPLTTIPEGDWHC 448
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
N + C +C DGG+LLCCDSCP ++H C+ P IP G WHC C
Sbjct: 406 NMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|83771111|dbj|BAE61243.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 730
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 762
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 341 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 400
Query: 763 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 798
+ Y A + G+ + V PF ++ R R +++
Sbjct: 401 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 460
Query: 799 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 461 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|47230454|emb|CAF99647.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2153
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 39/68 (57%), Gaps = 10/68 (14%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 771
E++D C +C+ GGDLLCCD CP+ FH+ C P PSG W C C + VEY+
Sbjct: 1877 ESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLC-------REVEYD 1929
Query: 772 A-NARAAG 778
N R +G
Sbjct: 1930 CENERTSG 1937
>gi|391871058|gb|EIT80224.1| PHD Zn-finger protein [Aspergillus oryzae 3.042]
Length = 730
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 762
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 341 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 400
Query: 763 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 798
+ Y A + G+ + V PF ++ R R +++
Sbjct: 401 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 460
Query: 799 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 461 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Bos taurus]
Length = 1047
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 822 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 879
Query: 773 N 773
+
Sbjct: 880 D 880
>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
taurus]
gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
Length = 1050
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 773 N 773
+
Sbjct: 883 D 883
>gi|345322044|ref|XP_001508010.2| PREDICTED: transcription intermediary factor 1-alpha
[Ornithorhynchus anatinus]
Length = 800
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 575 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSKPE--VEYD 631
>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
mutus]
Length = 940
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 715 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 772
Query: 773 N 773
+
Sbjct: 773 D 773
>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
taurus]
gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
Length = 1016
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 773 N 773
+
Sbjct: 849 D 849
>gi|363728093|ref|XP_416340.3| PREDICTED: transcription intermediary factor 1-alpha [Gallus
gallus]
Length = 939
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 714 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRDLSKPE--VEYDC 771
Query: 773 N 773
+
Sbjct: 772 D 772
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 62/147 (42%), Gaps = 30/147 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 385
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ Q + + E +FC+ ++ CD C
Sbjct: 386 ---------EGI----QWEPKEDDEDEEDGAEEEEEEEDDHMEFCRVCKDGGELLCCDTC 432
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 415 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 864
+D+C+ +ILCD C R YH+ CL +L++ P+GKW C C++
Sbjct: 339 QDYCEVCQQGGEIILCDTCPRAYHLVCLD----PELEKAPEGKW-SCPHCEK 385
>gi|327272268|ref|XP_003220907.1| PREDICTED: transcription intermediary factor 1-alpha-like [Anolis
carolinensis]
Length = 1038
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + E VEY+
Sbjct: 812 NEDWCAVCQNGGELLCCDRCPKVFHLSCHVPTLRNFPSGEWICTFCRDLCDPE--VEYDC 869
Query: 773 N 773
+
Sbjct: 870 D 870
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 65.5 bits (158), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKD 496
L G++ G G C C C ++V++ FE HAG+ + P +IYLENGK + I+ K
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 497 SPLETLEKAVRMVLGSS------SMKKANFCLNC 524
+PL ++ ++ V GSS + K + C C
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFFRVWKGDICFIC 100
>gi|403215174|emb|CCK69674.1| hypothetical protein KNAG_0C05760 [Kazachstania naganishii CBS
8797]
Length = 731
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 97/246 (39%), Gaps = 79/246 (32%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHC-------RYCMNTF 762
ENDD C C+ G LCCD+CP++FH C+ P +P G W C +Y NT
Sbjct: 308 ENDDFCSACLQSGSFLCCDTCPKSFHFLCLDPPIDPSNLPEGDWSCPSCEFKSQYATNTQ 367
Query: 763 QK---EKFVEYNAN-------ARAAGRIEGVDPFAQMVSRCIR----------------- 795
K ++F++ A+ ++ +++G +P + + ++
Sbjct: 368 YKKGQKQFIQEAADNGTNKLFSKLLFQLQGFNPKQFNLPQSVKESFPEVSTGDRGQYTDD 427
Query: 796 IVQTPDTELG------GCVLCRGRDFC------------------------KSRFGR--- 822
++TP TE G + + +C ++FG
Sbjct: 428 TMRTPLTEKQMFNSPYGQSITKLDSYCPENHYIDDNSDTTPDRFLLCYKCKTTKFGTWDH 487
Query: 823 ----RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 878
R +I CD C+ +H+ C+ D L+ L KW C N+ Q+ +++ ++
Sbjct: 488 PGNARLLIKCDYCQTPWHLDCIPDVPRASLKIL-GYKWRCPL---HANVKQQRRLNKRQQ 543
Query: 879 KLPETS 884
K E S
Sbjct: 544 KYLEPS 549
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 429
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 430 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 470
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 471 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPMLKGRVQKIL 511
>gi|334331581|ref|XP_001379177.2| PREDICTED: tripartite motif-containing protein 66-like [Monodelphis
domestica]
Length = 1162
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + + E VEY+
Sbjct: 911 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTKPE--VEYD 968
Query: 772 ANARAAGR 779
+ G
Sbjct: 969 CDNTRYGH 976
>gi|326912229|ref|XP_003202456.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Meleagris gallopavo]
Length = 1051
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 826 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRDLSKPE--VEYDC 883
Query: 773 N 773
+
Sbjct: 884 D 884
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+D C C GG ++CCD+CP+ FH C+ L +P G W+C C++ F+++
Sbjct: 844 EDRCNKCGQGGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNFERQ 893
>gi|301609145|ref|XP_002934140.1| PREDICTED: PHD finger protein 21A-like [Xenopus (Silurana)
tropicalis]
Length = 681
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 702 IKLSLERPFSSKEN--DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRY 757
+ +SL PFS+ + +D C IC G LL CD+C R +H+DC+ P IP G W C
Sbjct: 475 LPVSLPSPFSTDSDIHEDFCSICRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPK 534
Query: 758 CMNTFQKE 765
C + K+
Sbjct: 535 CQDQMMKK 542
Score = 44.3 bits (103), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 814 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CAD 861
DFC +++CD C R YH+ CL+ L+ +PKG W+C C D
Sbjct: 492 DFCSICRKSGQLLMCDTCSRVYHLDCLEP----PLKTIPKGMWICPKCQD 537
>gi|170579494|ref|XP_001894854.1| PHD-finger family protein [Brugia malayi]
gi|158598405|gb|EDP36306.1| PHD-finger family protein [Brugia malayi]
Length = 598
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 77/203 (37%), Gaps = 62/203 (30%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKE----- 765
N C CM+GG+LLCCD CP +FH+ C P IPSG W C C + +
Sbjct: 55 NSQYCSSCMEGGELLCCDRCPASFHLMCHEPPIERSSIPSGKWLCNRCTHAVANDTSPKA 114
Query: 766 --KFVEYNA----------NARAAGRIE--------------------GVDPFAQMVSRC 793
K + +A NA +G +E + F+
Sbjct: 115 IKKVPKLDAECSSVDARRNNATISGIMEQNTDSGLSALAVLADAALATNAEQFSIPYDLS 174
Query: 794 IRIVQTP------DTEL-----GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKD 842
+I P T+L C LC R+ + T+I CD C YH+ CL
Sbjct: 175 CQIPPLPFDERLTSTQLVPPPKAVCHLCASRN------EKSTMIQCDFCSLCYHLDCL-- 226
Query: 843 HGMEDLQELPKGKWLCCADCKRI 865
L +PK KW+C A + I
Sbjct: 227 --TPPLPSVPKDKWMCPAHVEHI 247
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 72/179 (40%), Gaps = 39/179 (21%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM--NTFQKEKFVEY 770
N D C C +GG+L+CCD CP +FH++C++ LP +P G W CR C+ +T + + +E
Sbjct: 26 NHDNCDSCGEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQDTPFEPQSMEM 85
Query: 771 NANARAAGRIEGVDPFA-QMVSRCIRIVQTPDTELG------------------------ 805
R +EG + A + I+ V+ D G
Sbjct: 86 TIMGRLLQHLEGRNTVAFSLPLGVIKAVEPEDYADGQDDADAPGESNLFDSDQSDSDRGL 145
Query: 806 --GCVLCRGRDFCK----SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
G R +C R + C +C YH+ CL L P KWLC
Sbjct: 146 RQGKRKRRHDSYCSVCSLPSPSRDDLAQCTRCPHSYHLWCLD----PPLLAKPTVKWLC 200
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 31/49 (63%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
S N C +C GG L+CC+SCP AFH DC+ IP G+W+CR C N
Sbjct: 1082 SHVNVSWCFMCSKGGSLMCCESCPAAFHPDCIGYDEIPDGSWYCRDCTN 1130
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 714 ENDDLCGICMDGGDLLCCD--SCPRAFHIDCVSLPGIPSGTWHCRY 757
E++D C C +GG+L+ CD +CP+A+H+ C+ L P G W C +
Sbjct: 1498 EHEDDCFRCGEGGELVMCDKKTCPKAYHVKCLGLSKRPYGKWECPW 1543
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 711 SSKENDDLCGICMDGGDLLCCD-SCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
S KEN +C +C G+LL C+ C AFH+DC+ L +PSG + C C++
Sbjct: 919 SRKEN--VCQVCERPGELLLCEGQCCGAFHLDCIGLQQMPSGVFKCDECIS 967
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKF 767
C IC DGG LLCCD CPRAFH+ C+ S+ IP W+C+ C + +
Sbjct: 161 CNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMCTECLDRRRL 211
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 708 RPFSSKENDDL-CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
RP + ++ D+ C +C GG+LLCCD CPRAFH++C+ L IP W C C
Sbjct: 1230 RPKAPEDQWDVDCSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC 1281
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQ 763
C IC +GG+LLCCD CP FH C+ L IP G C C T +
Sbjct: 1117 CYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVK 1161
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 15/103 (14%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C +
Sbjct: 442 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC-------------S 488
Query: 773 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF 815
G+ E + + V R + P T G CR R++
Sbjct: 489 CPPLTGKAEKIITWRWAVHRNAAGDEQPSTSKGAAKPCRIREY 531
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 381 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 440
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 441 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 461
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 462 PSAYHTFCLN----PPLDTIPDGDWRC 484
>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
harrisii]
Length = 880
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 655 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 712
Query: 773 N 773
+
Sbjct: 713 D 713
>gi|395543452|ref|XP_003773631.1| PREDICTED: tripartite motif-containing protein 66 [Sarcophilus
harrisii]
Length = 1496
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + + E VEY+
Sbjct: 1258 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTKPE--VEYD 1315
Query: 772 AN 773
+
Sbjct: 1316 CD 1317
>gi|302915931|ref|XP_003051776.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
gi|256732715|gb|EEU46063.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
Length = 1194
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 67/165 (40%), Gaps = 43/165 (26%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS----GTWHCRYCM---------- 759
+ND+ C C GD++CCD CPR+FH +CV + +PS W+C C+
Sbjct: 754 DNDEDCSACGAAGDVVCCDGCPRSFHFECVGM--VPSEDLPDEWYCNECLFKRYPSRVPV 811
Query: 760 ---------NTFQKEKFVEYNANARAAGRIEGVDP----------FAQMVSRCIRIVQTP 800
N +K ++ + R EGV A+ R + P
Sbjct: 812 HKGVFGPALNNLEKSIPRAFSLPKKLQTRFEGVKAGPDGEYEEVTTAKTTKRKNGYEEVP 871
Query: 801 D----TELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
D + G VLC G C K+ R +I C C R +H+ CL
Sbjct: 872 DFFRQRDDGQPVLCHG---CQKAATDVRAIIPCSVCPRYWHIDCL 913
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 339 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 387
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 388 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 432
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 415 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
E++D C +C++ L+ C SCPR+FH+ C+ + P W C C +K Y
Sbjct: 990 ESNDFCEVCLNDEQLISCGSCPRSFHLICIQMKRAPRRDWRCLACTAGVKK-----YKQE 1044
Query: 774 ARAAGR-IEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 832
+ + IE + F S D + C+ C R I C C
Sbjct: 1045 LKDLKKIIEEKEAFEAKDS------NEEDFSINQCLKC-------GELLSRGHIECIGCG 1091
Query: 833 REYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 874
R+YH+ C DL + PKG W C C+ ++L + ++
Sbjct: 1092 RKYHLAC------ADLTKRPKGDWYCKKRCEPGYVSLNQTMN 1127
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 429
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 430 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 470
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L ++P G+WLC
Sbjct: 471 ACISSYHIHCLN----PPLPDIPNGEWLC 495
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|47206539|emb|CAF92235.1| unnamed protein product [Tetraodon nigroviridis]
Length = 993
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C CM
Sbjct: 2 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCM 46
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + D + C +C+ G C CD
Sbjct: 432 AKEEEE---------EYEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 471
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 472 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 512
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF 767
C IC DGG+LLCCD CPRAFH++C+ + IP W+C+ C + +
Sbjct: 161 CNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMCSECLDRRRL 211
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 26/40 (65%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
C +C GG+LLCCD CPRAFH+ C+ L IP W C C
Sbjct: 1195 CSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC 1234
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
C IC +GG+L+CCD CP FH C+ L +P G C C
Sbjct: 1062 CYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 430 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 484
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + D + C +C+ G C CD
Sbjct: 485 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 524
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 525 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 565
>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Monodelphis domestica]
Length = 1061
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 836 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 893
Query: 773 N----------ARAAGRIEGVD 784
+ + GR+ +D
Sbjct: 894 DDPSYSSEKRKSEGHGRLTPID 915
>gi|405960336|gb|EKC26267.1| E3 ubiquitin-protein ligase TRIM33 [Crassostrea gigas]
Length = 899
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNT 761
N+D C +C +GGDLLCCD CP+ +H+ C L PS W C C NT
Sbjct: 672 NEDYCAVCQNGGDLLCCDKCPKVYHLKCHIPELKEFPSDEWQCTMCTNT 720
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 437
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 438 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 478
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 479 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 519
>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
[Monodelphis domestica]
Length = 1060
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 835 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 892
Query: 773 N----------ARAAGRIEGVD 784
+ + GR+ +D
Sbjct: 893 DDPSYSSEKRKSEGHGRLTPID 914
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 391
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 436
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 29/186 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD CPRA+H+ C+ P P G+W C C+
Sbjct: 342 DHQDYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTCVKN---------- 391
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G+ P + + + E + +FC +++CD C
Sbjct: 392 ----------GIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDGGDLLICDTC 441
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 889
YH+ CL ++++P+G+W C C C + Q+++ P T + +
Sbjct: 442 PHSYHLNCLN----PPVEKVPEGEWSCPRCT-CPMLKGKCQRILAWKWRDAPYTEVADNR 496
Query: 890 KKHEES 895
HE S
Sbjct: 497 PGHEGS 502
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+D C C GG ++CCD+CP+ FH C++L +P G W+C C+ F+++
Sbjct: 865 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQ 914
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 428
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 429 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 469
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 TCISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 510
>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Monodelphis domestica]
Length = 1027
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 802 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 859
Query: 773 N----------ARAAGRIEGVD 784
+ + GR+ +D
Sbjct: 860 DDPSYSSEKRKSEGHGRLTPID 881
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 33/149 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 254
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + D + C +C+ G C CD
Sbjct: 255 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 294
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L ++P G+WLC
Sbjct: 295 ACISSYHIHCLN----PPLPDIPNGEWLC 319
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 279 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|297791319|ref|XP_002863544.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
gi|297309379|gb|EFH39803.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
Length = 2251
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 44/76 (57%), Gaps = 7/76 (9%)
Query: 690 YTSNG-VSLHELSIKLSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS 744
YTS+ + H L L+ ER S ++ +D C IC GGDLLCCDSCPR +H C++
Sbjct: 44 YTSSAKPTKHRLKTDLTPERNSSKRKGNDGNYFECVICDLGGDLLCCDSCPRTYHTACLN 103
Query: 745 LP--GIPSGTWHCRYC 758
P IP+G W C C
Sbjct: 104 PPLKRIPNGKWICPKC 119
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 315 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 363
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 364 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 408
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 409 PSSYHLHCLN----PPLPEIPNGEWLC 431
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 391 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 434
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 760
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C++
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRCLS 503
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 68/149 (45%), Gaps = 27/149 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHC-----EKEGIQWE 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A ++ E + + D + C +C+ G C CD
Sbjct: 428 AREESSEGEEEN---DDGRRDDGDVEEEDDHHMEFCRVCKDGGELLC-----------CD 473
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L ++P G+W+C
Sbjct: 474 TCPSSYHLHCLN----PPLPDIPNGEWIC 498
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 400
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 401 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 445
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 446 PSSYHLHCLN----PPLPEIPNGEWLC 468
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 428 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 471
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
Length = 764
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 539 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYD 595
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 428 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 468
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 469 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 509
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 252 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 306
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 307 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 347
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 348 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 388
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 394 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 448
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + D + C +C+ G C CD
Sbjct: 449 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 488
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 489 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 529
>gi|159162701|pdb|1MM2|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b
gi|319443275|pdb|2L75|A Chain A, Solution Structure Of Chd4-Phd2 In Complex With H3k9me3
Length = 61
Score = 64.7 bits (156), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 35/52 (67%), Gaps = 2/52 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
P S + + C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 2 PLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 53
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|241998816|ref|XP_002434051.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
gi|215495810|gb|EEC05451.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
Length = 907
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQ 763
SS N+D C +C DGG+LLCC SCPR +H+ C SL PS W C C++ +
Sbjct: 689 SSDPNEDWCSVCHDGGELLCCGSCPRVYHLQCHVPSLSATPSEDWTCLLCLDILK 743
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 391
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 436
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 406
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 407 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 447
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 448 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 488
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 375
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 376 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 420
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 421 PSSYHLHCLN----PPLPEIPNGEWLC 443
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 403 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 446
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 428
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 429 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 469
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 TCISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 510
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 3 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 60
Query: 773 NARA 776
+A +
Sbjct: 61 DAPS 64
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 422 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 476
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 477 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 517
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 518 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 558
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 471 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 525
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 526 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 566
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 567 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 607
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 283 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 337
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 338 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 378
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L ++P G+WLC
Sbjct: 379 ACISSYHIHCLN----PPLPDIPNGEWLC 403
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 363 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 406
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 435 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 489
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 490 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 530
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 531 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 571
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+D C C GG ++CCD+CP+ FH C++L +P G W+C C+ F+++
Sbjct: 860 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQ 909
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 436 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 490
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 491 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 531
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 532 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 572
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 426
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 427 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 471
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 371 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 425
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 426 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 466
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 467 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 507
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 426
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 427 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 471
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 398
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 399 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 439
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 440 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 480
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 417
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 418 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 458
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 459 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 499
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 528 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 582
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 583 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 623
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 624 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 664
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 33/50 (66%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
+D C C GG ++CCD+CP+ FH C+ L IP G W+C C++ F+++
Sbjct: 844 EDQCKECGKGGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNFERQ 893
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 373
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 374 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 418
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 419 PSSYHLHCLN----PPLPEIPNGEWLC 441
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 401 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 444
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E +
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDD 402
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
+ E D + C + GG +LC CD
Sbjct: 403 DEEEEGGCEEEEEDDHMEFCRVC---------KDGGELLC-----------------CDA 436
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L E+P G+WLC
Sbjct: 437 CPSSYHLHCLN----PPLPEIPNGEWLC 460
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 420 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 463
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 479 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 527
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 528 ---------EGIQWEPKEDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 572
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 573 PSSYHLHCLN----PPLAEIPNGEWLC 595
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ P IP+G W C C
Sbjct: 555 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLAEIPNGEWLCPRC 598
>gi|5834582|emb|CAB55313.1| rfg7 protein [Homo sapiens]
Length = 1052
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GGDLLCC+ CP+ H+ C +L PSG W C +C + + E VEY+
Sbjct: 811 NEDWCAVCQNGGDLLCCEKCPKVSHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 868
Query: 773 N----ARAAGRIEGVDPFAQMVSRCIRIV 797
+ ++ +G+ P Q +C R++
Sbjct: 869 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 895
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 416
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 417 ---------EGIQWEPKDDEDDEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 461
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 462 PSSYHLHCLN----PPLPEIPNGEWLC 484
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 444 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 487
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 477
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 522
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 392 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 446
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 447 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 487
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 488 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 528
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 397 EHQDFCEVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWSCPTC------------- 443
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D Q R + GG +LC CD C
Sbjct: 444 --EADGGVAEDDDDEHQEFCRICKD--------GGELLC-----------------CDMC 476
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH CL L ++P G W C C+ C + +QK++
Sbjct: 477 PSAYHTFCL----TPPLDDIPDGDWRCPRCS-CPPLQYKVQKIL 515
>gi|260792165|ref|XP_002591087.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
gi|229276287|gb|EEN47098.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
Length = 1005
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 758
N+D C +C +GGDLLCCD+CP+ +H+ C ++P +PSG + C C
Sbjct: 789 NEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 834
>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
Length = 1017
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 792 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 849
Query: 773 N 773
+
Sbjct: 850 D 850
>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase Trim24; AltName: Full=Tripartite motif-containing
protein 24
gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
Length = 1051
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 826 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 883
Query: 773 N 773
+
Sbjct: 884 D 884
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 80/199 (40%), Gaps = 45/199 (22%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP---GIPSGTWHCRYC---------- 758
+ +ND+ C C + GD+LCCD CPR+FH +CV+L +P W+C C
Sbjct: 892 ATDNDEFCSACGNAGDVLCCDGCPRSFHFECVNLTQSEDLPD-DWYCSECIMRRFPSRVP 950
Query: 759 ---------MNTFQKEKFVEYNANARAAGRIEGVDP-----FAQMVSRCIR----IVQTP 800
+N +K ++ R EGV + ++V + ++ +TP
Sbjct: 951 IHKGAFAPALNALEKSIPRAFSLPKHIQNRFEGVKAGPGGDYEEIVGKTVKRRTGFDETP 1010
Query: 801 DT----ELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 855
D + VLC C KS R ++ C C +H+ CL L P K
Sbjct: 1011 DLFKQRDENQPVLCHA---CQKSSNDTRAILPCSLCSYYWHLDCLD----PPLAVPPVLK 1063
Query: 856 -WLCCADCKRINLALQKLV 873
W C A I + +L
Sbjct: 1064 SWRCPAHVDHILADVPRLA 1082
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 477
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 522
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 710 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 762
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 102 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 158
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 814
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 159 QKNDLP-----LDATSYLDTISKRARTKVISAKCKNGIKSSDTEKVSRIFGSSILAKKRS 213
Query: 815 FCKSR---------FGRRTV 825
K + FGR++V
Sbjct: 214 SNKRKSILAHKVKTFGRKSV 233
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 421
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 829
A + + D + C +C+ G C CD
Sbjct: 422 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 462
Query: 830 QCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C YH+ CL L ++P G+WLC
Sbjct: 463 ACISSYHIHCLN----PPLPDIPNGEWLC 487
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 447 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|168051599|ref|XP_001778241.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162670338|gb|EDQ56908.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1245
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 4/71 (5%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMNTF-QKEKFVEY 770
+++D C IC GGDLLCCD+C FH+ C+ P +P G W C C+N EKF++
Sbjct: 51 QSEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKCVNPLGDVEKFIDT 110
Query: 771 NANA-RAAGRI 780
+ GRI
Sbjct: 111 QMRPMKVPGRI 121
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 67 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 115
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 116 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 160
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 161 PSSYHLHCLN----PPLPEIPNGEWLC 183
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 143 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 186
>gi|242024147|ref|XP_002432491.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212517929|gb|EEB19753.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1049
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 35/50 (70%), Gaps = 4/50 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMN 760
N+D C +CMDGG+L+CCD+CP+ FH++C +L +P + TW C C +
Sbjct: 847 NEDWCAVCMDGGELICCDNCPKVFHVNCHIPALKAMPGETETWQCLLCTD 896
>gi|111226798|ref|XP_642746.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|90970774|gb|EAL68857.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 914
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
NDD C C DGGDLLCCD+C +FH+ C++ P IPSG W C C
Sbjct: 60 NDDYCSGCGDGGDLLCCDNCQASFHLICLNPPLNEIPSGDWFCDSC 105
>gi|50290691|ref|XP_447778.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527089|emb|CAG60725.1| unnamed protein product [Candida glabrata]
Length = 671
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 44/109 (40%), Gaps = 36/109 (33%)
Query: 687 QHIYTSNGVSLHELSIKLSLE--RPFSSK------------------------------E 714
+H+ G++LH IK L+ F SK E
Sbjct: 212 EHVLVPKGINLHNPHIKTKLKDTHAFDSKTKLISQNPVGQMNDIAMAKKKSSDGTTQEIE 271
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
NDD C C G LCCD+CP++FH C++ P +P G W C CM
Sbjct: 272 NDDYCSACFQTGSFLCCDTCPKSFHFLCLNPPLDPDHLPEGDWSCPQCM 320
>gi|395538667|ref|XP_003771296.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Sarcophilus harrisii]
Length = 2011
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 523 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 566
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 32/52 (61%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
E DD C IC +GG+L+CCDSCPR H C+ L IP G ++C C KE
Sbjct: 1757 EWDDYCLICFEGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDCERERAKE 1808
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF 767
ND++C +C GG L+CCD+CP +H+DC P +P G W C+ C K K
Sbjct: 1226 NDNVCAVCSTGGMLVCCDTCPLVYHLDCAVPPLKKVPRGKWQCQLCTGVTTKGKI 1280
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 8/54 (14%)
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 863
CR C+ + ++LCD+C+R +H+ CLK L+ +PKG W C DCK
Sbjct: 1132 CR---ICRRKGDAEQMLLCDKCDRGHHMYCLKPR----LKHVPKGDWF-CPDCK 1177
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 732 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 780
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 781 ---------EGIQWEPKDDEDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 825
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 826 PSSYHLHCLN----PPLPEIPNGEWLC 848
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 808 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 851
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 326 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 374
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 375 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 419
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 420 PSSYHLHCLN----PPLPEIPNGEWLC 442
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 402 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 445
>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
Length = 981
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 756 NEDWCAVCQNGGELLCCEKCPKVFHLTCQVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 813
Query: 773 N 773
+
Sbjct: 814 D 814
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 492
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 493 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 537
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 538 PSSYHLHCLN----PPLPEIPNGEWLC 560
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 520 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 563
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
+NDDLC IC GGDL+CC++CP FH C+ L P G +HC C
Sbjct: 117 DNDDLCHICGLGGDLMCCETCPGVFHAACLGLAAPPEGDYHCPLC 161
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 858 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA 917
C + R + ++ R +LP SL V++ G+ A D R + D
Sbjct: 366 CMVEGARATPSAGEVSSRAFAELP-LSLLVVR------GAAAAAPRDCR-GIAPAYTPDQ 417
Query: 918 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRG--QDYHGMYCAILTVNQ 975
R LS A+ + H ++P+ +S + D++P ++ G G DY GM+ A+L
Sbjct: 418 RQELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGP 477
Query: 976 VVVSAGIFRIFGQELAELPLVATSNDCQGQ-GYFQSLFCCIEKLLGFLNVKTLVLPS 1031
V+ +FR FG +LAE+P++A + Q + G + L +E+LL K + P+
Sbjct: 478 AAVAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPA 533
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ + C +C DGG+LLCCDSCP AFH C++ P +P+G W+C C
Sbjct: 473 EHMEFCRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPRC 519
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 54/159 (33%)
Query: 709 PFSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCM 759
P S END D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 403 PNSDAENDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWSCPHCE 462
Query: 760 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 819
+E+ + + + C R+ + GG +LC
Sbjct: 463 GEGIQEQEED------------------EHMEFC-RVCKD-----GGELLC--------- 489
Query: 820 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C +H CL L+ +P GKW C
Sbjct: 490 --------CDSCPAAFHTFCLN----PPLKNVPTGKWNC 516
>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
Length = 981
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 756 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 813
Query: 773 N 773
+
Sbjct: 814 D 814
>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
Length = 516
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 291 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 348
Query: 773 N 773
+
Sbjct: 349 D 349
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 411 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 459
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 460 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 504
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 505 PSSYHLHCLN----PPLPEIPNGEWLC 527
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 487 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 530
>gi|224096253|ref|XP_002191494.1| PREDICTED: transcription intermediary factor 1-alpha [Taeniopygia
guttata]
Length = 1061
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C +GG LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 836 NEDWCAVCQNGGKLLCCEKCPKVFHLSCHVPTLMSFPSGEWICTFCRDLSKPE--VEYDC 893
Query: 773 N 773
+
Sbjct: 894 D 894
>gi|170058471|ref|XP_001864936.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877568|gb|EDS40951.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 830
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 75/213 (35%), Gaps = 76/213 (35%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQ------ 763
N+D C C +GGDL+CCD CP +FH+ C P IP+G W C C T Q
Sbjct: 43 HNNDTCDACGEGGDLICCDRCPSSFHLGCHDPPLSEQDIPNGLWICHTCKMTDQGSATQE 102
Query: 764 --------------KEKFVEYNANAR-----AAG----------RIE-------GVDPFA 787
K K + +N+R +AG R+E + P
Sbjct: 103 TRPDAAMAESSQISKIKKMRNRSNSRKNSANSAGGAEKGTIVDERVEKELEDLSSLSPLD 162
Query: 788 QMVSRCIRIVQTPDTEL----------------------GGCVLCRGRDFCKSRFGRRTV 825
Q++ R RI+ EL G C C R +
Sbjct: 163 QLI-RAARILNPRQFELPREINSHFPFPGTEKHDSGQKNGNAKTCHT---CGKSCRRAPL 218
Query: 826 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
I CD C+ +H CL L LP W+C
Sbjct: 219 IACDYCDLFFHQDCLD----PPLTALPTSMWMC 247
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 468 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 511
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 801 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 374 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 427
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 361 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 404
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 66/155 (42%), Gaps = 33/155 (21%)
Query: 725 GGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 782
GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N+ G
Sbjct: 290 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVG 349
Query: 783 VDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCDQCEREYHVGCL 840
DP + D + C +C+ G C CD C YH+ CL
Sbjct: 350 GDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CDTCPSSYHIHCL 387
Query: 841 KDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
L E+P G+WLC C C + +QK++
Sbjct: 388 N----PPLPEIPNGEWLCPRCT-CPSLKGKVQKIL 417
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 710 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 762
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 109 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 165
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 814
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 166 QKNDLP-----LDATSYLDTISKRARTKVVSAKCKNGIKSSDTEKVSRIFGSSILAKKRS 220
Query: 815 FCKSR---------FGRRTV 825
K + FGR++V
Sbjct: 221 SNKRKSILAHKVKTFGRKSV 240
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 710 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 762
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 102 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 158
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 814
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 159 QKNDLP-----LDATSYLDTISKRARTKVISAKCKNGIKSSDTEKVSRIFGSSILAKKRS 213
Query: 815 FCKSR---------FGRRTV 825
K + FGR++V
Sbjct: 214 SNKRKSILAHKVKTFGRKSV 233
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 333 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 392
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + + C R+ + GG +LC CD C
Sbjct: 393 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 429
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 430 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 468
>gi|47220585|emb|CAG05611.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 758
+DD C +C++GG+LLCCD CP+ +H+ C L G P G W C C
Sbjct: 1 SDDFCAVCLNGGELLCCDRCPKVYHLSCHLPPLSGFPQGDWVCTLC 46
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 336 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 395
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + + C R+ + GG +LC CD C
Sbjct: 396 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 432
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 433 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 471
>gi|260817522|ref|XP_002603635.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
gi|229288956|gb|EEN59646.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
Length = 190
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 758
N+D C +C +GGDLLCCD+CP+ +H+ C ++P +PSG + C C
Sbjct: 13 NEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 58
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + + C R+ + GG +LC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 469
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + + C R+ + GG +LC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 469
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 389
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + R+ + GG +LC CD C
Sbjct: 390 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 427
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 428 PSSYHLHCLN----PPLPEIPNGEWLC 450
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 410 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 453
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 591 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 646
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + R+ + GG +LC CD C
Sbjct: 647 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 684
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 685 PSSYHLHCLN----PPLPEIPNGEWLC 707
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 667 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 710
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 503
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + + C R+ + GG +LC CD C
Sbjct: 504 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 540
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 541 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 579
>gi|156402732|ref|XP_001639744.1| predicted protein [Nematostella vectensis]
gi|156226874|gb|EDO47681.1| predicted protein [Nematostella vectensis]
Length = 180
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 758
NDD C +C +GG+LLCCD+CPR FH+ C S+ PS W C C
Sbjct: 4 NDDWCAVCRNGGELLCCDTCPRVFHLQCHIPSVNNTPSDKWSCGLC 49
>gi|363755388|ref|XP_003647909.1| hypothetical protein Ecym_7248 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891945|gb|AET41092.1| hypothetical protein Ecym_7248 [Eremothecium cymbalariae
DBVPG#7215]
Length = 617
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 11/74 (14%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 769
ENDD C C+ G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 230 ENDDFCSSCLQTGIFLCCDTCPKSFHFACLNPPLDPDNLPEGDWSCYEC-------RFKQ 282
Query: 770 YNANARAAGRIEGV 783
N N R E +
Sbjct: 283 MNPNKSVMARNEKL 296
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 51 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 99
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
EG+ + + E CR CK ++ CD C
Sbjct: 100 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 144
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 145 PSSYHLHCLN----PPLPEIPNGEWLC 167
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 127 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 170
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 812 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 867
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + R+ + GG +LC CD C
Sbjct: 868 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 905
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 906 PSSYHLHCLN----PPLPEIPNGEWLC 928
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 888 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 931
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 37 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 83
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 395 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 449
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 450 AKEEEEDYEEDGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 492
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 493 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 531
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%)
Query: 711 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
S +E D +C +C+DGG+LL CD CP AFH CV L P G W C C
Sbjct: 460 SEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLC 507
>gi|213408481|ref|XP_002175011.1| transcriptional regulatory protein RCO1 [Schizosaccharomyces
japonicus yFS275]
gi|212003058|gb|EEB08718.1| transcriptional regulatory protein RCO1 [Schizosaccharomyces
japonicus yFS275]
Length = 335
Score = 63.2 bits (152), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 4/50 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
+N+D CG C G LCC+SCPR+FH+ C++ P IP G+W+C CM
Sbjct: 62 QNEDFCGACGGQGLFLCCESCPRSFHLSCLNPPLSRNDIPEGSWYCNKCM 111
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
D C +C DGG LLCCDSCP ++H+ C++ P IP G W C C+
Sbjct: 1 DFCHVCKDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRCL 45
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 432 AKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 474
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 81/218 (37%), Gaps = 57/218 (26%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C C GG+L+ CD+CPRA+H C+ ++ P G W C +CM
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCM------------ 300
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
I +P Q DFCK ++LCD C
Sbjct: 301 ---EHGPEIVKEEPAKQ-----------------------NDDFCKICKETENLLLCDNC 334
Query: 832 EREYHVGCLKDHGMEDLQELPKG--KWLC--CADCKRINLALQKLVDRGEE-----KLPE 882
+H C+ L ELP W C C K A + L R +E LPE
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWACPRCELAKPEQKAEKILCWRWKEIPYADPLPE 390
Query: 883 ----TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVD 916
T D++ K + +F ++W+ L + D
Sbjct: 391 GTEPTEDDLLLKPPRKMAPRREREFFIKWKYLSYWQCD 428
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSG--TWHCRYC 758
+K+NDD C IC + +LL CD+C +FH C+ P +P +W C C
Sbjct: 312 AKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
>gi|168012021|ref|XP_001758701.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162690311|gb|EDQ76679.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1220
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMNTF-QKEKFVE 769
+++D C IC GGDLLCCD+C FH+ C+ P +P G W C C+N EKF++
Sbjct: 51 QSEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKCVNPLGDVEKFID 109
>gi|281205840|gb|EFA80029.1| hypothetical protein PPL_06850 [Polysphondylium pallidum PN500]
Length = 704
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMN 760
N+D C C DGGDLLCC++C +FH+ C++ P +P G W+C C N
Sbjct: 87 NNDFCYSCRDGGDLLCCENCELSFHLLCLNPPNPEVPDGDWYCTRCTN 134
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 771
N+D C +C +GGDLL C+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 392 NEDWCAVCQNGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 449
Query: 772 ---ANARAAGRIEGVDPFAQMVSRCIRIV 797
+++ +G+ P Q +C R++
Sbjct: 450 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 476
>gi|348690302|gb|EGZ30116.1| hypothetical protein PHYSODRAFT_474458 [Phytophthora sojae]
Length = 239
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 703 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMN 760
+++ RP ++ ++ + C +C+ GG LLCCD C RA+H++CV SL +P G W C YC +
Sbjct: 179 RMARMRP-AADQHREYCEMCLKGGKLLCCDGCERAYHLNCVRPSLLDVPEGDWFCPYCRD 237
Query: 761 T 761
Sbjct: 238 A 238
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 432 AKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 474
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 79/212 (37%), Gaps = 57/212 (26%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C C GG+L+ CD+CPRA+H C+ ++ P G W C +CM
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCM------------ 300
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
I +P Q DFCK ++LCD C
Sbjct: 301 ---EHGPEIVKEEPAKQ-----------------------NDDFCKICKETENLLLCDNC 334
Query: 832 EREYHVGCLKDHGMEDLQELPKG--KWLC--CADCKRINLALQKLVDRGEE-----KLPE 882
+H C+ L ELP W C C K A + L R +E LPE
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWACPRCELAKPEQKAEKILCWRWKEIPYADPLPE 390
Query: 883 ----TSLDVIKKKHEESGSDNAVDFDVRWRVL 910
T D++ K + +F ++W+ L
Sbjct: 391 GTEPTEDDLLLKPPRKMAPRREREFFIKWKYL 422
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSG--TWHCRYC 758
+K+NDD C IC + +LL CD+C +FH C+ P +P +W C C
Sbjct: 312 AKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 760
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C++
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRCLS 503
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 64/147 (43%), Gaps = 23/147 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +KE +++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHC----EKEG-IQWE 427
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A ++ D +FC+ ++ CD C
Sbjct: 428 AREESS------------EGEEENDDGRRDDGDVEEEDDHHMEFCRVCKDGGELLCCDTC 475
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L ++P G+W+C
Sbjct: 476 PSSYHLHCLN----PPLPDIPNGEWIC 498
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 64/164 (39%), Gaps = 50/164 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 454 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 500
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A G + D Q R + GG +LC CD C
Sbjct: 501 ---EAEGPADEDDDEHQEFCRICKD--------GGELLC-----------------CDNC 532
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH CL L ++P G W C C+ C + +QK++
Sbjct: 533 PSAYHTFCLN----PPLDDIPDGDWRCPRCS-CPPLPDKVQKIL 571
>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
Length = 209
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 37/55 (67%), Gaps = 4/55 (7%)
Query: 714 ENDDLCGICMDGGDLLCCD-SCPRAFHIDCVSLPGIPSG-TWHCRYCMNTFQKEK 766
END++C IC DGGDLL CD C R +H+ C++L +P G TW C YC QKEK
Sbjct: 53 ENDEICYICGDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLCPYCAR--QKEK 105
>gi|430812048|emb|CCJ30503.1| unnamed protein product [Pneumocystis jirovecii]
Length = 635
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 115/295 (38%), Gaps = 64/295 (21%)
Query: 688 HIYTSNGVSLHELSIKLSLERPFSSK-ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 746
++YT ++H S E P + +++D C C G LCC+SCPR+FH C+ P
Sbjct: 217 NLYTGKATNIHIQQQLSSQEDPTKVRNQSNDFCSACKRHGRFLCCESCPRSFHFSCIEPP 276
Query: 747 ----GIPSGTWHCRYC-------------MNTFQKEKFV-----EYNANARAAGRIEGVD 784
+P +WHC C + F E + E++ EGV
Sbjct: 277 IDENCLPEDSWHCTLCRTKRFPPPKQPRGIFAFLMEYIIRNNPKEFSLPLNIREYFEGVS 336
Query: 785 --PFAQMVSRCIRIVQTPDTELGGCVLCRGRD------------FC----KSRFGRRTVI 826
P + I++ + + G L RD C ++ + ++
Sbjct: 337 TGPNGEYQD----IMEHKNQKFGRQKLMEERDPFQLEDKNGNPILCYKCSQNAMNGKAIV 392
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ-----KLVD----RGE 877
CD C +H+ C+ D M + + KW+C + ++ +++D RG
Sbjct: 393 SCDYCSLHWHIDCI-DPPMSSIPSASR-KWMCPNHAYHVTPKIRQPKKSRILDVCLKRGF 450
Query: 878 EKLPETSLDVI-------KKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRAL 924
+ +D KK+ + S D+ + ++R+R+ G K+D D R L
Sbjct: 451 KNNGNIEIDNTEEEEEEQKKRKKISFDDSYIINNIRYRLPESGVKLDFLDKVRTL 505
>gi|367008710|ref|XP_003678856.1| hypothetical protein TDEL_0A03130 [Torulaspora delbrueckii]
gi|359746513|emb|CCE89645.1| hypothetical protein TDEL_0A03130 [Torulaspora delbrueckii]
Length = 689
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 58/130 (44%), Gaps = 6/130 (4%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQK 764
P ENDD C C+ G LCCD+CP++FH C++ P +P G W C C +
Sbjct: 279 PTQEVENDDFCSSCLQSGSFLCCDTCPKSFHFLCLNPPLDADNLPEGDWSCPQCTFKQKH 338
Query: 765 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 824
+ + + + R E + P A++ + + ++ + + F R G R
Sbjct: 339 QNLTQIRKSEKEFIRTE-LPPSARLFGKLLFQLEASNPKQFSLPPSIKDSFQHVRSGSRG 397
Query: 825 VILCDQCERE 834
CD+ E+E
Sbjct: 398 Q-YCDEREKE 406
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
E+ + C +C DGG++LCCDSCP A+H C++ P IP G W C C
Sbjct: 434 EHQEFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRC 480
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 50/164 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 376 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 422
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A G + D Q R + GG +LC CD C
Sbjct: 423 ---EAEGPADEDDDEHQEFCRVCKD--------GGEMLC-----------------CDSC 454
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH CL L ++P G W C C+ C I + K++
Sbjct: 455 PSAYHTWCL----TPPLDDIPDGDWRCPRCS-CPPIPYKVAKIL 493
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 70/165 (42%), Gaps = 33/165 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 454 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 513
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
G E + C R+ + GG +LC CD
Sbjct: 514 EDYEEERGGKERRREEDDHMEYC-RVCKD-----GGELLC-----------------CDA 550
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 551 CISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 590
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 713 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 527 REEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|397596945|gb|EJK56894.1| hypothetical protein THAOC_23124 [Thalassiosira oceanica]
Length = 1752
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKE 765
+P E+DD C +C GGDLLCCD C +A+H+ C L IP G W C+ C K
Sbjct: 683 KPLFKGEHDDTCYMCYQGGDLLCCDYCSKAYHMKCHLPPLTEIPEGNWKCQECAAVEMKR 742
Query: 766 KF 767
F
Sbjct: 743 MF 744
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 73/172 (42%), Gaps = 43/172 (25%)
Query: 714 ENDDLCGICMDGG-----------------------DLLCCDSCPRAFHIDC--VSLPGI 748
E+D C IC +GG DL+CCD CP+ +H +C + +
Sbjct: 505 EHDAECFICGEGGGKRLITYSHSHHTHSPFTPLTYTDLVCCDGCPKVYHSNCHKPKIREL 564
Query: 749 PSGTWHCRYC--MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 806
P G W C +C +K+K+ + A+ G E VD A+ V +C V+ P+ E
Sbjct: 565 PDGEWLCMHCKPKGADRKKKYQGFRL-AKIPG--ETVDSPARHV-KC--TVRWPEME--- 615
Query: 807 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
C++C G + + C C+ YH C+ L+ P GKW C
Sbjct: 616 CIICEGTEV-TGPLKDNDWVTCATCDDAYHTRCV------GLETRPGGKWRC 660
Score = 47.4 bits (111), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 50/135 (37%), Gaps = 37/135 (27%)
Query: 727 DLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC---MNTFQKEKFVEYNANARAAGRIEGV 783
D + C +C A+H CV L P G W C C K K + NA ++ + E
Sbjct: 632 DWVTCATCDDAYHTRCVGLETRPGGKWRCPTCKERKKLIPKTKPNKENAPSKPLFKGEHD 691
Query: 784 DPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH 843
D C Q GG +LC CD C + YH+ C
Sbjct: 692 D-------TCYMCYQ------GGDLLC-----------------CDYCSKAYHMKC---- 717
Query: 844 GMEDLQELPKGKWLC 858
+ L E+P+G W C
Sbjct: 718 HLPPLTEIPEGNWKC 732
>gi|363745215|ref|XP_003643224.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Gallus gallus]
Length = 137
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/48 (54%), Positives = 35/48 (72%), Gaps = 3/48 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCM 759
+N+D C + DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 1 DNEDECSV-RDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCV 47
>gi|307180228|gb|EFN68261.1| Nucleosome-remodeling factor subunit NURF301 [Camponotus
floridanus]
Length = 3651
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 49/119 (41%), Gaps = 32/119 (26%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 769
S DD C +C GDLLCC++CP FH++CV P +P+ W C C
Sbjct: 361 SMHYDDHCRVCHRLGDLLCCETCPAVFHLECVEPPLVDVPTDDWQCSTC----------- 409
Query: 770 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILC 828
A ++ G V CI PD E G LCR R+GR+ L
Sbjct: 410 ------KANKVMG-------VVDCI-----PDVEKNGS-LCRQEHLGFDRYGRKYWFLA 449
>gi|170051000|ref|XP_001861566.1| nucleosome-remodeling factor subunit NURF301 [Culex
quinquefasciatus]
gi|167872443|gb|EDS35826.1| nucleosome-remodeling factor subunit NURF301 [Culex
quinquefasciatus]
Length = 2337
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
DD C IC GDLLCC++CP FH+DCV P +PS W C C
Sbjct: 366 DDHCRICHRLGDLLCCETCPAVFHLDCVDPPLIDVPSEDWQCNLC 410
>gi|323456424|gb|EGB12291.1| hypothetical protein AURANDRAFT_61296 [Aureococcus anophagefferens]
Length = 757
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 29/44 (65%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N+D C IC GG+LLCCD+C RAFH+ C L +P G W C C
Sbjct: 697 NEDHCFICRSGGELLCCDTCERAFHVTCCGLADVPEGAWSCYVC 740
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 76/204 (37%), Gaps = 51/204 (25%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 766
P ++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C N
Sbjct: 362 PDGEGDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCEND----- 416
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 826
G +E D + C + GG +LC
Sbjct: 417 -----------GALEDDDEHMEFCRVC---------KDGGELLC---------------- 440
Query: 827 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETS 884
CD C YH CL L E+P G W C C+ + L R +E + S
Sbjct: 441 -CDSCTSAYHTHCLN----PPLTEIPDGDWKCPRCSCPPLFGKVAKILTWRWKEVMDPPS 495
Query: 885 LDVIKKKHEESGSDNAV-DFDVRW 907
+ K SG + +F V+W
Sbjct: 496 EEPSTSKASSSGKPRKIREFFVKW 519
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 70/165 (42%), Gaps = 33/165 (20%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 259
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
G E + C R+ + GG +LC CD
Sbjct: 260 EDYEEERGGKERRREEDDHMEYC-RVCKD-----GGELLC-----------------CDA 296
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 297 CISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 336
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 713 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 273 REEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|374109838|gb|AEY98743.1| FAGL075Cp [Ashbya gossypii FDAG1]
Length = 594
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 11/72 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 769
ENDD C CM G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 207 ENDDFCSSCMQTGIFLCCDTCPKSFHFACLNPPLDPDRLPEGDWSCHEC-------RFKQ 259
Query: 770 YNANARAAGRIE 781
AAG+ E
Sbjct: 260 MYPTKAAAGKAE 271
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 426
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 427 AKEEDEEYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 469
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 508
>gi|125812233|ref|XP_001337239.1| PREDICTED: autoimmune regulator-like [Danio rerio]
Length = 640
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP--SGTWHCRYCMNTFQKEKFVEYNA 772
NDD+C C G DL CCD CPRAFH DC LP +P SG W C +C+ + +++ A
Sbjct: 468 NDDVCYACHCGVDLRCCDGCPRAFHSDC-HLPAVPEGSGEWICTFCV----LKTSLQWRA 522
Query: 773 NARAAGRIEGVDPFAQMVSRC 793
++ + P +Q + +C
Sbjct: 523 SSNMTEQEAFDSPVSQYILQC 543
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 398
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 399 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 441
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 442 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 480
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 31/45 (68%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 759
N ++C IC G+L+ CD CP++FH+DC+ L +P GTW C C+
Sbjct: 1064 NQEVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPCV 1108
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 56/145 (38%), Gaps = 23/145 (15%)
Query: 719 CGICMDGGD---LLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNAN 773
C +C + +L CD C R +HI C+ P IP G W C C T Q A
Sbjct: 978 CRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPT-QLSPRKRTKAP 1036
Query: 774 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 833
+ E + + + E+ C +C +ILCD C +
Sbjct: 1037 VEVSSEEEDDNEKVDEDGDEDEEEEDLNQEV--CNICESPG---------ELILCDFCPK 1085
Query: 834 EYHVGCLKDHGMEDLQELPKGKWLC 858
+H+ C+ DL+ LP+G W C
Sbjct: 1086 SFHLDCI------DLKRLPRGTWKC 1104
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 5/56 (8%)
Query: 816 CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 871
C+ + ++LCD C+R YH+ CLK L E+P+G W C+ C L+ +K
Sbjct: 981 CRKKSNPEQMLLCDGCDRGYHIYCLK----PPLSEIPQGDWF-CSQCSPTQLSPRK 1031
>gi|291230097|ref|XP_002735005.1| PREDICTED: tripartite motif-containing 28 protein-like
[Saccoglossus kowalevskii]
Length = 995
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMN 760
N+D C C +GGDLLCCD+CP+ FH+ C SL P TW C C +
Sbjct: 792 NEDYCACCQNGGDLLCCDTCPKVFHLQCHIPSLTATPKETWICGLCQD 839
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 388 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 442
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 443 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 485
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 524
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 209 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 263
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A E + + + + GG +LC CD C
Sbjct: 264 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 306
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 307 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 345
>gi|345482162|ref|XP_001605087.2| PREDICTED: nucleosome-remodeling factor subunit NURF301 [Nasonia
vitripennis]
Length = 2739
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 50/118 (42%), Gaps = 32/118 (27%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 769
+++ DD C +C GDLLCC++CP FH++CV P +P W C C
Sbjct: 353 NRDYDDHCRVCHRLGDLLCCETCPAVFHLECVEPPLVDVPPEDWQCNIC----------- 401
Query: 770 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
A +I G V CI PD E G +LCR R GR+ L
Sbjct: 402 ------KAHKISG-------VMDCI-----PDVEKNG-LLCRQEHLGFDRHGRKYWFL 440
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 65/164 (39%), Gaps = 50/164 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 411 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 457
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A G + D Q R + GG +LC CD C
Sbjct: 458 ---EAEGPADEDDDEHQEFCRVCKD--------GGELLC-----------------CDNC 489
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH CL L ++P G+W C C+ C + +QK++
Sbjct: 490 PSAYHTFCLN----PPLDDIPDGEWRCPRCS-CPPLADKVQKIL 528
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 36/52 (69%), Gaps = 5/52 (9%)
Query: 714 ENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 760
E+DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C++
Sbjct: 464 EDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCLS 515
Score = 40.0 bits (92), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L + P+GKW C
Sbjct: 392 IILCDTCPRAYHLVCLE----PELDKAPEGKWSC 421
>gi|45201021|ref|NP_986591.1| AGL075Cp [Ashbya gossypii ATCC 10895]
gi|44985791|gb|AAS54415.1| AGL075Cp [Ashbya gossypii ATCC 10895]
Length = 594
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 11/72 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 769
ENDD C CM G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 207 ENDDFCSSCMQTGIFLCCDTCPKSFHFACLNPPLDPDRLPEGDWSCHEC-------RFKQ 259
Query: 770 YNANARAAGRIE 781
AAG+ E
Sbjct: 260 MYPTKAAAGKAE 271
>gi|366994322|ref|XP_003676925.1| hypothetical protein NCAS_0F00850 [Naumovozyma castellii CBS 4309]
gi|342302793|emb|CCC70569.1| hypothetical protein NCAS_0F00850 [Naumovozyma castellii CBS 4309]
Length = 742
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
ENDD C C G LCCD+CP++FH C+ P +P G W C C+
Sbjct: 324 ENDDYCSACFQTGSFLCCDTCPKSFHFLCLDPPLDPNNLPEGDWSCHECL 373
>gi|74149099|dbj|BAE32198.1| unnamed protein product [Mus musculus]
Length = 895
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 57 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 100
>gi|449675047|ref|XP_002155989.2| PREDICTED: transcription intermediary factor 1-alpha-like [Hydra
magnipapillata]
Length = 1025
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 42/72 (58%), Gaps = 5/72 (6%)
Query: 690 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPG 747
++ NG++ E + + S+ N+D CG+C +GG LLCCD+CP+ +H+ C L
Sbjct: 805 FSENGINQEEYQCESGFDE---SQNNEDYCGVCRNGGQLLCCDTCPKVYHLQCHIPVLND 861
Query: 748 IPSGTWHCRYCM 759
+P +W C C+
Sbjct: 862 LPRDSWSCGLCV 873
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 81
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 396
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + R+ + GG +LC CD C
Sbjct: 397 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 434
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 81
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 80
>gi|332024007|gb|EGI64225.1| E3 ubiquitin-protein ligase TRIM33 [Acromyrmex echinatior]
Length = 1083
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 875 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHVPSLKSFPDESETWQCMLCTNVL 927
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 82/211 (38%), Gaps = 56/211 (26%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
EN+D C C GG+LL CD+CPRA+H C+ S+ P G W C +C
Sbjct: 249 ENNDFCEECKQGGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHC------------- 295
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ IV+ ++ DFCK ++LCD C
Sbjct: 296 ------------------IEHGPEIVKEEPQKVND-------DFCKICKETENLLLCDTC 330
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI---NLALQKLVDRGEE-----KLPE- 882
+H C+ L ++P+ + C C+ + A + L R +E LPE
Sbjct: 331 VCAFHAYCMD----PPLTQVPQEETWNCPRCELVKPEQKAEKILCWRWKEIPYPDPLPEG 386
Query: 883 ---TSLDVIKKKHEESGSDNAVDFDVRWRVL 910
T D++ K + +F ++W+ L
Sbjct: 387 TEPTEDDLLLKPPRKMAPRREREFFIKWKYL 417
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 713 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPS-GTWHCRYC 758
K NDD C IC + +LL CD+C AFH C+ P +P TW+C C
Sbjct: 309 KVNDDFCKICKETENLLLCDTCVCAFHAYCMDPPLTQVPQEETWNCPRC 357
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 437 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 390 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 419
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 321 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 376
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ G + R+ + GG +LC CD C
Sbjct: 377 KDDDEEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 414
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH+ CL L E+P G+WLC
Sbjct: 415 PSSYHLHCLN----PPLPEIPNGEWLC 437
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 717 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 397 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 440
>gi|380020007|ref|XP_003693890.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase
TRIM33-like [Apis florea]
Length = 1046
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 839 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 891
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 421 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 374 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 403
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 389 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 418
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 389 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 418
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 380 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 409
>gi|403179544|ref|XP_003888549.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165103|gb|EHS62842.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1912
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 111/292 (38%), Gaps = 69/292 (23%)
Query: 716 DDLCGICMDG-----GDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 770
D C IC DG ++ CD C A H DC +P IP G W CR C + + E
Sbjct: 140 DSRCAICEDGDTENSNAIVFCDGCNLAVHQDCYGVPYIPEGQWLCRKCTVSPDRPVTCEL 199
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD----FCKSRFG----- 821
N+ A + + +A +V C + P+T +G + D K R+
Sbjct: 200 CPNSFGAFKQTSENKWAHLV--C--AIHIPETGVGNAMYMEPVDGVRCIPKQRWKLKCYI 255
Query: 822 -RRTVILCDQCERE-----YHVGCLKDHGMEDLQELPKGKWL--------------CCAD 861
++TV C QC YH C ++ G+ ++ P G C AD
Sbjct: 256 CKKTVGACIQCANRSCCVAYHATCAQEVGLY-VKMKPAGSLYPTTGARGTDEGPDSCAAD 314
Query: 862 ---CKRINL----ALQKLVDRGEEKLPETSLDV--IKKKHEESGSDNAVDFDVRWRVLRG 912
I L ALQ P T+ D KK + GSD+ + +
Sbjct: 315 HVGADGIRLRHIEALQAAAVARLSNNPATTTDSSDAKKTFDPQGSDSHL-------AISS 367
Query: 913 KKVDASDGTRAL--------LSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 956
KK+++ D RAL SK+ +H + +S L+P +++ R
Sbjct: 368 KKINSLDDLRALKLLATGSQTSKSARAYHKSY------SSGPPLVPEVIFQR 413
>gi|350414692|ref|XP_003490389.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Bombus
impatiens]
Length = 1036
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 829 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 881
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 423 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 362 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 421
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 422 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 442
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 443 PSAYHTFCLN----PPLDTIPDGDWRC 465
>gi|340715100|ref|XP_003396058.1| PREDICTED: LOW QUALITY PROTEIN: e3 ubiquitin-protein ligase
TRIM33-like [Bombus terrestris]
Length = 1036
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 829 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 881
>gi|307205723|gb|EFN83968.1| E3 ubiquitin-protein ligase TRIM33 [Harpegnathos saltator]
Length = 1101
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 893 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLESFPDESETWQCMLCTNVL 945
>gi|326671403|ref|XP_001919955.3| PREDICTED: PHD finger protein 12 [Danio rerio]
Length = 941
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D+C C +GGDLLCCD CP AFH+ C + P +PSG W C C
Sbjct: 55 NHDICDSCREGGDLLCCDHCPAAFHLQCCNPPLSREMLPSGDWMCHRC 102
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 765
C +C GG +L CD+CPR FH C+ L IP G W C C++ F ++
Sbjct: 824 CKVCGQGGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQ 870
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 6/34 (17%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
V+LCD C R +H CLK L+E+PKGKW C
Sbjct: 833 VLLCDTCPRVFHPRCLK------LKEIPKGKWSC 860
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 380 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 409
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 80
>gi|195130615|ref|XP_002009747.1| GI15065 [Drosophila mojavensis]
gi|193908197|gb|EDW07064.1| GI15065 [Drosophila mojavensis]
Length = 886
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N DLC C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPSGQWLCHSC 99
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 718 LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
C +C DGG+LLCCDSCP A+H C+S P +P G W C C
Sbjct: 434 FCRVCKDGGELLCCDSCPAAYHTFCLSPPITDVPDGDWKCPRC 476
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 57/151 (37%), Gaps = 47/151 (31%)
Query: 710 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKF 767
+ ++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + +E+
Sbjct: 368 YEQTDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCESEGGQEQ- 426
Query: 768 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 827
R+ + GG +LC
Sbjct: 427 ------------------EEDEHQEFCRVCKD-----GGELLC----------------- 446
Query: 828 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
CD C YH CL + ++P G W C
Sbjct: 447 CDSCPAAYHTFCLS----PPITDVPDGDWKC 473
>gi|383860809|ref|XP_003705881.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Megachile rotundata]
Length = 1061
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 854 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 906
>gi|255731658|ref|XP_002550753.1| predicted protein [Candida tropicalis MYA-3404]
gi|240131762|gb|EER31321.1| predicted protein [Candida tropicalis MYA-3404]
Length = 507
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 65/168 (38%), Gaps = 34/168 (20%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
NDD C C G L+CC+SCP++FH C P P W+C+ C + E
Sbjct: 201 NDDFCSSCELPGILICCESCPKSFHFTCCDPPLEEAPEDEWYCQECYSKIHPPILTE-GI 259
Query: 773 NARAAGRIEGVDP----------FAQMVSRCIRIVQTPDTELGG----------CVLCRG 812
+ +E +P + S+ ++ + +L C C G
Sbjct: 260 FGKLLFELEKTNPKEFKLPVKFQLPKNESQLQERIKDKNWDLEALYDENDKPYLCHCCGG 319
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKD--HGMEDLQELPKGKWLC 858
S RRT+I CD C YH+ CL G + L E KW C
Sbjct: 320 -----SGLNRRTLIHCDYCPLVYHIDCLNPPMFGPKTLGE----KWKC 358
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 40.8 bits (94), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 389 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 418
>gi|307180087|gb|EFN68155.1| E3 ubiquitin-protein ligase TRIM33 [Camponotus floridanus]
Length = 1078
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 870 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 922
>gi|146165350|ref|XP_001014826.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila]
gi|146145551|gb|EAR94494.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila SB210]
Length = 1612
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N+ C IC D G L+CC++C + FH+ CV + P+G W C YC
Sbjct: 295 NESWCFICRDQGKLICCENCSKTFHLTCVGIKKPPTGAWECPYC 338
>gi|410899062|ref|XP_003963016.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1148
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 48/86 (55%), Gaps = 8/86 (9%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 771
N+D C +C++GGDLLCCD CP+ FH+ C +P I P G + C +C + E ++Y
Sbjct: 908 NEDWCAVCINGGDLLCCDRCPKVFHMKC-HVPTIKIFPKGEFLCTFCRSLETPE--IKYC 964
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIV 797
+R + + P Q RC R++
Sbjct: 965 EESRKTSTEQSLSPEDQ--RRCERLL 988
>gi|313247391|emb|CBY15642.1| unnamed protein product [Oikopleura dioica]
Length = 1498
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYC 758
E+++LC C GG+LL C+SCPR +H C++ P IP G W C YC
Sbjct: 146 EHNELCQFCKSGGELLACESCPRVYHPKCLNPPQTEIPDGDWFCPYC 192
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
E++D+C C GG+L+CCD+CP+AFH++C L +P G W C C
Sbjct: 1420 EHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENC 1466
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 695 VSLHELSIKLS-LERPFSSKEN--DDLCGICMDGGD---LLCCDSCPRAFHIDCVSLP-- 746
SL ++ + LS LER + ++ C +C GG+ +L CDSC R H+ C+ P
Sbjct: 1247 TSLSQIFLHLSTLERSVTWNKSILKARCRMCRRGGNPEAMLLCDSCNRGHHMFCLKPPLK 1306
Query: 747 GIPSGTWHCRYC 758
+P G W C+ C
Sbjct: 1307 KVPKGEWFCKDC 1318
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 862
CR C+ ++LCD C R +H+ CLK L+++PKG+W C DC
Sbjct: 1274 CR---MCRRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDC 1318
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 431 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 370 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 429
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 430 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 450
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 451 PSAYHTFCLN----PPLDTIPDGDWRC 473
>gi|195402015|ref|XP_002059606.1| GJ14859 [Drosophila virilis]
gi|194147313|gb|EDW63028.1| GJ14859 [Drosophila virilis]
Length = 884
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N DLC C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|307208076|gb|EFN85607.1| PHD finger protein 12 [Harpegnathos saltator]
Length = 633
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 69/179 (38%), Gaps = 41/179 (22%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC----VSLPGIPSGTWHCRYCMNTFQKEKFVE 769
N D C C DGG+L+CCD CP ++H+ C V IP+G W C C ++E +
Sbjct: 54 HNHDFCDACKDGGELICCDRCPASYHLQCHYPAVDPMDIPNGEWLCYTCRCAAKREMDNK 113
Query: 770 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT----- 824
N + +E + A +V+ R + P EL ++ G + GRR
Sbjct: 114 GNDKKKKMSALEILAFAASLVNP--REFELP-KELQLPIMFPGSNKMDYVSGRRVGKSHC 170
Query: 825 -------------------------VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+I CD C +H CL L P G+W+C
Sbjct: 171 LDSNLMVPLPARLCFECGRSCRKAPLIACDYCPLYFHQDCLD----PPLTTFPIGRWMC 225
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 431 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 370 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 429
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 430 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 450
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 451 PSAYHTFCLN----PPLDTIPDGDWRC 473
>gi|322707103|gb|EFY98682.1| Essential subunit of the histone deacetylase Rpd3S complex
[Metarhizium anisopliae ARSEF 23]
Length = 1190
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 69/163 (42%), Gaps = 40/163 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL---PGIPSGTWHCRYC------------ 758
+ND+ C C + GD++CCD CPR+FH +CV + +P W+C C
Sbjct: 754 DNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDDLPD-EWYCNECLIRRYPSRVPIH 812
Query: 759 -------MNTFQKEKFVEYNANARAAGRIEGVDP-----FAQMVS-RCIRI---VQTPD- 801
+N +K ++ R R EGV + ++VS R R PD
Sbjct: 813 KGIFGSALNNLEKSIPRAFSLPKRVQNRFEGVKAGADGDYEEVVSNRTARKRNGTDEPDF 872
Query: 802 ---TELGGCVLCRGRDFCKSRFGR-RTVILCDQCEREYHVGCL 840
E G VLC C+ + R++I C C +H+ CL
Sbjct: 873 FKQREDGQAVLCHS---CQKPATQIRSIIPCSVCSFYWHIDCL 912
>gi|260791426|ref|XP_002590730.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
gi|229275926|gb|EEN46741.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
Length = 1073
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
N D C C +GGDLLCCD CP AFH+ C P +P G W C CM
Sbjct: 49 NHDCCDSCKEGGDLLCCDRCPAAFHLQCCDPPLCEEDLPEGEWLCHRCM 97
>gi|410074363|ref|XP_003954764.1| hypothetical protein KAFR_0A01910 [Kazachstania africana CBS 2517]
gi|372461346|emb|CCF55629.1| hypothetical protein KAFR_0A01910 [Kazachstania africana CBS 2517]
Length = 686
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
+NDD C C+ G LCCD+CPR+FH C+ P +P G W C C
Sbjct: 247 DNDDFCSACLQTGSFLCCDTCPRSFHFLCLDPPIDPNHLPEGDWSCPSC 295
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
N+D C C DGGDLLCC+SC AFH+ C+ P +P G W C C
Sbjct: 88 NNDFCDACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSC 133
>gi|166158078|ref|NP_001107447.1| uncharacterized protein LOC100135295 [Xenopus (Silurana)
tropicalis]
gi|156230557|gb|AAI52125.1| Zgc:173656 protein [Danio rerio]
gi|163916551|gb|AAI57592.1| LOC100135295 protein [Xenopus (Silurana) tropicalis]
Length = 386
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 744
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 197 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 255
Query: 745 LPGIP---SGTWHCRYCM 759
LP +P SG W C C+
Sbjct: 256 LPAVPEDSSGKWSCTICV 273
>gi|297737996|emb|CBI27197.3| unnamed protein product [Vitis vinifera]
Length = 1638
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 765
C IC GG+LLCCDSCPR +H+ C++ P IP+G W C C + +K+
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKCFCSIEKK 126
>gi|156554524|ref|XP_001605295.1| PREDICTED: transcription intermediary factor 1-alpha-like [Nasonia
vitripennis]
Length = 1085
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 715 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 762
N+D C +CMDGGD +LCCD CP+ FH+ C +L P S TW C C N
Sbjct: 878 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPNLKSFPEESETWQCMLCTNVL 930
>gi|357618139|gb|EHJ71233.1| hypothetical protein KGM_08685 [Danaus plexippus]
Length = 792
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFV 768
N D C C +GGDL+CCD CP +FH+ C P IP+G+W CR C +K+ V
Sbjct: 49 HNHDSCDACREGGDLICCDRCPASFHLGCYDPPLEENDIPAGSWLCRECKAGDEKQGVV 107
>gi|348523329|ref|XP_003449176.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1174
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 4/63 (6%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 772
N+D C +C++GGDLLCCD CP+ FH+ C ++ +P G + C +C + E +EY
Sbjct: 934 NEDWCAVCINGGDLLCCDRCPKVFHMKCHVPTIKIVPKGDFLCTFCRSITNPE--IEYCD 991
Query: 773 NAR 775
+R
Sbjct: 992 ESR 994
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
E++D+C C GG+L+CCD+CP+AFH++C L +P G W C C
Sbjct: 1491 EHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENC 1537
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 43/162 (26%)
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM---NTF 762
R + ++++LC C G L+ C CP +H +C+ P + W C C+ T
Sbjct: 1416 RGLGALDHNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLCVMDRTTN 1475
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 822
+ E +N G IE D + SRC GG ++C
Sbjct: 1476 GADSEEEMGSN---DGEIEHED----VCSRCRH---------GGELIC------------ 1507
Query: 823 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 864
CD C + +H+ C K L+++PKG W C +CK+
Sbjct: 1508 -----CDTCPKAFHMECCKPV----LRKVPKGHWE-CENCKK 1539
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 695 VSLHELSIKLS-LERPFSSKEN--DDLCGICMDGGD---LLCCDSCPRAFHIDCVSLP-- 746
SL ++ + LS LER + ++ C +C GG+ +L CDSC R H+ C+ P
Sbjct: 1247 TSLSQIFLHLSTLERSVTWNKSILKARCRMCRRGGNPEAMLLCDSCNRGHHMFCLKPPLK 1306
Query: 747 GIPSGTWHCRYC 758
+P G W C+ C
Sbjct: 1307 KVPKGEWFCKDC 1318
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 810 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 862
CR C+ ++LCD C R +H+ CLK L+++PKG+W C DC
Sbjct: 1274 CR---MCRRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDC 1318
>gi|194874449|ref|XP_001973400.1| GG13363 [Drosophila erecta]
gi|190655183|gb|EDV52426.1| GG13363 [Drosophila erecta]
Length = 711
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 5/67 (7%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN---TFQKEKFVE 769
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C T + EK +
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSCPPLTGKAEKIIT 495
Query: 770 YNANARA 776
+ R+
Sbjct: 496 WRWAQRS 502
Score = 40.4 bits (93), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 825 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 389 IILCDTCPRAYHLVCLE----PELDEPPEGKWSC 418
>gi|254578124|ref|XP_002495048.1| ZYRO0B02156p [Zygosaccharomyces rouxii]
gi|238937938|emb|CAR26115.1| ZYRO0B02156p [Zygosaccharomyces rouxii]
Length = 696
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 759
ENDD C C+ G LCCD+CP++FH C++ P +P G W C C+
Sbjct: 298 ENDDFCSACLQSGSFLCCDTCPKSFHFLCLNPPVDPDELPDGDWSCPQCV 347
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 437 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 40.4 bits (93), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+D+C+ +ILCD C R YH+ CL+ +L E P+GKW C
Sbjct: 378 QDYCEVCQQGGEIILCDTCPRAYHLVCLE----PELDEPPEGKWSC 419
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 4/54 (7%)
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 758
+ P +S E D C +C G+LLCCD CPR +H+DCV+ L +P G W C C
Sbjct: 711 DSPTTSHE--DYCIVCQQSGELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPAC 762
>gi|160773236|gb|AAI55238.1| Zgc:173656 protein [Danio rerio]
Length = 408
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 744
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 745 LPGIP---SGTWHCRYCM 759
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCTICV 295
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 74/198 (37%), Gaps = 48/198 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 376 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 422
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
AG E D + C + GG +LC CD C
Sbjct: 423 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 456
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 889
YH CL L E+P G W C C+ C I + K++ ++ PET +
Sbjct: 457 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKECPETPSEEPS 511
Query: 890 KKHEESGSDNAVDFDVRW 907
+F V+W
Sbjct: 512 TSKAAPKQRRIREFFVKW 529
>gi|312377051|gb|EFR23976.1| hypothetical protein AND_11766 [Anopheles darlingi]
Length = 2960
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
DD C C GDLLCC++CP FH++CV P IP+G W C C
Sbjct: 355 DDHCRACHRVGDLLCCETCPAVFHLECVEPPLVNIPNGDWQCNLC 399
>gi|432864384|ref|XP_004070295.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like, partial
[Oryzias latipes]
Length = 1042
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 52/87 (59%), Gaps = 10/87 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY- 770
N+D C +C++GGDLLCCD+CP+ FH+ C +P I P G + C +C + E +EY
Sbjct: 803 NEDWCAVCINGGDLLCCDNCPKVFHMKC-HVPTIKIFPKGDFLCTFCRSLSVPE--IEYC 859
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIV 797
+ N R G + ++P Q RC R++
Sbjct: 860 DDNKRIKGE-QSLNPEDQ--RRCERVL 883
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 68/174 (39%), Gaps = 48/174 (27%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 413
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
AG E D + C + GG +LC CD C
Sbjct: 414 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 447
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPET 883
YH CL L E+P G W C C+ C I + K++ ++ PET
Sbjct: 448 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKECPET 496
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
++ C +C DGGDLLCCDSCP +H C++ P IP G W C C+
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRCI 53
>gi|351713762|gb|EHB16681.1| Chromodomain-helicase-DNA-binding protein 5 [Heterocephalus glaber]
Length = 2263
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 21/135 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVE-- 769
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C+ +F+ + V
Sbjct: 409 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCVPSFEAQGCVLSV 468
Query: 770 ------YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF-CKSRF-- 820
Y A R+ + Q VSR + Q GG + GR + C ++
Sbjct: 469 LGGALCYPATLRSPSILG-----PQCVSRLMETGQCTRVGPGGSLARGGRGWPCDNKLTA 523
Query: 821 ---GRRTVILCDQCE 832
G RT + C E
Sbjct: 524 KCEGPRTAMSCSGPE 538
Score = 40.0 bits (92), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 813 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 858
+D+C+ +ILCD C R YH+ CL +L++ P+GKW C
Sbjct: 411 QDYCEVCQQGGEIILCDTCPRAYHLVCLD----PELEKAPEGKWSC 452
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C+ +L IP G W C C
Sbjct: 433 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 372 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 431
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 432 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 452
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 453 PSAYHTFCLN----PALDTIPDGDWRC 475
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 758
+ + C +C DGG+LLCCDSCP A+H C+ +L IP G W C C
Sbjct: 291 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 230 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 289
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
+ Q R + GG +LC CD C
Sbjct: 290 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 310
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L +P G W C
Sbjct: 311 PSAYHTFCLN----PALDTIPDGDWRC 333
>gi|225543329|ref|NP_001098590.2| uncharacterized protein LOC550611 isoform 2 [Danio rerio]
Length = 386
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 744
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 197 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 255
Query: 745 LPGIP---SGTWHCRYCM 759
LP +P SG W C C+
Sbjct: 256 LPAVPEDSSGKWSCIICV 273
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 73/198 (36%), Gaps = 48/198 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 412
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
AG E D + C + GG +LC CD C
Sbjct: 413 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 446
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 889
YH CL L E+P G W C C+ C I + K++ + PET +
Sbjct: 447 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKDCPETPSEEPS 501
Query: 890 KKHEESGSDNAVDFDVRW 907
+F V+W
Sbjct: 502 TSKATPKQRRMREFFVKW 519
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 73/198 (36%), Gaps = 48/198 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 413
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
AG E D + C + GG +LC CD C
Sbjct: 414 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 447
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 889
YH CL L E+P G W C C+ C I + K++ + PET +
Sbjct: 448 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKDCPETPSEEPS 502
Query: 890 KKHEESGSDNAVDFDVRW 907
+F V+W
Sbjct: 503 TSKATPKQRRMREFFVKW 520
>gi|157108665|ref|XP_001650336.1| fetal alzheimer antigen, falz [Aedes aegypti]
gi|108868530|gb|EAT32755.1| AAEL015015-PA [Aedes aegypti]
Length = 2722
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
DD C IC GDLLCC++CP FH++CV P +PS W C C
Sbjct: 357 DDHCRICHRLGDLLCCETCPAVFHLECVDPPLVDVPSEDWQCNLC 401
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 718 LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 760
C C GGDL+CC+SCP A+H C+ +P G W CR C+N
Sbjct: 997 WCFTCSLGGDLICCESCPAAYHAKCLGFDSVPDGNWFCRDCVN 1039
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 714 ENDDLCGICMDGGDLLCCD--SCPRAFHIDCVSLPGIPSGTWHCRY 757
+++D C C +GG+L CD +CP+A+H+DC+ L P G W C +
Sbjct: 1411 KHEDYCFRCAEGGELTMCDVKTCPKAYHLDCLGLTKQPYGKWQCPW 1456
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 59/150 (39%), Gaps = 23/150 (15%)
Query: 707 ERPFS---SKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYCMNTF 762
ERP + + +++C +C G LL C+ C AFH+DC+ L PSG++ C C++
Sbjct: 819 ERPAGGGGAAKKENVCQVCERTGQLLLCEGGCCGAFHLDCIGLQVAPSGSFRCDECISGV 878
Query: 763 Q------------KEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 810
+ V G I P + SR L CV C
Sbjct: 879 HSCFVCKLSDQEVRRCHVPVCGKYYHEGCIRRF-PLTRFDSRGFTC------PLHACVAC 931
Query: 811 RGRDFCKSRFGRRTVILCDQCEREYHVGCL 840
+ ++ R ++ C +C YH G L
Sbjct: 932 FADNPKSTKASRGRLMRCVRCPTAYHQGDL 961
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 443
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A + D + C +C+ ++ CD C
Sbjct: 444 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 485
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 524
>gi|356510796|ref|XP_003524120.1| PREDICTED: uncharacterized protein LOC100793933 [Glycine max]
Length = 2325
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 759
C IC GG+LLCCDSCPR +H+ C+ P IP+G W C C
Sbjct: 73 CVICDVGGNLLCCDSCPRTYHLQCLDPPLKRIPNGKWQCPSCF 115
>gi|255544538|ref|XP_002513330.1| conserved hypothetical protein [Ricinus communis]
gi|223547238|gb|EEF48733.1| conserved hypothetical protein [Ricinus communis]
Length = 602
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
C IC +GGDLLCCD+CP +H+ C++ P +PSG W C C
Sbjct: 64 CVICDNGGDLLCCDTCPGTYHLQCLTPPLELVPSGNWQCENC 105
>gi|225543325|ref|NP_001017912.2| uncharacterized protein LOC550611 isoform 1 [Danio rerio]
Length = 408
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 744
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 745 LPGIP---SGTWHCRYCM 759
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCIICV 295
>gi|189236075|ref|XP_972162.2| PREDICTED: similar to Dual specificity
tyrosine-phosphorylation-regulated kinase [Tribolium
castaneum]
Length = 2981
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP---SGTWHCRYCMN 760
N+D C +CMDGG+L+CCD CP+ FH C +P + + TW C C N
Sbjct: 2783 NEDWCAVCMDGGELVCCDKCPKVFHQYC-HIPNLSVEENDTWQCLLCTN 2830
>gi|195048447|ref|XP_001992528.1| GH24800 [Drosophila grimshawi]
gi|193893369|gb|EDV92235.1| GH24800 [Drosophila grimshawi]
Length = 887
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N DLC C +GG+LLCCD CP +FH+ C P IP+G W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPTGQWLCHSC 99
>gi|312373820|gb|EFR21503.1| hypothetical protein AND_16980 [Anopheles darlingi]
Length = 1342
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 37/77 (48%), Gaps = 12/77 (15%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCV-----SLPGIPSGTWHCRYCMNTFQKEKFVE 769
N+D C +CMDGG+L+CCD CP+ FH C SLP S TW C C N F +
Sbjct: 1006 NEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDE-SETWQCLLCYN------FAD 1058
Query: 770 YNANARAAGRIEGVDPF 786
R G+ P
Sbjct: 1059 LPPEPTGEKRNVGITPL 1075
>gi|62471470|gb|AAH93591.1| Zgc:113411 [Danio rerio]
gi|182889002|gb|AAI64509.1| Zgc:113411 protein [Danio rerio]
Length = 408
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 685 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 744
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 745 LPGIP---SGTWHCRYCM 759
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCIICV 295
>gi|159162702|pdb|1MM3|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b With C-
Terminal Loop Replaced By Corresponding Loop From Wstf
Length = 61
Score = 60.5 bits (145), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 709 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 758
P S + + C +C DGG+LLCCD+CP ++HI C+ +L +P G W C C
Sbjct: 2 PLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLRPALYEVPDGEWQCPRC 53
>gi|347965727|ref|XP_321817.5| AGAP001328-PA [Anopheles gambiae str. PEST]
gi|333470378|gb|EAA01183.5| AGAP001328-PA [Anopheles gambiae str. PEST]
Length = 1278
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 37/77 (48%), Gaps = 12/77 (15%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCV-----SLPGIPSGTWHCRYCMNTFQKEKFVE 769
N+D C +CMDGG+L+CCD CP+ FH C SLP S TW C C N F +
Sbjct: 1078 NEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDE-SETWQCLLCYN------FAD 1130
Query: 770 YNANARAAGRIEGVDPF 786
R G+ P
Sbjct: 1131 LPPEPTGEKRNVGITPL 1147
>gi|321453629|gb|EFX64846.1| hypothetical protein DAPPUDRAFT_229741 [Daphnia pulex]
Length = 1431
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 44/72 (61%), Gaps = 6/72 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQ---KEKFVE 769
++++C +C +GG+++CCD+CP +H++C++ P +P G W C C Q + K E
Sbjct: 1149 DNEICPVCQEGGEVICCDTCPAVYHLECINPPLRKVPRGKWSCPQCKTPPQDRERGKLRE 1208
Query: 770 YNANARA-AGRI 780
N++ R + RI
Sbjct: 1209 KNSDGRTGSARI 1220
Score = 47.4 bits (111), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 77/207 (37%), Gaps = 50/207 (24%)
Query: 719 CGICMDGGD---LLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYNAN 773
C IC GD +L CD C + H+ C+ L +P G W C C K E
Sbjct: 1053 CRICRRKGDPEKMLLCDGCDKGHHMYCLKPLLTVVPEGDWFCAEC-------KPREKPRT 1105
Query: 774 ARAAGRI----------------EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 817
R A R+ G + V++ D E+ C +C+
Sbjct: 1106 PRKARRVFSEESEDEAVEEEEADAGEEEAEVEAEEGESDVESVDNEI--CPVCQEGG--- 1160
Query: 818 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 877
VI CD C YH+ C+ L+++P+GKW C CK DR
Sbjct: 1161 ------EVICCDTCPAVYHLECIN----PPLRKVPRGKW-SCPQCK------TPPQDRER 1203
Query: 878 EKLPETSLDVIKKKHEESGSDNAVDFD 904
KL E + D S + +A+DFD
Sbjct: 1204 GKLREKNSDGRTGSARISRTRHAIDFD 1230
>gi|292621498|ref|XP_002664670.1| PREDICTED: PHD finger protein 12 [Danio rerio]
Length = 939
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 54 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSREMLPPGDWMCHRC 101
>gi|325188434|emb|CCA22970.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 609
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 711 SSKEND---DLCGICMDGGDLLCCDSCPRAFHIDCVSL--PGIPSGTWHCRYC 758
SS E+D D C IC+ GD +CCD CPR+FH++C+ L +P G W C C
Sbjct: 431 SSDEHDTNLDYCSICLKDGDFVCCDICPRSFHLECLRLRQEDLPKGEWQCEGC 483
>gi|359472760|ref|XP_003631193.1| PREDICTED: uncharacterized protein LOC100247555 [Vitis vinifera]
Length = 2355
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
C IC GG+LLCCDSCPR +H+ C++ P IP+G W C C
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKC 119
Score = 39.7 bits (91), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 27/62 (43%), Gaps = 21/62 (33%)
Query: 803 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 862
+LGG +LC CD C R YH+ CL L+ +P GKW C C
Sbjct: 82 DLGGNLLC-----------------CDSCPRTYHLQCLN----PPLKRIPNGKWQCPKCC 120
Query: 863 KR 864
++
Sbjct: 121 QK 122
>gi|296420656|ref|XP_002839885.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636091|emb|CAZ84076.1| unnamed protein product [Tuber melanosporum]
Length = 990
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 47/187 (25%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEK----- 766
EN+D+C +C G LCC+ CPR+FH C++ P +P G W C C
Sbjct: 589 ENEDVCAVCNGPGRFLCCERCPRSFHFTCLNPPLEEVPEGMWFCNKCTTQHNPPPKPPRG 648
Query: 767 -FVEY--NANARAAGRIE----------GVD--PFAQMVSRCIRIVQT--------P--- 800
FV+ N N R GV+ P + V +R +T P
Sbjct: 649 LFVDLLDNINRRNPSSFRLPSEIRNYFVGVETGPLGEYVD--VRDNKTQRFGRSGFPEEY 706
Query: 801 DT-----ELGGCVLCRGRDFCK--SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 853
DT + G +LC C+ + G+R +I CD C +H+ C+ M + K
Sbjct: 707 DTHRLKDKSGNIILCHS---CRRSAADGQR-IISCDFCPLNWHLDCVSPMPMPNPPPAQK 762
Query: 854 GKWLCCA 860
KW+C A
Sbjct: 763 -KWMCPA 768
>gi|427786979|gb|JAA58941.1| Putative sin3-type complex [Rhipicephalus pulchellus]
Length = 533
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 67/188 (35%), Gaps = 51/188 (27%)
Query: 707 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---- 758
RP N D C C +GGDL+CCD CP FH+ C P +PSG W C C
Sbjct: 37 HRPTGRAVNHDCCDSCKEGGDLICCDRCPATFHLQCHDPPLDEESLPSGEWICHRCTVMA 96
Query: 759 --------------------------MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 792
+N Q E E + + G + P +R
Sbjct: 97 AASSGPASNSSEASSALHTLVAAASLLNAQQFELPNELSCHIALPGSSKRPRP-----AR 151
Query: 793 CIRIVQTPDTELG--GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQE 850
R P L C CR R+ +I CD C +H CL+ L
Sbjct: 152 GRRHDLEPPVPLPLRVCFTCRR----SCRWA--PLIQCDYCPLLFHADCLE----FPLAS 201
Query: 851 LPKGKWLC 858
LP G+W+C
Sbjct: 202 LPTGRWMC 209
>gi|432109785|gb|ELK33837.1| Autoimmune regulator, partial [Myotis davidii]
Length = 374
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 700 LSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 746
L + LS E P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P
Sbjct: 240 LPLPLSSE-PQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPP 285
>gi|195055097|ref|XP_001994457.1| GH17187 [Drosophila grimshawi]
gi|193892220|gb|EDV91086.1| GH17187 [Drosophila grimshawi]
Length = 1128
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 892 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 941
>gi|195391037|ref|XP_002054172.1| GJ22942 [Drosophila virilis]
gi|194152258|gb|EDW67692.1| GJ22942 [Drosophila virilis]
Length = 1119
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|18858131|ref|NP_572311.1| CG3815 [Drosophila melanogaster]
gi|17861444|gb|AAL39199.1| GH06635p [Drosophila melanogaster]
gi|22831814|gb|AAF46150.2| CG3815 [Drosophila melanogaster]
gi|220944994|gb|ACL85040.1| CG3815-PA [synthetic construct]
gi|220954818|gb|ACL89952.1| CG3815-PA [synthetic construct]
Length = 878
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|281352213|gb|EFB27797.1| hypothetical protein PANDA_013886 [Ailuropoda melanoleuca]
Length = 640
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 68/164 (41%), Gaps = 31/164 (18%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 771 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 830
+ + C R+ + GG +LC CD
Sbjct: 437 EEYEEEGEDEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDA 473
Query: 831 CEREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 873
C YH+ CL L ++P G+WLC C + +QK++
Sbjct: 474 CISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|442620095|ref|NP_001262765.1| bonus, isoform C [Drosophila melanogaster]
gi|440217666|gb|AGB96145.1| bonus, isoform C [Drosophila melanogaster]
Length = 1207
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|198467663|ref|XP_001354469.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
gi|198149350|gb|EAL31522.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
Length = 951
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 50 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 98
>gi|195340386|ref|XP_002036794.1| GM12479 [Drosophila sechellia]
gi|194130910|gb|EDW52953.1| GM12479 [Drosophila sechellia]
Length = 876
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|298707919|emb|CBJ30305.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1534
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCM---NTFQKEKFV 768
+++DLC +C GGDLLCCD+C FH C L +P+G W+C++C+ E
Sbjct: 108 DHNDLCEVCSHGGDLLCCDTCSLVFHTKCHRPELKEVPAGDWNCQFCVADSTHVPPEDRA 167
Query: 769 EYNANARAAGRIE 781
E N A R++
Sbjct: 168 EAQRNVAAMRRVQ 180
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 11/61 (18%)
Query: 801 DTELGGCVLCRGRDFCKSRFGRRT---VILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 857
D E+ C LC C+ +G++ +LC+ C+ EYH CL L E+PKGKW
Sbjct: 341 DDEVMMCPLC----SCRKCYGKQDPELALLCEHCDDEYHTYCLD----PPLTEVPKGKWY 392
Query: 858 C 858
C
Sbjct: 393 C 393
>gi|195569415|ref|XP_002102705.1| GD19361 [Drosophila simulans]
gi|194198632|gb|EDX12208.1| GD19361 [Drosophila simulans]
Length = 1140
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|195113037|ref|XP_002001076.1| GI10584 [Drosophila mojavensis]
gi|193917670|gb|EDW16537.1| GI10584 [Drosophila mojavensis]
Length = 1121
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 886 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 935
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 758
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1529 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1587
>gi|6631000|gb|AAF19646.1|AF210315_1 bonus [Drosophila melanogaster]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|24648457|ref|NP_524724.2| bonus, isoform A [Drosophila melanogaster]
gi|23176008|gb|AAF55786.2| bonus, isoform A [Drosophila melanogaster]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|195498245|ref|XP_002096441.1| GE25673 [Drosophila yakuba]
gi|194182542|gb|EDW96153.1| GE25673 [Drosophila yakuba]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|195353945|ref|XP_002043462.1| GM23122 [Drosophila sechellia]
gi|194127603|gb|EDW49646.1| GM23122 [Drosophila sechellia]
Length = 1131
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|194896219|ref|XP_001978436.1| GG19583 [Drosophila erecta]
gi|190650085|gb|EDV47363.1| GG19583 [Drosophila erecta]
Length = 876
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|194744148|ref|XP_001954557.1| GF16683 [Drosophila ananassae]
gi|190627594|gb|EDV43118.1| GF16683 [Drosophila ananassae]
Length = 1183
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 895 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 944
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 758
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1426 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1484
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|260943552|ref|XP_002616074.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
gi|238849723|gb|EEQ39187.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
Length = 766
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 64/167 (38%), Gaps = 34/167 (20%)
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC---MNTF 762
+P + +NDD C C G+ +CCDSCP++FH+ C P +P W+C C
Sbjct: 227 QPEDTTQNDDYCATCGGTGEFICCDSCPKSFHLLCCGPPLREVPEDNWNCSECRAAQGMT 286
Query: 763 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR------IVQTPDTELGGCVLCRGR--- 813
Q+ + + G +P + R +R + P + L R
Sbjct: 287 QRRSWNSVGVFGALLNALHGRNPREYCLPRRLRENTFIDVSAGPSGQYTDASLKPERTSR 346
Query: 814 ------DFCKSRFGR--------------RTVILCDQCEREYHVGCL 840
D R GR +T++ CD C ++H+ CL
Sbjct: 347 ADELDIDALYDRAGRPFLCHRCRMSGRANKTLVACDYCPLKWHLDCL 393
>gi|113195649|ref|NP_001037823.1| zinc finger protein LOC723799 [Ciona intestinalis]
gi|92081516|dbj|BAE93305.1| zinc finger protein [Ciona intestinalis]
Length = 1298
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNT 761
N + C C +GGDLLCCD CP AFH+ C + P +P G W C C+ T
Sbjct: 53 NHETCDACGEGGDLLCCDFCPAAFHLQCCNPPLDEDKVPHGEWACHRCVVT 103
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 758
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1426 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1484
>gi|281362149|ref|NP_001163665.1| bonus, isoform B [Drosophila melanogaster]
gi|159884207|gb|ABX00782.1| RE48191p [Drosophila melanogaster]
gi|272477073|gb|ACZ94961.1| bonus, isoform B [Drosophila melanogaster]
Length = 1125
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|2660669|gb|AAC79140.1| human Mi-2 autoantigen-like protein [Arabidopsis thaliana]
gi|9758384|dbj|BAB08833.1| helicase-like protein [Arabidopsis thaliana]
Length = 2228
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
C IC GGDLLCCDSCPR +H C++ P IP+G W C C
Sbjct: 64 CVICDLGGDLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKC 105
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1519 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1561
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1927 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1974
>gi|442620101|ref|NP_001262768.1| bonus, isoform F [Drosophila melanogaster]
gi|440217669|gb|AGB96148.1| bonus, isoform F [Drosophila melanogaster]
Length = 1122
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 886 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 935
>gi|442620099|ref|NP_001262767.1| bonus, isoform E [Drosophila melanogaster]
gi|440217668|gb|AGB96147.1| bonus, isoform E [Drosophila melanogaster]
Length = 1125
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|442620097|ref|NP_001262766.1| bonus, isoform D [Drosophila melanogaster]
gi|440217667|gb|AGB96146.1| bonus, isoform D [Drosophila melanogaster]
Length = 1119
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 882 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 931
>gi|405957765|gb|EKC23951.1| PHD finger protein 12 [Crassostrea gigas]
Length = 913
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C P +P G W C C
Sbjct: 49 NHDSCDSCKEGGDLLCCDWCPAAFHLQCHDPPLEEDDVPPGEWRCHRC 96
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|348543602|ref|XP_003459272.1| PREDICTED: PHD finger protein 12-like [Oreochromis niloticus]
Length = 1018
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1422 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1464
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1830 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1877
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1708 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1750
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2116 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2163
>gi|156379809|ref|XP_001631648.1| predicted protein [Nematostella vectensis]
gi|156218692|gb|EDO39585.1| predicted protein [Nematostella vectensis]
Length = 824
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYC 758
+ D C +C DGG LLCCD CP A+H+ C+ P +P+G W C C
Sbjct: 10 HSDYCRVCKDGGQLLCCDKCPMAYHLKCLIPPMMRVPTGEWKCPRC 55
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1398 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1440
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1806 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1853
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 701 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 758
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1220 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1278
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|334188199|ref|NP_199293.3| chromatin remodeling 4 protein [Arabidopsis thaliana]
gi|332007781|gb|AED95164.1| chromatin remodeling 4 protein [Arabidopsis thaliana]
Length = 2223
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 719 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
C IC GGDLLCCDSCPR +H C++ P IP+G W C C
Sbjct: 78 CVICDLGGDLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKC 119
>gi|270005635|gb|EFA02083.1| hypothetical protein TcasGA2_TC007718 [Tribolium castaneum]
Length = 980
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP---SGTWHCRYCMN 760
N+D C +CMDGG+L+CCD CP+ FH C +P + + TW C C N
Sbjct: 782 NEDWCAVCMDGGELVCCDKCPKVFHQYC-HIPNLSVEENDTWQCLLCTN 829
>gi|194763533|ref|XP_001963887.1| GF21258 [Drosophila ananassae]
gi|190618812|gb|EDV34336.1| GF21258 [Drosophila ananassae]
Length = 876
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 29/163 (17%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 307
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A + + D + C +C+ ++ CD C
Sbjct: 308 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 350
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C + +QK++
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|348577397|ref|XP_003474471.1| PREDICTED: nuclear body protein SP140-like [Cavia porcellus]
Length = 652
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 666 SEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---ENDDLCGIC 722
S +SP +FE G AS + ++ +G +L EL K L P S K N + C +C
Sbjct: 424 SLLSPREFEVAGGRASSKN-WKISVRCHGWTLRELMQKEYLPIPPSKKPKLRNSNECEVC 482
Query: 723 MDGGDLLCCDSCPRAFHIDC-VSLPGIPSGTWHCRYC 758
DGG L CCD+C RAFH C + L + W C +C
Sbjct: 483 RDGGTLFCCDTCSRAFHEYCHLPLVEVERNPWSCIFC 519
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1710 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2118 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2165
>gi|194899728|ref|XP_001979410.1| GG24000 [Drosophila erecta]
gi|190651113|gb|EDV48368.1| GG24000 [Drosophila erecta]
Length = 1131
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|357621017|gb|EHJ73003.1| putative fetal alzheimer antigen, falz [Danaus plexippus]
Length = 2482
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 716 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
DD C +C GDLLCC++CP FH++CV P +PS W C C
Sbjct: 345 DDHCRVCHRLGDLLCCETCPAVFHLECVDPPLVNVPSEDWQCALC 389
>gi|223997928|ref|XP_002288637.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975745|gb|EED94073.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 646
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 36/66 (54%), Gaps = 10/66 (15%)
Query: 699 ELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV------SLPGIPSGT 752
EL I L E+P S N + C IC DGG L+CCD+C R +H C+ +LP +
Sbjct: 50 ELGIALKSEKPGSRGGNQNECAICEDGGKLICCDNCDRVYHATCLRIVDVDTLPDV---- 105
Query: 753 WHCRYC 758
WHC C
Sbjct: 106 WHCPKC 111
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1601 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1643
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C DGG L+ C CP+ +H DC++L P+G W C +
Sbjct: 2009 TKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2056
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|195449701|ref|XP_002072186.1| GK22457 [Drosophila willistoni]
gi|194168271|gb|EDW83172.1| GK22457 [Drosophila willistoni]
Length = 1120
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 29/163 (17%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 307
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
A + + D + C +C+ ++ CD C
Sbjct: 308 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 350
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 873
YH+ CL L ++P G+WLC C + +QK++
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1400 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1442
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1808 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1855
>gi|74218224|dbj|BAE43222.1| unnamed protein product [Mus musculus]
Length = 239
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1698 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1740
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2106 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2153
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 30/53 (56%)
Query: 706 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
L F+ NDD C +C GDLLCCD CP +FH C+ + +P G W C C
Sbjct: 1043 LSHEFAEIVNDDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPEC 1095
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1337 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1379
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1745 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1792
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1440 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1482
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1848 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1895
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1603 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2011 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2058
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1350 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1747 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1794
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1603 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2011 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2058
>gi|340522049|gb|EGR52282.1| predicted protein [Trichoderma reesei QM6a]
Length = 1069
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 40/163 (24%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL---PGIPSGTWHCRYCM----------- 759
+ND+ C C + GD++CCD CPR+FH +CV + +P W+C C+
Sbjct: 632 DNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWYCNECLVRRFPSRVPVY 690
Query: 760 --------NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT---------PD- 801
N +K ++ + R EGV A + V+ PD
Sbjct: 691 KGAFASALNALEKSIPRAFSLPKKIQTRFEGVKAGADGDYEEVTTVKAKKRSGYDELPDF 750
Query: 802 ---TELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 840
E G VLC C K+ R +I C C +H+ CL
Sbjct: 751 FKQREDGEAVLCHA---CQKAATEIRAIIPCSACPLHWHIDCL 790
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|255713490|ref|XP_002553027.1| KLTH0D07040p [Lachancea thermotolerans]
gi|238934407|emb|CAR22589.1| KLTH0D07040p [Lachancea thermotolerans CBS 6340]
Length = 674
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 39/80 (48%), Gaps = 12/80 (15%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 769
ENDD C C G LCCD+CP++FH C + P +P G W C C +F
Sbjct: 282 ENDDFCASCRQPGIFLCCDTCPKSFHFACCNPPLDPDNLPEGDWSCAEC-------QFRI 334
Query: 770 YNANARAAGRIEGVDPFAQM 789
N AA R+E D AQ+
Sbjct: 335 RCPNKAAAHRLEK-DYLAQL 353
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1396 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1438
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C DGG L+ C CP+ +H DC++L P+G W C +
Sbjct: 1804 TKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1851
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 32/53 (60%), Gaps = 6/53 (11%)
Query: 712 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 758
+ E+DD C IC DGG+LLCCDSC A+H C++ P IP G W C C
Sbjct: 422 AAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 474
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 54/147 (36%), Gaps = 45/147 (30%)
Query: 714 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 771
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 414
Query: 772 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 831
G E D + C + GG +LC CD C
Sbjct: 415 EGEGITGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 448
Query: 832 EREYHVGCLKDHGMEDLQELPKGKWLC 858
YH CL L E+P G W C
Sbjct: 449 TSAYHTHCLN----PPLSEIPDGDWKC 471
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1350 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
>gi|427788397|gb|JAA59650.1| Putative bromodomain adjacent to zinc finger domain protein 1a
[Rhipicephalus pulchellus]
Length = 1568
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQK 764
N+D C C GG L+CCDSCP A+H++C P +P G W+C C Q+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCTVAKQR 1314
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 860
CR C+ R ++LCD C+R +H+ CLK L+E+PKG W C +
Sbjct: 1158 AYCR---ICRRRRDPERMLLCDGCDRGHHLYCLK----PPLEEIPKGDWYCIS 1203
>gi|427788395|gb|JAA59649.1| Putative atp-dependent chromatin assembly factor large subunit
[Rhipicephalus pulchellus]
Length = 1568
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQK 764
N+D C C GG L+CCDSCP A+H++C P +P G W+C C Q+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCTVAKQR 1314
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 808 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 860
CR C+ R ++LCD C+R +H+ CLK L+E+PKG W C +
Sbjct: 1158 AYCR---ICRRRRDPERMLLCDGCDRGHHLYCLK----PPLEEIPKGDWYCIS 1203
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1436 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1478
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1844 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1891
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1437 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 47.4 bits (111), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1845 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1892
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1437 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 47.4 bits (111), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1845 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1892
>gi|432894203|ref|XP_004075956.1| PREDICTED: PHD finger protein 12-like [Oryzias latipes]
Length = 865
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 51 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 98
Score = 43.1 bits (100), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 47/110 (42%), Gaps = 34/110 (30%)
Query: 667 EVSPSQFE---------AHAGWASRR-------KPF---QHIYTSNGVSLHELSIKLSLE 707
E +P+QF+ A G + RR KPF QH +NG L L +K+
Sbjct: 220 ERNPTQFQLPSELTCTTALPGSSKRRRKEELLGKPFRRPQHELDTNG--LVPLPVKVCF- 276
Query: 708 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHC 755
LC L+ CD CP FH+DC+ P G+P+G W C
Sbjct: 277 ----------LCNKSCRLAPLIQCDYCPLLFHMDCLDPPLTGLPAGRWMC 316
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1441 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1483
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1849 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1896
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1438 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1846 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1893
>gi|198449950|ref|XP_001357792.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
gi|198130826|gb|EAL26927.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
Length = 1128
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|403373758|gb|EJY86802.1| PHD domain containing protein [Oxytricha trifallax]
Length = 988
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGTWHCRYC 758
N D+CGIC G L+CCD CP AFH DC+ P G W C +C
Sbjct: 576 NADVCGICDLPGKLICCDDCPSAFHADCLGYEKQCPRGKWKCYFC 620
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1438 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1846 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1893
>gi|348666072|gb|EGZ05900.1| hypothetical protein PHYSODRAFT_342104 [Phytophthora sojae]
Length = 1134
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 14/86 (16%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 772
N D C +C GDL+CCD CPR++H+ C+ + +P G W C C + +F Y+
Sbjct: 933 NLDFCEVCQRAGDLVCCDKCPRSYHLKCLHMTENDLPEGDWQCAECK---KPSRFDAYSV 989
Query: 773 NARAAGRIEGVDPFAQMVSRCIRIVQ 798
V ++ RC++IV+
Sbjct: 990 ---------AVASEQSLLDRCLKIVE 1006
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%), Gaps = 9/53 (16%)
Query: 814 DFCK--SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 864
DFC+ R G ++ CD+C R YH+ CL H E+ +LP+G W CA+CK+
Sbjct: 935 DFCEVCQRAG--DLVCCDKCPRSYHLKCL--HMTEN--DLPEGDWQ-CAECKK 980
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1710 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2118 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2165
>gi|195158907|ref|XP_002020325.1| GL13567 [Drosophila persimilis]
gi|194117094|gb|EDW39137.1| GL13567 [Drosophila persimilis]
Length = 1053
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 760
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|410930175|ref|XP_003978474.1| PREDICTED: PHD finger protein 12-like [Takifugu rubripes]
Length = 1011
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
Score = 40.0 bits (92), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 53/123 (43%), Gaps = 35/123 (28%)
Query: 654 KKGFGILCTCCNSEVSPSQFE---------AHAGWASRR-------KPF---QHIYTSNG 694
K+ F +L E +P+QF+ A G + RR KPF QH SNG
Sbjct: 227 KRPFQLLIAAA-MERNPTQFQLPSELTCTTALPGSSKRRRKEELLGKPFRRPQHELDSNG 285
Query: 695 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGT 752
L L +K+ FS + L L+ CD CP FH+DC+ P +P+G
Sbjct: 286 --LVPLPVKIC----FSCNRSCRL-------APLIQCDYCPLLFHMDCLDPPLTALPAGK 332
Query: 753 WHC 755
W C
Sbjct: 333 WMC 335
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1401 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1443
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 712 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 757
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1809 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1856
>gi|363741325|ref|XP_415826.3| PREDICTED: PHD finger protein 12 [Gallus gallus]
Length = 1084
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 130 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 177
>gi|294658848|ref|XP_461185.2| DEHA2F19338p [Debaryomyces hansenii CBS767]
gi|202953432|emb|CAG89573.2| DEHA2F19338p [Debaryomyces hansenii CBS767]
Length = 795
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 71/196 (36%), Gaps = 54/196 (27%)
Query: 712 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM---NTFQKEK 766
+K NDD C C G +CCD+CP++FH C + P P WHC+ C+ N K+
Sbjct: 329 TKNNDDFCTNCGGPGVFICCDTCPKSFHFTCCNPPLEECPEDNWHCQECVIKQNPGFKKT 388
Query: 767 FVEYNANARAAGRIEGVDPFAQMVSRCIR------------------------------- 795
+ + + EG +P + R IR
Sbjct: 389 YNHIGIFGQLLNQSEGRNPKEFQLPRKIRDNSFIGVTTGENGVYQDATFKPEVSYTKMNE 448
Query: 796 -----IVQTPDTELGGCVLCRGRDFCKSRFGR-----RTVILCDQCEREYHVGCLKDHGM 845
+ D ++ G G + + G +T+ CD C +H+ CL
Sbjct: 449 YQIIGFNKNTDLDVDGLYDKNGNPYLCHKCGLSGLKGKTLTHCDYCPLVWHIDCL----- 503
Query: 846 EDLQELPK---GKWLC 858
ED +PK KW C
Sbjct: 504 EDPLCIPKTLGSKWRC 519
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 758
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1155 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1197
>gi|75677357|ref|NP_001028733.1| PHD finger protein 12 isoform 1 [Homo sapiens]
gi|71153050|sp|Q96QT6.2|PHF12_HUMAN RecName: Full=PHD finger protein 12; AltName: Full=PHD factor 1;
Short=Pf1
gi|119571550|gb|EAW51165.1| PHD finger protein 12, isoform CRA_b [Homo sapiens]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|158256276|dbj|BAF84109.1| unnamed protein product [Homo sapiens]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|410352467|gb|JAA42837.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|397483106|ref|XP_003812746.1| PREDICTED: PHD finger protein 12 [Pan paniscus]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|410220518|gb|JAA07478.1| PHD finger protein 12 [Pan troglodytes]
gi|410294710|gb|JAA25955.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|426348933|ref|XP_004042075.1| PREDICTED: PHD finger protein 12 [Gorilla gorilla gorilla]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|291405467|ref|XP_002718962.1| PREDICTED: PHD finger protein 12 [Oryctolagus cuniculus]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|410267244|gb|JAA21588.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|297700404|ref|XP_002827236.1| PREDICTED: PHD finger protein 12 [Pongo abelii]
Length = 1004
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|296202152|ref|XP_002748276.1| PREDICTED: PHD finger protein 12 [Callithrix jacchus]
Length = 1003
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|403279921|ref|XP_003931489.1| PREDICTED: PHD finger protein 12 [Saimiri boliviensis boliviensis]
Length = 1004
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 715 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 758
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.133 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,984,060,668
Number of Sequences: 23463169
Number of extensions: 749556555
Number of successful extensions: 2530489
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3174
Number of HSP's successfully gapped in prelim test: 18305
Number of HSP's that attempted gapping in prelim test: 2301327
Number of HSP's gapped (non-prelim): 128216
length of query: 1087
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 933
effective length of database: 8,745,867,341
effective search space: 8159894229153
effective search space used: 8159894229153
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 83 (36.6 bits)