BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001383
(1088 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255565495|ref|XP_002523738.1| protein binding protein, putative [Ricinus communis]
gi|223537042|gb|EEF38678.1| protein binding protein, putative [Ricinus communis]
Length = 1042
Score = 919 bits (2374), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 537/1122 (47%), Positives = 697/1122 (62%), Gaps = 114/1122 (10%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANG D+ + +V +++R GLKREF FA K SEI SLGRTRA + +S GCV P
Sbjct: 1 MANGKDNNKDMLV-AEVRPGLKREFAFAFKALSEISRSLGRTRASRTRSG--GGCV-SPA 56
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERP----IREKESKD 116
K K + +K +K ++ E+K EE V SDV ++ + E + E S
Sbjct: 57 SNNKKKRLKGKKDQK-------ARDLEEKEEENV-SDVVELGSGDEEATIGGLIESVSVS 108
Query: 117 DSENMGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKY 176
D+E G G G ++ V+E S EE EK+NE +EVL N
Sbjct: 109 DTEING-GNNGEIVEVKENGAGSMCLEE-------------TEKRNEHEEVLKN------ 148
Query: 177 GELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENE---HSEVDREKAEND 233
+ EE ++ + VE+E +NE ++ + E D K E+
Sbjct: 149 ---------DQCEERRSRSLPYSMKVEDESKNESDRTCEEIVSGSVPILMEEDSRKLEDV 199
Query: 234 LIGE---VKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRV---QMEM 287
I E +NE EEV+ D+ R A ++ CEE G + + +
Sbjct: 200 TIKEEIPKRNEPEEVLG------NDDLKRYADGNDQ--CEERISGSSPNSMNIDNFENQN 251
Query: 288 DEEKKNDIERELVENGVLESSM---------VGKHSSTLCNGESNVAKSVAVDGNDEGKT 338
E KN++E+ N +LES S L N E S+ + ND K
Sbjct: 252 GEHSKNEMEKVTAMNELLESKSDMNNVNEEGTSMSSVILMNSEGGAIDSLPI--NDSTK- 308
Query: 339 VNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQE---TVMK 395
V E+P+RRFTRSLL+ K+E+ + KD + A D G P T++K
Sbjct: 309 --VEKEKPMRRFTRSLLKPKMEIGQEYAVKD-------SSSAADDAGSPSAASNSGTMLK 359
Query: 396 PRK--VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDC 453
K +KF +KLK+ L+SGILEG V Y+RGSK +G G + L+GV+ GS I CFC C
Sbjct: 360 VWKNDTSKKFPTKLKDLLDSGILEGQQVKYMRGSKARGAGETVLQGVISGSAILCFCRSC 419
Query: 454 KGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS 513
+GN+VVTP++FE+HAGS+NKRPPEYIYLENG TLRD+MN CK++ LETL++A+ + G S
Sbjct: 420 RGNEVVTPSIFEVHAGSANKRPPEYIYLENGNTLRDVMNACKNASLETLDEALWLSTGCS 479
Query: 514 SMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSV 573
S+K + FCL CR + A M LC C+ LK+SQA P +
Sbjct: 480 SLKNSTFCLKCRGKLAEASTGRSMTLCSQCMVLKDSQASI----------------PATT 523
Query: 574 ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMK--SSSVKSHGKITRKDLRMHKLVFEE 631
+ ++ A S +P+ + SS S +K +S KS G++T KDLRMHKLVFEE
Sbjct: 524 DTDKGYAESDVCAYRIVLTPKSHPVSKSSDSVLKCSTSRSKSQGRLTVKDLRMHKLVFEE 583
Query: 632 GGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY 691
L DG EV Y+ RG+ K LVGYKKGFGI C+CCN+EVSPSQFEAHAGWASRRKP+ HIY
Sbjct: 584 DVLPDGTEVAYYSRGQ-KLLVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIY 642
Query: 692 TSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS 751
TSNGVSLHEL+I LS R FS+ +NDDLC IC DGGDLLCCD CPRA+H DC++LP IP+
Sbjct: 643 TSNGVSLHELAISLSKSRKFSTHQNDDLCQICRDGGDLLCCDVCPRAYHKDCLALPEIPT 702
Query: 752 GTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 811
G W+C++C+N FQKEKFVE+NANA AAGR+ GVDP Q+ RCIRIV+T D + GGCV C
Sbjct: 703 GRWYCKFCLNNFQKEKFVEHNANAIAAGRVAGVDPIDQITRRCIRIVKTMDADFGGCVFC 762
Query: 812 RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ 871
RG DF K FG RTV+LCDQCE+E+HVGCLKDH MEDL+ELPKG W CC+DC RI+ AL+
Sbjct: 763 RGHDFDKI-FGPRTVLLCDQCEKEFHVGCLKDHNMEDLKELPKGNWFCCSDCCRIHSALE 821
Query: 872 KLVDRGEEKLPETSLDVIKKK-HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 930
KLV RGEE+L ++SL++I KK E+ + + DVRWR+L K A D T ALLS+A+
Sbjct: 822 KLVLRGEERLLDSSLNLINKKVQEKCAGIDCSNIDVRWRLLNDKINPAGD-TAALLSEAL 880
Query: 931 SIFHDRFDPII----ESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 986
+I H++F+PI+ S + DLI +MV+G + +GQ++ GMYCA+L +NQ VVS I R
Sbjct: 881 AILHEQFNPILVAGTSSKADRDLITSMVFGDNLKGQEFGGMYCAVLMINQAVVSCAIIRF 940
Query: 987 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1046
FG ELAELPLVATS+ QG+GYFQ+LF CIEKLLGFLN+K LVLP+A EA++IW NKFGF
Sbjct: 941 FGLELAELPLVATSSKAQGKGYFQALFTCIEKLLGFLNIKNLVLPAAEEAESIWINKFGF 1000
Query: 1047 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
+T EE K+R DY +M+FQGTSML KPVPK RIVG+S G
Sbjct: 1001 RKLTHEEFLKFRKDYQMMVFQGTSMLHKPVPKIRIVGRSEGG 1042
>gi|224106864|ref|XP_002314310.1| predicted protein [Populus trichocarpa]
gi|222850718|gb|EEE88265.1| predicted protein [Populus trichocarpa]
Length = 955
Score = 904 bits (2337), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/928 (52%), Positives = 616/928 (66%), Gaps = 91/928 (9%)
Query: 199 VVGVEEERRNECNQVLTN-----------VEENEHSEVDREKAENDLIGEVKNEFEEVVA 247
V+ +EE R +L N EE + +EV+ E+ N L+ + +
Sbjct: 81 VIDLEEARVESLAPLLNNYGDGEIVEVKEFEEAKENEVECEEKNNGLVPVLMDGVMAESG 140
Query: 248 VVEEE---KKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIER------- 297
V+E + + E D+V CEE G + +DE+ K + R
Sbjct: 141 VIENKGGGEVKEGDKVHA------CEEGSSG--------LVLIDEDSKPTVNRVLESKSG 186
Query: 298 -ELVENGVLESSMVGKHSSTLCNGESNVAKS----VAVDGNDEGKTVNVVVERPLRRFTR 352
EL ++ E G S ++ N E + V V+G+ + K V E+P RRFTR
Sbjct: 187 CELKKDDACEEGTSGLSSVSVKNDEGGYVNASFQPVVVNGDSKCK---VEEEKPFRRFTR 243
Query: 353 SLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLE 412
S L+ K+E S S DG K V + + V T M ++KF +KLK+ L+
Sbjct: 244 SALKPKIEPLDIS-SSDGVK---VDDTGSSSVAAITTTPTKMFAIDGLKKFPTKLKDLLD 299
Query: 413 SGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSN 472
SGILEG V Y+RG KV+GPG GL GVVK SGI CFCDDCKG +VVTP +FELHAGS+N
Sbjct: 300 SGILEGQKVKYLRGPKVRGPGEKGLHGVVKESGILCFCDDCKGKEVVTPTIFELHAGSAN 359
Query: 473 KRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAG 532
KRPPEYI+LENG TLRD+MN CK+S L+ L++A+R+ +G + KK+NFCL+CR S + AG
Sbjct: 360 KRPPEYIFLENGNTLRDVMNACKNSSLDILDEAIRLSIGFTPSKKSNFCLSCRGSITGAG 419
Query: 533 VEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRS 592
+ +LC C+ELK+SQA + P+ TDT R+
Sbjct: 420 TRKSKVLCSQCLELKDSQA-------------ILAPE----------------TDTKERT 450
Query: 593 PEPNSAQTSSHSKMKSSSVKSH--GKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKF 650
P P+ SS + +KSS +S+ G++T+KD+RMHKLVFEE L DG EVGY+ +G+ K
Sbjct: 451 PRPSPVPESSSALLKSSPSRSNSQGRLTKKDIRMHKLVFEEEVLPDGTEVGYYSQGK-KL 509
Query: 651 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP 710
LVGYKKGFGI C+CCN+EVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHEL+I LS R
Sbjct: 510 LVGYKKGFGIFCSCCNTEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELAISLSKCRR 569
Query: 711 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVE 770
S+KENDDLC IC DGG LLCCD CPRAFH +C+SLP IP G W+C+YC+NTF+KEKFVE
Sbjct: 570 HSTKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPKGKWYCKYCLNTFEKEKFVE 629
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCD 830
NANA AAGR+ G DP Q+ RCIRIV+T + E+GGCV CRG DF ++ FG RTVI+CD
Sbjct: 630 RNANAIAAGRVAGTDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT-FGPRTVIICD 688
Query: 831 QCEREYHVGCLKDHGMEDLQ---ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLD 887
QCE+E+HVGCLK+H M+DL+ ELP GKW CC C+RI+ ALQKLV RGEEKLP++SL+
Sbjct: 689 QCEKEFHVGCLKEHQMQDLKAICELPTGKWFCCTGCERIHSALQKLVIRGEEKLPDSSLN 748
Query: 888 VIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL 947
IKKKHEES S++ D+RWR+L KK D SD T +LLS+AV+IFH+RF PI SK
Sbjct: 749 FIKKKHEESASESGGGDDIRWRLL-SKKTDPSDVTESLLSEAVAIFHERFAPITVDKSKR 807
Query: 948 -----DLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
D IP+MV G +GQD GMYCA+L VN VVSA + RIFGQELAELP+VATS+
Sbjct: 808 KRDDHDFIPSMVKGGDMKGQDLGGMYCAVLLVNHEVVSAAVMRIFGQELAELPIVATSSK 867
Query: 1003 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN--KYRND 1060
QGQGYFQ+LF CIEKLLGFLNVK LVLP+A E ++IWTNKFGFS +T++E +YR
Sbjct: 868 SQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVESIWTNKFGFSTITQDEVRLMEYRKS 927
Query: 1061 YPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
Y +M FQG+ MLQKPVPKCR+VGKS G
Sbjct: 928 YQIMEFQGSLMLQKPVPKCRVVGKSEGG 955
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%), Gaps = 2/40 (5%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLG 40
MANGTD+++ V +K+R G KREFEFA + SEICGSLG
Sbjct: 1 MANGTDAKDAAV--AKVRPGHKREFEFAFRAHSEICGSLG 38
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 875 bits (2262), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/904 (51%), Positives = 612/904 (67%), Gaps = 59/904 (6%)
Query: 203 EEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAM 262
EEE +++ ++++ E S+VD G+ + E++ A+ EE K+E +
Sbjct: 106 EEEAKSDIVDLISD--EEPKSQVDES------TGDTGTKDEKLDAIRIEESKEE----LL 153
Query: 263 DVEEV---KCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVE-NGVLESSMVGKHSSTLC 318
D E+ + ++ + E +V +EE K + E E + + VGK+ S+
Sbjct: 154 DSEDPSSHRTVDLAIHSELVDVKVDPSYEEESKETLRNESEELSTCADLGKVGKNVSS-- 211
Query: 319 NGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGSLSKDGGK 372
+N +KS+ +D N G+ + ++P +RFTRS L+Q VE L+K +
Sbjct: 212 EEAANGSKSI-IDVN--GQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQV 268
Query: 373 RSDVTEVANDGVGGPVKQETV----MKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGS 427
++ TE + + GP+ V K +KV +KF +KLK+ L++GILEG+ V YIRGS
Sbjct: 269 ITNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKKFPAKLKDLLDTGILEGLRVRYIRGS 328
Query: 428 KVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTL 487
K+K G +GL GV+ GSGI CFC++CKG +VV+P +FELHAGSSNKRPPEYIYLE G TL
Sbjct: 329 KIKALGETGLGGVISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTL 388
Query: 488 RDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC--RVSFSNAGVEELMLLCKSCVE 545
RDIMN C++ + E+ ++ +G S +K+ CLNC R+ S+ G+ MLLC SC++
Sbjct: 389 RDIMNACQNFSFDQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMD 446
Query: 546 LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK 605
++ PQ S + + +R+P+PN SS +
Sbjct: 447 SRK-------------------PQVSSSPSPSPSPSPTPIVFSKDRTPKPNVLSKSSDTI 487
Query: 606 MKSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCT 663
KS S K HG+ITRKDLR+HKLVFEE L DG EV Y+ RG+ K LVGYKKG GI C+
Sbjct: 488 TKSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQ-KLLVGYKKGSGIFCS 546
Query: 664 CCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGIC 723
CCNSEVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHELSI LS R FS +NDDLC IC
Sbjct: 547 CCNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSIC 606
Query: 724 MDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 783
DGGDLLCCD CPR+FH DCV LP IP+G W+C+YC N FQKEKFVE+NANA AAGR+ G
Sbjct: 607 ADGGDLLCCDGCPRSFHRDCVPLPCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAG 666
Query: 784 VDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKD 843
VDP Q+ +RCIRIV+T + E+GGC LCR DF KS FG RTVILCDQCE+E+HVGCLK+
Sbjct: 667 VDPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKE 726
Query: 844 HGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVD 903
+ MEDL+ELP+GKW CC +C RI+ AL+KLV G EKLPE+ L ++KK E+ GS + D
Sbjct: 727 NNMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASIND 786
Query: 904 FDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQD 963
++RWRVL K + +SD TR+LLSKAVSIFHD FDPI++SAS D IP+M+YGR+ RGQ+
Sbjct: 787 VEIRWRVLNWKML-SSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQE 845
Query: 964 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1023
+ G+YCA+LTVN+ VVS GIFRIFG E+AELPLVAT + QGQGYFQSL+ CIE+ LGFL
Sbjct: 846 FGGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFL 905
Query: 1024 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVG 1083
NVK LVLP+A EA+++W NKFGFS + EE +++ Y +MIFQGTSMLQK VPK R++
Sbjct: 906 NVKNLVLPAADEAESLWINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPKYRVIN 965
Query: 1084 KSVD 1087
+ +
Sbjct: 966 SAAN 969
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANGT +E FVVLS++R GLKREF FALKVQS ICGSLGRTR+ K+ + + P
Sbjct: 1 MANGTAPDE-FVVLSRVRTGLKREFAFALKVQSTICGSLGRTRSTKLHNAIPESPT--PK 57
Query: 61 EVKKLKTY--------ESRKKRKRQEQSVVVKETEDKR----------EEEVKSDVFDVI 102
+K L T + Q +S V E E + EEE KSD+ D+I
Sbjct: 58 RLKGLGTMEAKEGEEEDEESDEAAQLRSCEVGEVEKVKIMEDMADSMSEEEAKSDIVDLI 117
Query: 103 NERERPIREKESKDDS 118
++ E + ES D+
Sbjct: 118 SDEEPKSQVDESTGDT 133
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 870 bits (2248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/903 (51%), Positives = 606/903 (67%), Gaps = 57/903 (6%)
Query: 203 EEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAM 262
EEE +++ ++++ E S+VD G+ + E++ A+ EE K+E +
Sbjct: 106 EEEAKSDIVDLISD--EEPKSQVDES------TGDTGTKDEKLDAIRIEESKEE----LL 153
Query: 263 DVEEV---KCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGKHSSTLCN 319
D E+ + ++ + E +V +EE K + E E + + +GK + +
Sbjct: 154 DSEDPSSHRTVDLAIHSELVDVKVDPSYEEESKETLRNESEE--LSTCADLGKAGKNVSS 211
Query: 320 GESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGSLSKDGGKR 373
E+ +D N G+ + ++P +RFTRS L+Q VE L+K +
Sbjct: 212 EEAANGSKSIIDVN--GQLGKKMFQQPRKRFTRSALKQNVEPTSLEHLSKCNTGVAMQVI 269
Query: 374 SDVTEVANDGVGGPVKQETV----MKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGSK 428
++ TE + + GP+ V K +KV +KF +KLK+ L++GILEG+ V YIRGSK
Sbjct: 270 TNDTETKPEDIPGPLATPPVKIGKTKLKKVSAKKFPAKLKDLLDTGILEGLRVRYIRGSK 329
Query: 429 VKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLR 488
+K G +GL GV+ GSGI CFC++CKG +VV+P +FELHAGSSNKRPPEYIYLE G TLR
Sbjct: 330 IKALGETGLGGVISGSGIICFCNNCKGKEVVSPTLFELHAGSSNKRPPEYIYLETGNTLR 389
Query: 489 DIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC--RVSFSNAGVEELMLLCKSCVEL 546
DIMN C++ + E+ ++ +G S +K+ CLNC R+ S+ G+ MLLC SC++
Sbjct: 390 DIMNACQNFSFDQTEEFIQSAIGRSLVKRTAICLNCKGRIPESDTGIA--MLLCCSCMDS 447
Query: 547 KESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKM 606
K+ QA LSH E D +P+PN SS +
Sbjct: 448 KKPQAIDLL---SLSHYYMKE----------------FWADHLIITPKPNVLSKSSDTIT 488
Query: 607 KSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC 664
KS S K HG+ITRKDLR+HKLVFEE L DG EV Y+ RG+ K LVGYKKG GI C+C
Sbjct: 489 KSVSTRGKIHGRITRKDLRLHKLVFEEDILPDGTEVAYYARGQ-KLLVGYKKGSGIFCSC 547
Query: 665 CNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICM 724
CNSEVSPSQFEAHAGWASRRKP+ HIYTSNGVSLHELSI LS R FS +NDDLC IC
Sbjct: 548 CNSEVSPSQFEAHAGWASRRKPYLHIYTSNGVSLHELSISLSKGRKFSLTDNDDLCSICA 607
Query: 725 DGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGV 784
DGGDLLCCD CPR+FH DCV L IP+G W+C+YC N FQKEKFVE+NANA AAGR+ GV
Sbjct: 608 DGGDLLCCDGCPRSFHRDCVPLQCIPTGIWYCKYCQNLFQKEKFVEHNANAVAAGRVAGV 667
Query: 785 DPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH 844
DP Q+ +RCIRIV+T + E+GGC LCR DF KS FG RTVILCDQCE+E+HVGCLK++
Sbjct: 668 DPIEQITTRCIRIVKTMEVEVGGCALCRCHDFSKSGFGPRTVILCDQCEKEFHVGCLKEN 727
Query: 845 GMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDF 904
MEDL+ELP+GKW CC +C RI+ AL+KLV G EKLPE+ L ++KK E+ GS + D
Sbjct: 728 NMEDLKELPQGKWFCCPECNRIHSALEKLVVLGGEKLPESILVSVQKKIEDQGSASINDV 787
Query: 905 DVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDY 964
++RWRVL K + +SD TR+LLSKAVSIFHD FDPI++SAS D IP+M+YGR+ RGQ++
Sbjct: 788 EIRWRVLNWKML-SSDETRSLLSKAVSIFHDCFDPIVDSASGRDFIPSMLYGRNIRGQEF 846
Query: 965 HGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLN 1024
G+YCA+LTVN+ VVS GIFRIFG E+AELPLVAT + QGQGYFQSL+ CIE+ LGFLN
Sbjct: 847 GGIYCAVLTVNESVVSVGIFRIFGAEVAELPLVATDTNFQGQGYFQSLYACIERFLGFLN 906
Query: 1025 VKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGK 1084
VK LVLP+A EA+++W NKFGFS + EE +++ Y +MIFQGTSMLQK VPK R++
Sbjct: 907 VKNLVLPAADEAESLWINKFGFSKLPPEEVMEFKRHYQMMIFQGTSMLQKEVPKYRVINS 966
Query: 1085 SVD 1087
+ +
Sbjct: 967 AAN 969
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MANGT +E FVVLS++R GLKREF FALKVQS ICGSLGRTR+ K+ + + P
Sbjct: 1 MANGTAPDE-FVVLSRVRTGLKREFAFALKVQSTICGSLGRTRSTKLHNAIPESPT--PK 57
Query: 61 EVKKLKTY--------ESRKKRKRQEQSVVVKETEDKR----------EEEVKSDVFDVI 102
+K L T + Q +S V E E + EEE KSD+ D+I
Sbjct: 58 RLKGLGTMEAKEGEEEDEESDEAAQLRSCEVGEVEKVKIMEDMADSMSEEEAKSDIVDLI 117
Query: 103 NERERPIREKESKDDS 118
++ E + ES D+
Sbjct: 118 SDEEPKSQVDESTGDT 133
>gi|224118454|ref|XP_002331486.1| predicted protein [Populus trichocarpa]
gi|222873564|gb|EEF10695.1| predicted protein [Populus trichocarpa]
Length = 973
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/813 (57%), Positives = 565/813 (69%), Gaps = 57/813 (7%)
Query: 290 EKKNDIERELVENGVLESSMVGKHSSTLCNGE----SNVAKSVAVDGNDEGKTVNVVVER 345
E+KND E L ++ E G S + NGE +N V VDG+ + K V E+
Sbjct: 204 ERKNDCE--LKKDDAREEGTSGLSSVLVKNGEGGDVNNSLHPVVVDGDIKCK---VEAEK 258
Query: 346 PLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVM----- 400
P RRFTRS L+ K+E S SD +V + G T P K+
Sbjct: 259 PFRRFTRSALKPKIETVDIS-------SSDGVKVDDRGSSSAAAATTTNTPTKMFSIDGS 311
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
+KF +KLK+ L+SGILEG V Y+RG+KV+GPG GL G+V+ SGI CFCDDCKG +VVT
Sbjct: 312 KKFPTKLKDLLDSGILEGQKVKYLRGAKVRGPGEKGLHGMVRESGILCFCDDCKGKEVVT 371
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
PA+F LHAGSSNKRPPEYI LENG TL D+MN CK+S L+TL++A+R+ G S KK+NF
Sbjct: 372 PAIFVLHAGSSNKRPPEYICLENGNTLCDVMNACKNSSLDTLDEAIRLSTGFSPSKKSNF 431
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPA 580
C NCR S + AG + +LC C LK+ QAGSA P + E +P SV
Sbjct: 432 CWNCRGSITGAGSRKSKVLCSQCFGLKDFQAGSA----PKTAKKERTAKPHSV------- 480
Query: 581 PSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEV 640
PE + + K S KS G++T+KD+R HKLVFEE L DG EV
Sbjct: 481 ------------PE----SSCNLLKSSLSGSKSQGRVTKKDIRTHKLVFEEEVLPDGTEV 524
Query: 641 GYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE 700
GY+ +G+ K L GYKKGFGI C+CCNSEVSPSQFEAHAGWASRRKP+ +IYTSNGVSLHE
Sbjct: 525 GYYCQGK-KLLAGYKKGFGIFCSCCNSEVSPSQFEAHAGWASRRKPYLNIYTSNGVSLHE 583
Query: 701 LSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
L+I LS R S KENDDLC IC DGG LLCCD CPRAFH +C+SLP IP G W+C+YC+
Sbjct: 584 LAISLSKGRRHSIKENDDLCQICRDGGKLLCCDVCPRAFHQECLSLPSIPRGKWYCKYCL 643
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
NTF+KEKFVE NANA AAGR+ GVDP Q+ RCIRIV+T + E+GGCV CRG DF ++
Sbjct: 644 NTFEKEKFVERNANAIAAGRVAGVDPIEQITRRCIRIVKTFEAEVGGCVFCRGHDFERT- 702
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 880
FG RTVI+CDQCE+E+HVGCLK+H M+DL+ELPKGKW CC C+RI+ ALQKLV RGEEK
Sbjct: 703 FGPRTVIICDQCEKEFHVGCLKEHKMQDLKELPKGKWFCCTGCERIHSALQKLVIRGEEK 762
Query: 881 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI 940
LP++SL+ I KKHEES S++ DVRWR+L KK D+SD T ALLS AV+IFH+ FDPI
Sbjct: 763 LPDSSLNFI-KKHEESASESGCSDDVRWRLL-SKKTDSSDVTEALLSDAVAIFHECFDPI 820
Query: 941 IESASKL-----DLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELP 995
SK D IP+MV G + +GQD GMYCA+L VN VVVS + RIFGQELAELP
Sbjct: 821 TVDKSKRRRDDHDFIPSMVKGGNMKGQDLGGMYCAVLLVNHVVVSVAVVRIFGQELAELP 880
Query: 996 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1055
+VATS+ QGQGYFQ+LF CIEKLLGFLNVK LVLP+A E +IW NKFGF +T++E
Sbjct: 881 IVATSSRWQGQGYFQTLFTCIEKLLGFLNVKNLVLPAAEEVGSIWKNKFGFGAITQDELM 940
Query: 1056 KYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
+YR Y +M+FQG MLQKPVPKCRIVGKS G
Sbjct: 941 EYRRRYQIMVFQGALMLQKPVPKCRIVGKSEGG 973
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/75 (46%), Positives = 46/75 (61%), Gaps = 9/75 (12%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVL 57
MAN TD+++ ++K+R G KREFEFA + +SEI G LGRTR+ +V S N S
Sbjct: 1 MANSTDAKD--AAMAKVRPGHKREFEFAFRARSEIRGYLGRTRSSRVFSSPGNNGSNSYN 58
Query: 58 GPPEVKKLKTYESRK 72
G KKLK Y +K
Sbjct: 59 G----KKLKGYGIKK 69
>gi|356547147|ref|XP_003541978.1| PREDICTED: uncharacterized protein LOC100804381 [Glycine max]
Length = 1006
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/1127 (45%), Positives = 671/1127 (59%), Gaps = 172/1127 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG-SLGRTRARKVQSNVDSGCVLGP 59
MA G DS+E FVVLS++R GLKREF FA+K QSEICG SLGRTRA K + V P
Sbjct: 1 MAKGADSDE-FVVLSRVRTGLKREFAFAMKAQSEICGGSLGRTRASKNR-------VEAP 52
Query: 60 PEVKKLKTYESRKKRKRQEQSVVVKETEDK-REEEVKSDVFDVINERERPIREKESKDDS 118
++ +RK+ ++ E+ K +ED EEEVKSDV D+ DD+
Sbjct: 53 -----VQPASARKRSRKSEEP---KTSEDAMSEEEVKSDVVDL------------QSDDN 92
Query: 119 ENMGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYGE 178
++G E A+ EE + E + E + EE+ D+V+ N EE
Sbjct: 93 NHVGESESAAMQVCEEEPKMLEPKPEPV---------ISEEEPKVLDDVI-NEEE----- 137
Query: 179 LDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLIGE- 237
+ E K E EP+V E EV E AE L E
Sbjct: 138 ------AVVAETLKGE--EPIVA-----------------ETLKEEVVDEMAEQPLCKEE 172
Query: 238 ----VKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKN 293
V +E E EE +K SD +A E+ CEE E E G V E D
Sbjct: 173 SEKGVSDEMAEQPLCEEESEKGVSDEMA---EQPLCEE-----EPEKGGVSEEKDS---- 220
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPL--RRFT 351
NGV + L N + + +GN + + + +E P RRFT
Sbjct: 221 --------NGV---------ALALVNDDGD-------EGNKKKRRMKKRLEMPQSERRFT 256
Query: 352 RSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET----VMKP-------RKVM 400
RS L+ K E + + V + +DGV G + +M P +
Sbjct: 257 RSALKVKSE------ETNDVEHVGVAGIDDDGVKGETEASAEASLLMTPPSSAKFSNSRL 310
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
+KF SKLK+ L +GILEG+ VMY++G+KV G GL+GV++ SG+ CFC C G +VVT
Sbjct: 311 KKFPSKLKDLLATGILEGLPVMYMKGAKVLFAGEKGLQGVIQDSGVLCFCKICNGVEVVT 370
Query: 461 PAVFELHAGSSNKRPPEYIYLENG---KTLRDIMNVCK--DSPLETLEKAVRMVLGSSSM 515
P VFELHAGS+NKRPPEYIY+ +G KTLRD+MN C D PLE++++AV+ +LG +M
Sbjct: 371 PTVFELHAGSANKRPPEYIYIHDGNCGKTLRDVMNACCCCDFPLESMDEAVQKLLGDFTM 430
Query: 516 KKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE--IKEP----------LSHS 563
KK++ CLNCR + GV +L +C C+ A ++ I +P + S
Sbjct: 431 KKSSICLNCRGACK--GVSKL--VCDLCLASPPQTAMASRKVISQPVQPRSPEPVVIQKS 486
Query: 564 SEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK-----MKSSSV--KSHGK 616
+ E QP S++ E P ++ + +PNS T K MK S+ KS G+
Sbjct: 487 LDNEVQPNSLDNEVPP-------NSLDNEVQPNSLDTGVQPKSFSNGMKHSASRGKSQGR 539
Query: 617 ITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEA 676
+TRKDLR+HKLVFE L DG E+ Y+ G+ K LVGYKKG GI CTCCN +VS SQFEA
Sbjct: 540 LTRKDLRLHKLVFEADVLPDGTELAYYAHGQ-KLLVGYKKGCGIFCTCCNEQVSASQFEA 598
Query: 677 HAGWASRRKPFQHIYTSNGVSLHELSIKLSLE-RPFSSKENDDLCGICMDGGDLLCCDSC 735
HAGWASRRKP+ HIYTSNG+SLHELSI LS + R FS+ +NDDLC IC DGGDLLCCD C
Sbjct: 599 HAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLLCCDGC 658
Query: 736 PRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA-RAAGRIEGVDPFAQMVSRC 794
PRAFHIDCV LP IPSG+W+C+YC N FQK++ ++ NA AAGRI G D M RC
Sbjct: 659 PRAFHIDCVPLPCIPSGSWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILELMNKRC 718
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
IR+V+T + + GGC LC +F KS FG RTVI+CDQCE+EYHVGCLK+H ME+L++LP+
Sbjct: 719 IRVVKTVEVDHGGCALCSRPNFSKS-FGPRTVIICDQCEKEYHVGCLKEHNMENLEKLPE 777
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 914
G W C +C I+ AL LV E+ +P+ L +IKKKHEE + DV+WRV+ K
Sbjct: 778 GNWFCSGNCSHIHTALTDLVASKEKDVPDPLLSLIKKKHEEKSLEIGAGLDVKWRVMNWK 837
Query: 915 KVDASDG---TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAI 971
SD TR LLSKAV+IFH+RFDPI++S S D IP M++GR+ RGQD+ G+YCA+
Sbjct: 838 LDSDSDDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPTMLFGRNIRGQDFSGIYCAV 897
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
LTVN +VSAG+FR+FG E+AELPLVAT+ D QGQGYFQ LF CIE LLG LNVK LVLP
Sbjct: 898 LTVNGDIVSAGVFRVFGSEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLVLP 957
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+A EA++IWT KFGF+ + ++E NKY+ Y +MIFQGTS+LQKPVP+
Sbjct: 958 AADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPVPE 1004
>gi|45935119|gb|AAS79577.1| putative PHD zinc finger protein [Ipomoea trifida]
gi|117165997|dbj|BAF36299.1| hypothetical protein [Ipomoea trifida]
Length = 1047
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 500/1111 (45%), Positives = 658/1111 (59%), Gaps = 102/1111 (9%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MAN T++ E VV+S IR G+KREF +K Q+E S+G+ R + Q+ +G P
Sbjct: 1 MANRTEAGE--VVISSIRTGMKREFAMMMKAQAECGISIGQKRVTRSQNGALNGRASISP 58
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDSEN 120
+ +T S KR+++E +VV + NE+ E SKD+ E
Sbjct: 59 Q-NGSRTKSSTVKRRKKEANVVTADQS-----------LSHSNEKLDKTEELRSKDEGEK 106
Query: 121 MGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG--- 177
VE++ + SE E +D VV G E++KN V + E +
Sbjct: 107 -----------VEKLPLTSEWEEPKSD----VVDGASEDEKNG-TAVDSEIGEQETACVM 150
Query: 178 ---ELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDL 234
EL+ +GGS + + + V V+ E N L + E+ L
Sbjct: 151 EREELNSLGGSQKEKLINGDVAAKPVDVDNSGNGEENNGLVDCGGLFQKEL--------L 202
Query: 235 I-GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYE-PGRVQMEMDEEKK 292
I G+V + +EV D+SD V C G+ KE E G V M K
Sbjct: 203 INGDVAEKPKEV---------DKSDNDKQSDSLVNCGG-GVEKERELNGDVTEAMMNVVK 252
Query: 293 NDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGND--------EGKTVNVVVE 344
D E +L + ++E +M S C +S K+ G+ E V+ E
Sbjct: 253 TDNEEQL--DSLMEPTMEELPMSGDCKTKSEAIKADCAYGSALATVPESVENSKVSTSSE 310
Query: 345 RPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFY 404
+PLRRFTRS L+ K + S ++D + +G K E M + + K
Sbjct: 311 KPLRRFTRSCLKTKQQAMSASPAEDTKAEDALESDEASAIGTTSKLEMKMSKKVALVKIP 370
Query: 405 SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVF 464
+KLK L +G+LEG+ V Y+R +K +G GL+GV++GSGI CFC +C G +VVTP F
Sbjct: 371 TKLKGLLATGLLEGLPVRYVRVTKARGRPEKGLQGVIQGSGILCFCQNCGGTKVVTPNQF 430
Query: 465 ELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC 524
E+HAGSSNKRPPEYIYL+NGKTLRD++ CKD+P + LE A+R G+ +K+ FCLNC
Sbjct: 431 EMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEAAIRNATGAGDARKSTFCLNC 490
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEP-LSHSSEME---------------- 567
+ S A L C SC+ K+SQ +++ + S ++E
Sbjct: 491 KASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRDGQLEFIFLLNYYWADDLYKL 550
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P L+ SP+ + L +S E S+ T SK+ HG++TRKDLRMHKL
Sbjct: 551 GLPDLRGLQWSPSSNSVL-----KSTERMSSGTCPPSKV-------HGRLTRKDLRMHKL 598
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
VFE L DG + Y+VRG+ K L GYKKG I C CC SEVSPSQFEAHAG ASRRKP+
Sbjct: 599 VFEGDVLPDGTALAYYVRGK-KLLEGYKKGGAIFCYCCQSEVSPSQFEAHAGCASRRKPY 657
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
HIYTSNGVSLHELSIKLS+ER SS ENDDLC IC DGGDLLCCD+CPRAFH +CVSLP
Sbjct: 658 SHIYTSNGVSLHELSIKLSMERRSSSDENDDLCSICADGGDLLCCDNCPRAFHTECVSLP 717
Query: 748 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 807
IP GTW+C+YC N F KEKF + +ANA AAGR+ G+D Q+ IRIV T E+G
Sbjct: 718 NIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQITKCSIRIVDTLHAEVGV 776
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
CVLCR DF S FG +TVI+CDQCE+EYHV CL++H M+DL+ELPK KW CC +C I+
Sbjct: 777 CVLCRSHDFSTSGFGPQTVIICDQCEKEYHVKCLEEHNMDDLKELPKDKWFCCKECNSIH 836
Query: 868 LALQKLVDRGEEKLPETSLDVIKKK-HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
ALQKLV GE+ LP++ + +I +K ++ DN+++ DV+WR+L GK ++++ TR L
Sbjct: 837 YALQKLVSDGEQSLPDSLMGIINEKIKAKNLEDNSIN-DVKWRLLSGK--NSTEETRVWL 893
Query: 927 SKAVSIFHDRFDPIIESA-SKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFR 985
S AVSIFHD FDPI +S+ S+LDLIP MVYGR+ + QD+ GM CAIL VN +VVSAG+ R
Sbjct: 894 SGAVSIFHDSFDPIADSSTSRLDLIPTMVYGRNFKDQDFGGMLCAILMVNSLVVSAGVIR 953
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
IFG+E+AELPLVATS DCQG+GYFQSLF IE LL L VK LVLP+A EA++IWT KFG
Sbjct: 954 IFGKEVAELPLVATSLDCQGKGYFQSLFYSIENLLKSLGVKYLVLPAAEEAESIWTKKFG 1013
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
F +T EE Y+++Y LMIFQGT+MLQK V
Sbjct: 1014 FQHITPEELKHYKDNYQLMIFQGTAMLQKQV 1044
>gi|334184527|ref|NP_180365.6| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330252972|gb|AEC08066.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1072
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/816 (52%), Positives = 565/816 (69%), Gaps = 28/816 (3%)
Query: 295 IERELVENGVLESSMVGKHSSTLCNGESNV----AKSVAVDGNDEG---KTVNVVVERPL 347
+E EL+E +E + S+ NG S ++S + G+ +G K VN V++PL
Sbjct: 254 VEEELLEQNNVEICLGLPSRSSQMNGHSLCLGLPSRSFQMSGHSQGVDKKAVNDTVDKPL 313
Query: 348 RRFTRSLLQQKVELAKGSL--SKDGGKRSDVTEVANDGVGGPVKQETVM------KPRKV 399
RRFTRSL++Q+ + +L + + DV AND + +V +P+K
Sbjct: 314 RRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRGRPKKF 373
Query: 400 MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
+R F +KLK+ + GILEG+ V Y+RG+KV+ G GL+GV+KGSG+ CFC C G QVV
Sbjct: 374 LRNFPAKLKDIFDCGILEGLIVYYVRGAKVREAGTRGLKGVIKGSGVLCFCSACIGIQVV 433
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
+PA+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +KK++
Sbjct: 434 SPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LKKSS 492
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV--EL 575
CL+C+ + +++CKSC+E KE + ++ K + L+ SS P S+
Sbjct: 493 LCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSILRRS 552
Query: 576 EESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGG 633
+ SP S + +S EP + S SK S SHGK+TRKDLR+HKLVFE+
Sbjct: 553 KSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDI 612
Query: 634 LEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTS 693
L DG EVGYFV GE K LVGYKKGFGI C+CCN VSPS FEAHAG ASRRKPFQHIYT+
Sbjct: 613 LPDGTEVGYFVAGE-KMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTT 671
Query: 694 NGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 753
NGVSLHELS+ LS+++ FS ENDDLC IC DGG+L+CCD+CPR++H C SLP +PS
Sbjct: 672 NGVSLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSER 731
Query: 754 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCR 812
W C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CVLCR
Sbjct: 732 WSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCR 791
Query: 813 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
G FC+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN L
Sbjct: 792 GHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGN 851
Query: 873 LVDRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
L+ RGEEKL L+ ++KK +EE+ D D+RWRVL GK + +SD T+ LL+KA
Sbjct: 852 LIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGK-LTSSDDTKILLAKA 910
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 989
+SI H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+FG
Sbjct: 911 LSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGS 970
Query: 990 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1049
ELAELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ M
Sbjct: 971 ELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKM 1030
Query: 1050 TEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1085
T+EE +YR DY +MIF GTSML+K VP V K+
Sbjct: 1031 TDEEVKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 1066
>gi|356541962|ref|XP_003539441.1| PREDICTED: uncharacterized protein LOC100803825 [Glycine max]
Length = 981
Score = 796 bits (2056), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/887 (49%), Positives = 576/887 (64%), Gaps = 76/887 (8%)
Query: 234 LIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKN 293
LI E + + E +V+ EE+ D V + E + E + KE EP + E K
Sbjct: 123 LISEEQPKMLEPESVISEEEPKVLDDVINEEEAIVAETL---KEQEP-----IVPETLKE 174
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPL--RRFT 351
++ E+ E + K S NG VA ++ DG K++ +ERP RRFT
Sbjct: 175 EVVDEMAEQPLCIEESEEKDS----NG---VALALVNDGAKGKKSMKKRLERPQSERRFT 227
Query: 352 RSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET-------VMKPRKV----- 399
RS L+ K E + +D V G+ VK+ET + P V
Sbjct: 228 RSALKVKSE-----------ETNDGEHVGVAGISDGVKRETEAGASLVMTTPSSVKFSNR 276
Query: 400 --MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++KF +KL++ L +GILEG+ VMY++G KV G GL+GV++ SG+ CFC CKG +
Sbjct: 277 GKLKKFPAKLRDLLATGILEGLPVMYMKGVKVLFDGEKGLQGVIQDSGVLCFCKICKGVE 336
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENG---KTLRDIMNVCK--DSPLETLEKAVRMVLGS 512
VVTP VFELHAGS+NKRPPEYIY+ +G KTLRD+MN C D PLE++++AV+ +LG
Sbjct: 337 VVTPTVFELHAGSANKRPPEYIYIHDGNSGKTLRDVMNACCCCDFPLESMDEAVQKLLGD 396
Query: 513 SSMKKANFCLNCRVSFSNAGVEELMLLCKSCV--ELKESQAGSAEIKEPL---------- 560
+MKK++ CLNCR + GV L +C SC+ + + A + I +P+
Sbjct: 397 FTMKKSSICLNCRGACK--GVSRL--VCDSCLVSPAQTAVASNKGISQPVQPRSPEPVVI 452
Query: 561 --SHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK-----MKSSSV-- 611
S +E++P E++ + +G ++ + EP+S S K MK S+
Sbjct: 453 QKSLDNEVQPNSLHNEVQPNKLDTGMQPNSLDNGMEPDSLNNSMKPKSFSNGMKHSASRG 512
Query: 612 KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSP 671
KS G++TRKDLR+HKLVFE L DG E+ Y+ G+ K LVGYKKG+GI CTCCN +VS
Sbjct: 513 KSQGRLTRKDLRLHKLVFEADVLPDGTELAYYAHGQ-KLLVGYKKGYGIFCTCCNEQVSA 571
Query: 672 SQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE-RPFSSKENDDLCGICMDGGDLL 730
SQFEAHAGWASRRKP+ HIYTSNG+SLHELSI LS + R FS+ +NDDLC IC DGGDLL
Sbjct: 572 SQFEAHAGWASRRKPYLHIYTSNGISLHELSISLSKDHRRFSNNDNDDLCIICEDGGDLL 631
Query: 731 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA-RAAGRIEGVDPFAQ 789
CCD CPRAFHIDCV LP IPSGTW+C+YC N FQK++ ++ NA AAGRI G D
Sbjct: 632 CCDGCPRAFHIDCVPLPCIPSGTWYCKYCQNVFQKDRHGQHEVNALAAAGRIAGPDILEL 691
Query: 790 MVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDL 849
M RCIR+V+T + + GGC LC +F KS FG +TVI+CDQCE+EYHVGCLKDH ME+L
Sbjct: 692 MNKRCIRVVRTLEVDHGGCALCSRPNFSKS-FGPQTVIICDQCEKEYHVGCLKDHNMENL 750
Query: 850 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 909
+ELP G W C +C +I+ AL LV E+ +P+ L++IKKKHEE D DV+WR
Sbjct: 751 EELPVGNWFCSGNCSQIHTALMDLVASKEKDVPDPLLNLIKKKHEEKSLDIGAGLDVKWR 810
Query: 910 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYC 969
V+ K S TR LLSKAV+IFH+RFDPI++S S D IPAM++GR+ RGQD+ G+YC
Sbjct: 811 VINWKLDSDSVETRKLLSKAVAIFHERFDPIVDSTSGRDFIPAMLFGRNIRGQDFSGIYC 870
Query: 970 AILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLV 1029
A+LTVN +VSAG+FR+FG E+AELPLVAT+ D QGQGYFQ LF CIE LLG LNVK LV
Sbjct: 871 AVLTVNGDIVSAGVFRVFGLEIAELPLVATTADHQGQGYFQCLFSCIETLLGSLNVKNLV 930
Query: 1030 LPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
LP+A EA++IWT KFGF+ + ++E NKY+ Y +MIFQGTS+LQKPV
Sbjct: 931 LPAADEAESIWTGKFGFTKLPQDEINKYKKFYRMMIFQGTSVLQKPV 977
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 79/146 (54%), Gaps = 22/146 (15%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEIC-GSLGRTRARK--VQSNVDSGCVL 57
MA G DS+E FVVLS++R GLKREF FA+K QSEIC GSLGRTRA K V++ V
Sbjct: 1 MAKGADSDE-FVVLSRVRTGLKREFAFAMKAQSEICGGSLGRTRASKNRVEAPVQ----- 54
Query: 58 GPPEVKKLKTYESRKKRKRQEQSVVVKETEDK-REEEVKSDVFDVINERERPIREKESKD 116
PP RKR +S K ED EEEVKSDV D+ ++ E +
Sbjct: 55 -PPSA-----------RKRSRKSEEAKTLEDAMSEEEVKSDVVDLQSDDEPKNNNHVGES 102
Query: 117 DSENMGVGERGALMNVEEVKVVSERR 142
+S M V E +V ++SE +
Sbjct: 103 ESAAMQVCEEEPKSDVVLETLISEEQ 128
>gi|359481940|ref|XP_002264975.2| PREDICTED: uncharacterized protein LOC100248757 [Vitis vinifera]
Length = 2411
Score = 788 bits (2034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/762 (52%), Positives = 514/762 (67%), Gaps = 64/762 (8%)
Query: 334 DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
DE K +++ +E L +RFTRS L+ K + + SL D + V ++ G V+ +
Sbjct: 1697 DESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SLESDYNFCNSVAIGVDEKTNGAVR--S 1753
Query: 393 VMKPRKVMRKFYSKL---------KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKG 443
+ P+K+ K K+ ++ LE+G+LEG V Y + KG L+G +KG
Sbjct: 1754 LTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTY--DGRKKG---YRLQGTIKG 1808
Query: 444 SGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE 503
+GI C C CKG++VV P+ FELHA S + +YIYL+NGK L D+++VCKD+PLETLE
Sbjct: 1809 NGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLE 1868
Query: 504 KAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHS 563
++ +GS +K++ P +
Sbjct: 1869 ATIQSAIGSFPVKRS---------------------------------------LPADEA 1889
Query: 564 SEMEPQPPS-VELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSS-VKSHGKITRKD 621
++M+P S ++ SPA S T R +P SS S + +SS KS GKIT+KD
Sbjct: 1890 AKMDPLGNSCIKRNNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSENKSLGKITKKD 1949
Query: 622 LRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWA 681
R+H+LVFEEGGL DG EV Y+ G+ K L GYKKGFGI C CC+ EVS SQFEAHAGWA
Sbjct: 1950 QRLHRLVFEEGGLPDGTEVAYYAGGK-KLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWA 2008
Query: 682 SRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHI 741
SR+KP+ +IYTSNGVSLHEL+I LS R +S+++NDDLC IC DGG+LL CD CPRAFH
Sbjct: 2009 SRKKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHR 2068
Query: 742 DCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTP 801
C SLP IP W+CRYC N FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P
Sbjct: 2069 VCASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NP 2127
Query: 802 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 861
+ E+ CVLCRG DF KS FG RT+ILCDQCE+E+H+GCL+DH M+DL+ELP GKW CC
Sbjct: 2128 EAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKEFHIGCLRDHKMQDLKELPSGKWFCCL 2187
Query: 862 DCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDG 921
+C RI+ ALQKL RGEEKLP++ L+VIK+KHE G ++ D++VRWR+L GK AS
Sbjct: 2188 ECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKL--ASPE 2245
Query: 922 TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSA 981
TR LLS+AV+IFHDRFDPII+S + DLIPAMVYGR+ RGQD+ G+YCA++TVN VVSA
Sbjct: 2246 TRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSA 2305
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
GI R+FGQE+AELPLVATS D QG+GYFQ LF CIEKLL FLNV++ VLP+A EA+ IWT
Sbjct: 2306 GILRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWT 2365
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIV 1082
KFGF +T ++ ++YR + MI FQGT ML+K VP+ R +
Sbjct: 2366 KKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWRRI 2407
Score = 39.7 bits (91), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 13 VLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVLGPPEVKKLKT 67
V+S G+KRE FAL+V S+ CG +GRTR+ K Q+ ++ V KKLKT
Sbjct: 1525 VVSGFWKGMKRELAFALEVHSQ-CGLIGRTRSTKAQTQNETINCNGVAWNESNKKLKT 1581
>gi|297740008|emb|CBI30190.3| unnamed protein product [Vitis vinifera]
Length = 879
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/908 (47%), Positives = 562/908 (61%), Gaps = 102/908 (11%)
Query: 210 CNQVLTNVEENEHSEVDREKAEND---LIGEVKNEFEEVVAV--------VEEEKKDESD 258
CN V N + ++ E ND I E KN ++ V+ +EE K DE
Sbjct: 35 CNGVAWNESNKKLKTLNDELEGNDSSDRIRETKNRWDGVIQYSRNKRLKRLEESKNDERR 94
Query: 259 RVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGKHSSTLC 318
+A EP + DEE+K D +V +VG
Sbjct: 95 TIAE----------------EPKDDESTTDEEQKTDENDPVVVEKPTGGYLVGPICEEEP 138
Query: 319 NGESNVA--KSVAVDGN---------DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSL 366
+S A K + DG+ DE K +++ +E L +RFTRS L+ K + + SL
Sbjct: 139 KSQSQKASIKDESNDGSLKLQTAGLIDESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SL 197
Query: 367 SKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL---------KNFLESGILE 417
D + V ++ G V+ ++ P+K+ K K+ ++ LE+G+LE
Sbjct: 198 ESDYNFCNSVAIGVDEKTNGAVR--SLTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLE 255
Query: 418 GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPE 477
G V Y + KG L+G +KG+GI C C CKG++VV P+ FELHA S + +
Sbjct: 256 GYPVTY--DGRKKG---YRLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHAAK 310
Query: 478 YIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELM 537
YIYL+NGK L D+++VCKD+PLETLE ++ +GS +K++
Sbjct: 311 YIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGSFPVKRS------------------- 351
Query: 538 LLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPS-VELEESPAPSGELTDTSNRSPEPN 596
P +++M+P S ++ SPA S T R +P
Sbjct: 352 --------------------LPADEAAKMDPLGNSCIKRNNSPATSIHRTSERARLLKPI 391
Query: 597 SAQTSSHSKMKSSS-VKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYK 655
SS S + +SS KS GKIT+KD R+H+LVFEEGGL DG EV Y+ G+ K L GYK
Sbjct: 392 PVTKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEGGLPDGTEVAYYAGGK-KLLDGYK 450
Query: 656 KGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKE 715
KGFGI C CC+ EVS SQFEAHAGWASR+KP+ +IYTSNGVSLHEL+I LS R +S+++
Sbjct: 451 KGFGIFCWCCHCEVSASQFEAHAGWASRKKPYSYIYTSNGVSLHELAISLSKGRKYSARD 510
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
NDDLC IC DGG+LL CD CPRAFH C SLP IP W+CRYC N FQ+EKFVE+NANA
Sbjct: 511 NDDLCSICGDGGNLLLCDGCPRAFHRVCASLPSIPQDDWYCRYCQNMFQREKFVEHNANA 570
Query: 776 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 835
AAGR+ GVDP Q+ RCIRIV P+ E+ CVLCRG DF KS FG RT+ILCDQCE+E
Sbjct: 571 VAAGRVSGVDPIEQITKRCIRIV-NPEAEVSACVLCRGYDFSKSGFGPRTIILCDQCEKE 629
Query: 836 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 895
+H+GCL+DH M+DL+ELP GKW CC +C RI+ ALQKL RGEEKLP++ L+VIK+KHE
Sbjct: 630 FHIGCLRDHKMQDLKELPSGKWFCCLECIRIHSALQKLHVRGEEKLPDSLLNVIKEKHER 689
Query: 896 SGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY 955
G ++ D++VRWR+L GK AS TR LLS+AV+IFHDRFDPII+S + DLIPAMVY
Sbjct: 690 KGLESIADYNVRWRLLSGKL--ASPETRVLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVY 747
Query: 956 GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCC 1015
GR+ RGQD+ G+YCA++TVN VVSAGI R+FGQE+AELPLVATS D QG+GYFQ LF C
Sbjct: 748 GRNVRGQDFSGLYCAVITVNSHVVSAGILRVFGQEVAELPLVATSVDNQGRGYFQILFSC 807
Query: 1016 IEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQK 1074
IEKLL FLNV++ VLP+A EA+ IWT KFGF +T ++ ++YR + MI FQGT ML+K
Sbjct: 808 IEKLLAFLNVRSFVLPAAEEAECIWTKKFGFKKITPDQLSEYRKSFYQMISFQGTCMLEK 867
Query: 1075 PVPKCRIV 1082
VP+ R +
Sbjct: 868 GVPEWRRI 875
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 133/558 (23%), Positives = 211/558 (37%), Gaps = 150/558 (26%)
Query: 21 LKREFEFALKVQSEICGSLGRTRARKVQS---NVDSGCVLGPPEVKKLKT---------- 67
+KRE FAL+V S+ CG +GRTR+ K Q+ ++ V KKLKT
Sbjct: 1 MKRELAFALEVHSQ-CGLIGRTRSTKAQTQNETINCNGVAWNESNKKLKTLNDELEGNDS 59
Query: 68 ----------------YESRKKRKRQEQS------VVVKETED---KREEEVKSDVFD-V 101
Y K+ KR E+S + +E +D +EE K+D D V
Sbjct: 60 SDRIRETKNRWDGVIQYSRNKRLKRLEESKNDERRTIAEEPKDDESTTDEEQKTDENDPV 119
Query: 102 INERER------PIREKESKDDSENMGVGERGALMNVEEVKVVSERREEGNDEFGKV-VI 154
+ E+ PI E+E K S+ + ++E ND K+
Sbjct: 120 VVEKPTGGYLVGPICEEEPKSQSQKASI------------------KDESNDGSLKLQTA 161
Query: 155 GVEEEKKNECDEVLMNVEENKYGELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVL 214
G+ +E K E+ + +EE R + E V E N CN V
Sbjct: 162 GLIDESK----EIDIAMEEKL---------PKRFTRSALKSKEDTVESLESDYNFCNSVA 208
Query: 215 TNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGL 274
V+E + V R +G ++ KK ++V + + ++ E G+
Sbjct: 209 IGVDEKTNGAV-RSLTSPKKLG------------LKMSKKIALNKVPLTIRDLL--ETGM 253
Query: 275 GKEYEPGRVQMEMDEEKKN-DIERELVENGVLESSMVGKHSSTL---------CNGESNV 324
+ Y + D KK ++ + NG+L S + K S + C +
Sbjct: 254 LEGYP-----VTYDGRKKGYRLQGTIKGNGILCSCSLCKGSRVVLPSQFELHACKSYRHA 308
Query: 325 AKSVAVD-GNDEGKTVNVVVERPLRRFTRSLLQQKVEL-AKGSLSKDGGKRSDVTEVA-- 380
AK + +D G + ++V + PL ++ K SL D + D +
Sbjct: 309 AKYIYLDNGKNLHDVLHVCKDAPLETLEATIQSAIGSFPVKRSLPADEAAKMDPLGNSCI 368
Query: 381 --NDGVGGPVKQET----VMKPRKVMRKFYSKLKN---------------------FLES 413
N+ + + + ++KP V + S L N F E
Sbjct: 369 KRNNSPATSIHRTSERARLLKPIPVTKSSGSALYNSSENKSLGKITKKDQRLHRLVFEEG 428
Query: 414 GILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG-SSN 472
G+ +G V Y G K L G KG GI C+C C+ V+ + FE HAG +S
Sbjct: 429 GLPDGTEVAYYAGGK------KLLDGYKKGFGIFCWCCHCE----VSASQFEAHAGWASR 478
Query: 473 KRPPEYIYLENGKTLRDI 490
K+P YIY NG +L ++
Sbjct: 479 KKPYSYIYTSNGVSLHEL 496
>gi|297822481|ref|XP_002879123.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297324962|gb|EFH55382.1| protein binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1026
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/1107 (44%), Positives = 650/1107 (58%), Gaps = 125/1107 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
MA GT + E FVV R G KRE +F LK QSEICG SLGRTR + N +S V
Sbjct: 1 MAKGTATGE-FVV----RTGCKRELQFVLKSQSEICGGESLGRTRGSR-NLNGESRSV-- 52
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDS 118
EVKK R +S + K K EE + SD V + E E K D
Sbjct: 53 --EVKK-------SVRGSGLKSGIKKMRFSKDEEVLMSDTVGV------GLEEDEGKSD- 96
Query: 119 ENMGVGERGALMNVEEVK-VVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG 177
+++VEE K V + +E N E G N D V+ N E+
Sbjct: 97 ----------VVDVEETKGFVKDEGQEENVEIG-----------NGDDNVVTN--EDHLL 133
Query: 178 ELDGMGGSARTEEEK--NECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLI 235
E G TE EK + G+ V+ + E C L+ V E S + K E L+
Sbjct: 134 EQPEKGSLEVTEAEKVCSIGGDSVI--DREIVVACPAGLS-VLEKMASRSCKVKLERGLV 190
Query: 236 -----GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEE 290
+ +V + E DE D V + + + E+ +G G V EM+EE
Sbjct: 191 YAKPCKRLTRSMLKVEGIKSEVNVDE-DHVNPEKDAIDSEDNCVGVS---GSVACEMEEE 246
Query: 291 KKNDIERELVENGVLESSMVGKHSSTLCNG-ESNVAKSVAVDGNDEGKTVNVVVERPLRR 349
E+ SS + HS LC G S + + + K VN VV +PLRR
Sbjct: 247 LHEQNHAEICLGLPSRSSQMSGHS--LCLGLPSRSFQMSGLSQRVDKKAVNDVVAKPLRR 304
Query: 350 FTRSLLQQKVELAKGSLSKDGGKRSDVTEV---ANDGVGGPVKQETVM------KPRKVM 400
FTRSL++Q+ + +L + + +D+ EV AND +V +P+K +
Sbjct: 305 FTRSLVKQESDSDNPNLGNNT-ELADLGEVDMHANDVEMDDFHSPSVTTHNRRGRPKKFL 363
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
R F +KLK GILEG++V Y+RG+KV+ G GL+GV+KGSG+ CFC CKG QVV+
Sbjct: 364 RNFPAKLKEIFNCGILEGLTVYYLRGAKVREAGTRGLKGVIKGSGVLCFCCACKGIQVVS 423
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
A++E+HA S+NKRPPEYI LE+G TLRD+MN CK++P TLE+ +R+V+G + +KK++
Sbjct: 424 TAMYEVHASSANKRPPEYILLESGFTLRDVMNACKETPSATLEEKLRVVVGPN-LKKSSL 482
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKE--PLSHSSEMEPQPPSV--ELE 576
CLNC+ + +++CKSC+E KE + ++ K L+ SS P S+ +
Sbjct: 483 CLNCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKGNGALNGSSRPSVDPKSILSRSK 542
Query: 577 ESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGL 634
SP S + +S EP + S SK S S GK+TRKD+R+HKLVFE+ L
Sbjct: 543 SSPRQSNRQEQPTRKSTEPGVVPGTILSESKSSSIKSNSQGKLTRKDVRLHKLVFEDDIL 602
Query: 635 EDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSN 694
DG EVGYFV GEV SPS FEAHAG ASRRKPFQHIYT+N
Sbjct: 603 PDGTEVGYFVAGEV---------------------SPSSFEAHAGCASRRKPFQHIYTTN 641
Query: 695 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
GVSLHELS+ LS+++ FS ENDDLC IC DG C SL +PS W
Sbjct: 642 GVSLHELSVALSMDQRFSIHENDDLCSICRDGV---------------CASLSSLPSERW 686
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCRG 813
C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIR+V + TEL CVLCRG
Sbjct: 687 SCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRVVSSFGTELPSVCVLCRG 746
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 873
FC+ F RTVI+CDQCE+E+HVGCLK+H + DL+ELP+ KW C DC++IN L L
Sbjct: 747 HSFCRLGFNSRTVIICDQCEKEFHVGCLKEHNIADLKELPEEKWFCSVDCEKINTTLGNL 806
Query: 874 VDRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 930
+ RGEEKL L+ I+ K +EES D+ D+RWRVL GK + +SD T+ LL+KAV
Sbjct: 807 IIRGEEKLTNNILNFIRTKEKPNEESCPDDNTTPDIRWRVLSGK-LTSSDETKILLAKAV 865
Query: 931 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
SI H+RFDPI E+ ++ DLIPAMVYGR +GQD+ GMYC +L V++V+VS GIFR+FG E
Sbjct: 866 SILHERFDPISETGTRGDLIPAMVYGRQAKGQDFSGMYCTMLAVDEVIVSVGIFRVFGSE 925
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
LAELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ MT
Sbjct: 926 LAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMT 985
Query: 1051 EEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
+EE +YR DY +MIF GTSML+K VP
Sbjct: 986 DEEVKEYRKDYSVMIFHGTSMLRKSVP 1012
>gi|147843889|emb|CAN79441.1| hypothetical protein VITISV_017668 [Vitis vinifera]
Length = 848
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/683 (55%), Positives = 487/683 (71%), Gaps = 52/683 (7%)
Query: 375 DVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
+++ V N K+ + + PR V R+F +KLK L++GILE + V YIRGS+ +G G
Sbjct: 181 EISAVNNGEENTGTKRSSGLVPR-VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGE 239
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
SGLRGV+KGSGI C C+ CKG +VVTP +FELHAGSSNKRPPEYIYLENG +LR +MN
Sbjct: 240 SGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAW 299
Query: 495 KDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSA 554
K++ L++L++A+R+ +G S +KK+ FCLNC+ S AG+ +LC SC++LKESQA
Sbjct: 300 KNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNSKVLCLSCLQLKESQA--- 356
Query: 555 EIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS- 613
SP+ +D+ RSP+P++ S+ S K SS S
Sbjct: 357 -----------------------SPSQVTGSSDSHLRSPKPSTISRSAESVSKCSSSGSK 393
Query: 614 -HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEV--KFLVGYKKGFGILCTCCNSEVS 670
+G++T+KDL +HKLVF E GL +G EVGY+VRG+V + LVGYK+G GI CTCCNSEVS
Sbjct: 394 SYGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQVVTQLLVGYKRGSGIXCTCCNSEVS 453
Query: 671 PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLL 730
PSQFEAHAGWASRRKP+ HIYTSNGVSLHE SI LS R S +NDDLC IC+DGG+LL
Sbjct: 454 PSQFEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLL 513
Query: 731 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM 790
CCD CPR FH +CVSL IP G W C++C N QKEKFVE+NANA AAGR+ GVDP Q+
Sbjct: 514 CCDGCPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQI 573
Query: 791 VSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQ 850
RCIRIV T E+GGC LCR +F +S FG RTV+LCDQCE+E+HVGCL++H M+DL+
Sbjct: 574 TKRCIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLK 633
Query: 851 ELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 910
E+PKGKW CC DCKRIN +LQKLV GEE+LP L IK+K+ +GS + D D++WR+
Sbjct: 634 EVPKGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRL 693
Query: 911 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 970
+ G++ + + +LLS+A+SIFH++FDPI ++A + DL+P MV+G
Sbjct: 694 ICGRRASSIEAG-SLLSQALSIFHEQFDPIADAAGR-DLLPDMVHGSQ------------ 739
Query: 971 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1030
VVSA FRIFG+E+AELPLVAT +DCQGQGYFQ+LF C+E LLG L V++LVL
Sbjct: 740 -------VVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVL 792
Query: 1031 PSASEAQAIWTNKFGFSMMTEEE 1053
P+A A++IWTNKFGF+ +T+E+
Sbjct: 793 PAAEGAESIWTNKFGFNKVTQEQ 815
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/45 (73%), Positives = 36/45 (80%), Gaps = 1/45 (2%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRAR 45
MA GTDSEE FVVLS++R G KREF FA+K QS I GSLGRTR R
Sbjct: 1 MAKGTDSEE-FVVLSRVRPGCKREFAFAVKAQSAIAGSLGRTRTR 44
>gi|359479418|ref|XP_002272497.2| PREDICTED: uncharacterized protein LOC100255152 [Vitis vinifera]
Length = 863
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/737 (51%), Positives = 497/737 (67%), Gaps = 51/737 (6%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 170 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSSALVPSTLKGFPTKL 216
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 217 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 276
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 277 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 335
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A A QP T
Sbjct: 336 ISEVGTSDSLVLCESCVGLKESHASPA--------------QP---------------TG 366
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
TS+RS +T++ SK SS K++G++T+KD+ +HKL F E L +G+EV Y+VRGE
Sbjct: 367 TSDRS-----TKTTTVSKCSSSGSKNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRGE 421
Query: 648 VKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL 707
+ L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHELS+ L
Sbjct: 422 -RLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 480
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+NT QK K
Sbjct: 481 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 540
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
FVE NANA AAGR+ GVDP Q+ RCIRIV+ E GGC LCR +F S FG TV+
Sbjct: 541 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 600
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLD 887
+CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+LP+ L
Sbjct: 601 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 660
Query: 888 VIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL 947
+IKK++ GS + + D++WR+L G++ A++ +LLS+A+S+FH++F+PI ++ +
Sbjct: 661 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAG-SLLSQALSLFHEQFNPIADAEGR- 718
Query: 948 DLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQG 1007
DL+ MV+ S ++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +DCQGQG
Sbjct: 719 DLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQG 778
Query: 1008 YFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQ 1067
YFQ+L+ CIE+LL FL V +LVLP+A A+++W NKF F M +EE N D+ +M FQ
Sbjct: 779 YFQALYTCIERLLCFLQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQ 838
Query: 1068 GTSMLQKPVPKCRIVGK 1084
GTSMLQKPVP+ R + +
Sbjct: 839 GTSMLQKPVPEYRRISQ 855
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 1 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 54
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKR--EEEVKSD 97
P + L+ S KKR++ K ++R E+ V+S+
Sbjct: 55 PSRNRVLEN--SIKKRQKSSSLDSQKNNVEERFPEDRVRSN 93
>gi|449524528|ref|XP_004169274.1| PREDICTED: uncharacterized protein LOC101231774 [Cucumis sativus]
Length = 937
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/815 (48%), Positives = 509/815 (62%), Gaps = 75/815 (9%)
Query: 324 VAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGS-----LSKDGGK 372
+AK + V+GN G + +RFTRS L KVE LA GS +S GG
Sbjct: 140 IAKDIKVEGNLPGWEI--------KRFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGG- 190
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGP 432
+ +E N K E M + + K ++ E+G+LEG+ V+Y+ VK
Sbjct: 191 --ETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGLLEGVPVIYM---GVKKA 245
Query: 433 GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
GLRG +K SGI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D++
Sbjct: 246 DDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQYICLENGKSLLDLLK 305
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAG 552
CK S +TLE V+ ++ SS +K C +C+ F ++ V ++ LC SC E K S+
Sbjct: 306 ACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSS-VGQVGPLCPSCEESKRSKWM 363
Query: 553 SAEIKEPLSH-SSEMEPQPPSVE------------------LEESPAPSGELTDTSNRSP 593
P S + P+ + ++ + S E T S RSP
Sbjct: 364 LTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKSKSSEYTSIS-RSP 422
Query: 594 EPNSAQTSSHSK----MKSSSVK--------------------SHGKITRKDLRMHKLVF 629
+ S +K M+ S+K + KIT KD R+HKLVF
Sbjct: 423 RSAPMRIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKLVF 482
Query: 630 EEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 689
EE GL DG EV YF RG+ K L GYKKG GILC CCN VSPSQFE HAGW+SR+KP+ +
Sbjct: 483 EEDGLPDGTEVAYFARGQ-KLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAY 541
Query: 690 IYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 749
IYTSNGVSLHEL+I LS R +S+K+NDDLC IC+DGG+LL CD CPRAFH +C SL
Sbjct: 542 IYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSST 601
Query: 750 PSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCV 809
P G W+C++C N FQ+EKFVE+N NA AAGR+ GVDP Q+ RCIRIV+ +T+L GCV
Sbjct: 602 PRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGCV 661
Query: 810 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLA 869
LCRG DF KS FG RT+ILCDQCE+E+HVGCLKDH M L+ELP+GKW C C RI+ A
Sbjct: 662 LCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHSA 721
Query: 870 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
LQKL+ RG EKLP + L + +K E+ SD VD DV WR++ GK AS TR LLS+A
Sbjct: 722 LQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGK--IASPETRLLLSEA 779
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 989
++IFHDRFDPI++ S DLIPAMVYGR GQ++ GMYCAIL VN VVSA + R+FGQ
Sbjct: 780 IAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQ 839
Query: 990 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1049
++AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IWT KFGF +
Sbjct: 840 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 899
Query: 1050 TEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1083
++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 900 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 934
>gi|255556782|ref|XP_002519424.1| DNA binding protein, putative [Ricinus communis]
gi|223541287|gb|EEF42838.1| DNA binding protein, putative [Ricinus communis]
Length = 855
Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/752 (48%), Positives = 491/752 (65%), Gaps = 49/752 (6%)
Query: 347 LRRFTRSLLQQKVELAK------GSLSKDGGKRSDVTEVANDGVGGPVKQ--ETVMKPRK 398
LR+ TRS KVE + ++ + + DV +A P K+ E M +
Sbjct: 137 LRQLTRSNFTLKVEPVEVKVNGLETIDSEMISKVDVEMIAEGSALTPPKKNLELKMSKKI 196
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
+ +K E+G+LEG+ V+Y+ G K LRG +K GI C+C CKG +V
Sbjct: 197 ALDNIPMTVKELFETGLLEGVPVVYMGGKKA-----FCLRGTIKDVGILCYCSFCKGCRV 251
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
+ P+ FE+HA +R +YI ENGK+L D++N C++SPL++LE ++ + +K
Sbjct: 252 IPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRNSPLDSLEATIQSAISGLPKEKT 311
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEES 578
C C+ ++ V ++ LC SCVE KES P+ E
Sbjct: 312 FTCKRCKGTYPTILVGKVGPLCSSCVESKESNGS------------------PACE---- 349
Query: 579 PAPSGELTDTSNRSPEPNSAQTSSHSKMK--SSSVKSHGKITRKDLRMHKLVFEEGGLED 636
T+ +RS +P + S +S ++ SS K KIT KD R+HKLVFE+GGL D
Sbjct: 350 -------TNIKSRSSKPATVSKSLNSALEGVSSENKCQWKITTKDQRLHKLVFEDGGLPD 402
Query: 637 GAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 696
G EV Y+ RG+ K L+GYK+GFGILC CCN EVSPS FEAHAGWA+R+KP+ +IYTSNGV
Sbjct: 403 GTEVAYYARGQ-KLLMGYKRGFGILCCCCNCEVSPSTFEAHAGWATRKKPYAYIYTSNGV 461
Query: 697 SLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 756
SLHEL+I LS R +S+++NDDLC +C DGG L+ CD CPRAFH C SL IP G W C
Sbjct: 462 SLHELAISLSKGRKYSARDNDDLCIVCADGGSLILCDGCPRAFHKGCASLSSIPRGKWFC 521
Query: 757 RYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF 816
++C N FQ+EKFVE+NANA AAGRI GVDP Q+ RCIRIV+ + EL GCVLCRG DF
Sbjct: 522 QFCQNMFQREKFVEHNANAVAAGRISGVDPIEQITQRCIRIVKNIEAELTGCVLCRGYDF 581
Query: 817 CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR 876
+S FG RT+ILCDQC +E+HVGCL+ H + +L+ELPKGKW CC DC RI+ AL+KL+ R
Sbjct: 582 SRSGFGPRTIILCDQCGKEFHVGCLRSHKIANLKELPKGKWFCCPDCGRIHSALKKLLAR 641
Query: 877 GEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDR 936
E +P L+V+ KK+EE G + + DVRW++L GK AS T+ LLS+A++IF +
Sbjct: 642 EAEIIPNKLLEVVMKKNEEKGLETVNNIDVRWKLLTGK--SASPETKLLLSQALAIFQEC 699
Query: 937 FDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPL 996
FDPI+++ + DLIP MVYG++ +GQDY GMYCA+L VN VVSA I RIFGQE+AELPL
Sbjct: 700 FDPIVDTTGR-DLIPLMVYGKNSKGQDYGGMYCAVLMVNSFVVSAAIVRIFGQEVAELPL 758
Query: 997 VATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNK 1056
VATSN G+GYFQ LF IEKLL +L V ++VLP+A EA++IWT+KFGF + ++ +K
Sbjct: 759 VATSNGNHGKGYFQLLFSFIEKLLAYLKVHSIVLPAAEEAESIWTDKFGFQKIKPDQLSK 818
Query: 1057 YRND-YPLMIFQGTSMLQKPVPKCRIVGKSVD 1087
YR ++ F+GTSMLQK VP CRIV ++ +
Sbjct: 819 YRKSCCQILTFKGTSMLQKAVPPCRIVNQNTE 850
>gi|110741771|dbj|BAE98830.1| hypothetical protein [Arabidopsis thaliana]
Length = 636
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/633 (57%), Positives = 460/633 (72%), Gaps = 13/633 (2%)
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +KK++ CL
Sbjct: 1 MFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LKKSSLCL 59
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV--ELEES 578
+C+ + +++CKSC+E KE + ++ K + L+ SS P S+ + S
Sbjct: 60 SCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSILRRSKSS 119
Query: 579 PAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
P S + +S EP + S SK S SHGK+TRKDLR+HKLVFE+ L D
Sbjct: 120 PRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFEDDILPD 179
Query: 637 GAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 696
G EVGYFV GE K LVGYKKGFGI C+CCN VSPS FEAHAG ASRRKPFQHIYT+NGV
Sbjct: 180 GTEVGYFVAGE-KMLVGYKKGFGIHCSCCNKVVSPSTFEAHAGCASRRKPFQHIYTTNGV 238
Query: 697 SLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 756
SLHELS+ LS+++ FS ENDDLC IC DGG+L+CCD+CPR++H C SLP +PS W C
Sbjct: 239 SLHELSVALSMDQRFSIHENDDLCSICRDGGELVCCDTCPRSYHKVCASLPSLPSERWSC 298
Query: 757 RYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CVLCRGRD 815
+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CVLCRG
Sbjct: 299 KYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCVLCRGHS 358
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
FC+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN L L+
Sbjct: 359 FCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTTLGNLIV 418
Query: 876 RGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 932
RGEEKL L+ ++KK +EE+ D D+RWRVL GK + +SD T+ LL+KA+SI
Sbjct: 419 RGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGK-LTSSDDTKILLAKALSI 477
Query: 933 FHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELA 992
H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+FG ELA
Sbjct: 478 LHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELA 537
Query: 993 ELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEE 1052
ELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF+ MT+E
Sbjct: 538 ELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDE 597
Query: 1053 EQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1085
E +YR DY +MIF GTSML+K VP V K+
Sbjct: 598 EVKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 630
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 51/96 (53%), Gaps = 14/96 (14%)
Query: 398 KVMRKFYSKLKNFLESGIL-EGMSVMY-IRGSKVKGPGVSGLRGVVKGSGISCFCDDCKG 455
K+ RK K E IL +G V Y + G K+ L G KG GI C C
Sbjct: 159 KLTRKDLRLHKLVFEDDILPDGTEVGYFVAGEKM-------LVGYKKGFGIHCSC----C 207
Query: 456 NQVVTPAVFELHAGSSNKRPP-EYIYLENGKTLRDI 490
N+VV+P+ FE HAG +++R P ++IY NG +L ++
Sbjct: 208 NKVVSPSTFEAHAGCASRRKPFQHIYTTNGVSLHEL 243
>gi|147861524|emb|CAN83583.1| hypothetical protein VITISV_009664 [Vitis vinifera]
Length = 2427
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/760 (48%), Positives = 483/760 (63%), Gaps = 109/760 (14%)
Query: 334 DEGKTVNVVVERPL-RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
DE K +++ +E L +RFTRS L+ K + + SL D + V ++ G V+ +
Sbjct: 1762 DESKEIDIAMEEKLPKRFTRSALKSKEDTVE-SLESDYNFCNSVAIGVDEKTNGAVR--S 1818
Query: 393 VMKPRKVMRKFYSKL---------KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKG 443
+ P+K+ K K+ ++ LE+G+LEG V Y + KG L+G +KG
Sbjct: 1819 LTSPKKLGLKMSKKIALNKVPLTIRDLLETGMLEGYPVTY--DGRKKG---YRLQGTIKG 1873
Query: 444 SGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE 503
+GI C C CKG++VV P+ FELHA S + +YIYL+NGK L D+++VCKD+PLETLE
Sbjct: 1874 NGILCSCSLCKGSRVVLPSQFELHACKSYRHAAKYIYLDNGKNLHDVLHVCKDAPLETLE 1933
Query: 504 KAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHS 563
++ +GS +K++ P +
Sbjct: 1934 ATIQSAIGSFPVKRS---------------------------------------LPADEA 1954
Query: 564 SEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLR 623
++M+P S ++ + +P+ + TS R+ +D R
Sbjct: 1955 AKMDPLGNSC-IKRNNSPATSIHRTSERA---------------------------RDQR 1986
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+H+LVFEEGGL DG EV Y+ G+ K L GYKKGFGI C CC+ EVS SQFEAHAGWASR
Sbjct: 1987 LHRLVFEEGGLPDGTEVAYYAGGK-KLLDGYKKGFGIFCWCCHCEVSASQFEAHAGWASR 2045
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
+KP+ +IYTSNGVSLHEL+I LS R +S+++NDDLC IC DGG+LL CD CPRAFH C
Sbjct: 2046 KKPYSYIYTSNGVSLHELAISLSKGRKYSARDNDDLCSICGDGGNLLLCDGCPRAFHRVC 2105
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
SLP IP W+CRYC N FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P+
Sbjct: 2106 ASLPSIPQDDWYCRYCQNMFQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIV-NPEA 2164
Query: 804 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
E+ CVLCRG DF KS FG RT+ILCD QELP GKW CC +C
Sbjct: 2165 EVSACVLCRGYDFSKSGFGPRTIILCD-------------------QELPSGKWFCCLEC 2205
Query: 864 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 923
RI+ ALQKL RGEEKLP++ L+VIK+KHE G ++ D++VRWR+L GK AS TR
Sbjct: 2206 IRIHSALQKLHVRGEEKLPDSLLNVIKEKHERKGLESIADYNVRWRLLSGKL--ASPETR 2263
Query: 924 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGI 983
LLS+AV+IFHDRFDPII+S + DLIPAMVYGR+ RGQD+ G+YCA++TVN VVSAGI
Sbjct: 2264 VLLSEAVAIFHDRFDPIIDSVTGRDLIPAMVYGRNVRGQDFSGLYCAVITVNSHVVSAGI 2323
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
R+FGQE+AELPLVATS D QG+GYFQ LF CIEKLL FLNV++ VLP+A EA+ IWT K
Sbjct: 2324 LRVFGQEVAELPLVATSVDNQGRGYFQILFSCIEKLLAFLNVRSFVLPAAEEAECIWTKK 2383
Query: 1044 FGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIV 1082
FGF +T ++ ++YR + MI FQGT ML+K VP+ R +
Sbjct: 2384 FGFKKITPDQLSEYRKSFYQMISFQGTCMLEKGVPEWRRI 2423
>gi|449440157|ref|XP_004137851.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101203549
[Cucumis sativus]
Length = 946
Score = 717 bits (1852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/824 (48%), Positives = 510/824 (61%), Gaps = 84/824 (10%)
Query: 324 VAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVE------LAKGS-----LSKDGGK 372
+AK + V+GN G + +RFTRS L KVE LA GS +S GG
Sbjct: 140 IAKDIKVEGNLPGWEI--------KRFTRSSLGPKVEPMDITPLAIGSVKEEVISDVGG- 190
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGP 432
+ +E N K E M + + K ++ E+G+LEG+ V+Y+ VK
Sbjct: 191 --ETSETVNSLSTPKNKLELKMSKKIALNKRPMTVRELFETGLLEGVPVIYM---GVKKA 245
Query: 433 GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
GLRG +K SGI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D++
Sbjct: 246 DDFGLRGTIKDSGILCTCSSCNGCRVIPPSQFEIHACNQYKRAAQYICLENGKSLLDLLK 305
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAG 552
CK S +TLE V+ ++ SS +K C +C+ F ++ V ++ LC SC E K S+
Sbjct: 306 ACKGS-RQTLEATVQSLISSSPEEKHFTCRDCKGCFPSS-VGQVGPLCPSCEESKRSKWM 363
Query: 553 SAEIKEPLSH-SSEMEPQPPSVE------------------LEESPAPSGELTDTSNRSP 593
P S + P+ + ++ + S E T S RSP
Sbjct: 364 LTLPAPPTSGIGKRLRLAEPTTSKSSGSASVSISSRYKRKWVTKAKSKSSEYTSIS-RSP 422
Query: 594 EPNSAQTSSHSK----MKSSSVK--------------------SHGKITRKDLRMHKLVF 629
+ S +K M+ S+K + KIT KD R+HKLVF
Sbjct: 423 RSAPMRIPSKNKSALKMRKKSLKPALMLKSSQSASKCSSSLAKNQWKITTKDQRLHKLVF 482
Query: 630 EEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 689
EE GL DG EV YF RG+ K L GYKKG GILC CCN VSPSQFE HAGW+SR+KP+ +
Sbjct: 483 EEDGLPDGTEVAYFARGQ-KLLQGYKKGSGILCCCCNCVVSPSQFEVHAGWSSRKKPYAY 541
Query: 690 IYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 749
IYTSNGVSLHEL+I LS R +S+K+NDDLC IC+DGG+LL CD CPRAFH +C SL I
Sbjct: 542 IYTSNGVSLHELAISLSKGRKYSAKDNDDLCIICLDGGNLLLCDGCPRAFHKECASLSSI 601
Query: 750 PSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCV 809
P G W+C++C N FQ+EKFVE+N NA AAGR+ GVDP Q+ RCIRIV+ +T+L GCV
Sbjct: 602 PRGDWYCKFCQNMFQREKFVEHNVNAVAAGRVHGVDPIEQITKRCIRIVRNIETDLSGCV 661
Query: 810 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLA 869
LCRG DF KS FG RT+ILCDQCE+E+HVGCLKDH M L+ELP+GKW C C RI+ A
Sbjct: 662 LCRGSDFSKSGFGPRTIILCDQCEKEFHVGCLKDHKMAFLKELPRGKWFCSIVCTRIHSA 721
Query: 870 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
LQKL+ RG EKLP + L + +K E+ SD VD DV WR++ GK AS TR LLS+A
Sbjct: 722 LQKLLIRGPEKLPNSLLGAVNRKLGENCSDIQVDVDVSWRLISGK--IASPETRLLLSEA 779
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQ 989
++IFHDRFDPI++ S DLIPAMVYGR GQ++ GMYCAIL VN VVSA + R+FGQ
Sbjct: 780 IAIFHDRFDPIVDITSGRDLIPAMVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQ 839
Query: 990 ---------ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1040
++AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IW
Sbjct: 840 YCRAAIGCXDIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIW 899
Query: 1041 TNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1083
T KFGF + ++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 900 TEKFGFERIKPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 943
>gi|4510418|gb|AAD21504.1| hypothetical protein [Arabidopsis thaliana]
Length = 1008
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/819 (48%), Positives = 516/819 (63%), Gaps = 98/819 (11%)
Query: 295 IERELVENGVLESSMVGKHSSTLCNGESNV----AKSVAVDGNDEG---KTVNVVVERPL 347
+E EL+E +E + S+ NG S ++S + G+ +G K VN V++PL
Sbjct: 254 VEEELLEQNNVEICLGLPSRSSQMNGHSLCLGLPSRSFQMSGHSQGVDKKAVNDTVDKPL 313
Query: 348 RRFTRSLLQQKVELAKGSL--SKDGGKRSDVTEVAND---------GVGGPVKQETVMKP 396
RRFTRSL++Q+ + +L + + DV AND V P K+ +P
Sbjct: 314 RRFTRSLVKQESDSDNPNLGNTTEPADLVDVDMHANDVEMDGFQSPSVTTPNKRG---RP 370
Query: 397 RKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
+K +R F +KLK+ + GILEG+ V Y+RG+KV
Sbjct: 371 KKFLRNFPAKLKDIFDCGILEGLIVYYVRGAKV--------------------------- 403
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
V+PA+FELHA S+NKRPPEYI LE+G TLRD+MN CK++PL TLE+ +R+V+G +K
Sbjct: 404 --VSPAMFELHASSNNKRPPEYILLESGFTLRDVMNACKENPLATLEEKLRVVVGPI-LK 460
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIK--EPLSHSSEMEPQPPSV- 573
K++ CL+C+ + +++CKSC+E KE + ++ K + L+ SS P S+
Sbjct: 461 KSSLCLSCQGPMIEPCDTKSLVVCKSCLESKEPEFHNSPSKANDALNGSSRPSVDPKSIL 520
Query: 574 -ELEESPAPSGELTDTSNRSPEPNSAQTS--SHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+ SP S + +S EP + S SK S SHGK+TRKDLR+HKLVFE
Sbjct: 521 RRSKSSPRQSNRREQPTRKSTEPGVVPGTILSESKNSSIKSNSHGKLTRKDLRLHKLVFE 580
Query: 631 EGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
+ L DG EVGYFV GEV SPS FEAHAG ASRRKPFQHI
Sbjct: 581 DDILPDGTEVGYFVAGEV---------------------SPSTFEAHAGCASRRKPFQHI 619
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 750
YT+NGVSLHELS+ LS+++ FS ENDDLC IC DG C SLP +P
Sbjct: 620 YTTNGVSLHELSVALSMDQRFSIHENDDLCSICRDGV---------------CASLPSLP 664
Query: 751 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG-CV 809
S W C+YC+N ++EKFV+ N NA AAGR++GVD A++ +RCIRIV + TEL CV
Sbjct: 665 SERWSCKYCVNMVEREKFVDSNLNAIAAGRVQGVDAIAEITNRCIRIVSSFVTELPSVCV 724
Query: 810 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLA 869
LCRG FC+ F RTVI+CDQCE+E+HVGCLK+ + DL+ELP+ KW C C+ IN
Sbjct: 725 LCRGHSFCRLGFNARTVIICDQCEKEFHVGCLKERDIADLKELPEEKWFCSLGCEEINTT 784
Query: 870 LQKLVDRGEEKLPETSLDVIKKK---HEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
L L+ RGEEKL L+ ++KK +EE+ D D+RWRVL GK + +SD T+ LL
Sbjct: 785 LGNLIVRGEEKLSNNILNFLRKKEQPNEENCPDYKTTPDIRWRVLSGK-LTSSDDTKILL 843
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 986
+KA+SI H+RFDPI ES +K DLIPAMVYGR + QD+ GMYC +L V++V+VS GIFR+
Sbjct: 844 AKALSILHERFDPISESGTKGDLIPAMVYGRQTKAQDFSGMYCTMLAVDEVIVSVGIFRV 903
Query: 987 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1046
FG ELAELPLVATS DCQGQGYFQ LF CIE+LLGFLNVK +VLP+A EA++IWT+KFGF
Sbjct: 904 FGSELAELPLVATSKDCQGQGYFQCLFACIERLLGFLNVKHIVLPAADEAKSIWTDKFGF 963
Query: 1047 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKS 1085
+ MT+EE +YR DY +MIF GTSML+K VP V K+
Sbjct: 964 TKMTDEEVKEYRKDYSVMIFHGTSMLRKSVPAPSAVSKT 1002
>gi|356570792|ref|XP_003553568.1| PREDICTED: uncharacterized protein LOC100802562 [Glycine max]
Length = 796
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/745 (50%), Positives = 500/745 (67%), Gaps = 31/745 (4%)
Query: 342 VVERPLRRFTRSLLQQKVELA-KGSLSKDGGKRSDVTEVANDGVGGPV-KQETVMKPRKV 399
VV R +R RS L+ KVE + +S+ ++ N +G P K E M + V
Sbjct: 75 VVARTRKRSRRSALEAKVECCDQMVVSETEQVVANGGSGINGALGAPRNKMELKMSKKIV 134
Query: 400 MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
+ + +K ++G L+G+SV+Y+ G K SGLRGV++ GI C C C G +V+
Sbjct: 135 VNRKPMTVKKLFDTGFLDGVSVVYMGGIK----KASGLRGVIRDGGILCSCCLCNGRRVI 190
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
P+ FE+HA +R +YI LENGK+L D++ C+ + L TLE V+ + S ++
Sbjct: 191 PPSQFEIHACKQYRRAAQYICLENGKSLLDLLRACRGATLHTLEVTVQNFVCSPHEERYF 250
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
C C+ F ++ VE + +C+SCVE ++S+ S + S P+P
Sbjct: 251 TCKRCKGCFPSSFVERVGPICRSCVESRKSEESSNNVVGKRVRS----PRPVV------- 299
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSV-KSHGKITRKDLRMHKLVFEEGGLEDGA 638
L++ S+ S S+Q H K ++ V S + D R+HKLVFEE GL DG
Sbjct: 300 -----LSNPSSTSELSVSSQVKRHRKKRTKLVFISISSVL--DQRLHKLVFEENGLPDGT 352
Query: 639 EVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL 698
EV Y+ RG+ K L G+K G GI+C CCN+E+SPSQFE HAGWASR+KP+ +IYTSNGVSL
Sbjct: 353 EVAYYARGQ-KLLEGFKMGSGIVCRCCNTEISPSQFEVHAGWASRKKPYAYIYTSNGVSL 411
Query: 699 HELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 758
HEL+I LS +R +S+K+NDDLC +C DGG+LL CD CPRAFH +C +L IP G W+C++
Sbjct: 412 HELAISLSKDRKYSAKDNDDLCIVCWDGGNLLLCDGCPRAFHKECAALSSIPRGDWYCQF 471
Query: 759 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 818
C N FQ+EKFV +NANA AAGR+EGVDP Q+ +RCIRIV+ + +L C LCRG DF +
Sbjct: 472 CQNMFQREKFVAHNANAVAAGRVEGVDPIEQIANRCIRIVKDIEADLSSCALCRGVDFSR 531
Query: 819 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
S FG RT+ILCDQCE+EYHVGCL+DH M L+ELP+G WLCC DC RI+ L+ L+ +G
Sbjct: 532 SGFGPRTIILCDQCEKEYHVGCLRDHKMAYLKELPEGNWLCCNDCTRIHSTLENLLVKGA 591
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
E+LPE+ L VIKKK EE G + + DVRWR+L GK AS TR LL +AVSIFH+ F+
Sbjct: 592 ERLPESLLGVIKKKQEEKGLEPII--DVRWRLLNGK--IASPETRPLLLEAVSIFHECFN 647
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVA 998
PI+++AS DLIPAMVYGR+ RGQ++ GMYCA+L VN VVSAG+ RIFG ++AELPLVA
Sbjct: 648 PIVDAASGRDLIPAMVYGRNVRGQEFGGMYCALLIVNSSVVSAGMLRIFGSDVAELPLVA 707
Query: 999 TSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR 1058
TSN G+GYFQ+LF CIE+LL FLNVK LVLP+A EA++IWT+KFGFS M +E YR
Sbjct: 708 TSNGNHGKGYFQTLFSCIERLLAFLNVKNLVLPAAEEAESIWTDKFGFSKMNPDELTNYR 767
Query: 1059 -NDYPLMIFQGTSMLQKPVPKCRIV 1082
N + ++ F+GT+ML K VP CR++
Sbjct: 768 KNCHQMVSFKGTNMLHKMVPSCRVI 792
>gi|224140243|ref|XP_002323493.1| predicted protein [Populus trichocarpa]
gi|222868123|gb|EEF05254.1| predicted protein [Populus trichocarpa]
Length = 741
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/726 (49%), Positives = 471/726 (64%), Gaps = 51/726 (7%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K E+G+LEG+ V+Y+ G K + GLRG +K GI C C C G +V+ P+ FE+
Sbjct: 14 VKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDVGILCSCAFCNGRRVIPPSQFEI 70
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N C+ +PL++LE ++ + +++ C C+
Sbjct: 71 HAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVERTFTCKRCKG 130
Query: 527 SFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVE--LEESPAPSGE 584
F + V ++ LC C E KES +S + P + + SG+
Sbjct: 131 IFPSICVGKIGPLCNLCAESKESHPTLTIGSSIISRYCQNLPSLILISWIINLKTITSGQ 190
Query: 585 ----LTDTSNR----SPE---------PNSAQTSSH--------------------SKMK 607
L S R SPE P S T S SK+
Sbjct: 191 FLLMLAHCSFRLSFLSPEQVLALEYFKPASLSTFSQDNTLRKKKRKPEEPDLIAKPSKVA 250
Query: 608 SSSV----KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCT 663
S + + + KI+ +D R+H+LVFEEGGL DG E+ Y+ RG+ K L GYK+GFGILC
Sbjct: 251 SVHLSPRKRKYKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQ-KLLGGYKRGFGILCH 309
Query: 664 CCNSEV-SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGI 722
CCN EV SPS FEAHAGWA+R+KP+ IYTSNGVSLH+L+I LS R +SS++NDDLC I
Sbjct: 310 CCNCEVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHDLAISLSKSRKYSSQDNDDLCII 369
Query: 723 CMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIE 782
C DGGDLL CD CPRAFH C SL +PSG W+C++C NTFQ+EKFVE+NANA AAGR+
Sbjct: 370 CADGGDLLLCDGCPRAFHKGCASLSTVPSGDWYCQHCQNTFQREKFVEHNANAFAAGRVS 429
Query: 783 GVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLK 842
+D Q+ RC RIV+ + EL GC LCRG DF +S FG RT+ILCDQCE+E+HVGCL+
Sbjct: 430 EIDSIEQITKRCFRIVKNVEAELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLR 489
Query: 843 DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAV 902
H M +L+ELPKG W CC DC RI+ LQKL+ RG EKLP++ L+ IKKKHEE G + +
Sbjct: 490 SHKMANLKELPKGNWFCCMDCSRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEEKGLNISN 549
Query: 903 DFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ 962
DVRW +L GK AS + LLS+A+SIF + FDPI++S DLIP MVYG++ +GQ
Sbjct: 550 SIDVRWTLLSGK--IASPENKLLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQ 607
Query: 963 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1022
DY GMYCA+L VN +VSAGI R+FG+E+AELPLVAT N G+GYFQ LF CIEKLL F
Sbjct: 608 DYGGMYCAVLIVNSCIVSAGILRVFGEEVAELPLVATRNGDHGKGYFQLLFSCIEKLLAF 667
Query: 1023 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRI 1081
LNV+ LVLP+A EA++IW KFGF + E+ +KYR + M+ F+GTSMLQK VP C+I
Sbjct: 668 LNVQNLVLPAAEEAESIWIEKFGFQKIKPEQLSKYRKNCCQMVRFEGTSMLQKAVPTCKI 727
Query: 1082 VGKSVD 1087
V +S++
Sbjct: 728 VNQSIE 733
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 25/121 (20%)
Query: 382 DGVGGPVKQETV-MKPRKVMRKFYSKLKN---------FLESGILEGMSVMYI-RGSKVK 430
D + P K +V + PRK RK Y K+ F E G+ +G + Y RG K+
Sbjct: 241 DLIAKPSKVASVHLSPRK--RK-YKKISPRDQRLHRLVFEEGGLPDGTELAYYARGQKLL 297
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG-SSNKRPPEYIYLENGKTLRD 489
G G +G GI C C +C +VV+P+ FE HAG ++ K+P IY NG +L D
Sbjct: 298 G-------GYKRGFGILCHCCNC---EVVSPSTFEAHAGWATRKKPYACIYTSNGVSLHD 347
Query: 490 I 490
+
Sbjct: 348 L 348
>gi|357510879|ref|XP_003625728.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500743|gb|AES81946.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 730
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/697 (50%), Positives = 454/697 (65%), Gaps = 50/697 (7%)
Query: 434 VSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV 493
VSGLRGV++ GI C C C+G +V++P+ FE+HA +R EYI ENGK+L D++
Sbjct: 32 VSGLRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRA 91
Query: 494 CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL-----MLLCKSCVELKE 548
C+ +PL LE ++ ++ S +K C C+ F ++ +E + + S E
Sbjct: 92 CRGAPLHDLEATIQNIVCSPPEEKYFTCKRCKGRFPSSCMERVGPICSSCVESSKSEESS 151
Query: 549 SQAGSAEIKEPL-------SHSSEMEPQPPSVELEESPA--------------------P 581
S I+ P S +SEM P P
Sbjct: 152 KNVVSKRIRSPRPVLVSKSSCASEMSISPKIKRRGRKRRKSSKRVNSSNSSKSASVPILP 211
Query: 582 SGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRK----------DLRMHKLVFEE 631
++T + + +T+S+S S +KS KIT+K D R+HKLVFEE
Sbjct: 212 RRKVTPKTKKKSLSVKLKTTSNSNCLSPQIKSEWKITKKLVPYSFPTCGDNRLHKLVFEE 271
Query: 632 GGLEDGAEVGYFVRGEV-----KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
GL DG+E+ Y+ G++ K L G+KKG GI+C CCN+E+SPSQFE HAGWASR+KP
Sbjct: 272 NGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRCCNTEISPSQFEVHAGWASRKKP 331
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
+ +IYTSNGVSLHELSI LS +R +S+ +NDDLC +C DGG+LL CD CPRAFH +C SL
Sbjct: 332 YAYIYTSNGVSLHELSISLSKDRKYSANDNDDLCVVCWDGGNLLLCDGCPRAFHKECASL 391
Query: 747 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
IP G W+C++C N FQ+EKFV YN NA AAGR+EGVDP Q+ RCIRIV+ D EL
Sbjct: 392 SSIPRGDWYCQFCQNMFQREKFVAYNVNAFAAGRVEGVDPIEQITKRCIRIVKDIDAELS 451
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
C LCRG DF KS FG RT+ILCDQCE+EYHVGCL+DH M L+ELPKG WLCC DC RI
Sbjct: 452 ACALCRGVDFSKSGFGPRTIILCDQCEKEYHVGCLRDHKMTFLKELPKGNWLCCNDCTRI 511
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
+ L+ ++ RG E+LP++ L VIKKK EE G D D +VRWR+L GKK AS TR LL
Sbjct: 512 HSTLENVLVRGAERLPKSLLAVIKKKQEEKGLDPINDINVRWRLLSGKK--ASPETRPLL 569
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 986
+AVSIFH+ FDPI+++ S DLI AMVYG+S RGQ++ GMYCA+L VN VVSAG+ RI
Sbjct: 570 LEAVSIFHECFDPIVDAVSGRDLIRAMVYGKSVRGQEFGGMYCALLIVNSSVVSAGMLRI 629
Query: 987 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1046
FG ++AELPLVATSN G+GYFQ+LF CIE+LL F+ VK LVLP+A EAQ+IWT+KFGF
Sbjct: 630 FGTDIAELPLVATSNSQHGKGYFQALFSCIERLLAFMKVKNLVLPAAEEAQSIWTDKFGF 689
Query: 1047 SMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIV 1082
S + +E Y RN + FQGT+ML K VP CR++
Sbjct: 690 SKIKPDELANYRRNCNQFVTFQGTNMLHKMVPPCRVI 726
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 46/82 (56%), Gaps = 5/82 (6%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ +G + Y G ++ L G KGSGI C C N ++P+ FE+HAG
Sbjct: 269 FEENGLPDGSELAYYAGGQLYSDRQKLLEGFKKGSGIVCRC----CNTEISPSQFEVHAG 324
Query: 470 -SSNKRPPEYIYLENGKTLRDI 490
+S K+P YIY NG +L ++
Sbjct: 325 WASRKKPYAYIYTSNGVSLHEL 346
>gi|356533354|ref|XP_003535230.1| PREDICTED: uncharacterized protein LOC100798276 [Glycine max]
Length = 745
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/689 (49%), Positives = 460/689 (66%), Gaps = 34/689 (4%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
V+ K + +K ++G+L+G+ V+Y+ K + LRG +K GI C C C G +V
Sbjct: 90 VVHKKPATVKELFQTGLLDGVPVVYV---GCKKDSTTELRGEIKDGGILCSCSLCNGRRV 146
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
+ P+ FE+HA + KR +YI LENGK++ ++M C+ +PL TLE ++ + S +K
Sbjct: 147 IPPSQFEIHACNIYKRAAQYICLENGKSMLELMRACRAAPLHTLEATIQNFINSPPEEKY 206
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEES 578
C NCR F ++ VE + LLC SCVE ++S+ S SS++ SV+L+ +
Sbjct: 207 FTCKNCRGCFPSSNVERVGLLCLSCVESRKSEKSSIHAVGKRIRSSKL-----SVKLKTA 261
Query: 579 PAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGA 638
P S K S KS +I+++ R+HKL+FEE GL +GA
Sbjct: 262 PITS----------------------KCLSPQNKSQWRISKRYQRLHKLIFEEDGLPNGA 299
Query: 639 EVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL 698
EV Y+ RG+ K L G K GI+C CCN+E+SPSQFE HAGWASRRKP+ IYTSNGVSL
Sbjct: 300 EVAYYARGQ-KLLEGIKTCSGIVCRCCNTEISPSQFEVHAGWASRRKPYAFIYTSNGVSL 358
Query: 699 HELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 758
HEL+I LS + ++K+ND +C +C DGG+LL CD CPRAFH +C S+ IP G W+C+
Sbjct: 359 HELAIFLSKDHKCTTKQNDYVCVVCWDGGNLLLCDGCPRAFHKECASVSSIPRGEWYCQI 418
Query: 759 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 818
C +TF +E+ V +NA+A AAGR+EGVDP Q+ RCIRIV+ E+GGCVLCR DF +
Sbjct: 419 CQHTFLRERPVLHNADAVAAGRVEGVDPIEQIAKRCIRIVKDIGAEMGGCVLCRSSDFSR 478
Query: 819 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
S FG RT+I+CDQCE+EYHVGCL+DH M L+ELP+G W CC DC RI+ L+ L+ R
Sbjct: 479 SGFGPRTIIICDQCEKEYHVGCLRDHKMAYLKELPEGDWFCCNDCTRIHSTLENLLIRVA 538
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
E+LPE+ LDVIKKK + + DVRW++L GK AS TR LL +AVS+FH+ FD
Sbjct: 539 ERLPESLLDVIKKKQVGRCLEPLNEIDVRWKLLNGK--IASPETRPLLLEAVSMFHECFD 596
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVA 998
PI++ A+ DLIPAMVYGR+ + QD+ GMYCA+L VN VVSAG+ RIFG+++AELPLVA
Sbjct: 597 PIVDPAAGRDLIPAMVYGRNLQTQDFGGMYCALLIVNSSVVSAGMVRIFGRDIAELPLVA 656
Query: 999 TSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR 1058
T +G+GYFQ+LF CIE+LL FLNVK LVLP+A EA +IWT KFGFS M + YR
Sbjct: 657 TRYKNRGKGYFQTLFACIERLLAFLNVKNLVLPAAEEAASIWTEKFGFSKMKPNQLTNYR 716
Query: 1059 -NDYPLMIFQGTSMLQKPVPKCRIVGKSV 1086
N + +M F+GT+ML K VP+CR++ +
Sbjct: 717 MNCHQIMAFKGTNMLHKTVPQCRVINTQL 745
>gi|147857667|emb|CAN78676.1| hypothetical protein VITISV_001802 [Vitis vinifera]
Length = 844
Score = 677 bits (1748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/743 (48%), Positives = 472/743 (63%), Gaps = 82/743 (11%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 170 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSXAXVPSTLKGFPTKL 216
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 217 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 276
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 277 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 335
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A P P+ + ++ PS
Sbjct: 336 ISEVGTSDSLVLCESCVGLKESHA---------------SPAQPTGQQKQRLCPS----- 375
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
++H +++ I D+ +HKL F E L +G+EV Y+VRGE
Sbjct: 376 -------------AAHQEVR---------IMGGDVGLHKLAFGENDLPEGSEVSYYVRGE 413
Query: 648 V------KFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 701
V + L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHEL
Sbjct: 414 VGTMRSKRLLSGHKKGCRILCDCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHEL 473
Query: 702 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
S+ L R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+N
Sbjct: 474 SLSLLRGREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLN 533
Query: 762 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 821
T QK KFVE NANA AAGR+ GVDP Q+ RCIRIV++ E GGC LCR +F S F
Sbjct: 534 TLQKGKFVERNANAVAAGRMGGVDPIEQIRKRCIRIVKSQTDEAGGCALCRRHEFSTSGF 593
Query: 822 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 881
G TV++CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+L
Sbjct: 594 GPHTVMICDQCEKEFHVGCLKAHNIDDLKAVPKGKWFCCRDCKDINSSLRKIVVRREEEL 653
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
P+ L +IKK++ GS + + D++WR+L G+ A++ +LLS+A+S+FH++F+PI
Sbjct: 654 PDDVLRIIKKRYGRKGSVCSGNPDIKWRLLHGRXASATEAG-SLLSQALSLFHEQFNPIA 712
Query: 942 ESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1001
++ + DL+ MV+ S ++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +
Sbjct: 713 DAEGR-DLLLDMVHSNSTGELEFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRS 771
Query: 1002 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1061
DCQGQ V +LVLP+A A+++W NKF F M +EE N D+
Sbjct: 772 DCQGQ------------------VNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDF 813
Query: 1062 PLMIFQGTSMLQKPVPKCRIVGK 1084
+M FQGTSMLQKPVP+ R + +
Sbjct: 814 QMMTFQGTSMLQKPVPEYRRISQ 836
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/78 (47%), Positives = 47/78 (60%), Gaps = 10/78 (12%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 1 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 54
Query: 59 PPEVKKLKTYESRKKRKR 76
P + L+ S KKR++
Sbjct: 55 PSRNRVLEN--SIKKRQK 70
>gi|297827161|ref|XP_002881463.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
gi|297327302|gb|EFH57722.1| hypothetical protein ARALYDRAFT_482652 [Arabidopsis lyrata subsp.
lyrata]
Length = 1007
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/735 (46%), Positives = 463/735 (62%), Gaps = 52/735 (7%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
++R+ ++ E+GIL+G+SV+Y+ K + GLRG++K GI C C C V
Sbjct: 256 LIRRRPETVRELFETGILDGLSVVYMGTVKSQA---FGLRGIIKDGGILCSCSSCDWAHV 312
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
++ + FE+HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKC 372
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEI----KEPLSHSSEME-----PQ 569
C C+ +F + + LC SC E++ SQA A + P +S ++ +
Sbjct: 373 FTCKRCKGAFPFSSLGHRGFLCMSCSEVETSQASPAAMWTSTSSPACIASPVKSRLKITR 432
Query: 570 PPSVELEESP---APSGELTDTSNR-------------SPEPN-SAQTSSHSKMK----- 607
PS + SP +P G T R S N S+Q SK K
Sbjct: 433 KPSESMSISPVFMSPLGNSTRNITRKALRQALVGKAYLSASTNISSQNKCRSKFKKMLTQ 492
Query: 608 -SSSVKSHG-------------KITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVG 653
S + K+ ++TRKD +HKLVFE GGL +G E+GY+ RG+ K L G
Sbjct: 493 YSVTPKAVKSVSLSVSSKKRSYRLTRKDQGLHKLVFERGGLPEGTELGYYARGQ-KLLGG 551
Query: 654 YKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSS 713
YK G GI C CC SEVSPS FEAHAGWASRRKP+ +IYTSNGVSLHE + S R +S+
Sbjct: 552 YKMGAGIYCYCCKSEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSQGRKYSA 611
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
+N+DLC IC DGG+LL CDSCPRAFHI+CVSLP IP G WHC+YC N F E EYN
Sbjct: 612 NDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNV 671
Query: 774 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 833
N+ A G++EGVDP Q RCIR+V+ + E GCVLC G DFC+S FG RT+I+CDQCE
Sbjct: 672 NSSAVGQLEGVDPVDQSAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCE 731
Query: 834 REYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKH 893
+EYH+GCL + DL+ELPKG W C DC RIN LQKL+ G E L ++SL +I++K
Sbjct: 732 KEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAETLSDSSLGIIQRKQ 791
Query: 894 EESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAM 953
E + + D D+RWR++ GK S +R LLS+A++IFHD FDPI++ S +LIP M
Sbjct: 792 ERTDVYSISDLDIRWRLISGKV--TSPESRMLLSQALAIFHDCFDPIVDPLSGRNLIPRM 849
Query: 954 VYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLF 1013
VYG++ +GQDY G+ CA+LTVN VVSAG+ R+FG+E+AELPLVAT + +GYFQ LF
Sbjct: 850 VYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLF 909
Query: 1014 CCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSML 1072
CIEKLL LNV+++V+P+A EA+ +W NKFGF + E+ +KY + Y ++ F+G SML
Sbjct: 910 SCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASML 969
Query: 1073 QKPVPKCRIVGKSVD 1087
QKPV +I+ K ++
Sbjct: 970 QKPVHAHQIIDKKIE 984
>gi|356546024|ref|XP_003541432.1| PREDICTED: uncharacterized protein LOC100816654 [Glycine max]
Length = 753
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/717 (48%), Positives = 464/717 (64%), Gaps = 52/717 (7%)
Query: 374 SDVTEVANDGVGGPVKQETVMKPRK---VMRKFYSKLKNFLESGILEGMSVMYIRGSKVK 430
S++TEV +K T +K K V+ K +K ++G+L+G+ V+Y+ K
Sbjct: 77 SELTEVE-------LKTATSLKTTKKIIVVHKKPVTVKELFQTGLLDGVPVVYV---GCK 126
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDI 490
+ LRG +K GI C C C G +V+ P+ FE+HA + KR +YI LENGK+L D+
Sbjct: 127 KDSTTELRGEIKDGGILCSCRLCNGRRVIPPSQFEIHACNIYKRAAQYICLENGKSLLDL 186
Query: 491 MNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCR----VSFSNAGVEELMLLCKSCVEL 546
M C+ +PL TLE ++ + S +K C +CR +S V ++L S L
Sbjct: 187 MRACRAAPLHTLEATIQNFINSPPEEKYFTCKSCRGPLGQYYSPIHVHVVLLNLNSVSLL 246
Query: 547 KESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKM 606
K +G E S SS++ SV+L+ P S K
Sbjct: 247 KLRNSGRQE----QSWSSKL-----SVKLKTVPITS----------------------KC 275
Query: 607 KSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCN 666
S KS +I+++ R+HKL+FEE GL +GAEV Y+ RG+ K L G K GI+C CCN
Sbjct: 276 LSPQNKSQWRISKRYQRLHKLIFEEDGLPNGAEVAYYARGQ-KLLEGIKTRCGIVCRCCN 334
Query: 667 SEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDG 726
+EVSPSQFE HAGWASRRKP+ +IYTSNGVSLHEL+I LS + ++K+ND +C +C DG
Sbjct: 335 TEVSPSQFEVHAGWASRRKPYAYIYTSNGVSLHELAIFLSKDHKCTTKQNDYVCVVCWDG 394
Query: 727 GDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDP 786
G+LL CD CPRAFH +C S+ IP G W+C+ C +TF +E+ V YNA+A AAGR+EGVDP
Sbjct: 395 GNLLLCDGCPRAFHKECASVSSIPRGEWYCQICQHTFLRERPVLYNADAVAAGRVEGVDP 454
Query: 787 FAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGM 846
Q+ RCIRIV+ E+GGCVLCR DF +S FG RT+I+CDQCE+EYHVGCL+DH
Sbjct: 455 IEQIAKRCIRIVKDIGAEMGGCVLCRSSDFSRSGFGPRTIIICDQCEKEYHVGCLRDHKK 514
Query: 847 EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDV 906
L+ELP+G W CC DC I+ L+ L+ R E+LPE LDVIKKK E + + DV
Sbjct: 515 AYLKELPEGDWFCCNDCTIIHSTLENLLIRVAERLPEALLDVIKKKQVERCLEPLNEIDV 574
Query: 907 RWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHG 966
RW++L GK AS TR LL +AVS+FH+ FDPI++ A+ DLIPAMVYGR+ + QD+ G
Sbjct: 575 RWKLLNGK--IASPETRPLLLEAVSMFHECFDPIVDPAAGRDLIPAMVYGRNLQTQDFGG 632
Query: 967 MYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVK 1026
MYCA+L VN VVSAG+ RIFG+++AELPLVAT +G+GYFQ+LF CIE+LL FLNVK
Sbjct: 633 MYCALLIVNSSVVSAGMVRIFGRDIAELPLVATRYKNRGKGYFQTLFACIERLLAFLNVK 692
Query: 1027 TLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR-NDYPLMIFQGTSMLQKPVPKCRIV 1082
LVLP+A EA++IWT KFGFS M ++ YR N + +M F+GT+ML K VP+CR+
Sbjct: 693 NLVLPAAEEAESIWTEKFGFSKMKLDQLTNYRMNCHQIMAFKGTNMLHKTVPRCRVT 749
>gi|30686882|ref|NP_850270.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
gi|31711790|gb|AAP68251.1| At2g36720 [Arabidopsis thaliana]
gi|330254196|gb|AEC09290.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 1007
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/735 (45%), Positives = 457/735 (62%), Gaps = 52/735 (7%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQV 458
++R +++ E+G+L+G+SV+Y+ K + LRG+++ GI C C C V
Sbjct: 256 LIRSRPETVRDLFETGLLDGLSVVYMGTVKSQA---FPLRGIIRDGGILCSCSSCDWANV 312
Query: 459 VTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
++ + FE+HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K
Sbjct: 313 ISTSKFEIHACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKR 372
Query: 519 NFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEME---------PQ 569
C C+ F + + LCKSC E++ SQA A + S + + +
Sbjct: 373 FTCKRCKGPFPFSSLGHRGFLCKSCSEVETSQASLAATRTSTSAPACITSPVKSRLKITR 432
Query: 570 PPSVELEESPAPSGELTDTSNR----------------SPEPN-SAQTSSHSKMK----S 608
PS SP L +++ + S N S+Q SK K
Sbjct: 433 KPSESTSISPVFMSSLGNSTRKITRKALRQALVGKAYLSASTNVSSQKKCRSKFKKMLTQ 492
Query: 609 SSVKSHG---------------KITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVG 653
SV ++ RKD +HKLVF+ GGL +G E+GY+ RG+ K L G
Sbjct: 493 HSVTPKALKSVSLSVSSKKRSYRLARKDQGLHKLVFDRGGLPEGTELGYYARGQ-KLLGG 551
Query: 654 YKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSS 713
YK G GI C CC EVSPS FEAHAGWASRRKP+ +IYTSNGVSLHE + S R +S+
Sbjct: 552 YKMGAGIYCYCCKCEVSPSLFEAHAGWASRRKPYFYIYTSNGVSLHEWATTFSHGRKYSA 611
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
+N+DLC IC DGG+LL CDSCPRAFHI+CVSLP IP G WHC+YC N F E EYN
Sbjct: 612 NDNNDLCVICADGGNLLLCDSCPRAFHIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNV 671
Query: 774 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 833
N+ A G++EGVDP Q+ RCIR+V+ + E GCVLC G DFC+S FG RT+I+CDQCE
Sbjct: 672 NSSAVGQLEGVDPVDQLAGRCIRVVKNMEAETNGCVLCSGSDFCRSGFGPRTIIICDQCE 731
Query: 834 REYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKH 893
+EYH+GCL + DL+ELPKG W C DC RIN LQKL+ G EKL ++SL +I+ K
Sbjct: 732 KEYHIGCLSSQNIVDLKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQ 791
Query: 894 EESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAM 953
E + + D D+RWR++ GK S +R LLS+A++IFHD FDPI++ S +LIP M
Sbjct: 792 ERNDVYSISDLDIRWRLISGKV--TSPESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRM 849
Query: 954 VYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLF 1013
VYG++ +GQDY G+ CA+LTVN VVSAG+ R+FG+E+AELPLVAT + +GYFQ LF
Sbjct: 850 VYGKTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLF 909
Query: 1014 CCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSML 1072
CIEKLL LNV+++V+P+A EA+ +W NKFGF + E+ +KY + Y ++ F+G SML
Sbjct: 910 SCIEKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASML 969
Query: 1073 QKPVPKCRIVGKSVD 1087
QKPV +I+ K+++
Sbjct: 970 QKPVDSHQIIDKTIE 984
>gi|297734888|emb|CBI17122.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 640 bits (1651), Expect = e-180, Method: Compositional matrix adjust.
Identities = 291/465 (62%), Positives = 369/465 (79%), Gaps = 3/465 (0%)
Query: 614 HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQ 673
+G++T+KDL +HKLVF E GL +G EVGY+VRG+ + LVGYK+G GI CTCCNSEVSPSQ
Sbjct: 363 YGRVTKKDLSLHKLVFGENGLPEGTEVGYYVRGQ-QLLVGYKRGSGIFCTCCNSEVSPSQ 421
Query: 674 FEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCD 733
FEAHAGWASRRKP+ HIYTSNGVSLHE SI LS R S +NDDLC IC+DGG+LLCCD
Sbjct: 422 FEAHAGWASRRKPYLHIYTSNGVSLHEFSISLSRGREISVSDNDDLCSICLDGGNLLCCD 481
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CPR FH +CVSL IP G W C++C N QKEKFVE+NANA AAGR+ GVDP Q+ R
Sbjct: 482 GCPRVFHKECVSLANIPKGKWFCKFCNNMLQKEKFVEHNANAVAAGRVAGVDPIEQITKR 541
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
CIRIV T E+GGC LCR +F +S FG RTV+LCDQCE+E+HVGCL++H M+DL+E+P
Sbjct: 542 CIRIVNTQVDEMGGCALCRRHEFSRSGFGPRTVMLCDQCEKEFHVGCLREHDMDDLKEVP 601
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
KGKW CC DCKRIN +LQKLV GEE+LP L IK+K+ +GS + D D++WR++ G
Sbjct: 602 KGKWFCCHDCKRINSSLQKLVVHGEEELPHNVLTTIKEKYGRNGSACSKDPDIKWRLICG 661
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILT 973
++ + + +LLS+A+SIFH++FDPI ++A + DL+P MV+G+S R D+ GMYCAILT
Sbjct: 662 RRASSIEAG-SLLSQALSIFHEQFDPIADAAGR-DLLPDMVHGKSTREWDFGGMYCAILT 719
Query: 974 VNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSA 1033
++ VVSA FRIFG+E+AELPLVAT +DCQGQGYFQ+LF C+E LLG L V++LVLP+A
Sbjct: 720 ISSQVVSAAAFRIFGKEVAELPLVATRSDCQGQGYFQTLFSCLEGLLGVLEVRSLVLPAA 779
Query: 1034 SEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
A++IWTNKFGF+ +T+E++N +R DY ++ FQGT MLQK VP+
Sbjct: 780 EGAESIWTNKFGFNKVTQEQRNNFRRDYQMVTFQGTLMLQKLVPR 824
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/166 (53%), Positives = 121/166 (72%), Gaps = 1/166 (0%)
Query: 375 DVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
+++ V N K+ + + PR V R+F +KLK L++GILE + V YIRGS+ +G G
Sbjct: 181 EISAVNNGEENTGTKRSSGLVPR-VPRRFPAKLKELLDTGILEDLPVQYIRGSRTRGSGE 239
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
SGLRGV+KGSGI C C+ CKG +VVTP +FELHAGSSNKRPPEYIYLENG +LR +MN
Sbjct: 240 SGLRGVIKGSGILCSCNSCKGTKVVTPNLFELHAGSSNKRPPEYIYLENGTSLRGVMNAW 299
Query: 495 KDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLC 540
K++ L++L++A+R+ +G S +KK+ FCLNC+ S AG+ +LC
Sbjct: 300 KNAALDSLDEAIRVAIGCSMIKKSTFCLNCKGRISEAGIGNSKVLC 345
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/45 (73%), Positives = 36/45 (80%), Gaps = 1/45 (2%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRAR 45
MA GTDSEE FVVLS++R G KREF FA+K QS I GSLGRTR R
Sbjct: 1 MAKGTDSEE-FVVLSRVRPGCKREFAFAVKAQSAIAGSLGRTRTR 44
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 13/83 (15%)
Query: 410 FLESGILEGMSV-MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHA 468
F E+G+ EG V Y+RG ++ L G +GSGI C C C N V+P+ FE HA
Sbjct: 378 FGENGLPEGTEVGYYVRGQQL-------LVGYKRGSGIFCTC--C--NSEVSPSQFEAHA 426
Query: 469 GSSNKRPPE-YIYLENGKTLRDI 490
G +++R P +IY NG +L +
Sbjct: 427 GWASRRKPYLHIYTSNGVSLHEF 449
Score = 43.9 bits (102), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 651 LVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSL 698
L G KG GILC+C + + V+P+ FE HAG +S ++P ++IY NG SL
Sbjct: 242 LRGVIKGSGILCSCNSCKGTKVVTPNLFELHAG-SSNKRPPEYIYLENGTSL 292
>gi|224068881|ref|XP_002326222.1| predicted protein [Populus trichocarpa]
gi|222833415|gb|EEE71892.1| predicted protein [Populus trichocarpa]
Length = 697
Score = 626 bits (1615), Expect = e-176, Method: Compositional matrix adjust.
Identities = 334/702 (47%), Positives = 441/702 (62%), Gaps = 46/702 (6%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K E+G+LEG+ V+Y+ G K + GLRG +K +GI C C C G++V+ P+ FE+
Sbjct: 14 VKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDAGILCSCAFCNGHRVIPPSQFEI 70
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N C+ +PL++LE ++ + +++ C C+
Sbjct: 71 HAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVERTFTCKRCK- 129
Query: 527 SFSNAGVEELMLL--CKSCVELKESQAGSAEIKE--PLSHSSEMEP-QPPSVELEE---- 577
E+++ L K SQ + K+ P S +P + SV L
Sbjct: 130 -------EQVLALEYFKPASLSTSSQDNTPRKKKRKPEEQDSITKPSKSASVYLSSRKRK 182
Query: 578 ----SP------APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS----HGKITRKDLR 623
SP P L SP P ++ + S +D R
Sbjct: 183 YKKISPRLVCFFYPIDILFGLVMLSPFPFLWLVKIFVFIRKYAYLSPFCPFSGYQSQDQR 242
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+H+LVFEEGGL DG E+ Y+ RG+V + Y F L N S SQ + G+A
Sbjct: 243 LHRLVFEEGGLPDGTELAYYARGQV-INITYSYPFTFLLLIVNKINSSSQ-KLLGGYA-- 298
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
+IYTSNGVSLHEL+I LS R +SS++NDDLC IC DGG+LL CD CPRAFH C
Sbjct: 299 -----YIYTSNGVSLHELAISLSKSRKYSSRDNDDLCIICADGGNLLLCDGCPRAFHKGC 353
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
S+P +PSG W+C+YC NTF++EK VE+NANA AAGR G+D Q+ RC RIV+ +
Sbjct: 354 ASIPTVPSGDWYCQYCQNTFEREKLVEHNANASAAGRDSGIDSIEQITKRCFRIVKNIEA 413
Query: 804 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
EL GC LCRG DF +S FG RT+ILCDQCE+E+HVGCL+ H M +L+ELPKG W CC DC
Sbjct: 414 ELTGCALCRGYDFMRSGFGPRTIILCDQCEKEFHVGCLRSHKMTNLKELPKGNWFCCMDC 473
Query: 864 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 923
RI+ LQKL+ RG EKLP++ L+ IKKKHEE G + + + DVRW +L GK AS +
Sbjct: 474 SRIHSTLQKLLIRGAEKLPDSLLNDIKKKHEERGLNISNNIDVRWTLLSGK--IASPENK 531
Query: 924 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGI 983
LLS+A+SIF + FDPI++S DLIP MVYG++ +GQDY GMYCA+LT+N +VSAGI
Sbjct: 532 LLLSRALSIFQECFDPIVDSTIGRDLIPLMVYGKNSKGQDYGGMYCAVLTINSSIVSAGI 591
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
R+FG+E+AELPLVAT N G+GYFQ LF CIEKLL FLNV+ LVLP+A EA++IWT K
Sbjct: 592 LRVFGEEVAELPLVATRNGEHGKGYFQLLFSCIEKLLAFLNVQNLVLPAAEEAESIWTEK 651
Query: 1044 FGFSMMTEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVGK 1084
FGF + E+ NKYR M+ F+GTSMLQK VP CRIV +
Sbjct: 652 FGFQKIKPEQLNKYRKSCCQMVRFEGTSMLQKAVPTCRIVNQ 693
>gi|55296653|dbj|BAD69373.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 1025
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 336/755 (44%), Positives = 442/755 (58%), Gaps = 46/755 (6%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ D N+ + + +P RRFTRSLL+ KVE + + D T ++ V
Sbjct: 310 ISEDNNEASVDLALAPGKPQRRFTRSLLKVKVEARSTNNLLQSKEAIDSTSDSSRSVK-- 367
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + K + K S ++ L +G+LEGM V YI S K + L+GV+ G I
Sbjct: 368 KMEMKMSKKVACLTKHPSNIRELLNTGLLEGMPVRYIIPSSKK----AVLKGVITGCNIR 423
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G++ V FE HAGS+ K P ++IYL NG +LRD++ C+ SPLE+LEK +R
Sbjct: 424 CFCLSCNGSKDVCSYFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIR 483
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ + + CLNC S++ E LC+ C+E K+ Q
Sbjct: 484 SSIDPIAKRSYVNCLNCNEHLSSSQTEIFGSFLCQRCLEPKQHQ---------------- 527
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHK 626
+ P+PS SN S P+S ++ S GK+T KD +HK
Sbjct: 528 ----------DPPSPS--YACKSNSSLIPSSKDFLLKKTPLNTKGGSAGKVTTKDTGLHK 575
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
LVF+ L DG EV Y+V G+ K + GY K I C CN VSPS FEAHAG +RRKP
Sbjct: 576 LVFKV--LLDGTEVAYYVDGQRK-VDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKP 632
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
+ +I+TSNGVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L
Sbjct: 633 YDNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGL 692
Query: 747 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
G+PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LG
Sbjct: 693 SGVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLG 750
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
GC LC+ +DF S F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I
Sbjct: 751 GCALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEI 810
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
+L K++ G L E+ +D+I+KKHE G + D+RWR+L G+K AS+ LL
Sbjct: 811 RSSLDKIISDGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLL 868
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAG 982
S AV I H FDPIIE S DLIP MV GR + GQDY GMYCA+LT+ VVSA
Sbjct: 869 SAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAA 928
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
+ R+ G E+AELPLVATS D QG GYFQ+LF CIE++L L +K +LP+A EA+ IW N
Sbjct: 929 LLRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMN 988
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
KFGF+ + +E+ + Y N L IF GTS L K +P
Sbjct: 989 KFGFTKIPQEQSDAYLNGAHLTIFHGTSNLYKAIP 1023
>gi|222634801|gb|EEE64933.1| hypothetical protein OsJ_19794 [Oryza sativa Japonica Group]
Length = 1016
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 336/755 (44%), Positives = 441/755 (58%), Gaps = 44/755 (5%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ D N+ + + +P RRFTRSLL+ KVE + + D T ++ V
Sbjct: 299 ISEDNNEASVDLALAPGKPQRRFTRSLLKVKVEARSTNNLLQSKEAIDSTSDSSRSVK-- 356
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + K + K S ++ L +G+LEGM V YI S V L+GV+ G I
Sbjct: 357 KMEMKMSKKVACLTKHPSNIRELLNTGLLEGMPVRYIIPSSKLQKAV--LKGVITGCNIR 414
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G++ V FE HAGS+ K P ++IYL NG +LRD++ C+ SPLE+LEK +R
Sbjct: 415 CFCLSCNGSKDVCSYFFEQHAGSNKKHPADHIYLGNGNSLRDVLRACESSPLESLEKTIR 474
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ + + CLNC S++ E LC+ C+E K+ Q
Sbjct: 475 SSIDPIAKRSYVNCLNCNEHLSSSQTEIFGSFLCQRCLEPKQHQ---------------- 518
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHK 626
+ P+PS SN S P+S ++ S GK+T KD +HK
Sbjct: 519 ----------DPPSPS--YACKSNSSLIPSSKDFLLKKTPLNTKGGSAGKVTTKDTGLHK 566
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
LVF+ L DG EV Y+V G+ K + GY K I C CN VSPS FEAHAG +RRKP
Sbjct: 567 LVFKV--LLDGTEVAYYVDGQRK-VDGYIKDQRIYCNHCNRVVSPSAFEAHAGEGTRRKP 623
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
+ +I+TSNGVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L
Sbjct: 624 YDNIFTSNGVSLHELSMKISKDMELSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGL 683
Query: 747 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
G+PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LG
Sbjct: 684 SGVPS-EWYCDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLG 741
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
GC LC+ +DF S F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I
Sbjct: 742 GCALCKQKDFNNSVFDERTVILCDQCEKEYHVGCLRSQWQVDLKELPEGEWFCCNSCSEI 801
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
+L K++ G L E+ +D+I+KKHE G + D+RWR+L G+K AS+ LL
Sbjct: 802 RSSLDKIISDGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLL 859
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAG 982
S AV I H FDPIIE S DLIP MV GR + GQDY GMYCA+LT+ VVSA
Sbjct: 860 SAAVPIIHQSFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAA 919
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
+ R+ G E+AELPLVATS D QG GYFQ+LF CIE++L L +K +LP+A EA+ IW N
Sbjct: 920 LLRVMGGEVAELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMN 979
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
KFGF+ + +E+ + Y N L IF GTS L K +P
Sbjct: 980 KFGFTKIPQEQSDAYLNGAHLTIFHGTSNLYKAIP 1014
>gi|222636273|gb|EEE66405.1| hypothetical protein OsJ_22748 [Oryza sativa Japonica Group]
Length = 800
Score = 583 bits (1504), Expect = e-163, Method: Compositional matrix adjust.
Identities = 321/722 (44%), Positives = 414/722 (57%), Gaps = 99/722 (13%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGI 446
K E M + + LK+ L +G+LEG V YI +G + LRGV+K GI
Sbjct: 143 KMELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKRAV------LRGVIKRVGI 196
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+ C D+ L+ L+ A+
Sbjct: 197 LCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAI 256
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ +G + K+ C C+ SF+ + LLC SC+E K SQ + K
Sbjct: 257 QNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQNSTRTSK--------- 307
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEP---NSAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ +P S + N SP NS+ S + S G+ITRKD
Sbjct: 308 --------IGRNPTSSAR--RSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKG 357
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+HKL F G L +G +VGY+V G+ + L GY K FGI C CCN+ VSPSQFEAHAG A+R
Sbjct: 358 LHKLAFMSGVLPEGTDVGYYVGGK-RLLDGYIKEFGIYCHCCNTVVSPSQFEAHAGRAAR 416
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
RKP+ +IY SNGVSLHELS+ LS R S++++DDLC IC DGG+LL CDSCPRAFH +C
Sbjct: 417 RKPYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHREC 476
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
V IP GTW CRYC N Q+E + YN NA AAGRI+G+DP Q+ +R IRI TP T
Sbjct: 477 VGFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVT 536
Query: 804 ELGGCVLC-------------RGR--------------DFCKSRFGRRTVILCDQCEREY 836
GGC LC R R DF K +F RTV+LCD
Sbjct: 537 GFGGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCD------ 590
Query: 837 HVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 896
Q LP+G W C ADC RI+ L+ L+ RG E + +++IK+K+E+
Sbjct: 591 -------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQK 637
Query: 897 GSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG 956
+ D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ DLIPAMVYG
Sbjct: 638 ALNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG 694
Query: 957 RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCI 1016
VVSAG+FR+ G E+AELPLVATS D QG GYFQ+LF CI
Sbjct: 695 -------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCI 735
Query: 1017 EKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
E+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQGTS L K V
Sbjct: 736 ERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLV 795
Query: 1077 PK 1078
PK
Sbjct: 796 PK 797
>gi|293331977|ref|NP_001168115.1| uncharacterized protein LOC100381857 [Zea mays]
gi|223946087|gb|ACN27127.1| unknown [Zea mays]
gi|413942541|gb|AFW75190.1| hypothetical protein ZEAMMB73_711939 [Zea mays]
Length = 849
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 335/756 (44%), Positives = 441/756 (58%), Gaps = 60/756 (7%)
Query: 331 DGN-DEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVK 389
DG+ D +++E+P RRFTRSLL+ KVE SL DV + A+D VK
Sbjct: 143 DGSKDASVGFTLLLEKPQRRFTRSLLKTKVE---SSLVGS----DDVLDSASDSPPS-VK 194
Query: 390 QETVMKPRKV--MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
+ + +KV + K ++ L +G+LEGM VMYI K + L+GV+ G I
Sbjct: 195 KMEMKMSKKVACLTKHPGNIRELLNTGMLEGMPVMYIIPHSKK----AVLKGVITGCNIR 250
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
CFC C G + ++ FE HAGS+ K P +YIYL NG +LRD++ SPLE LE+ +R
Sbjct: 251 CFCLSCNGAKAISAYYFEQHAGSTKKHPADYIYLGNGNSLRDVLRASDRSPLEALEETIR 310
Query: 508 MVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEME 567
+ + CLNC + E +LC+ C+E K+ Q +PL+ S
Sbjct: 311 SSIDPVVKRSRINCLNCNELVLPSSHEN--VLCQVCLESKQPQ-------DPLTASY--- 358
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVK--SHGKITRKDLRMH 625
T N S + + + + SS K S GK+T KD R+H
Sbjct: 359 --------------------TCNGSSSLSRSSKEALLRNISSGKKGGSAGKVTNKDNRLH 398
Query: 626 KLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
KLVF L DG EV Y+V G+ K + GY K I C CN VSPS FEAHAG SRRK
Sbjct: 399 KLVFNV--LLDGTEVAYYVDGQRK-VDGYIKDHRIYCNHCNRVVSPSAFEAHAGEGSRRK 455
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P+ +I+TSNGVSLHEL++K+S + S +E DDLC C GGD+ C CPR+FH CV
Sbjct: 456 PYDNIFTSNGVSLHELAMKISKDMELSERETDDLCRECGQGGDIFPCKICPRSFHPACVG 515
Query: 746 LPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTEL 805
L +P+ W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +L
Sbjct: 516 LSKVPA-EWYCDSCRNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DL 573
Query: 806 GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 865
GGC LC+ +DF + F RTVILCDQCE+EYHVGCL+ +L+ELP+ +W CC+ C
Sbjct: 574 GGCALCKQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEEEWFCCSSCSE 633
Query: 866 INLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRAL 925
+L K++ G + L + L++IKKKHE G D++W++L GK+ A++ L
Sbjct: 634 TRSSLDKIISDGAQLLADPDLEIIKKKHETRGLCMDTSKDLKWQLLSGKR--ATEDGSIL 691
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSA 981
LS AV IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA
Sbjct: 692 LSAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSA 751
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
+ R+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW
Sbjct: 752 ALLRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWM 811
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
NKFGFS ++ EE Y N L IF GTS + K VP
Sbjct: 812 NKFGFSRISPEELEAYLNGAHLTIFHGTSYMYKAVP 847
>gi|326516960|dbj|BAJ96472.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1163
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 330/739 (44%), Positives = 431/739 (58%), Gaps = 51/739 (6%)
Query: 346 PLRRFTRSLLQQKVELAKGS-LSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKV--MRK 402
P RRFTRSLL+ KVE S L + D T V K E M +KV + K
Sbjct: 465 PGRRFTRSLLKPKVEAPPASSLVVVPEEPVDSTPETPPSV---TKMEMKMS-KKVAFLSK 520
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
++ L +G+LEGM VMYI + K L+GV+ G I CFC C G++ +T
Sbjct: 521 HPGNTRDLLSTGLLEGMPVMYIIPNSKK----PVLKGVIAGCNIRCFCVKCDGSKTITTY 576
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
FELHAGSS K P EYIYL NG LRD++ C+ SPL++L+K ++ + ++ CL
Sbjct: 577 FFELHAGSSKKHPAEYIYLANGNRLRDVLRACESSPLDSLDKTIQSCIDPMLIRTRMNCL 636
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPS 582
NC + E+ LC C ES +PQ P+
Sbjct: 637 NCNGELPSQTEEQF--LCHDCC--PESN----------------QPQ----------DPT 666
Query: 583 GELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGY 642
L + + S S++ S +M +S S GK+T KD +HKLVF+ L DG EV Y
Sbjct: 667 SPLACSKSSSSLTPSSKESLLKRMSASKGASTGKVTTKDTGLHKLVFKV--LLDGTEVNY 724
Query: 643 FVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELS 702
+V G+ K + GY K I C CN VSPS FEAHAG SRRKP+ +I+TSNGVSLHELS
Sbjct: 725 YVDGQKK-IDGYIKDQRIYCNHCNKVVSPSAFEAHAGEGSRRKPYDNIFTSNGVSLHELS 783
Query: 703 IKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 762
+ +S + S +E DDLC C GGD+ C CPR+FH CV LP +PS W C C
Sbjct: 784 MSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWFCDNCTIL 843
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+ +DF + F
Sbjct: 844 VQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKDFNNAVFD 902
Query: 823 RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLP 882
RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++ G L
Sbjct: 903 ERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMISEGAHPLS 962
Query: 883 ETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIE 942
E+ +D+I+KKHE G + ++RW+++ G+ A++ +LLS AV + H FDPIIE
Sbjct: 963 ESDVDIIRKKHESKGLVMDANTEIRWQLVAGRS--ATEDGNSLLSSAVPVIHQSFDPIIE 1020
Query: 943 SASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVA 998
+ + DLIP MV+GR + GQDY GMYCA+LTV VVSA + R+ G ++AELPLVA
Sbjct: 1021 AHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDVAELPLVA 1080
Query: 999 TSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYR 1058
TS D QG GYFQ LF CIE+LL L VK +LP+A EA+AIW KFGFS + +++ Y
Sbjct: 1081 TSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQDQMEAYL 1140
Query: 1059 NDYPLMIFQGTSMLQKPVP 1077
N L +F GT L K +P
Sbjct: 1141 NGGHLTVFHGTLNLYKAIP 1159
>gi|297734889|emb|CBI17123.3| unnamed protein product [Vitis vinifera]
Length = 772
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 301/587 (51%), Positives = 388/587 (66%), Gaps = 50/587 (8%)
Query: 348 RRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
RR +R L+ K K S V N+G VK + ++ F +KL
Sbjct: 232 RRVSRVTLKPKANAMKIS-------------VVNNGEKNVVKMGSSALVPSTLKGFPTKL 278
Query: 408 KNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELH 467
K L++GILE + V YIRG + K G SGL GV+KGSGI C+CD CKG VVTP VFELH
Sbjct: 279 KELLDTGILEDLPVQYIRGLRRKENGESGLHGVIKGSGILCYCDTCKGTNVVTPNVFELH 338
Query: 468 AGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVS 527
AGSSNKRPPEYIYLENG TLR +M C + L+ L++ +R+ +GSS +KK+ FC NC+ S
Sbjct: 339 AGSSNKRPPEYIYLENGNTLRSVMTACSKATLKALDEDIRVAIGSS-IKKSTFCFNCKGS 397
Query: 528 FSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTD 587
S G + ++LC+SCV LKES A A QP T
Sbjct: 398 ISEVGTSDSLVLCESCVGLKESHASPA--------------QP---------------TG 428
Query: 588 TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGE 647
TS+RS +T++ SK SS K++G++T+KD+ +HKL F E L +G+EV Y+VRGE
Sbjct: 429 TSDRS-----TKTTTVSKCSSSGSKNYGRVTKKDVGLHKLAFGENDLPEGSEVSYYVRGE 483
Query: 648 VKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL 707
+ L G+KKG ILC CCNSEVSPSQFEAH+GWASRRKP+ HIYTSNGVSLHELS+ L
Sbjct: 484 -RLLSGHKKGCRILCGCCNSEVSPSQFEAHSGWASRRKPYLHIYTSNGVSLHELSLSLLR 542
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
R S ND++C IC+DGG LLCCD CPR FH +CVSL IP G W C++C+NT QK K
Sbjct: 543 GREPSINTNDEICSICLDGGTLLCCDGCPRVFHKECVSLENIPKGKWFCKFCLNTLQKGK 602
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
FVE NANA AAGR+ GVDP Q+ RCIRIV+ E GGC LCR +F S FG TV+
Sbjct: 603 FVERNANAVAAGRMGGVDPIEQIRKRCIRIVKNQTDEAGGCALCRRHEFSTSGFGPHTVM 662
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLD 887
+CDQCE+E+HVGCLK H ++DL+ +PKGKW CC DCK IN +L+K+V R EE+LP+ L
Sbjct: 663 ICDQCEKEFHVGCLKAHNIDDLKVVPKGKWFCCRDCKDINSSLRKIVVRQEEELPDDVLR 722
Query: 888 VIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 934
+IKK++ GS + + D++WR+L G++ A++ +LLS+A+S+FH
Sbjct: 723 IIKKRYGRKGSVCSGNPDIKWRLLHGRRASATEAG-SLLSQALSLFH 768
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 12/101 (11%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICG--SLGRTRARKVQSNVDSGCVLG 58
M G+DSEE V+S+IR G KREF F +K QS I G SLGRTR + Q+ D G G
Sbjct: 63 MEKGSDSEE--FVMSRIRPGFKREFAFVVKAQSMIGGSLSLGRTRT-QTQTKKDRG---G 116
Query: 59 PPEVKKLKTYESRKKRKRQEQSVVVKETEDKR--EEEVKSD 97
P + L+ S KKR++ K ++R E+ V+S+
Sbjct: 117 PSRNRVLE--NSIKKRQKSSSLDSQKNNVEERFPEDRVRSN 155
>gi|357117034|ref|XP_003560281.1| PREDICTED: uncharacterized protein LOC100835479 [Brachypodium
distachyon]
Length = 807
Score = 564 bits (1453), Expect = e-157, Method: Compositional matrix adjust.
Identities = 276/510 (54%), Positives = 351/510 (68%), Gaps = 6/510 (1%)
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P+ ++L + P EL+ RSP ++ + S S+ S G++TRKD +HKL
Sbjct: 234 PKRNKIKLTKIPKNLKELSPKIARSPTSSARVPKNFSPGAKST--SAGRLTRKDHGLHKL 291
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
VF G L +G +VGY+V G+ + L GY K GI C CCN+ VSPSQFE HAG A+RRKP+
Sbjct: 292 VFLSGILPEGTDVGYYVGGK-RLLDGYIKEPGIHCHCCNTVVSPSQFEGHAGRAARRKPY 350
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
+IY SNGVSLHELS+ LS R S +++DDLC IC DGG+LL CD+CPRAFH +CV L
Sbjct: 351 HNIYMSNGVSLHELSVSLSRGRKTSDRQSDDLCSICSDGGELLLCDTCPRAFHRECVDLT 410
Query: 748 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 807
+P GTW CRYC Q+E + YN NA AAGRI+G+D Q+ +R IRI TP+T GG
Sbjct: 411 AVPKGTWCCRYCETRQQRESSLAYNHNAIAAGRIDGIDSMEQIFTRSIRIATTPETGFGG 470
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
C LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+G W C +DC RI+
Sbjct: 471 CALCKLHDFGKKKFSARTVLLCDQCGREYHVGCLKEHSMADLTALPEGAWYCSSDCVRIS 530
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
++ L+ G E +P D+IKKK E+ G + D DVRWRVLR K +S+ ++ +LS
Sbjct: 531 ETMKDLLSGGAEPVPAMDADLIKKKREDKGLNEDGDLDVRWRVLRDK---SSEDSKLVLS 587
Query: 928 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 987
KAV+IFH+ FDPII++ + DLIPAMVYGRS R QDY GMYCA+LTV VVSAG+FRI
Sbjct: 588 KAVAIFHESFDPIIQTTTGRDLIPAMVYGRSVRDQDYTGMYCAVLTVGNTVVSAGLFRIM 647
Query: 988 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1047
G+E AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A EA +IWT +FGFS
Sbjct: 648 GREAAELPLVATSRDNQGFGYFQALFGCIERLLASLKVKYFVLPAADEAVSIWTQRFGFS 707
Query: 1048 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
++ +E ++ +FQGTS L K +P
Sbjct: 708 KISRDELLEHLKGARTTVFQGTSTLHKLIP 737
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 50/95 (52%), Gaps = 12/95 (12%)
Query: 398 KVMRKFYSKLKNFLESGIL-EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
++ RK + K SGIL EG V Y G K L G +K GI C C C N
Sbjct: 280 RLTRKDHGLHKLVFLSGILPEGTDVGYYVGGK------RLLDGYIKEPGIHCHC--C--N 329
Query: 457 QVVTPAVFELHAGSSNKRPPEY-IYLENGKTLRDI 490
VV+P+ FE HAG + +R P + IY+ NG +L ++
Sbjct: 330 TVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLHEL 364
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 26/35 (74%)
Query: 9 EKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTR 43
+ FV+ S +RVGLKREF FA+ Q+ + SLGRTR
Sbjct: 2 DDFVLRSGVRVGLKREFAFAIASQAALAPSLGRTR 36
>gi|297741475|emb|CBI32607.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 560 bits (1443), Expect = e-156, Method: Compositional matrix adjust.
Identities = 339/880 (38%), Positives = 501/880 (56%), Gaps = 92/880 (10%)
Query: 227 REKAENDLIGEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVG---LGKEYEPGRV 283
+ ++EN E+K + E + +E E V + E C EV L +Y V
Sbjct: 21 KTESENSSRMELKRDHECIAGNIETEASPRKKPVKEALNEEGCSEVSNPILSPKYNASSV 80
Query: 284 QMEMDEEKKNDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVA-----VDGNDEG-K 337
Q I ++ E +++G+ +ST + ++V +S++ +G+ +G
Sbjct: 81 QT---------ITSQVAELASTNQAVLGEITST--SSGNSVPESLSDEEHSRNGSSDGVS 129
Query: 338 TVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDV-------TEVANDGVGGPVKQ 390
T VV+E P + S +K+ K S SK+ R+++ E N P
Sbjct: 130 TTQVVLEIP--KHVSSTGIRKITF-KFSKSKEAYNRTNMRVNTCWNLETRNLHFRAP-NM 185
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + + +K L +GIL+G V YI S+ K L+GV++ SG C C
Sbjct: 186 ELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE-----LQGVIRESGYLCGC 240
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C +V+T FE HAG + P +IYLENGK + I+ K +PL L++ ++ +
Sbjct: 241 SACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIA 300
Query: 511 GSS------SMKKANFCLNCRVSFSNAGVEELMLLCKSCV------ELKESQAGSAEIKE 558
GSS KA+F N V+ ++ +L + +++S GS +
Sbjct: 301 GSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGS---RL 357
Query: 559 PLSHSSEMEPQPPSVELEESPAPSGELT-DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKI 617
PL M+ E+T + + + +P+S S + K SS G I
Sbjct: 358 PLKQKELMK----------------EMTQERKHAAKKPSSYIYGSGLQHKKSS---EGAI 398
Query: 618 TRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAH 677
++D +H+L+F GL DGAE+ Y+V+G+ + L GYK+G GI+C+ C+SEVSPSQFEAH
Sbjct: 399 KKRDNDLHRLLFMPNGLPDGAELAYYVKGQ-RILGGYKQGNGIVCSHCDSEVSPSQFEAH 457
Query: 678 AGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPR 737
AGWA+RR+P++HIYTSNG++LH+++I L+ + ++ ++DD+C +C DGGDL+ CD CPR
Sbjct: 458 AGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPR 517
Query: 738 AFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRI 797
AFH C+ L +P G W C C+ F ++ V AR RI+ R
Sbjct: 518 AFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----ARPI-RIQ-----------LTRA 560
Query: 798 VQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKW 857
V+ P++E+GGCV+CR DF S+F RTV+LCDQCE+E+HVGCL+D G+ DL+ELPK KW
Sbjct: 561 VKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKW 620
Query: 858 LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVD 917
CC DC R+++ALQ L RG E +P + +I +K+ E G + D++W +L GK
Sbjct: 621 FCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCY 680
Query: 918 ASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 977
LLS+ +IF + FDPI+ S+ + DLIP MVYGR+ GQ++ GMYC +L
Sbjct: 681 KEH--LPLLSRTTAIFRECFDPIVASSGR-DLIPVMVYGRNISGQEFGGMYCVVLLAKST 737
Query: 978 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1037
VVSAG+ R+FGQE+AELP+VATS + QG+G+F++LF CIE+LL L VKTLVLP+A EA+
Sbjct: 738 VVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAE 797
Query: 1038 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
AIWTNK GF M+EE KY + L IF+GTSML+K VP
Sbjct: 798 AIWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVP 837
>gi|225439735|ref|XP_002273013.1| PREDICTED: uncharacterized protein LOC100246491 [Vitis vinifera]
Length = 896
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 299/700 (42%), Positives = 428/700 (61%), Gaps = 61/700 (8%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + + +K L +GIL+G V YI S+ K L+GV++ SG C C
Sbjct: 241 ELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKE-----LQGVIRESGYLCGC 295
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C +V+T FE HAG + P +IYLENGK + I+ K +PL L++ ++ +
Sbjct: 296 SACNFTKVLTAYEFEQHAGGRTRHPNNHIYLENGKPIYSIIQQLKTAPLSDLDEVIKNIA 355
Query: 511 GSS------SMKKANFCLNCRVSFSNAGVEELMLLCKSCV------ELKESQAGSAEIKE 558
GSS KA+F N V+ ++ +L + +++S GS +
Sbjct: 356 GSSVNMECFKAWKASFHQNNGVTEADENYHAQLLNHPQSIVSFPVQAVEDSFTGS---RL 412
Query: 559 PLSHSSEMEPQPPSVELEESPAPSGELT-DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKI 617
PL M+ E+T + + + +P+S S + K SS G I
Sbjct: 413 PLKQKELMK----------------EMTQERKHAAKKPSSYIYGSGLQHKKSS---EGAI 453
Query: 618 TRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAH 677
++D +H+L+F GL DGAE+ Y+V+G+ + L GYK+G GI+C+ C+SEVSPSQFEAH
Sbjct: 454 KKRDNDLHRLLFMPNGLPDGAELAYYVKGQ-RILGGYKQGNGIVCSHCDSEVSPSQFEAH 512
Query: 678 AGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPR 737
AGWA+RR+P++HIYTSNG++LH+++I L+ + ++ ++DD+C +C DGGDL+ CD CPR
Sbjct: 513 AGWAARRQPYRHIYTSNGLTLHDIAISLANGQNCTTGDSDDMCTLCGDGGDLILCDGCPR 572
Query: 738 AFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRI 797
AFH C+ L +P G W C C+ F ++ V AR RI+ R
Sbjct: 573 AFHPACLELQCLPEGDWRCPCCVENFCPDRKV-----ARPI-RIQ-----------LTRA 615
Query: 798 VQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKW 857
V+ P++E+GGCV+CR DF S+F RTV+LCDQCE+E+HVGCL+D G+ DL+ELPK KW
Sbjct: 616 VKAPESEIGGCVVCRAHDFSVSKFDDRTVMLCDQCEKEFHVGCLRDSGLCDLKELPKDKW 675
Query: 858 LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVD 917
CC DC R+++ALQ L RG E +P + +I +K+ E G + D++W +L GK
Sbjct: 676 FCCDDCSRVHVALQNLASRGPEMIPASVSSMINRKNLEKGLIDGAADDIQWCILSGKSCY 735
Query: 918 ASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 977
LLS+ +IF + FDPI+ S+ + DLIP MVYGR+ GQ++ GMYC +L
Sbjct: 736 KEH--LPLLSRTTAIFRECFDPIVASSGR-DLIPVMVYGRNISGQEFGGMYCVVLLAKST 792
Query: 978 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1037
VVSAG+ R+FGQE+AELP+VATS + QG+G+F++LF CIE+LL L VKTLVLP+A EA+
Sbjct: 793 VVSAGLIRVFGQEVAELPIVATSKEHQGKGFFRALFSCIEELLSSLGVKTLVLPAAEEAE 852
Query: 1038 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
AIWTNK GF M+EE KY + L IF+GTSML+K VP
Sbjct: 853 AIWTNKLGFQKMSEERMLKYTRELQLTIFKGTSMLEKEVP 892
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 46/107 (42%), Gaps = 5/107 (4%)
Query: 603 HSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILC 662
H + + +K K+ K + G+ DGA V Y K L G + G LC
Sbjct: 234 HFRAPNMELKMSKKVVPKSYPTNVKKLLSTGILDGALVKYISTSREKELQGVIRESGYLC 293
Query: 663 TC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 705
C CN ++ +FE HAG R P HIY NG ++ + +L
Sbjct: 294 GCSACNFTKVLTAYEFEQHAG-GRTRHPNNHIYLENGKPIYSIIQQL 339
>gi|54291565|dbj|BAD62489.1| PHD zinc finger protein-like [Oryza sativa Japonica Group]
Length = 779
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 310/722 (42%), Positives = 402/722 (55%), Gaps = 119/722 (16%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGI 446
K E M + + LK+ L +G+LEG V YI +G + + LRGV+K GI
Sbjct: 142 KMELKMSKKISFTRIPRNLKDLLATGLLEGHPVKYIMRKGKR------AVLRGVIKRVGI 195
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+ C D+ L+ L+ A+
Sbjct: 196 LCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDILRACSDATLDMLQSAI 255
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEM 566
+ +G + K+ C C+ SF+ + LLC SC+E K SQ + K
Sbjct: 256 QNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQNSTRTSK--------- 306
Query: 567 EPQPPSVELEESPAPSGELTDTSNRSPEP---NSAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ +P S + N SP NS+ S + S G+ITRKD
Sbjct: 307 --------IGRNPTSSAR--RSKNESPGSKYCNSSARGSKNAFPGVKTTSTGRITRKDKG 356
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+HKL F G L +G +VGY+V G+V SPSQFEAHAG A+R
Sbjct: 357 LHKLAFMSGVLPEGTDVGYYVGGKV---------------------SPSQFEAHAGRAAR 395
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
RKP+ +IY SNGVSLHELS+ LS R S++++DDLC IC DGG+LL CDSCPRAFH +C
Sbjct: 396 RKPYHNIYMSNGVSLHELSVSLSKGRNMSNRQSDDLCSICSDGGELLLCDSCPRAFHREC 455
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
V IP GTW CRYC N Q+E + YN NA AAGRI+G+DP Q+ +R IRI TP T
Sbjct: 456 VGFTTIPRGTWCCRYCENRQQRESSLAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVT 515
Query: 804 ELGGCVLC-------------RGR--------------DFCKSRFGRRTVILCDQCEREY 836
GGC LC R R DF K +F RTV+LCD
Sbjct: 516 GFGGCALCSMSGFMDKQSVLSRSRPDYDDELAVLDQLHDFSKKKFSARTVLLCD------ 569
Query: 837 HVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 896
Q LP+G W C ADC RI+ L+ L+ RG E + +++IK+K+E+
Sbjct: 570 -------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEIIKRKYEQK 616
Query: 897 GSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG 956
+ D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ DLIPAMVYG
Sbjct: 617 ALNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRDLIPAMVYG 673
Query: 957 RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCI 1016
VVSAG+FR+ G E+AELPLVATS D QG GYFQ+LF CI
Sbjct: 674 -------------------NTVVSAGLFRVMGSEIAELPLVATSRDSQGLGYFQALFGCI 714
Query: 1017 EKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
E+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQGTS L K V
Sbjct: 715 ERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQGTSTLHKLV 774
Query: 1077 PK 1078
PK
Sbjct: 775 PK 776
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 8 EEKFVVLSKIRVGLKREFEFALKVQSEICGSLG 40
E+KFV+ S +R GLKREF FA+ Q+ + +LG
Sbjct: 10 EDKFVLRSGVRTGLKREFAFAIASQAALSSTLG 42
>gi|357119016|ref|XP_003561242.1| PREDICTED: uncharacterized protein LOC100842921 [Brachypodium
distachyon]
Length = 1190
Score = 550 bits (1418), Expect = e-153, Method: Compositional matrix adjust.
Identities = 331/808 (40%), Positives = 446/808 (55%), Gaps = 81/808 (10%)
Query: 307 SSMVGKHS----STLCNGES--NVAKSVAVD----GNDEGKTVNVVVERPLRRFTRSLLQ 356
++++G+H + C+G + NV +V D + V+ +P RRFTRSLL+
Sbjct: 421 TTVIGQHQREELTATCHGSNRPNVLDNVLADPPLVTDGAATPVSTSGLKPRRRFTRSLLK 480
Query: 357 QKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQE---TVMKPRK--------------- 398
K E + G SD + A+ + P ++ +++KP
Sbjct: 481 SKPEEDAVTSEVQG---SDDNKDASVDLAPPSERRFTRSLLKPPADTASPPSVKKMEMKM 537
Query: 399 -----VMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDC 453
K +K L++G+LEGM VMYI + K + ++GV+ G I CFC C
Sbjct: 538 SKKVACFTKHPGNVKELLQTGLLEGMPVMYIIPNSKK----AVVKGVITGCNIRCFCIKC 593
Query: 454 KGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS 513
G++ ++ FELHAGS+ K P E+IYL NG +LRD++ C S LE+LE+ R +
Sbjct: 594 NGSRALSTYFFELHAGSNKKHPAEHIYLGNGNSLRDVLRACCGSSLESLEETFRSSIDPM 653
Query: 514 SMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSV 573
++ CLNC ++ E LC C++ K+ Q + S M P
Sbjct: 654 VIRSRPNCLNCGGHLPSSETEHF--LCHCCLDSKQPQDPPSPSYSCSKSDSSMTPS---- 707
Query: 574 ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGG 633
++SS S GK+T KD +HKLVF+
Sbjct: 708 ------------------------FKSSSVKISSIKKAGSSGKVTTKDTGLHKLVFKV-- 741
Query: 634 LEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTS 693
L DG EV Y+V G+ K + GY K I C C+ VSPS FEAHAG SRRKP+ +I+TS
Sbjct: 742 LLDGTEVAYYVDGQRK-VDGYIKDQRIYCNHCSRVVSPSAFEAHAGEGSRRKPYDNIFTS 800
Query: 694 NGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGT 753
NGVSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L PS
Sbjct: 801 NGVSLHELSMKISKDMELSERETDDLCRECGLGGDIFPCKMCPRSFHPACVRLSEFPS-E 859
Query: 754 WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 813
W C C N QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+
Sbjct: 860 WFCDNCSNLVQKEKALAANKNAKAAGRQAGVDSIEQIMKRAIRIVPICD-DLGGCALCKK 918
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 873
+DF + F RTVILCDQCE+EYHVGCL+ DL+ELP G+W CC+ C I L K+
Sbjct: 919 KDFNNAVFDERTVILCDQCEKEYHVGCLRTQWQVDLKELPDGEWFCCSSCSEIRSCLDKM 978
Query: 874 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 933
+ G + L + L++I+KKHE G D D+RW++L G+ A++ LLS AV I
Sbjct: 979 ISDGAQPLSGSDLEIIRKKHESRGLSMDTDIDIRWQLLAGRS--ATEDGSLLLSSAVPII 1036
Query: 934 HDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQ 989
H FDPIIE+ + DLIP MV GR + GQDY GMYCA++T+ VVSA + RI G
Sbjct: 1037 HQSFDPIIEANTGRDLIPEMVNGRRPKEGMPGQDYSGMYCAVITLGSTVVSAALLRIMGG 1096
Query: 990 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1049
++AELPLVATS D QG GYFQ LF C+E++L L +K +LP+A EA+AIW KFGFS +
Sbjct: 1097 DVAELPLVATSMDLQGLGYFQVLFSCMERMLISLKIKHFMLPAAQEAEAIWMKKFGFSRI 1156
Query: 1050 TEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
+E+ Y N L +F GTS L K VP
Sbjct: 1157 PQEQLEAYLNGAHLTVFHGTSNLYKAVP 1184
>gi|449440345|ref|XP_004137945.1| PREDICTED: uncharacterized protein LOC101207817 [Cucumis sativus]
Length = 842
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 287/695 (41%), Positives = 411/695 (59%), Gaps = 71/695 (10%)
Query: 386 GPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRG-SKVKGPGVSGLRGVVKGS 444
GP K E M + + + S +K L +GIL+G V Y+ S++K L+G++ G
Sbjct: 210 GPNKMELKMSKKVLPNNYPSNVKKLLSTGILDGARVKYVSTTSEMK------LQGIINGG 263
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C +++ FE HAG + P +IYLENG+ + ++ K +PL L++
Sbjct: 264 GYMCGCSTCNFTAILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDE 323
Query: 505 AVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSS 564
+ V GSS V + +A A + S+
Sbjct: 324 VIMEVAGSS------------------------------VNMNSFEAWKASFHQ---DSA 350
Query: 565 EMEPQPPSVELEESPAPSGELTDTSNRSPEPN-SAQTSSHSKMKSSSVKSHGKITRKDLR 623
+ + V+L + P P PN S H K K R+D
Sbjct: 351 NIVVENHDVKLPKLSHPV--------ERPNPNFSNAVLQHKKTAEKGTK------RRDND 396
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+H+L+F GL DGAE+ YFV+G+ + L G+K+G GILC+ CN E+SPSQFEAHAG A+R
Sbjct: 397 LHRLLFMPNGLPDGAELAYFVKGQ-RILGGFKQGNGILCSHCNREISPSQFEAHAGMAAR 455
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
R+P++HIYT+NG++LH+++I L+ + ++ ++DD+C C +GGDL+ CD CPRA+H C
Sbjct: 456 RQPYRHIYTTNGLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGC 515
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
+ L +P G W C C + V N+ A + G + P +V R R+V+ P+
Sbjct: 516 LHLQNVPEGVWSCPNCRDK------VGSNSKAISGGSLSFSKP---IVFRLTRVVKAPEY 566
Query: 804 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
E+GGCV+CR DF ++F RTV+LCDQCERE+HVGCL+D G+ DL+ELPK KW CC +C
Sbjct: 567 EIGGCVVCRRHDFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDEC 626
Query: 864 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDG 921
I++ALQ V G + +P++ D+I +KH G D A++ DVRW++L GK D
Sbjct: 627 SNIHVALQNTVLNGAQIIPDSLSDLIIRKHVGKGLLVDEALN-DVRWQILSGKSRFPED- 684
Query: 922 TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSA 981
LS+A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V +VVSA
Sbjct: 685 -LPFLSRATAIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSA 742
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
G+ RIFG+E+AELP+VATS + QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT
Sbjct: 743 GLLRIFGREVAELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWT 802
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
K GF M+EE+ KY + L IF GTSML+K V
Sbjct: 803 KKLGFRKMSEEQLIKYMREVQLTIFNGTSMLEKVV 837
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 59/131 (45%), Gaps = 24/131 (18%)
Query: 593 PEPN-----SAQTSSHSKMKS--SSVKSHGKITRKDLRMHKLVFEEG-----------GL 634
PEP+ +TS+HS S SS S G + +L+M K V G+
Sbjct: 181 PEPSLLDDACTETSAHSCEGSAESSRYSLGP-NKMELKMSKKVLPNNYPSNVKKLLSTGI 239
Query: 635 EDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CN--SEVSPSQFEAHAGWASRRKPFQHI 690
DGA V Y L G G G +C C CN + +S +FE HAG+ +R P HI
Sbjct: 240 LDGARVKYVSTTSEMKLQGIINGGGYMCGCSTCNFTAILSAYEFEQHAGFKTRH-PNNHI 298
Query: 691 YTSNGVSLHEL 701
Y NG ++ +
Sbjct: 299 YLENGRPIYSV 309
>gi|242097188|ref|XP_002439084.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
gi|241917307|gb|EER90451.1| hypothetical protein SORBIDRAFT_10g031300 [Sorghum bicolor]
Length = 880
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 262/464 (56%), Positives = 320/464 (68%), Gaps = 24/464 (5%)
Query: 615 GKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQF 674
K +D MHK+VF G L++G +VGY+V G+V SPSQF
Sbjct: 438 AKNNSRDKGMHKVVFMSGVLQEGTDVGYYVGGKV---------------------SPSQF 476
Query: 675 EAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDS 734
EAHAG A+RRKP+ +IY SNGVSLHELSI L + S++++DDLC IC DGG LL CD+
Sbjct: 477 EAHAGRAARRKPYHNIYMSNGVSLHELSISLLKGQKMSNRQSDDLCSICSDGGQLLLCDT 536
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 794
CPRAFH +CVSL P GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R
Sbjct: 537 CPRAFHRECVSLSSAPKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRS 596
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
IRI TP+T GGC LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+
Sbjct: 597 IRIATTPETGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPE 656
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 914
G W C DC RIN LQ L++ G E +P LDVIKKK E G + D DVRWRVL+ K
Sbjct: 657 GAWYCSTDCVRINQTLQDLLNHGGEPVPTMDLDVIKKKREVKGFNEDADLDVRWRVLKDK 716
Query: 915 KVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTV 974
+SD ++ +LSKAV+IFH+ FDPII+ ++ DLIPAMVYGRS R QDY GMYCA+LTV
Sbjct: 717 ---SSDDSKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCAVLTV 773
Query: 975 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1034
N VVSAG+FRI G E+AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A
Sbjct: 774 NNTVVSAGLFRIMGNEIAELPLVATSRDSQGLGYFQALFSCIERLLASLEVKHFVLPAAE 833
Query: 1035 EAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
EA++IWT +FGF+ ++++E +Y +FQGTS L K V K
Sbjct: 834 EAESIWTERFGFTKISQDELREYLKGGRTTVFQGTSNLHKLVAK 877
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 58/97 (59%)
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
QVV+P FE+HAGS+ K P +YI+LENG L D++ C + L+ LE AVR +G + K
Sbjct: 346 QVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNVTLDMLESAVRKAIGPAPQK 405
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGS 553
+ C C+ SFS + L C SC+E K ++ S
Sbjct: 406 RTFRCKGCKSSFSTLCSGKFALFCDSCLESKGAKNNS 442
>gi|297827261|ref|XP_002881513.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327352|gb|EFH57772.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 862
Score = 537 bits (1383), Expect = e-149, Method: Compositional matrix adjust.
Identities = 296/732 (40%), Positives = 433/732 (59%), Gaps = 73/732 (9%)
Query: 393 VMKPRKVMRKFY-SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCD 451
V P+K++ Y S +K LE+GILEG V YI P V L+G++ G C C
Sbjct: 159 VKMPKKIVALSYPSNVKKLLETGILEGAPVKYI-----STPPVRELQGIIHSGGYLCGCT 213
Query: 452 DCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
C ++V++ FELHAG+ + P +I+LENG+ + +I+ K +P + LE+ +R V G
Sbjct: 214 TCSFSKVLSAYEFELHAGAKTRHPNNHIFLENGRAVYNIVQELKTAPRDVLEEVIRNVAG 273
Query: 512 SS------------SMKKANFCLNC-----------RVSFSNAGVEELMLL------CKS 542
S+ + +N + C V + + V +L C+S
Sbjct: 274 SALNEEGLQAWKGDGQQVSNRVIACLTGTISRNIPLSVIYLDPVVTYHFVLLSETDSCES 333
Query: 543 CVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPN--SAQT 600
C L + + L+ P SVE P E T + EP + +
Sbjct: 334 CSYLGSGTGPGLDESQSLT--------PCSVENHYFP----EKTYAKDTLDEPKRIAKKL 381
Query: 601 SSHSKMKSSSVK-SHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFG 659
+SH K S G ++D +H+L+F GL DG E+ Y+V+ + K L GYK+G G
Sbjct: 382 TSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQ-KLLHGYKQGSG 440
Query: 660 ILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDL 719
I+C+CC+ E+SPSQFEAHAG A+RR+P++HI+ S+G+SLH++++ L+ ++ ++DD+
Sbjct: 441 IVCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDM 500
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN---TFQKEKFVEYNANAR 776
C IC DGGDLL C CP+AFH C+ +P GTW+C C + + +K + + NAR
Sbjct: 501 CSICGDGGDLLLCAGCPQAFHTACLKFQSVPEGTWYCSSCNDGPISSKKATATDPSGNAR 560
Query: 777 AAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREY 836
+V R R+V+ P++E+GGCV CR DF +F RTVILCDQCE+EY
Sbjct: 561 P------------IVIRLSRVVKAPESEIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEY 608
Query: 837 HVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEES 896
HVGCL+++G+ DL+E+P+ KW CC+DC RI+ A+Q V G + +P LD+I +K E
Sbjct: 609 HVGCLRENGLCDLKEIPQEKWFCCSDCSRIHTAVQNSVSCGPQTIPTPLLDMICRKDREK 668
Query: 897 G--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 954
G +DN D V WR+L GK LLS+A IF + FDPI+ + + DLIP MV
Sbjct: 669 GIFTDNG-DI-VEWRILSGKSRYPEH--LPLLSRAAVIFRECFDPIVAKSGR-DLIPVMV 723
Query: 955 YGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFC 1014
YGR+ GQ++ GMYC +L VN +VVSA + RIFGQ++AELP+VATS + QG+GYFQ L+
Sbjct: 724 YGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQQVAELPIVATSREYQGRGYFQGLYA 783
Query: 1015 CIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
C+E LL LNV+ LVLP+A EA++IWT KFGF+ M++++ +Y+ + L IF+GTSML+K
Sbjct: 784 CVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEK 843
Query: 1075 PVPKCRIVGKSV 1086
VPK + +S
Sbjct: 844 KVPKTTSLSEST 855
>gi|334184778|ref|NP_181288.4| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|330254317|gb|AEC09411.1| acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 829
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/805 (37%), Positives = 455/805 (56%), Gaps = 77/805 (9%)
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERP------- 346
D+ + V++G++E S S + +++V S + + + V+E P
Sbjct: 70 DVSSQPVKSGLVECSGSDFGSEETVSDDASVVGSSQTEQSSDVLPSRFVLEIPKHLSSTG 129
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSK 406
+ + T L + K E L KD + V ++ K + V + S
Sbjct: 130 ITKITFKLSKPKKEFDDLPLIKDHTWDAGVVKMP--------------KKKIVSLSYPSN 175
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K LE+GILEG V YI P V L G++ G C C C ++V++ FE
Sbjct: 176 VKKLLETGILEGARVKYI-----STPPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQ 230
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HAG+ + P +I+LEN + + +I+ K +P LE+ +R V GS+ LN
Sbjct: 231 HAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSA--------LN--- 279
Query: 527 SFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSG--- 583
EE + K+ + Q+ S + ++ S + P ++ +S P
Sbjct: 280 -------EEGLRAWKASFQ----QSNSMSDRNYITDHSTVSYLGPGLDESQSLTPCSVEN 328
Query: 584 ----ELTDTSNRSPEPN--SAQTSSHSKMKSSSVK-SHGKITRKDLRMHKLVFEEGGLED 636
E T + EP + + +SH K S G ++D +H+L+F GL D
Sbjct: 329 HYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLFMPNGLPD 388
Query: 637 GAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 696
G E+ Y+V+ + K L GYK+G GI+C+CC+ E+SPSQFEAHAG A+RR+P++HI+ S+G+
Sbjct: 389 GTELAYYVKTQ-KLLQGYKQGSGIVCSCCSREISPSQFEAHAGMAARRQPYRHIFISSGL 447
Query: 697 SLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 756
SLH++++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +P GTW+C
Sbjct: 448 SLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYC 507
Query: 757 RYCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG 813
C + + +K + + NAR +V R R+V+ P++++GGCV CR
Sbjct: 508 SSCNDGPISSKKATTTDPSGNARP------------IVIRLSRVVKAPESDIGGCVFCRS 555
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL 873
DF +F RTVILCDQCE+EYHVGCL+++G DL+E+P+ KW CC++C RI+ A+Q
Sbjct: 556 HDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRIHTAVQNS 615
Query: 874 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 933
V G + LP LD+I +K E G + V WR+L GK LLS+A IF
Sbjct: 616 VSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEH--LPLLSRAAVIF 673
Query: 934 HDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
+ FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIFGQE+AE
Sbjct: 674 RECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQEVAE 732
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
LP+VATS + QG+GYFQ L+ C+E LL LNV+ LVLP+A EA++IWT KFGF+ M++++
Sbjct: 733 LPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQ 792
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+Y+ + L IF+GTSML+K VPK
Sbjct: 793 LQEYQKEVQLTIFKGTSMLEKKVPK 817
>gi|255581042|ref|XP_002531337.1| DNA binding protein, putative [Ricinus communis]
gi|223529059|gb|EEF31044.1| DNA binding protein, putative [Ricinus communis]
Length = 856
Score = 530 bits (1366), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/700 (43%), Positives = 417/700 (59%), Gaps = 63/700 (9%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + + F S +K L +GIL+G V YI + L G++ G G C C
Sbjct: 203 ELKMSKKVLPNTFPSNVKKLLSTGILDGARVKYISPQRE-------LYGIIDGGGYLCGC 255
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V+T FELHAG+ + P +IYLENGK + I+ K +PL +++ ++
Sbjct: 256 PSCNFSRVLTAYEFELHAGAKTRHPNNHIYLENGKPICSIIQELKAAPLGAVDEVIKDAA 315
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE-IKEPLSHSSEMEPQ 569
GSS +N EE + K+ + G+ E L +S
Sbjct: 316 GSS--------IN----------EEFFQVWKASLHQCNGIIGADEKCYSMLPYSPHSLGS 357
Query: 570 PPSVELEESPAPSGELTDTSN---RSPEPNSAQTSSHSKMKSSSVKSHGKIT------RK 620
S LEES P SN R +S++ + + SS+ SH K T R+
Sbjct: 358 YSSQGLEESGCPPCSSFVHSNPFRRQKYMDSSEEHKRAFRRPSSL-SHPKKTNEGGTRRR 416
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGW 680
D +H+L+F GL DGAE+ Y+++G+ K L GYK+G GI+C+CC+ E+SPSQFEAHAG
Sbjct: 417 DNDLHRLLFMPNGLPDGAELAYYIKGQ-KMLAGYKQGNGIVCSCCDREISPSQFEAHAGM 475
Query: 681 ASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFH 740
A+RR+P++HIYTSNG++LH+++ L+ + ++ +DD+C C DGGDL+ C+SCPRAFH
Sbjct: 476 AARRQPYRHIYTSNGLTLHDIATSLANGQNLTTGLSDDMCAECGDGGDLIFCESCPRAFH 535
Query: 741 IDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT 800
+ C+ L +PS WHC C KF +R+ +V R R+V+T
Sbjct: 536 LVCLGLKYVPSDVWHCPNC------NKFGHGGNFSRS------------IVIRLTRVVKT 577
Query: 801 PDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCC 860
P+ E+GGCV CR DF F RTVILCDQCERE+HVGCL+D+G+ DL+E+PK W C
Sbjct: 578 PEYEVGGCVFCRAHDFSTHTFNDRTVILCDQCEREFHVGCLRDNGLCDLKEIPKDNWFCS 637
Query: 861 ADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG---SDNAVDFDVRWRVLRGKKVD 917
DC RI ALQ V G + +P L++I KH E G A DF +WR+L GK
Sbjct: 638 NDCNRIYEALQNFVSSGVQMIPSLQLNIITGKHAEKGLYIDGQANDF--QWRILMGKSRY 695
Query: 918 ASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 977
D +LLS A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V V
Sbjct: 696 QED--LSLLSAAAAIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVLLLVKNV 752
Query: 978 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1037
VVSAG+ RIFG+++AELPLVATS + QG+GYFQ+LF CIE+LL LNV LVLP+A EA+
Sbjct: 753 VVSAGLLRIFGRDVAELPLVATSREHQGKGYFQALFSCIERLLCSLNVVKLVLPAAEEAE 812
Query: 1038 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
+IWT +FGF MTEE+ ++Y + L IF+GTSML+K VP
Sbjct: 813 SIWTRRFGFRKMTEEQLSQYTRELQLTIFKGTSMLEKEVP 852
>gi|356548148|ref|XP_003542465.1| PREDICTED: uncharacterized protein LOC100808999 [Glycine max]
Length = 852
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 282/688 (40%), Positives = 416/688 (60%), Gaps = 72/688 (10%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+G V YI PG L+G++ G G C C
Sbjct: 230 ELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYN-----PGKVELQGIIDGGGYLCGC 284
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENG+ + I+ K +PL L++ ++ V
Sbjct: 285 SMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSLLDEVIKNVA 344
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS ++ SF A E L+ + +++ S+S+++ P
Sbjct: 345 GSSVNEE---------SF-QAWKESLL-------------QSNGKVQAHKSYSTKLVGMP 381
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+N P S+ TS+ ++ S + G R+D +H+L+F
Sbjct: 382 -----------------HTNIRP---SSYTSNSGVLQKRS--ADGCTKRRDNDLHRLLFM 419
Query: 631 EGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
GL DGAE+ Y+V+G+ K L GYK+G GI+C CC+ E+SPSQFEAHAG A+RR+P++HI
Sbjct: 420 PNGLPDGAELAYYVKGQ-KLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHI 478
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 750
YTSNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C+ L +P
Sbjct: 479 YTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVP 538
Query: 751 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 810
W C C++ N R + + + + R R+ +TP+ E+GGCV+
Sbjct: 539 DSGWQCLNCIDN---------AGNGRESSIVRPI------MIRLTRVDKTPEVEMGGCVV 583
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 870
CR DF ++F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC RI AL
Sbjct: 584 CREHDFSVAKFDERTVIICDQCEKEYHVGCLRDMGLCELEELPKDKWFCCDDCNRIYAAL 643
Query: 871 QKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSK 928
Q V G E +P + ++I +KHE+ G + A++ D++WR+L GK LLS+
Sbjct: 644 QNSVSAGAEIIPASFSELIIRKHEDKGLCTYGAMN-DIQWRILSGKSRYPEH--LPLLSR 700
Query: 929 AVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
A +IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN VVVSAG+ RIFG
Sbjct: 701 AAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNYVVVSAGLLRIFG 759
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+ +AELPLVATS QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF
Sbjct: 760 RNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRK 819
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
M+E++ +K+ + L +F TSML+K V
Sbjct: 820 MSEDQLSKHLREVQLTLFNKTSMLEKTV 847
>gi|38230506|gb|AAR14274.1| predicted protein [Populus tremula x Populus alba]
Length = 868
Score = 524 bits (1350), Expect = e-145, Method: Compositional matrix adjust.
Identities = 285/691 (41%), Positives = 408/691 (59%), Gaps = 39/691 (5%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+ V YI S + L G++ G G C C
Sbjct: 206 ELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERE-----LDGIIDGGGYLCGC 260
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +IYLENGK + I+ K +PL ++ ++ V
Sbjct: 261 SSCSFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 320
Query: 511 GSSSMKKANFCLNCRVSFSNA--GVEELMLLCKSCVELKESQAGSAEIKEPLSH-SSEME 567
GSS ++ ++ SNA G ++ C+ S +KE SS
Sbjct: 321 GSSINEEFFRVWKASLNQSNALVGADKKSYSELPCLPHSHVSYASQALKESFCPISSSFL 380
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
V + + SG TS R P+ S +K K ++ + ++D +H+L
Sbjct: 381 YNNNFVSQQTNMETSGVNKQTSKR---PSFYVPGSATKQKKTA---ESGVRKRDNDLHRL 434
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
+F GL DG E+ Y+V+G+ K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+P+
Sbjct: 435 LFMPNGLPDGTELAYYVKGQ-KILGGYKQGNGIVCSCCEIEISPSQFESHAGMSARRQPY 493
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
+HIYTSN ++LH+++I L+ + ++ DD+C C DGGDL+ C SCPRAFH C+ L
Sbjct: 494 RHIYTSNRLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLH 553
Query: 748 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 807
P G WHC C F +V R R+V+TP+ ++GG
Sbjct: 554 DTPEGAWHCPNCNKLGHGGNFAR------------------PIVIRLTRVVKTPEYDVGG 595
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
C +CR DF F RTVILCDQCE+E+HVGCL++ G+ DL+E+PK W CC DC I
Sbjct: 596 CAVCRAHDFSGDTFDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIY 655
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRAL 925
+AL+ V G + +P + L+ I +KH E G D A +DV+W++L GK + D +L
Sbjct: 656 VALRNSVSTGVQTIPVSLLNTINRKHVEKGLLVDEAA-YDVQWQILMGKSRNRED--LSL 712
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFR 985
LS A +IF + FDPI+ + DLIP MVYGR+ GQ++ GMYC +LTV VVVSAG+ R
Sbjct: 713 LSGAAAIFRECFDPIVAKTGR-DLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLR 771
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
IFG+E+AELPLVAT+ + QG+GYFQ+LF CIE+LL LNV+ LVLP+A EA++IWT +FG
Sbjct: 772 IFGREVAELPLVATNREHQGKGYFQALFSCIERLLCSLNVEQLVLPAAEEAESIWTRRFG 831
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
F M+E + KY ++ L IF+GTSML+K V
Sbjct: 832 FRKMSEGQLLKYTREFQLTIFKGTSMLEKEV 862
>gi|356536874|ref|XP_003536958.1| PREDICTED: uncharacterized protein LOC100794242 [Glycine max]
Length = 855
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 282/688 (40%), Positives = 416/688 (60%), Gaps = 72/688 (10%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+G V YI PG L+G++ G G C C
Sbjct: 233 ELKMSKKVVPNCYPTNVKKLLSTGILDGAVVKYIYN-----PGKVELQGIIDGGGYLCGC 287
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENG+ + I+ K +PL L++ ++ V
Sbjct: 288 SMCNYSRVLSAYEFEQHAGAKTRHPNNHIFLENGRPIYSIIQEIKTAPLSILDEVIKNVA 347
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS ++ SF A E L+ + +++ S+S+++ P
Sbjct: 348 GSSVNEE---------SF-QAWKESLL-------------QSNGKVQAHKSYSTKLVGMP 384
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFE 630
+N P S+ TS+ ++ S + G R+D +H+L+F
Sbjct: 385 -----------------HTNIRP---SSYTSNTGVLQKRS--ADGCTKRRDNDLHRLLFM 422
Query: 631 EGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
GL DGAE+ Y+V+G+ K L GYK+G GI+C CC+ E+SPSQFEAHAG A+RR+P++HI
Sbjct: 423 PNGLPDGAELAYYVKGQ-KLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAARRQPYRHI 481
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 750
YTSNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C+ L +P
Sbjct: 482 YTSNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAACLGLQCVP 541
Query: 751 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 810
W C C + N R + + + + R R+ +TP+ E+GGCV+
Sbjct: 542 DSGWQCLNCRDN---------AGNGRESSIVRPI------MIRLTRVDKTPEFEMGGCVV 586
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 870
CR DF ++F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC RI +AL
Sbjct: 587 CREHDFSVAKFDERTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDCNRIYVAL 646
Query: 871 QKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSK 928
Q V G E +P + ++I +KHE+ G + A++ D++WR+L GK LLS+
Sbjct: 647 QNSVAAGAEIIPASVSELIIRKHEDKGLCTYGAMN-DIQWRILSGKSRYPEH--LPLLSR 703
Query: 929 AVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
A +IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN VVVSAG+ RIFG
Sbjct: 704 AAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSVVVSAGLLRIFG 762
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+ +AELPLVATS QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF
Sbjct: 763 RNVAELPLVATSRAHQGKGYFQVLFSCIERLLSSLNVEKLVLPAAGDAESIWTKKLGFRK 822
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
M+E++ +K+ + L +F TSML+K V
Sbjct: 823 MSEDQLSKHLREVQLTLFNKTSMLEKTV 850
>gi|358346906|ref|XP_003637505.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355503440|gb|AES84643.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 897
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 279/694 (40%), Positives = 411/694 (59%), Gaps = 50/694 (7%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V F + +K L +GIL+G +V YI PG L G++ G C C
Sbjct: 242 ELKMSKKVVPNAFPNNVKKLLSTGILDGAAVKYIYN-----PGKVELDGIIGDGGYLCGC 296
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +I+LENGK + I++ K + T ++ ++ V
Sbjct: 297 SMCSYSRVLSAYEFEQHAGAKTRHPNNHIFLENGKPIYSIIHEIKTATNSTPDEVIKNVA 356
Query: 511 GSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
GSS + + +F E L+ K K+ S I P +++S+
Sbjct: 357 GSS-INEGSF---------QVWKESLLQSNKKVPTQKKYSTKSTGI--PHTYNSQSIESA 404
Query: 571 PSV-------ELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLR 623
S E+ + + +P++ S K S+ G ++D
Sbjct: 405 SSFSSLRVRNHFEQQMYVNQTADEWKRVVKKPSTYTYYSGIPQKRSA---DGCTKKRDND 461
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASR 683
+H+L+F GL DGAE+ Y+V+G+ K L GYK+G GI+C CC+ E+SPSQFEAHAG A+R
Sbjct: 462 LHRLLFMPNGLPDGAELAYYVKGQ-KLLGGYKQGNGIVCGCCDIEISPSQFEAHAGMAAR 520
Query: 684 RKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
R+P++HIY SNG++LH++++ L+ + ++ ++DD+C +C DGGDL+ C+ CPRAFH C
Sbjct: 521 RQPYRHIYASNGLTLHDIALSLANGQNLTTGDSDDMCAVCGDGGDLILCNGCPRAFHAAC 580
Query: 744 VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT 803
+ L +P WHC C + E+ G P ++ R R+ + P+
Sbjct: 581 LGLHSVPESGWHCLNCEDNTGDER---------------GARP---IMIRLTRVDKEPEY 622
Query: 804 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
E+GGCV+CR DF +F RTVI+CDQCE+EYHVGCL+D G+ +L+ELPK KW CC DC
Sbjct: 623 EVGGCVVCRANDFSVDKFDDRTVIICDQCEKEYHVGCLRDIGLCELEELPKDKWFCCDDC 682
Query: 864 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDF-DVRWRVLRGKKVDASDGT 922
RI +ALQ V G + +P + ++I +KHE+ G D D++WR+L GK A
Sbjct: 683 NRIYVALQNSVSAGADTIPSSLSELIIRKHEDRGLCTYGDMNDIQWRILSGKSRYAEH-- 740
Query: 923 RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAG 982
LLS+A +IF + FDPI+ + S DLIP MVYGR+ GQ++ GMYC +L VN +VVSAG
Sbjct: 741 LPLLSRAAAIFRECFDPIV-AISGRDLIPVMVYGRNISGQEFGGMYCIVLIVNSIVVSAG 799
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
+ RIFG+ +AELPLVATS + QG+GYFQ+LF CIE+LL LNV+ LVLP+A +A++IWT
Sbjct: 800 LLRIFGRNIAELPLVATSREHQGKGYFQALFSCIERLLSSLNVEKLVLPAAGDAESIWTK 859
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
K GF M+E++ K+ + L +F TS+L+K V
Sbjct: 860 KLGFHKMSEDQLTKHLKEVQLTLFNKTSVLEKMV 893
>gi|334185956|ref|NP_190936.2| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
gi|225898713|dbj|BAH30487.1| hypothetical protein [Arabidopsis thaliana]
gi|332645606|gb|AEE79127.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain-containing protein [Arabidopsis thaliana]
Length = 841
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 271/690 (39%), Positives = 397/690 (57%), Gaps = 54/690 (7%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 182 FLSNVKKLLGTGILDGARVKYLSTSAARE-----LQGIIHSGGYLCGCTACDFSKVLGAY 236
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS---------- 512
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS
Sbjct: 237 EFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIAPPDVLEEVIRKVAGSALSEEGFQAW 296
Query: 513 --SSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
S + N + + + L+ S L ESQ+ + E ++
Sbjct: 297 KGSFQQDKNMTEDDSNHIMDHSFQSLVSYPGSGWSLDESQSSTPCFPEDNYFREKI---- 352
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVF 629
T + + +P + + +SH M S G ++D +H+L+F
Sbjct: 353 --------------CTKDTRHAHKPKAKKLTSHMFGMGCHKKVSGGGKWKRDNDLHRLLF 398
Query: 630 EEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 689
GL DG E+ Y+V+ + K L GYK+G GI+C+CC++++SPSQFEAHAG A RR+P++
Sbjct: 399 LPNGLPDGTELAYYVKSQ-KLLQGYKQGSGIVCSCCDTKISPSQFEAHAGMAGRRQPYRR 457
Query: 690 IYTSNGVSLHELSIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 748
I+ S+G+SLH++++ L+ ++ ++DD+C IC +GGDLL C CP+AFH C+
Sbjct: 458 IHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAFHTACLKFQS 517
Query: 749 IPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDP-FAQMVSRCIRIVQTPDTELGG 807
+P GTW+C C N + DP +V R R+V+ P++E+GG
Sbjct: 518 MPEGTWYCSSC------------NDGPTSCKIATASDPNLKPIVIRLTRVVKAPESEIGG 565
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
CV CR DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW CC+DC RI+
Sbjct: 566 CVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFCCSDCSRIH 625
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
LQ G + +P LD I +K+ E G V WR+L GK LLS
Sbjct: 626 RVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPEH--LPLLS 683
Query: 928 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIF 987
+A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIF
Sbjct: 684 RAATIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIF 742
Query: 988 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1047
GQ++AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++IWTNKFGF+
Sbjct: 743 GQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTNKFGFT 802
Query: 1048 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
MTE +Y+ + L IF+GTSML+K VP
Sbjct: 803 KMTEHRLQRYQREVQLTIFKGTSMLEKKVP 832
>gi|449483630|ref|XP_004156643.1| PREDICTED: uncharacterized protein LOC101223245 [Cucumis sativus]
Length = 781
Score = 504 bits (1297), Expect = e-139, Method: Compositional matrix adjust.
Identities = 264/624 (42%), Positives = 377/624 (60%), Gaps = 64/624 (10%)
Query: 456 NQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSM 515
++++ FE HAG + P +IYLENG+ + ++ K +PL L++ + V GSS
Sbjct: 214 TEILSAYEFEQHAGFKTRHPNNHIYLENGRPIYSVIQEIKSAPLSILDEVIMEVAGSS-- 271
Query: 516 KKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVEL 575
V + +A A + S+ + + V+L
Sbjct: 272 ----------------------------VNMNSFEAWKASFHQ---DSANIVVENHDVKL 300
Query: 576 EESPAPSGELTDTSNRSPEPN-SAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGL 634
+ P P PN S H K K R+D +H+L+F GL
Sbjct: 301 PKLSHPV--------ERPNPNFSNAVLQHKKTAEKGTK------RRDNDLHRLLFMPNGL 346
Query: 635 EDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSN 694
DGAE+ YFV+G+ + L G+K+G GILC+ CN E+SPSQFEAHAG A+RR+P++HIYT+N
Sbjct: 347 PDGAELAYFVKGQ-RILGGFKQGNGILCSHCNREISPSQFEAHAGMAARRQPYRHIYTTN 405
Query: 695 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
G++LH+++I L+ + ++ ++DD+C C +GGDL+ CD CPRA+H C+ L +P G W
Sbjct: 406 GLTLHDIAISLASGQKLTTGDSDDMCAACGNGGDLIFCDRCPRAYHTGCLHLQNVPEGVW 465
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 814
C C + V N+ A + G + P +V R R+V+ P+ E+GGCV+CR
Sbjct: 466 SCPNCRDK------VGSNSKAISGGSLSFSKP---IVFRLTRVVKAPEYEIGGCVVCRRH 516
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 874
DF ++F RTV+LCDQCERE+HVGCL+D G+ DL+ELPK KW CC +C I++ALQ V
Sbjct: 517 DFSAAKFDDRTVLLCDQCEREFHVGCLRDSGLCDLKELPKDKWFCCDECSNIHVALQNTV 576
Query: 875 DRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 932
G + +P++ D+I +KH G D A++ DVRW++L GK D LS+A +I
Sbjct: 577 LNGAQIIPDSLSDLIIRKHVGKGLLVDEALN-DVRWQILSGKSRFPED--LPFLSRATAI 633
Query: 933 FHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELA 992
F + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L V +VVSAG+ RIFG+E+A
Sbjct: 634 FRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCVVLIVRSIVVSAGLLRIFGREVA 692
Query: 993 ELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEE 1052
ELP+VATS + QG+GYFQ LF CIE+LL LNV+ LVLP+A +A++IWT K GF M+EE
Sbjct: 693 ELPIVATSREHQGKGYFQVLFSCIERLLSSLNVQNLVLPAAEDAESIWTKKLGFRKMSEE 752
Query: 1053 EQNKYRNDYPLMIFQGTSMLQKPV 1076
+ KY + L IF GTSML+K V
Sbjct: 753 QLIKYMREVQLTIFNGTSMLEKVV 776
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 66/149 (44%), Gaps = 44/149 (29%)
Query: 343 VERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK 402
VERP F+ ++LQ K KG+ +D +D+ +
Sbjct: 307 VERPNPNFSNAVLQHKKTAEKGTKRRD----NDLHRLL---------------------- 340
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F+ +G+ +G + Y VKG + G G +G+GI C C N+ ++P+
Sbjct: 341 -------FMPNGLPDGAELAYF----VKGQRILG--GFKQGNGI--LCSHC--NREISPS 383
Query: 463 VFELHAGSSNKRPP-EYIYLENGKTLRDI 490
FE HAG + +R P +IY NG TL DI
Sbjct: 384 QFEAHAGMAARRQPYRHIYTTNGLTLHDI 412
>gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis thaliana]
gi|20197471|gb|AAM15090.1| unknown protein [Arabidopsis thaliana]
Length = 825
Score = 491 bits (1264), Expect = e-135, Method: Compositional matrix adjust.
Identities = 288/812 (35%), Positives = 438/812 (53%), Gaps = 86/812 (10%)
Query: 294 DIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERP------- 346
D+ + V++G++E S S + +++V S + + + V+E P
Sbjct: 61 DVSSQPVKSGLVECSGSDFGSEETVSDDASVVGSSQTEQSSDVLPSRFVLEIPKHLSSTG 120
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSK 406
+ + T L + K E L KD + V ++ K + V + S
Sbjct: 121 ITKITFKLSKPKKEFDDLPLIKDHTWDAGVVKMP--------------KKKIVSLSYPSN 166
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+K LE+GILEG V YI P V L G++ G C C C ++V++ FE
Sbjct: 167 VKKLLETGILEGARVKYI-----STPPVRQLLGIIHSGGYLCGCTTCNFSKVLSAYEFEQ 221
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSS------------S 514
HAG+ + P +I+LEN + + +I+ K +P LE+ +R V GS+
Sbjct: 222 HAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIRNVAGSALNEEGLRAWKGDG 281
Query: 515 MKKANFCLNCR--VSFSNAGVEELMLLCKSCVELKESQA--GSAEIKEPLSHSSEMEPQP 570
+ +N + C + + LL V L E+ + + + L S + P
Sbjct: 282 QQVSNRVIACLTGIILRIIPLSVNALLSYHFVLLSETDSCESCSYLGPGLDESQSLTP-- 339
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVK-SHGKITRKDLRMHKLVF 629
+E + P+ + + +SH K S G ++D +H+L+F
Sbjct: 340 --CSVENHYFSEKTYAKDTLDEPKRIAKKLTSHVSGTGCHKKVSEGSNRKRDNDLHRLLF 397
Query: 630 EEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 689
GL DG E+ Y+V+ +++SPSQFEAHAG A+RR+P++H
Sbjct: 398 MPNGLPDGTELAYYVK---------------------TQISPSQFEAHAGMAARRQPYRH 436
Query: 690 IYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI 749
I+ S+G+SLH++++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +
Sbjct: 437 IFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSM 496
Query: 750 PSGTWHCRYCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
P GTW+C C + + +K + + NAR +V R R+V+ P++++G
Sbjct: 497 PEGTWYCSSCNDGPISSKKATTTDPSGNARP------------IVIRLSRVVKAPESDIG 544
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
GCV CR DF +F RTVILCDQCE+EYHVGCL+++G DL+E+P+ KW CC++C RI
Sbjct: 545 GCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDLKEIPQEKWFCCSNCSRI 604
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
+ A+Q V G + LP LD+I +K E G + V WR+L GK LL
Sbjct: 605 HTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWRILSGKSRYPEH--LPLL 662
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRI 986
S+A IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RI
Sbjct: 663 SRAAVIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLIVNSLVVSAALLRI 721
Query: 987 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1046
FGQE+AELP+VATS + QG+GYFQ L+ C+E LL LNV+ LVLP+A EA++IWT KFGF
Sbjct: 722 FGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLVLPAAEEAESIWTKKFGF 781
Query: 1047 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ M++++ +Y+ + L IF+GTSML+K VPK
Sbjct: 782 TKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 813
>gi|297820102|ref|XP_002877934.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323772|gb|EFH54193.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 834
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 271/688 (39%), Positives = 397/688 (57%), Gaps = 48/688 (6%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 175 FPSNVKKLLATGILDGARVKYLSISPARE-----LQGIIHSGGYLCGCTVCDFSKVLGAY 229
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS+ + + F
Sbjct: 230 EFERHAGGKTKHPNNHIYLENGRPVYNMIQELRVAPPDVLEEVIRKVAGSA-LSEEGF-Q 287
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPS 582
+ SF +M + + + P S S E Q + E+
Sbjct: 288 AWKESFQQDDSNHIM-----------DYSFQSLVSYPGSGWSIDESQSSTPYFPENNYFR 336
Query: 583 GELTDTSNRS-PEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEV 640
+++ R +P + + +SH M + G ++D +H+L+F GL DG E+
Sbjct: 337 KKISTKDTRHEHKPKAKKVTSHMFGMGCHKKAAGGGKWKRDNDLHRLLFLPNGLPDGTEL 396
Query: 641 GYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE 700
YFV+ + K L GYK+G GI+C+CC++E+SPSQFEAHAG A RR+P++HI+ S+G+SLH+
Sbjct: 397 AYFVKSQ-KLLQGYKQGSGIVCSCCDTEISPSQFEAHAGMAGRRQPYRHIHISSGLSLHD 455
Query: 701 LSIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
+++ L+ ++ ++DD+C IC DGGDLL C CP+AFH C+ +P GTW+C C
Sbjct: 456 IAMSLADGGHVITTGDSDDMCSICGDGGDLLLCAGCPQAFHTACLKFQSMPEGTWYCSSC 515
Query: 760 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQ-TPDTELGGCVLC------R 812
N + DP + + I I + + C R
Sbjct: 516 ------------NDGPTSCKTATATDPNLKSIVGSIAIFSLSAHIRVLHSAYCFSPISDR 563
Query: 813 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW CC+DC RI+ ALQ
Sbjct: 564 SLDFSIGKFDDRTVILCDQCEKEYHVGCLRENDLCDLKGIPQDKWFCCSDCSRIHTALQS 623
Query: 873 LVDRGEEKLPETSLDVIKKKHEESGS--DNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 930
G + +P LD I +K+ E G DN + V WR+L GK A LLS+A
Sbjct: 624 SASCGPQTIPTVLLDTISRKYREKGICIDNGDN--VEWRMLSGKSRYAEH--LPLLSRAA 679
Query: 931 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
+IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VVSA + RIFGQ+
Sbjct: 680 TIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQK 738
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
+AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++IWT KFGF+ MT
Sbjct: 739 VAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESIWTKKFGFTKMT 798
Query: 1051 EEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
E + KY+ + L IF+GTSML+K VP+
Sbjct: 799 EHQLQKYQREVQLTIFKGTSMLEKKVPR 826
>gi|4415917|gb|AAD20148.1| putative PHD-type zinc finger protein [Arabidopsis thaliana]
Length = 958
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 235/469 (50%), Positives = 312/469 (66%), Gaps = 26/469 (5%)
Query: 620 KDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAG 679
KD +HKLVF+ GGL +G E+GY+ RG+ V C SE + E
Sbjct: 492 KDQGLHKLVFDRGGLPEGTELGYYARGQTYITVDRN---------C-SEATKWALEYIVT 541
Query: 680 WASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAF 739
AS + +IYTSNGVSLHE + S R +S+ +N+DLC IC DGG+LL CDSCPRAF
Sbjct: 542 VAS--ASYFYIYTSNGVSLHEWATTFSHGRKYSANDNNDLCVICADGGNLLLCDSCPRAF 599
Query: 740 HIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQ 799
HI+CVSLP IP G WHC+YC N F E EYN N+ A G++EGVDP Q+ RCIR+V+
Sbjct: 600 HIECVSLPSIPRGNWHCKYCENKFTSEIAGEYNVNSSAVGQLEGVDPVDQLAGRCIRVVK 659
Query: 800 TPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+ E G S FG RT+I+CDQCE+EYH+GCL + DL+ELPKG W C
Sbjct: 660 NMEAETNG-----------SGFGPRTIIICDQCEKEYHIGCLSSQNIVDLKELPKGNWFC 708
Query: 860 CADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDAS 919
DC RIN LQKL+ G EKL ++SL +I+ K E + + D D+RWR++ GK S
Sbjct: 709 SMDCTRINSTLQKLLLGGAEKLSDSSLGIIQTKQERNDVYSISDLDIRWRLISGKV--TS 766
Query: 920 DGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVV 979
+R LLS+A++IFHD FDPI++ S +LIP MVYG++ +GQDY G+ CA+LTVN VV
Sbjct: 767 PESRMLLSQALAIFHDCFDPIVDPLSGSNLIPRMVYGKTMQGQDYGGICCAVLTVNATVV 826
Query: 980 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1039
SAG+ R+FG+E+AELPLVAT + +GYFQ LF CIEKLL LNV+++V+P+A EA+ +
Sbjct: 827 SAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVESIVVPAAEEAEPL 886
Query: 1040 WTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIVGKSVD 1087
W NKFGF + E+ +KY + Y ++ F+G SMLQKPV +I+ K+++
Sbjct: 887 WMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQIIDKTIE 935
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 71/122 (58%), Gaps = 3/122 (2%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
+++ E+G+L+G+SV+Y+ K + LRG+++ GI C C C V++ + FE+
Sbjct: 264 VRDLFETGLLDGLSVVYMGTVKSQA---FPLRGIIRDGGILCSCSSCDWANVISTSKFEI 320
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRV 526
HA +R +YI ENGK+L D++N+ +++PL LE + + +S +K C C+
Sbjct: 321 HACKQYRRASQYICFENGKSLLDVLNISRNTPLHALEATILDAVDYASKEKRFTCKRCKE 380
Query: 527 SF 528
F
Sbjct: 381 GF 382
>gi|6729519|emb|CAB67675.1| putative protein [Arabidopsis thaliana]
Length = 839
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 262/698 (37%), Positives = 385/698 (55%), Gaps = 72/698 (10%)
Query: 403 FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPA 462
F S +K L +GIL+G V Y+ S + L+G++ G C C C ++V+
Sbjct: 182 FLSNVKKLLGTGILDGARVKYLSTSAARE-----LQGIIHSGGYLCGCTACDFSKVLGAY 236
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS---------- 512
FE HAG K P +IYLENG+ + +++ + +P + LE+ +R V GS
Sbjct: 237 EFERHAGGKTKHPNNHIYLENGRPVYNVIQELRIAPPDVLEEVIRKVAGSALSEEGFQAW 296
Query: 513 --SSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQP 570
S + N + + + L+ S L ESQ+ + E ++
Sbjct: 297 KGSFQQDKNMTEDDSNHIMDHSFQSLVSYPGSGWSLDESQSSTPCFPEDNYFREKI---- 352
Query: 571 PSVELEESPAPSGELTDTSNRSPEPNSAQTSSHS-KMKSSSVKSHGKITRKDLRMHKLVF 629
T + + +P + + +SH M S G ++D +H+L+F
Sbjct: 353 --------------CTKDTRHAHKPKAKKLTSHMFGMGCHKKVSGGGKWKRDNDLHRLLF 398
Query: 630 EEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQH 689
GL DG E+ Y+V+ + K L GYK+G GI+C+CC++++SPSQFEAHAG A RR+P++
Sbjct: 399 LPNGLPDGTELAYYVKSQ-KLLQGYKQGSGIVCSCCDTKISPSQFEAHAGMAGRRQPYRR 457
Query: 690 IYTSNGVSLHELSIKLS-LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 748
I+ S+G+SLH++++ L+ ++ ++DD+C IC +GGDLL C CP+AFH C+
Sbjct: 458 IHISSGLSLHDIAVSLADGGHVITTGDSDDMCSICGNGGDLLLCAGCPQAFHTACLKFQS 517
Query: 749 IPSGTWHCRYCMNTFQKEK---------FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQ 799
+P GTW+C C + K + NAN + P +
Sbjct: 518 MPEGTWYCSSCNDGPTSCKIATASWLYTYFNLNANILVLHSAYSLSPISD---------- 567
Query: 800 TPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
R DF +F RTVILCDQCE+EYHVGCL+++ + DL+ +P+ KW C
Sbjct: 568 ------------RSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENELCDLKGIPQDKWFC 615
Query: 860 CADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDAS 919
C+DC RI+ LQ G + +P LD I +K+ E G V WR+L GK
Sbjct: 616 CSDCSRIHRVLQSSASCGPQTIPTLLLDTISRKYREKGIYIDNGNTVEWRMLSGKSRYPE 675
Query: 920 DGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVV 979
LLS+A +IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC +L VN +VV
Sbjct: 676 H--LPLLSRAATIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYCLVLMVNSLVV 732
Query: 980 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1039
SA + RIFGQ++AELP+VATS + QG+GYFQ LF C+E LL LNV+ L+LP+A EA++I
Sbjct: 733 SAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACVENLLSSLNVENLLLPAAEEAESI 792
Query: 1040 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
WTNKFGF+ MTE +Y+ + L IF+GTSML+K VP
Sbjct: 793 WTNKFGFTKMTEHRLQRYQREVQLTIFKGTSMLEKKVP 830
>gi|110737508|dbj|BAF00696.1| hypothetical protein [Arabidopsis thaliana]
Length = 534
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 224/469 (47%), Positives = 319/469 (68%), Gaps = 19/469 (4%)
Query: 613 SHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPS 672
S G ++D +H+L+F GL DG E+ Y+V+ + K L GYK+G GI+C+CC+ E+SPS
Sbjct: 70 SEGSNRKRDNDLHRLLFMPNGLPDGTELAYYVKTQ-KLLQGYKQGSGIVCSCCSREISPS 128
Query: 673 QFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCC 732
QFEAHAG A+RR+P++HI+ S+G+SLH++++ L+ ++ ++DD+C IC DGGDLL C
Sbjct: 129 QFEAHAGMAARRQPYRHIFISSGLSLHDIAMSLANGHVITTGDSDDMCSICGDGGDLLLC 188
Query: 733 DSCPRAFHIDCVSLPGIPSGTWHCRYCMN---TFQKEKFVEYNANARAAGRIEGVDPFAQ 789
CP+AFH C+ +P GTW+C C + + +K + + NAR
Sbjct: 189 AGCPQAFHTACLKFQSMPEGTWYCSSCNDGPISSKKATTTDPSGNARP------------ 236
Query: 790 MVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDL 849
+V R R+V+ P++++GGCV CR DF +F RTVILCDQCE+EYHVGCL+++G DL
Sbjct: 237 IVIRLSRVVKAPESDIGGCVFCRSHDFSIGKFDDRTVILCDQCEKEYHVGCLRENGFCDL 296
Query: 850 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 909
+E+P+ KW CC++C RI+ A+Q V G + LP LD+I +K E G + V WR
Sbjct: 297 KEIPQEKWFCCSNCSRIHTAVQNSVSCGPQTLPTPLLDMICRKDREKGIFTDIGDTVEWR 356
Query: 910 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYC 969
+L GK LLS+A IF + FDPI+ + + DLIP MVYGR+ GQ++ GMYC
Sbjct: 357 ILSGKSRYPEH--LPLLSRAAVIFRECFDPIVAKSGR-DLIPVMVYGRNISGQEFGGMYC 413
Query: 970 AILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLV 1029
+L VN +VVSA + RIFGQE+AELP+VATS + QG+GYFQ L+ C+E LL LNV+ LV
Sbjct: 414 LVLIVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACVENLLSSLNVENLV 473
Query: 1030 LPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
LP+A EA++IWT KFGF+ M++++ +Y+ + L IF+GTSML+K VPK
Sbjct: 474 LPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKVPK 522
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 13/83 (15%)
Query: 410 FLESGILEGMSV-MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHA 468
F+ +G+ +G + Y++ K+ L+G +GSGI C C ++ ++P+ FE HA
Sbjct: 86 FMPNGLPDGTELAYYVKTQKL-------LQGYKQGSGIVCSC----CSREISPSQFEAHA 134
Query: 469 GSSNKRPP-EYIYLENGKTLRDI 490
G + +R P +I++ +G +L DI
Sbjct: 135 GMAARRQPYRHIFISSGLSLHDI 157
>gi|218197387|gb|EEC79814.1| hypothetical protein OsI_21258 [Oryza sativa Indica Group]
Length = 858
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 219/422 (51%), Positives = 272/422 (64%), Gaps = 20/422 (4%)
Query: 643 FVRGEVKF----LVGYKKGFGILCTC--CN-SEVSPSQFEAHAGWASRRKPFQHIYTSNG 695
F R +K L G G I C C CN S+VSPS FEAHAG +RRKP+ +I+TSNG
Sbjct: 317 FTRSLLKLQKAVLKGVIAGCNIRCFCLSCNGSKVSPSAFEAHAGEGTRRKPYDNIFTSNG 376
Query: 696 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
VSLHELS+K+S + S +E DDLC C GGD+ C CPR+FH CV L G+PS W+
Sbjct: 377 VSLHELSMKISKDMQLSERETDDLCRECGQGGDIFPCKMCPRSFHPACVGLSGVPS-EWY 435
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C N QKEK + N NA+AAGR GVD Q++ R IRIV ELG +D
Sbjct: 436 CDNCSNLVQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPNLWIELGQ------KD 489
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
F S F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++
Sbjct: 490 FNNSVFDERTVILCDQCEKEYHVGCLQSQWQVDLKELPEGEWFCCNSCSEIRSSLDKIIS 549
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
G L E+ +D+I+KKHE G + D+RWR+L G+K AS+ LLS AV I H
Sbjct: 550 DGALILAESDIDIIRKKHEMKGLSMDTNTDLRWRLLAGRK--ASEDGDLLLSAAVPIIHQ 607
Query: 936 RFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
FDPIIE S DLIP MV GR + GQDY GMYCA+LT+ VVSA + R+ G E+
Sbjct: 608 SFDPIIEVQSGRDLIPEMVNGRRPKDGMPGQDYSGMYCAVLTLGTSVVSAALLRVMGGEV 667
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AELPLVATS D QG GYFQ+LF CIE++L L +K +LP+A EA+ IW NKFGF+ + +
Sbjct: 668 AELPLVATSKDLQGLGYFQALFSCIERMLISLKIKHFMLPAAQEAEGIWMNKFGFTKIPQ 727
Query: 1052 EE 1053
E+
Sbjct: 728 EQ 729
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 109/253 (43%), Gaps = 64/253 (25%)
Query: 247 AVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLE 306
AV E K + +AM+++E G++ E + +++ ++++N + +
Sbjct: 198 AVAAETAKPD---MAMELQEPPTVTAANGRDVS--------HESFEQNLQHQVLDNALTD 246
Query: 307 SSMVGKHSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVEL-AKGS 365
S++ A+S A + G +P RRFTRSLL+ K E S
Sbjct: 247 PSLL--------------AESTATPASTAGL-------KPARRFTRSLLKNKPEEEPTAS 285
Query: 366 LSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIR 425
S+D V+ ++ D V + + P K R+F L L+ +L
Sbjct: 286 KSQDPA----VSMISEDNNEASV--DLALAPEKPQRRFTRSLLK-LQKAVL--------- 329
Query: 426 GSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPP-EYIYLENG 484
+GV+ G I CFC C G++ V+P+ FE HAG +R P + I+ NG
Sbjct: 330 ------------KGVIAGCNIRCFCLSCNGSK-VSPSAFEAHAGEGTRRKPYDNIFTSNG 376
Query: 485 KTLRDI-MNVCKD 496
+L ++ M + KD
Sbjct: 377 VSLHELSMKISKD 389
>gi|357472095|ref|XP_003606332.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507387|gb|AES88529.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 587
Score = 411 bits (1056), Expect = e-111, Method: Compositional matrix adjust.
Identities = 220/506 (43%), Positives = 302/506 (59%), Gaps = 39/506 (7%)
Query: 591 RSPEPNSAQTS-SHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFV----- 644
RS E + Q S H+ S+S S I +D +HKLVF+E LEDGA VGYFV
Sbjct: 56 RSNESKNLQPSPKHTTQTSNSHASPTTINYRDQCLHKLVFQENVLEDGAAVGYFVYEEKQ 115
Query: 645 -RGEVKFLVGYKKGFGILCTCC---------------NSEVSPSQFEAHAGWASRRKPFQ 688
+GE+ K GILC CC + +VSPS+FEAHAGWASRRKP+
Sbjct: 116 LQGEINI-----KQSGILCDCCKEVILEFFFCQIKKLDEQVSPSKFEAHAGWASRRKPYF 170
Query: 689 HIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 748
HI T++GVSLH+L+I + S +D+ C C G+LLCCD C RAFH+ C+ +
Sbjct: 171 HIRTTDGVSLHQLAINHRI----SISNSDEHCSKCKQRGNLLCCDGCQRAFHLGCIPVES 226
Query: 749 IPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGC 808
P W+C YC N QK+K VE+ N +I DP Q+ C V+ + E C
Sbjct: 227 PPKEKWYCEYCRNKLQKDKNVEHKENVVTTQKIIESDPSEQIAKICTLSVKHKEVEHSSC 286
Query: 809 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 868
LC R F F TV++CDQCE++YHVGCLKDH M +L+++PK W C DC I++
Sbjct: 287 ALCSERHFNNGEFSPWTVMICDQCEKDYHVGCLKDHNMANLKKVPKHYWFCGVDCYDIHM 346
Query: 869 ALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSK 928
L+ + RG+ L ++ L +IK K E+ G + D++W+V + + + T +LLS
Sbjct: 347 KLKNFMARGDVLLSDSLLSLIKNKKEQKGLETEFGLDIKWKVFNRQLIVSKIITSSLLSD 406
Query: 929 AVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYH-GMYCAILTVNQVVVSAGIFRIF 987
V+IFH++FD I+ + +K+DLIPAMV GR + + Y GMYCA+L VNQVVVSAGIFR+F
Sbjct: 407 VVTIFHEQFDSIVVTGTKIDLIPAMVKGRKIKDKYYFGGMYCAVLIVNQVVVSAGIFRVF 466
Query: 988 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1047
G+E+AEL L+AT + Q QG+F+ L CIE +L L V+ LVLP+A EA+++W +KFGF
Sbjct: 467 GKEVAELSLIATKAEYQKQGFFKCLLSCIENVLKELKVERLVLPAAHEAESMWIDKFGF- 525
Query: 1048 MMTEEEQ----NKYRNDYPLMIFQGT 1069
TE Q YR + + +G
Sbjct: 526 --TEPNQGLGRRYYRRSWSFHLNKGV 549
>gi|117166041|dbj|BAF36342.1| hypothetical protein [Ipomoea trifida]
Length = 770
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 307/850 (36%), Positives = 424/850 (49%), Gaps = 127/850 (14%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNVDSGCVLGPP 60
MAN T++ E VV+S IR G+KREF +K Q+E S+G+ R + Q+ +G P
Sbjct: 1 MANRTEAGE--VVISSIRTGMKREFAMMMKAQAECGISIGQKRVTRSQNGSLNGRASISP 58
Query: 61 EVKKLKTYESRKKRKRQEQSVVVKETEDKREEEVKSDVFDVINERERPIREKESKDDSEN 120
+ +T S KR+++E +VV + NE+ E SKD+ E
Sbjct: 59 Q-NGSRTKSSTVKRRKKEANVVTADQS-----------LSHSNEKLDKTEELRSKDEGEK 106
Query: 121 MGVGERGALMNVEEVKVVSERREEGNDEFGKVVIGVEEEKKNECDEVLMNVEENKYG--- 177
VE++ + SE E +D VV G E++KN V + E +
Sbjct: 107 -----------VEKLPLTSEWEEPKSD----VVDGASEDEKNGT-AVDSEIGEQETACVM 150
Query: 178 ---ELDGMGGSARTEEEKNECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDL 234
EL+ +GGS + + + V V+ E N L + E+ L
Sbjct: 151 EREELNSLGGSQKEKLINGDVAAKPVDVDNSGNGEENNGLVDCGGLFQKEL--------L 202
Query: 235 I-GEVKNEFEEVVAVVEEEKKDESDRVAMDVEEVKCEEVGLGKEYE-PGRVQMEMDEEKK 292
I G+V + +EV D+SD V C G+ KE E G V +M + K
Sbjct: 203 INGDVAEKPKEV---------DKSDNDKQSDSLVNCGG-GVEKEKELNGDVTEKMMDVDK 252
Query: 293 NDIERELVENGVLESSMVGKHSSTLCNGESNVAKSVAVDGND--------EGKTVNVVVE 344
D + +L + ++E +M S C +S K+ G+ E V+ E
Sbjct: 253 TDNKEQL--DSLMELAMEELPMSGDCKTKSEAIKADCAYGSALATVPESVENSKVSTSSE 310
Query: 345 RPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRKFY 404
+PLRRFTRS L+ K + S ++D + +G K E M + + K
Sbjct: 311 KPLRRFTRSCLKPKQQAMSASPAEDTKAEDALESDEASAIGTTSKLEMKMSKKVALVKIP 370
Query: 405 SKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVF 464
+KLK L +G+LEG+ V Y RG KG L+GV++GSGI CFC +C G +VVTP F
Sbjct: 371 TKLKGLLATGLLEGLPVRYARGRPEKG-----LQGVIQGSGILCFCQNCGGTKVVTPNQF 425
Query: 465 ELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNC 524
E+HAGSSNKRPPEYIYL+NGKTLRD++ CKD+P + LE A+R G+ +K+ CLNC
Sbjct: 426 EMHAGSSNKRPPEYIYLQNGKTLRDVLVACKDAPADALEAAIRNATGAGDARKSTVCLNC 485
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEP-LSHSSEME---------------- 567
+ S A L C SC+ K+SQ +++ + S ++E
Sbjct: 486 KASLPEASFGRPRLQCDSCMTSKKSQTTPSQVGDANCSRDGQLEFIFLLNYYWADDLYKL 545
Query: 568 PQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKL 627
P L+ SP+ + L +S E S+ T SK+ HG++TRKDLRMHKL
Sbjct: 546 GLPDLRGLQWSPSSNSVL-----KSTERMSSGTCPPSKV-------HGRLTRKDLRMHKL 593
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
VFE L DG + Y+VRG +VSPSQFEAHAG ASRRKP
Sbjct: 594 VFEGDVLPDGTALAYYVRG---------------------KVSPSQFEAHAGCASRRKPG 632
Query: 688 QHIYTSNGVSLHELS--IKLSLER---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHID 742
+ + + + L I LE+ P + DLC IC DGGDLLCCD+CPRAFH +
Sbjct: 633 WYWGKLHTLGVFNLKAVILFGLEKCKDPHLDGKWMDLCSICADGGDLLCCDNCPRAFHTE 692
Query: 743 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 802
CVSLP IP GTW+C+YC N F KEKF + +ANA AAGR+ G+D Q+ R IRIV T
Sbjct: 693 CVSLPNIPRGTWYCKYCENMFLKEKF-DRSANAIAAGRVAGIDALEQITKRSIRIVDTLH 751
Query: 803 TELGGCVLCR 812
E+G CVLCR
Sbjct: 752 AEVGVCVLCR 761
>gi|357472045|ref|XP_003606307.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355507362|gb|AES88504.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 680
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 223/526 (42%), Positives = 302/526 (57%), Gaps = 65/526 (12%)
Query: 541 KSCVELKESQAGSAEIKEPLSHSSEMEPQP----PSVELEESPAPSGELTDTSNRSPEPN 596
K CV+ ++++ S S +P P +++ SP T ++RSPEP
Sbjct: 107 KFCVDYRDAKVVSGLFSSSCEESKNRQPSPRHTIETIDSRVSP------TTVNHRSPEPV 160
Query: 597 SAQTSSHSKMKSSSV--KSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGY 654
QTSS+ MK ++ KS +ITRKD +HKLVF+E LEDGA VGYF RG+V
Sbjct: 161 VPQTSSYKGMKCNTYCDKSPRRITRKDRGLHKLVFQENMLEDGAAVGYFDRGKV------ 214
Query: 655 KKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK 714
SPS+FEAHAG ASRRKP+ +I T++GVSLHEL+ R S
Sbjct: 215 ---------------SPSKFEAHAGRASRRKPYSYIRTADGVSLHELAN----NRRISMS 255
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
++D+ C C G+LL CD C R+FH++C+ L P +C YC N F K+K V++ N
Sbjct: 256 DSDERCSHCEQVGNLLWCDRCQRSFHLECIPLESPPKRKRYCEYCRNKFHKDKNVKHKEN 315
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
A GRI DP Q+ C + + + G C LC RDF + G RTV++C QCE+
Sbjct: 316 DVATGRIAEGDPSEQITEVCTLSEKQKEVKDGPCALCSERDFNNNESGPRTVMICKQCEK 375
Query: 835 EYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE 894
E+HV CLKDH M +L ELPK KW C DC I++ LQKL+ RGE +L
Sbjct: 376 EFHVECLKDHNMANLVELPKDKWFCGIDCDDIHMKLQKLMARGEAEL------------- 422
Query: 895 ESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 954
D++WR+L K + L+SKA +IFH+RF I + +K+DLI AM+
Sbjct: 423 --------GLDIKWRLLNTKLNNPKHNISPLISKANAIFHERFKSIKDPKTKIDLIRAML 474
Query: 955 YGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQ------G 1007
YG GQ + GMYCA+L +V+ AGIFR+ GQE+AELPLVAT+ Q + G
Sbjct: 475 YGMEIEGQYSFEGMYCAVLYFKKVIACAGIFRVLGQEVAELPLVATTTKYQKRVILFTSG 534
Query: 1008 YFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
YF+SLF CIE +L L VKTLVLP+A EA+++W +KFGF+ ++E
Sbjct: 535 YFRSLFSCIENMLRHLKVKTLVLPAAHEAESMWIDKFGFTKPNDKE 580
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 33/51 (64%), Gaps = 1/51 (1%)
Query: 1 MANGTDSEEKFVVLSKIRVGLKREFEFALKVQSEICGSLGRTRARKVQSNV 51
M N DSE FV+ SK+R G KREF FAL SEI SL +TR RK Q+ V
Sbjct: 1 MTNFNDSE-LFVIKSKVRTGHKREFSFALNSYSEIGTSLSKTRPRKNQNMV 50
>gi|413935127|gb|AFW69678.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 341
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 193/337 (57%), Positives = 236/337 (70%), Gaps = 4/337 (1%)
Query: 743 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 802
CV L GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R IRI T +
Sbjct: 5 CVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTLE 64
Query: 803 TELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 862
T GGC LC+ DF K +F RTV+LCDQC REYHVGCLK+H M DL LP+G W C D
Sbjct: 65 TGFGGCALCKLHDFSKKKFSTRTVLLCDQCGREYHVGCLKEHNMADLTALPEGAWYCSTD 124
Query: 863 CKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG-SDNAVDFDVRWRVLRGKKVDASDG 921
C RIN LQ L++ G E + LDVIKKK E G +D+ D DVRWRVL+ K +SD
Sbjct: 125 CVRINQTLQDLLNSGGEPVLAMDLDVIKKKREVKGFNDDDADLDVRWRVLKDK---SSDD 181
Query: 922 TRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSA 981
++ +LSKAV+IFH+ FDPII+ ++ DLIPAMVYGRS R QDY GMYC +LTVN +VVSA
Sbjct: 182 SKLVLSKAVAIFHETFDPIIQVSTGRDLIPAMVYGRSARDQDYTGMYCTVLTVNNIVVSA 241
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
G+FRI G E+AELPLVATS D QG GYFQ+LF CIE+LL L VK VLP+A EA++IWT
Sbjct: 242 GLFRIMGSEIAELPLVATSRDRQGLGYFQALFSCIERLLSSLEVKHFVLPAAEEAESIWT 301
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+FGF+ ++++E +Y +FQGTS L K V K
Sbjct: 302 ERFGFAKISQDELREYLKGGRTTVFQGTSNLHKLVAK 338
>gi|326494740|dbj|BAJ94489.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/386 (50%), Positives = 250/386 (64%), Gaps = 7/386 (1%)
Query: 696 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
VSLHELS+ +S + S +E DDLC C GGD+ C CPR+FH CV LP +PS W
Sbjct: 2 VSLHELSMSISKDMQLSERETDDLCRECGLGGDIFPCRMCPRSFHPACVGLPVVPSEEWF 61
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+ +D
Sbjct: 62 CDNCTILVQKEKALAANKNAKAAGRQAGVDSIEQILKRAIRIVPICD-DLGGCALCKKKD 120
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
F + F RTVILCDQCE+EYHVGCL+ DL+ELP+G+W CC C I +L K++
Sbjct: 121 FNNAVFDERTVILCDQCEKEYHVGCLRSEWQVDLKELPEGEWFCCDSCSEIRSSLDKMIS 180
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
G L E+ +D+I+KKHE G + ++RW+++ G+ A++ +LLS AV + H
Sbjct: 181 GGAHPLSESDVDIIRKKHESKGLVMDANTEIRWQLVAGRS--ATEDGNSLLSSAVPVIHQ 238
Query: 936 RFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
FDPIIE+ + DLIP MV+GR + GQDY GMYCA+LTV VVSA + R+ G ++
Sbjct: 239 SFDPIIEAHTGRDLIPEMVHGRRPKEGMPGQDYSGMYCAVLTVGSTVVSAALLRVMGGDV 298
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AELPLVATS D QG GYFQ LF CIE+LL L VK +LP+A EA+AIW KFGFS + +
Sbjct: 299 AELPLVATSMDLQGLGYFQVLFSCIERLLVSLKVKHFMLPAAHEAEAIWMKKFGFSKIPQ 358
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQKPVP 1077
++ Y N L +F GT L K +P
Sbjct: 359 DQMEAYLNGGHLTVFHGTLNLYKAIP 384
>gi|125556844|gb|EAZ02450.1| hypothetical protein OsI_24553 [Oryza sativa Indica Group]
Length = 565
Score = 369 bits (948), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 185/370 (50%), Positives = 235/370 (63%), Gaps = 41/370 (11%)
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 768
R S++++DDLC IC DGG+LL CDSCPRAFH +CV IP GTW CRYC N Q+E
Sbjct: 234 RNMSNRQSDDLCSICSDGGELLLCDSCPRAFHRECVGFTTIPRGTWCCRYCENRQQRESS 293
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
+ YN NA AAGRI+G+DP Q+ +R IRI TP T GGC LCR DF K +F RTV+L
Sbjct: 294 LAYNHNAIAAGRIDGIDPMEQIFTRSIRIATTPVTGFGGCALCRLHDFSKKKFSARTVLL 353
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDV 888
CD Q LP+G W C ADC RI+ L+ L+ RG E + +++
Sbjct: 354 CD-------------------QALPEGAWYCTADCVRISETLKDLLSRGAEPISSVDVEI 394
Query: 889 IKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLD 948
IK+K+E+ + D DVRWRVL+ K +S ++ +LSKAV+IFH+ FDPII+ A+ D
Sbjct: 395 IKRKYEQKALNKDGDLDVRWRVLKDK---SSADSKLVLSKAVAIFHESFDPIIQIATGRD 451
Query: 949 LIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1008
LIPAMVYG + VVSAG+FR+ G E+AELPLVATS D QG GY
Sbjct: 452 LIPAMVYGNT-------------------VVSAGLFRVMGSEIAELPLVATSRDSQGLGY 492
Query: 1009 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQG 1068
FQ+LF CIE+LL L VK VLP+A EA++IWT +FGF +T++E +Y +FQG
Sbjct: 493 FQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTVFQG 552
Query: 1069 TSMLQKPVPK 1078
TS L K VPK
Sbjct: 553 TSTLHKLVPK 562
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 83/147 (56%), Gaps = 3/147 (2%)
Query: 432 PGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM 491
P + LRGV+K GI C C CKG VV+P FE+HAGS+ K P +YI+LENG L DI+
Sbjct: 70 PPRAVLRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAGSTKKHPSDYIFLENGNNLHDIL 129
Query: 492 NVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQA 551
C D+ L+ L+ A++ +G + K+ C C+ SF+ + LLC SC+E K SQ
Sbjct: 130 RACSDATLDMLQSAIQNAIGPAPKKRTFRCQTCKSSFATLRTGKFALLCDSCLESKGSQN 189
Query: 552 GSAEIK---EPLSHSSEMEPQPPSVEL 575
+ K P S + + + P +L
Sbjct: 190 STRTSKIGRNPTSSARRSKNESPGSKL 216
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 34/55 (61%), Gaps = 5/55 (9%)
Query: 651 LVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 701
L G K GILC+C + + VSP FE HAG ++++ P +I+ NG +LH++
Sbjct: 75 LRGVIKRVGILCSCSSCKGRTVVSPYYFEVHAG-STKKHPSDYIFLENGNNLHDI 128
>gi|224099259|ref|XP_002334497.1| predicted protein [Populus trichocarpa]
gi|222872483|gb|EEF09614.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 355 bits (911), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 174/368 (47%), Positives = 240/368 (65%), Gaps = 27/368 (7%)
Query: 641 GYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE 700
G+F + K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+P++HIYTSNG++LH+
Sbjct: 13 GFFFQ---KILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLTLHD 69
Query: 701 LSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
++I L+ + ++ DD+C C DGGDL+ C SCPRAFH C+ L P G WHC C
Sbjct: 70 IAISLANGQNITTGIGDDMCAECGDGGDLMFCQSCPRAFHAACLDLHDTPEGAWHCPNCN 129
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
F +V R R+V+TP+ ++GGC +CR DF
Sbjct: 130 KLGHGGNFAR------------------PIVIRLTRVVKTPEYDVGGCAVCRAHDFSGDT 171
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 880
F RTVILCDQCE+E+HVGCL++ G+ DL+E+PK W CC DC I +AL+ V G +
Sbjct: 172 FDDRTVILCDQCEKEFHVGCLRESGLCDLKEIPKDNWFCCQDCNNIYVALRNSVSTGVQT 231
Query: 881 LPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
+P + L++I +KH E G D A +DV+W++L GK + D +LLS A +IF + FD
Sbjct: 232 IPASLLNIINRKHVEKGLLVDEAA-YDVQWQILMGKSRNRED--LSLLSGAAAIFRECFD 288
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVA 998
PI+ + DLIP MVYGR+ GQ++ GMYC +LTV VVVSAG+ RIFG+E+AELPLVA
Sbjct: 289 PIVAKTGR-DLIPVMVYGRNISGQEFGGMYCVLLTVRHVVVSAGLLRIFGREVAELPLVA 347
Query: 999 TSNDCQGQ 1006
T+ + QG+
Sbjct: 348 TNREHQGK 355
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 33/55 (60%), Gaps = 5/55 (9%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPP-EYIYLENGKTLRDI 490
L G +G+GI C C + + ++P+ FE HAG S +R P +IY NG TL DI
Sbjct: 20 LGGYKQGNGIVCSCCEVE----ISPSQFESHAGMSARRQPYRHIYTSNGLTLHDI 70
>gi|168035064|ref|XP_001770031.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678752|gb|EDQ65207.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1038
Score = 335 bits (859), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 234/728 (32%), Positives = 375/728 (51%), Gaps = 102/728 (14%)
Query: 377 TEVANDGVGGPVKQ-ETVMKPRKVMRKFYSKLKNFLESG---------ILEGMSVMYIRG 426
+E +DG P ++ + + +P + R KLK+ + G E M+ + G
Sbjct: 245 SEEGSDGQKSPEERGDAIARPSQAPRAAAIKLKSMVPVGDKLLKAPRNAKELMATRLMEG 304
Query: 427 SKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
V+ G+ L G++K G+ C C +C+G+ +V+ + FE H+GS++ P + IYLENG
Sbjct: 305 HHVRCSCRGIQ-LTGMLKDMGVQCDCRNCRGSVIVSISAFEAHSGSTSHHPSDNIYLENG 363
Query: 485 KTLRDIMNV------CKDSPLETLEKAVRMVLG-----SSSMKKAN-------FCLNCRV 526
K LRDI++ C D+ L L+ A+ V G S K N +C R
Sbjct: 364 KNLRDILSAGQEAADCGDNILRALKMAIGDVQGVEKSKSKCAKCGNPEEGDLIYCKGARC 423
Query: 527 SF-SNAGVEELM------LLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
S +++G E+ C C + K+ A S ++K P+S +E E
Sbjct: 424 SVVAHSGCVEIANPHLGDWFCGKCEKTKKPHA-SVKVKRPISSGAEKE------------ 470
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
S+++ +++R D +HK +F GGLE+G E
Sbjct: 471 -----------------------DSRVREKDATVSARLSR-DAHLHKALFLPGGLENGTE 506
Query: 640 VGYFVRGEVKFLVGYKKGFGILCTCCN----SEVSPSQFEAHAGWASRRKPFQHIYT-SN 694
+GY+ + ++K L G K+G GI C+CCN S++S +FE HAG +RR P+ +I +
Sbjct: 507 LGYYTKSQLK-LKGVKRGKGICCSCCNKEASSDISCFEFEQHAGCEARRNPYGNILVLVD 565
Query: 695 GVSLHELSIKLSLERPFSSKEN-------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
G SL ++ L+ + ++N + C C G+L C+ C A+ +C
Sbjct: 566 GRSLKDVCKDLTHKNKLGEQQNCEPLARDVNCCYECSSSGELKTCNGCEEAWCDNCTKGE 625
Query: 748 GIPSGT-WHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDTE 804
+ S + W+CR C N K V N + E ++ RCIR ++ E
Sbjct: 626 EVDSDSKWYCRMCRNDTLK---VAQNGQKVSGKHQEESSSITEIDERGRCIRHLEG-HRE 681
Query: 805 LGGCVLCRGRDFCKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
+GGC +C+ + K+ F T+++CDQC REYHV CLKD GM++L ELP+G+W C C
Sbjct: 682 VGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGMDNLNELPEGEWFCQKGC 741
Query: 864 KRINLALQKLVDRGEEKLPETSL-DVIKKKHEESG---SDNAVDFDVRWRVLRGKKVDAS 919
K I+ L +LV G E L + + ++ + + ++SG ++ W++L GK +S
Sbjct: 742 KVIDEILTQLVAIGPESLSHSIISELPENRQQKSGVIEKAESISPSFEWQILCGK--GSS 799
Query: 920 DGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVV 979
L++AV+IF + DPI ++ + +LIP MV R + D+ G++C +L +N VV
Sbjct: 800 PANIQTLAEAVNIFTECSDPIRDAKTGKNLIPLMVQSRRTKDYDFEGVFCVVLKLNGKVV 859
Query: 980 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1039
SA + +IFG+E AE+PLVATS QGQG+ ++L IE+LLG L+V+ LVLP+A + +++
Sbjct: 860 SAALLQIFGREFAEVPLVATSLPHQGQGFCKALMTTIERLLGVLSVERLVLPTAKDTESL 919
Query: 1040 WTNKFGFS 1047
W NKFGFS
Sbjct: 920 WVNKFGFS 927
>gi|353441170|gb|AEQ94169.1| PHD finger transcription factor [Elaeis guineensis]
Length = 276
Score = 327 bits (839), Expect = 2e-86, Method: Composition-based stats.
Identities = 157/274 (57%), Positives = 198/274 (72%), Gaps = 2/274 (0%)
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 874
DF KS F RTVI+CDQCEREYHVGCLK+H M DL+ELP+G+W C +DC RI+ ALQ L+
Sbjct: 3 DFSKSGFSDRTVIICDQCEREYHVGCLKEHKMADLKELPEGEWFCTSDCCRIHSALQTLL 62
Query: 875 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFH 934
RG + LP +DVI+KK + G + + D+RW++L GK DA +R LLSKAV+IFH
Sbjct: 63 LRGAQPLPLLDVDVIRKKCDIKGFNIGANTDIRWQLLSGKTADAE--SRLLLSKAVAIFH 120
Query: 935 DRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 994
+ FDPI+++ + DLIP MVYGR+ R QDY G+YCA+LTV VVSAGI R+ G E+AEL
Sbjct: 121 ESFDPIVDATTGRDLIPTMVYGRTVRDQDYGGIYCALLTVGSSVVSAGILRVLGSEIAEL 180
Query: 995 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1054
PLVATS + QGQGYFQSLF CIE+LL L VK VLP+A EA++IWT KFGF+ +T +E
Sbjct: 181 PLVATSREHQGQGYFQSLFSCIERLLVTLKVKHFVLPAADEAESIWTKKFGFTKITSDEL 240
Query: 1055 NKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
+KY N +FQGTS L KPV + + G
Sbjct: 241 HKYLNGARTTVFQGTSTLHKPVTVPHVSSRETQG 274
>gi|168015596|ref|XP_001760336.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688350|gb|EDQ74727.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1489
Score = 322 bits (826), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 227/667 (34%), Positives = 340/667 (50%), Gaps = 65/667 (9%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV--- 493
L G+++ G+ C C CKG QVV+ + FE H+GS++ P IYLENGK LRDI++
Sbjct: 570 LTGILQDMGVVCNCRICKGTQVVSISAFEAHSGSTSHHPSHNIYLENGKNLRDILSAGQE 629
Query: 494 ---CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQ 550
C L L+ A+ + G KK C C V SC E K S
Sbjct: 630 SADCGGDILGALKHAIGEIQGIP--KKEGACGKCGKREGGDFV--------SCKEPKCSA 679
Query: 551 AGSAE-IKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSS 609
AE + P H ++ E + P T + P + + + K K
Sbjct: 680 VYHAECVGLPSPH--RVDWFCAKCEKAQVKMPK---TVLKMKRPPAVTDREDTRLKEKEL 734
Query: 610 SVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEV 669
+V+S +D +HK +F GGL DG E+GY+ R + L G K+G GI C CCN E+
Sbjct: 735 TVRS-----ARDAHLHKALFLPGGLADGTELGYYARNQC-ILKGVKQGGGICCKCCNQEI 788
Query: 670 SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDG--- 726
S S FE HAG SRR P+ I ++G SL ++ +L+ + SK D + G
Sbjct: 789 SCSAFEQHAGCESRRNPYGSILLADGRSLKDMCKELA----YQSKLGDRAHQVARTGDVK 844
Query: 727 --------GDLLCCDSCPRAFHIDC---VSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
G L C + I+ S SG C+ C +K VE A+
Sbjct: 845 SSSGSEEQGVLASSQRCESTWCINFGTRFSCQEADSGHPLCKIC------QKNVE-GAHK 897
Query: 776 RAAGRIEGVD--PFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 833
+ R++ P R +R+ Q PD+ GC +C+ K F T+++CDQC
Sbjct: 898 TSKKRVDATANIPATDDTGRNVRLFQAPDSS-SGCAICKKWTLKKCGFDM-TMLVCDQCG 955
Query: 834 REYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL-DVIKKK 892
REYHVGCL++ G+ L ELP+ +W C +C+ I L +LV G E L + + D+++ +
Sbjct: 956 REYHVGCLRESGI--LDELPEAEWYCQPNCQHIVQVLSQLVANGPELLSDNIVNDLLESR 1013
Query: 893 HEESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDL 949
+ G + W++L G + +G L++AV IF + DPI ++ S ++
Sbjct: 1014 QHQQGIVEMAESSSPVFGWQILHGAGENPVNGR--TLAQAVEIFTECSDPIKDAPSGQNM 1071
Query: 950 IPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYF 1009
IP MVY R + D+ G+YC +LT+N+ VVS + +IFG+E+AE+PL+ATS D Q QG+
Sbjct: 1072 IPIMVYSRRFKDYDFDGIYCVVLTLNEKVVSTALLQIFGREVAEVPLIATSVDHQDQGFC 1131
Query: 1010 QSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGT 1069
++L IE+LLG LNV+ LVLP++ A+ +W N+FGFS M + + R+ L++F GT
Sbjct: 1132 KALMTTIERLLGVLNVERLVLPASKNAEFVWVNRFGFSRMEDAQLKHIRSMMGLLVFTGT 1191
Query: 1070 SMLQKPV 1076
+ML K +
Sbjct: 1192 TMLVKHI 1198
>gi|168020788|ref|XP_001762924.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685736|gb|EDQ72129.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1100
Score = 319 bits (818), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 231/731 (31%), Positives = 378/731 (51%), Gaps = 87/731 (11%)
Query: 374 SDVTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESG---------ILEGMSVMYI 424
SD E +G GPV+ P + R +KLK+ + G E M+ +
Sbjct: 350 SDGQESHEEGDTGPVR------PSQAPRAAATKLKSMVPIGDKLLKAPRNAKELMATRLM 403
Query: 425 RGSKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLE 482
G V+ G+ L G++K G+ C C +CK + +V+ + FE H+GS++ P + IYLE
Sbjct: 404 EGHFVRCSCRGIQ-LTGMLKDMGVRCDCRNCKSSVIVSISAFEAHSGSTSHHPSDNIYLE 462
Query: 483 NGKTLRDIMNV------CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL 536
NGK LRDI++ C D+ L L+ A+ + G K C C S E
Sbjct: 463 NGKNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKWKVT--CAKCWNSD-----EGD 515
Query: 537 MLLCKS--CVELKESQA---GSAEIKEPLSHSSEMEPQP-PSVELEESPAPSGELTDTSN 590
++ CK C + S+ + + + E +P +V+++ S + E D
Sbjct: 516 LIYCKGARCSIIAHSRCIGISNPRLGDWFCDKCEKMKKPHATVKVKRSISSGTEKDD--G 573
Query: 591 RSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKF 650
R E ++ +++ +D +HK +F GGLEDG E+GY+ + ++K
Sbjct: 574 RVREKDATESTR---------------LNRDAHLHKALFLPGGLEDGTELGYYTKSQLK- 617
Query: 651 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIY-TSNGVSLHELSIKLSLER 709
L G K+G E++ +FE HAG +RR P+ +I ++G SL ++ +L+ +
Sbjct: 618 LKGVKRGEAFKKVVV--EINCYKFEQHAGCEARRNPYGNILLVADGRSLKDVCKELAHKN 675
Query: 710 PFSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDCV-SLPGIPSGTWHCRYC-- 759
KE + C C G+L C C + C L G W+CR C
Sbjct: 676 KLGEKEKRVARAGKVNSCYECGTRGELKNCHGCVETWCNSCTKGLETDSDGKWYCRMCRQ 735
Query: 760 --MNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDTELGGCVLCRGRD 815
+N Q E+ R+ IEG+ A+ RC+R ++ E+GGC +C+ +
Sbjct: 736 DTLNVAQIEQ-------KRSNKHIEGMSNIAETDERDRCVRHLEG-HREVGGCAICKKWN 787
Query: 816 FCKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLV 874
K+ F T+++CDQC REYHV CLKD G++DL ELP+G+W C DCK I+ L +LV
Sbjct: 788 LSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPEGEWFCQKDCKVIDEILTQLV 847
Query: 875 DRGEEKLPETSL-DVIKKKHEESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAV 930
G E L ++ + ++++ + +++G+ + + W++L GK + ++ L++A+
Sbjct: 848 ANGPELLTDSIISELLESRQQQTGAKDKAESSCPSFAWQILCGKSGNTAN--TQTLAEAI 905
Query: 931 SIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
+IF + DPI ++ + +LIP MV R + D+ G++C +L +N+ VVSA + +IFG E
Sbjct: 906 NIFTECSDPIRDAKTGKNLIPLMVQSRRSKDHDFEGVFCIVLKLNEKVVSAALLQIFGGE 965
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
+AE+PLVATS QGQG+ ++L IE+LLG L+V+ LVLP+A ++IW NKFGFS +
Sbjct: 966 IAEVPLVATSLTHQGQGFCKALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGFSRVP 1025
Query: 1051 EEEQNKYRNDY 1061
++E + R +
Sbjct: 1026 DDEGSFLRCTF 1036
>gi|168066393|ref|XP_001785123.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663302|gb|EDQ50074.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1314
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 251/818 (30%), Positives = 386/818 (47%), Gaps = 148/818 (18%)
Query: 377 TEVANDGVGGPVKQETVM-KPRKVMRKFYSKLKNFLESG---------ILEGMSVMYIRG 426
+E +DG P + + V+ +P + R +KLK+ + G E M+ + G
Sbjct: 522 SEEGSDGQDSPEEGDDVLVRPSQAPRAAATKLKSMVPVGDKLVRAPRNAKELMATRLMEG 581
Query: 427 SKVKGP--GVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
V+ G+ L G++K G+ C C +CKG+ +V+ + FE H+GS++ P + IYLENG
Sbjct: 582 HFVRCSCRGIQ-LTGMLKDMGVQCNCRNCKGSVIVSISAFEAHSGSTSHHPSDNIYLENG 640
Query: 485 KTLRDIMNV------CKDSPLETLEKAVRMVLG------------SSSMKKANFCLNCRV 526
K LRDI++ C D+ L L+ A+ + G SS +C R
Sbjct: 641 KNLRDILSAGQEAADCGDNILRALKMAIGDIQGVEKRKVTCAKCESSQEGDLIYCKGARC 700
Query: 527 SFSN----AGVEELML---LCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
S G+ L C C + K+ A +A++K +S ++ P+ V L
Sbjct: 701 SVVAHSRCVGIANPQLGDWFCGKCEKTKKHHA-AAKVKRSISGGAD--PEDGKVRLL--- 754
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
R + ++ ++S K ++SV+ +D +HK +F GGL DG E
Sbjct: 755 -----------RDLQGYASPSASKDKAATASVR-----LNRDAHLHKALFLPGGLVDGTE 798
Query: 640 VGYFVRGEVKFLVGYKKGFGILCTCCNSEVS------------PSQ-------------- 673
+GY+ + ++K L G K+G GI C+CCN E PSQ
Sbjct: 799 LGYYTKSQLK-LKGVKRGEGICCSCCNEETRHVQGGSRGGLMFPSQQDGTYYVFSQVKEI 857
Query: 674 --------------------------FEAHAGWASRRKP-FQHIY-----TSN------- 694
FE HAG +RR P + H+ TSN
Sbjct: 858 HVTTFNEEMKFLSNDLDDIQQISCYEFEQHAGCEARRNPGYYHVVAKYAETSNEYDSYGN 917
Query: 695 ------GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG 748
G SL ++ L+ + KE G L DS + VSL G
Sbjct: 918 ILLVPDGRSLKDVCKDLAHKNKLDEKEKRAARGGESRFSLELEQDSVFNNSTVAIVSLIG 977
Query: 749 I---PSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM--VSRCIRIVQTPDT 803
I G W+CR C K V N + +EG+ A+ RCIR + P
Sbjct: 978 IEKDSEGRWYCRMCRQDSLK---VAQNGQKGSEKIMEGMSNIAETNEQGRCIRHLDGP-R 1033
Query: 804 ELGGCVLCRGRDFCKSRF-GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCAD 862
E+GGC +C+ + K+ F T+++CDQC REYHV CLKD G++DL ELP G+W C D
Sbjct: 1034 EVGGCAICKKWNLSKTGFVDGMTILVCDQCGREYHVSCLKDSGVDDLNELPDGEWFCHKD 1093
Query: 863 CKRINLALQKLVDRGEEKLPETSLD-VIKKKHEESGSDNAVDFD---VRWRVLRGKKVDA 918
CK I+ L +LV G E L +++ +++ + + S + ++ W++L GK
Sbjct: 1094 CKVIDEILAQLVANGPELLSNSTISGLLESRQQLSSAKEKIESSNPRFEWQILCGKGSSP 1153
Query: 919 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVV 978
+D L++A +IF D DPI + + +LIP MV R + D+ G++C +L +N V
Sbjct: 1154 AD--VQTLAEAENIFTDCSDPIRDVKTGKNLIPLMVQSRRTKDHDFEGVFCVVLKLNGKV 1211
Query: 979 VSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQA 1038
VSA + +IFG+E+AE+PL+ATS GQ + ++L IE+LLG L+V+ LVLP+A ++
Sbjct: 1212 VSAALLQIFGREIAEVPLIATSLSHHGQPFCKALMTTIERLLGVLSVERLVLPTAKSTES 1271
Query: 1039 IWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+W NKFGFS + E++ L F GTSM+ K +
Sbjct: 1272 VWINKFGFSRVQEDQLKSICTTIRLTTFTGTSMVVKAI 1309
>gi|224090665|ref|XP_002309048.1| predicted protein [Populus trichocarpa]
gi|222855024|gb|EEE92571.1| predicted protein [Populus trichocarpa]
Length = 805
Score = 312 bits (799), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 222/702 (31%), Positives = 343/702 (48%), Gaps = 121/702 (17%)
Query: 391 ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
E M + V + + +K L +GIL+ V YI S + L G++ G G C C
Sbjct: 208 ELNMSKKVVPNNYPTNVKKLLATGILDRARVKYICFSSERE-----LDGIIDGGGYLCGC 262
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVL 510
C ++V++ FE HAG+ + P +IYLENGK + I+ K +PL ++ ++ V
Sbjct: 263 SSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKTAPLSMIDGVIKDVA 322
Query: 511 GSSSMKKANFCLNCRVSFSNA--GVEELMLLCKSCVELKESQAGSAEIKE---PLSHSSE 565
GSS ++ ++ SNA G ++ C+ S +KE P+S S
Sbjct: 323 GSSINEEFFRVWKASLNQSNALVGADKKCHSELPCLPHSHVSYASQALKESFCPISSSFL 382
Query: 566 MEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMH 625
S ++ SG TS R P+ S +K K ++ + ++D +H
Sbjct: 383 YNNNFVSQQMYMET--SGVNKQTSKR---PSLYFPGSATKQKKTA---ESGVRKRDNDLH 434
Query: 626 KLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+L+F GL DG E+ Y+V+G+ K L GYK+G GI+C+CC E+SPSQFE+HAG ++RR+
Sbjct: 435 RLLFMPNGLPDGTELAYYVKGQ-KILGGYKQGNGIVCSCCEVEISPSQFESHAGMSARRQ 493
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P++HIYTSNG++LH+++I L+ + ++ DD+C C DGGDL+ +H+
Sbjct: 494 PYRHIYTSNGLTLHDIAISLANGQNITTGIGDDMCAECGDGGDLM--------WHV---- 541
Query: 746 LPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTEL 805
W R + +I+G + V R R+ + P+ +
Sbjct: 542 --------WIYRILLKVL-------------GIVQIDGGNFARPTVIRLTRVGKIPEYNV 580
Query: 806 GGCVLCRGR------DFCKSRFGR-RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 858
G CV+CR DF + + V+L ++E+HVGCL++ G+ DL+E+P+ W
Sbjct: 581 GDCVVCRLNLLKFLIDFTLANSRKCLNVMLSFSAKKEFHVGCLRESGLCDLEEIPEDNWF 640
Query: 859 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESG--SDNAVDFDVRWRVLRGKKV 916
CC DC I +AL+ V G +K+P + L++I +KH E G D A +DV+W++L GK
Sbjct: 641 CCQDCNNIYVALRNSVSTGVQKIPASLLNIINRKHVEKGLLVDEAA-YDVQWQILMGKSR 699
Query: 917 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQ 976
+ D +LLS A +IF + S + H M C
Sbjct: 700 NRED--LSLLSGAAAIFR------VSSCT------------------LHLMKC------- 726
Query: 977 VVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEA 1036
F F L F CIE+LL LNV+ LVLP+ A
Sbjct: 727 -----WFFLAFNSLFNPL------------------FSCIERLLCSLNVEQLVLPA---A 760
Query: 1037 QAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ IWT +FGF M+E + KY ++ L IF+GTSML+K VP+
Sbjct: 761 ETIWTRRFGFRKMSEGQLLKYTREFQLTIFKGTSMLEKEVPR 802
>gi|413953619|gb|AFW86268.1| hypothetical protein ZEAMMB73_978394 [Zea mays]
Length = 283
Score = 285 bits (728), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/272 (52%), Positives = 178/272 (65%), Gaps = 6/272 (2%)
Query: 810 LCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLA 869
LCR +DF + F RTVILCDQCE+EYHVGCLK+ +L+ELP+G+W CC+ C +
Sbjct: 12 LCRQKDFNNAVFDERTVILCDQCEKEYHVGCLKNQWQVELKELPEGEWFCCSSCSETRSS 71
Query: 870 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
L K++ G + L E L++IKKKH G D++W++L GK+ DG+ LLS A
Sbjct: 72 LDKIISDGAQLLVEPDLEIIKKKHVTRGLCMDTSKDLKWQLLSGKRT-TEDGS-ILLSAA 129
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAGIFR 985
V IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA + R
Sbjct: 130 VPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAALLR 189
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW KFG
Sbjct: 190 VMGGDVAELPLVATSQDVQGLGYFQALFSCIERMLVSLKIKHFVLPAAHEAEGIWMKKFG 249
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
FS T EE Y N L IF GTS L K VP
Sbjct: 250 FSRTTPEELEAYLNGAHLTIFHGTSYLYKAVP 281
>gi|168026535|ref|XP_001765787.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682964|gb|EDQ69378.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1334
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 223/741 (30%), Positives = 351/741 (47%), Gaps = 121/741 (16%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV--- 493
L G++K G+ C C +CKG+ +V+ + FE H+GS++ P + IYLENGK LRD+++
Sbjct: 545 LTGMLKDMGVQCNCRNCKGSMIVSISAFEAHSGSTSHHPSDNIYLENGKNLRDVLSAGQE 604
Query: 494 ---CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL-MLLCKSCVELKES 549
C D+ L L+ A+ + G K C C G EE ++ CK
Sbjct: 605 AADCGDNILRALKMAIGDIQGVEKSKVT--CAEC------GGSEEGDLIYCK-------- 648
Query: 550 QAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSS----HSK 605
+SHS + P + G+ T R ++ S S+
Sbjct: 649 ----GARCSVVSHSRCVGSANPQL----GDWFCGKCEKTKKRHAAAKVKRSISAGTEDSE 700
Query: 606 MKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCC 665
++ + + ++ R D + K +F GGL DG E+GY+ + ++K L G K+G GI C+CC
Sbjct: 701 VRDKATTASARLNR-DAHLRKALFLPGGLVDGTELGYYTKSQLK-LKGVKRGEGICCSCC 758
Query: 666 NSEVSPSQFEAHAGWASRRKPFQHIY-TSNGVSLHELSIKLSLERPFSSKENDDL----- 719
N E+S +FE HAG +RR P+ +I ++G SL ++S +L+ + KE D
Sbjct: 759 NKEISCYEFEQHAGCEARRNPYGNILLVADGRSLKDVSKELADKNKLGEKEKRDARAGEV 818
Query: 720 -CGICMDGGDLLCCDSCPRAFHIDCV-SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARA 777
C C + G+L C SC A+ C + G W+CR C Q V N +
Sbjct: 819 CCYECSNSGELKRCHSCEEAWCDKCTKGMETDSEGRWYCRMCR---QDSLKVAQNGHKGT 875
Query: 778 AGRIEGVDPFAQM--VSRCIRIVQTPDTELGGCVLCRGRDFCKSRF-GRRTVILCDQCER 834
IEG+ A+ RC+R ++ P E+GGC +C+ + K+ F T+++CDQ
Sbjct: 876 DKIIEGMSNIAETDEKGRCVRHLEGP-REVGGCAICKKWNLSKTGFVDGMTILVCDQVRS 934
Query: 835 EYHVG-----CLKDHGMED-----------LQELPKGKWLCCADCKRINLALQKLVDRGE 878
+ K +G D QELP+G+W C DCK I+ L +LV G
Sbjct: 935 LNQMLPGTRITWKVNGFTDPNILMQRHCITSQELPEGEWFCQKDCKVIDEILTQLVANGP 994
Query: 879 EKLPETSL-DVIKKKHEESGSDNAVDFD---VRWRVLRGKKVDASDGTRALLSKAVSIF- 933
E L + + ++++ + ++S ++ W++L G+ S L++A +IF
Sbjct: 995 ELLSNSIISELLESRQQQSSVKVKLESSNPRFGWQILCGE--GGSSANVQTLAEAANIFT 1052
Query: 934 ---------------------------------------HDRFDPIIESASKLDL----I 950
FDP S D+ +
Sbjct: 1053 SIDDINLPYLWLVVGNYSPSTHTPVSSTGMFGSNQGCKNWKEFDPSHGSECHEDIHSLNV 1112
Query: 951 PAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQ 1010
+V R + D+ G++C +L +N+ VVSA + +IFG+E+AE+PL+ATS QGQG+ +
Sbjct: 1113 ALVVCSRRAKDHDFEGVFCVVLKLNEKVVSAALLQIFGREIAEVPLIATSLPHQGQGFCK 1172
Query: 1011 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTS 1070
+L IE+LLG L+V+ LVLP+A ++IW NKFGFS + E++ + LM F GT
Sbjct: 1173 ALMTTIERLLGVLSVERLVLPTAKNTESIWINKFGFSRVPEDQLKRICTTIRLMTFTGTR 1232
Query: 1071 MLQK---PVPKCRIVGKSVDG 1088
ML K P+ RI +S DG
Sbjct: 1233 MLGKAITPMTLNRIQRQSRDG 1253
Score = 43.9 bits (102), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 54/105 (51%), Gaps = 12/105 (11%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
V+ + ++ R + + FL G+++G + Y S++K L+GV +G GI
Sbjct: 701 VRDKATTASARLNRDAHLRKALFLPGGLVDGTELGYYTKSQLK------LKGVKRGEGIC 754
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPP--EYIYLENGKTLRDI 490
C C C N+ ++ FE HAG +R P + + +G++L+D+
Sbjct: 755 CSC--C--NKEISCYEFEQHAGCEARRNPYGNILLVADGRSLKDV 795
Score = 43.5 bits (101), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 64/152 (42%), Gaps = 27/152 (17%)
Query: 641 GYFVRGEVKF--LVGYKKGFGILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSN 694
G+FVR + L G K G+ C C N + VS S FEAH+G ++ P +IY N
Sbjct: 533 GHFVRCSCRGMQLTGMLKDMGVQCNCRNCKGSMIVSISAFEAHSG-STSHHPSDNIYLEN 591
Query: 695 GVSLHEL----------------SIKLSLERPFSSKENDDLCGIC--MDGGDLLCCDS-- 734
G +L ++ ++K+++ +++ C C + GDL+ C
Sbjct: 592 GKNLRDVLSAGQEAADCGDNILRALKMAIGDIQGVEKSKVTCAECGGSEEGDLIYCKGAR 651
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
C H CV G W C C T ++
Sbjct: 652 CSVVSHSRCVGSANPQLGDWFCGKCEKTKKRH 683
>gi|242091644|ref|XP_002436312.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
gi|241914535|gb|EER87679.1| hypothetical protein SORBIDRAFT_10g000270 [Sorghum bicolor]
Length = 329
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/275 (50%), Positives = 177/275 (64%), Gaps = 6/275 (2%)
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
G + R +DF + F RTVILCDQCE+EYHVGCL+ +L+ELP+G+W CC+ C
Sbjct: 55 GLLWHRQKDFNNAVFDERTVILCDQCEKEYHVGCLQSQWQVELKELPEGEWFCCSSCSET 114
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALL 926
+L K++ G + L E L++I+KKHE G D++W++L GK+ A++ LL
Sbjct: 115 RSSLDKIISDGAQLLAERDLEIIRKKHETRGLCMDTSKDLKWQLLSGKR--ATEEGSILL 172
Query: 927 SKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHR----GQDYHGMYCAILTVNQVVVSAG 982
S AV IFH FDPI E+ + DLIP MV GR + GQDY GMYCA+LTV VVSA
Sbjct: 173 SAAVPIFHQSFDPIREALTGRDLIPEMVNGRGPKEGMPGQDYSGMYCALLTVGSTVVSAA 232
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
+ R+ G ++AELPLVATS D QG GYFQ+LF CIE++L L +K VLP+A EA+ IW
Sbjct: 233 LMRVMGGDVAELPLVATSQDVQGLGYFQALFSCIERVLVSLKIKHFVLPAAHEAEGIWMK 292
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
KFGFS + EE N L IF GTS L K VP
Sbjct: 293 KFGFSRIPPEELEACLNGAHLTIFHGTSYLYKAVP 327
>gi|168065346|ref|XP_001784614.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663846|gb|EDQ50589.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1510
Score = 253 bits (646), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 168/525 (32%), Positives = 242/525 (46%), Gaps = 137/525 (26%)
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAF 739
+ IY ++G SLH+ + L +E+ + E+ D C C D GDL C CP A+
Sbjct: 961 YTSIYLADGRSLHDAAQSLVVEQNVKQEGVTPAKIEHLDQCVECGDSGDLQLCTGCPNAY 1020
Query: 740 H-------------------------IDCVSLPGIPS-GTWHCRYCMNTFQKEKFVEYN- 772
H +DC+ S G + C C Q+++F
Sbjct: 1021 HQGTVVPGVNHVAEVVVVLLGNEFDDVDCLGSTDSSSFGEFFCPDC----QEQRFGGTKD 1076
Query: 773 -----ANARAAGRIEGVDPFAQMVSRCIRIVQTPDT-ELGGCVLCRGRDFCKSRFGRRTV 826
R+ G + + +M+ RC R++Q P+ LGGCV C+ DF K+ FG +T
Sbjct: 1077 RRRSMTKRRSKGAAKTLLTKDRMIGRCSRLLQVPEAIVLGGCVFCKSGDFAKTGFGPKTT 1136
Query: 827 ILCDQCEREYHVGCLK-------------------------------------DHGMED- 848
+LCDQCEREYHVGCLK D G E
Sbjct: 1137 LLCDQCEREYHVGCLKKHGLEDLKSCSVTGFLGLLGWGEVVFRCVATVRLRDCDRGCESQ 1196
Query: 849 ----------------LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKK- 891
QELP+G+W C DCK I+ L LV G E L ++ + + +
Sbjct: 1197 GIVEMRTYIRMVIREIFQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLRT 1256
Query: 892 ------KHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDR--------- 936
+ E++ + F+ W++L G+ D S+G L++AV IF R
Sbjct: 1257 NQARLERSEDATESSCSGFE--WQLLHGRGGDPSNGKA--LAEAVQIFSVRNLSDPGFPV 1312
Query: 937 -------------------FDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV 977
FDPI + S DLIP MVY RS R QD+ G+YC +L +
Sbjct: 1313 RTVWDSHPCGESIFLLLECFDPIADGVSGGDLIPLMVYRRSLRDQDFGGIYCVVLKYDNR 1372
Query: 978 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1037
VVS + R+FG++LAELPL+AT+ QGQG+ ++L IE+LLG L V+ L LP+A A+
Sbjct: 1373 VVSTALIRVFGRQLAELPLLATNPSHQGQGHCKALLLSIERLLGVLRVERLALPAAEGAE 1432
Query: 1038 AIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1082
IW NKFGF M E + ++ +D +M+F G+ ML+K +P I
Sbjct: 1433 GIWLNKFGFRRMAEGQVKQFHSDLNMMVFTGSFMLEKEIPPLEIT 1477
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 140/304 (46%), Gaps = 85/304 (27%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++++ + K L+SG+L+G V Y+ RG + L G+++ G+ C C CKG Q
Sbjct: 585 LLKEAPASAKLLLQSGLLDGHHVRYMGRGGHIM------LTGIIQEGGVLCDCSSCKGVQ 638
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKK 517
VV + FE HAGSS + P ++I+LENGK L+DI+ + + ++ +K
Sbjct: 639 VVNVSAFEKHAGSSARHPSDFIFLENGKCLKDILEIGWN----------------ANKQK 682
Query: 518 ANFCLNCRVSFSNAGVEELMLLCKSCV-ELKESQAGSAEIKEPLSHSSEMEPQPPSVELE 576
N +M + KS + E+ + + ++ PL + QP L+
Sbjct: 683 MN----------------IMDVLKSAIGEVGGVKVQISSLEHPL-----IAIQP---ALK 718
Query: 577 ESPAPSGELTDTSNRSP---EPNSAQTSSHSKMKSSS-------VKSHGKITRKDL---- 622
+ P P L DT R P +P + Q + KM S + VK+ G R +
Sbjct: 719 KLPQPK-SLLDTKPRVPVDVKPRTPQGDTKPKMPSDTKARFTPEVKARGSDARASMPRLD 777
Query: 623 ---------------------RMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGIL 661
+HK +F GGLED EVGY+V+G+ K L G KKG GIL
Sbjct: 778 RTPREKETASPPVLSRESSGANLHKALFLPGGLEDDTEVGYYVKGQ-KSLAGVKKGAGIL 836
Query: 662 CTCC 665
C+CC
Sbjct: 837 CSCC 840
>gi|413935125|gb|AFW69676.1| hypothetical protein ZEAMMB73_508622, partial [Zea mays]
Length = 527
Score = 229 bits (583), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 144/366 (39%), Positives = 194/366 (53%), Gaps = 54/366 (14%)
Query: 347 LRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP-----VKQETVMKPRKVMR 401
LRRFTRSLL + + LS T +N P + K K
Sbjct: 205 LRRFTRSLLVKDKDSNDDDLS-------GTTTASNASSPSPKTNTTSTSRSSNKGNKNTN 257
Query: 402 KFYSKLKNFLESGILEGMSVMYI--RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVV 459
K + L+ L +G+LEG V YI +G + LRGV+K GI C C CKG +VV
Sbjct: 258 KIPTNLRELLATGMLEGQPVKYIMRKGKRAV------LRGVIKRIGILCSCSSCKGRKVV 311
Query: 460 TPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKAN 519
+P FE+HAGS+ K P +YI+LENG L D++ C ++ L+ LE A+R +G + ++
Sbjct: 312 SPYYFEVHAGSTKKHPSDYIFLENGNNLHDVLRACTNATLDMLEPAIRKAIGPAPQERIF 371
Query: 520 FCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESP 579
C +C+ SFS + L C SC+E K + K +S S Q S ++ +S
Sbjct: 372 RCKSCKSSFSTLRSGKFALFCDSCLESKGA-------KNNISSSKVGRSQTSSAKVYKSA 424
Query: 580 APSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAE 639
+P KSSSV G++TRKD MHK+VF G L +G +
Sbjct: 425 SPGA-----------------------KSSSV---GRLTRKDKGMHKVVFMSGILPEGTD 458
Query: 640 VGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 699
VGY+V G+ + L GY K GI C CC++ VSPSQFE HAG A+RRKP+ +IY SNGVSLH
Sbjct: 459 VGYYVGGK-RLLDGYIKELGIYCHCCSTVVSPSQFEGHAGRAARRKPYHNIYMSNGVSLH 517
Query: 700 ELSIKL 705
ELSI L
Sbjct: 518 ELSISL 523
>gi|222624670|gb|EEE58802.1| hypothetical protein OsJ_10349 [Oryza sativa Japonica Group]
Length = 874
Score = 227 bits (579), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 198/749 (26%), Positives = 335/749 (44%), Gaps = 159/749 (21%)
Query: 384 VGGPVK---QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
+ PVK ET +P +++ ++ L +G+LEG V Y + + G
Sbjct: 224 ITAPVKVRVTETKYRPESILKD----VRGLLSTGLLEGFRVTYKKNEVER-------IGR 272
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
+ G G SC C +C ++ FE H+G S+ +I+L++G +L ++ K + L+
Sbjct: 273 INGQGYSCGCSECGYRNIMNACEFEQHSGESSNNQNNHIFLDSGISLYMVIQGLKYTKLD 332
Query: 501 TLEKAVRMVLG-------------------------------SSSMKKANFCL--NCRVS 527
L + V+ + S +++N L + + S
Sbjct: 333 MLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFDDAPSDPCSTQSSQESNIALTDSLKDS 392
Query: 528 FSNAGVEELMLLCKSCVELKESQ--AGSAEIKEP-LSHSSEME----PQPPSVELEESPA 580
SNA +L S + Q G E P LS S E E S++ EE+P+
Sbjct: 393 TSNASS---ILNWSSFRRRSDRQFKRGGTETSTPILSRSPEKEISDLSTSTSMKSEETPS 449
Query: 581 --PSGELTD--TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
+G LT T + P P+ H+ S+ K G+ +D +H ++F+EGGL D
Sbjct: 450 ENTAGLLTTDVTVIQDPPPD------HNV--DSNSKDLGQPKVRDNTLHPMLFKEGGLPD 501
Query: 637 GAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSN 694
+ Y ++ GYK G GI+C CC+ EV +PSQFE H G RR+P++ IYTS+
Sbjct: 502 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQPYRSIYTSD 561
Query: 695 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
G++LHEL++KL + SS N D LP + SG+
Sbjct: 562 GLTLHELALKL--QDGLSSNVNID--------------------------ELPTLTSGSG 593
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 814
EY+ +R + P + + + V++ C +CR
Sbjct: 594 K--------------EYSTTSRPI-----IVPLKRTLQERVLTVES-------CYMCRKP 627
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMED-------LQELPKGKWLCCADCKRIN 867
++ C+QCER HV C ++G++ L E + ++CC C+ +
Sbjct: 628 HTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLR 686
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
+L + + KK E+ + +++ W++L G + + +
Sbjct: 687 ASLHEGL----------------KKREDIAFLRRIRYNICWQLLNGTNMRSD-----VQH 725
Query: 928 KAVSIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFR 985
+ + IF D F E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I +
Sbjct: 726 QVIEIFKDAF---AETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILK 782
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
+ +E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G
Sbjct: 783 VRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLG 842
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++++++E+++ +PL++F S++QK
Sbjct: 843 YTILSDEQKHSMLMAHPLVMFANLSLVQK 871
>gi|388499466|gb|AFK37799.1| unknown [Lotus japonicus]
Length = 196
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 106/192 (55%), Positives = 140/192 (72%), Gaps = 3/192 (1%)
Query: 891 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 950
KK EE + + DVRWR+L G+ AS T+ LL +AVS+FH+ FDPI++ A+ DLI
Sbjct: 2 KKQEERCLEPLREIDVRWRLLNGRV--ASPETKPLLLEAVSMFHECFDPIVDPATGRDLI 59
Query: 951 PAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQ 1010
PAMV GR+ R QD+ GMYCA+L VN V SA + RIFG ++AELPL+AT N +G+GYFQ
Sbjct: 60 PAMVNGRNLRTQDFGGMYCALLMVNSSVASAAMLRIFGGDIAELPLIATRNKNRGKGYFQ 119
Query: 1011 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY-PLMIFQGT 1069
+LF CIE+LL FL+VK LVLP+A EA++IW +KFGFS M ++ YR +Y +M F+GT
Sbjct: 120 TLFSCIERLLSFLSVKNLVLPAAEEAESIWIHKFGFSKMEPDQLTNYRKNYCQMMAFKGT 179
Query: 1070 SMLQKPVPKCRI 1081
ML K VP+CR+
Sbjct: 180 VMLHKTVPQCRV 191
>gi|168018374|ref|XP_001761721.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687092|gb|EDQ73477.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1635
Score = 216 bits (549), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 167/575 (29%), Positives = 252/575 (43%), Gaps = 153/575 (26%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
++++ + K L+SG+L+G V Y+ RG + L G+++ G+ C C CKG Q
Sbjct: 582 LLKEAPASAKLLLQSGLLDGHHVRYLGRGGHIM------LTGIIQEGGVLCDCSSCKGVQ 635
Query: 458 V---------------------------------------------------VTPAVFEL 466
V V + FE
Sbjct: 636 VTCDRLPGVGKWGLLEGLMLRGALGSIGQCGTCSCTFCHKRVDSGCVVGLKVVNVSAFEK 695
Query: 467 HAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLE--KAVRMVLGSSSMKKANFCLNC 524
HAGSS + P ++I+LENGK L+DI+ + ++ + + ++ +G +K
Sbjct: 696 HAGSSARHPSDFIFLENGKCLKDILEIGWNANKQKMNVMDVLKSAIGEVGGEKVQI---- 751
Query: 525 RVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGE 584
+S + L+ E K Q +P +++P+ P V+++
Sbjct: 752 -ISLDHP------LIAIQPAEKKLPQPRLVLDTKP-RVPVDLKPRMPQVDMKPR-----V 798
Query: 585 LTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD--------------------LRM 624
+ DT +R P A+++S K + V++ I R D +
Sbjct: 799 MLDTRSRMPLDTKAKSTSDVKARGGDVRA--TIPRLDRTTREKEAASPPVPSRESSGANL 856
Query: 625 HKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNS----------------- 667
HK +F GGLED EVGY+V+G+ KFL G K+G GILC+CC
Sbjct: 857 HKALFLPGGLEDDIEVGYYVKGQ-KFLAGLKRGAGILCSCCQQVARNGMSSILMDSVVCG 915
Query: 668 -------EVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKEND--- 717
+S S FE HAGW SRR P+ IY ++G SLH+ + L +E+ + N
Sbjct: 916 LTYEGERMISCSLFEQHAGWGSRRNPYTSIYLADGRSLHDAAQSLVVEQTVKQEGNTPAK 975
Query: 718 ----DLCGICMDGGDLLCCDSCPRAFHIDCVS-LPGIPSGTWHC------RYCMNTFQKE 766
D C C D GDL C CP A+H DC+ + SG + C RY ++
Sbjct: 976 IEHLDQCVECGDRGDLQLCTRCPNAYHQDCLGKVDSYSSGEFFCPDCQEQRYGGTKDRRR 1035
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDT-ELGGCVLCRGRDFCKSRFGRRT 825
V N R+ G + + ++ RC R++Q P+ LGGCV C+ DF K+ FG +T
Sbjct: 1036 SMV----NRRSKGAAKTLLSKDRVTGRCTRLLQVPEAVVLGGCVFCKSGDFAKTGFGPKT 1091
Query: 826 VILCDQ----------CEREYHVGCLKDHGMEDLQ 850
+LCDQ CEREYHVGCLK HG+EDL+
Sbjct: 1092 TLLCDQVSVGDMKVKGCEREYHVGCLKKHGLEDLK 1126
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 52/78 (66%)
Query: 1007 GYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIF 1066
G+ ++L IE+LLG L V+ L LP+A A+ IW N+FGF M EE+ ++ +D +M+F
Sbjct: 1477 GHCKALLLSIERLLGVLRVERLALPAAEGAEGIWLNRFGFRRMAEEQVKQFHSDLNMMVF 1536
Query: 1067 QGTSMLQKPVPKCRIVGK 1084
G+SML+K +P I G+
Sbjct: 1537 TGSSMLEKDIPPLEITGQ 1554
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 50/93 (53%), Gaps = 7/93 (7%)
Query: 849 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE--SGSDNAVDFD- 905
QELP+G+W C DCK I+ L LV G E L ++ + + K ++ GS++A +
Sbjct: 1202 FQELPEGEWFCGQDCKHIHSILSLLVSNGPEPLADSIISKVLKTNQARLEGSEDATESSC 1261
Query: 906 --VRWRVLRGKKVDASDGTRALLSKAVSIFHDR 936
W++L G+ D S+G L++ V IF R
Sbjct: 1262 SGFEWQLLHGRGGDPSNGKA--LAEVVQIFSVR 1292
>gi|108707492|gb|ABF95287.1| expressed protein [Oryza sativa Japonica Group]
Length = 973
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 189/749 (25%), Positives = 325/749 (43%), Gaps = 182/749 (24%)
Query: 384 VGGPVK---QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
+ PVK ET +P +++ ++ L +G+LEG V Y
Sbjct: 346 ITAPVKVRVTETKYRPESILKD----VRGLLSTGLLEGFRVTY----------------- 384
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
K N+++ FE H+G S+ +I+L++G +L ++ K + L+
Sbjct: 385 -------------KKNEIMNACEFEQHSGESSNNQNNHIFLDSGISLYMVIQGLKYTKLD 431
Query: 501 TLEKAVRMVLG-------------------------------SSSMKKANFCLNCRVSFS 529
L + V+ + S +++N L + S
Sbjct: 432 MLGDVIGKVISLPPNMIQYEKWKASFQLEKDYFDDAPSDPCSTQSSQESNIALTDSLKDS 491
Query: 530 NAGVEELMLLC----KSCVELKESQAGSAEIKEP-LSHSSEME----PQPPSVELEESPA 580
+ ++ +S + K G E P LS S E E S++ EE+P+
Sbjct: 492 TSNASSILNWSSFRRRSDRQFKR---GGTETSTPILSRSPEKEISDLSTSTSMKSEETPS 548
Query: 581 --PSGELTD--TSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLED 636
+G LT T + P P+ H+ S+ K G+ +D +H ++F+EGGL D
Sbjct: 549 ENTAGLLTTDVTVIQDPPPD------HNV--DSNSKDLGQPKVRDNTLHPMLFKEGGLPD 600
Query: 637 GAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSN 694
+ Y ++ GYK G GI+C CC+ EV +PSQFE H G RR+P++ IYTS+
Sbjct: 601 YTLLTYKLKNGEVLKQGYKLGTGIICECCSIEVQYTPSQFEKHVGMGRRRQPYRSIYTSD 660
Query: 695 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
G++LHEL++KL + SS N D LP + SG+
Sbjct: 661 GLTLHELALKL--QDGLSSNVNID--------------------------ELPTLTSGSG 692
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 814
EY+ +R + P + + + V++ C +CR
Sbjct: 693 K--------------EYSTTSRPI-----IVPLKRTLQERVLTVES-------CYMCRKP 726
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMED-------LQELPKGKWLCCADCKRIN 867
++ C+QCER HV C ++G++ L E + ++CC C+ +
Sbjct: 727 HTVLGVISVDMIVFCNQCERALHVKCY-NNGLQKPKAPLKVLGEYTQFNFMCCEKCQLLR 785
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
+L + + KK E+ + +++ W++L G + + +
Sbjct: 786 ASLHEGL----------------KKREDIAFLRRIRYNICWQLLNGTNMRSD-----VQH 824
Query: 928 KAVSIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFR 985
+ + IF D F E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I +
Sbjct: 825 QVIEIFKDAF---AETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILK 881
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
+ +E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G
Sbjct: 882 VRTEEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLG 941
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++++++E+++ +PL++F S++QK
Sbjct: 942 YTILSDEQKHSMLMAHPLVMFANLSLVQK 970
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 9/101 (8%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKG----NQVVTPAVFELHAGS 470
+L+G+ V Y R K + L G V G +C C G +V++ FE HAG
Sbjct: 31 LLQGVPVTY-RFEKHN----AKLEGTVAAGGYACACPAYAGCDYRGKVLSALQFEKHAGV 85
Query: 471 SNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
++K +I+L NG++L ++ + ++ P E +A RM G
Sbjct: 86 TSKNQNGHIFLRNGRSLYELFHKLREVPAEKFPEAFRMAAG 126
>gi|414866151|tpg|DAA44708.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
gi|414866152|tpg|DAA44709.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 752
Score = 206 bits (524), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 188/711 (26%), Positives = 306/711 (43%), Gaps = 115/711 (16%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 115 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 167
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG------------SS 513
H G S ++I+L+ G +L ++ K L L + +G +S
Sbjct: 168 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIGFPPNLDEYNKWKAS 227
Query: 514 SMKKANF--------CLNCRVSFSNAG--VEELMLLCKSCVELKESQAG-SAEIKEPLSH 562
K+ ++ CL AG + L K V S SA +
Sbjct: 228 FQKRKDYLDAVASDGCLTQSSQGLAAGEMIYSLRDYLKDSVSNSISNLNWSASKRRSGRR 287
Query: 563 SSEMEPQPPSVELEESPAPSG--ELTDTSNR--SPEPNSAQTSSHSK-----MKSSSVKS 613
+ + + SP G TDTS + + E + SS K +++ S+ S
Sbjct: 288 FRQGDTGTSTPTFSGSPGKGGFGHSTDTSEKKGTEETHRLSLSSPVKITQRPLRNCSIDS 347
Query: 614 HGKITR-KDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPS 672
K ++ +D +H L+F+E GL D + Y ++ GYK+G I+C CCN E SPS
Sbjct: 348 KSKESKTRDTTLHPLIFKEDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPS 407
Query: 673 QFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCC 732
FE HAG RR+P+ +IYT G+SLH+ + L L+ + D+ + L
Sbjct: 408 HFEEHAGMGRRRQPYHNIYTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTS 465
Query: 733 DSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVS 792
C R P SG + P + +
Sbjct: 466 SGCGRE--------PSTTSGP------------------------------IVPLKRTLQ 487
Query: 793 RCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GM 846
R+V+T C C T+I C+QCER H+ C + +
Sbjct: 488 E--RVVETES-----CYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPL 540
Query: 847 EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDV 906
E L+E +LCC +C+ +L+ ++ G EK + +++ + ++
Sbjct: 541 EILKEYMCFHFLCCQECQ----SLRARLEEGLEKC--VGITFLRR----------IRSNI 584
Query: 907 RWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYH 965
WR+L G +DAS + + + + IF D F + S D+I MV G++ + +D+
Sbjct: 585 CWRLLSG--MDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFR 640
Query: 966 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1025
GMYCA+LT + VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV
Sbjct: 641 GMYCALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNV 700
Query: 1026 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
L P E IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 701 SLLTAPVDPEMAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 751
>gi|384253135|gb|EIE26610.1| hypothetical protein COCSUDRAFT_59132 [Coccomyxa subellipsoidea
C-169]
Length = 1231
Score = 201 bits (512), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 108/286 (37%), Positives = 153/286 (53%), Gaps = 38/286 (13%)
Query: 806 GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK--------- 856
GGCVLC+ DF + FG RT+I+CDQCEREYH+GCL +HG L ELP+GK
Sbjct: 575 GGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLAEHGRAHLTELPEGKASLYDILLT 634
Query: 857 ------WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 910
W C +CK I +++ V +S+ V + + W+V
Sbjct: 635 LHLNGEWHCSPECKGIATRMRERV---------SSVPV------------PLQGEYSWQV 673
Query: 911 LRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCA 970
LRGK D + T L A I + FDPI++ + DL+ AMVY + DY GMY A
Sbjct: 674 LRGK--DGTHATTWALKAAQEILTESFDPILDLVTGADLMMAMVYAQELGDWDYTGMYTA 731
Query: 971 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1030
+L V + +FR+FG++LAE+PLVAT + QG+ + L E L V++L L
Sbjct: 732 VLRRRGKAVCSAVFRVFGRQLAEVPLVATRLGARRQGHARVLMAAFEDYFRSLGVQSLCL 791
Query: 1031 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
P+A W + FGF+ +T EEQ ++ ++IF GT +LQKP+
Sbjct: 792 PAAQSTVETWIHGFGFAAITPEEQAATCSELRVLIFPGTELLQKPL 837
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 111/411 (27%), Positives = 165/411 (40%), Gaps = 101/411 (24%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSG-ISCFCDDCKGNQVVTPAV-- 463
L+ L+ G L G V + G L G + G I+C C C+ + TP V
Sbjct: 230 LREVLKGGALRGQPVFFQSRH-----GDLLLNGSITEEGQIACPCKQCRAKK--TPGVSC 282
Query: 464 --FELHAGSSNKRPPEYIYLEN-GKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAGS +RP E IYL N +L++ + D G S+ + +
Sbjct: 283 SEFEEHAGSRERRPGESIYLTNLSISLKEFCALVNDE-------------GRSADRHGSA 329
Query: 521 CLNC--------------RVSFSNAGVEELM---LLCKSCVELKESQAGSAEIK-EPLSH 562
C C V AG+EE+ C +CV A A++K +PL
Sbjct: 330 CGLCMDGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDACV------ARGAQLKAKPLP- 382
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKS-HGKITRKD 621
S +P P+ + E + + + + + M + +++ G ++
Sbjct: 383 -SPQKPSKPAKWRQPKQKAPKEKKHGGSGAAKKAAHAGAPRVHMAAPALRVVSGARRERN 441
Query: 622 LRMHKLVF---EEGGLEDGAEVGY-------FVRGEVKFLVGYKKGFGILCTCCNSEVSP 671
HK +F E GGL DG V Y ++G V+ GILC CCN +S
Sbjct: 442 SNKHKRLFLPDEPGGLTDGEPVSYITSQGEELLKGSVRIDATEAGPSGILCACCNGVISC 501
Query: 672 SQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS---LERPFSSKENDDLCGI------ 722
SQFEAHAG SRR P+ +I+T+ GVSL +L+ + E P S + LC +
Sbjct: 502 SQFEAHAGRGSRRAPYDNIFTAAGVSLRKLACLMPASEAESPISHRPA-ALCAVADRRAL 560
Query: 723 -----------CMDGGDLLC-----------------CDSCPRAFHIDCVS 745
+ GG +LC CD C R +HI C++
Sbjct: 561 EPELVTVSGEAALHGGCVLCKVPDFLRGGFGERTMIICDQCEREYHIGCLA 611
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 61/108 (56%), Gaps = 9/108 (8%)
Query: 660 ILCTC--CNSEVSP----SQFEAHAGWASRRKPFQHIYTSN-GVSLHELSIKLSLERPFS 712
I C C C ++ +P S+FE HAG + R+P + IY +N +SL E ++ E S
Sbjct: 265 IACPCKQCRAKKTPGVSCSEFEEHAG-SRERRPGESIYLTNLSISLKEFCALVNDEG-RS 322
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ + CG+CMDGGDLLCCD CP A H C L +P G W C C+
Sbjct: 323 ADRHGSACGLCMDGGDLLCCDGCPTAVHAYCAGLEEVPEGDWFCDACV 370
>gi|414866150|tpg|DAA44707.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 787
Score = 196 bits (498), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 148/513 (28%), Positives = 236/513 (46%), Gaps = 96/513 (18%)
Query: 592 SPEPNSAQTSSHSKMKSS----SVKSHGKITRKDLR-----------------MHKLVFE 630
SP P +A T++HSK S+ S+ S KIT++ LR +H L+F+
Sbjct: 342 SPLP-TAVTTNHSKHDSTNLGLSLSSPVKITQRPLRNCSIDSKSKESKTRDTTLHPLIFK 400
Query: 631 EGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
E GL D + Y ++ GYK+G I+C CCN E SPS FE HAG RR+P+ +I
Sbjct: 401 EDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNI 460
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 750
YT G+SLH+ + L L+ + D+ + L C R P
Sbjct: 461 YTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGRE--------PSTT 510
Query: 751 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 810
SG + P + + R+V+T C
Sbjct: 511 SGP------------------------------IVPLKRTLQE--RVVETE-----SCYF 533
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQELPKGKWLCCADCK 864
C T+I C+QCER H+ C + +E L+E +LCC +C+
Sbjct: 534 CGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQ 593
Query: 865 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 924
+L+ ++ G EK + +++ + ++ WR+L G +DAS +
Sbjct: 594 ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWRLLSG--MDASRDVKL 635
Query: 925 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMYCAILTVNQVVVSAGI 983
+ + + IF D F + S D+I MV G++ + +D+ GMYCA+LT + VVSA I
Sbjct: 636 YMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTASTHVVSAAI 693
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
++ +++AEL L+AT ++C+ +GYF L IE L NV L P E IW+ K
Sbjct: 694 LKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPEMAQIWSEK 753
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
GF++++ EE+ +PL++F+ ++QK +
Sbjct: 754 LGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 786
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 8/97 (8%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 115 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 167
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETL 502
H G S ++I+L+ G +L ++ K L L
Sbjct: 168 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNML 204
>gi|242041293|ref|XP_002468041.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
gi|241921895|gb|EER95039.1| hypothetical protein SORBIDRAFT_01g038485 [Sorghum bicolor]
Length = 981
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 202/432 (46%), Gaps = 75/432 (17%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL------------- 705
GI C CCN + FE HAG +S P H++ +G+SL + ++
Sbjct: 607 GIKCMCCNEIFTVENFEVHAG-SSTPLPSAHMFLKDGMSLSQCLVEFMGGNKPRDPHPLH 665
Query: 706 -SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQ 764
L+ S E+D +C +C DGGDLL CD+CP ++H DCV L IP G W+C C
Sbjct: 666 ARLKGKNSDLESDSICSVCHDGGDLLLCDNCPSSYHHDCVGLEAIPEGNWYCPSCR---- 721
Query: 765 KEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRR 824
S C PDT S+F +
Sbjct: 722 --------------------------CSICNLSDYDPDT---------------SQFTEK 740
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
T++ CDQCEREYHVGC ++ + L P+G W C C + LQ+L+ + E
Sbjct: 741 TIVYCDQCEREYHVGCTRNSDNQ-LICRPEGCWFCSRGCSNVFQHLQELIGKSVPTPIEG 799
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
I K +GSD+ D+D ++ A + L AV I H+ F IIE
Sbjct: 800 VSWTILKFCSGNGSDHG-DYD--------DEIMADHYGK--LCVAVGILHECFVTIIEPR 848
Query: 945 SKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
++ D+ +V+ R R ++ G Y +L +S G FRI GQ+ AELPL+ TS+
Sbjct: 849 TQSDISEDIVFNRESELRRLNFRGFYTILLQKGGEPISVGTFRICGQKFAELPLIGTSSP 908
Query: 1003 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1062
+ QG + L +EKLL L V+ L+LP+ E WT FGF++M+ ++ + +
Sbjct: 909 YRRQGMCRLLINELEKLLLDLGVERLILPAVPELLETWTCSFGFTIMSNSDRLELAGN-S 967
Query: 1063 LMIFQGTSMLQK 1074
++ FQGT+M QK
Sbjct: 968 ILSFQGTTMCQK 979
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 33/57 (57%), Gaps = 6/57 (10%)
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTL 487
GPG+ G ++ GI C C N++ T FE+HAGSS P +++L++G +L
Sbjct: 595 GPGIK--EGSIRRDGIKCMC----CNEIFTVENFEVHAGSSTPLPSAHMFLKDGMSL 645
>gi|302771369|ref|XP_002969103.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
gi|300163608|gb|EFJ30219.1| hypothetical protein SELMODRAFT_90617 [Selaginella moellendorffii]
Length = 443
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/445 (29%), Positives = 194/445 (43%), Gaps = 99/445 (22%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLE---------- 708
GI+C+CC S S FEAH+G S+R+ I+ + G SL +L ++ + E
Sbjct: 28 GIVCSCCKKLWSLSGFEAHSG-TSQRRACASIFNNKGESLLDLQVQ-AWELLDSKVNPKE 85
Query: 709 --RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ S ENDD CG+C DGG L+CCD CP +H+ C+ L +P G W C C
Sbjct: 86 NVKAAPSDENDDACGVCGDGGRLICCDHCPSTYHLSCLLLKELPEGEWFCPSCR------ 139
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK--SRFGRR 824
C +C G ++ S F
Sbjct: 140 -----------------------------------------CAICGGSEYNADGSSFNEM 158
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
TV+LCDQCEREYHV CL GM + P W C C +I L+KLV
Sbjct: 159 TVLLCDQCEREYHVSCLYSRGMAKMTSCPDDSWFCGDHCDKIFEGLRKLV---------- 208
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGT----------RALLSKAVSIFH 934
G N + + W +LR + D + R+ L+ A+ +
Sbjct: 209 ------------GISNTIGEGLSWTLLRSGEDDLPSASSMNREQMAEHRSKLAVALGVMQ 256
Query: 935 DRFDPIIESASKLDLIPAMVYGRSH---RGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
+ F P+++ +K+DL+ ++Y R ++ G Y +L + V+S RI G L
Sbjct: 257 ECFLPMVDPRTKIDLVTHILYNRGKAEVNRLNFRGFYTVVLEKDDEVISVASIRIHGGLL 316
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AE+PL+ T + QG + L IE LL L +++ VLP+ E W N FGF M
Sbjct: 317 AEMPLIGTRFHHRRQGMCRRLVRAIEGLLQRLGIRSFVLPAVPELLHTWKNAFGFQEMAP 376
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQKPV 1076
++ + ++ F G ++LQKP+
Sbjct: 377 TQRLELVK-LSVVSFPGVTLLQKPL 400
>gi|326525367|dbj|BAK07953.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1292
Score = 192 bits (488), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 143/495 (28%), Positives = 224/495 (45%), Gaps = 84/495 (16%)
Query: 600 TSSHSKMKSSSVKSHGKITRKDLR---MHKLVFEEGGLEDGAEVGYFVRGE-VKFLVGYK 655
T SH+++ SS+ HG+I R + L+ ++ L ++ Y R + + G
Sbjct: 680 TMSHARVLSST---HGQILPYQHRAKTVLSLLVDKNILLPRVKLTYKQRSDGPRLKEGAV 736
Query: 656 KGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL------------SI 703
GI C CCN + FE HAG S R P HI+ +G L + S+
Sbjct: 737 TKDGIKCRCCNELFTLESFEVHAG-CSTRLPAAHIFLKDGRPLSQCLVELMGENKPKESL 795
Query: 704 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
+ L+ +S E+D +C IC +GG++L CD+CP +FH CV L P G+W+C C
Sbjct: 796 HVRLKTNYSDTESDSICSICNEGGEILLCDNCPSSFHHACVGLESTPEGSWYCPSCR--- 852
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFC--KSRF 821
C +C D+ ++F
Sbjct: 853 --------------------------------------------CSICDSSDYDPDTNKF 868
Query: 822 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 881
+T++ CDQCEREYHVGC+++ G + L P+G W C C I LQ L+ +
Sbjct: 869 TEKTIMYCDQCEREYHVGCMRNKG-DQLTCCPEGCWFCSRGCSEIFQHLQGLIGKSIPTP 927
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
E I + E+ S + ++ GK L A+ + H+ F II
Sbjct: 928 VEGLSCTILRFDRENASQHGDFYNEIIAEQYGK-----------LCIALDVLHECFVTII 976
Query: 942 ESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
E +++ DL +V+ R R ++ G Y IL + ++S G FR+ G++ AELPL+ T
Sbjct: 977 EPSTRRDLSEDIVFNRESGLRRLNFRGFYTLILQKDGELISVGTFRVCGKKFAELPLIGT 1036
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ QG + L +EKLL L V+ LVLP+ + WT FGF M+ ++ +
Sbjct: 1037 RVQYRRQGMCRLLMNELEKLLSGLGVERLVLPAIPQLLETWTGSFGFRAMSFSDRFELA- 1095
Query: 1060 DYPLMIFQGTSMLQK 1074
+ ++ FQGT++ QK
Sbjct: 1096 ESSILSFQGTTICQK 1110
Score = 43.9 bits (102), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 15/107 (14%)
Query: 439 GVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN--VCKD 496
G V GI C C N++ T FE+HAG S + P +I+L++G+ L + + ++
Sbjct: 734 GAVTKDGIKCRC----CNELFTLESFEVHAGCSTRLPAAHIFLKDGRPLSQCLVELMGEN 789
Query: 497 SPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSC 543
P E+L VR+ S + + C C N G E +LLC +C
Sbjct: 790 KPKESLH--VRLKTNYSDTESDSICSIC-----NEGGE--ILLCDNC 827
>gi|302784378|ref|XP_002973961.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
gi|300158293|gb|EFJ24916.1| hypothetical protein SELMODRAFT_100329 [Selaginella moellendorffii]
Length = 468
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 137/475 (28%), Positives = 205/475 (43%), Gaps = 96/475 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ E L + +V Y + + L+ G + GI+C+CC S S FEAH+G S+R+
Sbjct: 21 LLETKMLHEFQKVSYVDKKTRQVLLEGIVRFEGIVCSCCKKLWSLSGFEAHSG-TSQRRA 79
Query: 687 FQHIYTSNGVSLHELSIKLSLE------------RPFSSKENDDLCGICMDGGDLLCCDS 734
I+ + G SL +L ++ + E + S ENDD CG+C DGG L+CCD
Sbjct: 80 CASIFNNKGESLLDLQVQ-AWELLDSKVNPKENVKAAPSDENDDACGVCGDGGRLICCDH 138
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 794
CP +H+ C+ L +P G W C C EYNA+
Sbjct: 139 CPSTYHLSCLLLKELPEGEWFCPSCRCAICGGS--EYNADG------------------- 177
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
S F TV+LCDQCEREYHV CL GM + P
Sbjct: 178 ------------------------SSFNEMTVLLCDQCEREYHVSCLYSRGMAKMTSCPD 213
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 914
W C C +I L+KLV G N + + W +LR
Sbjct: 214 DSWFCGDHCDKIFQGLRKLV----------------------GISNNIGEGLSWTLLRSG 251
Query: 915 KVDASDGT----------RALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH---RG 961
+ D R+ L+ A+ + + F P+++ +K+DL+ ++Y R
Sbjct: 252 EDDLPSANSMNREQMAEHRSKLAVALGVMQECFLPMVDPRTKIDLVTHILYNRGKAEVNR 311
Query: 962 QDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLG 1021
++ G Y +L + V+S RI G LAE+PL+ T + QG + L IE LL
Sbjct: 312 LNFRGFYTVVLEKDDEVISVASIRIHGGLLAEMPLIGTRFHHRRQGMCRRLVRAIEGLLQ 371
Query: 1022 FLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
L +++ VLP+ E W N FGF M ++ + ++ F G ++LQKP+
Sbjct: 372 RLGIRSFVLPAVPELLHTWKNAFGFQEMAPTQRLELVK-LSVVSFPGVTLLQKPL 425
>gi|242041377|ref|XP_002468083.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
gi|241921937|gb|EER95081.1| hypothetical protein SORBIDRAFT_01g039292 [Sorghum bicolor]
Length = 503
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/464 (30%), Positives = 211/464 (45%), Gaps = 77/464 (16%)
Query: 620 KDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAG 679
+D +H L+F+EGGL D + Y ++ GYK G GI+C CC+ E +PS FE HAG
Sbjct: 105 RDTTLHPLIFKEGGLADNTLLTYKLKNGEVLKQGYKWGTGIICNCCSQEFAPSHFEEHAG 164
Query: 680 WASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICM-DGGDLLCCDSCPRA 738
RR+P+ +IYT G +LH+L+ L L+ +S D+ D +L C R
Sbjct: 165 MGRRRQPYHNIYTPEGSTLHKLA--LQLQDHLNSNGFDNASVSSFSDYPNLTSASGCGRQ 222
Query: 739 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 798
P SG P + R +
Sbjct: 223 --------PSTTSG---------------------------------PIVPLK----RTL 237
Query: 799 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQEL 852
Q E C C +I C+QCER HV C + +E L+E
Sbjct: 238 QGRVVETESCYFCGYGHTTIGNIDPDMIIFCNQCERPCHVKCYNNRVVKKKVPLEILKEY 297
Query: 853 PKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR 912
++LCC +C Q L DR EE L +K EE + ++ WR+L
Sbjct: 298 VCFRFLCCQEC-------QLLRDRLEEGL---------EKCEEIAFLRRIRSNICWRLLS 341
Query: 913 GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKL-DLIPAMVYGRS-HRGQDYHGMYCA 970
G +DAS + + + + IF D F +ES + D+ MV ++ + +D+ GMYCA
Sbjct: 342 G--MDASRDVKLFMPQVIDIFKDAF---VESTDEHSDIFSDMVNCKNGDQEKDFRGMYCA 396
Query: 971 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1030
+LT + VVSA I ++ +++AEL L+AT +C+ +GYF L IE L NV L
Sbjct: 397 LLTASTHVVSAAILKVRMEQIAELVLIATRRECRKKGYFILLLKSIEANLRAWNVSLLTA 456
Query: 1031 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
P E IW+ K GF++++ EE+ +PL++F+ ++QK
Sbjct: 457 PVDPEMAQIWSEKLGFTILSAEEKESVLESHPLVMFKNLVLVQK 500
>gi|159463412|ref|XP_001689936.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283924|gb|EDP09674.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2449
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/277 (36%), Positives = 152/277 (54%), Gaps = 31/277 (11%)
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
CVLC DF + F +TV++CDQCE+EYH+GCL+ H M D+Q +P+G+W C +C RI
Sbjct: 1512 CVLCHEPDFDREGFSDKTVLICDQCEKEYHIGCLRQHKMVDMQAVPEGEWFCSDECVRIR 1571
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
AL + V GE +P RW++LRGK + T LS
Sbjct: 1572 DALGEDVAAGEVLMPGNPA-------------------YRWQILRGK--NGRQQTWHALS 1610
Query: 928 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV-------VVS 980
++I + FDPII++ S DL+PAMV + D+ GMY +L V
Sbjct: 1611 TVLNILQESFDPIIDTGSGSDLLPAMVNAETAGDYDFQGMYSILLRYRGPDKEARGKPVL 1670
Query: 981 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCI-EKLLGFLNVKTLVLPSASEAQAI 1039
A +FR+ G +AE+PLVAT DC+ QG+ ++L + KLLG L V+ +VLP+ ++AQ
Sbjct: 1671 AALFRVLGSSMAEMPLVATRYDCRRQGHLRALVDAMRHKLLG-LGVRAIVLPATADAQPA 1729
Query: 1040 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
W + F + E R+++ ++IF T++L +P+
Sbjct: 1730 W-RQLQFQDLDEPSTRVARSEHRMVIFPHTTVLARPL 1765
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 37/63 (58%)
Query: 643 FVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELS 702
+RG V+ GILC C++ +S S FEAHAG RR P++ I T +G++L ++
Sbjct: 1376 LLRGVVQLQACGAATSGILCGHCDTIISASAFEAHAGHKHRRNPYEAILTDDGMTLKRIA 1435
Query: 703 IKL 705
+L
Sbjct: 1436 ERL 1438
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 186 bits (473), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 138/471 (29%), Positives = 207/471 (43%), Gaps = 100/471 (21%)
Query: 632 GGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG + R+PFQ+I
Sbjct: 664 GAVQLSQKVRYMNRRRTKVMLEGWVTRDGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNI 722
Query: 691 YTSNGVSLHELSIKL-----SLER-PFSS------KENDDLCGICMDGGDLLCCDSCPRA 738
Y +GVSL E I S+ER F S NDD CGIC DGGDL+CCD CP
Sbjct: 723 YLDSGVSLLECQIDAWNRQESIERIGFHSVNTDGDDPNDDTCGICGDGGDLICCDGCPST 782
Query: 739 FHIDCVSLPGIPSGTWHCRYCMNTF---QKEKFVEYNANARAAGRIEGVDPFAQMVSRCI 795
FH C+ + +P G WHC C F E FV+ +
Sbjct: 783 FHQSCLDIMMLPPGDWHCPNCTCKFCGIASEDFVQEDG---------------------- 820
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 855
T +EL C L C ++YH CL+D +
Sbjct: 821 ----TNVSELLTCSL---------------------CAKKYHKSCLQDVDAPCIDFNNST 855
Query: 856 KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE-ESGSDNAVDFDVRWRVLRGK 914
C C+ + LQK + KHE ESG W ++
Sbjct: 856 PCFCGKTCRELFEQLQKY---------------LGIKHELESG--------FSWSLVHRM 892
Query: 915 KVDASDGTRAL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 963
+D + L L+ A+S+ + F PI++ S +++I ++Y G + +
Sbjct: 893 DIDLDMSLQGLPQRVECNSKLAVALSVMDECFLPIVDRRSGINIIQNVLYNCGSNFNRLN 952
Query: 964 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1023
Y G Y AIL ++SA R G +LAE+P + T + + QG + LF IE L L
Sbjct: 953 YSGFYAAILERGDEIISAASIRFHGTQLAEMPFIGTRHVYRRQGMCRRLFSAIESALCSL 1012
Query: 1024 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
V+ L++P+ SE WT FGF+ +++ + + ++ +++F G MLQK
Sbjct: 1013 KVQKLIIPAISELTHTWTGVFGFTTLSDSLKQELKS-MNMLVFPGIDMLQK 1062
>gi|449526609|ref|XP_004170306.1| PREDICTED: uncharacterized LOC101209468 [Cucumis sativus]
Length = 1169
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 131/441 (29%), Positives = 204/441 (46%), Gaps = 85/441 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK-------------- 704
GI C+CC+ ++ S+FE HAG + +P ++IY G SL + ++
Sbjct: 571 GIHCSCCDEVITISKFEMHAG-SRVGQPLENIYVHTGSSLLQCLLESWNKQNEPQCKGYN 629
Query: 705 ---LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
+ +E P NDD CGIC DGGDL+CCDSCP FH C+ + PSG WHC YC
Sbjct: 630 FVDVDVEDP-----NDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC-- 682
Query: 762 TFQKEKFVEYNANARAAGRIE-GVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
+ + G++ G+ P E VLC+
Sbjct: 683 ------------SCKVCGQVTIGLHPMDD------------HHEAAADVLCK-------- 710
Query: 821 FGRRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
CD CE +YH C++ ++ D P C C+ ++ LQ+L+ ++
Sbjct: 711 --------CDLCEEKYHPICVQMNNASGDDVNNP---LFCGKKCQMLHERLQRLLGVRQD 759
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
S +I++ SD VD DV +K+ + + L+ A+ + + F P
Sbjct: 760 MKEGFSWTLIRR------SD--VDSDVSLCNEVAQKIKCN----SELAVALFVMDECFLP 807
Query: 940 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+I+ S ++LI ++Y G + ++ G Y AIL + V+ A RI G ELAE+P +
Sbjct: 808 VIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFI 867
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
T + QG + IE +L LNV+ LV+P+ SE + W + FGF + E + +
Sbjct: 868 GTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDETTKQRM 927
Query: 1058 RNDYPLMIFQGTSMLQKPVPK 1078
R L++F G MLQK + K
Sbjct: 928 RK-MSLLVFPGVEMLQKLLLK 947
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGIL---EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L ++ GIL E + M R ++VK L G + GI C C D +V+T +
Sbjct: 536 LAWMIDLGILSLDEKVQYMNQRKTRVK------LEGRLTRDGIHCSCCD----EVITISK 585
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P E IY+ G +L
Sbjct: 586 FEMHAGSRVGQPLENIYVHTGSSL 609
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 129/441 (29%), Positives = 194/441 (43%), Gaps = 85/441 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL--------SLERP 710
GI C+CC S S+FEAHAG +S + +I+ +G SL E +++ +
Sbjct: 594 GIKCSCCQEVFSLSRFEAHAG-SSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKES 652
Query: 711 FSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
FS K END +C +C GGDL+ CD CP +FH C+ L G W C C
Sbjct: 653 FSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKVGCFGDWFCPSCC--- 709
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 823
C +C F
Sbjct: 710 --------------------------------------------CGICGENKFDGGSEQD 725
Query: 824 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEE 879
V C QCER+YHVGCL+ G L P G W C CK+I L LQKL+ + G +
Sbjct: 726 NVVFSCYQCERQYHVGCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 785
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
L T L I+ K ++ D+ ++A + L+ A+ + H+ F+P
Sbjct: 786 NLTWTLLKPIRSK--------GLEIDL-------PDIEALTEVYSKLNIALGVMHECFEP 830
Query: 940 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+ E ++ D++ +++ G ++ G Y +L N ++S R++G+++AE+PL+
Sbjct: 831 VKEPHTRRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLI 890
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
T + G L +EK L L V+ LVLP+ WT FGFS MT+ E+ ++
Sbjct: 891 GTRFQYRRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRF 950
Query: 1058 RNDYPLMIFQGTSMLQKPVPK 1078
DY + FQ T M QK + K
Sbjct: 951 L-DYSFLDFQDTVMCQKLLMK 970
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 133/466 (28%), Positives = 215/466 (46%), Gaps = 83/466 (17%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 566 LIDSGTVQLSEKVQYMNRRRTKVMLEGWITRDGIHCRCCSKILTVSKFEIHAG-SKLRQP 624
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSKEN------------DDLCGICMDGGDLLCCDS 734
FQ+I +GVSL + + + S + DD CGIC DGGDL+CCD
Sbjct: 625 FQNIVLDSGVSLLQCQVDAWNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDG 684
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 794
CP FH C+++ +PSG WHC C F G +G +
Sbjct: 685 CPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADGSN--------- 721
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQEL 852
DT + ++ C CE++YH C++ D + D
Sbjct: 722 ----AEDDTTVS------------------ELVTCSLCEKKYHTSCIQGVDAVLSDTNN- 758
Query: 853 PKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR 912
C C+ + LQK + +E S +I + + GSD +V R
Sbjct: 759 -PSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHRT--DPGSDTSV---------R 806
Query: 913 G--KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMY 968
G ++V+++ + L+ A+++ + F I++ S+++LI ++Y R +Y G Y
Sbjct: 807 GFPQRVESN----SKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGFY 862
Query: 969 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1028
AIL ++ A RI G +LAE+P + T + + QG + LFC IE L L V+ L
Sbjct: 863 TAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEML 922
Query: 1029 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++P+ SE WT FGF+ + E + + R+ +++F GT MLQK
Sbjct: 923 IIPAISELMHTWTVGFGFNPLEESHKQELRS-LNMLVFPGTDMLQK 967
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 133/466 (28%), Positives = 215/466 (46%), Gaps = 83/466 (17%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 398 LIDSGTVQLSEKVQYMNRRRTKVMLEGWITRDGIHCRCCSKILTVSKFEIHAG-SKLRQP 456
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSKEN------------DDLCGICMDGGDLLCCDS 734
FQ+I +GVSL + + + S + DD CGIC DGGDL+CCD
Sbjct: 457 FQNIVLDSGVSLLQCQVDAWNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDG 516
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 794
CP FH C+++ +PSG WHC C F G +G +
Sbjct: 517 CPSTFHQSCLNIQMLPSGDWHCPNCTCKF--------------CGMADGSN--------- 553
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQEL 852
DT + ++ C CE++YH C++ D + D
Sbjct: 554 ----AEDDTTVS------------------ELVTCSLCEKKYHTSCIQGVDAVLSDTNN- 590
Query: 853 PKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR 912
C C+ + LQK + +E S +I + + GSD +V R
Sbjct: 591 -PSTSFCGQGCRELFEHLQKFIGVKQELEAGFSWSLIHRT--DPGSDTSV---------R 638
Query: 913 G--KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMY 968
G ++V+++ + L+ A+++ + F I++ S+++LI ++Y R +Y G Y
Sbjct: 639 GFPQRVESN----SKLAIALTVMDECFLSIVDRRSEINLIHNVLYNRGSNFNRLNYSGFY 694
Query: 969 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1028
AIL ++ A RI G +LAE+P + T + + QG + LFC IE L L V+ L
Sbjct: 695 TAILERGDEIICAASIRIHGTQLAEMPFIGTRHIYRRQGMCRRLFCAIESALCSLKVEML 754
Query: 1029 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++P+ SE WT FGF+ + E + + R+ +++F GT MLQK
Sbjct: 755 IIPAISELMHTWTVGFGFNPLEESHKQELRS-LNMLVFPGTDMLQK 799
>gi|242051400|ref|XP_002463444.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
gi|241926821|gb|EER99965.1| hypothetical protein SORBIDRAFT_02g043960 [Sorghum bicolor]
Length = 843
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 133/472 (28%), Positives = 201/472 (42%), Gaps = 105/472 (22%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGW---ASRRKPFQHIYTSNGVSLHELSIKL---- 705
G+ G GI CTCCN ++ ++F AHA + RR+ + ++ +G SL + ++L
Sbjct: 422 GFITGEGIRCTCCNKTLTVAEFAAHATARRGSDRREAWARVFLKDGRSLSQCLVELMRRD 481
Query: 706 -----------SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
++ S E D +C IC DGGDLL CD+CP AFH CV L P G W
Sbjct: 482 VAVVAARNGDVRVKEKCSDPEGDSVCSICNDGGDLLLCDNCPSAFHHACVGLQATPEGDW 541
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 814
C C C +C G
Sbjct: 542 FCPSCR-----------------------------------------------CGVCGGS 554
Query: 815 DFCKSR-----FGRRTVILCDQCEREYHVGCLKDHGM--------------EDLQELPKG 855
DF + F +T+I CDQCEREYHVGC++ G E+ +E P
Sbjct: 555 DFDATAAGGGGFTDKTIIYCDQCEREYHVGCVRRRGSEEEEESAAEWCRRPEEQEEWP-- 612
Query: 856 KWLCCADCKRINLALQKLVDRGEEK---LPET--------SLDVIKKKHEESGSDNAVDF 904
WLC +C + LQ L E+ +P T SL +++++ S A
Sbjct: 613 -WLCSPECGEVFRHLQGLAAVARERSIPIPTTVPTTVEGVSLSILRRRRRRPISMVATGS 671
Query: 905 DVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQ 962
+ ++ + L A+ + H+ F +IE ++ DL +V+ R R
Sbjct: 672 GCQ-----EEEEEEDAAEHGQLCSALDVLHECFVTLIEPRTQTDLTADIVFNRESELRRL 726
Query: 963 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1022
++ G Y L +++ G R+ G E+AELPLV T + QG L +EK+L
Sbjct: 727 NFRGYYVVGLEKAGELITVGTLRVLGTEVAELPLVGTRFAHRRQGMCHLLVTELEKVLRQ 786
Query: 1023 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
+ V+ LVLP+ E +WT GF MT + + ++ ++ FQGT+M K
Sbjct: 787 VGVRRLVLPAVPELLPMWTASLGFHPMTRSDVMEIAAEHAILSFQGTTMCHK 838
>gi|449458532|ref|XP_004147001.1| PREDICTED: uncharacterized protein LOC101209468 [Cucumis sativus]
Length = 1329
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 202/437 (46%), Gaps = 85/437 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK-------------- 704
GI C+CC+ ++ S+FE HAG + +P ++IY G SL + ++
Sbjct: 598 GIHCSCCDEVITISKFEMHAG-SRVGQPLENIYVHTGSSLLQCLLESWNKQNEPQCKGYN 656
Query: 705 ---LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
+ +E P NDD CGIC DGGDL+CCDSCP FH C+ + PSG WHC YC
Sbjct: 657 FVDVDVEDP-----NDDTCGICGDGGDLICCDSCPSTFHQSCLDIKKFPSGPWHCLYC-- 709
Query: 762 TFQKEKFVEYNANARAAGRIE-GVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
+ + G++ G+ P E VLC+
Sbjct: 710 ------------SCKVCGQVTIGLHPMDD------------HHEAAADVLCK-------- 737
Query: 821 FGRRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
CD CE +YH C++ ++ D P C C+ ++ LQ+L+ ++
Sbjct: 738 --------CDLCEEKYHPICVQMNNASGDDVNNP---LFCGKKCQMLHERLQRLLGVRQD 786
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
S +I++ SD VD DV +K+ + + L+ A+ + + F P
Sbjct: 787 MKEGFSWTLIRR------SD--VDSDVSLCNEVAQKIKCN----SELAVALFVMDECFLP 834
Query: 940 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+I+ S ++LI ++Y G + ++ G Y AIL + V+ A RI G ELAE+P +
Sbjct: 835 VIDHRSGINLIHNILYNCGSNFTRLNFSGFYTAILEKDDEVICAASLRIHGNELAEMPFI 894
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
T + QG + IE +L LNV+ LV+P+ SE + W + FGF + E + +
Sbjct: 895 GTRYMYRRQGMCRRFLSAIESVLSSLNVEKLVIPAISEVRDTWISVFGFKPLDETTKQRM 954
Query: 1058 RNDYPLMIFQGTSMLQK 1074
R L++F G MLQK
Sbjct: 955 RK-MSLLVFPGVEMLQK 970
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGIL---EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L ++ GIL E + M R ++VK L G + GI C C D +V+T +
Sbjct: 563 LAWMIDLGILSLDEKVQYMNQRKTRVK------LEGRLTRDGIHCSCCD----EVITISK 612
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P E IY+ G +L
Sbjct: 613 FEMHAGSRVGQPLENIYVHTGSSL 636
>gi|414872769|tpg|DAA51326.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 1370
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 131/446 (29%), Positives = 208/446 (46%), Gaps = 78/446 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 702
L G+ GI C+CCN S +F HAG + KP+++I +G+ + L +
Sbjct: 922 LLEGFATRDGINCSCCNEVYSVLEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 979
Query: 703 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
++ +ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 980 MQSDVERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEVLPSDYWC 1039
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F E + G + A + D
Sbjct: 1040 CANCSCKFCHEHSSD------------GAEDTADV------------------------D 1063
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
+ ++ C QCE +YH C + G C C+ + LQ L+
Sbjct: 1064 Y--------SLHTCSQCEEQYHEACSPETDSITNLSSQTGNLFCQQSCRLLFEELQNLLA 1115
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRALLSKAVSIFH 934
++ PE S V+++ HE DV VL K+V+ + + ++ A+S+
Sbjct: 1116 VKKDLEPEYSCRVVQRIHE----------DVPEEVLALDKRVECN----SRIAVALSLMD 1161
Query: 935 DRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELA 992
+ F PII+ + ++LI +VY G + D+ G Y IL +++A RI G +LA
Sbjct: 1162 ECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLA 1221
Query: 993 ELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEE 1052
E+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + +
Sbjct: 1222 EMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDS 1281
Query: 1053 EQNKYRNDYPLMIFQGTSMLQKPVPK 1078
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1282 EKQEVKS-ISMLVFPGTGLLQKPLLK 1306
>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 135/464 (29%), Positives = 207/464 (44%), Gaps = 77/464 (16%)
Query: 632 GGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
G + A+V Y R + + L+ G+ GI C CC+ + S+FE HAG +P Q+I
Sbjct: 152 GTVPLNAKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAG-MKLCEPSQNI 210
Query: 691 YTSNGVSLHELSIK------------LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRA 738
G+SL + + L + NDD CGIC DGGDL+CCD CP
Sbjct: 211 ILETGISLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPST 270
Query: 739 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 798
FH C+ + PSG WHC YC F F+ + +
Sbjct: 271 FHQSCLDIQKFPSGDWHCIYCSCKF--------------------CGMFSGNTDQMNYNL 310
Query: 799 QTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 858
D+ L + C CE +YH C + G + + +
Sbjct: 311 DVNDSAL---------------------LTCQLCEEKYHHMCTQ--GEDSILDDSSSPSF 347
Query: 859 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKV 916
C C+ + LQ L+ E S ++++ V FD+ L G +KV
Sbjct: 348 CGKTCRELFEQLQMLLGVKHELEDGFSWTLVQR--------TEVGFDIS---LNGIPQKV 396
Query: 917 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTV 974
+ + + L+ A+SI + F PI++ S ++LI ++Y G + +Y G + AIL
Sbjct: 397 ECN----SKLAVALSIMDECFLPIVDQRSGINLIHNVLYNCGSNFNRLNYSGFFTAILER 452
Query: 975 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1034
+ ++SA RI G +LAE+P + T + + QG + L IE L LNV+ LV+P+ S
Sbjct: 453 GEEIISAASIRIHGNKLAEMPFIGTRHIYRRQGMCRRLLNAIESALHSLNVEKLVIPAIS 512
Query: 1035 EAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
E WT+ FGF + + + RN +++F GT MLQKP+ K
Sbjct: 513 ELMQTWTSVFGFKPLEVSSRKEMRN-MNMLVFHGTDMLQKPLLK 555
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 136/492 (27%), Positives = 211/492 (42%), Gaps = 86/492 (17%)
Query: 604 SKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGF----G 659
SK ++ V + I K L + + + + +V Y V+G + + G G
Sbjct: 609 SKKRAQKVSAPSLINHKPLNVLSYLIDNSIILPRCKVYYKVKGRHRKVCTLADGKITRDG 668
Query: 660 ILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP--------- 710
I C CC S FE HA +S +P I+ +G SL + IK+ +
Sbjct: 669 IKCNCCMGIYSFVGFENHASGSSTCRPSARIFLEDGRSLLDCQIKMMHDHKTRETTGKSF 728
Query: 711 --FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 768
S END +C +C GG+L+ CD CP +FH C+ L IP+G W C C ++
Sbjct: 729 SGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR- 787
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
+I+G D Q++
Sbjct: 788 -----------KIDGDDEVGQLLP------------------------------------ 800
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKL----VDRGEEKLPET 884
C QCE +YHV CL ++G D+ G W C DC++I L KL V G + L T
Sbjct: 801 CIQCEHKYHVRCL-ENGAADISTRYLGNWFCGKDCEKIYEGLHKLLGEPVSVGVDNLTWT 859
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
+ I E S K D + + L+ A+S+ H+ F+P+ ES
Sbjct: 860 LVKFINPDSCEHDSS---------------KSDLLAESYSKLNLAISVMHECFEPLKESL 904
Query: 945 SKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
+ DL+ +++ R ++ G Y +L N+ ++S R++G+++AE+PLV T
Sbjct: 905 TNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVATVRVYGKKVAEIPLVGTRLQ 964
Query: 1003 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1062
+ +G L +EK L L V+ LVLP+ WT FGF+ MT E++++ DY
Sbjct: 965 YRRRGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRSFGFAKMTNLERSQFL-DYT 1023
Query: 1063 LMIFQGTSMLQK 1074
+ FQG M QK
Sbjct: 1024 FLDFQGAIMCQK 1035
>gi|388507928|gb|AFK42030.1| unknown [Lotus japonicus]
Length = 135
Score = 176 bits (447), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 81/131 (61%), Positives = 105/131 (80%), Gaps = 1/131 (0%)
Query: 953 MVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1012
MVYGR+ RGQ++ GMYCA+L VN VVSA + RIFG ++AELPLVATSN G+GYFQ+L
Sbjct: 1 MVYGRNVRGQEFGGMYCALLVVNSSVVSAAMLRIFGSDVAELPLVATSNGNHGKGYFQTL 60
Query: 1013 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSM 1071
F CIE+LL F+NVK+LVLP+A EA++IWT+KFGFS M +E + YR ++ M+ F+GT+M
Sbjct: 61 FSCIERLLAFMNVKSLVLPAAEEAESIWTDKFGFSRMKPDELSDYRKNFNQMVTFKGTNM 120
Query: 1072 LQKPVPKCRIV 1082
L K VP CRI+
Sbjct: 121 LHKLVPPCRII 131
>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
Length = 1254
Score = 176 bits (447), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 131/472 (27%), Positives = 206/472 (43%), Gaps = 97/472 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
+ + G +E +V Y R +V L G+ GI C CC+ ++ S+FE HAG + +P+
Sbjct: 610 LIDSGTVELSQKVQYRRRKKV-MLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPY 667
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSC 735
Q+IY +GVSL + I + + K NDD CGIC DGGDL+CCD C
Sbjct: 668 QNIYLESGVSLLQCQIDAWNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDGC 727
Query: 736 PRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCI 795
P FH C+ + +P G WHC C F C
Sbjct: 728 PSTFHQSCLDIQMLPPGEWHCPNCTCKF------------------------------CG 757
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 855
+T D + + RT IL CE++YH C K+ M+ L
Sbjct: 758 IASETSDKDDASVNVL------------RTCIL---CEKKYHDSCTKE--MDTLPNNINS 800
Query: 856 KWL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
L C +CK ++ L+K + G+ + ++ W ++
Sbjct: 801 SSLSFCGKECKELSEYLKKYL----------------------GTKHELEAGFSWCLIHR 838
Query: 914 KKVDASDGTRAL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQ 962
D+ R L L+ A+++ + F P+I+ S ++LI ++Y G +
Sbjct: 839 SDEDSEAACRGLTQRVECNSKLAIALTVMDECFLPVIDRRSGINLIRNILYNSGSNFSRL 898
Query: 963 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1022
Y G Y AIL +++A R G ++AE+P + T + + QG + LF IE L
Sbjct: 899 SYSGFYTAILERGDEIIAAASIRFHGTKIAEMPFIGTRHIYRRQGMCRRLFSAIELALCS 958
Query: 1023 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
L V+ LV+P+ +E WT FGF+ + E + + ++ +M+F G MLQK
Sbjct: 959 LKVEKLVIPAVAELTHTWTTVFGFTYLDESLRQEMKS-LNMMVFPGIDMLQK 1009
Score = 43.5 bits (101), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 607 LAWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 656
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 657 LHAGSKLPQPYQNIYLESGVSL 678
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 879 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 936
Query: 708 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 937 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 996
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F +E + + A I VD
Sbjct: 997 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 1021
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 1022 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 1072
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 1073 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 1119
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 1120 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 1179
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1180 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1239
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 1240 KQEVKS-TSMLVFPGTGLLQKPLLK 1263
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 724 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 781
Query: 708 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 782 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 841
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F +E + + A I VD
Sbjct: 842 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 866
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 867 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 917
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 918 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 964
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 965 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 1024
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1025 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1084
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 1085 KQEVKS-TSMLVFPGTGLLQKPLLK 1108
>gi|302850261|ref|XP_002956658.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
gi|300258019|gb|EFJ42260.1| hypothetical protein VOLCADRAFT_119505 [Volvox carteri f.
nagariensis]
Length = 3077
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 143/272 (52%), Gaps = 29/272 (10%)
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
CVLC +F + F +TV++CDQCE+EYH+GCL+ H M D+Q +P+G+W C +C RI
Sbjct: 1786 CVLCHQPEFDREGFSDQTVLICDQCEKEYHIGCLRKHKMVDMQAVPEGEWFCSDECVRIR 1845
Query: 868 LALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLS 927
L K ++ GE + S N RW+ +RG+ D + T L
Sbjct: 1846 ELLTKSLEEGETTM----------------SGNPA---YRWQFIRGR--DGTKATARALK 1884
Query: 928 KAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQV-------VVS 980
+ I + FDPII++ S DL+P MV+ S D+ GMY +L V
Sbjct: 1885 TVLEILQESFDPIIDNGSGEDLLPRMVHAESAGDYDFQGMYSILLRYRGADKEARGRPVL 1944
Query: 981 AGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIW 1040
AG+ R+ G +AE+PLVAT DC+ QG+ ++L + L L V+ +VLP+ ++A W
Sbjct: 1945 AGLVRVLGSSMAEVPLVATRYDCRRQGHLRALVEGLRHRLIALGVRAMVLPATADALPAW 2004
Query: 1041 TNKFGFSMMTEEEQNKYRNDYPLMIFQGTSML 1072
+ F + E R ++ ++IF TS++
Sbjct: 2005 -RQLAFMDLDEGSVRVARGEHRMIIFPHTSVV 2035
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 9/108 (8%)
Query: 606 MKSSSVKSHGKITRKDLRMHKLVFEEG--GLEDGAEVGYFVRGE-----VKFLVGYKKGF 658
M++S+ ++ R+ +R L F+ G L+DG V Y ++G+ +V G
Sbjct: 1507 MRASAARTSAS-QRRPIRSRTL-FDGGPDSLQDGERVHYTIQGQRLLSGTVVIVQRTAGS 1564
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS 706
GILC CC+ +S S FE+HAG RR P++ I T NG +L ++ +++
Sbjct: 1565 GILCDCCSKVISASAFESHAGHKHRRNPYESIMTDNGTTLKRIAERIT 1612
>gi|359478537|ref|XP_002278840.2| PREDICTED: uncharacterized protein LOC100243375 [Vitis vinifera]
Length = 1332
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 88/441 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL--------SLERP 710
GI C+CC S S+FEAHAG +S + +I+ +G SL E +++ +
Sbjct: 849 GIKCSCCQEVFSLSRFEAHAG-SSYHRSAANIFLEDGRSLLECQMQIIRDITGKGFTKES 907
Query: 711 FSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
FS K END +C +C GGDL+ CD CP +FH C+ L +P G W C C
Sbjct: 908 FSRKKSNERHHENDHICSVCHYGGDLVLCDHCPSSFHKSCLGLKTLPEGDWFCPSCC--- 964
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 823
C +C F
Sbjct: 965 --------------------------------------------CGICGENKFDGGSEQD 980
Query: 824 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEE 879
V C QCER+ CL+ G L P G W C CK+I L LQKL+ + G +
Sbjct: 981 NVVFSCYQCERQ---CCLRKWGHVKLASYPNGTWFCSKQCKKIFLGLQKLLGKSFPVGVD 1037
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
L T L I+ K ++ D+ ++A + L+ A+ + H+ F+P
Sbjct: 1038 NLTWTLLKPIRSK--------GLEIDL-------PDIEALTEVYSKLNIALGVMHECFEP 1082
Query: 940 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+ E ++ D++ +++ G ++ G Y +L N ++S R++G+++AE+PL+
Sbjct: 1083 VKEPHTRRDVVEDVIFCRGSDLNRLNFQGFYTVLLERNDELISVATVRVYGEKVAEVPLI 1142
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
T + G L +EK L L V+ LVLP+ WT FGFS MT+ E+ ++
Sbjct: 1143 GTRFQYRRLGMCHILMNELEKKLMELGVERLVLPAVPSVLNTWTTSFGFSKMTDSERLRF 1202
Query: 1058 RNDYPLMIFQGTSMLQKPVPK 1078
DY + FQ T M QK + K
Sbjct: 1203 L-DYSFLDFQDTVMCQKLLMK 1222
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 133/473 (28%), Positives = 206/473 (43%), Gaps = 99/473 (20%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
+ + G +E +V Y R +V L G+ GI C CC+ ++ S+FE HAG + +P+
Sbjct: 611 LIDSGTVELSQKVQYRRRKKV-MLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPY 668
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSC 735
Q+IY +GVSL + I + + K NDD CGIC DGGDL+CCD C
Sbjct: 669 QNIYLESGVSLLQCQIDAWNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDGC 728
Query: 736 PRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCI 795
P FH C+ + +P G W C C F A+G E D
Sbjct: 729 PSTFHQSCLDIQMLPPGEWRCMNCTCKFC----------GIASGTSEKDDASV------- 771
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 855
CVL +C+ CE++YH C K+ M+ L
Sbjct: 772 ------------CVL----------------HICNLCEKKYHDSCTKE--MDTLPNNINS 801
Query: 856 KWL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHE-ESGSDNAVDFDVRWRVLR 912
L C +CK ++ L+K + KHE ESG W ++
Sbjct: 802 SSLSFCGKECKELSEHLKKY---------------LGTKHELESG--------FSWSLIH 838
Query: 913 GKKVDASDGTRAL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRG 961
D+ R + L+ +++ + F P+I+ S ++LI ++Y G +
Sbjct: 839 RTDDDSEAACRGISQRVECNSKLAITLTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSR 898
Query: 962 QDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLG 1021
Y G Y AIL +++A R G ++AE+P + T + + QG + LF IE L
Sbjct: 899 LSYSGFYTAILERGDEIIAAASIRFHGTQIAEMPFIGTRHIYRRQGMCRRLFSAIESTLC 958
Query: 1022 FLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
L V+ LV+P+ +E WT FGF+ + + + + ++ +M+F G MLQK
Sbjct: 959 SLKVEKLVIPAIAEVTNTWTTVFGFTHLDKSLRQEMKS-LNMMVFPGIDMLQK 1010
Score = 43.1 bits (100), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 608 LAWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 657
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 658 LHAGSKLPQPYQNIYLESGVSL 679
>gi|242038141|ref|XP_002466465.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
gi|241920319|gb|EER93463.1| hypothetical protein SORBIDRAFT_01g008195 [Sorghum bicolor]
Length = 1370
Score = 174 bits (441), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 126/445 (28%), Positives = 203/445 (45%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER 709
L G GI C+CC+ +S +F AHAG + P+++I +G + L ++
Sbjct: 923 LLEGIVTRDGIDCSCCSKVLSVLEFVAHAG-SEVNTPYRNILV-DGQDIDLLHCLINAWN 980
Query: 710 PFSSKE--------------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
S E NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 981 MQSDAEKQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEELPSDYWC 1040
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C KF ++N +G + A + S
Sbjct: 1041 CANCSC-----KFCHEHSN-------DGAEDTADVDS----------------------- 1065
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH C ++ G C C+ + LQ L+
Sbjct: 1066 ---------SLHTCSQCEEQYHEACSPENDSITNLSSQTGNLFCQQSCRLLFEELQNLLA 1116
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S V+++ HE DV VL +D + ++ A+S+ +
Sbjct: 1117 VKKDLEPEYSCRVVQRIHE----------DVPEEVL---PLDTRVECNSKIAVALSLMDE 1163
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+ G Y IL +++A RI G +LAE
Sbjct: 1164 CFLPIVDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAE 1223
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT++FGFS + + E
Sbjct: 1224 MPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSRFGFSPLEDSE 1283
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 1284 KEEVKS-ISMLVFPGTGLLQKPLLK 1307
>gi|414888237|tpg|DAA64251.1| TPA: hypothetical protein ZEAMMB73_186624 [Zea mays]
Length = 771
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 134/514 (26%), Positives = 216/514 (42%), Gaps = 104/514 (20%)
Query: 616 KITRKDLRMHK------LVFEEGGLEDGAEVGYF-VR-GEVKFLVGYKKGFGIL-CTCCN 666
++ R D H+ ++ + G L V Y+ VR G+ G+ G GI+ C CCN
Sbjct: 307 EMVRADGTQHRSRTLLSVLIDSGILAPRGNVTYWAVRDGQPACKYGFITGEGIIRCKCCN 366
Query: 667 SEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKL----------------- 705
+ ++ EAHA G RR+ + ++ +G SL ++L
Sbjct: 367 KTFTVAELEAHATGGIGTDDRREAWARVFVEDGRSLSLCLMELMRRDDVGAAAANRNRNG 426
Query: 706 ---SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 762
++ S +E D +C +C+D G+LL CD CP AFH CV L P G W C C
Sbjct: 427 SVMRVKEKCSEEEGDSVCSVCIDSGELLLCDKCPSAFHHACVGLQATPEGDWCCPLCR-- 484
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF---CKS 819
C +C G D
Sbjct: 485 ---------------------------------------------CGVCGGSDLDDDTAE 499
Query: 820 RFGRRTVILCDQCEREYHVGCLKDHGMED------LQELPKGK--------WLCCADCKR 865
F +T+I C+QCEREYHVGC++ G E+ + L + + WLC +C
Sbjct: 500 GFTDKTIIYCEQCEREYHVGCMRRGGSEEESAAEWCRRLSESEGPEEEWRPWLCSPECGE 559
Query: 866 INLALQKLVDRGEEK-LPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 924
+ LQ LV + +P S + + + R+ R + +
Sbjct: 560 VFQHLQALVASSRARSIPHYS----RGAYHSAPCGRRRYMSTVTRITRWQHEEEDAADHG 615
Query: 925 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAG 982
L A+ + H+ FD ++E ++ DL +V+ + R ++ G Y L +++ G
Sbjct: 616 QLCAALDVLHECFDDMVEPRTQTDLAADIVFNQESGLRRLNFRGYYVVGLEKAGELINVG 675
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
R+FG ++AELPLV T + QG + L +EK+L + V+ LVLP+ E +WT
Sbjct: 676 TLRVFGNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELMPMWTA 735
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
GF MT + + ++ ++ F+GT+M QK +
Sbjct: 736 SLGFHAMTRSDVMEMAVEHAILSFKGTTMCQKTL 769
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 124/445 (27%), Positives = 199/445 (44%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 860 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 917
Query: 708 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 918 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 977
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F +E + + A I VD
Sbjct: 978 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 1002
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 1003 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 1053
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 1054 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 1100
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+ G Y +L +++A RI G +LAE
Sbjct: 1101 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFRGFYIFVLERGDEIIAAASVRIHGTKLAE 1160
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 1161 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 1220
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 1221 KQEVKS-TSMLVFPGTGLLQKPLLK 1244
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 132/447 (29%), Positives = 203/447 (45%), Gaps = 81/447 (18%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + + +F AHAG KP++++ +G+ L +S
Sbjct: 896 LLEGLVTRDGIQCRCCSKDFALLEFVAHAG-GQVSKPYRNVLV-DGLDKDLLHCLISAWD 953
Query: 708 -----ER----PFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 954 KQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWR 1013
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C KF + ++N A A++ S C
Sbjct: 1014 CANCCC-----KFCQEHSNDDAPD-------IAEVDSLCT-------------------- 1041
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK--GKWLCCADCKRINLALQKL 873
C QCE YH C + E+ +P G C C+ + LQ L
Sbjct: 1042 -------------CSQCEENYHPVCSPE--TENPSSVPSQAGDLFCQQSCRLLFEELQNL 1086
Query: 874 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 933
+ ++ PE + +IK HE+ + A+ D R + ++ A+S+
Sbjct: 1087 LAVKKDLEPEFACRIIKCIHEDV-PETALALDERVEC------------NSKIAVALSLM 1133
Query: 934 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
+ F PII+ + ++LI +VY G + D+ G Y IL +VSA RI G +
Sbjct: 1134 DECFLPIIDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKC 1193
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AE+P + T N + QG + L IE +L L V+ L++P+ SE WT+KFGFS +
Sbjct: 1194 AEMPFIGTRNMYRRQGMCRRLLDGIEMILSSLKVQKLIIPAISELVDTWTSKFGFSPLEV 1253
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1254 SEKQEVKS-ISMLVFPGTGLLQKPLLK 1279
>gi|145335136|ref|NP_563736.3| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|186478156|ref|NP_001117233.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|8778713|gb|AAF79721.1|AC005106_2 T25N20.3 [Arabidopsis thaliana]
gi|332189710|gb|AEE27831.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|332189711|gb|AEE27832.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 131/465 (28%), Positives = 209/465 (44%), Gaps = 77/465 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+ E G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG S +
Sbjct: 523 WLIESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-Q 581
Query: 686 PFQHIYTSNGVSLHELSIK----------LSLER--PFSSKENDDLCGICMDGGDLLCCD 733
PFQ+IY +G SL + ++ L+L + NDD CGIC DGGDL+CCD
Sbjct: 582 PFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCD 641
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP +H +C+ + +PSG WHC C F A A+G +G F ++S
Sbjct: 642 GCPSTYHQNCLGMQVLPSGDWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS- 690
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
C CER YH CL D + +Q
Sbjct: 691 -----------------------------------CGMCERRYHQLCLNDEAHK-VQSFG 714
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
C C + LQK + E S +I + +S +++ +
Sbjct: 715 SASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSA 765
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
++++ + L+ ++I + F PI++ S +DLI ++Y G + +Y G Y AI
Sbjct: 766 QRIE----NNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAI 821
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L ++SA R G +LAE+P + T + + QG + LF IE + L V+ LV+P
Sbjct: 822 LERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIP 881
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ + WT FGF+ + + + + R+ ++F G MLQKP+
Sbjct: 882 AIPDFLHAWTGNFGFTPLDDSVRKEMRS-LNTLVFPGIDMLQKPL 925
Score = 44.3 bits (103), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 48/84 (57%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 521 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 570
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 571 FEIHAGSKSCQPFQNIYLESGASL 594
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 332 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 389
Query: 708 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 390 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 449
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F +E + + A I VD
Sbjct: 450 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 474
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 475 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 525
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 526 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 572
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 573 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 632
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 633 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 692
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 693 KQEVKS-TSMLVFPGTGLLQKPLLK 716
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 133/465 (28%), Positives = 212/465 (45%), Gaps = 79/465 (16%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPF 687
+ + G +E +V Y R +V L G+ GI C CC+ ++ S+FE HAG + +P+
Sbjct: 626 LIDSGTVELSQKVQYRRRKKV-MLEGWITRDGIHCGCCSKILTVSKFELHAG-SKLPQPY 683
Query: 688 QHIYTSNGVSLHELSIKLSLERPFSSK------------ENDDLCGICMDGGDLLCCDSC 735
Q+IY +GVSL + I+ + S K NDD CGIC DGGDL+CCD C
Sbjct: 684 QNIYLESGVSLLQCQIEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDGC 743
Query: 736 PRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCI 795
P FH C+ + +P G WHC C F A+G E D +
Sbjct: 744 PSTFHQSCLDIQMLPLGEWHCPNCTCKFC----------GIASGNSEKDDA-------SV 786
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG 855
++Q +C+ CE++YH C K+ M++L
Sbjct: 787 YVLQ----------------------------ICNLCEKKYHDSCTKE--MDNLPNNINT 816
Query: 856 KWL--CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
L C +CK ++ L+K + E S +I + E+S A + RV
Sbjct: 817 SSLSFCGKECKELSEHLKKYLGTKHELEAGFSWSLIHRIDEDS---EAACRGISQRVECN 873
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
K L+ A+++ + F P+I+ S ++LI ++Y G + +Y G Y A
Sbjct: 874 SK----------LAIALTVMDECFLPVIDRRSGINLIRNVLYNSGSNFSRLNYSGFYTAT 923
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L ++++ R G ++AE+P + T + + QG + LF IE L L V+ LV+P
Sbjct: 924 LERGDEIIASASIRFHGTQIAEMPFIGTRHMYRRQGMCRRLFSAIESTLCSLKVEKLVIP 983
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ +E WT FGF+ + E + + ++ +M+F G ML KP+
Sbjct: 984 AIAELTNTWTTVFGFTHLDESLRQEMKS-LNMMVFPGIDMLMKPL 1027
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 11/82 (13%)
Query: 407 LKNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
L ++SG +E V Y R KV L G + GI C C C ++++T + FE
Sbjct: 623 LSWLIDSGTVELSQKVQYRRRKKVM------LEGWITRDGIHCGC--C--SKILTVSKFE 672
Query: 466 LHAGSSNKRPPEYIYLENGKTL 487
LHAGS +P + IYLE+G +L
Sbjct: 673 LHAGSKLPQPYQNIYLESGVSL 694
>gi|110741207|dbj|BAF02154.1| hypothetical protein [Arabidopsis thaliana]
Length = 1138
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/465 (27%), Positives = 208/465 (44%), Gaps = 77/465 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+ E G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG S +
Sbjct: 523 WLIESGVVQLRQKVQYMRRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-Q 581
Query: 686 PFQHIYTSNGVSLHELSIK----------LSLER--PFSSKENDDLCGICMDGGDLLCCD 733
PFQ+IY +G SL + ++ L+L + NDD CGIC DGGDL+CCD
Sbjct: 582 PFQNIYLESGASLLQCQVRAWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCD 641
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP +H +C+ + +PSG WHC C F A A+G +G
Sbjct: 642 GCPSTYHQNCLGMQVLPSGDWHCPNCTCKFCDA--------AVASGGKDG---------- 683
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
+++ C CER YH CL D + +Q
Sbjct: 684 ----------------------------NSISLLSCGMCERRYHQLCLNDEAHK-VQSFG 714
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
C C + LQK + E S +I + +S +++ +
Sbjct: 715 SASSFCGPKCLELFEKLQKYLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSA 765
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
++++ + L+ ++I + F PI++ S +DLI ++Y G + +Y G Y AI
Sbjct: 766 QRIE----NNSKLAVGLAIMDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAI 821
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L ++SA R G +LAE+P + T + + QG + LF IE + L V+ LV+P
Sbjct: 822 LERGDEIISAASLRFHGMQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIP 881
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ + WT FGF+ + + + + R+ ++F G MLQKP+
Sbjct: 882 AIPDFLHAWTGNFGFTPLDDSVRKEMRS-LNTLVFPGIDMLQKPL 925
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 48/84 (57%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 521 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 570
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 571 FEIHAGSKSCQPFQNIYLESGASL 594
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 198/433 (45%), Gaps = 83/433 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS--------LERP 710
GI CTCC + S F HAG +S +P I+ +G SL + +++ E+P
Sbjct: 716 GIRCTCCQNLYGLSGFANHAGGSSNCRPSACIFLKDGRSLLDCMMEVMQDHRTREITEKP 775
Query: 711 FSS---KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
+ END++C +C GG+L+ CD CP A+H +C++L GIP G W C C
Sbjct: 776 HNDLFEGENDNICSVCNYGGELILCDQCPSAYHKNCLNLEGIPDGDWFCPSCR------- 828
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
C +C ++ G +
Sbjct: 829 ----------------------------------------CGICGQNKIEETEDGH--FL 846
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDR----GEEKLPE 883
C QCE +YHV CL++ +D + K W C +C+R+ LQ L+ + G + L
Sbjct: 847 TCIQCEHKYHVECLRNGEKDDSRRCMKN-WFCGEECERVYTGLQNLLGKPVLVGADNLTW 905
Query: 884 TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIES 943
T ++K + E+ + G + D + LS A+S+ H+ F+P+
Sbjct: 906 T---LVKYVNSETCG------------VGGAESDLVVENYSKLSVALSVMHECFEPLHNP 950
Query: 944 ASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1001
S D++ +++ RS + ++ G Y +L N+ ++S RIFG+++AE+PLV T
Sbjct: 951 FSSRDIVEDVIFNQRSELNRLNFQGFYTVLLERNEELISVATVRIFGEKIAEVPLVGTRF 1010
Query: 1002 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1061
+ G + L +EK L L V+ LVLP+ WTN FGF MT E++++ DY
Sbjct: 1011 QYRRLGMCRVLMDELEKKLKQLGVERLVLPAVPGVLDTWTNSFGFEQMTNFERSQFL-DY 1069
Query: 1062 PLMIFQGTSMLQK 1074
+ FQGT M QK
Sbjct: 1070 SFLDFQGTVMCQK 1082
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 194/443 (43%), Gaps = 86/443 (19%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG--VSLHEL--------- 701
G K GI+C CCN +S + F+ HAG+ R P +++ +G +L +L
Sbjct: 790 GLIKKEGIMCKCCNMVLSVTNFKNHAGFKQSR-PCLNVFMKSGKPFTLCQLQAWSAEYKT 848
Query: 702 --SIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
S + + R ENDD CG+C DGG+L+CCD+CP FH C+S +P G+W+C C
Sbjct: 849 RKSRTIKVVRTADDDENDDSCGLCGDGGELICCDNCPSTFHQACLSTEELPEGSWYCPNC 908
Query: 760 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 819
C +C K
Sbjct: 909 T-----------------------------------------------CWICGELVNDKE 921
Query: 820 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKG----KWLCCADCKRINLALQKLVD 875
C QCE +YH C K+ + + KG W C C+ + LQ V
Sbjct: 922 DINSSNAFKCSQCEHKYHDSCWKN------KTIGKGGASDTWFCGGSCQAVYFGLQSRVG 975
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++K HE+ +V +++ + L+ A++I +
Sbjct: 976 IINHIADGVCWTLLKCIHEDQ------------KVHSAQRLALKAECNSKLAVALTIMEE 1023
Query: 936 RFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F +++ + +D+IP ++Y RS + ++HG Y +L + V++S RI G +AE
Sbjct: 1024 CFQSMVDPRTGIDMIPHVLYNWRSEFARLNFHGFYTVVLEKDDVLLSVASIRIHGATVAE 1083
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+PL+AT ++ + QG + L IE++L V+ LV+ + + WT FGF+ M+ +E
Sbjct: 1084 MPLIATCSNYRRQGMCRRLMTAIEEMLISFKVEKLVVSAIPDLVETWTEGFGFTPMSNDE 1143
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPV 1076
+ N LM+F GT +L+KP+
Sbjct: 1144 KQSL-NKINLMVFPGTILLKKPL 1165
>gi|224067206|ref|XP_002302408.1| predicted protein [Populus trichocarpa]
gi|222844134|gb|EEE81681.1| predicted protein [Populus trichocarpa]
Length = 923
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 192/435 (44%), Gaps = 83/435 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK---------LSLER 709
GI C CC + + FE+HAG ++ +P +I +G SL + K ++ E
Sbjct: 550 GIECNCCQKIFTLTGFESHAG-STNHRPAANIILEDGRSLLDCQRKKKPRIKMQRVTREA 608
Query: 710 PFSSKEN------DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
+ ++N D +C +C DGGDL+ CD CP FH +CV L IP G W C C
Sbjct: 609 KWKGRQNQHQGETDYICSVCHDGGDLIVCDHCPSTFHKNCVGLEDIPEGEWFCPPCCCGI 668
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTP-DTELGGCVLCRGRDFCKSRFG 822
E +YN VQ P D+ L
Sbjct: 669 CGENKFKYN-------------------------VQEPKDSRL----------------- 686
Query: 823 RRTVILCDQCEREYHVGCLKDHGMEDLQEL-PKGKWLCCADCKRINLALQKLVDRGEEKL 881
+ CDQCER+YH+GCL++ G+ L+ PK W C C+ I + LQ L+ +
Sbjct: 687 ----LSCDQCERKYHIGCLRNKGVVKLKRKDPKDSWFCSNKCEDIFIGLQTLLGKSVVVG 742
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
P+ + K + D V+A G + L AV + H+ F+P
Sbjct: 743 PDNLTWTLWKFMDSDSCD----------------VEAPTGKHSKLDLAVEVIHECFEPAT 786
Query: 942 ESASKLDLIPAMVYGR--SHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
E+ + D+ +++ R + ++ G Y +L N +++ R+FG ++AE+PLV T
Sbjct: 787 ETYTGRDIAEDVIFSRECNLNRLNFRGFYTVLLERNDELIAVANVRVFGDKVAEIPLVGT 846
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ G + L +EK L L V+ L+LP+ W N FGFS +T+ E+ +Y
Sbjct: 847 RFLFRRLGMCKILMDELEKQLMNLGVERLMLPAVPSVLYTWINGFGFSKLTDAEKMQYL- 905
Query: 1060 DYPLMIFQGTSMLQK 1074
D+ + F GT QK
Sbjct: 906 DHTFLDFPGTIKCQK 920
Score = 46.6 bits (109), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 71/159 (44%), Gaps = 16/159 (10%)
Query: 340 NVVVERPLRRFTRSLLQQKVELAKGSLSKDGG----KRSDVTEVANDGVGGPVKQ--ETV 393
N + RP R T L++ + K ++ +R V E G +Q T
Sbjct: 439 NASLPRPRREKTIETLKKLRDYQKSHQEQNASPLKQRRGKVIETLEKPRDGQKRQSSRTA 498
Query: 394 MK---PRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFC 450
M+ PR K L +++ ++ + RGSK GPG RG + GI C C
Sbjct: 499 MQGVTPRSSKCKPRCALSWMIDNNLVSPGEKVSYRGSK--GPG-ELTRGRITREGIECNC 555
Query: 451 DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRD 489
C+ ++ T FE HAGS+N RP I LE+G++L D
Sbjct: 556 --CQ--KIFTLTGFESHAGSTNHRPAANIILEDGRSLLD 590
>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
Length = 1314
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 135/477 (28%), Positives = 218/477 (45%), Gaps = 89/477 (18%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G ++ +V Y R + + ++ G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 603 LIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEIHAG-SKLRQP 661
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSK-------------ENDDLCGICMDGGDLLCCD 733
FQ+I+ +G+SL + + + R SK NDD CGIC DGGDL+CCD
Sbjct: 662 FQNIFLESGLSLLQCQ-RDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCD 720
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP FH C+ + P G WHC C +Y A
Sbjct: 721 GCPSTFHQSCLDILIPPPGDWHCPNCT--------CKYCGVA------------------ 754
Query: 794 CIRIVQTPDT---ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DL 849
I I Q +T E+ C+L CE+++H C +E D
Sbjct: 755 SIDICQGDNTSVSEISTCIL---------------------CEKKFHESC----NLEMDT 789
Query: 850 QELPKG--KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVR 907
G C C+ + +LQK + E S +I++ E+S DV
Sbjct: 790 PVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRRTSEDS--------DVS 841
Query: 908 WRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYH 965
R L ++++++ + L+ A+++ + F PI++ S ++LI ++Y G + +Y
Sbjct: 842 VRGL-SQRIESN----SKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYS 896
Query: 966 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1025
G Y AIL ++SA R G +LAE+P + T + + QG + LFC IE L V
Sbjct: 897 GFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKV 956
Query: 1026 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1082
+ L++P+ +E W FGFS + + + R +++F GT MLQK + + IV
Sbjct: 957 EKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRL-MNMLVFPGTDMLQKLLIQETIV 1012
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 200/445 (44%), Gaps = 76/445 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSL-- 707
L G GI C CC+ + +F AHAG KP++++ +G+ L ++
Sbjct: 195 LLEGVTTRDGINCRCCSKVFTVLEFVAHAG-GPVSKPYRNVLV-DGLDTDLLHCLINAWD 252
Query: 708 -----ER----PFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
ER P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 253 KQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEALPSDDWR 312
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F +E + + A I VD
Sbjct: 313 CAKCSCKFCQEH------SRQDAQDIAEVDS----------------------------- 337
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
++ C QCE +YH GC + C C+ + L+ L+
Sbjct: 338 ---------SLCTCSQCEEKYHPGCSPETTNTSNVSSQACDLFCQQSCRLLFEGLRNLLA 388
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
++ PE S +I++ HE + V D R + ++ A+S+ +
Sbjct: 389 VKKDLEPEFSCRIIQRIHENV-PETVVALDERVEC------------NSKIAVALSLMDE 435
Query: 936 RFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F PI++ + ++LI +VY G + D+HG Y +L +++A RI G +LAE
Sbjct: 436 CFLPIVDQRTGINLIRNVVYNCGSNFVRMDFHGFYIFVLERGDEIIAAASVRIHGTKLAE 495
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + E
Sbjct: 496 MPFIGTRNMYRRQGMCRRLLDGIEMILSSLNVEKLIIPAIAELVDTWTSKFGFSSLDVSE 555
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ + ++ +++F GT +LQKP+ K
Sbjct: 556 KQEVKS-TSMLVFPGTGLLQKPLLK 579
>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
Length = 1213
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 135/477 (28%), Positives = 218/477 (45%), Gaps = 89/477 (18%)
Query: 628 VFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G ++ +V Y R + + ++ G+ GI C CC+ ++ S+FE HAG + R+P
Sbjct: 485 LIDSGTVQLSQKVRYMNRRQTRVMLEGWITRDGIHCGCCSKILTVSKFEIHAG-SKLRQP 543
Query: 687 FQHIYTSNGVSLHELSIKLSLERPFSSK-------------ENDDLCGICMDGGDLLCCD 733
FQ+I+ +G+SL + + + R SK NDD CGIC DGGDL+CCD
Sbjct: 544 FQNIFLESGLSLLQCQ-RDAWNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCD 602
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP FH C+ + P G WHC C +Y A
Sbjct: 603 GCPSTFHQSCLDILIPPPGDWHCPNCT--------CKYCGVA------------------ 636
Query: 794 CIRIVQTPDT---ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DL 849
I I Q +T E+ C+L CE+++H C +E D
Sbjct: 637 SIDICQGDNTSVSEISTCIL---------------------CEKKFHESC----NLEMDT 671
Query: 850 QELPKG--KWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVR 907
G C C+ + +LQK + E S +I++ E+S DV
Sbjct: 672 PVHSSGLVTSFCGKSCRELFESLQKNLGVKHELDAGFSWSLIRRTSEDS--------DVS 723
Query: 908 WRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYH 965
R L ++++++ + L+ A+++ + F PI++ S ++LI ++Y G + +Y
Sbjct: 724 VRGL-SQRIESN----SKLAVALTVMDECFLPIVDRRSGINLIHNVLYNCGSNFYRLNYS 778
Query: 966 GMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNV 1025
G Y AIL ++SA R G +LAE+P + T + + QG + LFC IE L V
Sbjct: 779 GFYTAILERGDEIISAATIRFHGTKLAEMPFIGTRHIYRRQGMCRRLFCAIESALRVFKV 838
Query: 1026 KTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1082
+ L++P+ +E W FGFS + + + R +++F GT MLQK + + IV
Sbjct: 839 EKLIIPAIAELMHTWNVIFGFSPLEPSLKQEMRL-MNMLVFPGTDMLQKLLIQETIV 894
>gi|255571928|ref|XP_002526906.1| conserved hypothetical protein [Ricinus communis]
gi|223533745|gb|EEF35478.1| conserved hypothetical protein [Ricinus communis]
Length = 853
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 133/457 (29%), Positives = 200/457 (43%), Gaps = 85/457 (18%)
Query: 645 RGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP---FQHIYTSNGVSLHEL 701
+G+ G G GI C CC + F+ HA R F +I+ +G SL +
Sbjct: 341 KGQHSIAEGRISGSGIKCNCCGKVYTLCGFDYHASGKQGRTATSIFSNIFLEDGRSLLDC 400
Query: 702 SIKL-----------SLERPFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
+++ LER SSK END +C +C GG+L+ CD CP +FH C+ L
Sbjct: 401 QMQIMHDHTKNLGEEPLERWQSSKDQVENDHICSVCHYGGELILCDQCPSSFHKSCLGLM 460
Query: 748 GIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 807
+P G W C C C +I G
Sbjct: 461 DVPDGDWFCSSC----------------------------------CCKIC-------GQ 479
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRIN 867
C L R D G V+ C QCER+YHV CL + E L+ PK W C C++I
Sbjct: 480 C-LKRDSDLSMEDDG---VLDCTQCERKYHVVCLGNKREECLEYFPKEHWFCSKRCQQIF 535
Query: 868 LALQKLVDR----GEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 923
L L +L+ + G L T L I+ + SD ++A
Sbjct: 536 LGLHELLGKKIPVGLHNLTWTLLKSIQFNDQCEASD----------------IEALSENY 579
Query: 924 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSA 981
++L+ A+ + H+ FDP+ E +K DL+ +++ RS + ++HG Y +L + +S
Sbjct: 580 SMLNIALDMMHEFFDPVEEPHTKRDLLKDVIFSKRSELNRLNFHGFYTVLLQKDDEFISV 639
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
R++G+++AE+PLV T + G L +EK L L V+ L+LP+ A W
Sbjct: 640 ATVRVYGEKVAEIPLVGTRFQYRRLGMCCILMNVLEKKLRELGVQRLILPAVPSALNTWI 699
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
FGFS +TE ++ + DY + FQ T M K + K
Sbjct: 700 GSFGFSKLTELDRLQLL-DYTFLDFQDTIMCHKLLTK 735
>gi|449533614|ref|XP_004173768.1| PREDICTED: uncharacterized protein LOC101226716 [Cucumis sativus]
Length = 135
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 81/132 (61%), Positives = 100/132 (75%), Gaps = 1/132 (0%)
Query: 953 MVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1012
MVYGR GQ++ GMYCAIL VN VVSA + R+FGQ++AELPLVATSN G+GYFQ+L
Sbjct: 1 MVYGRDVGGQEFGGMYCAILIVNSFVVSAAMLRVFGQDIAELPLVATSNGNHGKGYFQTL 60
Query: 1013 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI-FQGTSM 1071
F CIE+LL FL VK LVLP+A EA++IWT KFGF + ++ + YR M+ F+GTSM
Sbjct: 61 FSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERIKPDQLSSYRRSCCQMVTFKGTSM 120
Query: 1072 LQKPVPKCRIVG 1083
LQK VP CR+VG
Sbjct: 121 LQKTVPSCRVVG 132
>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
Length = 1376
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/447 (27%), Positives = 205/447 (45%), Gaps = 79/447 (17%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 702
L G+ GI C+CC+ +S +F HAG + KP+++I +G+ + L +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 984
Query: 703 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
++ ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +P+ W
Sbjct: 985 MQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWC 1044
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C F E + +A ++
Sbjct: 1045 CSNCSCKFCHEHSSD---DAEDTADVDS-------------------------------- 1069
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKD-HGMEDLQELPKGKWLCCADCKRINLALQKLV 874
++ C QCE + C D + G C C+ + LQ L+
Sbjct: 1070 ---------SLHTCSQCEEQCTEACSPDIDSIATNLSSQTGNLFCQQSCRLLFEELQNLL 1120
Query: 875 DRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRALLSKAVSIF 933
++ PE S V+++ HEE V VL K+V+ + + ++ A+S+
Sbjct: 1121 AVKKDLEPEYSCRVVQRIHEE----------VPEEVLALDKRVECN----SKIAVALSLM 1166
Query: 934 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
+ F PI++ + ++LI +VY G + D+ G Y IL +++A RI G +L
Sbjct: 1167 DECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRIHGTKL 1226
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AE+P + T N + QG + L IE +L LN++ L++P+ +E WT+KFGFS + +
Sbjct: 1227 AEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDD 1286
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
E+ + ++ +++F GT +LQKP+ K
Sbjct: 1287 SEKQEVKS-VSMLVFPGTGLLQKPLLK 1312
>gi|302760729|ref|XP_002963787.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
gi|300169055|gb|EFJ35658.1| hypothetical protein SELMODRAFT_80470 [Selaginella moellendorffii]
Length = 461
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 128/479 (26%), Positives = 199/479 (41%), Gaps = 104/479 (21%)
Query: 634 LEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR--------- 684
L +GA V Y + + G GILC CCN S + F+ HAG R
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 685 ---------------------KPFQHIYTSNGVSLHELSIKLS--LERPFSSKENDDLCG 721
+P T + +L + +K S + ENDD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 722 ICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRI 781
+C DGG L+CCD CP FH+ C+ L +P G W C C + GR
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRCC--------------CASCGR- 177
Query: 782 EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL 841
DP Q TE ++ CDQCEREYH C+
Sbjct: 178 SLYDPTIQ-------------TE---------------------ILYCDQCEREYHSNCV 203
Query: 842 KDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNA 901
M + C C +I L+KLV R + S +++ +H + +N+
Sbjct: 204 PGSAM---KYESSDNQFCSRKCLKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSEENS 260
Query: 902 VDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRG 961
K+++ L+ A+++ + F P+I+ S +D++ ++Y R RG
Sbjct: 261 -------------KLESVADLNTRLALALTVIQECFRPMIDPRSNIDMVSHILYNR--RG 305
Query: 962 QD----YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIE 1017
+D + G Y +L Q ++S R+ G AE+P + T + + QG + L I+
Sbjct: 306 EDKRMDFRGFYTVVLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQ 365
Query: 1018 KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
++L L V+TLVLP+ +E WT+ FGF +T Q + ++ F G+S+LQKP+
Sbjct: 366 QVLHTLEVQTLVLPAIAEFIETWTSAFGFQKLT-AAQGIQLMELNIVTFPGSSVLQKPL 423
>gi|297843332|ref|XP_002889547.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297335389|gb|EFH65806.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1121
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 129/465 (27%), Positives = 204/465 (43%), Gaps = 77/465 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+ E G ++ +V Y R K ++ G+ GI C CC+ ++ S+FE HAG + +
Sbjct: 506 WLIESGVVQLRQKVQYMKRRGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAG-SKACQ 564
Query: 686 PFQHIYTSNGVSLHELSIK------------LSLERPFSSKENDDLCGICMDGGDLLCCD 733
PFQ+IY +G SL + ++ L NDD CGIC DGGDL+CCD
Sbjct: 565 PFQNIYLESGASLLQCQVRAWNMQKDATNVGLHQVDTDGDDPNDDACGICGDGGDLICCD 624
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP +H C+ + +PSG WHC C F A A+G +G F ++S
Sbjct: 625 GCPSTYHQTCLGMQVLPSGDWHCPNCTCKFCDA--------AVASGGKDG--NFLSLLS- 673
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
C CER YH CL D + +Q
Sbjct: 674 -----------------------------------CSMCERRYHQLCLSDEA-QKVQSFG 697
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
C C + LQK + E S +I + +S ++ L
Sbjct: 698 SASSFCGPKCLELFEKLQKYLGVKNEIEGGYSWSLIHRVDTDSDINSQ---------LSA 748
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
++++ + L+ ++I + F PI++ S ++LI ++Y G + +Y G Y AI
Sbjct: 749 QRIE----NNSKLAVGLAIMDECFLPIVDRRSGVNLIRNVLYNCGSNFNRINYTGFYTAI 804
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L ++SA R G +LAE+P + T + + QG + LF IE + L V+ LV+P
Sbjct: 805 LERGDEIISAASLRFHGTQLAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIP 864
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ + WT FGF+ + + + + R+ ++F G MLQKP+
Sbjct: 865 AIPDFLHAWTGNFGFTPLDDSVRKEMRS-LNTLVFPGIDMLQKPL 908
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 47/84 (55%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C C ++++T +
Sbjct: 504 LSWLIESGVVQLRQKVQYMKRRGAKVM------LEGWITREGIHCDC--C--SKILTVSR 553
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS +P + IYLE+G +L
Sbjct: 554 FEIHAGSKACQPFQNIYLESGASL 577
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 127/445 (28%), Positives = 201/445 (45%), Gaps = 81/445 (18%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS---LHELSIKLS 706
L G GI C+CC+ V+ +F AHAG KP+++I +G+ LH L I
Sbjct: 950 LLEGLITRDGINCSCCSKVVTVLEFVAHAG-GQLSKPYRNILV-DGLDNDLLHCLIIAWD 1007
Query: 707 LER--------PFSSK---ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
+ P S++ NDD CGIC DGG+L+CCD CP FH+ C+ L +PS W
Sbjct: 1008 KQSDSERQAFFPVSTEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLELEELPSDDWR 1067
Query: 756 CRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD 815
C C K + N A E +DP
Sbjct: 1068 CTNCSC-----KLCHEHLNHDAPDNAE-IDPLHS-------------------------- 1095
Query: 816 FCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK--GKWLCCADCKRINLALQKL 873
C QCE++YH C + E L + G C C+ + LQ L
Sbjct: 1096 -------------CSQCEKKYHPSCSPE--TEKLSSVSSQAGNHFCQQSCRLLFEELQNL 1140
Query: 874 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIF 933
+ ++ PE + +I+ HE++ + +D D R + ++ A+S+
Sbjct: 1141 LAVEKDLGPEYACRIIQCIHEDA-PETVLDLDGRVEC------------NSKIAVALSLM 1187
Query: 934 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
+ F PI++ + ++LI +VY G + D+ G Y IL +VSA RI G +L
Sbjct: 1188 DECFLPIVDQRTGINLIRNVVYNCGSNFLRLDFRGFYIFILERGDEIVSAASVRIHGTKL 1247
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AE+P + T + + QG + L IE +L L V+ L++P+ +E WT+KFGFS +
Sbjct: 1248 AEMPFIGTRHMYRRQGMCRRLLDGIEMILSSLKVEKLIIPAINELVDTWTSKFGFSPLEV 1307
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQKPV 1076
++ + ++ +++F GT +LQK +
Sbjct: 1308 SDKQEVKS-INMLVFPGTGLLQKSL 1331
>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
Length = 1393
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/439 (27%), Positives = 191/439 (43%), Gaps = 79/439 (17%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRR----------KPFQHIYTSNGVSLHELS 702
G G GI+C CC+ +S S+F++HAG+ R +PF + E
Sbjct: 870 GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGRPFMLCQLQAWST--EYK 927
Query: 703 IKLSLERPFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
+ S R E NDD CGIC DGG+L+CCD+CP FH C+S+ +P G W+C
Sbjct: 928 TRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNWYC--- 984
Query: 760 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 819
N R G + + I + D
Sbjct: 985 -----------LNCTCRICGDLVNFE----------EISSSSD----------------- 1006
Query: 820 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
+ C QCE++YH CLK ++ E W C C++I ALQ + +
Sbjct: 1007 ------ALKCFQCEQKYHGQCLKQRDIDSGVE--SHIWFCSGSCQKIYAALQSQLGLTNQ 1058
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
S +++ H + ++L ++ + L A++I + F
Sbjct: 1059 FANGFSWTLLRCIHYDQ------------KILSTARLAMMAECNSRLVVALTIMEECFLS 1106
Query: 940 IIESASKLDLIPAMVYG--RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+++ + +D+IP +VY S D+HG Y IL + V++ R+ G ELAE+PL+
Sbjct: 1107 MVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLI 1166
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
AT + + QG + L IE++L VK LV+ + WT FGF + E E+ +
Sbjct: 1167 ATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGF-VTVENEEKQS 1225
Query: 1058 RNDYPLMIFQGTSMLQKPV 1076
+ + LM+F GT +L+K +
Sbjct: 1226 LHKFNLMVFPGTVLLKKAL 1244
>gi|30793945|gb|AAP40424.1| unknown protein [Arabidopsis thaliana]
Length = 600
Score = 167 bits (422), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/446 (28%), Positives = 200/446 (44%), Gaps = 76/446 (17%)
Query: 645 RGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK 704
RG L G+ GI C CC+ ++ S+FE HAG S +PFQ+IY +G SL + ++
Sbjct: 4 RGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAGSKSC-QPFQNIYLESGASLLQCQVR 62
Query: 705 ----------LSLER--PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 752
L+L + NDD CGIC DGGDL+CCD CP +H +C+ + +PSG
Sbjct: 63 AWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSG 122
Query: 753 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 812
WHC C F A A+G +G F ++S
Sbjct: 123 DWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS-------------------- 152
Query: 813 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
C CER YH CL D + +Q C C + LQK
Sbjct: 153 ----------------CGMCERRYHQLCLNDEAHK-VQSFGSASSFCGPKCLELFEKLQK 195
Query: 873 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 932
+ E S +I + +S +++ + ++++ + L+ ++I
Sbjct: 196 YLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSAQRIE----NNSKLAVGLAI 242
Query: 933 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
+ F PI++ S +DLI ++Y G + +Y G Y AIL ++SA R G +
Sbjct: 243 MDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQ 302
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
LAE+P + T + + QG + LF IE + L V+ LV+P+ + WT FGF+ +
Sbjct: 303 LAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD 362
Query: 1051 EEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ + + R+ ++F G MLQKP+
Sbjct: 363 DSVRKEMRS-LNTLVFPGIDMLQKPL 387
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 10/66 (15%)
Query: 422 MYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYL 481
M RG+KV L G + GI C C ++++T + FE+HAGS + +P + IYL
Sbjct: 1 MRRRGAKVM------LEGWITREGIHCDC----CSKILTVSRFEIHAGSKSCQPFQNIYL 50
Query: 482 ENGKTL 487
E+G +L
Sbjct: 51 ESGASL 56
>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
[Cucumis sativus]
Length = 1403
Score = 167 bits (422), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 120/439 (27%), Positives = 190/439 (43%), Gaps = 79/439 (17%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRR----------KPFQHIYTSNGVSLHELS 702
G G GI+C CC+ +S S+F++HAG+ R +PF + E
Sbjct: 870 GRITGDGIICNCCSDILSISEFKSHAGFKFNRACSNLFLDSGRPFMLCQLQAWST--EYK 927
Query: 703 IKLSLERPFSSKE---NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
+ S R E NDD CGIC DGG+L+CCD+CP FH C+S+ +P G W+C
Sbjct: 928 TRKSKTRTVEVDEDDRNDDSCGICGDGGELICCDNCPSTFHHSCLSIQELPEGNWYC--- 984
Query: 760 MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKS 819
N R G + + I + D
Sbjct: 985 -----------LNCTCRICGDLVNFE----------EISSSSD----------------- 1006
Query: 820 RFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
+ C QCE++YH CLK + E W C C++I ALQ + +
Sbjct: 1007 ------ALKCFQCEQKYHGQCLKQRDINSGVE--SHIWFCSGSCQKIYAALQSQLGLTNQ 1058
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
S +++ H + ++L ++ + L A++I + F
Sbjct: 1059 FANGFSWTLLRCIHYDQ------------KILSTARLAMMAECNSRLVVALTIMEECFLS 1106
Query: 940 IIESASKLDLIPAMVYG--RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+++ + +D+IP +VY S D+HG Y IL + V++ R+ G ELAE+PL+
Sbjct: 1107 MVDPRTGIDMIPHLVYSWKSSFPRLDFHGFYTVILEKDDVLLCVASIRVHGSELAEMPLI 1166
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
AT + + QG + L IE++L VK LV+ + WT FGF + E E+ +
Sbjct: 1167 ATCSKYRRQGMCRRLLNAIEEMLMSFKVKKLVIAAIPSLVETWTEGFGF-VTVENEEKQS 1225
Query: 1058 RNDYPLMIFQGTSMLQKPV 1076
+ + LM+F GT +L+K +
Sbjct: 1226 LHKFNLMVFPGTVLLKKAL 1244
>gi|21450874|gb|AAK59489.2| unknown protein [Arabidopsis thaliana]
Length = 620
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 127/446 (28%), Positives = 200/446 (44%), Gaps = 76/446 (17%)
Query: 645 RGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK 704
RG L G+ GI C CC+ ++ S+FE HAG + +PFQ+IY +G SL + ++
Sbjct: 24 RGAKVMLEGWITREGIHCDCCSKILTVSRFEIHAG-SKSCQPFQNIYLESGASLLQCQVR 82
Query: 705 ----------LSLER--PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 752
L+L + NDD CGIC DGGDL+CCD CP +H +C+ + +PSG
Sbjct: 83 AWNMQKDATNLALHQVDTDGDDPNDDACGICGDGGDLICCDGCPSTYHQNCLGMQVLPSG 142
Query: 753 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 812
WHC C F A A+G +G F ++S
Sbjct: 143 DWHCPNCTCKFCDA--------AVASGGKDG--NFISLLS-------------------- 172
Query: 813 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
C CER YH CL D + +Q C C + LQK
Sbjct: 173 ----------------CGMCERRYHQLCLNDEAHK-VQSFGSASSFCGPKCLELFEKLQK 215
Query: 873 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 932
+ E S +I + +S +++ + ++++ + L+ ++I
Sbjct: 216 YLGVKTEIEGGYSWSLIHRVDTDSDTNSQ---------MSAQRIE----NNSKLAVGLAI 262
Query: 933 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
+ F PI++ S +DLI ++Y G + +Y G Y AIL ++SA R G +
Sbjct: 263 MDECFLPIVDRRSGVDLIRNVLYNCGSNFNRINYTGFYTAILERGDEIISAASLRFHGMQ 322
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
LAE+P + T + + QG + LF IE + L V+ LV+P+ + WT FGF+ +
Sbjct: 323 LAEMPFIGTRHIYRRQGMCRRLFDAIESAMRSLKVEKLVIPAIPDFLHAWTGNFGFTPLD 382
Query: 1051 EEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ + + R+ ++F G MLQKP+
Sbjct: 383 DSVRKEMRS-LNTLVFPGIDMLQKPL 407
Score = 43.5 bits (101), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 13/84 (15%)
Query: 407 LKNFLESGILE---GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV 463
L +ESG+++ + M RG+KV L G + GI C C ++++T +
Sbjct: 3 LSWLIESGVVQLRQKVQYMRRRGAKVM------LEGWITREGIHCDC----CSKILTVSR 52
Query: 464 FELHAGSSNKRPPEYIYLENGKTL 487
FE+HAGS + +P + IYLE+G +L
Sbjct: 53 FEIHAGSKSCQPFQNIYLESGASL 76
>gi|334186543|ref|NP_193228.6| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
gi|225898777|dbj|BAH30519.1| hypothetical protein [Arabidopsis thaliana]
gi|332658123|gb|AEE83523.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein [Arabidopsis thaliana]
Length = 1138
Score = 166 bits (420), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 120/476 (25%), Positives = 207/476 (43%), Gaps = 79/476 (16%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+ + G L+ +V Y + + ++ G+ GI C CC+ ++ S+FE HAG + R+
Sbjct: 583 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAG-SKLRQ 641
Query: 686 PFQHIYTSNGVSLHELSIKLSLERP------------FSSKENDDLCGICMDGGDLLCCD 733
PFQ+I+ ++GVSL + I ++ + NDD CGIC DGGDL+CCD
Sbjct: 642 PFQNIFLNSGVSLLQCQIDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCD 701
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP FH C+ + P G WHC C F
Sbjct: 702 GCPSTFHQRCLDIRMFPLGDWHCPNCTCKF------------------------------ 731
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
C +++ +G C CE++YH C+ +
Sbjct: 732 CKAVIEDVTQTVGANT-------------------CKMCEKKYHKSCMPKANVTPADTTE 772
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
C CK ++ ++K V E S ++ + E + SD + L G
Sbjct: 773 PITSFCGKKCKALSEGVKKYVGVKHELEAGFSWSLVHR--ECTNSDLS---------LSG 821
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
+ ++ L+ A+++ + F PII+ S ++++ ++Y G + ++ G Y A+
Sbjct: 822 HPHIVENNSK--LALALTVMDECFLPIIDRRSGVNIVQNVLYNCGSNFNRLNFGGFYTAL 879
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L +V++ R G LAE+P + T + + QG + LF +E L L VK L++P
Sbjct: 880 LERGDEIVASASIRFHGNRLAEMPFIGTRHVYRHQGMCRRLFSVVESALQHLKVKLLIIP 939
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVD 1087
+ ++ +W +KFGF + + + + R+ L+ F G +LQK + R +VD
Sbjct: 940 ATADFSHVWISKFGFRQVEDSLKKEMRS-MNLLTFPGIDVLQKELLAPRHTESAVD 994
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 66/141 (46%), Gaps = 17/141 (12%)
Query: 353 SLLQQKVELAKGSLSK--DGGKRSD---VTEVANDGVGGPVKQETVMKPRKVMRKFYSKL 407
S L + E + GS S DGGK S T + V G + P R + L
Sbjct: 525 SPLHHQTEKSTGSSSHRVDGGKSSKHGRSTLLVRRSVRGDNSESDGFVPSSEKRTVLAWL 584
Query: 408 KNFLESGILE-GMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFEL 466
++SG L+ VMY+ + + + L G + GI C C C ++++ + FE+
Sbjct: 585 ---IDSGTLQLSEKVMYMNQRRTR----AMLEGWITRDGIHCGC--C--SKILAVSKFEI 633
Query: 467 HAGSSNKRPPEYIYLENGKTL 487
HAGS ++P + I+L +G +L
Sbjct: 634 HAGSKLRQPFQNIFLNSGVSL 654
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 166 bits (419), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 193/432 (44%), Gaps = 79/432 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK------------LS 706
G++CTCCN VS S+F+ HAG+ ++ P +++ +G ++
Sbjct: 613 GVVCTCCNKTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 671
Query: 707 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
LE+ NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W+C C E
Sbjct: 672 LEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWICSE 731
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
V NA R +DF
Sbjct: 732 -LVSDNAE-------------------------------------RSQDF---------- 743
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 744 -KCSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSW 800
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
++K E+ V +++ + L+ A+SI + F +++ +
Sbjct: 801 SILKCFQEDG------------MVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 848
Query: 947 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+D+IP ++Y G + D+ G Y ++ + V++S R+ G +AE+PLVAT + +
Sbjct: 849 IDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYR 908
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 909 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 967
Query: 1065 IFQGTSMLQKPV 1076
+F GT++L+K +
Sbjct: 968 VFPGTTLLKKTL 979
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1189
Score = 166 bits (419), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 193/432 (44%), Gaps = 79/432 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK------------LS 706
G++CTCCN VS S+F+ HAG+ ++ P +++ +G ++
Sbjct: 657 GVVCTCCNKTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 715
Query: 707 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
LE+ NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W+C C E
Sbjct: 716 LEKASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSWYCSSCTCWICSE 775
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
V NA R +DF
Sbjct: 776 -LVSDNAE-------------------------------------RSQDF---------- 787
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 788 -KCSQCAHKYHGTCLQ--GISKRRKLFPETYFCGKNCEKVYNGLSSRVGIINPNADGLSW 844
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
++K E+ V +++ + L+ A+SI + F +++ +
Sbjct: 845 SILKCFQEDG------------MVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 892
Query: 947 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+D+IP ++Y G + D+ G Y ++ + V++S R+ G +AE+PLVAT + +
Sbjct: 893 IDMIPHVLYNWGSTFARLDFDGFYTVVVEKDDVMISVASIRVHGVTIAEMPLVATCSKYR 952
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 953 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 1011
Query: 1065 IFQGTSMLQKPV 1076
+F GT++L+K +
Sbjct: 1012 VFPGTTLLKKTL 1023
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 188/446 (42%), Gaps = 108/446 (24%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE-LSIKLSLERPFSSK--- 714
GI+C CC +S S F AHAG + H +S G+ L S L L +S++
Sbjct: 889 GIVCNCCKKNLSVSDFMAHAGRS-------HPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 715 --------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C G K
Sbjct: 1002 -----------------------------------------------CQVCGGPFSEKEV 1014
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 880
+ C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1015 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV------ 1067
Query: 881 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAV 930
G+DN +D D+ W +LR G+K+ + L L+ A+
Sbjct: 1068 ----------------GTDNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVAL 1111
Query: 931 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
++ + F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G
Sbjct: 1112 TLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHG 1171
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+ AELP +ATS D + QG + L IEK+L NVK LVL + E + W + FGF
Sbjct: 1172 TKAAELPFIATSVDFRRQGMCRILMSIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKP 1231
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQK 1074
+ + E+ + N LM+F GTS+L K
Sbjct: 1232 IEDAERKQLHN-VNLMLFPGTSLLTK 1256
>gi|449450934|ref|XP_004143217.1| PREDICTED: uncharacterized protein LOC101206451 [Cucumis sativus]
gi|449525537|ref|XP_004169773.1| PREDICTED: uncharacterized LOC101206451 [Cucumis sativus]
Length = 1317
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/455 (27%), Positives = 194/455 (42%), Gaps = 96/455 (21%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER--------- 709
GI C CCN + + FE H S R HI +G SL + I + +
Sbjct: 829 GIKCCCCNKLYTINGFEIHVSGTSSRSA-AHILLEDGKSLLDCQILWNKKTRSFKNQAST 887
Query: 710 ----PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQK 765
+S END +C IC GG L+ CD CP +FH C+ L +P G W C C
Sbjct: 888 CGKGDYSKDENDYICSICHFGGTLILCDQCPSSFHQSCLGLKDVPEGDWFCPSCCCGICG 947
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 825
+ + +AN IV P
Sbjct: 948 QNKLSEHAN----------------------IVDGP------------------------ 961
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETS 885
+ C QCE +YHV CL+ G + K W C CK+I LQKL+ + S
Sbjct: 962 FLTCYQCECKYHVQCLR--GTKKFGSCSKPHWFCNKHCKQIYWGLQKLLGK--------S 1011
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDAS----------DGTRALLSKAVSIFHD 935
+ V G DN + W +L+ D + ++ L+ A+ + H+
Sbjct: 1012 IPV--------GGDN-----LTWSLLKSPSSDTNYFNPPHLETLTENQSKLNVALRVMHE 1058
Query: 936 RFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F+P+ E ++ D++ +++ R + ++ G Y +L N+ +++ R++G+++AE
Sbjct: 1059 CFEPVREQHTRRDIVEDVIFSRRSELKRLNFQGFYTVLLERNEELIAVAAIRVYGEKVAE 1118
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+PLV T + G L +E+ L L V+ LVLP+ WT FGFS MT+ E
Sbjct: 1119 VPLVGTRFQYRRLGMCHILMNELEERLRGLGVQRLVLPAVPSVLKAWTTSFGFSKMTDSE 1178
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
++++ N Y + FQ T M QK + K +V S+ G
Sbjct: 1179 RSEFLN-YTFLNFQETVMCQKFLLKNTVVPSSLSG 1212
>gi|357511385|ref|XP_003625981.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355500996|gb|AES82199.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 796
Score = 162 bits (411), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 186/433 (42%), Gaps = 78/433 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLER-------PF 711
GI+C CC + S FEAHAG +R +P I +G SL + + ++
Sbjct: 418 GIVCNCCRVNFTVSGFEAHAG-CTRHRPSISILLEDGRSLFKCQREARDQKGSHCIGEAN 476
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 771
S ND++C IC GGDL+ CD CP AFH+ C+ L +P G W C C
Sbjct: 477 SEANNDNVCSICGFGGDLVLCDRCPSAFHLGCLGLDRVPDGDWFCPTC------------ 524
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG---RRTVIL 828
C +I P CK ++
Sbjct: 525 ----------------------CCKICYRPK--------------CKQECADGNENNFLV 548
Query: 829 CDQCEREYHVGCLKDHGM---EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETS 885
C QCE+++H GC+K + K W C C + L L+KL+ + + +
Sbjct: 549 CVQCEQKFHFGCVKTTRFGSSHTESNIKKKNWFCSVVCGNMFLCLKKLLGKPIKVADNIN 608
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 945
++K S D+ DF K++A A+ + ++ F+P I++ S
Sbjct: 609 WTLLK---NVSSDDDGGDFTSNEFSQEKHKLNA----------ALGVLYEGFNPTIDALS 655
Query: 946 KLDLIPAMVYGRS--HRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1003
+LI +V+ R H+ ++ G Y IL V+S RIFGQ++AE+ VAT
Sbjct: 656 GRELIKDLVFSRDSEHKRLNFRGFYTVILEKMGEVISVATIRIFGQKVAEIVFVATKEQH 715
Query: 1004 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1063
+G+G + L +E+ L L V LVL S+ +A WT FGF+ MT E++ K D
Sbjct: 716 RGRGMCRLLMDELEEQLTRLGVGRLVLHSSEDAINTWTKSFGFARMTSEDKCKL-IDNTF 774
Query: 1064 MIFQGTSMLQKPV 1076
+ F + M KP+
Sbjct: 775 LEFHNSIMCLKPL 787
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 22/124 (17%)
Query: 371 GKRSDVTEVA-----NDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIR 425
G+R V +++ ++G G K+ V+K R +R+ YS + +E+ +L + ++ R
Sbjct: 348 GRRGKVLKMSIMKKNSEGFG---KRGKVLK-RGGIRESYSIVSWLIENKVLVSGTHVFCR 403
Query: 426 GSK--VKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLEN 483
GS+ VK RG + GI C C C+ N V+ FE HAG + RP I LE+
Sbjct: 404 GSENIVK-------RGSIFSDGIVCNC--CRVNFTVSG--FEAHAGCTRHRPSISILLED 452
Query: 484 GKTL 487
G++L
Sbjct: 453 GRSL 456
>gi|255559400|ref|XP_002520720.1| conserved hypothetical protein [Ricinus communis]
gi|223540105|gb|EEF41682.1| conserved hypothetical protein [Ricinus communis]
Length = 1700
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 126/456 (27%), Positives = 194/456 (42%), Gaps = 91/456 (19%)
Query: 638 AEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 696
+V Y R + +F+ G GI C CCN + ++FEAHAG S +PF++IY G
Sbjct: 778 GKVQYLKRRKARFVTKGSITTDGIQCDCCNKTFTSAEFEAHAGGKSC-QPFENIYLETGS 836
Query: 697 SLHELSIKLSLERPFSSKE------------NDDLCGICMDGGDLLCCDSCPRAFHIDCV 744
SL + + + S+ + NDD CGIC DGGDL+CCDSCP FH C+
Sbjct: 837 SLLQCQLDSWYKEDDSAHKGFHFIDIDGEDPNDDTCGICGDGGDLICCDSCPSTFHQSCL 896
Query: 745 SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE 804
+ PSG WHC YC+ F G + G + C R D
Sbjct: 897 EIRKFPSGLWHCMYCLCKF--------------CGMVGG--------NTCQR-----DGN 929
Query: 805 LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 864
+ ++ C CE +YH C ++ + + P C +C+
Sbjct: 930 MAAV--------------SHALVTCHLCEDKYHHSCFQEKDI--INADPGSPSFCGNNCQ 973
Query: 865 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 924
+ LQ L +E S +++ + SD +V + W+V KV
Sbjct: 974 ELYERLQMLFGVKQELEAGFSWTFVRRF--DVSSDISVS-GMSWKVDCNSKV-------- 1022
Query: 925 LLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAG 982
+ A+ I + F P+++ S ++LI +VY G + +Y G + A+L +++A
Sbjct: 1023 --AVALQIMDECFVPMVDHKSGVNLIRNIVYSFGSNFNRLNYSGFFNAVLERGDEMIAAA 1080
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
R F +P+ S + L LNV LV+P+ SE WT+
Sbjct: 1081 SIRYF----YSMPV--------------SFHSSLSMGLCSLNVGKLVIPAISELTGTWTS 1122
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
FGF + ++ RN +M+F G MLQKP+ K
Sbjct: 1123 VFGFKHLEGSDKQIMRN-MNMMVFPGVDMLQKPLLK 1157
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 160 bits (405), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 122/446 (27%), Positives = 187/446 (41%), Gaps = 108/446 (24%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL-----HELSIKLSLERPFSS 713
G++C CC +S S F AHAG F H +S G+ L + L + F S
Sbjct: 913 GVVCNCCKKTLSVSGFMAHAG-------FSHPQSSLGLFLESGKSYTLCQVEAWSAEFMS 965
Query: 714 K-------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 966 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHPACLSAKELPEGSWYCHNCT 1025
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C G K
Sbjct: 1026 -----------------------------------------------CQICGGPVSEKEV 1038
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 880
+ C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1039 STFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV------ 1091
Query: 881 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAV 930
G++N +D ++ W +LR G+K+ + L L+ A+
Sbjct: 1092 ----------------GTENILDSELSWSILRCNNDGQKLHSVQKIACLAECNMKLAVAL 1135
Query: 931 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
++ + F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G
Sbjct: 1136 TLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVILEKGDEILCVASIRVHG 1195
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+ AELP +ATS D + QG + L IEK+L NVK LVL + E + W + FGF
Sbjct: 1196 TKAAELPFIATSVDYRRQGMCRILMNIIEKMLCSFNVKMLVLSAIPELVSTWVSGFGFKP 1255
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQK 1074
+ + E+ + N LM+F GTS+L K
Sbjct: 1256 IEDAERKQLHN-VNLMLFPGTSLLTK 1280
>gi|224111178|ref|XP_002315772.1| predicted protein [Populus trichocarpa]
gi|222864812|gb|EEF01943.1| predicted protein [Populus trichocarpa]
Length = 390
Score = 159 bits (403), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 117/442 (26%), Positives = 196/442 (44%), Gaps = 77/442 (17%)
Query: 651 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIK-LSLER 709
L G+ GI C CC+ ++ S+FE HAG + R+PFQ+IY +G SL + I+ + +
Sbjct: 10 LEGWVTRDGIHCGCCSKILTVSKFEIHAG-SKLRQPFQNIYLESGASLLDCQIEAWNRQE 68
Query: 710 PF-----------SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 758
P + NDD CG+C DGGDL+CCD CP FH C+ + +P G WHC
Sbjct: 69 PVKRLGFQAVDVDGNDPNDDTCGLCGDGGDLICCDGCPSTFHQSCLDIKMLPPGDWHCPN 128
Query: 759 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 818
C F C + ++F +
Sbjct: 129 CSCKF--------------------------------------------CGVASDKNFQR 144
Query: 819 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
++ C C ++YH C+++ + C C+ + LQK +
Sbjct: 145 DDTTVSKLLTCSLCVKKYHKSCMQEINTLSIDTNNSVASFCGKKCRELFEQLQKYLGVKH 204
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG--KKVDASDGTRALLSKAVSIFHDR 936
E S +I + +S + L+G ++V+ + + L+ ++S+ +
Sbjct: 205 ELEAGFSWSLIHRTDADSDTS-----------LQGLPQRVECN----SKLAVSLSVMDEC 249
Query: 937 FDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 994
F PI++ S ++LI ++Y G + ++ G Y IL ++SA R G LAE+
Sbjct: 250 FLPIVDRRSGINLIQNVLYNCGSNFNRLNFGGFYALILERGDEIISAASIRFHGTRLAEM 309
Query: 995 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1054
P + T + + QG + LF IE L L V+ L++P+ SE WT FGF+ + E +
Sbjct: 310 PFIGTRHMYRRQGMCRRLFYAIESTLCSLKVEKLIIPAISELMHTWTEVFGFTTLDESLK 369
Query: 1055 NKYRNDYPLMIFQGTSMLQKPV 1076
+ ++ +++F G MLQK +
Sbjct: 370 QELKS-MNMLVFPGIDMLQKQL 390
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 4/53 (7%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRD 489
L G V GI C C ++++T + FE+HAGS ++P + IYLE+G +L D
Sbjct: 10 LEGWVTRDGIHCGC----CSKILTVSKFEIHAGSKLRQPFQNIYLESGASLLD 58
>gi|297734890|emb|CBI17124.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 159 bits (402), Expect = 7e-36, Method: Composition-based stats.
Identities = 73/122 (59%), Positives = 93/122 (76%)
Query: 963 DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGF 1022
++ GMYCAILTV VVSA FR+ G+E+AELPLVAT +DCQGQGYFQ+L+ CIE+LL F
Sbjct: 17 EFGGMYCAILTVGCQVVSAATFRVLGKEVAELPLVATRSDCQGQGYFQALYTCIERLLCF 76
Query: 1023 LNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1082
L V +LVLP+A A+++W NKF F M +EE N D+ +M FQGTSMLQKPVP+ R +
Sbjct: 77 LQVNSLVLPAAEGAESLWINKFKFHKMEQEELNHLCRDFQMMTFQGTSMLQKPVPEYRRI 136
Query: 1083 GK 1084
+
Sbjct: 137 SQ 138
>gi|356542320|ref|XP_003539616.1| PREDICTED: uncharacterized protein LOC100777440 [Glycine max]
Length = 1311
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/433 (25%), Positives = 186/433 (42%), Gaps = 79/433 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKLSLERPFS- 712
GI+CTCC+ ++ S+F+ HAG+ R P +I+ +G L S + R +
Sbjct: 729 GIICTCCDKVLTLSEFKFHAGFTVNR-PCLNIFMESGEPFTLCLLQAWSAEYKARRSQNQ 787
Query: 713 ------SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ +NDD CG+C +GG+L+CCD+CP FH+ C+S IP G W+C C
Sbjct: 788 AVHADDNDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------ 841
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
C +C K
Sbjct: 842 -----------------------------------------CRICGNLVIDKDTSDAHDS 860
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELP-KGKWLCCADCKRINLALQKLVDRGEEKLPETS 885
+ C QCE +YH CL+D D QE+ W C C+ + LQ V + S
Sbjct: 861 LQCSQCEHKYHEKCLEDR---DKQEVAISDTWFCGQSCQEVYSGLQTQVGLVNQVADGIS 917
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 945
+++ H++ +A +W L+ L+ A++I + F + + +
Sbjct: 918 WTLLRCIHDDQKVHSA-----QWFALKAV-------CNTKLAVALTIMEECFVSMFDPRT 965
Query: 946 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1003
+ +IP ++Y G ++ G Y +L V++S R+ G +AE+PL+AT +
Sbjct: 966 GIHMIPQVLYNWGSEFARLNFQGFYTIVLEKKDVLISVASIRVHGTTVAEMPLIATCSQY 1025
Query: 1004 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1063
+ QG + L IE++L V+ LV+ + + WT FGF + + E+ + N L
Sbjct: 1026 RRQGMCRLLVSAIEQMLISFKVEKLVVSAIPDLVETWTKGFGFITVDDIERQRL-NKINL 1084
Query: 1064 MIFQGTSMLQKPV 1076
M+F GT +L K +
Sbjct: 1085 MVFPGTVLLVKSL 1097
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 187/444 (42%), Gaps = 81/444 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 709
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 710 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 765
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK 699
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 825
++ ++ + + L C L
Sbjct: 700 DEAAKHETSTLPS--------------------------LSSCRL--------------- 718
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET- 884
CE +YH C+ G + C C+ + LQ + + LPE
Sbjct: 719 ------CEEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGF 769
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
S +++ S D D+ ++ A ++ A S+ + F P+++
Sbjct: 770 SWSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHR 816
Query: 945 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
S ++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 817 SGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYM 876
Query: 1003 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1062
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + + E+ +N
Sbjct: 877 YRRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKN-LN 935
Query: 1063 LMIFQGTSMLQKPVPKCRIVGKSV 1086
L++F G ML K + K +I V
Sbjct: 936 LLVFPGVDMLGKSLVKEKITDSVV 959
>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
Length = 1442
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 140/535 (26%), Positives = 224/535 (41%), Gaps = 125/535 (23%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 573 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMI 632
Query: 630 EEGGLEDGAEVGYF-VRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 688
+ G + + A+V Y +G L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 633 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 691
Query: 689 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 736
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 692 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 751
Query: 737 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 796
FH+ C+ + +PSG WHC C
Sbjct: 752 STFHLACLGIK-MPSGDWHCSSC------------------------------------- 773
Query: 797 IVQTPDTELGGCVLCRGRDFCKSRFGRRT----VILCDQCEREYHVGCLKDHGMEDLQEL 852
+CR FC S T ++ C QC R+YH C + ++
Sbjct: 774 -------------ICR---FCGSTQEITTSSAELLSCLQCSRKYHQVCAPGTMKDSVKAE 817
Query: 853 PKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 910
C C++I L+KL+ G NA++ W +
Sbjct: 818 SNSSTDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSL 855
Query: 911 LR--GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSH 959
+R K+ A +A L + A S+ + F P I+ S +++I ++Y G
Sbjct: 856 VRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDF 915
Query: 960 RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1019
++ Y IL V+SA RI G +LAE+P + T + QG L IE
Sbjct: 916 NRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESA 975
Query: 1020 LGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
L LNV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 976 LSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 1029
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 655 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 698
>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
Length = 1441
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 140/535 (26%), Positives = 224/535 (41%), Gaps = 125/535 (23%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 572 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYFPYRWKRTVLSWMI 631
Query: 630 EEGGLEDGAEVGYF-VRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 688
+ G + + A+V Y +G L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 632 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 690
Query: 689 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 736
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 691 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 750
Query: 737 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 796
FH+ C+ + +PSG WHC C
Sbjct: 751 STFHLACLGIK-MPSGDWHCSSC------------------------------------- 772
Query: 797 IVQTPDTELGGCVLCRGRDFCKSRFGRRT----VILCDQCEREYHVGCLKDHGMEDLQEL 852
+CR FC S T ++ C QC R+YH C + ++
Sbjct: 773 -------------ICR---FCGSTQEITTSSAELLSCLQCSRKYHQVCAPGTMKDSVKAE 816
Query: 853 PKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRV 910
C C++I L+KL+ G NA++ W +
Sbjct: 817 SNSSTDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSL 854
Query: 911 LR--GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSH 959
+R K+ A +A L + A S+ + F P I+ S +++I ++Y G
Sbjct: 855 VRCFPDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDF 914
Query: 960 RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1019
++ Y IL V+SA RI G +LAE+P + T + QG L IE
Sbjct: 915 NRLNFSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESA 974
Query: 1020 LGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
L LNV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 975 LSSLNVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 1028
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 654 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 697
>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
Length = 1019
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 140/531 (26%), Positives = 225/531 (42%), Gaps = 117/531 (22%)
Query: 587 DTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKD-------------LRMHKLVF---- 629
D+S+R+ + N+A SS +K +SS V+ + R R + V
Sbjct: 150 DSSSRNSKDNNAGRSSGNKHQSSGVRGCALLVRGSTHSMEGNVDGYVPYRWKRTVLSWMI 209
Query: 630 EEGGLEDGAEVGYF-VRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQ 688
+ G + + A+V Y +G L G GI C CC+ ++ ++FE HAG + ++P++
Sbjct: 210 DMGVVSEDAKVKYMNKKGTRARLEGRITRDGIHCGCCSKILTVAKFELHAG-SKEQQPYE 268
Query: 689 HIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDSCP 736
+I+ +G +L + + ++ S K+ DD CGIC DGGDLLCCD+CP
Sbjct: 269 NIFLEDGGATLSQCLVDAWKKQSQSEKKGFYKVDPGDDPDDDTCGICGDGGDLLCCDNCP 328
Query: 737 RAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR 796
FH+ C+ + +PSG WHCR C+ R G +
Sbjct: 329 STFHLACLGIK-MPSGDWHCRSCI--------------CRFCGSTQE------------- 360
Query: 797 IVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 856
+ T EL C+ QC R+YH C + ++
Sbjct: 361 -ITTSSAELLSCL---------------------QCSRKYHQVCAPGTMKDSVKAESNSS 398
Query: 857 W--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR-- 912
C C++I L+KL+ G NA++ W ++R
Sbjct: 399 TDCFCSPGCRKIYKHLRKLL----------------------GVKNAIEAGFSWSLVRCF 436
Query: 913 GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 963
K+ A +A L + A S+ + F P I+ S +++I ++Y G +
Sbjct: 437 PDKLAAPPKGKAHLIHCNSKTAVAFSVMDECFLPRIDERSGINIIHNVIYNCGSDFNRLN 496
Query: 964 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1023
+ Y IL V+SA RI G +LAE+P + T + QG L IE L L
Sbjct: 497 FSKFYTFILERGDEVISAAAVRIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSL 556
Query: 1024 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
NV+ LV+P+ E Q WT FGF + ++ K ++ ++I GT +L+K
Sbjct: 557 NVRRLVIPAIPELQNTWTTVFGFKPVEPSKRQKIKS-LNILIIHGTGLLEK 606
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
L G + GI C C C ++++T A FELHAGS ++P E I+LE+G
Sbjct: 232 LEGRITRDGIHCGC--C--SKILTVAKFELHAGSKEQQPYENIFLEDG 275
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 156 bits (394), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 110/432 (25%), Positives = 183/432 (42%), Gaps = 77/432 (17%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH---------ELSIKLSLER 709
GI+C CC ++ S+F+ HAG+ R P +I+ +G E + S +
Sbjct: 729 GIICICCGKVLTLSEFKFHAGFTLNR-PCLNIFMESGEPFTLCLLQAWSTEYKARKSQNQ 787
Query: 710 PFSSKEND---DLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ END D CG+C +GG+L+CCD+CP FH+ C+S IP G W+C C
Sbjct: 788 AVHADENDKNDDSCGLCGEGGELICCDNCPSTFHLACLSTQEIPDGDWYCTNCT------ 841
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
C +C K
Sbjct: 842 -----------------------------------------CRICGNLVIDKDTLDAHDS 860
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C QCE +YH CL+D ++ L W C C+ + LQ V + S
Sbjct: 861 LQCSQCEHKYHEKCLEDRDKQEGAIL--DTWFCGQSCQEVYSGLQSQVGLVNQVADGISW 918
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
+++ H++ +A +W L+ L+ A++I + F + + +
Sbjct: 919 TLLRCIHDDQKVHSA-----QWFALKAV-------CNTKLAVALTIMEECFVSMFDPRTG 966
Query: 947 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+ LIP ++Y G ++ G Y +L + V++S R+ G +AE+PL+AT + +
Sbjct: 967 IHLIPQVLYNWGSEFARLNFQGFYTIVLEKDDVLISVASIRVHGTTVAEMPLIATCSQYR 1026
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L V+ LV+ + + WT FGF + + E+ + N LM
Sbjct: 1027 RQGMCRLLVTAIEQVLISFKVEKLVISAIPDLVETWTKGFGFIPVDDIERQRL-NKINLM 1085
Query: 1065 IFQGTSMLQKPV 1076
+F GT +L K +
Sbjct: 1086 VFPGTVLLVKSL 1097
>gi|224082648|ref|XP_002306779.1| predicted protein [Populus trichocarpa]
gi|222856228|gb|EEE93775.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 156 bits (394), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 124/437 (28%), Positives = 187/437 (42%), Gaps = 82/437 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKE--- 715
GI C CC + S FEAHAG + +P ++I+ NG SL ++ + S ++
Sbjct: 18 GIQCDCCGETFAISDFEAHAG-SKSCQPLKNIFLENGPSLLHCQLESWHRQDESDRKGFH 76
Query: 716 ---------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
NDD CGIC DGG+L+CCDSCP FH C+ + +PSG W+C YC F
Sbjct: 77 FVDIDGQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKLPSGVWNCTYCSCKF--- 133
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
A G +D R +
Sbjct: 134 -------CGMAGGDACQMDE-------------------------------NDAAARPAL 155
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C CE +YH C+ + + + C C+ ++ LQ L+ E +
Sbjct: 156 LTCCLCEEKYHHSCIP--AEDTINDYHSSLSFCGKKCQELHDKLQALLGVKHEMEEGFAW 213
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRAL-----LSKAVSIFHDRFDPII 941
V+++ FDV G + S R + ++ A+ I + F P+
Sbjct: 214 TVVRR------------FDV------GSDITLSGMHRKVECNSKVAVALHIMDECFLPMP 255
Query: 942 ESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
+ S ++LI +VY G + +Y G AIL V+SA RI G +LAE+P + T
Sbjct: 256 DHRSGVNLIRNIVYNFGSNFNRLNYCGFLTAILERGDEVISAASIRIHGNQLAEMPFIGT 315
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ + QG + L IE L LNV+ LV+P+ SE + WT+ FGF + + K R
Sbjct: 316 RHMYRRQGMCRRLLGAIETALCSLNVEKLVIPAISELRETWTSVFGFKQLEGLSKQKMRY 375
Query: 1060 DYPLMIFQGTSMLQKPV 1076
++ F G MLQKP+
Sbjct: 376 -MKMVAFPGVDMLQKPL 391
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 155 bits (393), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 118/443 (26%), Positives = 184/443 (41%), Gaps = 102/443 (23%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---- 714
GILC CC +S S F+AHAG R ++ +G S ++ S +
Sbjct: 967 GILCNCCTKTLSISDFKAHAGCRLRLSSLG-LFLQSGKSYTLCQVEAWSAELMSRRSDAY 1025
Query: 715 --------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C
Sbjct: 1026 GRKVEAVDENDDTCGFCGDGGELLCCDNCPSTYHEACLSSQELPEGSWYC---------- 1075
Query: 767 KFVEYNANARAAGRI---EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 823
+N R+ G + V F+ ++ +C+
Sbjct: 1076 ----HNCTCRSCGNPVNEKEVSSFSDIL-KCL---------------------------- 1102
Query: 824 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
QC YH C+ D M W C CK I + L
Sbjct: 1103 -------QCGDAYHNTCI-DRVMLPSDGKRSDTWFCGRYCKEIFMGL------------- 1141
Query: 884 TSLDVIKKKHEESGSDNAVDFDVRWRVLR----------GKKVDASDGTRALLSKAVSIF 933
H + G +N ++ D+ W +LR +K+ L+ A+++
Sbjct: 1142 ---------HSQVGVENVINNDLSWTILRCNSDGQRLHSAQKIGLMTECNTKLAVALTLL 1192
Query: 934 HDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
+ F +++ + +D+IP ++Y G + DY G Y IL ++ R+ G +
Sbjct: 1193 EECFIRMVDPRTGVDMIPHVLYNKGSNFARLDYKGFYTVILEKGDEILCVASIRLHGTKA 1252
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AELP +ATS D + QG + L IEK+L +V+ LVL + E W + FGF + +
Sbjct: 1253 AELPFIATSVDYRRQGMCRRLLDIIEKMLRSFHVEMLVLSAIPELVNTWVSGFGFKPIED 1312
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQK 1074
+E+ + RN LM+F G S+L K
Sbjct: 1313 DEKKQLRN-VNLMLFPGASLLTK 1334
>gi|414587171|tpg|DAA37742.1| TPA: hypothetical protein ZEAMMB73_064783 [Zea mays]
Length = 1316
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/438 (27%), Positives = 184/438 (42%), Gaps = 95/438 (21%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSLER---- 709
GI C CC + + S+F+ HAG + P +++ G S L SI+ + +
Sbjct: 914 GIQCQCCGTTFTMSKFKCHAG-LRQEVPSLNLFLDTGKSYSLCQLQAWSIEQKVRKEHAK 972
Query: 710 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ +NDD CG C DGG+L+CCD+CP ++H C+S IP G W+C C+
Sbjct: 973 DTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNWYCSSCL------ 1026
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
C +C K
Sbjct: 1027 -----------------------------------------CDICGEVIDSKELVTSLPA 1045
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C QCER+YHV C+ E G W C C I + + V
Sbjct: 1046 LDCSQCERQYHVKCVS--AKVPCNEDGSGTWFCGRKCHEIYMTFRSRV------------ 1091
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRG---KKVDASDGTRAL------LSKAVSIFHDRF 937
G + +D D+ + VLR KKV ++ + L A SI + F
Sbjct: 1092 ----------GVPDHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECF 1141
Query: 938 DPIIESASKLDLIPAMVYG-RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPL 996
PI++ + +D+IP+++Y RS +Y G Y +L + +VS R+ G LAE+PL
Sbjct: 1142 LPILDPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPL 1201
Query: 997 VATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNK 1056
VATS + + QG + L IE++L L V+ L+L + WT+ FGF + E ++ K
Sbjct: 1202 VATSKENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDK-K 1260
Query: 1057 YRNDYPLMIFQGTSMLQK 1074
+ L GT +L+K
Sbjct: 1261 RLSKVRLAAVPGTVLLKK 1278
>gi|297834364|ref|XP_002885064.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
gi|297330904|gb|EFH61323.1| hypothetical protein ARALYDRAFT_478922 [Arabidopsis lyrata subsp.
lyrata]
Length = 1173
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 115/432 (26%), Positives = 194/432 (44%), Gaps = 79/432 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKLSLERP-FS 712
G++CTCCN VS S+F+ HAG+ ++ P +++ +G L S + R +
Sbjct: 645 GVVCTCCNRTVSLSEFKNHAGF-NQNCPCLNLFMGSGKPFASCQLEAWSAEYKARRNGWR 703
Query: 713 SKE------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
S+E NDD CG+C DGG+L+CCD+CP FH C+S+ +P G+W
Sbjct: 704 SEEASDDDPNDDSCGVCGDGGELICCDNCPSTFHQACLSMQVLPEGSW------------ 751
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
Y ++ E V + R +DF
Sbjct: 752 ----YCSSCSCQICSELVSDNGE----------------------RSQDF---------- 775
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
C QC +YH CL+ G+ ++L + C +C+++ L V S
Sbjct: 776 -KCSQCAHKYHGICLQ--GISKRRKLFPETYFCGKNCEKVYTGLSSRVGVINPNADGLSW 832
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
++K E+ +V +++ + L+ A+SI + F +++ +
Sbjct: 833 SILKCFQEDG------------KVHSARRLALKAECNSKLAVALSIMEESFLSMVDPRTG 880
Query: 947 LDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+D+IP ++Y G + D+ G Y +L + V++S R+ G +AE+PLVAT + +
Sbjct: 881 IDMIPHVLYNWGSNFARLDFDGFYTMVLEKDDVMISVASIRVHGVTVAEMPLVATCSKYR 940
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L L V+ LV+ + WT FGF M +EE++ + LM
Sbjct: 941 RQGMCRILVAAIEEMLMSLKVEKLVVAALPSLVETWTEGFGFKPMDDEERDALKR-INLM 999
Query: 1065 IFQGTSMLQKPV 1076
+F GT +L K +
Sbjct: 1000 VFPGTILLMKTL 1011
>gi|242077879|ref|XP_002443708.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
gi|241940058|gb|EES13203.1| hypothetical protein SORBIDRAFT_07g000645 [Sorghum bicolor]
Length = 1020
Score = 153 bits (386), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 118/460 (25%), Positives = 197/460 (42%), Gaps = 96/460 (20%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL------- 705
G+ GILC+CC+ +S S F+AHA + R ++ +G S I+
Sbjct: 580 GWVTWDGILCSCCSKTLSISDFKAHAMISLPRSSL-NLCLQSGKSFTLCQIEAWNAEYMD 638
Query: 706 ----SLERPFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ R + + NDD CG C DGG+LLCCD+CP +H C+S+ +P +W+C C+
Sbjct: 639 RRSNACRRKVEAADGNDDTCGFCGDGGELLCCDNCPSTYHQSCLSVKELPDDSWYCHNCI 698
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
RI P TE K
Sbjct: 699 ----------------------------------CRICGCPVTE-------------KEI 711
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEK 880
+I C QC +H C+ + G +E+ +W C CK I L L V
Sbjct: 712 SSFSAIIKCLQCGAAHHDTCV-EMGATAFEEMDSDEWFCGTHCKEIYLGLHGCV------ 764
Query: 881 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG----------KKVDASDGTRALLSKAV 930
G ++++ + W +LR +K+ + + L+ A+
Sbjct: 765 ----------------GVESSLGDGLSWTILRCNSGGQKMHSVQKIAHAIECNSKLAVAL 808
Query: 931 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
++ + F ++++ + +++IP ++Y G + +Y G Y IL + ++ A R+ G
Sbjct: 809 TLMEECFAQMVDTRTGINMIPHVLYNQGSKYARLNYQGFYTVILEKGEEILCAASIRVHG 868
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+ AELP +AT + + +G + L IE++L +VK LVL + E + W + FGF
Sbjct: 869 MKAAELPFIATCREHRRKGMCRRLINTIEEMLKSFHVKMLVLSAIPELVSTWVSGFGFKP 928
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVDG 1088
+ EE + K + LM+F GTS+L K + + KS G
Sbjct: 929 I-EEYERKQLDTINLMLFPGTSLLIKSLEDGTLTEKSGAG 967
>gi|242036739|ref|XP_002465764.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
gi|241919618|gb|EER92762.1| hypothetical protein SORBIDRAFT_01g045406 [Sorghum bicolor]
Length = 331
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 154/289 (53%), Gaps = 34/289 (11%)
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC-------LKDHGMED 848
R+VQT C CR D + T+ C+QCER HV C +K +E
Sbjct: 68 RVVQTE-----SCYFCRYGDTEFGKLDPNTIFFCNQCERPCHVRCYNSRDRDVKKVPLEI 122
Query: 849 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRW 908
L+E ++LCC +C+ + L+ V++GEE + +++ + ++ W
Sbjct: 123 LKEYMCFRFLCCEECQSLRARLEG-VEKGEE------IAFLRQ----------IRSNICW 165
Query: 909 RVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGM 967
R+L K DAS + +S+A+ IF D F ++ S D+ MVYG++ G+ D+ GM
Sbjct: 166 RLL--SKADASRDVKLYMSQAIDIFKDAFVESTDAHS--DIFSDMVYGKNGAGEKDFRGM 221
Query: 968 YCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKT 1027
YC +LT + VVSA I ++ ++ AEL L+AT ++C+ +GYF+ L IE L NV
Sbjct: 222 YCVVLTASTHVVSAAILKVRVEQFAELVLIATRSECRKKGYFRLLLKSIEANLRACNVSL 281
Query: 1028 LVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
L+ P E IW+ K GF++++ EE+ +PL++FQ ++QKP+
Sbjct: 282 LMAPVDPEMAQIWSEKLGFTILSAEEKKSMLESHPLVMFQDLVLVQKPL 330
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 185/443 (41%), Gaps = 106/443 (23%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---- 714
GILC CC S S F+ H G S K ++ +G S ++ S K
Sbjct: 956 GILCNCCTKTFSISDFKVHGG-CSLPKSSLGLFLQSGKSYTLCQVEAWSAEFLSRKCDAS 1014
Query: 715 --------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C
Sbjct: 1015 GRKVEAMDENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC---------- 1064
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
+N R+ G +P ++ K +
Sbjct: 1065 ----HNCTCRSCG-----NPLSE----------------------------KEVSTFSAI 1087
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKL 881
+ C QC YH C+ D + LP G W C CK I + L
Sbjct: 1088 LKCLQCGDSYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL----------- 1130
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVS 931
H G +N +D ++ W +L+ G+K+ +S + L+ A++
Sbjct: 1131 -----------HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALT 1179
Query: 932 IFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQEL 991
I + F +++ + +D+IP ++ + DY G Y IL ++ R+ G +
Sbjct: 1180 ILEECFVRMVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKA 1237
Query: 992 AELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTE 1051
AELP +ATS D + QG + L IE +L +V+TLVL + E W + FGF + +
Sbjct: 1238 AELPFIATSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIED 1297
Query: 1052 EEQNKYRNDYPLMIFQGTSMLQK 1074
E+ + RN LM+F GTS+L K
Sbjct: 1298 NEKKQLRN-VNLMLFPGTSLLTK 1319
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 190/445 (42%), Gaps = 86/445 (19%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 707
GI C+CC + S FE HA S R +I+ +G SL E ++ L
Sbjct: 635 GIKCSCCRRIFTISGFEVHANGGSCRAA-ANIFLDDGRSLLECQVEAYETRKKAQPPDIL 693
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
+ END +C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 694 KMKLRQGENDVICSVCHYGGKLILCDGCPSAFHANCLGLEEVPDGDWFCESCC------- 746
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK--SRFGRR- 824
G C G+ F K S++ +
Sbjct: 747 -------------------------------------CGAC----GQFFLKATSKYAKEE 765
Query: 825 TVILCDQCEREYHVGCLKDHGMED-LQELPKGKWLCCADCKRINLALQKLVDR----GEE 879
I C QCE +YH CL+ G D L KW C DC+ I + L +L+ + G E
Sbjct: 766 KFISCKQCELKYHPSCLRYDGAGDSLDTFLGEKWFCSKDCEEIFVNLCELIGKPREVGVE 825
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDP 939
KL + +++ D+A K++A LS A+ + H+ F+P
Sbjct: 826 KL---TWRLVQSFEPNMYGDDAY------------KIEAVAENHCKLSVALDVMHELFEP 870
Query: 940 IIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+ DL +++ R + + ++ G Y +L N+ +V+ RI G+++AE+P +
Sbjct: 871 VKRPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNEELVTVATVRILGKKVAEMPFI 930
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
T + +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ ++
Sbjct: 931 GTRFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKEF 990
Query: 1058 RNDYPLMIFQGTSMLQKPVPKCRIV 1082
+ L+ F T + QK + K +V
Sbjct: 991 LK-FTLLEFGRTILCQKILIKSSVV 1014
>gi|18421570|ref|NP_568540.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|332006726|gb|AED94109.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 1179
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 124/443 (27%), Positives = 186/443 (41%), Gaps = 69/443 (15%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 709
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 710 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W YC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAW---YC------- 689
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
YN + + + E + L C LC + C +
Sbjct: 690 ----YNCSCKFCEKDEAAKHETSTLP-----------SLSSCRLCEEK--CSKHYPH--- 729
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET-S 885
L D H C+ G + C C+ + LQ + + LPE S
Sbjct: 730 TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGFS 780
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 945
+++ S D D+ ++ A ++ A S+ + F P+++ S
Sbjct: 781 WSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHRS 827
Query: 946 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1003
++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 828 GVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMY 887
Query: 1004 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1063
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + + E+ +N L
Sbjct: 888 RRQGMCRRLMDGIESALGSLKVDKLVIPAVPELIDTWTSGFGFAPVNDSEKKTIKN-LNL 946
Query: 1064 MIFQGTSMLQKPVPKCRIVGKSV 1086
++F G ML K + K +I V
Sbjct: 947 LVFPGVDMLGKSLVKEKITDSVV 969
>gi|449447297|ref|XP_004141405.1| PREDICTED: uncharacterized protein LOC101209931 [Cucumis sativus]
Length = 671
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 142/541 (26%), Positives = 230/541 (42%), Gaps = 108/541 (19%)
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQ--TSSHSKMKSSSVKSHGKITRK 620
+S E EL +SP E D N P+P A TS ++ + S ++
Sbjct: 168 ASSFEKPKDKYELVKSPVLPSE-QDVWNEKPQPRKAAKGTSRRNQRTALSYLVDRELISP 226
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGW 680
R+H V +G L V + RG + +GF I C CC++ S+FEAH G
Sbjct: 227 GDRVHCNVTRDGRL-----VTW--RGSIT-----NEGF-IKCDCCSNLFPISKFEAHTG- 272
Query: 681 ASRRKPFQHIYTSNGVSLHELSIKL------------SLERPFSSKENDD---------- 718
+++ +P +I+ +G SL + +L + E+ + +N D
Sbjct: 273 STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHNDNADSDTHGLDKND 332
Query: 719 -LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARA 777
+C +C GG+L+ CD CP AFH C+ + GIPSG W+C C + +++
Sbjct: 333 CICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCCCKICGQVTYDFD----- 387
Query: 778 AGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYH 837
+ V F RC+ QCE+ H
Sbjct: 388 ----DQVSSFDTSFVRCV-----------------------------------QCEQNVH 408
Query: 838 VGCLKD-HGMEDL-QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 895
+GC+K +ED Q + + W C C+ I++ LQ L+ +++P + E
Sbjct: 409 IGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLL---WKQIP------VGDARE- 458
Query: 896 SGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY 955
++ W +++ S+ R L+KA+ + H F P+ + +K DLI +
Sbjct: 459 ---------NLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFL 509
Query: 956 GRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLF 1013
+ + ++ G Y AIL VV+ R++G E+AE+PLVAT + G + L
Sbjct: 510 SKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLL 569
Query: 1014 CCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQ 1073
+E L + VK L LP+ EA WT FGF+ MT+ ++ Y + FQ T Q
Sbjct: 570 NELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK-YTFLGFQHTVRCQ 628
Query: 1074 K 1074
K
Sbjct: 629 K 629
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/438 (25%), Positives = 188/438 (42%), Gaps = 81/438 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-----VSLHELSIKL-------S 706
GI+C CCN +S ++F++HAG+ R P +++ +G L S +
Sbjct: 150 GIMCKCCNMVLSVTKFKSHAGFKLNR-PCSNLFMESGKPFTLCQLQAWSAEYKSRKSGTQ 208
Query: 707 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ R +NDD CG+C DGG+L+CCD+CP FH C+ +P G+W YC N
Sbjct: 209 VVRADEDDKNDDSCGLCGDGGELICCDNCPSTFHQACLCTEDLPEGSW---YCPNC---- 261
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
C +C D + +V
Sbjct: 262 ----------------------------------------TCWICG--DLVNDKEASSSV 279
Query: 827 --ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
C QCE +YH C G + + L W C C+ + L V
Sbjct: 280 GAYKCLQCEHKYHGAC--QQGKQTHEGLVSDAWFCSGSCQEVYSGLHSRVGINNPIADGF 337
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
+++ HE+ +VL +++ + L+ A++I + F +++
Sbjct: 338 CWTLLRCIHEDQ------------KVLSAQRLALKAECNSKLAVALTIMEECFQSMVDPR 385
Query: 945 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
+ +D+IP +Y G ++ G Y +L + V+VSA R+ G +AE+PL+AT ++
Sbjct: 386 TGIDMIPHALYNWGSDFARLNFFGFYTVVLEKDDVLVSAASVRVHGVTVAEMPLIATCSN 445
Query: 1003 CQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYP 1062
+ QG + L IE++L V+ LV+ + + WT FGF ++++E+ N
Sbjct: 446 YRRQGMCRHLMTAIEEMLISYKVEKLVISAIPDLVETWTKGFGFIPVSKDEKQSL-NKIN 504
Query: 1063 LMIFQGTSMLQKPVPKCR 1080
M+F GT +L+K + K +
Sbjct: 505 FMVFPGTILLKKQLYKTK 522
>gi|297801176|ref|XP_002868472.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314308|gb|EFH44731.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 1232
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 123/441 (27%), Positives = 187/441 (42%), Gaps = 73/441 (16%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLERP---- 710
GI C CC+ S FE HAG R +PF+ +Y G SL H+ K S +
Sbjct: 678 GIRCNCCDEVFSVLDFEVHAG-GKRNQPFKSLYLEGGNSLLQCLHDFMNKQSESQHKGYH 736
Query: 711 ----FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 765
S NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 737 FVDFCSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCCNCSCKFCEK 796
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 825
+ ++ +A + L C LC + C + +
Sbjct: 797 VEAAIHDTSALHS--------------------------LSSCRLCEEK--CSNHYPH-- 826
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET- 884
L D H C+ G + C C+ + LQ L+ + LPE
Sbjct: 827 -TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLLIGV-KHPLPEGF 876
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
S +++ S D D+ ++ K ++ A S+ + F P+++
Sbjct: 877 SWSFLRRFELPS---EVADCDISEKIAYNAK----------MAVAFSVMDECFSPLVDHR 923
Query: 945 SKLDLIPAMVYG---RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1001
S ++L+ +VY HR ++ A+L +++ RI G +LAE+P + T
Sbjct: 924 SGVNLLQNIVYNFWSNFHR-LNFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRY 982
Query: 1002 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1061
+ QG + L IE LG L V LV+P+ E WT+ FGF+ + E E+ +N
Sbjct: 983 MYRRQGMCRRLMDGIESALGSLKVAKLVIPAVPELIDTWTSGFGFTPVNESEKKTIKN-L 1041
Query: 1062 PLMIFQGTSMLQKPVPKCRIV 1082
L++F G ML K + K +I
Sbjct: 1042 NLLVFPGVDMLGKSLVKEQIT 1062
>gi|449511699|ref|XP_004164030.1| PREDICTED: uncharacterized LOC101209931 [Cucumis sativus]
Length = 694
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 146/541 (26%), Positives = 232/541 (42%), Gaps = 108/541 (19%)
Query: 563 SSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQ--TSSHSKMKSSSVKSHGKITRK 620
+S E EL +SP E D N P+P A TS ++ + S ++
Sbjct: 191 ASSFEKPKDKYELVKSPVLPSE-QDVRNEKPQPRKAAKGTSRRNQRTALSYLVDRELISP 249
Query: 621 DLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGW 680
R+H V +G L V + RG + +GF I C CC++ S+FEAH G
Sbjct: 250 GDRVHCNVTRDGRL-----VTW--RGSIT-----NEGF-IKCDCCSNLFPISKFEAHTG- 295
Query: 681 ASRRKPFQHIYTSNGVSLHELSIKL------------SLERPFSSKENDD---------- 718
+++ +P +I+ +G SL + +L + E+ + +N D
Sbjct: 296 STKHRPAANIFLEDGRSLLDCQKQLVQNNDQIQKETKATEKKVNHNDNADSDTHGLDKND 355
Query: 719 -LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARA 777
+C +C GG+L+ CD CP AFH C+ + GIPSG W+C C +
Sbjct: 356 CICSVCHFGGELILCDLCPAAFHGSCLGIKGIPSGNWYCPSCC--------------CKI 401
Query: 778 AGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYH 837
G++ F VS + DT CV QCE+ H
Sbjct: 402 CGQV--TYDFDDQVS-------SFDTSFVRCV---------------------QCEQNVH 431
Query: 838 VGCLKD-HGMEDL-QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 895
+GC+K +ED Q + + W C C+ I++ LQ L+ +++P + E
Sbjct: 432 IGCVKSIQVLEDSNQTIDRENWFCTRRCEDIHMGLQNLL---WKQIP------VGDARE- 481
Query: 896 SGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY 955
++ W +++ S+ R L+KA+ + H F P+ + +K DLI +
Sbjct: 482 ---------NLTWTLMKHCPYKVSEHNRKKLNKALGVMHKSFRPVKDPITKNDLIEDVFL 532
Query: 956 GRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLF 1013
+ + ++ G Y AIL VV+ R++G E+AE+PLVAT + G + L
Sbjct: 533 SKRSESKRLNFEGFYTAILERKNTVVTVATVRVYGDEVAEIPLVATRLKYRRHGMCRRLL 592
Query: 1014 CCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQ 1073
+E L + VK L LP+ EA WT FGF+ MT+ ++ Y + FQ T Q
Sbjct: 593 NELEHQLIEMGVKRLTLPAVPEALNTWTKGFGFTKMTDSDRLDLIK-YTFLGFQHTVRCQ 651
Query: 1074 K 1074
K
Sbjct: 652 K 652
>gi|357440715|ref|XP_003590635.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
gi|355479683|gb|AES60886.1| hypothetical protein MTR_1g072130 [Medicago truncatula]
Length = 1672
Score = 149 bits (377), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 129/502 (25%), Positives = 199/502 (39%), Gaps = 119/502 (23%)
Query: 645 RGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL--- 701
R E L G G GI C CCN V+ S F AHAG + + P ++IYT SL +
Sbjct: 834 RLENASLEGKITGDGIHCGCCNEIVTISDFGAHAG-SKQSDPLKNIYTEEETSLLQCLLD 892
Query: 702 SIKLSLERPFSS---------KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSG 752
S E S NDD CG+C DGGDL+CCD CP FH C+ + PSG
Sbjct: 893 SWNKQDESELKSFHFFDVAGEDPNDDTCGVCGDGGDLICCDGCPSTFHKSCLDIKKFPSG 952
Query: 753 TWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 812
WHC YC F R+V GG
Sbjct: 953 DWHCAYCCCKF-------------------------------CRLV-------GG---SS 971
Query: 813 GRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
+ F ++ C CE ++H+ C++ +G + + K C C+ ++ L+
Sbjct: 972 NQSVVNDEFTMPALLTCHLCEEKFHISCVEANGGK--TDDSKDALFCGNKCQELSERLEM 1029
Query: 873 LVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSI 932
L+ E S I++ SD D + L V+ + + L+ A+SI
Sbjct: 1030 LLGVKHEIEDGFSWSFIRR------SDVGCDLSLTNPQL----VECN----SKLAVALSI 1075
Query: 933 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
++ F P I+ S +L+ +++Y G + + DY G IL ++ R+ G
Sbjct: 1076 MNECFMPYIDHRSGTNLLRSILYNCGSNFKRLDYSGFITVILERGDEIICVASIRVHGNR 1135
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIE--------------------------------- 1017
LAE+P + T + QG + L IE
Sbjct: 1136 LAEMPYIGTRYMYRRQGMCRRLLNAIESEAVYGLSVNLGACPMVVTVRVCNYSMNGSRGF 1195
Query: 1018 -------------KLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
K L L+V+ LV+P+ SE + WT+ FGF + + + + N+ L+
Sbjct: 1196 VFDVLRGNDWGGPKALSSLDVELLVIPAISELRETWTSVFGFEPLKQTSK-QITNNMNLL 1254
Query: 1065 IFQGTSMLQKPVPKCRIVGKSV 1086
+F +LQK + K I +++
Sbjct: 1255 VFPHVDLLQKKISKHAIANENL 1276
>gi|242075844|ref|XP_002447858.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
gi|241939041|gb|EES12186.1| hypothetical protein SORBIDRAFT_06g017030 [Sorghum bicolor]
Length = 1340
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 190/444 (42%), Gaps = 96/444 (21%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 709
GI C CC + + S+F+ HAG + P +++ G S L SI K+ ER
Sbjct: 903 GIRCRCCGTTFTMSKFKCHAG-LRQEVPSLNLFLGTGKSYSLCLLQAWSIEQKVRKERVK 961
Query: 710 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+ +NDD CG C DGG+L+CCD+CP ++H C+ IP G W+C C+
Sbjct: 962 DTMSLQADQNDDTCGSCGDGGELICCDNCPASYHQACLPCQDIPDGNWYCSSCL------ 1015
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
C +C K
Sbjct: 1016 -----------------------------------------CNICGEVITSKELRTSLPA 1034
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C QCER+YHV C+ E G W C C++I + + V
Sbjct: 1035 LECSQCERQYHVKCVS--AKVSCNEDGPGTWFCGRKCQQIYMIFRSRV------------ 1080
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLR--GKKVDASDGTRALLSK-------AVSIFHDRF 937
G + VD D+ +LR G K + G AL+++ A+SI + F
Sbjct: 1081 ----------GVPDHVDNDLSCTILRNNGDKKVRTAGEIALMAECNMKLMIALSIMEECF 1130
Query: 938 DPIIESASKLDLIPAMVYG-RSHRGQDYH-GMYCAILTVNQVVVSAGIFRIFGQELAELP 995
PI++ + +D+IP+++Y RS H G Y +L + +VS R+ G +AE+P
Sbjct: 1131 LPILDPRTGIDIIPSILYNWRSDFIHFNHKGFYTVVLENDDSMVSVASIRLHGTIVAEMP 1190
Query: 996 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1055
LVATS + + QG + L IE++L L V+ L+L + WT+ FGF + + ++
Sbjct: 1191 LVATSTENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLVETWTSTFGFREIDDSDK- 1249
Query: 1056 KYRNDYPLMIFQGTSMLQKPVPKC 1079
K + L GT +L+K + +C
Sbjct: 1250 KRLSMVRLAAVPGTVLLKKNLCEC 1273
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 707
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 956 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 1014
Query: 708 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 1015 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 1074
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C D K R
Sbjct: 1075 -----------------------------------------------CRICG--DLVKDR 1085
Query: 821 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 1086 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 1142
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 1143 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1190
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 993
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1191 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1247
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1248 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1307
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPV 1076
+ + LM+F GT +L+K +
Sbjct: 1308 KASLKK-INLMVFPGTILLKKSL 1329
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 707
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 892 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 950
Query: 708 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 951 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 1010
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C D K R
Sbjct: 1011 -----------------------------------------------CRICG--DLVKDR 1021
Query: 821 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 1022 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 1078
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 1079 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1126
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 993
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1127 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1183
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1184 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1243
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPV 1076
+ + LM+F GT +L+K +
Sbjct: 1244 KASLKK-INLMVFPGTILLKKSL 1265
>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
Length = 1437
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/479 (25%), Positives = 198/479 (41%), Gaps = 87/479 (18%)
Query: 624 MHKLVFEEGGLEDGAEVGYFVRGEVK-FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWAS 682
++ + + G + + +V Y + L G GI C CC+ ++ +FE HAG +
Sbjct: 629 VYSWMIDLGVVSEDMQVKYMNNNRTREMLAGKITREGIFCGCCSKILTVGKFELHAG-SK 687
Query: 683 RRKPFQHIYTSNG-VSLHELSI------KLSLERPF-----SSKENDDLCGICMDGGDLL 730
+KP+ +I+ G VSL + + L + F E+DD C IC DGGDL+
Sbjct: 688 EKKPYANIFLEGGKVSLLQCLLDAWEKHTLCENKGFYKIDKGEDEHDDTCAICGDGGDLV 747
Query: 731 CCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQM 790
CCD C FH+DC+ + +PSG W+CR C+ F
Sbjct: 748 CCDHCASTFHLDCLGIK-LPSGDWYCRSCLCRF--------------------------- 779
Query: 791 VSRCIRIVQTPDTE---LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME 847
C + P + L C+ C R+YH C G +
Sbjct: 780 ---CGFPQEKPSSSPELLLSCLQC---------------------SRKYHQTCSSGTGTD 815
Query: 848 DLQELPKGKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFD 905
+P C C++I L KL+ IK E S + V
Sbjct: 816 FDCTIPGTSIDCFCSPGCRKIYKRLNKLLG-------------IKNHMEAGFSWSLVHCF 862
Query: 906 VRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQD 963
+ + K + + ++ A ++ + F P I+ S +++I + Y G D
Sbjct: 863 PNDQAMPPKNKEKMAQCNSKIALAFTVLDECFQPHIDERSGINMIHNVAYNCGSDFSRLD 922
Query: 964 YHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFL 1023
+ G Y IL V+SA RI G +LAE+P + T + QG + L IE L L
Sbjct: 923 FSGFYAFILERGDEVISAASVRIHGTDLAEMPFIGTRGMYRHQGMLRRLLNGIESALCSL 982
Query: 1024 NVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIV 1082
NV+ LV+ + +E + WT FGF + ++ + ++ L+I GT +L+K + + + V
Sbjct: 983 NVQKLVVSAVTEMENTWTTVFGFKPVQPSKKQRIKS-LNLLIMNGTGLLEKRLLQTQTV 1040
Score = 42.7 bits (99), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 12/91 (13%)
Query: 396 PRKVMRKFYSKLKNFLESGIL-EGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCK 454
P + R YS + ++ G++ E M V Y+ ++ + L G + GI C C C
Sbjct: 622 PYEWKRTVYSWM---IDLGVVSEDMQVKYMNNNRTR----EMLAGKITREGIFCGC--C- 671
Query: 455 GNQVVTPAVFELHAGSSNKRPPEYIYLENGK 485
++++T FELHAGS K+P I+LE GK
Sbjct: 672 -SKILTVGKFELHAGSKEKKPYANIFLEGGK 701
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 192/443 (43%), Gaps = 88/443 (19%)
Query: 653 GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSIKLSL 707
GY GI+C CC S F+ HAG+ R P ++++ +G S L S + +
Sbjct: 795 GYVTRDGIVCKCCTELFSVCNFKIHAGFKLNR-PCRNLFMESGKSFTLCQLQAWSTEYKV 853
Query: 708 ERPFSSK-------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
+ +NDD CG+C DGG+L+CCD+CP FH C+S +P G W+C C
Sbjct: 854 RKGGIKNVQIDEIDQNDDSCGLCGDGGELICCDNCPSTFHQACLSAKELPEGNWYCPNCT 913
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C D K R
Sbjct: 914 -----------------------------------------------CRICG--DLVKDR 924
Query: 821 FGRRTVIL--CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
+ + C QCE +YH+ CLK+ ++E+ C +C+ I LQ L+
Sbjct: 925 EASSSFLALKCSQCEHKYHMPCLKEKC---VKEVGGDARFCGENCQEIYSGLQGLLGFVN 981
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFD 938
+ +++ H++ +V +K+ + L+ A++I + F
Sbjct: 982 HIADGFTWTLLRCIHDDQ------------KVHSSQKLALKAECNSKLAVALTIMEECFL 1029
Query: 939 PIIESASKLDLIPAMVYGRSHRGQDY-----HGMYCAILTVNQVVVSAGIFRIFGQELAE 993
+++ + +D+IP ++Y +RG D+ +G Y +L + +VS R+ G +AE
Sbjct: 1030 SMVDPRTGIDMIPHVLY---NRGSDFARLNFNGFYTVVLEKDDALVSVASIRVHGVTVAE 1086
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEE 1053
+PL+AT + +G + L IEK+L + V+ +V+ + WT FGF + ++E
Sbjct: 1087 MPLIATYEKFRSKGMCRLLMNAIEKMLKSVKVEKIVVAAIPSLVETWTLGFGFKPVEDDE 1146
Query: 1054 QNKYRNDYPLMIFQGTSMLQKPV 1076
+ + LM+F GT +L+K +
Sbjct: 1147 KASLKK-INLMVFPGTILLKKSL 1168
>gi|343172436|gb|AEL98922.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 147 bits (370), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/463 (26%), Positives = 197/463 (42%), Gaps = 92/463 (19%)
Query: 642 YFVRG--EVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 699
Y+++G +V G GI C CC S S F+AH + +P ++++ NG SL
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLSGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 700 ELSI-------------------------KLSLERPFSSKE-NDDLCGICMDGGDLLCCD 733
+ K L P S+ ND +C IC GGDL+CCD
Sbjct: 109 SCQVELMRKKIMRFNQEPVVRATGTGSRSKFRLLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP +FH C+N IE V P
Sbjct: 169 RCPSSFHAT----------------CLN-------------------IERV-PEGDWFCP 192
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
C D++ D +F +++ C QCER++H C K+ GM +E
Sbjct: 193 CCCCGICGDSQF---------DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE-- 241
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
W CC C+ + LQ+L+ G+ L +L K + +++ VD+D+
Sbjct: 242 --HWFCCKTCEMMQWGLQQLL--GKPILVGHNLTCTLIKPMQYQAEDRVDYDL------- 290
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
A + LS A+ + H+ FDP+ + +K DL+ +++ G + ++ G Y +
Sbjct: 291 ---AAMAENYSKLSVALEVMHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVL 347
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L N +++ + RI+G ++AE+PL+ T + G + L IEK L L V+ LVLP
Sbjct: 348 LERNDELIAVALLRIYGDKVAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLP 407
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++ WT FGF+ MTE ++ + + + F T M +K
Sbjct: 408 ASRSVLNTWTTSFGFTPMTESDRLDFLG-FTFLDFHDTIMCKK 449
>gi|21741218|emb|CAD41029.1| OSJNBb0086G13.1 [Oryza sativa Japonica Group]
gi|38345370|emb|CAE03210.2| OSJNBa0088K19.9 [Oryza sativa Japonica Group]
Length = 1456
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 117/435 (26%), Positives = 189/435 (43%), Gaps = 77/435 (17%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 709
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 975 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 1033
Query: 710 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
P + ENDD CG+C DGG+L+CCD+CP ++H DC+ IP G+W+C
Sbjct: 1034 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY--------- 1084
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
RC+ C +C K
Sbjct: 1085 --------------------------RCL------------CDICGEVINLKELRSSLPA 1106
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C QCER+YH C+ + + + P W C C++I + L+ V + S
Sbjct: 1107 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1165
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
V++ ++ RV + L A+SI + F PII++ +
Sbjct: 1166 TVLRNNGDQ-------------RVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTG 1212
Query: 947 LDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+D+IP ++Y RS DY G Y +L + ++S R+ G +AE+PL+AT + +
Sbjct: 1213 IDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1272
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L L V+ L+L + WT FGF + + ++ K + L+
Sbjct: 1273 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDR-KNLSRLRLV 1331
Query: 1065 IFQGTSMLQKPVPKC 1079
GT +L++ + +C
Sbjct: 1332 SVPGTVLLKRNLYEC 1346
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 146 bits (369), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 116/439 (26%), Positives = 186/439 (42%), Gaps = 82/439 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 707
GI C+CC + + FE HA AS +I+ +G SL E ++ L
Sbjct: 635 GIKCSCCRRVFTINGFEVHANGASCSGA-ANIFLDDGRSLLECQVEAYKKRKKAQPPDML 693
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH-----CRYCMNT 762
+ END C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 694 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 753
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
F K + NA+ + +F
Sbjct: 754 FLKTT----STNAK-----------------------------------------EEKF- 767
Query: 823 RRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 881
I C QCE +YH CL+ D + L ++ KW C DC+ I + L L+ + E
Sbjct: 768 ----ISCKQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDCEEIFVILYDLIGKPRE-- 821
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
S++ + + +S N D K++A+ +LS A+ + H+ F+P+
Sbjct: 822 --VSVEKLTWRLVQSLEPNMYGDD-------ASKIEAAAENHCILSVALDVMHELFEPVK 872
Query: 942 ESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
DL +++ R + + ++ G Y +L N +VS RI G+++AE+P + T
Sbjct: 873 RPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 932
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ +
Sbjct: 933 RFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK 992
Query: 1060 DYPLMIFQGTSMLQKPVPK 1078
+ L+ F T + +K + K
Sbjct: 993 -FTLLEFGRTILCEKILIK 1010
>gi|218194880|gb|EEC77307.1| hypothetical protein OsI_15961 [Oryza sativa Indica Group]
Length = 2505
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 117/435 (26%), Positives = 189/435 (43%), Gaps = 77/435 (17%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 709
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 924 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 982
Query: 710 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
P + ENDD CG+C DGG+L+CCD+CP ++H DC+ IP G+W+C
Sbjct: 983 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCLPCQDIPDGSWYCY--------- 1033
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
RC+ C +C K
Sbjct: 1034 --------------------------RCL------------CDICGEVINLKELRSSLPA 1055
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSL 886
+ C QCER+YH C+ + + + P W C C++I + L+ V + S
Sbjct: 1056 LECAQCERQYHAKCIYGKLLCNEEGGPCA-WFCGRRCQQIYMNLRSRVGIPIHTIDGFSC 1114
Query: 887 DVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASK 946
V++ ++ RV + L A+SI + F PII++ +
Sbjct: 1115 TVLRNNGDQ-------------RVSTAADIAILAECNMKLVIALSIMEECFLPIIDARTG 1161
Query: 947 LDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQ 1004
+D+IP ++Y RS DY G Y +L + ++S R+ G +AE+PL+AT + +
Sbjct: 1162 IDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRLHGTVVAEMPLIATCLENR 1221
Query: 1005 GQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLM 1064
QG + L IE++L L V+ L+L + WT FGF + + ++ K + L+
Sbjct: 1222 QQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGFVPIDDLDR-KNLSRLRLV 1280
Query: 1065 IFQGTSMLQKPVPKC 1079
GT +L++ + +C
Sbjct: 1281 SVPGTVLLKRNLYEC 1295
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 164/377 (43%), Gaps = 93/377 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 835 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 889
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 890 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 939
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 940 IIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLV 997
+++ + +D+IP ++Y G + DY G Y IL ++ R+ G + AELP +
Sbjct: 168 MVDPRTGVDMIPHVLYNKGSNFARLDYQGFYTVILEKGDEILCVASIRVHGTKAAELPFI 227
Query: 998 ATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKY 1057
ATS D + QG + L IE +L +V+TLVL + E W + FGF + + E+ +
Sbjct: 228 ATSVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQL 287
Query: 1058 RNDYPLMIFQGTSMLQK 1074
RN LM+F GTS+L K
Sbjct: 288 RN-VNLMLFPGTSLLTK 303
>gi|297745879|emb|CBI15935.3| unnamed protein product [Vitis vinifera]
Length = 687
Score = 144 bits (363), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/448 (28%), Positives = 189/448 (42%), Gaps = 99/448 (22%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL--------HELSIKLSLERP 710
GI +CC SP FEAHAG +S + +I+ + SL H ++ K +
Sbjct: 279 GIKNSCCQEVFSPRGFEAHAG-SSFHQSDANIFLEDEGSLLEGQRQMVHRITGKSFTKES 337
Query: 711 FSSKE-------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
K+ NDD+C +C GGDL+ CD CP FH C+ L +P G W C C
Sbjct: 338 SHGKKSNGDQCNNDDICSVCHYGGDLVLCDQCPSCFHQSCLGLKELPEGDWFCPSCCCRI 397
Query: 764 QKE-KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
E +F EY+ D F +F
Sbjct: 398 CGENRFDEYSEE----------DNF--------------------------------KFS 415
Query: 823 RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLP 882
C QCE +YHVGCL+ L+ P G C C++I L+ + +P
Sbjct: 416 ------CHQCELQYHVGCLRKQRHVKLETYPDGTRFCSTQCEKI---FLGLLKLLGKPIP 466
Query: 883 ETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD----GTRAL------LSKAVSI 932
G DN + W +L+ + D +AL L+ A+++
Sbjct: 467 -------------VGVDN-----LTWTLLKPTISEWFDMDVPDNKALTEVYSKLNIALNV 508
Query: 933 FHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQE 990
H+ F+PI E + DL+ +++ G + ++ G Y +L N ++S R+ G++
Sbjct: 509 MHECFEPIKEPHTGRDLVEDVIFCRGSDLKRLNFRGFYIVLLERNDELISVATIRVHGEK 568
Query: 991 LAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMT 1050
+AE+PLV T + + G + L IEK L L V+ L LP+A W FGFS MT
Sbjct: 569 VAEVPLVGTRSQYRRLGMCRILINEIEKKLVELGVERLTLPAAPSVLDTWVTSFGFSKMT 628
Query: 1051 EEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+ E+ + DY + FQ T M QK + K
Sbjct: 629 DSERLTFL-DYTFLDFQDTVMCQKLLMK 655
>gi|357484203|ref|XP_003612389.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513724|gb|AES95347.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 122/462 (26%), Positives = 196/462 (42%), Gaps = 113/462 (24%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----------------HEL- 701
I+C CC+ + + FE+HAG +R +P I +G SL H +
Sbjct: 37 AIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVV 95
Query: 702 --------SI-------KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
SI + + S +ND++C IC GGDL CD CP AFH+ C+ L
Sbjct: 96 NENQKKNHSIVKENRKKNHCVVKENSEAKNDNVCSICGFGGDLALCDRCPSAFHLGCLGL 155
Query: 747 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
+P G W C C C +I P
Sbjct: 156 NRVPIGEWFCPTC----------------------------------CCKICYRP----- 176
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
+ + CK +++C QCE++YH GC+K G+E + W C C +
Sbjct: 177 -----KCKQECKDH-KDNNILVCVQCEQKYHFGCVKAVGIE-FNHMEN--WFCSVVCGNM 227
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG-KKVDASDGTR-- 923
L L+KL+ G V ++ W +++ VD + +
Sbjct: 228 FLCLKKLL----------------------GKPIKVADNLTWTLVKNVSSVDDKEFNQKE 265
Query: 924 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS--HRGQDYHGMYCAILTVNQVVVSA 981
+ L+ A+ + ++ F+P ++ S +LI +V+ R H ++ G Y IL V+S
Sbjct: 266 SKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISV 325
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
RI+GQ++AE+ VAT + QG L IEK L L V+ L+L S+ +A IWT
Sbjct: 326 ATVRIYGQKVAEVVFVATKEQYRRQGICHLLMDEIEKQLTRLGVEKLLLHSSEDAMNIWT 385
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSM----LQKPVPKC 1079
FGF+ M +++ ++ D+ + FQ ++M L+ P+ C
Sbjct: 386 KSFGFARMASKDKCQFI-DHTFLEFQNSTMCLKALKTPIWSC 426
>gi|343172434|gb|AEL98921.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
protein, partial [Silene latifolia]
Length = 450
Score = 142 bits (359), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 119/463 (25%), Positives = 198/463 (42%), Gaps = 92/463 (19%)
Query: 642 YFVRG--EVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 699
Y+++G +V G GI C CC S + F+AH + +P ++++ NG SL
Sbjct: 49 YYLKGNSDVPLNEGRISSDGIKCNCCQKLFSLTGFQAHVTGNNICRPAENLFLGNGKSLV 108
Query: 700 ELSIKLSLER-------------------------PFSSKE-NDDLCGICMDGGDLLCCD 733
++L ++ P S+ ND +C IC GGDL+CCD
Sbjct: 109 SCQVELMRKKIMMFDQGPAVRAAGTGSRSKFRSLAPLGSENCNDYVCSICHYGGDLICCD 168
Query: 734 SCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
CP +FH C+N IE V P
Sbjct: 169 RCPSSFHAA----------------CLN-------------------IESV-PEGDWFCP 192
Query: 794 CIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELP 853
C D++ D +F +++ C QCER++H C K+ GM +E
Sbjct: 193 CCCCGICGDSQF---------DKMAEQFADDSLLRCHQCERQFHARCKKEGGMVSSEE-- 241
Query: 854 KGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG 913
W CC C+ + LQ+L+ G+ L +L K + +++ D+D+
Sbjct: 242 --HWFCCKTCEMMQWGLQQLL--GKPILVGQNLTCTLIKPMQYQAEDREDYDLA------ 291
Query: 914 KKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAI 971
A + LS A+ + H+ FDP+ + +K DL+ +++ G + ++ G Y +
Sbjct: 292 ----AMAENYSKLSVALEVMHECFDPVKDPKTKRDLVEDVLFCRGSNLNRLNFRGFYTVL 347
Query: 972 LTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLP 1031
L N +++ + RI+G ++AE+PL+ T + G + L IEK L L V+ LVLP
Sbjct: 348 LERNDELIAVALLRIYGDKVAEMPLIGTRFQHRRLGMCRILVNEIEKTLLNLGVQKLVLP 407
Query: 1032 SASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
++ WT FGF+ MTE ++ + + + F T M +K
Sbjct: 408 ASRSVLNTWTTSFGFTPMTESDRLDFLG-FTFLDFHDTIMCKK 449
>gi|29837188|dbj|BAC75570.1| PHD-type zinc finger protein-like [Oryza sativa Japonica Group]
gi|125601616|gb|EAZ41192.1| hypothetical protein OsJ_25694 [Oryza sativa Japonica Group]
Length = 744
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/442 (26%), Positives = 187/442 (42%), Gaps = 92/442 (20%)
Query: 667 SEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKLSLE-------------- 708
+E + ++F H +S R+P+ ++ +G SL + ++L +
Sbjct: 332 AEYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKK 391
Query: 709 -RPFSSK---------ENDD-LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 757
P+ + E+DD +C +C D G+LL CD CP FH CV L P G W C
Sbjct: 392 YLPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCP 451
Query: 758 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFC 817
C G + DP A ++ F
Sbjct: 452 ACTCAI--------------CGSSDLDDPPATTTTQ---------------------GFS 476
Query: 818 KSRFGRRTVILCDQCEREYHVGCLK--DHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
R VI C+QC REYHVGC++ D+G+ + +G WLC C +I L L++L
Sbjct: 477 SDRM----VISCEQCRREYHVGCMRERDNGLWYPEADGEGPWLCSEACSKIYLRLEELA- 531
Query: 876 RGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHD 935
V++ S ++ + RG D + A L A+ + +
Sbjct: 532 ------------VVQAPCRSVASGLSLVV-----LRRGAARDGEEEEHAKLCMALDVLRE 574
Query: 936 RFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAE 993
F +IE ++ DL +V+ R D+ G Y L +++ R++G+E+AE
Sbjct: 575 CFVTLIEPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAE 634
Query: 994 LPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN-KFGFSMMTEE 1052
+PLV T + QG + L I+KLLG + V+ LVLP+ E A WT FGF M +
Sbjct: 635 VPLVGTRFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGFREMGQA 694
Query: 1053 EQNKYRNDYPLMIFQGTSMLQK 1074
++ + + ++ FQGT M K
Sbjct: 695 DRQDVAH-HAILRFQGTIMCHK 715
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 125/473 (26%), Positives = 195/473 (41%), Gaps = 100/473 (21%)
Query: 628 VFEEGGLEDGAEVGYF-VRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G + A+V Y +G LVG GI C CC+ ++ ++FE HAG + ++P
Sbjct: 748 MIDLGVVSKDAKVKYMNKKGTRARLVGRVTREGICCDCCSKILTVAKFELHAG-SKEQQP 806
Query: 687 FQHIYTSNG-VSLHELSIKLSLERPFSSKEN-----------DDLCGICMDGGDLLCCDS 734
+ +I+ +G +SL + + ++ + K+ DD CGIC DGGDLLCCD
Sbjct: 807 YANIFLEDGGLSLFQCLLNAWDKQAQNEKKGFYKIDPADDPDDDTCGICGDGGDLLCCDR 866
Query: 735 CPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRC 794
C FH+ C+ + +PSG W CR C+ F G E
Sbjct: 867 CTSTFHVACLGIE-MPSGDWFCRNCICKF--------------CGSAE------------ 899
Query: 795 IRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
+P EL C+ C R+YH C + E + P
Sbjct: 900 -ERTSSP-AELLSCLQC---------------------SRKYHQVCAQGIEREFVSTTPS 936
Query: 855 GKW--LCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR 912
C C +I L++L+ G N ++ W ++R
Sbjct: 937 ASIDCFCSPGCTKIYKRLKRLL----------------------GLKNDLEAGFSWSLVR 974
Query: 913 --GKKVDASDGTRALL-------SKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRG 961
S +A L + A S+ + F P I+ S +++I +VY G
Sbjct: 975 CFADTEATSTKKKAQLVHCNSKTALAFSVLDECFLPRIDERSGINIIHNVVYNCGSDFSR 1034
Query: 962 QDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLG 1021
++ G Y IL V+SA RI G + AE+P + T + QG L IE L
Sbjct: 1035 LNFSGFYTFILERGDEVISAATVRIHGTDFAEMPFIGTRGMYRHQGMCHRLLDAIESALC 1094
Query: 1022 FLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
LNV+ LV+P+ E Q W+ FGF + ++ K ++ L+I GT +L+K
Sbjct: 1095 SLNVRRLVIPAIPELQNTWSTVFGFKPVGPTKKQKIKS-VNLLIIHGTGLLEK 1146
>gi|224066495|ref|XP_002302109.1| predicted protein [Populus trichocarpa]
gi|222843835|gb|EEE81382.1| predicted protein [Populus trichocarpa]
Length = 392
Score = 141 bits (355), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 125/442 (28%), Positives = 185/442 (41%), Gaps = 92/442 (20%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKEND- 717
GI C CC + FE+HAG + +P ++I NG SL ++ LE E+D
Sbjct: 18 GIQCDCCGETFAILDFESHAG-SKSCQPLKNICLENGHSL----LQCQLESWNKQDESDR 72
Query: 718 ---------------DLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 762
D CGIC DGG+L+CCDSCP FH C+ + PSG W+C YC
Sbjct: 73 KGFHFVDTDDQDPNDDTCGICGDGGNLICCDSCPSTFHQSCLEIKKFPSGVWNCTYCSCK 132
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
F A G +D DT
Sbjct: 133 F----------CGMAGGDTCQMDE--------------NDTA-----------------A 151
Query: 823 RRTVILCDQCEREYHVGC-LKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 881
+ ++ C CE +YH C L ++ + D C C+ + LQ L+ E
Sbjct: 152 QPALLACCLCEEKYHHSCILAENTVNDGYSSVS---FCGKKCQELYDKLQALLGVKHEME 208
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRAL-----LSKAVSIFHDR 936
+ ++++ FDV G + S R + ++ A+ I +
Sbjct: 209 EGFAWTLVRR------------FDV------GSDISLSGMHRKVECNSKVAVALHIMDEC 250
Query: 937 FDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 994
F P+ + S ++LI +VY G + +Y G AIL ++SA RI G LAE+
Sbjct: 251 FLPMPDHRSGVNLIRNIVYNFGSNFNRLNYSGFLTAILERGDEIISAASIRIHGNHLAEM 310
Query: 995 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1054
P + T + + QG + L IE L LNV+ LV+P+ SE + WT+ FGF + +
Sbjct: 311 PFIGTRHMYRRQGMCRRLLSAIETALCSLNVEKLVIPAISELRETWTSVFGFKPLEGSSK 370
Query: 1055 NKYRNDYPLMIFQGTSMLQKPV 1076
K RN ++ F G MLQKP+
Sbjct: 371 QKMRN-MKMVAFPGIDMLQKPL 391
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 93/375 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 835 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 889
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 890 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 939
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGRKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 940 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
+++ + +D+IP ++ + DY G Y IL ++ R+ G + AELP +AT
Sbjct: 168 MVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIAT 225
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
S D + QG + L IE +L +V+TLVL + E W + FGF + + E+ + RN
Sbjct: 226 SVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN 285
Query: 1060 DYPLMIFQGTSMLQK 1074
LM+F GTS+L K
Sbjct: 286 -VNLMLFPGTSLLTK 299
>gi|212721124|ref|NP_001132249.1| uncharacterized protein LOC100193685 [Zea mays]
gi|194693876|gb|ACF81022.1| unknown [Zea mays]
gi|414865116|tpg|DAA43673.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 565
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 143/288 (49%), Gaps = 34/288 (11%)
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL------KDHGMEDL 849
R+VQT C C + ++ C+QCER HV C K +E L
Sbjct: 304 RVVQTES-----CYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEIL 358
Query: 850 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 909
++ +LCC +C+ + + L+ + +K EE + ++ WR
Sbjct: 359 KDYLCFHFLCCQECQSLRVRLEGM-----------------EKCEEIAFLGRIRSNICWR 401
Query: 910 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGMY 968
+L DAS + LS+ + IF D F +ES I MVYG++ G+ D+ GMY
Sbjct: 402 LL--SSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMY 456
Query: 969 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1028
C +LT + VVSA I ++ + AEL L+AT ++C+ +GYF+ L IE L NV L
Sbjct: 457 CVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLL 516
Query: 1029 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ P E IW++K GF++++ +E+ +PL++F+ ++QKP+
Sbjct: 517 MAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 564
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 50/107 (46%), Positives = 75/107 (70%), Gaps = 1/107 (0%)
Query: 600 TSSHSKMKSSSVKSHGKITRK-DLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGF 658
TS+ + SSSV S+ + ++K D +H L+F+E GL D + Y ++ L GYK+G
Sbjct: 149 TSAQRPLPSSSVGSNSRESKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGA 208
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 705
GI+C CCN EVSPS+FE HAG RR+P+Q+IYTS G++LH+++++L
Sbjct: 209 GIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYTSQGLTLHDVALQL 255
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 10/82 (12%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ + + Y K+K G + L+G +G+GI C C C NQ V+P+ FE HAG
Sbjct: 179 FKEAGLPDNTLLTY----KLKN-GEALLQGYKQGAGIVCNC--C--NQEVSPSEFEKHAG 229
Query: 470 SSNKRPP-EYIYLENGKTLRDI 490
+R P + IY G TL D+
Sbjct: 230 MGKRRQPYQNIYTSQGLTLHDV 251
>gi|414865117|tpg|DAA43674.1| TPA: hypothetical protein ZEAMMB73_902866 [Zea mays]
Length = 534
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 143/288 (49%), Gaps = 34/288 (11%)
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCL------KDHGMEDL 849
R+VQT C C + ++ C+QCER HV C K +E L
Sbjct: 273 RVVQTES-----CYFCGYGHTELGKIDPNMIVFCNQCERPCHVKCYNSRVVKKKVPLEIL 327
Query: 850 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 909
++ +LCC +C+ + + L+ + +K EE + ++ WR
Sbjct: 328 KDYLCFHFLCCQECQSLRVRLEGM-----------------EKCEEIAFLGRIRSNICWR 370
Query: 910 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYHGMY 968
+L DAS + LS+ + IF D F +ES I MVYG++ G+ D+ GMY
Sbjct: 371 LL--SSADASRDVKLYLSQVIDIFKDAF---LESTDAHSDISDMVYGKNREGEKDFRGMY 425
Query: 969 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1028
C +LT + VVSA I ++ + AEL L+AT ++C+ +GYF+ L IE L NV L
Sbjct: 426 CVVLTASTHVVSAAILKVRVEHAAELVLIATRSECRKKGYFRLLLESIETNLRACNVSLL 485
Query: 1029 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
+ P E IW++K GF++++ +E+ +PL++F+ ++QKP+
Sbjct: 486 MAPVDPEMAQIWSDKLGFTILSADEKKSMLESHPLVMFKNLVLVQKPL 533
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 50/107 (46%), Positives = 75/107 (70%), Gaps = 1/107 (0%)
Query: 600 TSSHSKMKSSSVKSHGKITRK-DLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGF 658
TS+ + SSSV S+ + ++K D +H L+F+E GL D + Y ++ L GYK+G
Sbjct: 118 TSAQRPLPSSSVGSNSRESKKRDTALHPLIFKEAGLPDNTLLTYKLKNGEALLQGYKQGA 177
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKL 705
GI+C CCN EVSPS+FE HAG RR+P+Q+IYTS G++LH+++++L
Sbjct: 178 GIVCNCCNQEVSPSEFEKHAGMGKRRQPYQNIYTSQGLTLHDVALQL 224
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 10/82 (12%)
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
F E+G+ + + Y K+K G + L+G +G+GI C C C NQ V+P+ FE HAG
Sbjct: 148 FKEAGLPDNTLLTY----KLKN-GEALLQGYKQGAGIVCNC--C--NQEVSPSEFEKHAG 198
Query: 470 SSNKRPP-EYIYLENGKTLRDI 490
+R P + IY G TL D+
Sbjct: 199 MGKRRQPYQNIYTSQGLTLHDV 220
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 93/375 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
ENDD CG C DGG+LLCCD+CP +H C+S +P G+W+C +N
Sbjct: 3 ENDDTCGFCGDGGELLCCDNCPSTYHQTCLSDQELPEGSWYC--------------HNCT 48
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
R+ G +P ++ K ++ C QC
Sbjct: 49 CRSCG-----NPLSE----------------------------KEVSTFSAILKCLQCGD 75
Query: 835 EYHVGCLKDHGMEDLQELPKGK-----WLCCADCKRINLALQKLVDRGEEKLPETSLDVI 889
YH C+ D + LP G W C CK I + L
Sbjct: 76 SYHDTCI------DQEMLPCGDKQSNIWFCGRYCKEIFIGL------------------- 110
Query: 890 KKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKAVSIFHDRFDP 939
H G +N +D ++ W +L+ G+K+ +S + L+ A++I + F
Sbjct: 111 ---HNHVGIENFLDNELSWSILKCNTDGQKLHSSKKIAHMTECNTKLAVALTILEECFVR 167
Query: 940 IIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
+++ + +D+IP ++ + DY G Y IL ++ R+ G + AELP +AT
Sbjct: 168 MVDPRTGVDMIPHVLSNFARL--DYQGFYTVILEKGDEILCVASIRVHGTKAAELPFIAT 225
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
S D + QG + L IE +L +V+TLVL + E W + FGF + + E+ + RN
Sbjct: 226 SVDYRRQGMCRRLMDTIEMMLRSFHVETLVLSAIPELVNTWVSGFGFKPIEDNEKKQLRN 285
Query: 1060 DYPLMIFQGTSMLQK 1074
LM+F GTS+L K
Sbjct: 286 -VNLMLFPGTSLLTK 299
>gi|356502805|ref|XP_003520206.1| PREDICTED: uncharacterized protein LOC100784172 [Glycine max]
Length = 1180
Score = 139 bits (351), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 122/465 (26%), Positives = 201/465 (43%), Gaps = 101/465 (21%)
Query: 651 LVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP 710
L G G GI C CC+ + S FE HAG + P ++IY G SL + + ++
Sbjct: 488 LDGEITGNGIHCGCCDKIFTISDFELHAG-SKLADPLKNIYVGEGTSLLQCLLDSWNKQD 546
Query: 711 FSSKE------------NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRY 758
S ++ NDD CG+C DGGDL+CCD
Sbjct: 547 ESERKGFHFVDVAGEDPNDDTCGVCGDGGDLICCDG------------------------ 582
Query: 759 CMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 818
C +TF + C+ I + P + C+ C + FC
Sbjct: 583 CPSTFHQG---------------------------CLDIKKFPSGDWH-CIYCCCK-FCG 613
Query: 819 SRFGRRT------------VILCDQCEREYHVGCLK--DHGMEDLQELPKGKWLCCADCK 864
S G ++ C CE +YH C++ D +D +++ + C C+
Sbjct: 614 SVSGSSNQRDDNDELIVSKLLTCQLCEEKYHRSCIEANDANTDDSRDV----FFCGNRCQ 669
Query: 865 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 924
++ L+ L+ E S I++ + V FD ++ + V+ + +
Sbjct: 670 ELSERLEMLLGVKHEMEDGYSWTFIRR--------SDVGFDAS--QIKPQMVECN----S 715
Query: 925 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAG 982
L+ AVSI + F P I+ S ++LI +++Y R +Y G AIL ++SA
Sbjct: 716 KLAVAVSIMDECFMPYIDHRSGINLIHSILYNRGSNFNRLNYSGFVTAILERGDEIISAA 775
Query: 983 IFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN 1042
RI G +LAE+P + T + QG + L +E LG LNV+ LV+P+ SE + WT+
Sbjct: 776 SIRIRGNQLAEMPFIGTRYMYRRQGMCRRLLNAVEWGLGSLNVELLVIPAISELRETWTS 835
Query: 1043 KFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSVD 1087
FGF + E + ++ L++F MLQK + K ++ G++++
Sbjct: 836 VFGFESL-ESTSKQILHNKNLLVFPHVDMLQKKISKHKLAGQNLN 879
>gi|357484183|ref|XP_003612379.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
gi|355513714|gb|AES95337.1| Chromodomain-helicase-DNA-binding protein [Medicago truncatula]
Length = 428
Score = 139 bits (350), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 119/455 (26%), Positives = 191/455 (41%), Gaps = 109/455 (23%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----------------HEL- 701
I+C CC+ + + FE+HAG +R +P I +G SL H +
Sbjct: 37 AIVCDCCHVTFTITGFESHAG-CTRHRPSTSILLEDGRSLLDCQREALSSSDHKGNHSVV 95
Query: 702 --------SI-------KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
SI + + S ND++C IC GGDL CD CP AFH+ C+ L
Sbjct: 96 NENQKKNHSIVKENRKKNHCVVKEKSEANNDNVCSICGFGGDLALCDRCPSAFHLGCLGL 155
Query: 747 PGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELG 806
+P G W C C C +I P
Sbjct: 156 NRVPIGEWFCPTC----------------------------------CCKICYRP----- 176
Query: 807 GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
+ + CK +++C QCE++YH GC+K G+E + W C C +
Sbjct: 177 -----KCKQECKDH-KDNNILVCVQCEQKYHFGCVKAVGIE-FNHMEN--WFCSVVCGNM 227
Query: 867 NLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRG-KKVDASDGTR-- 923
L L+KL+ G V ++ W +++ VD + +
Sbjct: 228 FLCLKKLL----------------------GKPIKVADNLTWTLVKNVSSVDDKEFNQKE 265
Query: 924 ALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS--HRGQDYHGMYCAILTVNQVVVSA 981
+ L+ A+ + ++ F+P ++ S +LI +V+ R H ++ G Y IL V+S
Sbjct: 266 SKLNMALGVLYEGFNPTFDALSGRELIKDVVFSRESEHNRLNFCGFYNVILEKMGEVISV 325
Query: 982 GIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWT 1041
RI+GQ++AE+ VAT + QG L IEK L L V+ L+L S+ +A WT
Sbjct: 326 ATVRIYGQKVAEVVFVATKEQYRRQGMCHLLMDEIEKQLTRLGVEKLLLHSSEDAMNTWT 385
Query: 1042 NKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
FGF+ M +++ ++ D+ + FQ ++M K +
Sbjct: 386 RSFGFARMASKDKCQFI-DHTFLEFQNSTMCLKAL 419
>gi|147773656|emb|CAN63176.1| hypothetical protein VITISV_029947 [Vitis vinifera]
Length = 626
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/495 (24%), Positives = 206/495 (41%), Gaps = 96/495 (19%)
Query: 603 HSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILC 662
H +SSS ++K + L+ + +E+ A++ Y + L G G GI C
Sbjct: 136 HYSTRSSSKVDSESPSQKRTILSWLI-DSRVVENNAKIVYKNEAGEQILQGVLTGDGIWC 194
Query: 663 TCCNSEVSPSQFEAHAGWASRRKPFQHIYTS-NGVSL-------------HELS-IKLSL 707
+CCN+ ++ S+F+ HAG R P+Q I+ S G+SL EL L
Sbjct: 195 SCCNTVITVSEFQLHAGDEPNR-PYQRIFISETGLSLLTCQAEAWNQQGIPELQGYHLIE 253
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
R S + DD C +C DGG+L+CCD CP +HI C+ + P G W C C
Sbjct: 254 PREDVSDKYDDACVVCADGGNLICCDKCPSTYHISCLQMEDEPQGEWRCPAC-------- 305
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
C C F S V
Sbjct: 306 ---------------------------------------ACKFCHTHAFDIS------VF 320
Query: 828 LCDQCEREYHVGCLKDH-GM---EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
C QC+++YH C +++ GM ++ C + C +I L++LV E
Sbjct: 321 TCSQCDKKYHWECFRENEGMLIDLNMDGPSTSTPFCSSICSQIYEKLERLVGVRNELDEG 380
Query: 884 TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIES 943
+ ++++ E+G +D R L K+ + AV++ + F+P+I+
Sbjct: 381 LTWTLLRRMDPEAGVYLEESYD---RTLCNSKI----------AVAVAVMEECFEPVIDR 427
Query: 944 ASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSN 1001
++++++ +++Y G + + G Y AIL +S RI G +LAE+P +AT
Sbjct: 428 HTQINVVRSVIYNCGANFPRISFEGFYTAILEKGDETISVASMRIHGNKLAEMPFIAT-- 485
Query: 1002 DCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDY 1061
+ ++ L C + L+ +V+ LV+PS + W +GF + + +
Sbjct: 486 ----RPSYRRLGMCHKLLVAIESVQYLVIPSIEQRVRRWEESYGFQAIENKVMGELIKVK 541
Query: 1062 PLMIFQGTSMLQKPV 1076
LM F LQKP+
Sbjct: 542 SLM-FHCAIRLQKPL 555
>gi|414878580|tpg|DAA55711.1| TPA: hypothetical protein ZEAMMB73_837050, partial [Zea mays]
Length = 817
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 157/385 (40%), Gaps = 97/385 (25%)
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKF 768
RP + +DD CG+C DGG+LLCCDSCP FH C+++ +P G W C YC
Sbjct: 445 RPGAKDSSDDACGVCGDGGELLCCDSCPSTFHPACLAM-KVPQGWWACHYCR-------- 495
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
CVLC D + +
Sbjct: 496 ---------------------------------------CVLCMAND-------DQGLST 509
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWL---CCADCKRINLALQKLVDRGEEKLPETS 885
C C +YH C + L G+ + C CK+++ L +V
Sbjct: 510 CQHCSLKYHEVCRRP-------SLSNGRGIGAYCSETCKKVSARLSDMV----------- 551
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA--SDGTRAL------LSKAVSIFHDRF 937
G N + W +L+ +K +A S T A+ L+ A+ + ++ F
Sbjct: 552 -----------GVTNHTEDGFSWALLKIQKDEAVSSQDTAAVLECNVKLAVALGVLNECF 600
Query: 938 DPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELP 995
+P + +K+D++ VY G + Y G Y +L + ++A + RI G ++AE+P
Sbjct: 601 NPAKDRRTKIDMLHQAVYSLGSEFKRVSYEGFYTMVLDKDGETIAAALLRIHGTKVAEMP 660
Query: 996 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1055
AT + QG + L +E++L + V LV+P+ + WT F F + + E
Sbjct: 661 FAATLPAYRKQGMMRRLVNAVEQVLASVQVDKLVIPAIAALVDTWTRSFSFRPLLDPESR 720
Query: 1056 KYRNDYPLMIFQGTSMLQKPVPKCR 1080
+ L++ GT++L KPV R
Sbjct: 721 EEIRRRSLVVIAGTTLLHKPVAAAR 745
>gi|224137900|ref|XP_002326468.1| predicted protein [Populus trichocarpa]
gi|222833790|gb|EEE72267.1| predicted protein [Populus trichocarpa]
Length = 554
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 108/453 (23%), Positives = 181/453 (39%), Gaps = 107/453 (23%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTS---------------N 694
+ G + G+ C CCN ++ FE HAG + ++P++HI+ + +
Sbjct: 129 LMKGRARREGVRCECCNEVMTVWDFETHAG-SVLQRPYEHIHVARSNSSLLQCQFEVWQS 187
Query: 695 GVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTW 754
V + + + R +S ++DD C IC DGGDL+CC+ C H+ C+ L IP G W
Sbjct: 188 NVEVERRTFNEIVPRNGASDKHDDACLICADGGDLICCEKCWSTSHLKCMGLERIPQGDW 247
Query: 755 HCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGR 814
C YC+ + D +L CV
Sbjct: 248 ICPYCV----------------------------------CKHCNKNDKDLQTCV----- 268
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKL 873
QC+++YH CL + DL G+ L C + C + LQ L
Sbjct: 269 ----------------QCDKKYHCQCLVSNKELDLN--ASGETLACDSHCGEVYEKLQSL 310
Query: 874 VDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDG--------TRAL 925
V KHE G W +L+ + D D +
Sbjct: 311 V---------------GVKHELEGG-------FCWTLLQRMEPDNLDFKDLHLITECNSK 348
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGI 983
++ A + + F II+ ++++++ ++ Y R ++ G Y AIL N ++SA
Sbjct: 349 IALAWEVLDECFTTIIDRHTQINVVQSVAYSRGSNLNRINFRGFYTAILEKNDDIISAAT 408
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
R+ G +LAE+P + T + + G + L +E + + V+ L++PS E +W K
Sbjct: 409 IRVHGTDLAEMPFIGTRHLYRQNGMSRMLLVTLESIFSVMGVEHLIIPSVQELTEMWEGK 468
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
GFS + + K N + + F LQK +
Sbjct: 469 CGFSPIEDAVSQKITN-WNTLTFPSAVRLQKAL 500
>gi|168038096|ref|XP_001771538.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677265|gb|EDQ63738.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 660
Score = 132 bits (333), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 108/336 (32%), Positives = 157/336 (46%), Gaps = 60/336 (17%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVM-YIRGSKVKGPGVSGLRGVVKGSG 445
P+ + PR K + +G+LEG V RG + L G+ + G
Sbjct: 303 PIGNKLTKPPRNA--------KELMATGLLEGHYVHCSCRGEQ--------LTGIFQDMG 346
Query: 446 ISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV------CKDSPL 499
+ C C CKG QVV+ + FE H+GS++ P + IYLENGK LRDI++ C D+ L
Sbjct: 347 VVCNCRICKGTQVVSISAFEAHSGSTSHHPSDNIYLENGKNLRDILSAGQESADCGDNIL 406
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAE---I 556
L+ A+ + G + K C+ C + G E + SC K S A AE +
Sbjct: 407 RALQHAIGEIQG---ISKEMTCVKC---GKHEGGEFI-----SCKGAKCSAAYHAECVGV 455
Query: 557 KEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGK 616
K P LE+ E T R P+P S + +S GK
Sbjct: 456 KSP--------------HLEDWFCAKCEKTQA--RKPQPLLKVKRSPAGTDKEDARSKGK 499
Query: 617 ----ITR--KDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVS 670
I R +D +HK +F GGL DG E+GY+ R + L G K+G GI C+CCN E++
Sbjct: 500 EQTMIARSARDAHLHKALFLPGGLADGTELGYYARNQC-ILKGVKQGGGICCSCCNQEIT 558
Query: 671 PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS 706
S FE HA +R+ P+ I ++G SL ++ +L+
Sbjct: 559 CSAFERHARCEARQNPYGSILLADGRSLKDMCKELA 594
Score = 44.7 bits (104), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 82/186 (44%), Gaps = 31/186 (16%)
Query: 601 SSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGI 660
SS SK+K+ V K+T+ +L+ GL +G V RGE L G + G+
Sbjct: 293 SSASKLKAM-VPIGNKLTKPPRNAKELMAT--GLLEGHYVHCSCRGEQ--LTGIFQDMGV 347
Query: 661 LCTC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL------------SIK 704
+C C C VS S FEAH+G ++ P +IY NG +L ++ +I
Sbjct: 348 VCNCRICKGTQVVSISAFEAHSG-STSHHPSDNIYLENGKNLRDILSAGQESADCGDNIL 406
Query: 705 LSLERPFS-----SKENDDL-CGICMDGGDLLCCDS--CPRAFHIDCVSLPGIPSGTWHC 756
+L+ SKE + CG +GG+ + C C A+H +CV + W C
Sbjct: 407 RALQHAIGEIQGISKEMTCVKCGK-HEGGEFISCKGAKCSAAYHAECVGVKSPHLEDWFC 465
Query: 757 RYCMNT 762
C T
Sbjct: 466 AKCEKT 471
>gi|242091642|ref|XP_002436311.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
gi|241914534|gb|EER87678.1| hypothetical protein SORBIDRAFT_10g000260 [Sorghum bicolor]
Length = 704
Score = 132 bits (333), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 145/565 (25%), Positives = 229/565 (40%), Gaps = 76/565 (13%)
Query: 20 GLKREFEFALKVQSEICGSLGRTRA-RKVQSNVDSGCVLGPPEVKKLKTYESRKKRKRQE 78
+KRE FAL+ SEI S GRTR+ R + S D +K+ K +
Sbjct: 26 AMKRELAFALQSLSEISASPGRTRSGRPISSLPDPSASAS---LKRRKRSDPPAPAPAPA 82
Query: 79 QSVVVKETEDKREEEVKSDVFDVINE---RERPIREKESKDDSENMGVGERGALMNVEEV 135
+V T E + D+I P D + + V + L +
Sbjct: 83 ADLVSPPTPPIDAEPPNQPLRDIIQPLHGSNPPTVSDHRSDSNADQEVVPQSMLEAPQPF 142
Query: 136 KVVSERREEGNDEFGKVVIGVEEEKKNEC---DEVLMNVEENKYGELDGMGGSARTEEEK 192
+ + + K + E+ E D + + + DG S
Sbjct: 143 HAETATEDSAMEVIAKPNLSAAEDSAAETATEDSAMEVIAKPNVAAEDGASKSN------ 196
Query: 193 NECGEPVVGVEEERRNECNQVLTNVEENEHSEVDREKAENDLIGEVKNEFEEVVAVVEEE 252
V E+ + T + + S A D G E ++A +
Sbjct: 197 -------VATEDTAAVAPTLLQTTIAD---SASKPNVATEDTAGVPPTLLETILAAGDHH 246
Query: 253 KKDESDRVAMDVEEVKCEEVGLGKEYEPGRVQMEMDEEKKNDIERELVENGVLESSMVGK 312
D + +K + V + G VQ + ++++ V+N +L ++M+ +
Sbjct: 247 ----CDNTNSNGASLKQQAVTIAT----GNVQCDNSNSNGARLQQQAVDNVLLSNTMLAE 298
Query: 313 HS-----------------STLCNGESNVAKSVAV------DG-NDEGKTVNVVVERPLR 348
+ S L N + + +S A DG ND +++E+P R
Sbjct: 299 DAATPAAAADLKPARRFTRSLLKNNKPDKEESAACESQATPDGSNDASFDFPLLLEKPQR 358
Query: 349 RFTRSLLQQKVELAK-GSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKV--MRKFYS 405
RFTRSLL+ KVE + GS D + A+D PVK+ + +KV + K
Sbjct: 359 RFTRSLLKTKVESSLVGS--------DDALDSASDS--PPVKKMEMKMSKKVACLTKHPG 408
Query: 406 KLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFE 465
++ L +G+LEGM VMYI K + L+GV+ G I CFC C G++ V+ FE
Sbjct: 409 NIRELLNTGLLEGMPVMYIIPHSKK----AVLKGVITGCNIRCFCLSCNGSKAVSAYYFE 464
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCR 525
HAGS+ K P +YIYL NG +LRD++ SPLE LEK +R + + + CLNC
Sbjct: 465 QHAGSTKKHPADYIYLGNGNSLRDVLRASDGSPLEALEKTIRSSIDPVIKRSSVNCLNCN 524
Query: 526 VSFSNAGVEELMLLCKSCVELKESQ 550
+ E +LC+ C+E K+ Q
Sbjct: 525 EPVLPSSQSE-NVLCQVCLESKQPQ 548
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 44/70 (62%), Gaps = 2/70 (2%)
Query: 743 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 802
CV L +PS W+C C N QKEK + N NA+AAGR GVD Q++ R IRIV D
Sbjct: 637 CVGLRKVPS-EWYCDNCHNLVQKEKALAKNKNAKAAGRQAGVDSIEQIMKRAIRIVPISD 695
Query: 803 TELGGCVLCR 812
+LGGC LC+
Sbjct: 696 -DLGGCALCK 704
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 132 bits (333), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/453 (25%), Positives = 174/453 (38%), Gaps = 117/453 (25%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLS-----------L 707
GI C+CC + + FE HA AS +I+ +G SL E ++ L
Sbjct: 626 GIKCSCCRRVFTINGFEVHANGASCSGA-ANIFLDDGRSLLECQVEAYKKRKKAQPPDML 684
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH-----CRYCMNT 762
+ END C +C GG L+ CD CP AFH +C+ L +P G W C C
Sbjct: 685 KMKLRQGENDVFCSVCHYGGKLILCDGCPSAFHANCLGLEDVPDGDWFCQSCCCGACGQF 744
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
F K + NA+
Sbjct: 745 FLKTT----STNAK---------------------------------------------- 754
Query: 823 RRTVILCDQCEREYHVGCLK-DHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKL 881
I C QCE +YH CL+ D + L ++ KW C DC
Sbjct: 755 EEKFISCKQCELKYHPSCLRYDGACDSLDKILGEKWFCSKDC------------------ 796
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
EES N D K++A+ +LS A+ + H+ F+P+
Sbjct: 797 ------------EESLEPNMYGDD-------ASKIEAAAENHCILSVALDVMHELFEPVK 837
Query: 942 ESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
DL +++ R + + ++ G Y +L N +VS RI G+++AE+P + T
Sbjct: 838 RPHGGRDLAEDVIFSRWSKFKRLNFSGFYTVLLERNNELVSVATVRILGKKVAEMPFIGT 897
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ +G + L +EK+L L V+ LVLP+ W N FGF+ MT E+ +
Sbjct: 898 RFQHRQRGMCRVLINELEKVLIDLGVERLVLPAVPCVLNTWINSFGFTKMTISERKNFLK 957
Query: 1060 DYPLMIFQGTSMLQK---------PVPKCRIVG 1083
+ L+ F T + +K P+P +G
Sbjct: 958 -FTLLEFGRTILCEKILIKSGVADPIPSIASLG 989
>gi|242055711|ref|XP_002457001.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
gi|241928976|gb|EES02121.1| hypothetical protein SORBIDRAFT_03g046970 [Sorghum bicolor]
Length = 904
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 150/364 (41%), Gaps = 79/364 (21%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
+DD CG+C DGG+LLCCDSCP FH C+++ +P G W C YC
Sbjct: 511 SDDACGVCADGGELLCCDSCPSTFHPACLAM-KVPEGLWACHYCR--------------- 554
Query: 776 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 835
CVLC D + + C C +
Sbjct: 555 --------------------------------CVLCMAND-------DQGLSRCQHCTLK 575
Query: 836 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK-KKHE 894
YH C L C CK+++ L ++ S ++K +K E
Sbjct: 576 YHEIC-----RPSLSNGRGNGAYCSETCKKVSAQLSDMIGITNHTEDGFSWALLKIQKDE 630
Query: 895 ESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMV 954
S N+ D VL L+ A+ + ++ F+P+ + +K+D++ V
Sbjct: 631 PVSSQNSPD------VLE---------CNVKLAVALGVLNECFNPVKDRRTKIDMLHQAV 675
Query: 955 Y--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSL 1012
Y G + Y G Y +L N ++SA + RI G ++AE+P T + QG + L
Sbjct: 676 YSLGSEFKRVSYEGFYTMVLEKNGEIISAALLRIHGTKVAEMPFAGTLPAYRKQGMMRRL 735
Query: 1013 FCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSML 1072
+E++L + V+ LV+P+ + W F F + E + + R L++ GT++L
Sbjct: 736 VNAVEQVLASVQVEKLVIPAIAALVDTWKKSFSFKALDPELKEEIRRRS-LVVITGTTLL 794
Query: 1073 QKPV 1076
QKPV
Sbjct: 795 QKPV 798
>gi|238014598|gb|ACR38334.1| unknown [Zea mays]
Length = 338
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 145/288 (50%), Gaps = 27/288 (9%)
Query: 796 RIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDL 849
R +Q E C C T+I C+QCER H+ C + +E L
Sbjct: 70 RTLQERVVETESCYFCGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEIL 129
Query: 850 QELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWR 909
+E +LCC +C+ +L+ ++ G EK + +++ + ++ WR
Sbjct: 130 KEYMCFHFLCCQECQ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWR 173
Query: 910 VLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMY 968
+L G +DAS + + + + IF D F + S D+I MV G++ + +D+ GMY
Sbjct: 174 LLSG--MDASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMY 229
Query: 969 CAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTL 1028
CA+LT + VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV L
Sbjct: 230 CALLTASTHVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLL 289
Query: 1029 VLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
P E IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 290 TAPVDPEMAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSL 337
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 129 bits (325), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 164/366 (44%), Gaps = 64/366 (17%)
Query: 711 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVE 770
+ +NDD CG+C DGG+L+CCD+CP ++H+ C+ IP G+W+C C
Sbjct: 995 LQADQNDDTCGLCGDGGELICCDNCPASYHVACLPSQEIPDGSWYCSSCR---------- 1044
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCD 830
D ++VS + P T L C
Sbjct: 1045 -------------CDVCGEVVSS-----KEPRTPLHA-------------------FECS 1067
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 890
QCER+YH+ C+ + + + P G W C C++I +L+ V +P+ D
Sbjct: 1068 QCERQYHIKCISGKVLCNEESGP-GTWFCGRRCQQIYTSLRSRV-----GIPDHLDDGFS 1121
Query: 891 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 950
+ D V R+ + A + ++ A+SI + F PI + + +D++
Sbjct: 1122 CTILHNNGDQKV------RMAADIALLAECNMKLII--ALSILEECFLPIFDPRTGMDIM 1173
Query: 951 PAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1008
P ++Y RS+ DY G Y +L + ++S R+ G +AE+PL+AT + + QG
Sbjct: 1174 PLILYNWRSNFVHLDYKGFYTIVLEKDDSIISVASIRLHGAVVAEMPLIATCTENRQQGM 1233
Query: 1009 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQG 1068
+ + IE++L L V+ L+L + WT+ FGF + + ++ K + L G
Sbjct: 1234 CRRIVDYIEQMLKSLKVEMLLLSAIPSLVDTWTSAFGFRPIEDCDKKKL-SKIRLASVPG 1292
Query: 1069 TSMLQK 1074
T +L+K
Sbjct: 1293 TVLLKK 1298
>gi|224104617|ref|XP_002313502.1| predicted protein [Populus trichocarpa]
gi|222849910|gb|EEE87457.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 129 bits (325), Expect = 6e-27, Method: Composition-based stats.
Identities = 52/97 (53%), Positives = 72/97 (74%)
Query: 660 ILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDL 719
I+C+CC E+SPSQFE+HAG ++RR+P++HIYTSNG+SLH+++I L+ + ++ DD+
Sbjct: 4 IVCSCCEVEISPSQFESHAGMSARRQPYRHIYTSNGLSLHDIAISLANGQNITTGIGDDM 63
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 756
C DGGDL+ C SCPRAFH C+ L P G WHC
Sbjct: 64 CAEGGDGGDLMFCQSCPRAFHAACLDLQDTPEGAWHC 100
>gi|8885619|dbj|BAA97549.1| unnamed protein product [Arabidopsis thaliana]
Length = 1030
Score = 129 bits (325), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/460 (25%), Positives = 185/460 (40%), Gaps = 99/460 (21%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 709
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 428 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 486
Query: 710 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTF-QK 765
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W+C C F +K
Sbjct: 487 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAWYCYNCSCKFCEK 546
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 825
++ ++ + + L C L
Sbjct: 547 DEAAKHETSTLPS--------------------------LSSCRL--------------- 565
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE-T 884
CE +YH C+ G + C C+ + LQ + + LPE
Sbjct: 566 ------CEEKYHQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGF 616
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESA 944
S +++ S D D+ ++ A ++ A S+ + F P+++
Sbjct: 617 SWSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHR 663
Query: 945 SKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSND 1002
S ++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 664 SGVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYM 723
Query: 1003 CQGQG--------------YFQSLFCCIEKLLGFLNVKTLVLPSA--SEAQAIWTNKFGF 1046
+ QG YF +F I ++L L+V A + + + FGF
Sbjct: 724 YRRQGMCRRLMDGIESFVAYFSQMFLAISEVL--LDVWQFCCYPACFGDGPFCFFSGFGF 781
Query: 1047 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSV 1086
+ + + E+ +N L++F G ML K + K +I V
Sbjct: 782 APVNDSEKKTIKN-LNLLVFPGVDMLGKSLVKEKITDSVV 820
>gi|259490304|ref|NP_001159184.1| uncharacterized protein LOC100304269 [Zea mays]
gi|223942513|gb|ACN25340.1| unknown [Zea mays]
Length = 342
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 158/374 (42%), Gaps = 82/374 (21%)
Query: 711 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVE 770
+ +NDD CG C DGG+L+CCD+CP ++H C+S IP G W YC
Sbjct: 3 LQADQNDDTCGSCGDGGELICCDNCPASYHQACLSCQDIPDGNW---YC----------- 48
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCD 830
S C+ C +C K + C
Sbjct: 49 ---------------------SSCL------------CDICGEVIDSKELVTSLPALDCS 75
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 890
QCER+YHV C+ E G W C C I + + V
Sbjct: 76 QCERQYHVKCVS--AKVPCNEDGSGTWFCGRKCHEIYMTFRSRV---------------- 117
Query: 891 KKHEESGSDNAVDFDVRWRVLRG---KKVDASDGTRAL------LSKAVSIFHDRFDPII 941
G + +D D+ + VLR KKV ++ + L A SI + F PI+
Sbjct: 118 ------GVPDHMDDDLCFTVLRNNGDKKVRTAEEIALMAECNMKLMIATSIMEECFLPIL 171
Query: 942 ESASKLDLIPAMVYG-RSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATS 1000
+ + +D+IP+++Y RS +Y G Y +L + +VS R+ G LAE+PLVATS
Sbjct: 172 DPRTGIDIIPSILYNWRSDLHFNYKGFYTVVLESDDSMVSVASIRLHGAILAEMPLVATS 231
Query: 1001 NDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRND 1060
+ + QG + L IE++L L V+ L+L + WT+ FGF + E ++ + +
Sbjct: 232 KENRQQGMCRRLMDYIEEMLKSLKVEMLLLSAIPHLAETWTSTFGFREIDESDKKRL-SK 290
Query: 1061 YPLMIFQGTSMLQK 1074
L GT +L+K
Sbjct: 291 VRLAAVPGTVLLKK 304
>gi|413944529|gb|AFW77178.1| hypothetical protein ZEAMMB73_842631 [Zea mays]
Length = 947
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 125/543 (23%), Positives = 208/543 (38%), Gaps = 160/543 (29%)
Query: 607 KSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEV-----KFLVGYKKGFGIL 661
+ S + T+K + L+ + G + DG V Y G+ K + G G+
Sbjct: 355 RRSGTDAVAPATKKHTILTWLI-DGGFVSDGETVLYVPGGDGGAGAEKVVSGAVTRAGVH 413
Query: 662 CTCCNSEVSPSQFEAHAGWASR------RKPFQHIYTSNGVSL----------------- 698
C+CC+ V FEAHAG R R+P++ + +G SL
Sbjct: 414 CSCCDGVVPLPVFEAHAGARRRDPGPGQRQPWEKLLLVSGNSLLRCMQEAWEMEKVRTFH 473
Query: 699 -------------------------HELSIKLSLERPFSSK----------ENDDLCGIC 723
L + +ER S + +DD CG+C
Sbjct: 474 AQAKVRAALEQEEDKCSQAKRRLLAKHLKKGVVVERIMSPRMEKIKAGEKDSSDDACGVC 533
Query: 724 MDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 783
DGG+LLCCDSC FH +C+++ +P G+W C YC
Sbjct: 534 ADGGELLCCDSCTSTFHPECLAI-KVPEGSWSCHYCR----------------------- 569
Query: 784 VDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC--L 841
CVLC D + + C QC R+YH C L
Sbjct: 570 ------------------------CVLCMSNDDLQG------LSTCQQCARKYHESCRPL 599
Query: 842 KDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNA 901
+G D+ C CK++ L ++ +G N
Sbjct: 600 PGNGC-DIGTY------CGETCKKLFSQLAQV----------------------TGVTNP 630
Query: 902 VDFDVRWRVLRGKKVD--ASDGTRALLSK------AVSIFHDRFDPIIESASKLDLIPAM 953
W +LR +K + +S+ A+L + A+ +F++ F+P+ + +K+D++
Sbjct: 631 TGDGFWWALLRIQKDEPASSEEMPAVLERNVKLAVALGVFNECFNPVKDRRTKIDMLHQA 690
Query: 954 VY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQS 1011
VY G + Y G Y +L + +VSA + RI G ++AE+P T + QG +
Sbjct: 691 VYSLGSQFKRLSYEGFYTMVLEKDGEIVSAALLRIHGTQVAEMPFAGTLPAYRKQGMMRR 750
Query: 1012 LFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSM 1071
L +E++L + V+ LV+P+ W F F + + + + + L++ GT++
Sbjct: 751 LVSAVEQVLASVQVEKLVIPAIDSLVDTWKRSFFFRPVDPQLREELKR-LSLVVITGTTL 809
Query: 1072 LQK 1074
L K
Sbjct: 810 LHK 812
>gi|18421557|ref|NP_568537.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
gi|332006713|gb|AED94096.1| RING/FYVE/PHD zinc finger-containing protein [Arabidopsis thaliana]
Length = 1193
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 184/459 (40%), Gaps = 87/459 (18%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL----HELSIKLSLER----- 709
GI C CC+ S FE HAG +R +PF+ +Y G SL HE K S +
Sbjct: 581 GIRCNCCDEVFSVLDFEVHAG-GNRNQPFKSLYLEGGNSLLQCLHESMNKQSESQLKGYH 639
Query: 710 --PFSSKE-NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
F S + NDD CGIC DGGDL+CCD CP FH C+ + PSG W YC
Sbjct: 640 FVDFGSGDPNDDTCGICGDGGDLICCDGCPSTFHQSCLDIKKFPSGAW---YC------- 689
Query: 767 KFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTV 826
YN + + + E + L C LC + C +
Sbjct: 690 ----YNCSCKFCEKDEAAKHETSTLP-----------SLSSCRLCEEK--CSKHYPH--- 729
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET-S 885
L D H C+ G + C C+ + LQ + + LPE S
Sbjct: 730 TLAD------HQACINQDGTVPGERSTDS--FCGKYCQELFEELQLFIGV-KHPLPEGFS 780
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 945
+++ S D D+ ++ A ++ A S+ + F P+++ S
Sbjct: 781 WSFLRRFELPS---EVADCDISEKI----------AYNAKMAVAFSVMDECFSPLVDHRS 827
Query: 946 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1003
++L+ +VY G + D+ A+L +++ RI G +LAE+P + T
Sbjct: 828 GVNLLQNIVYNFGSNFHRLDFSSFLTAVLERGDEIIAVASIRIHGNQLAEMPFIGTRYMY 887
Query: 1004 QGQG--------------YFQSLFCCIEKLLGFLNVKTLVLPSA--SEAQAIWTNKFGFS 1047
+ QG YF +F I ++L L+V A + + + FGF+
Sbjct: 888 RRQGMCRRLMDGIESFVAYFSQMFLAISEVL--LDVWQFCCYPACFGDGPFCFFSGFGFA 945
Query: 1048 MMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKCRIVGKSV 1086
+ + E+ +N L++F G ML K + K +I V
Sbjct: 946 PVNDSEKKTIKN-LNLLVFPGVDMLGKSLVKEKITDSVV 983
>gi|224105951|ref|XP_002313990.1| predicted protein [Populus trichocarpa]
gi|222850398|gb|EEE87945.1| predicted protein [Populus trichocarpa]
Length = 978
Score = 120 bits (301), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 113/446 (25%), Positives = 176/446 (39%), Gaps = 120/446 (26%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRR-------KPFQHIYTSNGVSLHELSIKL------ 705
GI C+CC + S FE HAG S R KP I+ +G SL E I++
Sbjct: 619 GIKCSCCGKVYTLSGFELHAGIKSCRSGASKYCKPAASIFLDDGRSLLECQIQMMRDKEM 678
Query: 706 ---------SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHC 756
SL+ + ND +C +C GG+L+ CD
Sbjct: 679 SNHKAETPDSLKGSWDRDGNDHVCSVCHYGGELILCD----------------------- 715
Query: 757 RYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF 816
+C ++F K RC+ + PD + C C +
Sbjct: 716 -HCPSSFHK---------------------------RCLGMKDVPDGDWF-CPSCCCKIC 746
Query: 817 CKSRFGRRT------VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLAL 870
+++ + T V+ C QCE +YH+ CL + + ++ PK C C+ +
Sbjct: 747 GQNKLKKDTKDFIDGVLNCTQCEHQYHIMCLSNSWTDKWKDHPKENSFCSKKCEVYMQSD 806
Query: 871 QKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAV 930
Q +D ++ ET ++ K L A+
Sbjct: 807 QHKLDAFDD---ETLVETYSK----------------------------------LKIAL 829
Query: 931 SIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFG 988
+ H+ F+PI E + DL+ +++ G ++ G Y +L N +VS RI G
Sbjct: 830 DVVHECFEPIEEPRTGRDLMKDVIFSNGSELNRLNFQGFYTILLEKNDELVSVATVRIHG 889
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
++AE+PLV T + G + L +EK L L V+ LVLP+ WT FGFS
Sbjct: 890 DKVAEIPLVGTRFQFRQLGMCRILMDVLEKKLMELGVQRLVLPAVPGVLNTWTGSFGFSK 949
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQK 1074
MT+ E+ ++ DY + FQ T M QK
Sbjct: 950 MTDSERLQFV-DYTFLDFQDTVMCQK 974
>gi|302790536|ref|XP_002977035.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
gi|300155011|gb|EFJ21644.1| hypothetical protein SELMODRAFT_416997 [Selaginella moellendorffii]
Length = 592
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 155/369 (42%), Gaps = 95/369 (25%)
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 771
++ +D CGIC +GG+L+CC++ C TF E
Sbjct: 264 AAANSDQCCGICNEGGELVCCET------------------------CPLTFHME----- 294
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTP-DTELGGCVLCRGRDFCKSRFGRRTVILCD 830
C+ +++ P D LC C RT C+
Sbjct: 295 ----------------------CVSLLEVPKDAWFCFRCLC-----CHCGEPLRTQP-CE 326
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 890
QCER +H GC D L + C + C I L ++V +++ +
Sbjct: 327 QCERCFHPGCCDD------AILAGDFFFCSSGCWNIFQRLAEMV---------ATVNPLG 371
Query: 891 KKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLI 950
+ ++ W +LR + D LL++A+ + RFDP+++ ++LD +
Sbjct: 372 RS------------ELSWSLLRRGRCDDK-----LLAEALQVISSRFDPVLDCWTQLDYL 414
Query: 951 PAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGY 1008
AMV+ RSH D+ G Y A+L VV + RI G LAE+P +AT +GQG
Sbjct: 415 DAMVFSRSHHSPRLDFSGFYTAVLQRGAEVVGVAVLRIHGAWLAEMPFIATKAGMEGQGI 474
Query: 1009 FQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN-DYPLMIFQ 1067
+SLF +E++L L V+ +VL +A + + +W N F F M + + K R L+
Sbjct: 475 CRSLFTAVEEMLARLGVEMMVLLAAKDTEKMWKNSFEFHAM--DRKLKARTVALGLVALN 532
Query: 1068 GTSMLQKPV 1076
G LQK V
Sbjct: 533 GAGFLQKSV 541
>gi|449440451|ref|XP_004137998.1| PREDICTED: uncharacterized protein LOC101221048 [Cucumis sativus]
Length = 233
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/95 (60%), Positives = 72/95 (75%), Gaps = 1/95 (1%)
Query: 990 ELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMM 1049
++AELPLVATSN G+GYFQ+LF CIE+LL FL VK LVLP+A EA++IWT KFGF +
Sbjct: 136 DIAELPLVATSNGNHGKGYFQTLFSCIERLLAFLKVKCLVLPAAEEAESIWTEKFGFERI 195
Query: 1050 TEEEQNKYRNDYPLMI-FQGTSMLQKPVPKCRIVG 1083
++ + YR M+ F+GTSMLQK VP CR+VG
Sbjct: 196 KPDQLSSYRRSCCQMVTFKGTSMLQKTVPSCRVVG 230
>gi|357120109|ref|XP_003561772.1| PREDICTED: uncharacterized protein LOC100828050 [Brachypodium
distachyon]
Length = 910
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 146/290 (50%), Gaps = 37/290 (12%)
Query: 796 RIVQTPDTELGGCVLCRGRDFCKS--RFGRRTVILCDQCEREYHVGCLKDHGMED----- 848
R +Q + C LC D C + ++ C+QCER HV C ++G++
Sbjct: 622 RTLQQIADKTESCRLCG--DACTTIGTISEDMIVFCNQCERPCHVKCY-NNGLQKQKGPL 678
Query: 849 --LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDV 906
L E + + CC C+ + +L +++++ E+ I++K V
Sbjct: 679 NVLAEYMQFHFFCCQKCQLLRASLHEVLNKREK---------IRQKRSY----------V 719
Query: 907 RWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ-DYH 965
W++L G ++ + + + + IF F +AS +I MV + G+ D+
Sbjct: 720 FWQILNG--MNPGINVQKYIHQVIEIFKVAFPKT--AASDFGVIQDMVNAKDVGGEKDFR 775
Query: 966 GMYCAILTVN-QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLN 1024
GMYCA+LT + ++VVSA + ++ +E+AEL +VAT N + +GYF L IE L +N
Sbjct: 776 GMYCAVLTTSSKLVVSAAVLKVRTEEVAELVIVATCNQFRKKGYFTLLLRQIEAHLKAMN 835
Query: 1025 VKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
V+ L E ++IW+ K GF++++ EE+ +PL++F+ +++QK
Sbjct: 836 VRLLTALVDPEMESIWSKKLGFTILSGEEKETLLEAHPLVMFEDLTLMQK 885
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 153/364 (42%), Gaps = 87/364 (23%)
Query: 387 PVKQ-ETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSG 445
PV+ ET +P +++ ++ L +G+LEG SV Y + K+
Sbjct: 257 PVRMAETKYRPETLLKD----VRGLLSTGLLEGFSVTYKKNGKMNA-------------- 298
Query: 446 ISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKA 505
C+ FE HAG S+ ++I+L++G +L ++ K L L
Sbjct: 299 ----CE------------FEQHAGQSSNNQNDHIFLDSGISLYKLIQALKYKKLHLLADL 342
Query: 506 VRMVLG---------------------------------SSSMKKANFCLNCRVSFSNAG 532
+ G SS A ++ + S SN G
Sbjct: 343 IEEQTGLPPNLIEYGKWKASFEVQNDDLEDAASDHCSTQSSQDSDAGVTISMKESTSN-G 401
Query: 533 VEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPPS------VELEESPA--PSGE 584
+ L + + G+A + LS S E S + EE+P+ +G
Sbjct: 402 ISNLNWSAFRRPRWQYKRGGTATSTQTLSRSPEKGISGLSTGTSMKINTEETPSENTAGP 461
Query: 585 LTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFV 644
L EP + +S + K +D +H+LVF+EGG+ + + Y +
Sbjct: 462 LHSEVTIVQEPPRGHSVGPKSKESRTSKV------RDNSLHQLVFKEGGVPELTILTYKL 515
Query: 645 R-GEVKFLVGYKKGFGILCTCCNSEV--SPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 701
+ GEV GYK+G ILC CC+ EV +PS FE HAG RR+P+++IYT G++LHEL
Sbjct: 516 KHGEV-LKQGYKQGTCILCDCCSEEVQFTPSHFEEHAGMGKRRQPYRNIYTPEGLTLHEL 574
Query: 702 SIKL 705
++KL
Sbjct: 575 ALKL 578
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 122/317 (38%), Gaps = 67/317 (21%)
Query: 437 LRGVVKGSGISCFC----DDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMN 492
L G V G C C D +G + ++ FE HAG+ +K +I+L NG +L + +
Sbjct: 26 LEGFVAEGGYVCACTSPCDYRRGGKALSALQFEKHAGAVSKNQNGHIFLRNGTSLYALFH 85
Query: 493 VCKDSPLETLEKAVRMVLGSSSMKKANFCLNC------RVSFSNAGVEELMLLCKSCVE- 545
++ P + + RM G A R+ S+ E+ C + E
Sbjct: 86 ALREVPAQRFAEDFRMAAGVQMTVPAATGAPLAAREPDRLQASDLTAEQAP--CAAAQEA 143
Query: 546 -----LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQT 600
L E + S + + +HSS E P S PS L D + E +A
Sbjct: 144 QTLDNLTEEEKASLSLLDLRAHSSVTESDPMDGIEGSSIHPSLSLRDVAE---EIENAAR 200
Query: 601 SSHSKMKSSSVK-----SHGKIT------------RKDLRMHKLVFEEGGLEDG------ 637
HS V+ S G +T + +RM +++ +E
Sbjct: 201 CDHSMPDVGGVRDAPRDSGGLLTDTLVTVPVMEFKYQPVRMADTKYDQIMMESKYQPVRM 260
Query: 638 AEVGYF-------VRGEVK------FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR 684
AE Y VRG + F V YKK N +++ +FE HAG +S
Sbjct: 261 AETKYRPETLLKDVRGLLSTGLLEGFSVTYKK---------NGKMNACEFEQHAGQSSNN 311
Query: 685 KPFQHIYTSNGVSLHEL 701
+ HI+ +G+SL++L
Sbjct: 312 QN-DHIFLDSGISLYKL 327
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 117 bits (292), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 144/383 (37%), Gaps = 92/383 (24%)
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 771
+S +D +C C GGDL+ CD CP FH+ C+ L +P W C C
Sbjct: 138 NSSLSDTICSFCHYGGDLILCDKCPSTFHLGCLELKDVPLENWFCPSC------------ 185
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPD--TELGGCVLCRGRDFCKSRFGRRTVILC 829
C + D T C+ C
Sbjct: 186 ----------------------CCELCGKGDSSTSTNACLQC------------------ 205
Query: 830 DQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVI 889
R YHV CL G + P + C C + L +L+
Sbjct: 206 ---ARAYHVHCLTKDGCLLPTDYPSENF-CSKSCYELCAQLHQLL--------------- 246
Query: 890 KKKHEESGSDNAVDFD-VRWRVLRGKKVDASD-------GTRALLSKAVSIFHDRFDPII 941
G N D + W + R K D + T + + + H+ F +
Sbjct: 247 -------GISNPTSVDGLTWTLTRSSK-DVYNFPGMPRSSTHVKSFQILRVMHECFRSVK 298
Query: 942 ESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
E ++ D++ ++Y G + ++HG Y +L +VS RI G + AE+PLVAT
Sbjct: 299 EPHTQKDMVTDLIYNSGSKFKRLNFHGFYAVVLNRGDQIVSVATLRIHGLKAAEMPLVAT 358
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRN 1059
+ + QG + L + KLL V+ L+LP+ + + +W FGFS M E+ + +
Sbjct: 359 PFNFRRQGMCRLLMQEVLKLLNKFRVERLILPAIPQLRKMWEASFGFSEMPLSERQQL-S 417
Query: 1060 DYPLMIFQGTSMLQKPVPKCRIV 1082
Y + FQGT MLQ + RI
Sbjct: 418 GYSFVGFQGTMMLQNVLTSSRIT 440
>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
Length = 999
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 153/382 (40%), Gaps = 87/382 (22%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNG-VSLHELSIKLSLERPFSSK--- 714
GI C CC+ ++ ++FE HAG + +KP+ +I+ G VSL + + + E+ S+
Sbjct: 685 GIFCGCCSKILTIAKFELHAG-SKEKKPYANIFLEGGKVSLLQCLLD-AWEKHTQSENKG 742
Query: 715 ---------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQK 765
E+DD C IC DGGDL+CCD C FH+DC+ + +PSG W+CR C+ F
Sbjct: 743 FYKIDKGDDEHDDTCAICGDGGDLVCCDHCASTFHLDCLGIK-LPSGDWYCRSCLCRF-- 799
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTE---LGGCVLCRGRDFCKSRFG 822
C + P + L C+ C
Sbjct: 800 ----------------------------CGFPQEKPSSSPELLLSCLQC----------- 820
Query: 823 RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKW--LCCADCKRINLALQKLVDRGEEK 880
R+YH C G + +P C C++I L KL+
Sbjct: 821 ----------SRKYHQTCSSGTGTDSGCTMPGTSIDCFCSPGCRKIYKRLNKLLG----- 865
Query: 881 LPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI 940
IK E S + V + + K + + + A ++ + F P
Sbjct: 866 --------IKNHMEAGFSWSLVHCFANDQAMPNKNKEKLAQCNSKTALAFTVLDECFQPH 917
Query: 941 IESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVA 998
I+ S +++I + Y G D+ G Y IL V++A RI G +LAE+P +
Sbjct: 918 IDDRSGINMIHNVAYNCGSDFSRLDFSGFYAFILERGDEVIAAASVRIHGTDLAEMPFIG 977
Query: 999 TSNDCQGQGYFQSLFCCIEKLL 1020
T + QG + L IE ++
Sbjct: 978 TRGMYRHQGMCRRLLNGIESVI 999
Score = 43.5 bits (101), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 8/71 (11%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKR 474
I E M V Y+ ++ + L G + GI C C C ++++T A FELHAGS K+
Sbjct: 659 ISEDMHVKYMNSNRTR----EMLAGKITREGIFCGC--C--SKILTIAKFELHAGSKEKK 710
Query: 475 PPEYIYLENGK 485
P I+LE GK
Sbjct: 711 PYANIFLEGGK 721
>gi|302763069|ref|XP_002964956.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
gi|300167189|gb|EFJ33794.1| hypothetical protein SELMODRAFT_23264 [Selaginella moellendorffii]
Length = 363
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 111/432 (25%), Positives = 170/432 (39%), Gaps = 135/432 (31%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSL-----HELSIKL-SLE---- 708
G++C+CC+ + S FEAH G + R+P +I+ + L ++ K+ +LE
Sbjct: 11 GLVCSCCDELFNLSGFEAHTG-SKLRRPAANIFVGDEAQLSIADCQNVAFKMETLESLPG 69
Query: 709 ------RPFSS------------------------KENDDLCGICMDGGDLLCCDSCPRA 738
R F S +D CGIC +GG+L+CC++
Sbjct: 70 LPVARRRKFDSYCQSDEDSGLTTVSSGSDVDYEAAANSDQCCGICNEGGELVCCET---- 125
Query: 739 FHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIV 798
C TF E C+ ++
Sbjct: 126 --------------------CPLTFHME---------------------------CVSLL 138
Query: 799 QTP-DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGME-DLQELPKGK 856
+ P D LC C RT C+QCER +H GC D + D G
Sbjct: 139 EVPKDAWFCFRCLC-----CHCGEPLRTQP-CEQCERCFHPGCCDDAILAGDFFFCSSGC 192
Query: 857 WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKV 916
W NL ++L E V E + W +LR +
Sbjct: 193 W---------NLF---------QRLAEMVATVNPLGRSE----------LSWSLLRRGRC 224
Query: 917 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTV 974
D LL++A+ + RFDP+++ ++LD + AMV+ RSH D+ G Y A+L
Sbjct: 225 DDK-----LLAEALQLISSRFDPVLDCWTQLDYLDAMVFSRSHHSPRLDFSGFYTAVLQR 279
Query: 975 NQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSAS 1034
VV + RI LAE+P +AT +GQG +SLF +E++L L V+T+ L +A
Sbjct: 280 GAEVVGVAVLRIHAAWLAEMPFIATKAGMEGQGICRSLFTAVEEMLARLGVETMALLAAK 339
Query: 1035 EAQAIWTNKFGF 1046
+ + +W N F F
Sbjct: 340 DTEKMWKNSFEF 351
>gi|297809221|ref|XP_002872494.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
gi|297318331|gb|EFH48753.1| hypothetical protein ARALYDRAFT_911302 [Arabidopsis lyrata subsp.
lyrata]
Length = 213
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 80/114 (70%), Gaps = 1/114 (0%)
Query: 970 AILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLV 1029
I+ N VV+AG+ R+FG+E+AELPLVAT + +GYFQ LF CIEKLL LNV ++V
Sbjct: 79 GIIQTNATVVAAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCIEKLLSSLNVGSIV 138
Query: 1030 LPSASEAQAIWTNKFGFSMMTEEEQNKY-RNDYPLMIFQGTSMLQKPVPKCRIV 1082
+P+A E + +W NKFGF + E+ +KY + Y ++ F+G SMLQKPV +I+
Sbjct: 139 VPAAEEEEHLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKPVDSHQII 192
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 28/42 (66%)
Query: 849 LQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 890
L+ELPKG W C DC RIN LQKL+ G EKL ++ +I+
Sbjct: 41 LKELPKGNWFCSMDCTRINSTLQKLLLGGAEKLSDSCFGIIQ 82
>gi|226505438|ref|NP_001145707.1| uncharacterized protein LOC100279211 [Zea mays]
gi|219884103|gb|ACL52426.1| unknown [Zea mays]
Length = 635
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/393 (26%), Positives = 163/393 (41%), Gaps = 95/393 (24%)
Query: 592 SPEPNSAQTSSHSKMKSS----SVKSHGKITRKDLR-----------------MHKLVFE 630
SP P +A T++HSK S+ S+ S KIT++ LR +H L+F+
Sbjct: 296 SPLP-TAVTTNHSKHDSTNLGLSLSSPVKITQRPLRNCSIDSKSKESKTRDTTLHPLIFK 354
Query: 631 EGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHI 690
E GL D + Y ++ GYK+G I+C CCN E SPS FE HAG RR+P+ +I
Sbjct: 355 EDGLADNTLLTYKLKNGEALKQGYKRGTCIICNCCNQEFSPSHFEEHAGMGRRRQPYHNI 414
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP 750
YT G+SLH+ + L L+ + D+ + L C R P
Sbjct: 415 YTLEGLSLHK--LALQLQDHLNPNGFDNASVSSVSDYHNLTSSGCGRE--------PSTT 464
Query: 751 SGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVL 810
SG + P + + R+V+T C
Sbjct: 465 SGP------------------------------IVPLKRTLQE--RVVETE-----SCYF 487
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH------GMEDLQELPKGKWLCCADCK 864
C T+I C+QCER H+ C + +E L+E +LCC +C+
Sbjct: 488 CGYGHTTIGNINPDTIIFCNQCERPCHIKCYNNRVVKKKVPLEILKEYMCFHFLCCQECQ 547
Query: 865 RINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRA 924
+L+ ++ G EK + +++ + ++ WR+L G +DAS +
Sbjct: 548 ----SLRARLEEGLEKC--VGITFLRR----------IRSNICWRLLSG--MDASRDVKL 589
Query: 925 LLSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 957
+ + + IF D F + S D+I MV G+
Sbjct: 590 YMPQVIDIFKDAFMDSTDEHS--DIISDMVNGK 620
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 8/106 (7%)
Query: 407 LKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAV-FE 465
++ L +G+LEG V Y++ + G + G G SC C C N + A FE
Sbjct: 69 VRGLLSTGLLEGFRVTYMKDEVEE-------VGRINGQGYSCGCSKCNYNSNIMNACEFE 121
Query: 466 LHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLG 511
H G S ++I+L+ G +L ++ K L L + +G
Sbjct: 122 EHYGQSFDNQIDHIFLDTGISLFRVVEALKPCKLNMLGDFIEEKIG 167
>gi|302786210|ref|XP_002974876.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
gi|300157771|gb|EFJ24396.1| hypothetical protein SELMODRAFT_101919 [Selaginella moellendorffii]
Length = 454
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 128/286 (44%), Gaps = 44/286 (15%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKR---------------- 865
++ CD C +H+ CL+ L+ +P+G W C CA C R
Sbjct: 140 LVCCDHCPSTFHLKCLR------LENVPEGDWFCPRCCCASCGRSLYDPTIQTEILYYHS 193
Query: 866 --INLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 923
+ K + + + + G N VD W +LR + D S
Sbjct: 194 NCVPGCAMKYESSDNQFCSRKCFKIFRGLRKLVGRVNKVDDMYSWTLLRSEHYDQSAENS 253
Query: 924 AL---------LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQD----YHGMYCA 970
L L+ A+++ + F P+I+ S +D++ ++Y R RG+D + G Y
Sbjct: 254 KLESVADLNTRLALALTVIQECFRPMIDPRSNIDMVSHILYNR--RGEDKRMDFRGFYTV 311
Query: 971 ILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVL 1030
+L Q ++S R+ G AE+P + T + + QG + L I+++L L V+TLVL
Sbjct: 312 VLEKEQELISVASMRVHGSHAAEIPFIGTRSQYRKQGMCRRLINVIQQVLHTLEVQTLVL 371
Query: 1031 PSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
P+ +E WT+ FGF +T Q + ++ F G+S+LQKP+
Sbjct: 372 PAIAEFIETWTSAFGFQKLT-AAQGIQLMELNIVTFPGSSVLQKPL 416
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 64/158 (40%), Gaps = 32/158 (20%)
Query: 634 LEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRR--------- 684
L +GA V Y + + G GILC CCN S + F+ HAG R
Sbjct: 13 LSEGAAVSYVNKDSNQVASGVISRDGILCKCCNEVFSMTSFQVHAGDEVHRTAALLTLED 72
Query: 685 ---------------------KPFQHIYTSNGVSLHELSIKLS--LERPFSSKENDDLCG 721
+P T + +L + +K S + ENDD C
Sbjct: 73 GRSVLECQKQALKKIEQAKCDEPANGQLTVDETALKAMELKESELVVDDVEMDENDDTCA 132
Query: 722 ICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
+C DGG L+CCD CP FH+ C+ L +P G W C C
Sbjct: 133 VCGDGGQLVCCDHCPSTFHLKCLRLENVPEGDWFCPRC 170
>gi|15237559|ref|NP_201195.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
gi|10177678|dbj|BAB11038.1| unnamed protein product [Arabidopsis thaliana]
gi|225879156|dbj|BAH30648.1| hypothetical protein [Arabidopsis thaliana]
gi|332010430|gb|AED97813.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain
[Arabidopsis thaliana]
Length = 557
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 153/371 (41%), Gaps = 87/371 (23%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
N D+C +C GGDLL CD CP AFH C+ L +P +++ +
Sbjct: 255 NCDVCCVCHWGGDLLLCDGCPSAFHHACLGLSSLP-------------EEDLWFCPCCCC 301
Query: 776 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 835
G +E +++L C +QC+R
Sbjct: 302 DICGSMES----------------PANSKLMAC---------------------EQCQRR 324
Query: 836 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 895
+H+ CLK ED + W C + C R+ AL+ L+
Sbjct: 325 FHLTCLK----EDSCIVSSRGWFCSSQCNRVFSALENLL--------------------- 359
Query: 896 SGSDNAV--DFDVRWRVLR----GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDL 949
GS AV D D+ W ++R G+ D D + L AV I H F+P + S DL
Sbjct: 360 -GSKIAVGNDGDLVWTLMRAPNEGEHYD--DEQISKLESAVEILHQGFEPTNDVFSGRDL 416
Query: 950 IPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYF 1009
+ ++Y + G G Y ++ ++ R+ +++ E+PLVAT + + G
Sbjct: 417 VEELIYRKDRTGVG-RGFYTVLIERKNEPITVAAVRV-DKDVVEIPLVATLSSYRRSGMC 474
Query: 1010 QSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGT 1069
+ L +EK + + V LVLP+A E WT +FGFS+M E+ + + ++ F GT
Sbjct: 475 RVLMDELEKQMSQMGVCRLVLPAAKEVVTTWTERFGFSVMNSSERLELVK-HGMLDFVGT 533
Query: 1070 SMLQKPVPKCR 1080
M K + K R
Sbjct: 534 IMCHKFLQKER 544
>gi|414866149|tpg|DAA44706.1| TPA: hypothetical protein ZEAMMB73_046351 [Zea mays]
Length = 206
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 118/222 (53%), Gaps = 21/222 (9%)
Query: 857 WLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKV 916
+LCC +C Q L R EE L E + + + S ++ WR+L G +
Sbjct: 5 FLCCQEC-------QSLRARLEEGL-EKCVGITFLRRIRS--------NICWRLLSG--M 46
Query: 917 DASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRS-HRGQDYHGMYCAILTVN 975
DAS + + + + IF D F + S D+I MV G++ + +D+ GMYCA+LT +
Sbjct: 47 DASRDVKLYMPQVIDIFKDAFMDSTDEHS--DIISDMVNGKNGDQEKDFRGMYCALLTAS 104
Query: 976 QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASE 1035
VVSA I ++ +++AEL L+AT ++C+ +GYF L IE L NV L P E
Sbjct: 105 THVVSAAILKVRIEQIAELVLIATRSECRKKGYFILLLKSIEANLRAWNVSLLTAPVDPE 164
Query: 1036 AQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVP 1077
IW+ K GF++++ EE+ +PL++F+ ++QK +
Sbjct: 165 MAQIWSEKLGFTILSAEEKESMLESHPLVMFKNLVLVQKSLA 206
>gi|224080293|ref|XP_002335635.1| predicted protein [Populus trichocarpa]
gi|222834490|gb|EEE72967.1| predicted protein [Populus trichocarpa]
Length = 240
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 112/214 (52%), Gaps = 12/214 (5%)
Query: 345 RPLRRFTRSLLQQKVELAKGS------LSKDGGKRSDVTEVANDGVGGPVKQ--ETVMKP 396
RP +R T+S L+ K++ + + + + R DV +A P K+ E M
Sbjct: 17 RP-KRVTKSKLKIKLQAVEVTVEGPEAIEGEALSRVDVEMIAEGSALTPPKKNLELKMSK 75
Query: 397 RKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGN 456
+ + +K E+G+LEG+ V+Y+ G K + GLRG +K GI C C C G
Sbjct: 76 KIALDNVPLTVKELFETGLLEGVPVVYMGGKKFQA---FGLRGTIKDVGILCSCAFCNGR 132
Query: 457 QVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMK 516
+V+ P+ FE+HA +R +YI ENGK+L D++N C+ +PL++LE ++ + ++
Sbjct: 133 RVIPPSQFEIHAIKQYRRAAQYICFENGKSLLDVLNACRTAPLDSLETTIQSAISGLPVE 192
Query: 517 KANFCLNCRVSFSNAGVEELMLLCKSCVELKESQ 550
+ C C+ F + V ++ LC C E KES
Sbjct: 193 RTFTCKRCKGIFPSICVGKIGPLCNLCAESKESH 226
>gi|414872770|tpg|DAA51327.1| TPA: hypothetical protein ZEAMMB73_851441 [Zea mays]
Length = 299
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 119/226 (52%), Gaps = 16/226 (7%)
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGK 914
G C C+ + LQ L+ ++ PE S V+++ HE DV VL
Sbjct: 24 GNLFCQQSCRLLFEELQNLLAVKKDLEPEYSCRVVQRIHE----------DVPEEVL--- 70
Query: 915 KVDASDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAIL 972
+D + ++ A+S+ + F PII+ + ++LI +VY G + D+ G Y IL
Sbjct: 71 ALDKRVECNSRIAVALSLMDECFLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFIL 130
Query: 973 TVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPS 1032
+++A RI G +LAE+P + T N + QG + L IE +L LNV+ L++P+
Sbjct: 131 ERGDEIIAAASVRIHGTKLAEMPFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPA 190
Query: 1033 ASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPK 1078
+E WT+KFGFS + + E+ + ++ +++F GT +LQKP+ K
Sbjct: 191 ITELVDTWTSKFGFSPLEDSEKQEVKS-ISMLVFPGTGLLQKPLLK 235
>gi|298204724|emb|CBI25222.3| unnamed protein product [Vitis vinifera]
Length = 248
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 71/94 (75%)
Query: 463 VFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCL 522
V++ HA SSNKRPPEYIYLENG +LR +MN K++ L +L++A+R+ +G S +K++ FCL
Sbjct: 139 VWKSHADSSNKRPPEYIYLENGTSLRGVMNAWKNAALYSLDEAIRVAIGCSMIKRSTFCL 198
Query: 523 NCRVSFSNAGVEELMLLCKSCVELKESQAGSAEI 556
NC+ AG+ +LC SC++LKESQAG +++
Sbjct: 199 NCKGRIFGAGIGNSKVLCLSCLQLKESQAGPSQV 232
>gi|255538062|ref|XP_002510096.1| DNA binding protein, putative [Ricinus communis]
gi|223550797|gb|EEF52283.1| DNA binding protein, putative [Ricinus communis]
Length = 290
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 125/259 (48%), Gaps = 34/259 (13%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKG---KWLCCADCKRINLALQKLVDRGEEKLP 882
++ CDQC R++HV C + G+ L+ KG W C C+ + LQ L+
Sbjct: 47 ILSCDQCPRKFHVACARSRGLIKLER--KGTCYSWFCSDKCEYVFSGLQHLLG------- 97
Query: 883 ETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASD-----GTRALLSKAVSIFHDRF 937
K G+DN + W +L+ + D D + L A+ + H+ F
Sbjct: 98 ---------KSVPVGTDN-----LTWTLLKRVEPDCFDLEVLSANNSKLKLALEVMHECF 143
Query: 938 DPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELP 995
+P ++ + DL+ +++ G + ++ G Y +L N + + R+FG ++AE+P
Sbjct: 144 EPAKDAFTGKDLVEDVIFSSGSNLNRLNFLGFYTVLLERNNELTTVANVRVFGDKVAEVP 203
Query: 996 LVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQN 1055
VAT + G + L +E+ L L V+ LVLP+A W FGFS+MT ++
Sbjct: 204 FVATKFQYRRLGMCRVLMNELERQLLNLGVEKLVLPAAFSTLETWIKGFGFSVMTYSDK- 262
Query: 1056 KYRNDYPLMIFQGTSMLQK 1074
K +DYP++ FQGT + QK
Sbjct: 263 KAHSDYPILFFQGTVLCQK 281
>gi|297793979|ref|XP_002864874.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297310709|gb|EFH41133.1| PHD finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 555
Score = 106 bits (264), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 156/371 (42%), Gaps = 87/371 (23%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
N D+C +C GGDLL CD CP AFH C+ L +P +++ +
Sbjct: 253 NCDVCCVCHWGGDLLLCDGCPSAFHHTCLGLSSLP-------------EEDLWFCPCCCC 299
Query: 776 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCERE 835
G +E +++L C +QC+R
Sbjct: 300 DICGSMES----------------PVNSKLMAC---------------------EQCQRR 322
Query: 836 YHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEE 895
+H+ CLK E+ + W C + C R++ AL+ L+
Sbjct: 323 FHLKCLK----EEPGIVSCRGWFCSSQCNRVSSALENLI--------------------- 357
Query: 896 SGSDNAV--DFDVRWRVLR----GKKVDASDGTRALLSKAVSIFHDRFDPIIESASKLDL 949
G AV + D+ W ++R G+ D D + L AV I H F+P + S DL
Sbjct: 358 -GCKIAVGNNGDLVWTLMRAPNEGEHYD--DEQISKLESAVEILHQGFEPTKDVFSGRDL 414
Query: 950 IPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDCQGQGYF 1009
+ +++ + G G Y ++ + ++ R+ +++ E+PLVAT ++ + G
Sbjct: 415 VEELIFRKDRTGVG-RGFYTVLIERKKEPITVAAVRV-DKDVVEIPLVATLSNYRRSGMC 472
Query: 1010 QSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGT 1069
+ L +EK + + V LVLP+A E + WT +FGFS+M E+ + + ++ F GT
Sbjct: 473 RVLVDELEKQMSQMGVCRLVLPAAKEVVSTWTQRFGFSVMESSERLELVK-HGMLDFVGT 531
Query: 1070 SMLQKPVPKCR 1080
M K + K R
Sbjct: 532 VMCHKFLVKER 542
>gi|218192546|gb|EEC74973.1| hypothetical protein OsI_11003 [Oryza sativa Indica Group]
Length = 234
Score = 106 bits (264), Expect = 8e-20, Method: Composition-based stats.
Identities = 54/146 (36%), Positives = 88/146 (60%), Gaps = 2/146 (1%)
Query: 931 SIFHDRFDPIIESASK-LDLIPAMVYGRSHRGQ-DYHGMYCAILTVNQVVVSAGIFRIFG 988
+I D D E+A + +D I MV + G+ D+ G+YCA+LT + VVSA I ++
Sbjct: 86 TILSDEQDAFAETAPQDIDDIRNMVNSKDTTGEKDFRGIYCAVLTTSTFVVSAAILKVRT 145
Query: 989 QELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSM 1048
+E+AEL L+AT N+C+ +GYF L IE L NV+ L P E IW+ K G+++
Sbjct: 146 EEVAELVLIATHNECRKKGYFSLLLSLIEAHLKAWNVRLLTAPVDPEMAPIWSEKLGYTI 205
Query: 1049 MTEEEQNKYRNDYPLMIFQGTSMLQK 1074
+++E+++ +PL++F S++QK
Sbjct: 206 LSDEQKHSMLMAHPLVMFANLSLVQK 231
>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
Length = 1003
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 124/255 (48%), Gaps = 37/255 (14%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETS 885
+I CD C +H CL E LQ L L ++ ++ G S
Sbjct: 298 LICCDGCPSTFHQSCLDIQLFEQLQML---------------LGVKHELEDG------FS 336
Query: 886 LDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPIIESAS 945
++++ V FD+ + +KV+ + + L+ A+SI + F PI++ S
Sbjct: 337 WTLVQRTE--------VGFDISLNGIP-QKVECN----SKLAVALSIMDECFLPIVDQRS 383
Query: 946 KLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATSNDC 1003
++LI ++Y G + +Y G + AIL + ++SA RI G +LAE+P + T +
Sbjct: 384 GINLIHNVLYNCGSNFNRLNYSGFFTAILERGEEIISAASIRIHGNKLAEMPFIGTRHIY 443
Query: 1004 QGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPL 1063
+ QG + L IE L LNV+ LV+P+ SE WT+ FGF + + + RN +
Sbjct: 444 RRQGMCRRLLNAIESALHSLNVEKLVIPAISELMQTWTSVFGFKPLEVSSRKEMRN-MNM 502
Query: 1064 MIFQGTSMLQKPVPK 1078
++F GT MLQKP+ K
Sbjct: 503 LVFHGTDMLQKPLLK 517
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 61/122 (50%), Gaps = 14/122 (11%)
Query: 638 AEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGV 696
A+V Y R + + L+ G+ GI C CC+ + S+FE HAG +P Q+I G+
Sbjct: 195 AKVQYMNRRKTRALLEGWISRDGIRCGCCSEIFTISKFEIHAGM-KLCEPSQNIILETGI 253
Query: 697 SLHELSIK------------LSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 744
SL + + L + NDD CGIC DGGDL+CCD CP FH C+
Sbjct: 254 SLLQCQLDSWNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDGCPSTFHQSCL 313
Query: 745 SL 746
+
Sbjct: 314 DI 315
>gi|413953618|gb|AFW86267.1| hypothetical protein ZEAMMB73_807634 [Zea mays]
Length = 108
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/110 (49%), Positives = 68/110 (61%), Gaps = 2/110 (1%)
Query: 703 IKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNT 762
+K+S + S +E DDLC C GGD+ C CPR+FH CV L +PS W+C C N
Sbjct: 1 MKISKDMELSERETDDLCRECGQGGDIFPCKICPRSFHPACVGLSKVPS-EWYCDNCRNL 59
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR 812
QKEK + N NA+AAGR GVD Q++ R IRIV D +LGGC LC+
Sbjct: 60 VQKEKALAENKNAKAAGRQAGVDSIEQIMKRAIRIVPISD-DLGGCALCK 108
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 103 bits (256), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 135/345 (39%), Gaps = 109/345 (31%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHE-LSIKLSLERPFSSK--- 714
GI+C CC +S S F AHAG + H +S G+ L S L L +S++
Sbjct: 889 GIVCNCCKKNLSVSDFMAHAGRS-------HPQSSLGLFLESGKSYTLCLVEAWSAESMS 941
Query: 715 --------------ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
E+DD CG C DGG+LLCCD+CP +H C+S +P G+W+C C
Sbjct: 942 RRSNAWGRKVEAIDESDDTCGFCGDGGELLCCDNCPSTYHQACLSAKELPEGSWYCHNCT 1001
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
C +C G F +
Sbjct: 1002 -----------------------------------------------CQVCGGP-FSEKE 1013
Query: 821 FGRRTVIL-CDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
+ I C QC YH C++ + L++ W C CK I + L+ V
Sbjct: 1014 VSTFSAIFKCFQCGDAYHDTCIEQEKLP-LEDQISQTWFCGKYCKEIFIGLRSHV----- 1067
Query: 880 KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLR----GKKVDASDGTRAL------LSKA 929
G+DN +D D+ W +LR G+K+ + L L+ A
Sbjct: 1068 -----------------GTDNILDSDLSWSILRCNNDGQKLHSVQKIACLAECNMKLAVA 1110
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAIL 972
+++ + F +++ + +D+IP ++Y G + DY G Y IL
Sbjct: 1111 LTLLEECFIRMVDPRTGVDMIPHVLYNKGSNFARVDYQGFYTVIL 1155
>gi|413916050|gb|AFW55982.1| hypothetical protein ZEAMMB73_283196 [Zea mays]
Length = 831
Score = 100 bits (250), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/267 (27%), Positives = 127/267 (47%), Gaps = 33/267 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKRINL---ALQKLVDRG- 877
++LCD+C +H C+ LQ P+G W C C C +L + D+
Sbjct: 576 LLLCDKCPSAFHHACV------GLQATPEGDWCCPLCRCGVCGGSDLDDDTAEGFTDKTI 629
Query: 878 ----EEKLPET----SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
+P T SL ++++ S V RW+ ++ DA+D + L A
Sbjct: 630 IYCEARSIPTTVEGVSLSTLRRRRYMS----TVTRITRWQ---HEEEDAADHGQ--LCAA 680
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVYGRSHR--GQDYHGMYCAILTVNQVVVSAGIFRIF 987
+ + H+ FD ++E ++ DL +V+ + ++ G Y L +++ G R+F
Sbjct: 681 LDVLHECFDDMVEPRTQTDLAADIVFNQESGLCRLNFRGYYVVGLEKAGELITVGTLRVF 740
Query: 988 GQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFS 1047
G ++AELPLV T + QG + L +EK+L + V+ LVLP+ E +WT GF
Sbjct: 741 GNQVAELPLVGTRFAHRRQGMCRLLVTELEKMLRQVGVRRLVLPAVPELLPMWTASLGFH 800
Query: 1048 MMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
MT + + ++ ++ F+GT+M QK
Sbjct: 801 AMTRSDVMEMAVEHAILSFKGTTMCQK 827
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%)
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
S +E D +C +C+DGG+LL CD CP AFH CV L P G W C C
Sbjct: 559 SEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLC 606
>gi|296088061|emb|CBI35420.3| unnamed protein product [Vitis vinifera]
Length = 104
Score = 100 bits (248), Expect = 5e-18, Method: Composition-based stats.
Identities = 45/69 (65%), Positives = 53/69 (76%), Gaps = 1/69 (1%)
Query: 763 FQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFG 822
FQ+EKFVE+NANA AAGR+ GVDP Q+ RCIRIV P+ E+ CVLCRG DF KS FG
Sbjct: 2 FQREKFVEHNANAVAAGRVSGVDPIEQITKRCIRIVN-PEAEVSACVLCRGYDFSKSGFG 60
Query: 823 RRTVILCDQ 831
R +I+CDQ
Sbjct: 61 PRMIIMCDQ 69
>gi|242087023|ref|XP_002439344.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
gi|241944629|gb|EES17774.1| hypothetical protein SORBIDRAFT_09g004810 [Sorghum bicolor]
Length = 872
Score = 100 bits (248), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 149/376 (39%), Gaps = 103/376 (27%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANA 775
+DD CG+C DGG+LLCCD +C +TF E
Sbjct: 436 SDDACGVCADGGELLCCD------------------------FCTSTFHPE--------- 462
Query: 776 RAAGRIEGVDPFAQMVSRCIRIVQTPDTELG----GCVLCRGRDFCKSRFGRRTVILCDQ 831
C+ I + PD C LC D + + C +
Sbjct: 463 ------------------CLAI-EVPDGSWSCHYCRCTLCMSND-------DQDLSTCQE 496
Query: 832 CEREYHVGCLKDHGM-EDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPETSLDVIK 890
C +YH C G D+ C CK+++ L +++
Sbjct: 497 CACKYHESCRPLLGNGRDIGAY------CGEICKKLSAKLSEVI---------------- 534
Query: 891 KKHEESGSDNAVDFDVRWRVLRGKKVD--ASDGTRALLSK------AVSIFHDRFDPIIE 942
G N+ + W +LR + + +S G A+L + A+ + + F+P+ +
Sbjct: 535 ------GVMNSTEDGFSWSLLRIHEDEPASSQGMPAVLERNVKLAVALGVLNQCFNPVKD 588
Query: 943 SASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVATS 1000
+K+D++ VY G + Y G Y IL + +VS + RI G+++AE+P T
Sbjct: 589 RRTKIDMLHQAVYSLGSQFKRLSYEGFYTMILEKDGEIVSTALLRIHGRKVAEMPFAGTL 648
Query: 1001 NDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRND 1060
+ QG + +E++L + V+TL++P+ + W F F + + + + +
Sbjct: 649 PAYRKQGMMHRVVSAVEQVLASVQVETLIIPAIASMVDTWKRSFSFRPVDPQLREELKR- 707
Query: 1061 YPLMIFQGTSMLQKPV 1076
L++ GT+ML KPV
Sbjct: 708 LSLVVITGTTMLHKPV 723
>gi|226533395|ref|NP_001140625.1| uncharacterized protein LOC100272699 [Zea mays]
gi|194700228|gb|ACF84198.1| unknown [Zea mays]
Length = 211
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 85/144 (59%), Gaps = 3/144 (2%)
Query: 937 FDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAEL 994
F PII+ + ++LI +VY G + D+ G Y IL +++A RI G +LAE+
Sbjct: 5 FLPIIDQRTGINLIRNVVYSCGSNFARLDFRGFYIFILERGDEIIAAASVRIHGTKLAEM 64
Query: 995 PLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQ 1054
P + T N + QG + L IE +L LNV+ L++P+ +E WT+KFGFS + + E+
Sbjct: 65 PFIGTRNMYRRQGMCRRLVDGIEMILSSLNVEKLIIPAITELVDTWTSKFGFSPLEDSEK 124
Query: 1055 NKYRNDYPLMIFQGTSMLQKPVPK 1078
+ ++ +++F GT +LQKP+ K
Sbjct: 125 QEVKS-ISMLVFPGTGLLQKPLLK 147
>gi|293331683|ref|NP_001170374.1| uncharacterized protein LOC100384354 [Zea mays]
gi|224035435|gb|ACN36793.1| unknown [Zea mays]
Length = 336
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 85/151 (56%), Gaps = 3/151 (1%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 983
L AV I H+ F IIE ++ D+ +V+ R R ++ G Y +L +VS G
Sbjct: 8 LCIAVDILHECFVTIIEPRTQSDISEDIVFNRESELRRLNFRGFYIILLQKGGELVSVGT 67
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
FRI GQ+ AELPL+ T + + QG + L +EKLL L V+ L+LP+ E WT
Sbjct: 68 FRICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCS 127
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
FGF++M+ E+ + + ++ FQGT+M QK
Sbjct: 128 FGFTVMSNSERLELAGNS-ILSFQGTTMCQK 157
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 83/151 (54%), Gaps = 3/151 (1%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQ--DYHGMYCAILTVNQVVVSAGI 983
L A+S+ H+ F+P+ ES S DL+ +++ R ++ G Y +L N+ ++S
Sbjct: 979 LHLAISVMHECFEPLKESLSNRDLVEDVIFSRWSELNRLNFQGFYTVLLERNEELISVAT 1038
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
R++G+++AE+PLV T + G L +EK L L V+ LVLP+ WT
Sbjct: 1039 VRVYGKKVAEIPLVGTRLQYRRLGMCHILIEELEKKLKQLGVERLVLPAVPSVLETWTRS 1098
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
FGF+ MT E++++ DY + FQG M QK
Sbjct: 1099 FGFAKMTNLERSQFL-DYTFLDFQGAIMCQK 1128
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 87/219 (39%), Gaps = 60/219 (27%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERP-------- 710
GI C CC S FE HA S +P I+ +G SL + IK+ +
Sbjct: 572 GIKCNCCMGIYSFVGFENHASGNSTCRPSASIFLEDGRSLLDCLIKMMHDHKTMETSGKS 631
Query: 711 ---FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEK 767
S END +C +C GG+L+ CD CP +FH C+ L IP+G W C C ++
Sbjct: 632 FSGLSLVENDYICSVCHYGGELILCDKCPSSFHKTCLGLEDIPNGDWFCPSCCCGICGQR 691
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
+I+ D Q++ CI
Sbjct: 692 ------------KIDRDDEVEQLLP-CI-------------------------------- 706
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
QCE +YHV CL++ G D+ G W C DC+++
Sbjct: 707 ---QCEHKYHVRCLEN-GAADISTRYLGNWFCGKDCEKL 741
>gi|125529239|gb|EAY77353.1| hypothetical protein OsI_05335 [Oryza sativa Indica Group]
Length = 895
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 27/277 (9%)
Query: 826 VILCDQCEREYHVGCL------KDHGMEDLQELP----KGKWLCCADCKRINLALQ---K 872
++ CD C +H CL K + D Q+L + WL A I+ Q
Sbjct: 538 LLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRP 597
Query: 873 LVDRGEE---KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK---VDASDGTRAL- 925
L G E ET + + G N + W +L+ +K V + D L
Sbjct: 598 LQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQKDELVTSEDMPVILE 657
Query: 926 ----LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVV 979
L+ A+ + ++ F+P+ + +K+D++ VY G + +Y G Y +L + ++
Sbjct: 658 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 717
Query: 980 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1039
S + R G++LAE+P T Q QG + L +EK+L L V+ LV+P+ ++
Sbjct: 718 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 777
Query: 1040 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
W F F M E +++ + L+ GT++LQKP+
Sbjct: 778 WKRSFSFRPMQAEVRDEAKK-LSLVAITGTTLLQKPI 813
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
+DD CG+C DGG LLCCD+CP FH DC+++
Sbjct: 525 SDDACGVCADGGQLLCCDTCPSTFHPDCLAI 555
>gi|57900165|dbj|BAD88250.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
gi|125573433|gb|EAZ14948.1| hypothetical protein OsJ_04879 [Oryza sativa Japonica Group]
Length = 897
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 27/277 (9%)
Query: 826 VILCDQCEREYHVGCL------KDHGMEDLQELP----KGKWLCCADCKRINLALQ---K 872
++ CD C +H CL K + D Q+L + WL A I+ Q
Sbjct: 540 LLCCDTCPSTFHPDCLAIQFMIKSWLLFDRQQLTTIYGQQPWLQTAPGAAISADHQYCRP 599
Query: 873 LVDRGEE---KLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKK---VDASDGTRAL- 925
L G E ET + + G N + W +L+ +K V + D L
Sbjct: 600 LQSPGFEIGAYCSETCKKMSSHLSDMIGVMNHTEDGFSWALLKIQKDELVTSEDMPVILE 659
Query: 926 ----LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVV 979
L+ A+ + ++ F+P+ + +K+D++ VY G + +Y G Y +L + ++
Sbjct: 660 SNVKLAVALGVLNECFNPVQDRRTKIDMLHQAVYSLGSEFKRVNYEGFYTMVLEKDGEII 719
Query: 980 SAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAI 1039
S + R G++LAE+P T Q QG + L +EK+L L V+ LV+P+ ++
Sbjct: 720 SVALLRFHGRKLAEMPFAGTLPAYQKQGMMRRLVKAVEKVLASLQVENLVIPAVADLVET 779
Query: 1040 WTNKFGFSMMTEEEQNKYRNDYPLMIFQGTSMLQKPV 1076
W F F M E +++ + L+ GT++LQKP+
Sbjct: 780 WKRSFSFRPMQAEVRDEAKK-LSLVAITGTTLLQKPI 815
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 19/31 (61%), Positives = 25/31 (80%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL 746
+DD CG+C DGG LLCCD+CP FH DC+++
Sbjct: 527 SDDACGVCADGGQLLCCDTCPSTFHPDCLAI 557
>gi|297800714|ref|XP_002868241.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
gi|297314077|gb|EFH44500.1| hypothetical protein ARALYDRAFT_330015 [Arabidopsis lyrata subsp.
lyrata]
Length = 1008
Score = 90.1 bits (222), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 90/185 (48%), Gaps = 17/185 (9%)
Query: 628 VFEEGGLEDGAEVGYF-VRGEVKFLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKP 686
+ + G L+ +V Y R L G+ GI C CC+ +S S+FE HAG + R+P
Sbjct: 537 LIDSGTLKLSEKVMYMNQRRTHAMLEGWITRDGIHCGCCSKILSVSKFEIHAG-SKLRQP 595
Query: 687 FQHIYTSNGVSLHELSIKLSLERP------------FSSKENDDLCGICMDGGDLLCCDS 734
FQ+I+ + GVSL + I ++ + NDD CGIC DGGDL+CCD
Sbjct: 596 FQNIFLNTGVSLFQCQIDAWDKQKGAGNIGFCSVDVIADDPNDDACGICGDGGDLVCCDG 655
Query: 735 CPRAFHIDCVSLPGIPSGTW-HCRYCMNTFQKEKFVEYNANARAAGRIEG--VDPFAQMV 791
CP FH C+ + G W R+ F + +A + G++ + P Q+
Sbjct: 656 CPSTFHQRCLDIRGHLMPDWIFLRFNYRCFLLVIGIAPIVHANSVGQLLKMLLRPRMQIP 715
Query: 792 SRCIR 796
++C+R
Sbjct: 716 AKCVR 720
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 52/96 (54%), Gaps = 2/96 (2%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGI 983
L+ A+++ + F PI++ S ++++ ++Y G + ++ G Y A+L VV++
Sbjct: 770 LALALTVMDECFLPIVDRRSGVNIVRNVLYNCGSNFNRLNFGGFYTALLERGDEVVASAS 829
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKL 1019
R G LAE+P + T + + QG + LF IE +
Sbjct: 830 IRFHGNHLAEMPFIGTRHVYRHQGMCRRLFSVIESV 865
>gi|125559705|gb|EAZ05241.1| hypothetical protein OsI_27443 [Oryza sativa Indica Group]
Length = 681
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 120/268 (44%), Gaps = 33/268 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC----CADCKRINLALQKLVDRGEEKL 881
+++CD+C +H C+ L+ P+G W C CA C +L +
Sbjct: 423 LLMCDRCPSMFHHACV------GLESTPQGDWFCPACTCAICGSSDL----------DDP 466
Query: 882 PETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPII 941
P T+ + SD V + R + D + A L A+ + + F +I
Sbjct: 467 PATTTT------QGFSSDRMV---ISCEQCRRESRDGEEEEHAKLCMALDVLRECFVTLI 517
Query: 942 ESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGIFRIFGQELAELPLVAT 999
E ++ DL +V+ R D+ G Y L +++ R++G+E+AE+PLV T
Sbjct: 518 EPRTQTDLTADIVFNTESELRRLDFRGFYVVGLEKAGELIAVATLRVYGEEVAEVPLVGT 577
Query: 1000 SNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTN-KFGFSMMTEEEQNKYR 1058
+ QG + L I+KLLG + V+ LVLP+ E A WT FG M + ++
Sbjct: 578 RFARRRQGMCRLLMDEIQKLLGEMGVERLVLPAVPEMVATWTGPSFGIREMGQADRQDVA 637
Query: 1059 NDYPLMIFQGTSMLQKPVPKCRIVGKSV 1086
+ + ++ FQGT M K +P +G +
Sbjct: 638 H-HAILRFQGTIMCHKQLPPQPQLGHTT 664
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 54/122 (44%), Gaps = 29/122 (23%)
Query: 667 SEVSPSQFEAHA----GWASRRKPFQHIYTSNGVSLHELSIKLSLER------------- 709
+E + ++F H +S R+P+ ++ +G SL + ++L +
Sbjct: 332 AEYTVAEFAVHGDGDVASSSSRQPWARMFVGDGRSLSQCLVQLMMADDEAGSGRKKKKKK 391
Query: 710 --PFSSK---------ENDD-LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCR 757
P+ + E+DD +C +C D G+LL CD CP FH CV L P G W C
Sbjct: 392 YLPYVWRGARVKRKWEEDDDYVCSVCHDCGELLMCDRCPSMFHHACVGLESTPQGDWFCP 451
Query: 758 YC 759
C
Sbjct: 452 AC 453
>gi|357503057|ref|XP_003621817.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
gi|355496832|gb|AES78035.1| hypothetical protein MTR_7g023750 [Medicago truncatula]
Length = 537
Score = 88.2 bits (217), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 81/140 (57%), Gaps = 13/140 (9%)
Query: 388 VKQETVMKPR--KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGV 440
K ETV+K + K +K F S +K+ L +GI +G+ V Y S+ K L+GV
Sbjct: 379 AKNETVVKNKEPKTAKKPSTNSFPSNVKSLLSTGIFDGIPVKYCTWSREKN-----LQGV 433
Query: 441 VKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLE 500
+KG+G C CD CKG + + FE HAG+ +K P +I+ ENGK++ ++ K+SP E
Sbjct: 434 IKGTGYLCSCDICKGQKALNAYEFERHAGAKSKHPNSHIFFENGKSVYAVVQELKNSPQE 493
Query: 501 TLEKAVRMVLGSSSMKKANF 520
L A++ V G +++ + NF
Sbjct: 494 MLFDAIQTVTG-ATINQRNF 512
Score = 41.6 bits (96), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 85/194 (43%), Gaps = 33/194 (17%)
Query: 512 SSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVELKESQAGSAEIKEPLSHSSEMEPQPP 571
SSS+ F +N +++ S M+ + ++ A E ++ LS SS P PP
Sbjct: 319 SSSISFGGFHVNPQLNDSTG-----MINSSDPLMGNQNSAQGVESQKDLSESS---PNPP 370
Query: 572 SVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEE 631
+ +P S +T ++ EP +A+ S + S+ VKS
Sbjct: 371 ---VNSTPK-SNAKNETVVKNKEPKTAKKPSTNSFPSN-VKS---------------LLS 410
Query: 632 GGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPF 687
G+ DG V Y K L G KG G LC+C C + ++ +FE HAG A + P
Sbjct: 411 TGIFDGIPVKYCTWSREKNLQGVIKGTGYLCSCDICKGQKALNAYEFERHAG-AKSKHPN 469
Query: 688 QHIYTSNGVSLHEL 701
HI+ NG S++ +
Sbjct: 470 SHIFFENGKSVYAV 483
>gi|356499663|ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
Length = 581
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 71/120 (59%), Gaps = 6/120 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +GI +G+ V Y+ S+ K L+G++KG+G C CD+C ++ +
Sbjct: 441 NNFPSNVKSLLSTGIFDGVQVKYVSWSREKS-----LKGIIKGTGYLCSCDNCNQSKALN 495
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAG+ K P +IY ENGKT+ ++ K++P + L A++ V G S++ + NF
Sbjct: 496 AYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVTG-STINQKNF 554
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG +V Y K L G KG G LC+C CN ++ +FE HAG A + P
Sbjct: 454 GIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTKHPNN 512
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 513 HIYFENGKTIYAV 525
>gi|297606567|ref|NP_001058661.2| Os06g0731100 [Oryza sativa Japonica Group]
gi|255677428|dbj|BAF20575.2| Os06g0731100, partial [Oryza sativa Japonica Group]
Length = 78
Score = 87.0 bits (214), Expect = 5e-14, Method: Composition-based stats.
Identities = 40/73 (54%), Positives = 51/73 (69%)
Query: 1006 QGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMI 1065
QGYFQ+LF CIE+LL L VK VLP+A EA++IWT +FGF +T++E +Y +
Sbjct: 3 QGYFQALFGCIERLLASLKVKHFVLPAADEAESIWTQRFGFVKITQDELREYLKGGRTTV 62
Query: 1066 FQGTSMLQKPVPK 1078
FQGTS L K VPK
Sbjct: 63 FQGTSTLHKLVPK 75
>gi|255584782|ref|XP_002533109.1| DNA binding protein, putative [Ricinus communis]
gi|223527100|gb|EEF29281.1| DNA binding protein, putative [Ricinus communis]
Length = 422
Score = 86.3 bits (212), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/254 (27%), Positives = 114/254 (44%), Gaps = 32/254 (12%)
Query: 290 EKKNDIERELVENGVLESSMVG---KHSSTLC----NGESNVAKSVAVDGNDEGKTVN-- 340
E +DI + N ESS + H C G+ N G +EG ++
Sbjct: 159 ENNSDIPTDQAFNRENESSFISIGDAHIVATCPTYIKGDDNAISISDAYGKEEGNMISFG 218
Query: 341 --------VVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQET 392
+ V RP+ + +S + V+ + K+ SD A++ K E+
Sbjct: 219 EFHDAHDMIAVGRPISSYAQSYDESSVQTPEAVQQKEFDA-SDAHATASNTRVAKSKTES 277
Query: 393 VMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGS 444
V + + K RK F S +++ + +G+L+G+ V YI S+ + LRGV+KGS
Sbjct: 278 VSRNKPEVKTGRKEAPNSFPSNVRSLISTGMLDGVPVKYIALSREE------LRGVIKGS 331
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C ++V+ FE HAG K P +IY ENGKT+ I+ + +P L
Sbjct: 332 GYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESMLFD 391
Query: 505 AVRMVLGSSSMKKA 518
++ V G+ +K+
Sbjct: 392 VIQTVFGAPINQKS 405
>gi|356544590|ref|XP_003540732.1| PREDICTED: uncharacterized protein LOC100819317 [Glycine max]
Length = 502
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 84/167 (50%), Gaps = 24/167 (14%)
Query: 388 VKQETVMKPRKVMR--------KFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
VK ETV K ++ ++ F S +++ + +GIL+G+ V YI S+ + LRG
Sbjct: 344 VKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSREE------LRG 397
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C +V+ FE HAG K P +IY ENGKT+ I+ + +P
Sbjct: 398 IIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 457
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNA--------GVEELML 538
L ++ V G+ +KA N + SF A G EEL L
Sbjct: 458 SLLFDTIQTVFGAPIHQKA--FRNWKESFQAATRELQRIYGKEELNL 502
>gi|297793537|ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp.
lyrata]
Length = 415
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K ET ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG C
Sbjct: 266 KAETKSSKKEASTSFPSNVRSLISTGMLDGVPVTYVSISREE------LRGVIKGSGYLC 319
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C+ +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 320 GCQTCEFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQT 379
Query: 509 VLGSSSMKKA 518
V GS +KA
Sbjct: 380 VFGSPINQKA 389
>gi|356541246|ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max]
Length = 463
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 101/214 (47%), Gaps = 25/214 (11%)
Query: 341 VVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVM 400
+ V RP + + Q V ++ + K+ S VA+ VK ETV K ++ +
Sbjct: 259 ISVGRPAAEYDQLYNQSSVHVSTTAHEKELDVSSS-DAVASTLQVAKVKSETVSKNKQEL 317
Query: 401 R--------KFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDD 452
+ F S +++ + +GIL+G+ V Y+ S+ + LRG++KGSG C C
Sbjct: 318 KTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSREE------LRGIIKGSGYLCGCQS 371
Query: 453 CKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGS 512
C +V+ FE HAG K P +IY ENGKT+ I+ + +P L ++ V G+
Sbjct: 372 CNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTVFGA 431
Query: 513 SSMKKANFCLNCRVSFSNA--------GVEELML 538
+KA N + SF A G EEL L
Sbjct: 432 PINQKA--FRNWKESFQAATRELQRIYGKEELNL 463
>gi|224059526|ref|XP_002299890.1| predicted protein [Populus trichocarpa]
gi|222847148|gb|EEE84695.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 99/199 (49%), Gaps = 16/199 (8%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ G D+ + + V+RPL + S Q V + ++ + + + VA++
Sbjct: 178 LSFGGFDDAHDI-IPVDRPLSSYDHSYDQSSVR-TREAVDEKELRTTTAKAVASNTQATK 235
Query: 388 VKQETVMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
+ E V K R K RK F S +++ + +G+L+G+ V Y+ S+ + LRG
Sbjct: 236 SRTEPVSKNRPELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYVSLSREE------LRG 289
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C ++V+ FE HAG K P +IY ENGKT+ I+ + +P
Sbjct: 290 IIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPE 349
Query: 500 ETLEKAVRMVLGSSSMKKA 518
L ++ V G+ +K+
Sbjct: 350 SMLFDVIQTVFGAPINQKS 368
>gi|357510883|ref|XP_003625730.1| PHD zinc finger protein-like protein [Medicago truncatula]
gi|355500745|gb|AES81948.1| PHD zinc finger protein-like protein [Medicago truncatula]
Length = 171
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 64/103 (62%)
Query: 434 VSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNV 493
VSGLRGV++ GI C C C+G +V++P+ FE+HA +R EYI ENGK+L D++
Sbjct: 32 VSGLRGVIRDEGILCSCCLCEGRRVISPSQFEIHACKQYRRAVEYICFENGKSLLDLLRA 91
Query: 494 CKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFSNAGVEEL 536
C+ +PL LE ++ ++ S +K C C+ F ++ +E +
Sbjct: 92 CRGAPLHDLEATIQNIVCSPPEEKYFTCKRCKGRFPSSCMERV 134
>gi|449459968|ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
Length = 582
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 7/145 (4%)
Query: 390 QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+E M + F S +K+ L +G+L+G+ V Y+ S+ K L+G++KG+G C
Sbjct: 431 KEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKN-----LKGIIKGTGYLCS 485
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C++C ++ + FE HAG K P +IY ENGKT+ ++ K++P E L A++ V
Sbjct: 486 CENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNV 545
Query: 510 LGSSSMKKANFCLNCRVSFSNAGVE 534
G S + + NF + + S+ A +E
Sbjct: 546 TG-SPINQKNFRI-WKASYQAATLE 568
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG V Y K L G KG G LC+C CN ++ +FE HAG ++ P
Sbjct: 455 GMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKH-PNN 513
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 514 HIYFENGKTIYAV 526
>gi|449521523|ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
Length = 561
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 81/145 (55%), Gaps = 7/145 (4%)
Query: 390 QETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+E M + F S +K+ L +G+L+G+ V Y+ S+ K L+G++KG+G C
Sbjct: 410 KEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKN-----LKGIIKGTGYLCS 464
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C++C ++ + FE HAG K P +IY ENGKT+ ++ K++P E L A++ V
Sbjct: 465 CENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNV 524
Query: 510 LGSSSMKKANFCLNCRVSFSNAGVE 534
G S + + NF + + S+ A +E
Sbjct: 525 TG-SPINQKNFRI-WKASYQAATLE 547
Score = 43.1 bits (100), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG V Y K L G KG G LC+C CN ++ +FE HAG ++ P
Sbjct: 434 GMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKH-PNN 492
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 493 HIYFENGKTIYAV 505
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 68/131 (51%), Gaps = 16/131 (12%)
Query: 650 FLVGYKKGFGILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL-------S 702
L G+ GI C+CC+ +S +F HAG + KP+++I +G+ + L +
Sbjct: 927 LLEGFVTRDGINCSCCSEVISVPEFVTHAG-SEVNKPYRNILV-DGLDIDLLHCLINAWN 984
Query: 703 IKLSLER----PFS---SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWH 755
++ ER P S NDD CGIC DGG+L+CCD CP FH+ C+ L +P+ W
Sbjct: 985 MQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDGCPSTFHMSCLGLEALPTDYWC 1044
Query: 756 CRYCMNTFQKE 766
C C F E
Sbjct: 1045 CSNCSCKFCHE 1055
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 117/252 (46%), Gaps = 57/252 (22%)
Query: 870 LQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKA 929
LQ L+ ++ PE S V+++ HEE + + D K+V+ + + ++ A
Sbjct: 1242 LQNLLAVKKDLEPEYSCRVVQRIHEEV-PEEVLALD--------KRVECN----SKIAVA 1288
Query: 930 VSIFHDRFDPIIESASKLDLIPAMVY--GRSHRGQDYHGMYCAILTVNQVVVSAGIFR-- 985
+S+ + F PI++ + ++LI +VY G + D+ G Y IL +++A R
Sbjct: 1289 LSLMDECFLPIVDQRTGINLIRNVVYNCGSNFARLDFRGFYIIILERGDEIIAAASVRLK 1348
Query: 986 ---------------------------------------IFGQELAELPLVATSNDCQGQ 1006
I G +LAE+P + T N + Q
Sbjct: 1349 EKNILTGMPSILVYRVQSHGGKPPFIFLKLLRSFECFLSIHGTKLAEMPFIGTRNMYRRQ 1408
Query: 1007 GYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIF 1066
G + L IE +L LN++ L++P+ +E WT+KFGFS + + E+ + ++ +++F
Sbjct: 1409 GMCRRLVDGIEMILSSLNIEKLIIPAITELVDTWTSKFGFSPLDDSEKQEVKS-VSMLVF 1467
Query: 1067 QGTSMLQKPVPK 1078
GT +LQKP+ K
Sbjct: 1468 PGTGLLQKPLLK 1479
>gi|224112831|ref|XP_002316304.1| predicted protein [Populus trichocarpa]
gi|222865344|gb|EEF02475.1| predicted protein [Populus trichocarpa]
Length = 560
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 8/146 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +G+L+G++V Y+ S+ K LRG +KG+G C C C GN+V+
Sbjct: 421 NNFPSNVKSLLSTGLLDGVAVKYVSWSREKT-----LRGTIKGTGYLCSCKVC-GNKVLN 474
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HA K P +IY ENGKT+ ++ K++P E L A+ V G S++ + NF
Sbjct: 475 AYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFNAIETVTG-SAINQKNF 533
Query: 521 CLNCRVSFSNAGVEELMLLCKSCVEL 546
L+ + S+ A E + K V L
Sbjct: 534 -LSWKASYEAATRELQRIYGKEEVTL 558
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 55/115 (47%), Gaps = 4/115 (3%)
Query: 590 NRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVK 649
N +P+ +T+ S K+ +K+ KI + + GL DG V Y K
Sbjct: 391 NAAPKNKEPKTTIDSASKNKELKTSKKIPPNNFPSNVKSLLSTGLLDGVAVKYVSWSREK 450
Query: 650 FLVGYKKGFGILCTC--CNSEV-SPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 701
L G KG G LC+C C ++V + +FE HA ++ P HIY NG +++ +
Sbjct: 451 TLRGTIKGTGYLCSCKVCGNKVLNAYEFERHANCKTKH-PNNHIYFENGKTIYAV 504
>gi|145357978|ref|NP_196870.3| uncharacterized protein [Arabidopsis thaliana]
gi|9758032|dbj|BAB08693.1| unnamed protein product [Arabidopsis thaliana]
gi|110737280|dbj|BAF00587.1| hypothetical protein [Arabidopsis thaliana]
gi|332004541|gb|AED91924.1| uncharacterized protein [Arabidopsis thaliana]
Length = 536
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 5/126 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +GI +G++V Y S+ + L+G++KG+G
Sbjct: 382 PKIKDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSRERN-----LKGMIKGTGY 436
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +CK N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 437 LCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 496
Query: 507 RMVLGS 512
+ V GS
Sbjct: 497 QNVTGS 502
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 5/71 (7%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC----CNSEVSPSQFEAHAGWASRRKPFQ 688
G+ DG V Y+ + L G KG G LC C N ++ +FE HA ++ P
Sbjct: 409 GIFDGVTVKYYSWSRERNLKGMIKGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKH-PNN 467
Query: 689 HIYTSNGVSLH 699
HIY NG +++
Sbjct: 468 HIYFENGKTIY 478
>gi|2244849|emb|CAB10271.1| hypothetical protein [Arabidopsis thaliana]
gi|7268238|emb|CAB78534.1| hypothetical protein [Arabidopsis thaliana]
Length = 1040
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 73/131 (55%), Gaps = 9/131 (6%)
Query: 627 LVFEEGGLEDGAEVGYFVRGEVKFLV-GYKKGFGILCTCCNSEVSPSQFEAHAGWASRRK 685
+ + G L+ +V Y + + ++ G+ GI C CC+ ++ S+FE HAG + R+
Sbjct: 550 WLIDSGTLQLSEKVMYMNQRRTRAMLEGWITRDGIHCGCCSKILAVSKFEIHAG-SKLRQ 608
Query: 686 PFQHIYTSNGVS--LHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC 743
PFQ+I+ ++G + + S+ + + P NDD CGIC DGGDL+CCD CP FH C
Sbjct: 609 PFQNIFLNSGGAGNIGFCSVDVIADDP-----NDDACGICGDGGDLVCCDGCPSTFHQRC 663
Query: 744 VSLPGIPSGTW 754
+ + G W
Sbjct: 664 LDIRGHLMPDW 674
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 81/178 (45%), Gaps = 17/178 (9%)
Query: 313 HSSTLCNGESNVAKSVAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSK--DG 370
HS++ +G++ A++ N+ G + + + S L + E + GS S DG
Sbjct: 452 HSASDSDGKATFARNFLAIKNEVGNDDSRDSSQGTTSKSESPLHHQTEKSTGSSSHRVDG 511
Query: 371 GKRSD---VTEVANDGVGGPVKQETVMKPRKVMRKFYSKLKNFLESGILE-GMSVMYIRG 426
GK S T + V G + P R + L ++SG L+ VMY+
Sbjct: 512 GKSSKHGRSTLLVRRSVRGDNSESDGFVPSSEKRTVLAWL---IDSGTLQLSEKVMYMNQ 568
Query: 427 SKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENG 484
+ + + L G + GI C C C ++++ + FE+HAGS ++P + I+L +G
Sbjct: 569 RRTR----AMLEGWITRDGIHCGC--C--SKILAVSKFEIHAGSKLRQPFQNIFLNSG 618
>gi|297807391|ref|XP_002871579.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
gi|297317416|gb|EFH47838.1| hypothetical protein ARALYDRAFT_488185 [Arabidopsis lyrata subsp.
lyrata]
Length = 521
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 6/160 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +G+ +G++V Y S+ V L+G++KG+G
Sbjct: 366 PKSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSR----EVRNLKGIIKGTGY 421
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +C N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 422 LCGCGNCNFNRVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 481
Query: 507 RMVLGSSSMKKANFCLNCRVSFSNAGVEELMLLCKSCVEL 546
+ V G S + NF + S+ A +E + K V L
Sbjct: 482 QNVTG-SDINHKNFN-TWKASYHVASLELQRIYGKDDVTL 519
Score = 46.6 bits (109), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 70/162 (43%), Gaps = 17/162 (10%)
Query: 547 KESQAGSA----EIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSS 602
KES GSA E SH + PP +E EE+ S E R+P S + +
Sbjct: 310 KESTIGSAVRVINSYENFSHDPAIAKDPPHMEAEENM--SFEF-----RNPPYASPRVDT 362
Query: 603 HSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRG-EVKFLVGYKKGFGIL 661
KS K+ K + + G+ DG V Y+ EV+ L G KG G L
Sbjct: 363 LLVPKSKDTKTAKKGSTNTFPSNVKSLLSTGMFDGVTVKYYSWSREVRNLKGIIKGTGYL 422
Query: 662 CTC--CNSE--VSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 699
C C CN ++ +FE HA ++ P HIY NG +++
Sbjct: 423 CGCGNCNFNRVLNAYEFEQHANCKTKH-PNNHIYFENGKTIY 463
>gi|186522614|ref|NP_001119218.1| uncharacterized protein [Arabidopsis thaliana]
gi|332004542|gb|AED91925.1| uncharacterized protein [Arabidopsis thaliana]
Length = 537
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 4/126 (3%)
Query: 387 PVKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGI 446
P ++T + F S +K+ L +GI +G++V Y S+ + L+G++KG+G
Sbjct: 382 PKIKDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSREQ----RNLKGMIKGTGY 437
Query: 447 SCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAV 506
C C +CK N+V+ FE HA K P +IY ENGKT+ ++ K++P E L A+
Sbjct: 438 LCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEKLFDAI 497
Query: 507 RMVLGS 512
+ V GS
Sbjct: 498 QNVTGS 503
>gi|403369443|gb|EJY84565.1| Putative PHD zinc finger protein [Oxytricha trifallax]
Length = 1373
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 40/67 (59%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
++N D C IC DGGDLLCCD+CPR+FH CV L IP W+C+ C+ K K +
Sbjct: 118 RQNRDHCNICKDGGDLLCCDNCPRSFHTKCVGLKSIPEDDWYCKRCVPIMDKRKHTDEEK 177
Query: 774 NARAAGR 780
RA R
Sbjct: 178 RKRAQER 184
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
K+ D+ C C GG +LCC+SC H+ C L P WHC C+
Sbjct: 1299 KQWDEACYKCNKGGKVLCCESCNHVCHLSCSGLLKKPLDEWHCEECL 1345
>gi|357115944|ref|XP_003559745.1| PREDICTED: uncharacterized protein LOC100837323 [Brachypodium
distachyon]
Length = 178
Score = 83.6 bits (205), Expect = 6e-13, Method: Composition-based stats.
Identities = 50/151 (33%), Positives = 78/151 (51%), Gaps = 3/151 (1%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 983
L A + H+ F ++E ++ DL +V+ R R + G Y L +++ G
Sbjct: 9 LCMAFDVLHECFVTLVEPHTQSDLSQDIVFNRESWLRRLYFRGFYIIGLEKGGELITVGT 68
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
R++G+++AELPLV T + QG L +E LLG V+ LVLP+ E WT
Sbjct: 69 LRVYGKKVAELPLVGTRFTHRRQGMCHLLMNQLEMLLGEWGVERLVLPAVPELLQTWTGS 128
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
FGF +MT+ ++ + +M FQGT+M K
Sbjct: 129 FGFQVMTQSQKLDIAQ-HTIMCFQGTTMCHK 158
>gi|55819802|gb|AAV66096.1| At5g59830 [Arabidopsis thaliana]
Length = 425
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|9757903|dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana]
Length = 415
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 265 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 318
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 319 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 378
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 379 TVFGSPINQKA 389
>gi|30697285|ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana]
gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009855|gb|AED97238.1| uncharacterized protein [Arabidopsis thaliana]
gi|332009856|gb|AED97239.1| uncharacterized protein [Arabidopsis thaliana]
Length = 425
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENGKT+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|356568973|ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
Length = 582
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 70/120 (58%), Gaps = 6/120 (5%)
Query: 401 RKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVT 460
F S +K+ L +GI +G+ V Y+ S+ K L+G++KG+G C CD+C ++ +
Sbjct: 442 NNFPSNVKSLLSTGIFDGVQVKYVSWSREKS-----LKGIIKGTGYLCSCDNCNQSKALN 496
Query: 461 PAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
FE HAG+ K P +IY ENGKT+ ++ K++ + L A++ V G S++ + NF
Sbjct: 497 AYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVTG-STINQKNF 555
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 39/73 (53%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG +V Y K L G KG G LC+C CN ++ +FE HAG A + P
Sbjct: 455 GIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAYEFERHAG-AKTKHPNN 513
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 514 HIYFENGKTIYAV 526
>gi|224106527|ref|XP_002314197.1| predicted protein [Populus trichocarpa]
gi|222850605|gb|EEE88152.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 111/227 (48%), Gaps = 29/227 (12%)
Query: 328 VAVDGNDEGKTVNVVVERPLRRFTRSLLQQKVELAKGSLSKDGGKRSDVTEVANDGVGGP 387
++ G D+ + + V RP+ + Q V+ + K+ G ++ VA++
Sbjct: 244 LSFGGFDDAHDI-IPVCRPINNYDHPYDQSSVKTREAVDQKELGAKA----VASNTRATK 298
Query: 388 VKQETVMKPR---KVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRG 439
K E V K R K RK F S +++ + +G+L+G+ V YI S+ + LRG
Sbjct: 299 SKSEPVSKNRQELKTTRKEAPNSFPSNVRSLISTGMLDGVPVKYISLSRKE------LRG 352
Query: 440 VVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPL 499
++KGSG C C C ++V+ FE HAG K P +I ENGKT+ I+ +++P
Sbjct: 353 IIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHPNNHICFENGKTIYQIVQELRNTPE 412
Query: 500 ETLEKAVRMVLGSSSMKKANFCLNCRVSFSNA--------GVEELML 538
L A++ V G+ +K+ F + + SF A G EELML
Sbjct: 413 SMLFDAIQTVFGAPINQKS-FRI-WKESFKAATRELQRIYGKEELML 457
>gi|297746129|emb|CBI16185.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGSG C
Sbjct: 267 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREE------LHGIIKGSGYLC 320
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L A++
Sbjct: 321 GCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQT 380
Query: 509 VLGSSSMKKA 518
V GS +K+
Sbjct: 381 VTGSPINQKS 390
>gi|225435060|ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera]
Length = 486
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGSG C
Sbjct: 337 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREE------LHGIIKGSGYLC 390
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L A++
Sbjct: 391 GCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQT 450
Query: 509 VLGSSSMKKA 518
V GS +K+
Sbjct: 451 VTGSPINQKS 460
>gi|302836808|ref|XP_002949964.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
gi|300264873|gb|EFJ49067.1| hypothetical protein VOLCADRAFT_117404 [Volvox carteri f.
nagariensis]
Length = 2728
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 113/258 (43%), Gaps = 35/258 (13%)
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQC-EREYHVGCLKDHGMEDLQ-ELPKGK--WLCCADC 863
C C G D GRR ++LC C H GC +D E L E+ G + C +C
Sbjct: 16 CTHCGGGDVEPE--GRRVLVLCSACFAAGTHTGCHEDVTGEPLSSEITHGDGLYFCGKEC 73
Query: 864 KRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTR 923
+R AL+ R I+ + E+ + ++ K+ D + R
Sbjct: 74 QRSYEALEAATGRRSR---------IRDEPEQ----------YTFELVHYKQDDRT--VR 112
Query: 924 ALLSKAVSIFHDRFDPIIESASK--LDLI------PAMVYGRSHRGQDYHGMYCAILTVN 975
+ + A+ +F F P+I + L+++ P G ++ AIL +
Sbjct: 113 SAVETAMRMFRTSFAPLIMENGRDLLEMVCTAYETPDEEVEEEGGGHNFSAFRLAILRMG 172
Query: 976 QVVVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASE 1035
+++A R+FG + AE+P V+T + G+ + L +E LL V LV+PS +E
Sbjct: 173 GTIITAATLRVFGNKFAEMPFVSTREGHRRSGHCKRLMKAVEDLLLAGGVHCLVIPSINE 232
Query: 1036 AQAIWTNKFGFSMMTEEE 1053
+WTNKFGF+ + +E
Sbjct: 233 LLPMWTNKFGFAKIATDE 250
>gi|359481508|ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
Length = 599
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 71/130 (54%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
K+E + + F S +++ L +G+L+G+ V YI S+ + LRG++KGSG C
Sbjct: 448 KEEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSREE------LRGIIKGSGYLC 501
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C ++V+ FE HAG K P +IY ENGKT+ I+ K +P +L ++
Sbjct: 502 GCQSCNFSKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQT 561
Query: 509 VLGSSSMKKA 518
+ GS +K+
Sbjct: 562 ITGSPINQKS 571
>gi|110738016|dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana]
Length = 425
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 388 VKQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGIS 447
K E ++ F S +++ + +G+L+G+ V Y+ S+ + LRGV+KGSG
Sbjct: 275 TKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREE------LRGVIKGSGYL 328
Query: 448 CFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVR 507
C C C +V+ FE HAG K P +IY ENG+T+ I+ +++P L ++
Sbjct: 329 CGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGRTIYQIVQELRNTPESILFDVIQ 388
Query: 508 MVLGSSSMKKA 518
V GS +KA
Sbjct: 389 TVFGSPINQKA 399
>gi|255575126|ref|XP_002528468.1| DNA binding protein, putative [Ricinus communis]
gi|223532144|gb|EEF33951.1| DNA binding protein, putative [Ricinus communis]
Length = 492
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 78/150 (52%), Gaps = 19/150 (12%)
Query: 373 RSDVTEVANDGVGGPVKQETVMKPRK--VMRKFYSKLKNFLESGILEGMSVMYIRGSKVK 430
++DVT DG +P K F S +K+ L +G+L+G+ V YI S+ K
Sbjct: 333 KTDVTPKNKDG-----------RPSKKVAPNNFPSNVKSLLSTGMLDGVPVKYISWSREK 381
Query: 431 GPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDI 490
L+G++KG+G C C +C + + FE HA K P +IY ENGKT+ +
Sbjct: 382 N-----LKGLIKGAGYLCGCQECNFTKALNAYEFERHANCKTKHPNNHIYFENGKTIYGV 436
Query: 491 MNVCKDSPLETLEKAVRMVLGSSSMKKANF 520
+ K++P E L +A++ V G S + + NF
Sbjct: 437 VQELKNTPQEMLFEAIQTVTG-SPINQKNF 465
>gi|296086276|emb|CBI31717.3| unnamed protein product [Vitis vinifera]
Length = 612
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 94/185 (50%), Gaps = 13/185 (7%)
Query: 356 QQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK------FYSKLKN 409
QQ+V+ ++ LS+ +S+ A+ K +TV K +++ K F +K+
Sbjct: 419 QQEVQTSEQVLSQKYLAQSNADPAASCTPKATSKTDTVSKNKQINTKKDPSSHFPLNVKS 478
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
L +G+ +G+ V Y+ ++ K +RGV+KGSG C C DC + + FE HA
Sbjct: 479 LLSTGMFDGVPVKYVSWTREKS-----VRGVIKGSGYLCSCKDCNSSNCLNAYEFERHAN 533
Query: 470 SSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFS 529
K P +IY ENGKT+ ++ K++P + L + ++ V G + + NF + S+
Sbjct: 534 CKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTG-CPINQKNF-QTWKASYQ 591
Query: 530 NAGVE 534
A VE
Sbjct: 592 AATVE 596
Score = 40.4 bits (93), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG V Y K + G KG G LC+C CNS ++ +FE HA ++ P
Sbjct: 483 GMFDGVPVKYVSWTREKSVRGVIKGSGYLCSCKDCNSSNCLNAYEFERHANCKTKH-PNN 541
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 542 HIYFENGKTIYAV 554
>gi|413935128|gb|AFW69679.1| hypothetical protein ZEAMMB73_570325 [Zea mays]
Length = 74
Score = 79.7 bits (195), Expect = 9e-12, Method: Composition-based stats.
Identities = 37/70 (52%), Positives = 44/70 (62%)
Query: 743 CVSLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPD 802
CV L GTW CRYC N Q+E + YN NA AAGR+EGVD Q+ +R IRI T +
Sbjct: 5 CVGLSSATKGTWCCRYCENRQQRESCLAYNNNAIAAGRVEGVDALEQIFTRSIRIATTLE 64
Query: 803 TELGGCVLCR 812
T GGC LC+
Sbjct: 65 TGFGGCALCK 74
>gi|147783309|emb|CAN64128.1| hypothetical protein VITISV_022422 [Vitis vinifera]
Length = 647
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 69/134 (51%), Gaps = 4/134 (2%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSG----LRGVVKGS 444
K E M ++ F S ++ + +G+L+G+ V Y+ S+ + L G++KGS
Sbjct: 448 KSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSRECHGYICAHKQELHGIIKGS 507
Query: 445 GISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEK 504
G C C C N+V+ FE HAG K P +IY ENGKT+ I+ + +P L
Sbjct: 508 GYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFB 567
Query: 505 AVRMVLGSSSMKKA 518
A++ V GS +K+
Sbjct: 568 AIQTVTGSPINQKS 581
>gi|359486643|ref|XP_002279348.2| PREDICTED: uncharacterized protein LOC100249637 [Vitis vinifera]
Length = 587
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 94/185 (50%), Gaps = 13/185 (7%)
Query: 356 QQKVELAKGSLSKDGGKRSDVTEVANDGVGGPVKQETVMKPRKVMRK------FYSKLKN 409
QQ+V+ ++ LS+ +S+ A+ K +TV K +++ K F +K+
Sbjct: 394 QQEVQTSEQVLSQKYLAQSNADPAASCTPKATSKTDTVSKNKQINTKKDPSSHFPLNVKS 453
Query: 410 FLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAG 469
L +G+ +G+ V Y+ ++ K +RGV+KGSG C C DC + + FE HA
Sbjct: 454 LLSTGMFDGVPVKYVSWTREKS-----VRGVIKGSGYLCSCKDCNSSNCLNAYEFERHAN 508
Query: 470 SSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKANFCLNCRVSFS 529
K P +IY ENGKT+ ++ K++P + L + ++ V G + + NF + S+
Sbjct: 509 CKTKHPNNHIYFENGKTIYAVVQELKNTPQDKLFEVIQNVTG-CPINQKNF-QTWKASYQ 566
Query: 530 NAGVE 534
A VE
Sbjct: 567 AATVE 571
Score = 40.0 bits (92), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 633 GLEDGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQ 688
G+ DG V Y K + G KG G LC+C CNS ++ +FE HA ++ P
Sbjct: 458 GMFDGVPVKYVSWTREKSVRGVIKGSGYLCSCKDCNSSNCLNAYEFERHANCKTKH-PNN 516
Query: 689 HIYTSNGVSLHEL 701
HIY NG +++ +
Sbjct: 517 HIYFENGKTIYAV 529
>gi|196010575|ref|XP_002115152.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
gi|190582535|gb|EDV22608.1| hypothetical protein TRIADDRAFT_59019 [Trichoplax adhaerens]
Length = 709
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 77/180 (42%), Gaps = 38/180 (21%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVEY 771
N+D C C +GGDLLCC+ CP AFH+ C P GIP+G W C+ C EY
Sbjct: 43 NNDTCDSCNEGGDLLCCERCPSAFHLHCCDPPLDEDGIPAGEWICKKCK--------AEY 94
Query: 772 NANARA-AGRIEGVDPFAQMVSRCIRIVQTPD-----TELGGCVLCRGRDFCK------- 818
N R A + E PFA +V ++ Q P E+ + G K
Sbjct: 95 QDNTRPIAKQSELRSPFAVLVKDNVK--QNPSEFRLPKEMQYHIPLPGSKKRKKSNGHGS 152
Query: 819 -SRFGRR------TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ 871
++ GRR +I CD C +H+ C+ L P G W+C + A+Q
Sbjct: 153 ATKGGRRGNSFLKPLIHCDYCPLSFHLDCMD----PPLTTTPSGLWMCPNHAEHFLPAMQ 208
>gi|363737037|ref|XP_427220.3| PREDICTED: autoimmune regulator [Gallus gallus]
Length = 553
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 73/165 (44%), Gaps = 25/165 (15%)
Query: 661 LCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLC 720
C +E SP +H+G + T++G H + + + P ++N+D C
Sbjct: 193 FCVPAAAEGSPKHLVSHSG--------EMCVTTHG---HLPAPPVHSQEPALYQDNEDEC 241
Query: 721 GICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAA 778
+C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+ + + + A A
Sbjct: 242 AVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKLGRLREADTAAEQLPA 301
Query: 779 ---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 811
G P RC + P D + GG LC
Sbjct: 302 VPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 346
>gi|449510083|ref|XP_002188592.2| PREDICTED: autoimmune regulator-like [Taeniopygia guttata]
Length = 434
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 46/70 (65%), Gaps = 3/70 (4%)
Query: 706 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCM-NT 762
SL+ P KEN+D C C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+ N
Sbjct: 236 SLKPPAQPKENEDECAACGDGGELICCDGCPRAFHLACLVPPLPHVPSGTWRCGSCVENV 295
Query: 763 FQKEKFVEYN 772
+ + +E +
Sbjct: 296 TEPGQLLEAD 305
>gi|255587619|ref|XP_002534332.1| DNA binding protein, putative [Ricinus communis]
gi|223525478|gb|EEF28050.1| DNA binding protein, putative [Ricinus communis]
Length = 417
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 74/144 (51%), Gaps = 7/144 (4%)
Query: 376 VTEVANDGVGGPVKQETVMKPRKV-MRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGV 434
T G+ K+E + +KV F S +++ L +G+L+G+ V YI S+ +
Sbjct: 253 ATHTTASGIENAKKKEDLKTCKKVPSNNFPSNVRSLLSTGMLDGVPVKYIAWSREE---- 308
Query: 435 SGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVC 494
LRGV+KGSG C C C ++V+ FE HA K P +IY ENGKT+ I+
Sbjct: 309 --LRGVIKGSGYLCGCQTCNFSKVINAYEFERHADCKTKHPNNHIYFENGKTVYGIVQEL 366
Query: 495 KDSPLETLEKAVRMVLGSSSMKKA 518
+ P L + ++ + GS +K+
Sbjct: 367 RSIPQNMLFEVIQTITGSPINQKS 390
>gi|224098320|ref|XP_002311151.1| predicted protein [Populus trichocarpa]
gi|222850971|gb|EEE88518.1| predicted protein [Populus trichocarpa]
Length = 590
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 11/139 (7%)
Query: 387 PVKQETVMKPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVV 441
P+ + K K +K F S +K+ L +G+L+G+ V Y+ S+ K L G++
Sbjct: 431 PIDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREKT-----LEGII 485
Query: 442 KGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLET 501
KG+G C C +C N+ + FE HA K P +I+ ENGKT+ ++ K++P
Sbjct: 486 KGTGYLCGCKECGSNKALNAYEFERHANCKTKHPNNHIFFENGKTIYAVVQELKNTPQGV 545
Query: 502 LEKAVRMVLGSSSMKKANF 520
L A++ V G S + + NF
Sbjct: 546 LFNAIQTVTG-SHINQKNF 563
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 51/116 (43%), Gaps = 5/116 (4%)
Query: 590 NRSPEPNSAQTSSHSKMKSSSVKSHGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVK 649
N +P+ +T S K+ +K+ K+ + + GL DG V Y K
Sbjct: 420 NAAPKNKEPKTPIDSASKNKELKTSKKVPANNFPSNVKSLLSTGLLDGVPVKYVSWSREK 479
Query: 650 FLVGYKKGFGILCTC----CNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHEL 701
L G KG G LC C N ++ +FE HA ++ P HI+ NG +++ +
Sbjct: 480 TLEGIIKGTGYLCGCKECGSNKALNAYEFERHANCKTKH-PNNHIFFENGKTIYAV 534
>gi|449528089|ref|XP_004171039.1| PREDICTED: uncharacterized LOC101211282 [Cucumis sativus]
Length = 461
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 70/129 (54%), Gaps = 11/129 (8%)
Query: 395 KPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+P MRK F S +++ + +G+L+G+ V Y+ ++ + LRG++KGSG C
Sbjct: 313 RPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE------LRGIIKGSGYLCG 366
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C C ++++ FE HAG K P +IY ENGKT+ I+ + +P L ++ +
Sbjct: 367 CQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTI 426
Query: 510 LGSSSMKKA 518
G+ +K+
Sbjct: 427 FGAPINQKS 435
>gi|449460965|ref|XP_004148214.1| PREDICTED: uncharacterized protein LOC101211282 [Cucumis sativus]
Length = 467
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 70/129 (54%), Gaps = 11/129 (8%)
Query: 395 KPRKVMRK-----FYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCF 449
+P MRK F S +++ + +G+L+G+ V Y+ ++ + LRG++KGSG C
Sbjct: 319 RPEYKMRKEAPNSFPSNVRSLISTGMLDGVPVKYVSVTREE------LRGIIKGSGYLCG 372
Query: 450 CDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMV 509
C C ++++ FE HAG K P +IY ENGKT+ I+ + +P L ++ +
Sbjct: 373 CQSCNFSKMLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDTIQTI 432
Query: 510 LGSSSMKKA 518
G+ +K+
Sbjct: 433 FGAPINQKS 441
>gi|326925645|ref|XP_003209021.1| PREDICTED: autoimmune regulator-like [Meleagris gallopavo]
Length = 444
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 699 HELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHC 756
H ++ + + P ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C
Sbjct: 247 HLPAVPVHSQDPAPYQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQC 306
Query: 757 RYCMNTFQKEKFVEYNANARAA---GRIEGVDPFAQMVSRCIR 796
C+ + + + A A G P S C+R
Sbjct: 307 SSCVAELGRLREADTAAEQLPAVPDKEEHGAQPGGSHWSTCVR 349
>gi|222628902|gb|EEE61034.1| hypothetical protein OsJ_14872 [Oryza sativa Japonica Group]
Length = 2486
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 84/153 (54%), Gaps = 3/153 (1%)
Query: 929 AVSIFHDRFDPIIESASKLDLIPAMVYG-RSHRGQ-DYHGMYCAILTVNQVVVSAGIFRI 986
A+SI + F PII++ + +D+IP ++Y RS DY G Y +L + ++S R+
Sbjct: 1125 ALSIMEECFLPIIDARTGIDIIPPILYNWRSDFVHLDYKGFYTVVLENDDRIISVASIRL 1184
Query: 987 FGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGF 1046
G +AE+PL+AT + + QG + L IE++L L V+ L+L + WT FGF
Sbjct: 1185 HGTVVAEMPLIATCLENRQQGMCRRLMDYIEQMLKSLKVEMLLLSAIPSLVDTWTMAFGF 1244
Query: 1047 SMMTEEEQNKYRNDYPLMIFQGTSMLQKPVPKC 1079
+ + ++ K + L+ GT +L++ + +C
Sbjct: 1245 VPIDDLDR-KNLSRLRLVSVPGTVLLKRNLYEC 1276
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 55/98 (56%), Gaps = 13/98 (13%)
Query: 659 GILCTCCNSEVSPSQFEAHAGWASRRKPFQHIYTSNGVS-----LHELSI--KLSLER-- 709
GI C CC+ + S F+ HAG + P +++ +G S L SI K ER
Sbjct: 975 GIRCRCCDMVFTMSMFKYHAG-LRQEIPSLNLFLGSGKSYTLCQLQAWSIEHKARKERAK 1033
Query: 710 ---PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV 744
P + ENDD CG+C DGG+L+CCD+CP ++H DC+
Sbjct: 1034 CTMPLQADENDDTCGLCGDGGELICCDNCPASYHQDCL 1071
>gi|113678140|ref|NP_001038323.1| chromodomain helicase DNA binding protein 4 [Danio rerio]
Length = 1929
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 411
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
E + A G E D + C + GG +LC
Sbjct: 412 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 445
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 446 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 488
>gi|189521245|ref|XP_696641.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Danio
rerio]
Length = 2063
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 75/166 (45%), Gaps = 27/166 (16%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 385 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC-----EKEGIQWE 439
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A E + V + + D + C +C+ G C CD
Sbjct: 440 AKEEDFEEFEEECDDVRDVESGLGGEEEEDDHMEFCRVCKDGGELLC-----------CD 488
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 489 SCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 529
>gi|158517931|ref|NP_001103484.1| autoimmune regulator [Danio rerio]
gi|158024564|gb|ABW08119.1| autoimmune regulator [Danio rerio]
Length = 511
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 62/146 (42%), Gaps = 20/146 (13%)
Query: 711 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM-NTFQKEK 767
F+ + NDD C +C DGG+L+CCD CPRAFH+ C+ P IP GTW C+ C N +
Sbjct: 286 FAVEHNDDECAVCKDGGELICCDGCPRAFHLSCLVPPLTSIPRGTWRCQLCQSNRLKDRT 345
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRI-------------VQTPDTELGGCVLCRGR 814
+ A VD + +Q+ D E G + G
Sbjct: 346 YTHVQPPATETSSGSAVDFSFFSSLSSTSLSTVSASSSAQSIGLQSSDGERVGIRMACGI 405
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGC 840
+ R +I C QC + YH C
Sbjct: 406 CY----LTRGELITCPQCLQAYHALC 427
>gi|281340666|gb|EFB16250.1| hypothetical protein PANDA_015510 [Ailuropoda melanoleuca]
Length = 449
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 100/229 (43%), Gaps = 35/229 (15%)
Query: 546 LKESQAGSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSK 605
L+E +A P + SS P + + P+ E + R P N QT S S
Sbjct: 134 LEEPRATPPAALSPRATSS------PGSQAKAKPSKKPESSAEPQRLPLGNGIQTMSTSV 187
Query: 606 MKSSSVKS---HGKITRKDLRMHKLVFEEGGLEDGAEVG--YFVRGEVKFLVGYK----- 655
++ +V S G + + + VFE GG + +VG ++ + + +G K
Sbjct: 188 QRAVTVSSGDVPGACGAVEGILIQQVFESGGSKKCIQVGGEFYTPSKFEDPIGGKNKTRS 247
Query: 656 KGFGILCTCCNSEVS-PSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK 714
G L ++ S P E AG R + + + LH+
Sbjct: 248 SGLKTLVRAKGTQASAPGGGELRAGQQGRLQAPPALPSEP--QLHQ-------------- 291
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 292 KNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQ 340
>gi|403271756|ref|XP_003927774.1| PREDICTED: autoimmune regulator [Saimiri boliviensis boliviensis]
Length = 570
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNT 762
P ++N+D C +C DGG+LLCCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 287 PQPHQKNEDECAVCRDGGELLCCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQA 341
>gi|147865915|emb|CAN78845.1| hypothetical protein VITISV_013035 [Vitis vinifera]
Length = 243
Score = 75.1 bits (183), Expect = 2e-10, Method: Composition-based stats.
Identities = 47/128 (36%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 963 DYHGMYCAILTVNQVVVSAGI------------FRIFGQELAELPLVATSNDCQGQGYFQ 1010
D+ G Y L + V A RI G ++AE+PLVAT+ + QG Q
Sbjct: 62 DFRGFYIMALQKDDEFVCAATAFMNCVYEYLHGLRIHGHKVAEMPLVATAFKYRRQGMCQ 121
Query: 1011 SLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFGFSMMTEEEQNKYRNDYPLMIFQGTS 1070
L +EK+L L+V+ LVLP+ SE +W + FGFS M+ E+ + +P + FQGT+
Sbjct: 122 VLVHELEKMLSQLHVERLVLPAISERSELWQSLFGFSEMSSAERLELLR-FPFLGFQGTT 180
Query: 1071 MLQKPVPK 1078
M QK + K
Sbjct: 181 MFQKILSK 188
>gi|158263559|gb|ABW24495.1| autoimmune regulator isoform 1 [Gallus gallus]
Length = 412
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 56/120 (46%), Gaps = 14/120 (11%)
Query: 706 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTF 763
S E ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 214 SQEPALYQQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL 273
Query: 764 QKEKFVEYNANARAA---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 811
+ + + A A G P RC + P D + GG LC
Sbjct: 274 GRLREADTAAEQLPAVPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 333
>gi|2696617|dbj|BAA23989.1| AIRE-2 [Homo sapiens]
gi|2696620|dbj|BAA23991.1| AIRE-2 [Homo sapiens]
gi|119629848|gb|EAX09443.1| hCG401300, isoform CRA_c [Homo sapiens]
gi|187950581|gb|AAI37271.1| AIRE protein [Homo sapiens]
gi|187953509|gb|AAI37269.1| AIRE protein [Homo sapiens]
Length = 348
Score = 74.7 bits (182), Expect = 3e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 150
Score = 40.8 bits (94), Expect = 3.8, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 796 RIVQTPDTELGGCVLC-RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
R++ +G LC + D C +I CD C R +H+ CL L+E+P
Sbjct: 80 RVLHPSPVCMGVSCLCQKNEDECAVCRDGGELICCDGCPRAFHLACLS----PPLREIPS 135
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPE 883
G W C+ C + +Q++ R EE P+
Sbjct: 136 GTWR-CSSC--LQATVQEVQPRAEEPRPQ 161
>gi|301781030|ref|XP_002925935.1| PREDICTED: autoimmune regulator-like [Ailuropoda melanoleuca]
Length = 559
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 330 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHEIPSGTWRCSSCLQ 379
>gi|256052547|ref|XP_002569826.1| chromodomain helicase DNA binding protein [Schistosoma mansoni]
gi|353229870|emb|CCD76041.1| putative chromodomain helicase DNA binding protein [Schistosoma
mansoni]
Length = 1966
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 68/153 (44%), Gaps = 26/153 (16%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY- 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P GTW C +C +KE
Sbjct: 327 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGTWSCPHC----EKEGITSVS 382
Query: 772 --NANARAAGRIEGV--DPFAQMVSRCIRIVQTPDTELGGCVLCR-GRDFCKSRFGRRTV 826
N +G V D ++VS D C CR G D +
Sbjct: 383 KGNKECEDSGSEPAVTSDKDGKIVSAVHPTSPEKDEHQEFCTECRDGGD----------L 432
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
I CD C YH+ CL + L +P+G WLC
Sbjct: 433 ICCDNCPASYHIACL----IPPLANIPEGVWLC 461
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ + C C DGGDL+CCD+CP ++HI C+ P IP G W C C
Sbjct: 418 EHQEFCTECRDGGDLICCDNCPASYHIACLIPPLANIPEGVWLCPRC 464
>gi|119629849|gb|EAX09444.1| hCG401300, isoform CRA_d [Homo sapiens]
Length = 514
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|158263561|gb|ABW24496.1| autoimmune regulator isoform 2 [Gallus gallus]
Length = 367
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 56/120 (46%), Gaps = 14/120 (11%)
Query: 706 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTF 763
S E ++N+D C +C DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 203 SQEPALYQQDNEDECAVCGDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCVAKL 262
Query: 764 QKEKFVEYNANARAA---GRIEGVDP---FAQMVSRCIRIVQTP------DTELGGCVLC 811
+ + + A A G P RC + P D + GG LC
Sbjct: 263 GRLREADTAAEQLPAVPDKEEHGAQPGGGHGSTCGRCFSSISAPQRCPTRDGDPGGLWLC 322
>gi|335300800|ref|XP_003359037.1| PREDICTED: autoimmune regulator-like [Sus scrofa]
Length = 578
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 41/66 (62%), Gaps = 2/66 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEK 767
P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ ++
Sbjct: 301 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLRDIPSGTWRCSSCLQGRSQQD 360
Query: 768 FVEYNA 773
A
Sbjct: 361 LPRAEA 366
>gi|3392940|emb|CAA08759.1| AIRE [Homo sapiens]
Length = 515
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|194226307|ref|XP_001490547.2| PREDICTED: autoimmune regulator-like [Equus caballus]
Length = 479
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 36/49 (73%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 298 QNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCTSCLQ 346
>gi|2696621|dbj|BAA23992.1| AIRE-3 [Homo sapiens]
gi|2696623|dbj|BAA23993.1| AIRE-3 [Homo sapiens]
gi|119629847|gb|EAX09442.1| hCG401300, isoform CRA_b [Homo sapiens]
Length = 254
Score = 74.3 bits (181), Expect = 4e-10, Method: Composition-based stats.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 96 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 150
Score = 40.4 bits (93), Expect = 5.1, Method: Composition-based stats.
Identities = 26/89 (29%), Positives = 41/89 (46%), Gaps = 8/89 (8%)
Query: 796 RIVQTPDTELGGCVLC-RGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
R++ +G LC + D C +I CD C R +H+ CL L+E+P
Sbjct: 80 RVLHPSPVCMGVSCLCQKNEDECAVCRDGGELICCDGCPRAFHLACLS----PPLREIPS 135
Query: 855 GKWLCCADCKRINLALQKLVDRGEEKLPE 883
G W C+ C + +Q++ R EE P+
Sbjct: 136 GTWR-CSSC--LQATVQEVQPRAEEPRPQ 161
>gi|410969921|ref|XP_003991440.1| PREDICTED: autoimmune regulator [Felis catus]
Length = 626
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/53 (54%), Positives = 38/53 (71%), Gaps = 2/53 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 403 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCYSCL 455
>gi|345307058|ref|XP_001513786.2| PREDICTED: autoimmune regulator-like [Ornithorhynchus anatinus]
Length = 552
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 55/97 (56%), Gaps = 11/97 (11%)
Query: 670 SPSQFEAHAGWASRRKPFQHIYT-SNGVSLHELSIKLSLERPFSS---KENDDLCGICMD 725
+P +FE G + R P + ++ V H S+ FSS ++NDD C +C D
Sbjct: 236 TPGKFEDAGGKSKNRSPKPPVRPKASQVPQHVRSLSR-----FSSLWGQKNDDECAVCRD 290
Query: 726 GGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
GG+L+CCD CPRAFH+ C+ P IPSGTW C C+
Sbjct: 291 GGELICCDGCPRAFHLTCLVPPLTEIPSGTWRCVRCV 327
>gi|355747326|gb|EHH51823.1| hypothetical protein EGM_12122, partial [Macaca fascicularis]
Length = 447
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 250 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 304
>gi|357120035|ref|XP_003561736.1| PREDICTED: uncharacterized protein LOC100841702 [Brachypodium
distachyon]
Length = 292
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 17/151 (11%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSH--RGQDYHGMYCAILTVNQVVVSAGI 983
L A+ + H+ F IIE ++ DL +V+ R R ++ G Y +
Sbjct: 8 LCIALDVLHEWFVTIIEPRTRRDLSEDIVFTRQSELRQLNFRGFYTIL------------ 55
Query: 984 FRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNK 1043
+ G++ AELPL+ T + QG + L +EKLL L V+ L+LP+ + WT
Sbjct: 56 --VCGKKFAELPLIGTRVQYRRQGMCRLLMNEVEKLLSGLGVERLLLPTVPQLLETWTGS 113
Query: 1044 FGFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
FGF+ M+ ++ +Y + ++ FQGT+M QK
Sbjct: 114 FGFTEMSYSDRFQYAANI-ILSFQGTTMCQK 143
>gi|397507134|ref|XP_003824063.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator [Pan paniscus]
Length = 630
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 378 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 432
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 393 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPQPQ 443
>gi|297287420|ref|XP_001103602.2| PREDICTED: autoimmune regulator [Macaca mulatta]
Length = 526
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 347
>gi|302754362|ref|XP_002960605.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
gi|300171544|gb|EFJ38144.1| hypothetical protein SELMODRAFT_437662 [Selaginella moellendorffii]
Length = 645
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 12/116 (10%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
V+++F K LESG+LEG++V Y+ R +V G GVVK I C C CK +Q
Sbjct: 530 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLG------SGVVKSGVILCNCRHCKSHQ 583
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM---NVCKDSP--LETLEKAVRM 508
+ FE H GS+ + P ++I+L+NG+ LR+++ + +D P + L KAV M
Sbjct: 584 GFNASSFEKHVGSTARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVSM 639
>gi|426393247|ref|XP_004062941.1| PREDICTED: autoimmune regulator [Gorilla gorilla gorilla]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|160773130|gb|AAI55053.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 586
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 410
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
E + A G E D + C + GG +LC
Sbjct: 411 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 444
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 445 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 487
>gi|426219513|ref|XP_004003966.1| PREDICTED: autoimmune regulator [Ovis aries]
Length = 612
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 2/54 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
P ++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+
Sbjct: 426 PHLHQKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCIQ 479
>gi|55656225|ref|XP_531580.1| PREDICTED: autoimmune regulator [Pan troglodytes]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPQPQ 358
>gi|302771658|ref|XP_002969247.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
gi|300162723|gb|EFJ29335.1| hypothetical protein SELMODRAFT_440724 [Selaginella moellendorffii]
Length = 618
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 12/116 (10%)
Query: 399 VMRKFYSKLKNFLESGILEGMSVMYI-RGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQ 457
V+++F K LESG+LEG++V Y+ R +V G GVVK I C C CK +Q
Sbjct: 503 VVKEFTVSAKWLLESGVLEGLTVRYMPRPGEVLG------SGVVKSGVILCNCRHCKSHQ 556
Query: 458 VVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIM---NVCKDSP--LETLEKAVRM 508
+ FE H GS+ + P ++I+L+NG+ LR+++ + +D P + L KAV M
Sbjct: 557 GFNASSFEKHVGSTARHPSDFIFLDNGRRLREVLEDGSRFRDKPNMMGALRKAVSM 612
>gi|4557291|ref|NP_000374.1| autoimmune regulator [Homo sapiens]
gi|3334119|sp|O43918.1|AIRE_HUMAN RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein; Short=APECED protein
gi|2665371|emb|CAB10790.1| AIRE protein [Homo sapiens]
gi|2696615|dbj|BAA23988.1| AIRE-1 [Homo sapiens]
gi|2696619|dbj|BAA23990.1| AIRE-1 [Homo sapiens]
gi|7768776|dbj|BAA95560.1| autoimmune regulator (APECED protein) [Homo sapiens]
gi|119629846|gb|EAX09441.1| hCG401300, isoform CRA_a [Homo sapiens]
Length = 545
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 293 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 347
Score = 40.8 bits (94), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL L+E+P G W C+ C + +Q++ R EE P+
Sbjct: 308 LICCDGCPRAFHLACLS----PPLREIPSGTWR-CSSC--LQATVQEVQPRAEEPRPQ 358
>gi|134026322|gb|AAI34984.1| Si:ch211-51m24.3 protein [Danio rerio]
Length = 584
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 351 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKMGIQWEAREDA 410
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
E + A G E D + C + GG +LC
Sbjct: 411 SEGEEDNEAGGEAEEDDHHMEFCRVC---------KDGGELLC----------------- 444
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 445 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 487
>gi|432883650|ref|XP_004074311.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oryzias latipes]
Length = 1974
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 73/168 (43%), Gaps = 39/168 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDV 437
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
+ + G +E D + C + GG +LC
Sbjct: 438 SDAEEDNGETGEMEEDDHHMEFCRVC---------KDGGELLC----------------- 471
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 472 CDSCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPPMKGKVQKIL 514
>gi|292606963|gb|ADE34162.1| chromodomain helicase DNA-binding protein 4 [Schmidtea
mediterranea]
Length = 1868
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 69/147 (46%), Gaps = 19/147 (12%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRAFH+ C+ L P G+W C +C +KE V
Sbjct: 363 DHQDYCEVCQQGGEIMLCDTCPRAFHLVCLDPELEEAPEGSWSCPHC----EKEGVV--- 415
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A +R+ G D + IR P+ E ++FC +I C +C
Sbjct: 416 AASRSTTPATGGD--MSQNPQNIRKSAQPNEEEKD----EHQEFCNECKDGGDLICCAKC 469
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L E+P+G WLC
Sbjct: 470 PVSYHPECL----YPPLSEIPEGPWLC 492
>gi|390478260|ref|XP_003735458.1| PREDICTED: LOW QUALITY PROTEIN: autoimmune regulator, partial
[Callithrix jacchus]
Length = 330
Score = 73.2 bits (178), Expect = 8e-10, Method: Composition-based stats.
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 2/59 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
P ++N+D C +C DGG+L+CCD CPRAFH+ C++ P IPSGTW C C+ E
Sbjct: 71 PQPHQKNEDECAVCRDGGELICCDGCPRAFHLACLTPPLREIPSGTWRCSSCLQATVSE 129
>gi|359323504|ref|XP_544921.3| PREDICTED: autoimmune regulator [Canis lupus familiaris]
Length = 551
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 297 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHDIPSGTWRCSSCLQ 346
>gi|348556287|ref|XP_003463954.1| PREDICTED: autoimmune regulator [Cavia porcellus]
Length = 551
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 300 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLHKIPSGTWRCSCCLQ 349
>gi|301625544|ref|XP_002941963.1| PREDICTED: hypothetical protein LOC100495769 [Xenopus (Silurana)
tropicalis]
Length = 868
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 30/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ V++
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC-----EKEGVQWE 413
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A + +++ V P+ E CR CK ++ CD C
Sbjct: 414 AK----------EDNSEIDDDMDDTVGDPEEEDHHMEFCR---VCKDG---GELLCCDAC 457
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 458 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKIL 496
>gi|402862209|ref|XP_003895460.1| PREDICTED: autoimmune regulator [Papio anubis]
Length = 527
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 39/55 (70%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 272 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSGCLQVTVQE 326
>gi|348500810|ref|XP_003437965.1| PREDICTED: autoimmune regulator-like [Oreochromis niloticus]
Length = 485
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 75/165 (45%), Gaps = 26/165 (15%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM-NTFQKEK----F 768
NDD C +C DGG+L+CCD CPRAFH+ C+ P IPSG+W C +C + +KEK
Sbjct: 259 NDDECTVCKDGGELICCDGCPRAFHLACLDPPLSSIPSGSWQCEWCRGHRVKKEKAQLPL 318
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRG------RDFCK-SRF 821
N + I V ++ + S + T GG C G R+ C
Sbjct: 319 QPQQTNTNCSNSITDVSFYSSLSSSLTSVTATMSAS-GGRNQCSGGELVGVREVCGVCHL 377
Query: 822 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI 866
G + C QC + +HV C KG+ +C + C R+
Sbjct: 378 GGGDLTHCFQCLKHFHVNC----------HFSKGRSICLS-CSRL 411
>gi|156717248|ref|NP_001096166.1| chromodomain helicase DNA binding protein 4 [Xenopus (Silurana)
tropicalis]
gi|126631946|gb|AAI33720.1| chd4 protein [Xenopus (Silurana) tropicalis]
Length = 1888
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 72/164 (43%), Gaps = 30/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHC------------- 404
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + E + +++ V P+ E CR CK ++ CD C
Sbjct: 405 --EKEGVQWEAKEDNSEIDDDMDDTVGDPEEEDHHMEFCR---VCKDG---GELLCCDAC 456
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 457 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKIQKIL 495
>gi|432852260|ref|XP_004067159.1| PREDICTED: uncharacterized protein LOC101164387 [Oryzias latipes]
Length = 1310
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 772
E++D C +C+ GG+LLCCD CP+ FH+ C P PSG W C C + Q E V+YN
Sbjct: 1104 ESEDFCAVCLIGGELLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDAIQPE--VQYN 1161
Query: 773 A-NARAAGR--IEGVDPFAQMVSRCIRIVQTP 801
N RA+G + G+ Q +I++ P
Sbjct: 1162 CENERASGANPLHGLSACDQRARHYYQIIKRP 1193
>gi|149043614|gb|EDL97065.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_c [Rattus
norvegicus]
Length = 404
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 95/229 (41%), Gaps = 55/229 (24%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSE 668
S G + + + VFE GG + +VG GE YK
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEF-----YK------------- 233
Query: 669 VSPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSK 714
PS+FE +G S KP + G ++S + + P +
Sbjct: 234 --PSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQ 291
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 292 KNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|351705306|gb|EHB08225.1| Autoimmune regulator [Heterocephalus glaber]
Length = 485
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 40/59 (67%), Gaps = 2/59 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 770
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ ++ +
Sbjct: 240 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSCCLQGRGQQGLAQ 298
>gi|348523828|ref|XP_003449425.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Oreochromis
niloticus]
Length = 2125
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 76/170 (44%), Gaps = 31/170 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 376 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 430
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIV----QTPDTELGGCVLCR--GRDFCKSRFGRRTV 826
A E + +R+ + D + C +C+ G C
Sbjct: 431 AKDEDFEDFEEDSEDRVISEVGVRVATGAEEEDDDHMEFCRVCKDGGELLC--------- 481
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 482 --CDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 524
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 36/53 (67%), Gaps = 5/53 (9%)
Query: 712 SSKENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ +E+DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 459 AEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRC 511
>gi|148232826|ref|NP_001089077.1| E3 ubiquitin-protein ligase TRIM33 [Xenopus laevis]
gi|82122015|sp|Q56R14.1|TRI33_XENLA RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|59891841|gb|AAX10105.1| ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 65/127 (51%), Gaps = 25/127 (19%)
Query: 689 HIYTSNGVSLHELSIKLSLERP-----------FSSKENDDLCGICMDGGDLLCCDSCPR 737
H+ NG S +++ S+ RP N+D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGGGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 738 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 791
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 792 SRCIRIV 798
+C R++
Sbjct: 928 -KCERLL 933
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 69/127 (54%), Gaps = 25/127 (19%)
Query: 689 HIYTSNGVSLHELSIKLSLERPF--------SSKEND---DLCGICMDGGDLLCCDSCPR 737
H+ NG S +++ S+ RP S+K++D D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGVGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 738 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 791
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 792 SRCIRIV 798
+C R++
Sbjct: 928 -KCERLL 933
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 69/127 (54%), Gaps = 25/127 (19%)
Query: 689 HIYTSNGVSLHELSIKLSLERPF--------SSKEND---DLCGICMDGGDLLCCDSCPR 737
H+ NG S +++ S+ RP S+K++D D C +C +GGDLLCC+ CP+
Sbjct: 815 HVSLVNGKS----AVRNSMHRPPRGGVGGDGSNKDDDPNEDWCAVCQNGGDLLCCEKCPK 870
Query: 738 AFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN----ARAAGRIEGVDPFAQMV 791
FH+ C +L PSG W C +C + + E VEY+ + ++ ++G+ P QM
Sbjct: 871 VFHLTCHVPTLLSFPSGEWICTFCRDLNKPE--VEYDCDNSQHSKKGKTVQGLSPVDQM- 927
Query: 792 SRCIRIV 798
+C R++
Sbjct: 928 -KCERLL 933
>gi|441672936|ref|XP_003277460.2| PREDICTED: uncharacterized protein LOC100599316 [Nomascus
leucogenys]
Length = 699
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNT 762
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 424 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCSSCLQA 474
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPE 883
+I CD C R +H+ CL LQE+P G W C+ C + +Q + R EE P+
Sbjct: 439 LICCDGCPRAFHLACLSP----PLQEIPSGTWR-CSSC--LQATVQDMRPRAEEPRPQ 489
>gi|363734262|ref|XP_420989.3| PREDICTED: tripartite motif-containing protein 66 [Gallus gallus]
Length = 1166
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 13/112 (11%)
Query: 693 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 750
+NG S+ + K + + ++ EN+D C +C++GG+LLCCD CP+ FH+ C +L P
Sbjct: 900 TNGTSI---TKKSPVTQEVNTIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFP 956
Query: 751 SGTWHCRYCMNTFQKEKFVEYNA-NARAAGRIE---GVDPFAQMVSRCIRIV 798
G W C C N + E VEY+ N R A G+D + Q +C ++V
Sbjct: 957 VGEWVCTLCRNPMKPE--VEYDCENTRYAHSYNAQYGLDDYDQ--KKCEKLV 1004
>gi|159466240|ref|XP_001691317.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279289|gb|EDP05050.1| predicted protein [Chlamydomonas reinhardtii]
Length = 624
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 67/136 (49%), Gaps = 8/136 (5%)
Query: 926 LSKAVSIFHDRFDPIIESASK--LDLI------PAMVYGRSHRGQDYHGMYCAILTVNQV 977
+ +A+ +F F P++ + LD++ P + G ++ G + A+L
Sbjct: 221 IDQALRLFKSSFSPLLMDNGRDLLDMVCTGWETPDEQLTETEPGHNFSGFHLAVLRQRGA 280
Query: 978 VVSAGIFRIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQ 1037
VV+A R+FG+ AELP VAT + G + L +E LL V LV+PS
Sbjct: 281 VVTAATLRVFGRRFAELPFVATREGYRRAGNCRRLVKAVEDLLLSAGVGQLVMPSIKPLL 340
Query: 1038 AIWTNKFGFSMMTEEE 1053
+W KFGF+ +TE+E
Sbjct: 341 PMWAAKFGFTPLTEQE 356
>gi|297741548|emb|CBI32680.3| unnamed protein product [Vitis vinifera]
Length = 127
Score = 72.4 bits (176), Expect = 1e-09, Method: Composition-based stats.
Identities = 39/104 (37%), Positives = 58/104 (55%), Gaps = 5/104 (4%)
Query: 415 ILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKR 474
+L+G+ V YI S+ K LRG++KGSG C C C ++V+ FE HAG K
Sbjct: 1 MLDGVPVKYIAWSREKE-----LRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKH 55
Query: 475 PPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRMVLGSSSMKKA 518
P +IY ENGKT+ I+ K +P +L ++ + GS +K+
Sbjct: 56 PNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKS 99
Score = 39.7 bits (91), Expect = 8.4, Method: Composition-based stats.
Identities = 26/68 (38%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 636 DGAEVGYFVRGEVKFLVGYKKGFGILCTC--CNSE--VSPSQFEAHAGWASRRKPFQHIY 691
DG V Y K L G KG G LC C CN ++ +FE HAG ++ P HIY
Sbjct: 3 DGVPVKYIAWSREKELRGIIKGSGYLCGCQSCNFSKVINAYEFERHAGCKTKH-PNNHIY 61
Query: 692 TSNGVSLH 699
NG +++
Sbjct: 62 FENGKTIY 69
>gi|291413605|ref|XP_002723061.1| PREDICTED: transcriptional intermediary factor 1 alpha [Oryctolagus
cuniculus]
Length = 903
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 42/64 (65%), Gaps = 4/64 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 678 NEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 735
Query: 774 NARA 777
+AR+
Sbjct: 736 DARS 739
>gi|392340124|ref|XP_001063352.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
Length = 384
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
NDD C +C DGG+L+CCD CPRAFH+ C++ P IPSG+W C C
Sbjct: 156 NDDECAVCKDGGELICCDGCPRAFHLTCLNPPLISIPSGSWQCERC 201
>gi|392347634|ref|XP_232354.5| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Rattus
norvegicus]
Length = 1921
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|395851249|ref|XP_003798178.1| PREDICTED: autoimmune regulator [Otolemur garnettii]
Length = 544
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 37/50 (74%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+
Sbjct: 291 QKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLQEIPSGTWRCCSCLQ 340
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 7/57 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLP 882
+I CD C R +H+ CL LQE+P G W CC+ C + +Q+ + EE P
Sbjct: 306 LICCDGCPRAFHLACLS----PPLQEIPSGTWRCCS-C--LQGGVQQGLPHAEEPQP 355
>gi|291392737|ref|XP_002712922.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
1 [Oryctolagus cuniculus]
Length = 1905
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|358339541|dbj|GAA47583.1| chromodomain-helicase-DNA-binding protein 4 [Clonorchis sinensis]
Length = 1670
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 78/165 (47%), Gaps = 26/165 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKF-VEY 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G+W C +C +KE +
Sbjct: 48 DHQDYCEVCQQGGEIMLCDTCPRAYHLVCLDPELEEAPEGSWSCPHC----EKEGISMGS 103
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
+A G D A+ V+ +P+ + ++FC +I C+
Sbjct: 104 QVEGKATGTKMAPDKSAKQVA-----AASPEKD-------EHQEFCTECHDGGDLICCEN 151
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL + L +P+G WLC C CK + + K++
Sbjct: 152 CPVSYHLDCL----IPPLTNIPEGVWLCPRCG-CKPLKARVSKIL 191
>gi|39204553|ref|NP_666091.1| chromodomain-helicase-DNA-binding protein 4 [Mus musculus]
gi|51701319|sp|Q6PDQ2.1|CHD4_MOUSE RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4
gi|35193271|gb|AAH58578.1| Chromodomain helicase DNA binding protein 4 [Mus musculus]
Length = 1915
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|348530512|ref|XP_003452755.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Oreochromis
niloticus]
Length = 1950
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 36/151 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C + +E
Sbjct: 364 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHCEKEGIQWEAREDG 423
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
+ + AG +E D + C + GG +LC
Sbjct: 424 SDGEEDNGDAGDMEEDDHHMEFCRVC---------KDGGELLC----------------- 457
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C YH+ CL L E+P G+W+C
Sbjct: 458 CDSCPSSYHIHCLN----PPLPEIPNGEWIC 484
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM 760
+ C +C DGG+LLCCDSCP ++HI C++ LP IP+G W C CM
Sbjct: 444 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRCM 488
>gi|301773764|ref|XP_002922290.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4-like [Ailuropoda melanoleuca]
Length = 1906
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 421
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 422 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 459
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 460 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 500
>gi|417414010|gb|JAA53313.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1916
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 432 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 469
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 470 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 510
>gi|410963637|ref|XP_003988370.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Felis catus]
Length = 1905
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|354467283|ref|XP_003496099.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Cricetulus
griseus]
Length = 1902
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|344242425|gb|EGV98528.1| Chromodomain-helicase-DNA-binding protein 4 [Cricetulus griseus]
Length = 1930
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|348555034|ref|XP_003463329.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Cavia
porcellus]
Length = 1893
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 408
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 409 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 446
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 447 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 487
>gi|47211547|emb|CAF96112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 886
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 6/75 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCCD CP+ FH+ C +P + PSG W C +C + E +EYN
Sbjct: 705 NEDWCAVCQNGGELLCCDRCPKVFHLSC-HIPALHEPPSGEWFCSFCRDLVSPE--MEYN 761
Query: 773 ANARAAGRIEGVDPF 787
N+ + +G P
Sbjct: 762 CNSNDSPVSDGFPPI 776
>gi|417413954|gb|JAA53286.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1766
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 222 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 281
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 282 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 319
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 320 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 360
>gi|60360510|dbj|BAD90499.1| mKIAA4075 protein [Mus musculus]
Length = 1945
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 448
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 449 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 486
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 487 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 527
>gi|440895655|gb|ELR47793.1| Chromodomain-helicase-DNA-binding protein 4 [Bos grunniens mutus]
Length = 1945
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|395847597|ref|XP_003796455.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Otolemur
garnettii]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|326920032|ref|XP_003206280.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Meleagris gallopavo]
Length = 1167
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 62/112 (55%), Gaps = 13/112 (11%)
Query: 693 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 750
+NG S+ + K + + ++ EN+D C +C++GG+LLCCD CP+ FH+ C +L P
Sbjct: 901 TNGTSIMK---KSPVTQEVNTIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFP 957
Query: 751 SGTWHCRYCMNTFQKEKFVEYNA-NARAAGRIE---GVDPFAQMVSRCIRIV 798
G W C C N + E VEY+ N R A G+D + Q +C ++V
Sbjct: 958 VGEWVCTLCRNPMKPE--VEYDCENTRYAHSYNAQYGLDDYDQ--KKCEKLV 1005
>gi|426227030|ref|XP_004007632.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Ovis aries]
Length = 1963
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 419 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 478
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 479 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 516
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 517 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 557
>gi|410963635|ref|XP_003988369.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Felis catus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|291392739|ref|XP_002712923.1| PREDICTED: chromodomain helicase DNA binding protein 4-like isoform
2 [Oryctolagus cuniculus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|351715692|gb|EHB18611.1| Chromodomain-helicase-DNA-binding protein 4 [Heterocephalus glaber]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|432111850|gb|ELK34892.1| Chromodomain-helicase-DNA-binding protein 4 [Myotis davidii]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 729 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 786
Query: 774 NARAAG----RIEGVDPFAQMVSR-CIRIV 798
+AR+ +IEG+ + R C R++
Sbjct: 787 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 816
>gi|350584424|ref|XP_003126577.3| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
gi|417515864|gb|JAA53737.1| chromodomain-helicase-DNA-binding protein 4 [Sus scrofa]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|297471380|ref|XP_002685182.1| PREDICTED: autoimmune regulator [Bos taurus]
gi|296490907|tpg|DAA33020.1| TPA: autoimmune regulator (autoimmune polyendocrinopathy
candidiasis ectodermal dystrophy)-like [Bos taurus]
Length = 620
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 4/57 (7%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM--NTFQKE 766
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+ T Q++
Sbjct: 383 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRD 439
>gi|194663786|ref|XP_001252993.2| PREDICTED: autoimmune regulator [Bos taurus]
Length = 628
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%), Gaps = 4/57 (7%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM--NTFQKE 766
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+ T Q++
Sbjct: 383 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQGTTAQRD 439
>gi|330417956|ref|NP_001193430.1| chromodomain-helicase-DNA-binding protein 4 [Bos taurus]
gi|296487143|tpg|DAA29256.1| TPA: chromodomain helicase DNA binding protein 4 [Bos taurus]
Length = 1912
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|345791649|ref|XP_867754.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 3
[Canis lupus familiaris]
Length = 1912
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|67514537|ref|NP_001002870.2| tripartite motif-containing 24 [Danio rerio]
gi|66910275|gb|AAH96849.1| Tripartite motif-containing 24 [Danio rerio]
gi|182888610|gb|AAI63977.1| Trim24 protein [Danio rerio]
Length = 961
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W+C +C + E ++YN
Sbjct: 696 NEDWCAVCQNGGELLCCDKCPKVFHLSCHVPTLTASPSGEWYCTFCRDLNSPE--MQYNV 753
Query: 774 NA 775
NA
Sbjct: 754 NA 755
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 53/90 (58%), Gaps = 9/90 (10%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 268 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 325
Query: 774 NARAAG----RIEGVDPFAQMVSR-CIRIV 798
+AR+ +IEG+ + R C R++
Sbjct: 326 DARSHNSDKRKIEGLSKLTPIDKRKCERLL 355
>gi|148227774|ref|NP_001080504.1| chromodomain helicase DNA binding protein 4 [Xenopus laevis]
gi|28422180|gb|AAH46866.1| B230399n07 protein [Xenopus laevis]
Length = 1893
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 34/166 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 359 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMDKAPEGKWSCPHCEKEGVQWEAKEDN 418
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ DP + D + C +C+ G C CD
Sbjct: 419 SELDDDLDDAVGDP------------EEEDHHMEFCRVCKDGGELLC-----------CD 455
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 456 VCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKIQKIL 496
>gi|149043613|gb|EDL97064.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_b [Rattus
norvegicus]
Length = 488
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 95/229 (41%), Gaps = 55/229 (24%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSE 668
S G + + + VFE GG + +VG GE YK
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEF-----YK------------- 233
Query: 669 VSPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSK 714
PS+FE +G S KP + G ++S + + P +
Sbjct: 234 --PSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQ 291
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 292 KNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|82085579|sp|Q6E2N3.1|TRI33_DANRE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Protein
moonshine; AltName: Full=Transcription intermediary
factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|50235052|gb|AAT70732.1| transcriptional intermediary factor 1 gamma [Danio rerio]
Length = 1163
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 920 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYN 976
>gi|440895583|gb|ELR47735.1| Autoimmune regulator [Bos grunniens mutus]
Length = 543
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C C DGG+LLCCD CPRAFH+ C++ P IPSGTW C C+
Sbjct: 301 QKNEDECAACRDGGELLCCDGCPRAFHLACLTPPLSEIPSGTWRCSNCVQ 350
>gi|444510914|gb|ELV09761.1| Chromodomain-helicase-DNA-binding protein 4 [Tupaia chinensis]
Length = 1875
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|347300253|ref|NP_001002871.2| E3 ubiquitin-protein ligase TRIM33 [Danio rerio]
Length = 1176
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 37/59 (62%), Gaps = 4/59 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 933 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYN 989
>gi|157823915|ref|NP_001099849.1| autoimmune regulator [Rattus norvegicus]
gi|149043612|gb|EDL97063.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) (predicted), isoform CRA_a [Rattus
norvegicus]
Length = 547
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 95/229 (41%), Gaps = 55/229 (24%)
Query: 552 GSAEIKEPLSHSSEMEPQPPSVELEESPAPSGELTDTSNRSPEPNSAQTSSHSKMKSSSV 611
S I P SHS P+ P LE S R P N QT + S ++ +V
Sbjct: 148 ASKSISSPGSHSKAKPPKKPEGNLE------------SQRLPLGNGIQTMAASVQRAVTV 195
Query: 612 KS---HGKITRKDLRMHKLVFEEGGLEDGAEVGYFVRGEVKFLVGYKKGFGILCTCCNSE 668
S G + + + VFE GG + +VG GE YK
Sbjct: 196 SSGDVPGSRGAVEGILIQQVFESGGSKKCIQVG----GEF-----YK------------- 233
Query: 669 VSPSQFEAHAG-------WASRRKPFQHIYTSNGVSLHELSIKLSL-------ERPFSSK 714
PS+FE +G S KP + G ++S + + P +
Sbjct: 234 --PSKFEDPSGNLKNKTRGGSSLKPVIRGKGTQGRDEQKVSQQCGVPPLPPLPNEPQVHQ 291
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 292 KNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|345787769|ref|XP_542497.3| PREDICTED: tripartite motif-containing protein 66 [Canis lupus
familiaris]
Length = 1207
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 959 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSLTQPE--MEYD 1016
Query: 773 A-NAR 776
NAR
Sbjct: 1017 CENAR 1021
>gi|281341697|gb|EFB17281.1| hypothetical protein PANDA_004268 [Ailuropoda melanoleuca]
Length = 1217
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 988 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSPTQPE--MEYD 1045
Query: 773 A-NAR 776
NAR
Sbjct: 1046 CENAR 1050
>gi|158024570|gb|ABW08121.1| autoimmune regulator [Xenopus laevis]
Length = 380
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/48 (56%), Positives = 34/48 (70%), Gaps = 2/48 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ NDD C +C DGG+L+CCD CPR+FH+ C+ P IPSGTW C C
Sbjct: 26 QSNDDECSVCRDGGELICCDGCPRSFHLSCLVPPLTHIPSGTWRCDAC 73
>gi|355678680|gb|AER96183.1| chromodomain helicase DNA binding protein 4 [Mustela putorius furo]
Length = 1457
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 287 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 346
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 347 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 384
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 385 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 425
>gi|291241106|ref|XP_002740458.1| PREDICTED: CHromoDomain protein family member (chd-3)-like
[Saccoglossus kowalevskii]
Length = 1294
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 70/170 (41%), Gaps = 46/170 (27%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
+ D C C GG+L+ CDSCP +FH+DCV P G+P W C+ C+ +
Sbjct: 1000 HSDRCARCRRGGELILCDSCPLSFHLDCVDPPLLGVPPDIWLCQLCVLEAESSPL----- 1054
Query: 774 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 833
G +G D + +RC + Q +ILCD C
Sbjct: 1055 ----EGCSDGTDSHCDVCARCYKHGQ--------------------------LILCDVCP 1084
Query: 834 REYHVGCLKDHGMEDLQELPKGKW---LCCADCKRINLALQKLVDRGEEK 880
+H+ C L ++P GKW +C DC+ ++ V+RG K
Sbjct: 1085 LAFHLRCTD----PPLLKVPSGKWTCQICVKDCQPVSSG--GFVNRGPAK 1128
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 62/145 (42%), Gaps = 17/145 (11%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF-VEYNAN 774
D C C G L+ C++CP A+H+ C + P IP+G W C C + QK+ +++
Sbjct: 912 DECAKCGREGQLILCETCPSAYHLKCANPPLKKIPAGKWICEVCTDKSQKKPTGIKFKGK 971
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
R G + + + S + D C CR R G +ILCD C
Sbjct: 972 HR-KGLLPTSSSPSSLSSDLETLGIVADGHSDRCARCR-------RGGE--LILCDSCPL 1021
Query: 835 EYHVGCLKDHGMEDLQELPKGKWLC 859
+H+ C+ L +P WLC
Sbjct: 1022 SFHLDCVD----PPLLGVPPDIWLC 1042
Score = 47.0 bits (110), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 34/71 (47%), Gaps = 10/71 (14%)
Query: 720 CGICMDGGD---LLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNAN 774
C IC GD +L CD C R H+ C+ P IPSG W+C C + K V+ N+
Sbjct: 778 CRICRRKGDAERMLLCDGCDRGHHMYCLKPPVKSIPSGDWYCVDC-----RPKIVKQNSR 832
Query: 775 ARAAGRIEGVD 785
R +E D
Sbjct: 833 RRRKSTLEDYD 843
>gi|89130583|gb|AAI14246.1| Trim33 protein [Danio rerio]
Length = 1058
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCCD CP+ FHI C +L PSG W C +C N E +EYN
Sbjct: 815 NEDWCAVCQNGGELLCCDHCPKVFHITCHIPTLKSSPSGDWMCTFCRNLANPE--IEYNC 872
Query: 774 N 774
+
Sbjct: 873 D 873
>gi|224043897|ref|XP_002197085.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Taeniopygia
guttata]
Length = 1919
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 362 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 416
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + EG + +V + D + C +C+ G C CD
Sbjct: 417 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 459
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 460 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 500
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 435 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 487
>gi|410925745|ref|XP_003976340.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1955
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 33/149 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 410 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC-----EKEGIQWE 464
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + EG D + D + C +C+ G C CD
Sbjct: 465 AREEGS---EGEDDNGDVGE------MEDDHHMEFCRVCKDGGELLC-----------CD 504
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+W+C
Sbjct: 505 SCPSSYHIHCLN----PPLPEIPNGEWIC 529
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCDSCP ++HI C++ LP IP+G W C C
Sbjct: 489 EFCRVCKDGGELLCCDSCPSSYHIHCLNPPLPEIPNGEWICPRC 532
>gi|344294674|ref|XP_003419041.1| PREDICTED: autoimmune regulator-like [Loxodonta africana]
Length = 470
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/54 (51%), Positives = 37/54 (68%), Gaps = 2/54 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
P ++N+D C +C DGG+L+CCD CPRAFH+ C+ P IPSGTW C C+
Sbjct: 297 PQLQQKNEDECAVCRDGGELICCDGCPRAFHLACLCPPLREIPSGTWRCSSCLQ 350
>gi|301761576|ref|XP_002916215.1| PREDICTED: tripartite motif-containing protein 66-like [Ailuropoda
melanoleuca]
Length = 1398
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 42/65 (64%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ FH+ C +L G P G W C C + Q E +EY+
Sbjct: 1150 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHLPALLGFPGGDWVCTLCRSPTQPE--MEYD 1207
Query: 773 A-NAR 776
NAR
Sbjct: 1208 CENAR 1212
>gi|363728319|ref|XP_003640489.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Gallus gallus]
Length = 1924
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 423
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + EG + +V + D + C +C+ G C CD
Sbjct: 424 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 466
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 467 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 507
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 442 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|357472675|ref|XP_003606622.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
gi|355507677|gb|AES88819.1| hypothetical protein MTR_4g063150 [Medicago truncatula]
Length = 444
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 6/130 (4%)
Query: 389 KQETVMKPRKVMRKFYSKLKNFLESGILEGMSVMYIRGSKVKGPGVSGLRGVVKGSGISC 448
KQ+ ++ F + +++ + +G+L+G+ V Y+ ++ + LRG++KG+ C
Sbjct: 295 KQDIKSTRKESPNTFPTNVRSLISTGMLDGVPVKYVSVAREE------LRGIIKGTTYLC 348
Query: 449 FCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKDSPLETLEKAVRM 508
C C + + FE HAG +K P +IY ENGKT+ I+ + +P +L ++
Sbjct: 349 GCQSCNYAKGLNAFEFEKHAGCKSKHPNNHIYFENGKTIYQIVQELRSTPESSLFDTIQT 408
Query: 509 VLGSSSMKKA 518
+ G+ +KA
Sbjct: 409 IFGAPINQKA 418
>gi|326912771|ref|XP_003202720.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Meleagris gallopavo]
Length = 1922
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 77/166 (46%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 369 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 423
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + EG + +V + D + C +C+ G C CD
Sbjct: 424 AKEDNS---EGEEILEDVVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 466
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 467 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 507
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 442 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 494
>gi|449019235|dbj|BAM82637.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 540
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 31/44 (70%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
DD+C IC GGD++CCD CP +H+ C+ LP IPSG W C C+
Sbjct: 177 DDVCSICGLGGDIVCCDECPMGYHLQCIGLPSIPSGEWFCPACV 220
>gi|312080256|ref|XP_003142522.1| PHD-finger family protein [Loa loa]
gi|307762316|gb|EFO21550.1| PHD-finger family protein [Loa loa]
Length = 569
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 96/248 (38%), Gaps = 83/248 (33%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKF--- 768
N C CM+GG+LLCCD CP +FH+ C P IPSG W C CM+ +
Sbjct: 55 NSQYCSACMEGGELLCCDRCPASFHLMCHEPPIERSSIPSGKWLCNRCMHAVAHDTSPKI 114
Query: 769 --------VEYNA-NARAAGRIEGVD---------------------------------- 785
VE+++ + R + I G+
Sbjct: 115 IKKVPKLDVEHSSLDVRRSHAISGIKEQSADRSLSALAVLADAALAANAEQFSLPHDLSC 174
Query: 786 -----PFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 840
PF + ++ ++V P C LC R+ + T+I CD C YH+ C
Sbjct: 175 EIPPLPFDERLTS-TQLVPPPK---AICHLCASRN------EKSTIIQCDFCSLCYHLDC 224
Query: 841 LKDHGMEDLQELPKGKWLCCADCKRI-------NLALQK--LVDRGEEKLPE-----TSL 886
L L +PK KW+C A + I L+L+K L+ R + P +
Sbjct: 225 L----TPPLPSVPKDKWMCPAHVEHILDRKLAKTLSLRKRLLIWRKYARQPRNGPYFSVP 280
Query: 887 DVIKKKHE 894
D IKK +E
Sbjct: 281 DCIKKLYE 288
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 32/46 (69%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
NDD C C DGGDLLCCDSC ++FH+ C++ P IP G W+C C
Sbjct: 65 NDDFCSSCKDGGDLLCCDSCEKSFHLMCLNPPLEEIPEGDWYCNSC 110
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 864
DFC S ++ CD CE+ +H+ CL L+E+P+G W C + CK
Sbjct: 67 DFCSSCKDGGDLLCCDSCEKSFHLMCLN----PPLEEIPEGDWYCNS-CK 111
>gi|410907027|ref|XP_003966993.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Takifugu rubripes]
Length = 2102
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 77/171 (45%), Gaps = 33/171 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 443
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQT-----PDTELGGCVLCR--GRDFCKSRFGRRT 825
A E +++S V T D + C +C+ G C
Sbjct: 444 AKDEEFEDFEEDS-EDRVISEVSLGVPTGAEEEDDDHMEFCRVCKDGGELLC-------- 494
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+WLC C C I +QK++
Sbjct: 495 ---CDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQKIL 537
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%), Gaps = 6/61 (9%)
Query: 705 LSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRY 758
+SL P ++E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C
Sbjct: 464 VSLGVPTGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPR 523
Query: 759 C 759
C
Sbjct: 524 C 524
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 4/59 (6%)
Query: 801 PDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
P E G +D+C+ +ILCD C R YH+ CL+ +L + P+GKW C
Sbjct: 378 PGDEEGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLE----PELDKAPEGKWSC 432
>gi|47211690|emb|CAF91815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1369
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 75/166 (45%), Gaps = 36/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 254 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGTWSCPHC-----EKEGIQWE 308
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + E +M D + C +C+ G C CD
Sbjct: 309 AREEGSEGDEDNGDVGEMED---------DHHMEFCRVCKDGGELLC-----------CD 348
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+W+C C C + +QK++
Sbjct: 349 SCPSSYHIHCLN----PPLPEIPNGEWICPRCT-CPSMKGKVQKIL 389
>gi|327283577|ref|XP_003226517.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like [Anolis
carolinensis]
Length = 1918
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 76/166 (45%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 417
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + EG + V + D + C +C+ G C CD
Sbjct: 418 AKEDNS---EGEETMEDAVGDA---EEEDDHHMEFCRVCKDGGELLC-----------CD 460
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 461 ACPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQKIL 501
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 436 AEEEDDHHMEFCRVCKDGGELLCCDACPSSYHIHCLNPPLPEIPNGEWLCPRC 488
>gi|354476880|ref|XP_003500651.1| PREDICTED: autoimmune regulator [Cricetulus griseus]
Length = 550
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 36/50 (72%), Gaps = 2/50 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 304 QKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLREIPSGLWRCSCCLQ 353
>gi|340371689|ref|XP_003384377.1| PREDICTED: hypothetical protein LOC100637285 [Amphimedon
queenslandica]
Length = 400
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 33/47 (70%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ D C IC DGG+LLCCD CP+A+H+ C+ P IP GTWHC C
Sbjct: 315 EHADYCHICQDGGELLCCDRCPKAYHLQCLYPPLKKIPDGTWHCPRC 361
>gi|409168280|ref|NP_001258480.1| autoimmune regulator isoform 4 [Mus musculus]
gi|7108536|gb|AAF36462.1|AF128117_1 autoimmune regulator [Mus musculus]
gi|148699809|gb|EDL31756.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_g [Mus musculus]
Length = 547
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|159163630|pdb|1XWH|A Chain A, Nmr Structure Of The First Phd Finger Of Autoimmune
Regulator Protein (Aire1): Insights Into Apeced
gi|238537671|pdb|2KE1|A Chain A, Molecular Basis Of Non-Modified Histone H3 Tail
Recognition By The First Phd Finger Of Autoimmune
Regulator
Length = 66
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 40/56 (71%), Gaps = 2/56 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 4 AQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 59
>gi|409168282|ref|NP_001258479.1| autoimmune regulator isoform 3 [Mus musculus]
gi|7108534|gb|AAF36461.1|AF128116_1 autoimmune regulator [Mus musculus]
gi|148699808|gb|EDL31755.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_f [Mus musculus]
Length = 548
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|409168278|ref|NP_001258478.1| autoimmune regulator isoform 2 [Mus musculus]
gi|7108532|gb|AAF36460.1|AF128115_1 autoimmune regulator [Mus musculus]
gi|73695408|gb|AAI03519.1| Autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy) [Mus musculus]
gi|148699803|gb|EDL31750.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_a [Mus musculus]
Length = 551
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|6753020|ref|NP_033776.1| autoimmune regulator isoform 1 [Mus musculus]
gi|22256596|sp|Q9Z0E3.1|AIRE_MOUSE RecName: Full=Autoimmune regulator; AltName: Full=Autoimmune
polyendocrinopathy candidiasis ectodermal dystrophy
protein homolog; Short=APECED protein homolog
gi|5669676|gb|AAD46421.1|AF105002_1 autoimmune regulator [Mus musculus]
gi|7108573|gb|AAF36481.1|AF128772_1 autoimmune regulator [Mus musculus]
gi|7108575|gb|AAF36482.1|AF128773_1 autoimmune regulator [Mus musculus]
gi|3550508|emb|CAA07620.1| autoimmune regulator [Mus musculus]
gi|4426599|gb|AAD20444.1| autoimmune regulator [Mus musculus]
gi|4456675|emb|CAB36909.1| Aire protein [Mus musculus]
gi|6706793|emb|CAB66141.1| APECED protein [Mus musculus]
gi|148699813|gb|EDL31760.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_k [Mus musculus]
gi|212659771|gb|ACC85597.3| autoimmune regulator AIRE1a [Mus musculus]
gi|325983883|gb|ADZ48462.1| AIRE [Mus musculus]
Length = 552
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|409168292|ref|NP_001258485.1| autoimmune regulator isoform 9 [Mus musculus]
gi|7108546|gb|AAF36467.1|AF128122_1 autoimmune regulator [Mus musculus]
gi|148699804|gb|EDL31751.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_b [Mus musculus]
Length = 409
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|380809128|gb|AFE76439.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|383415429|gb|AFH30928.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945024|gb|AFI36117.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1905
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|409168294|ref|NP_001258486.1| autoimmune regulator isoform 10 [Mus musculus]
gi|7108548|gb|AAF36468.1|AF128123_1 autoimmune regulator [Mus musculus]
gi|148699810|gb|EDL31757.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_h [Mus musculus]
Length = 408
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|402894298|ref|XP_003910303.1| PREDICTED: tripartite motif-containing protein 66, partial [Papio
anubis]
Length = 1041
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 956 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1014
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1015 -MEYDCENAR 1023
>gi|397499204|ref|XP_003820349.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Pan paniscus]
gi|410350197|gb|JAA41702.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1905
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|327289025|ref|XP_003229225.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Anolis
carolinensis]
Length = 2037
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 80/178 (44%), Gaps = 38/178 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE +++
Sbjct: 353 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC----EKEG-IQWE 407
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + + C R+ + GG +LC CD C
Sbjct: 408 PKDDDEEDEDLCEEADDHMEFC-RVCKD-----GGELLC-----------------CDTC 444
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVD-RGEEKLPETSLD 887
YH+ CL L E+P G+WLC C C + +Q+++ + +E P SLD
Sbjct: 445 PSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQRILHWKWKESSPLLSLD 497
>gi|355563925|gb|EHH20425.1| hypothetical protein EGK_03279 [Macaca mulatta]
Length = 1899
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|395743837|ref|XP_002822857.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Pongo abelii]
Length = 1898
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 350 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 409
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 410 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 447
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 448 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 488
>gi|380798783|gb|AFE71267.1| chromodomain-helicase-DNA-binding protein 4, partial [Macaca
mulatta]
Length = 1847
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 303 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 362
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 363 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 400
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 401 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 441
>gi|194211609|ref|XP_001496418.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Equus
caballus]
Length = 1912
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 39/169 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +++ +++
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC-----EKEGIQWE 422
Query: 773 A---NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVI 827
A N+ +E V A+ + D + C +C+ G C
Sbjct: 423 AKEDNSEGEEILEEVGGDAE---------EEDDHHMEFCRVCKDGGELLC---------- 463
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 464 -CDTCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 36/53 (67%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
++E DD C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 441 AEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|409168296|ref|NP_001258487.1| autoimmune regulator isoform 11 [Mus musculus]
gi|7108550|gb|AAF36469.1|AF128124_1 autoimmune regulator [Mus musculus]
gi|148699806|gb|EDL31753.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_d [Mus musculus]
Length = 405
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|384945020|gb|AFI36115.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1700
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|409168286|ref|NP_001258482.1| autoimmune regulator isoform 6 [Mus musculus]
gi|7108540|gb|AAF36464.1|AF128119_1 autoimmune regulator [Mus musculus]
gi|148699811|gb|EDL31758.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_i [Mus musculus]
Length = 492
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 295 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 343
>gi|383415431|gb|AFH30929.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1899
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 355 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 414
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 415 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 452
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 453 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 493
>gi|24047226|gb|AAH38596.1| CHD4 protein [Homo sapiens]
gi|167773199|gb|ABZ92034.1| chromodomain helicase DNA binding protein 4 [synthetic construct]
Length = 1937
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 424
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 425 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 462
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 463 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 503
>gi|409168298|ref|NP_001258488.1| autoimmune regulator isoform 12 [Mus musculus]
gi|7108552|gb|AAF36470.1|AF128125_1 autoimmune regulator [Mus musculus]
gi|148699814|gb|EDL31761.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_l [Mus musculus]
Length = 404
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|409168284|ref|NP_001258481.1| autoimmune regulator isoform 5 [Mus musculus]
gi|7108538|gb|AAF36463.1|AF128118_1 autoimmune regulator [Mus musculus]
gi|73695313|gb|AAI03512.1| Aire protein [Mus musculus]
gi|148699807|gb|EDL31754.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_e [Mus musculus]
Length = 493
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 294 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 344
>gi|390467440|ref|XP_002752322.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Callithrix
jacchus]
Length = 1814
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 358 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 417
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 418 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 455
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 456 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 496
>gi|317418651|emb|CBN80689.1| Chromodomain-helicase-DNA-binding protein 5 [Dicentrarchus labrax]
Length = 1981
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 380
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + + C R+ + GG +LC CD C
Sbjct: 381 KDDEEEEEEAPGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 417
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVD--RGEEKLP 882
YH+ CL L E+P G+WLC C C + +QK++ GE LP
Sbjct: 418 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKILHWTWGEPPLP 466
>gi|383415433|gb|AFH30930.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
gi|384945022|gb|AFI36116.1| chromodomain-helicase-DNA-binding protein 4 [Macaca mulatta]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|193785938|dbj|BAG54725.1| unnamed protein product [Homo sapiens]
Length = 1886
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 342 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDSDMEKAPEGKWSCPHCEKEGIQWEAKEDN 401
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 402 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 439
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 440 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 480
>gi|26330021|dbj|BAC28749.1| unnamed protein product [Mus musculus]
Length = 1045
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 113 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 172
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G DP + D + C +C+ G C CD
Sbjct: 173 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CD 210
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 211 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 251
>gi|409168288|ref|NP_001258483.1| autoimmune regulator isoform 7 [Mus musculus]
gi|7108542|gb|AAF36465.1|AF128120_1 autoimmune regulator [Mus musculus]
gi|148699812|gb|EDL31759.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_j [Mus musculus]
Length = 489
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 37/51 (72%), Gaps = 2/51 (3%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+++N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 290 NQKNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 340
>gi|410924319|ref|XP_003975629.1| PREDICTED: autoimmune regulator-like [Takifugu rubripes]
Length = 479
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 2/64 (3%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
NDD C C DGG+L+CCD CP+AFH+ C+ P IPSG W C +C T K + +
Sbjct: 251 NDDECAACKDGGELICCDGCPQAFHLTCLDPPLTSIPSGPWQCDWCCGTRGKRETTQQPP 310
Query: 774 NARA 777
A++
Sbjct: 311 LAKS 314
>gi|410227432|gb|JAA10935.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350199|gb|JAA41703.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|51599156|ref|NP_001264.2| chromodomain-helicase-DNA-binding protein 4 [Homo sapiens]
gi|311033360|sp|Q14839.2|CHD4_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 4;
Short=CHD-4; AltName: Full=ATP-dependent helicase CHD4;
AltName: Full=Mi-2 autoantigen 218 kDa protein; AltName:
Full=Mi2-beta
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|397499206|ref|XP_003820350.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 2
[Pan paniscus]
gi|1107696|emb|CAA60384.1| Mi-2 protein [Homo sapiens]
gi|119609184|gb|EAW88778.1| chromodomain helicase DNA binding protein 4, isoform CRA_b [Homo
sapiens]
gi|410227430|gb|JAA10934.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
gi|410350195|gb|JAA41701.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|426371465|ref|XP_004052667.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Gorilla
gorilla gorilla]
Length = 1759
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 361 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 420
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 421 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 459 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 499
>gi|349580356|dbj|GAA25516.1| K7_Rco1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 684
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 113/321 (35%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPTVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
+++CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMICDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHADYDFNQDF 537
>gi|409168290|ref|NP_001258484.1| autoimmune regulator isoform 8 [Mus musculus]
gi|7108544|gb|AAF36466.1|AF128121_1 autoimmune regulator [Mus musculus]
gi|148699805|gb|EDL31752.1| autoimmune regulator (autoimmune polyendocrinopathy candidiasis
ectodermal dystrophy), isoform CRA_c [Mus musculus]
Length = 488
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 35/49 (71%), Gaps = 2/49 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN 761
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSG W C C+
Sbjct: 291 QNEDECAVCHDGGELICCDGCPRAFHLACLSPPLQEIPSGLWRCSCCLQ 339
>gi|402884886|ref|XP_003905901.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Papio
anubis]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|119609183|gb|EAW88777.1| chromodomain helicase DNA binding protein 4, isoform CRA_a [Homo
sapiens]
Length = 1911
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|441670660|ref|XP_003273866.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Nomascus leucogenys]
Length = 1910
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 425
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 426 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 463
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 464 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 504
>gi|297261645|ref|XP_001107252.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like isoform
8 [Macaca mulatta]
Length = 1912
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|119609185|gb|EAW88779.1| chromodomain helicase DNA binding protein 4, isoform CRA_c [Homo
sapiens]
Length = 1908
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 424
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 425 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 462
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 463 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 503
>gi|403303237|ref|XP_003942247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Saimiri
boliviensis boliviensis]
Length = 1888
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 72/166 (43%), Gaps = 33/166 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPALKGKVQKIL 506
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 640 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 697
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ ++G+ P Q +C R++
Sbjct: 698 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 724
>gi|355697937|gb|EHH28485.1| hypothetical protein EGK_18929, partial [Macaca mulatta]
Length = 1236
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 983 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1041
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1042 -MEYDCENAR 1050
>gi|403254198|ref|XP_003919864.1| PREDICTED: tripartite motif-containing protein 66 [Saimiri
boliviensis boliviensis]
Length = 1207
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 954 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1012
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1013 -MEYDCENAR 1021
>gi|355752346|gb|EHH56466.1| hypothetical protein EGM_05877, partial [Macaca fascicularis]
Length = 1236
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 983 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1041
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1042 -MEYDCENAR 1050
>gi|50235054|gb|AAT70733.1| transcriptional intermediary factor 1 alpha [Danio rerio]
Length = 961
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+L+CCD CP+ FH+ C SL PSG W+C C + E ++YN
Sbjct: 696 NEDWCAVCQNGGELICCDKCPKVFHLSCHVPSLTASPSGEWYCTLCRDLNSPE--MQYNV 753
Query: 774 NA 775
NA
Sbjct: 754 NA 755
>gi|297268466|ref|XP_001097566.2| PREDICTED: tripartite motif-containing protein 66 [Macaca mulatta]
Length = 1326
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 956 PPAPIENEDFCAVCLNGGELLCCDHCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1014
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1015 -MEYDCENAR 1023
>gi|426367401|ref|XP_004050721.1| PREDICTED: tripartite motif-containing protein 66 [Gorilla gorilla
gorilla]
Length = 1085
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 921 PPAPVENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 979
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 980 -MEYDCENAR 988
>gi|397494634|ref|XP_003818179.1| PREDICTED: tripartite motif-containing protein 66 [Pan paniscus]
Length = 1216
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 963 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1021
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1022 -MEYDCENAR 1030
>gi|395536811|ref|XP_003770405.1| PREDICTED: autoimmune regulator-like [Sarcophilus harrisii]
Length = 529
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 50/102 (49%), Gaps = 11/102 (10%)
Query: 670 SPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSS---------KENDDLC 720
+P +FE +G R P T EL + P + ++NDD C
Sbjct: 198 TPGKFEEPSGKNKTRSPKPPTRTKATQGRAELQLSQQSGGPATPAQPPELHLHQKNDDEC 257
Query: 721 GICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+C DGG+L+CCD CPRAFH+ C+ P IPSG W C C+
Sbjct: 258 AVCRDGGELICCDGCPRAFHLACLEPPLTEIPSGMWRCGCCI 299
>gi|395815210|ref|XP_003781127.1| PREDICTED: tripartite motif-containing protein 66 [Otolemur
garnettii]
Length = 1211
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 958 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1016
Query: 768 FVEYNA-NAR 776
VEY+ NAR
Sbjct: 1017 -VEYDCENAR 1025
>gi|441646789|ref|XP_003254588.2| PREDICTED: tripartite motif-containing protein 66 [Nomascus
leucogenys]
Length = 1223
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 970 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1028
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1029 -MEYDCENAR 1037
>gi|410301138|gb|JAA29169.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1912
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 944
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ ++G+ P Q +C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|410301140|gb|JAA29170.1| chromodomain helicase DNA binding protein 4 [Pan troglodytes]
Length = 1914
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 64/149 (42%), Gaps = 30/149 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
+ G D + + D + C +C+ G C CD
Sbjct: 428 SEGEEILEEVGGD-----------LEEEDDHHMEFCRVCKDGGELLC-----------CD 465
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+WLC
Sbjct: 466 TCPSSYHIHCLN----PPLPEIPNGEWLC 490
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 450 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 493
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 52/89 (58%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 887 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 944
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ ++G+ P Q +C R++
Sbjct: 945 DNLQHSKKGKTVQGLSPVDQ--RKCERLL 971
>gi|334321588|ref|XP_001376672.2| PREDICTED: autoimmune regulator-like [Monodelphis domestica]
Length = 538
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+ NDD C +C DGG+L+CCD CPRAFH+ C+ P IPSG W C C+
Sbjct: 284 QRNDDECAVCRDGGELICCDGCPRAFHLACLEPPLTDIPSGMWRCGCCI 332
>gi|410044930|ref|XP_003313014.2| PREDICTED: tripartite motif-containing protein 66 [Pan troglodytes]
Length = 1037
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 784 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 842
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 843 -MEYDCENAR 851
>gi|255956317|ref|XP_002568911.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590622|emb|CAP96817.1| Pc21g19200 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 986
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 71/199 (35%), Gaps = 53/199 (26%)
Query: 706 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-- 759
S+E P +N D C C GGDLLCCD+C ++H DC+ P P G WHC C
Sbjct: 538 SIEPP----DNSDDCHQCGKGGDLLCCDTCINSYHFDCLDPPLDPKNPPQGEWHCPKCSI 593
Query: 760 MNTF-------QKEKFVEYNANARAAGRIEGVDP----------------FAQMVSRCIR 796
N+F + K E+ GVD + + R
Sbjct: 594 RNSFSTLIAHSKHYKKTEFQVPQDIKEHFHGVDEGIVFEDGYARNLKHQRYYKPAPHLPR 653
Query: 797 IVQTPDTE----------------LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 840
+ + P + G C+ C F G R +I CD C +H+ C
Sbjct: 654 LTKPPKQDGPVTYANPMLLRETDNKGDCIRCSKCGFTSQ--GTRPIITCDYCPCRFHLDC 711
Query: 841 LKDHGMEDLQELPKGKWLC 859
L PK W+C
Sbjct: 712 LDPPRANPPN--PKVGWMC 728
>gi|410973162|ref|XP_003993024.1| PREDICTED: tripartite motif-containing protein 66 [Felis catus]
Length = 1211
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ FH+ C LP + P G W C C + Q E +EY
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVFHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--MEY 1019
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1020 DCENAR 1025
>gi|296480218|tpg|DAA22333.1| TPA: hCG23889-like [Bos taurus]
Length = 1213
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 965 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1021
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1022 DCENAR 1027
>gi|444728362|gb|ELW68820.1| Transcription intermediary factor 1-alpha, partial [Tupaia
chinensis]
Length = 869
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 11/95 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 669 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 726
Query: 774 NARAAG----RIEG---VDPFAQMVSRCIRIVQTP 801
+A + + EG + P + V +I++ P
Sbjct: 727 DAPSHNSEKKKTEGLVKLTPIDKRVPDYYKIIKNP 761
>gi|359072874|ref|XP_002693153.2| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1215
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1023
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1024 DCENAR 1029
>gi|440906647|gb|ELR56880.1| Tripartite motif-containing protein 66, partial [Bos grunniens mutus]
Length = 1230
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 982 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1038
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1039 DCENAR 1044
>gi|326671885|ref|XP_003199545.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Danio
rerio]
Length = 1985
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 349 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 404
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + + C R+ + GG +LC CD C
Sbjct: 405 KDDEEEEDEVAGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 441
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV--DRGEEKLP 882
YH+ CL L E+P G+WLC C C + +QK++ G+ LP
Sbjct: 442 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKILHWSWGDPPLP 490
>gi|358415619|ref|XP_599031.6| PREDICTED: tripartite motif-containing protein 66 [Bos taurus]
Length = 1322
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E VEY
Sbjct: 967 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--VEY 1023
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1024 DCENAR 1029
>gi|348553320|ref|XP_003462475.1| PREDICTED: tripartite motif-containing protein 66-like [Cavia
porcellus]
Length = 1392
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E +EY+
Sbjct: 1144 ENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1201
Query: 773 A-NAR 776
NAR
Sbjct: 1202 CENAR 1206
>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
harrisii]
Length = 962
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 721 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 778
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 779 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 805
>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
Length = 912
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 671 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 728
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 729 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 755
>gi|151946233|gb|EDN64464.1| conserved protein [Saccharomyces cerevisiae YJM789]
Length = 684
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHADYDFNQDF 537
>gi|338727101|ref|XP_001500723.2| PREDICTED: tripartite motif-containing protein 66 [Equus caballus]
Length = 1211
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E +EY
Sbjct: 963 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWACTLCRSLTQPE--MEY 1019
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1020 DCENAR 1025
>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
Length = 878
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 654 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 711
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 712 DNSQHSKKGKTAQGLSPVDQ--RKCERLL 738
>gi|115385623|ref|XP_001209358.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187805|gb|EAU29505.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 888
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/186 (26%), Positives = 74/186 (39%), Gaps = 43/186 (23%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------- 759
+EN+D C C G LLCCD C ++H C++ P P G W+C C
Sbjct: 455 QENNDYCRECNGSGQLLCCDGCTDSYHFSCLNPPLDPANPPEGDWYCPRCEIKRPSRKIR 514
Query: 760 --MNTFQKEKFV------EYNANARAA--GRIEGVDPFAQMVSRCIRIVQT---PDTEL- 805
+ +E F +Y A R G+ E PF ++ R +R +T D L
Sbjct: 515 DQLERVSQEDFALPERIRDYFAGVRTGEDGKYEDFIPFPKINPRSMRGTRTGRYDDPHLL 574
Query: 806 ------GGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKDH-GMEDLQ----ELP 853
G + C C ++ G R +I CD C +H+ C M +Q + P
Sbjct: 575 RVHDAKGNLIFCHA---CGRTSNGNRPIIQCDYCPCAFHMDCCDPPLAMPPVQRPGSDKP 631
Query: 854 KGKWLC 859
W+C
Sbjct: 632 HHNWMC 637
>gi|13509324|emb|CAC35389.1| KIAA0298 protein [Homo sapiens]
Length = 1214
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 961 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1019
Query: 768 FVEYN 772
+EY+
Sbjct: 1020 -MEYD 1023
>gi|119589026|gb|EAW68620.1| hCG23889, isoform CRA_a [Homo sapiens]
Length = 1214
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 961 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1019
Query: 768 FVEYN 772
+EY+
Sbjct: 1020 -MEYD 1023
>gi|118344202|ref|NP_001071923.1| zinc finger protein [Ciona intestinalis]
gi|92081546|dbj|BAE93320.1| zinc finger protein [Ciona intestinalis]
Length = 1205
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 32/47 (68%), Gaps = 4/47 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYC 759
N+D CG+C +GGDLLCCDSCP+ FH+ C +P I P G W C C
Sbjct: 933 NEDWCGVCNNGGDLLCCDSCPKVFHVHC-HVPEILAAPPGIWQCMLC 978
>gi|218675692|gb|AAI69321.2| tripartite motif protein 33 isoform 1 [synthetic construct]
Length = 360
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 119 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 176
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 177 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 203
>gi|410905767|ref|XP_003966363.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Takifugu rubripes]
Length = 1967
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 65/149 (43%), Gaps = 28/149 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC----EKEGI---- 434
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + + R + + D + C +C+ G C CD
Sbjct: 435 -QWEAKDELSEGEGEDEEDRRDEGVEEEDDHHIEFCRVCKDGGELLC-----------CD 482
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+W+C
Sbjct: 483 TCPSSYHIHCLN----PPLPEIPNGEWIC 507
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 467 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 510
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 802 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 373 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 426
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 746 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDLSKPE--VEYDC 803
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 804 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 830
>gi|326669387|ref|XP_003199000.1| PREDICTED: hypothetical protein LOC566660 [Danio rerio]
Length = 1062
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 36/60 (60%), Gaps = 4/60 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C+ GGDLLCCD CP+ FH+ C P P G W C C + Q E VEY+
Sbjct: 827 ENEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLHSFPVGDWICTLCRDVEQPE--VEYD 884
>gi|425777912|gb|EKV16064.1| hypothetical protein PDIP_37980 [Penicillium digitatum Pd1]
gi|425779981|gb|EKV18004.1| hypothetical protein PDIG_11750 [Penicillium digitatum PHI26]
Length = 988
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 71/199 (35%), Gaps = 53/199 (26%)
Query: 706 SLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-- 759
S+E P EN D C C GGDLLCCD+C ++H +C+ P P G WHC C
Sbjct: 537 SIEPP----ENSDDCHQCGKGGDLLCCDTCINSYHFECLDPPLDPKNPPQGEWHCPKCSI 592
Query: 760 MNTF-------QKEKFVEYNANARAAGRIEGVDP----------------FAQMVSRCIR 796
N+F K E+ +GVD + + R
Sbjct: 593 RNSFSTLIAHSNHYKKTEFQLPQDIKEHFQGVDEGIVFDDDYARNLKHQRYYKAAPHLPR 652
Query: 797 IVQTPDTE----------------LGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGC 840
+ + P + G C+ C F G R +I CD C +H+ C
Sbjct: 653 LTKAPKQDGPTIYANPMLLREYDNKGDCIRCSKCGFTSQ--GTRPIITCDYCPCRFHLDC 710
Query: 841 LKDHGMEDLQELPKGKWLC 859
L PK W+C
Sbjct: 711 LDPPRAHPPN--PKVGWMC 727
>gi|209977097|ref|NP_055633.1| tripartite motif-containing protein 66 [Homo sapiens]
gi|269849740|sp|O15016.4|TRI66_HUMAN RecName: Full=Tripartite motif-containing protein 66
Length = 1216
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C + Q E
Sbjct: 963 PPAPIENEDFCAVCLNGGELLCCDRCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1021
Query: 768 FVEYN 772
+EY+
Sbjct: 1022 -MEYD 1025
>gi|432942390|ref|XP_004082995.1| PREDICTED: uncharacterized protein LOC101161205 [Oryzias latipes]
Length = 1040
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 18/100 (18%)
Query: 690 IYTSNGVSLHELSIKL-----SLERPFSSKE-------NDDLCGICMDGGDLLCCDSCPR 737
+ +S G SL + S L +P S K+ N+D C +C +GG+LLCCD CP+
Sbjct: 675 VQSSGGSSLPDSSTAAKSKPRQLHKPESEKKAEPEEDPNEDWCAVCQNGGELLCCDKCPK 734
Query: 738 AFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYNAN 774
FH+ C +P + PSG W C +C + E +EY+ N
Sbjct: 735 VFHLSC-HIPALNESPSGEWFCSFCRDLLNPE--MEYDCN 771
>gi|443684710|gb|ELT88567.1| hypothetical protein CAPTEDRAFT_218774, partial [Capitella teleta]
Length = 1064
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 62/150 (41%), Gaps = 33/150 (22%)
Query: 712 SSKENDDL--CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFV 769
S E+D C +C DGGDL+ CD+CP++FH C++L IP G W C C E
Sbjct: 450 SGNESDHFWYCEVCKDGGDLMLCDTCPKSFHQSCINLNEIPDGDWSCPICTGEGLPEDGD 509
Query: 770 EYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILC 829
N+ A+ E Q C R GG VILC
Sbjct: 510 SSNS-AQEEEEGEEETEHDQFCKVCKR---------GG-----------------DVILC 542
Query: 830 DQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
D C +H+ CL L E+P+G W C
Sbjct: 543 DFCSCVFHLRCLN----PPLGEVPEGDWKC 568
>gi|365989444|ref|XP_003671552.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
gi|343770325|emb|CCD26309.1| hypothetical protein NDAI_0H01350 [Naumovozyma dairenensis CBS 421]
Length = 866
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
+ENDD C C+ G LCCD+CPR+FH C+ P +P G W C +C+
Sbjct: 381 QENDDYCSACLQAGSFLCCDTCPRSFHFLCLDPPLDPNNLPEGDWSCHHCL 431
>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
[Monodelphis domestica]
Length = 1143
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 959
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 960 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 986
>gi|432103232|gb|ELK30472.1| Tripartite motif-containing protein 66 [Myotis davidii]
Length = 1324
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 1096 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE 1154
Query: 767 KFVEYNA-NAR 776
+EY+ NAR
Sbjct: 1155 --MEYDCENAR 1163
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 706 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLSKPE--VEYDC 763
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 764 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 790
>gi|357605668|gb|EHJ64730.1| putative Chromodomain helicase-DNA-binding protein Mi-2-like
protein [Danaus plexippus]
Length = 1963
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 60/148 (40%), Gaps = 46/148 (31%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY 771
+E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C YC +E+
Sbjct: 372 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQAEGNQEQ---- 427
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
E D + C + GG +LC CD
Sbjct: 428 ----------EDDDEHQEFCRIC---------KDGGELLC-----------------CDS 451
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH CL L+E+P G+W C
Sbjct: 452 CPSAYHRFCLN----PPLEEVPDGEWKC 475
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ + C IC DGG+LLCCDSCP A+H C++ P +P G W C C
Sbjct: 432 EHQEFCRICKDGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRC 478
>gi|348526504|ref|XP_003450759.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1043
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 6/74 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCCD CP+ FH+ C +P + PSG W C +C + E +EY+
Sbjct: 709 NEDWCAVCQNGGELLCCDKCPKVFHLAC-HIPTLNESPSGEWFCSFCRDLVSPE--MEYD 765
Query: 773 ANARAAGRIEGVDP 786
+++ A E P
Sbjct: 766 CDSKDAPISEKFPP 779
>gi|291384596|ref|XP_002708841.1| PREDICTED: tripartite motif-containing 66 [Oryctolagus cuniculus]
Length = 1211
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 958 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLSQPE- 1016
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1017 -MEYDCENAR 1025
>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
[Monodelphis domestica]
Length = 1126
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 902 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 959
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 960 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 986
>gi|410919013|ref|XP_003972979.1| PREDICTED: transcription intermediary factor 1-alpha-like [Takifugu
rubripes]
Length = 1006
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 41/74 (55%), Gaps = 6/74 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + E +EYN
Sbjct: 690 NEDWCAVCQNGGELLCCDKCPKVFHLTCHIPTLIASPSGEWFCSFCRDLVSPE--MEYNC 747
Query: 774 NARAAGRIEGVDPF 787
A EG P
Sbjct: 748 KEGLAS--EGFPPI 759
>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
[Ornithorhynchus anatinus]
Length = 1050
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 809 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDLGKPE--VEYDC 866
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 867 DNLQHSKKGKTAQGLSPMDQ--RKCERLL 893
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 710 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 767
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 768 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 794
>gi|344280903|ref|XP_003412221.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66-like [Loxodonta africana]
Length = 1343
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P + EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E
Sbjct: 1090 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE- 1148
Query: 768 FVEYNA-NAR 776
+EY+ NAR
Sbjct: 1149 -MEYDCENAR 1157
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 890 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 947
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 948 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 974
>gi|344249575|gb|EGW05679.1| Tripartite motif-containing protein 66 [Cricetulus griseus]
Length = 1297
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 937 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQPE--MEYD 994
Query: 773 A-NAR 776
NAR
Sbjct: 995 CENAR 999
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 967
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 968 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 994
>gi|270013510|gb|EFA09958.1| hypothetical protein TcasGA2_TC012115 [Tribolium castaneum]
Length = 1969
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 59/156 (37%), Gaps = 51/156 (32%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C N
Sbjct: 374 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 422
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D Q R + GG +LC CD C
Sbjct: 423 -----EGPAEQDDDEHQEFCRICKD--------GGELLC-----------------CDSC 452
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC----CADCK 864
YH CL L E+P G W C CA K
Sbjct: 453 PSAYHTHCLN----PPLVEIPDGDWKCPRCSCAPLK 484
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|367000541|ref|XP_003685006.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
gi|357523303|emb|CCE62572.1| hypothetical protein TPHA_0C04220 [Tetrapisispora phaffii CBS 4417]
Length = 669
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 78/219 (35%), Gaps = 80/219 (36%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM---------- 760
+NDD C C G LCCD+CPR+FH C++ P +P G W C C+
Sbjct: 263 DNDDFCSTCQQSGSFLCCDTCPRSFHFLCLNPPLDPDNLPEGDWSCPQCVFKAKNPTIAA 322
Query: 761 -------------------------------NTFQ-----KEKFVEYNANARAAGRIEGV 784
FQ K+ F ANAR + E
Sbjct: 323 YKKSEREFIAELPNKNKLFGKLLFNIQSQNSKQFQLPNTIKDTFENVKANARGQYQDEKE 382
Query: 785 -----------DPFAQMVSRCIRIVQTP----DTELGGCVLCRGRDFC-KSRFG------ 822
P+ Q +++ TP D E G ++C C K++ G
Sbjct: 383 KTPLTERQIFGTPYGQSITKLDGY--TPELHIDPESGEFLICYH---CQKTKMGTWENYE 437
Query: 823 --RRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
R ++ CD C+ +H+ CL D L+ L KW C
Sbjct: 438 TESRLIMRCDYCQTPWHLNCLPDVPRASLKNL-GNKWKC 475
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 767 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 824
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 825 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 851
>gi|401838195|gb|EJT41934.1| RCO1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 703
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 80/216 (37%), Gaps = 78/216 (36%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C +
Sbjct: 277 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPVDPNHLPEGDWHCNECKFKIFINNSLT 336
Query: 762 TFQKEK--FVEYNAN----ARAAGRIEGVDP----FAQMVSRCIRIVQT----------- 800
T +K + F++ N N A+ RIE ++P + V+T
Sbjct: 337 TLKKNESNFIKQNDNVKMFAKLLFRIESLNPRQFQLPNYIKETFPAVKTGSRGQYSDEND 396
Query: 801 -------------------------PDTEL----GGCVLCRGRDFC-KSRFG-------R 823
PD + G ++C C ++R G
Sbjct: 397 KIPLTDRQLFNTSYGQSITKLDSYNPDIHIDSSSGEFLICYK---CHQTRLGSWSHPENS 453
Query: 824 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
R ++ CD C+ +H+ C+ ++L KW C
Sbjct: 454 RLIMTCDYCQTPWHLDCIPRASFKNL----GSKWKC 485
>gi|350588054|ref|XP_003129432.3| PREDICTED: tripartite motif-containing protein 66 [Sus scrofa]
Length = 1250
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 993 PPAPVENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE 1051
Query: 767 KFVEYNA-NAR 776
+EY+ NAR
Sbjct: 1052 --MEYDCENAR 1060
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 758 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 815
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 816 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 842
>gi|6323720|ref|NP_013791.1| Rco1p [Saccharomyces cerevisiae S288c]
gi|2497139|sp|Q04779.1|RCO1_YEAST RecName: Full=Transcriptional regulatory protein RCO1
gi|763022|emb|CAA88800.1| unknown [Saccharomyces cerevisiae]
gi|190408306|gb|EDV11571.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|259148648|emb|CAY81893.1| Rco1p [Saccharomyces cerevisiae EC1118]
gi|285814078|tpg|DAA09973.1| TPA: Rco1p [Saccharomyces cerevisiae S288c]
gi|365763801|gb|EHN05327.1| Rco1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 684
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 739 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 796
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 797 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 823
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|344252625|gb|EGW08729.1| E3 ubiquitin-protein ligase TRIM33 [Cricetulus griseus]
Length = 910
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 669 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 726
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 727 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 753
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 832 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 889
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 890 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 916
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 848
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 849 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 875
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 903 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 960
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 961 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 987
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|392297235|gb|EIW08335.1| Rco1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 684
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 368 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 425
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 426 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 452
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 575
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 576 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 602
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 815 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 872
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 873 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 899
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 901 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 958
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 959 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 985
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 970 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1027
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 1028 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1054
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 956
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 910 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 967
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 968 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 994
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 885 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 942
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 943 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 969
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 518 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 575
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 576 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 602
>gi|189240851|ref|XP_001812556.1| PREDICTED: similar to chromodomain helicase-DNA-binding protein 3
[Tribolium castaneum]
Length = 1966
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 59/156 (37%), Gaps = 51/156 (32%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C N
Sbjct: 371 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCEN----------- 419
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D Q R + GG +LC CD C
Sbjct: 420 -----EGPAEQDDDEHQEFCRICKD--------GGELLC-----------------CDSC 449
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC----CADCK 864
YH CL L E+P G W C CA K
Sbjct: 450 PSAYHTHCLN----PPLVEIPDGDWKCPRCSCAPLK 481
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 899 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 956
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 957 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 983
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 771 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 828
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 829 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 855
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 847 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 904
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 905 DNMQHSKKGKTAQGLSPVDQ--RKCERLL 931
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 860
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 861 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 887
>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
[Desmodus rotundus]
Length = 951
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 726 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 783
Query: 774 NA 775
+A
Sbjct: 784 DA 785
>gi|354495678|ref|XP_003509956.1| PREDICTED: tripartite motif-containing protein 66-like [Cricetulus
griseus]
Length = 1384
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1136 ENEDFCAVCLNGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLMQPE--MEYD 1193
Query: 773 A-NAR 776
NAR
Sbjct: 1194 CENAR 1198
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 962 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1019
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 1020 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1046
>gi|426245702|ref|XP_004016644.1| PREDICTED: LOW QUALITY PROTEIN: tripartite motif-containing protein
66 [Ovis aries]
Length = 1286
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 7/66 (10%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY 771
EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E +EY
Sbjct: 1038 ENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCRSLTQPE--MEY 1094
Query: 772 NA-NAR 776
+ NAR
Sbjct: 1095 DCENAR 1100
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 886 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 943
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 944 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 970
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 803 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 860
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 861 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 887
>gi|47227437|emb|CAG04585.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2248
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 75/169 (44%), Gaps = 29/169 (17%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 272 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDKAPEGKWSCPHC-----EKEGIQWE 326
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD-----FCKSRFGRRTVI 827
A R++ + LG + D FC+ ++
Sbjct: 327 AKDEEFE--------DFEEDSEDRVIS--EVSLGVPMGAEEEDDDHMEFCRVCKDGGELL 376
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
CD C YH+ CL L E+P G+WLC C C I +Q+++
Sbjct: 377 CCDTCTSSYHIHCLN----PPLPEIPNGEWLCPRCT-CPPIKGRVQRIL 420
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%), Gaps = 6/61 (9%)
Query: 705 LSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRY 758
+SL P ++E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C
Sbjct: 347 VSLGVPMGAEEEDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPR 406
Query: 759 C 759
C
Sbjct: 407 C 407
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
AR+AG E + F + + P E G +D+C+ +ILCD C R
Sbjct: 236 ARSAGE-EKEENFKRSCGPGFDLAPVPGDEEGDGYETDHQDYCEVCQQGGEIILCDTCPR 294
Query: 835 EYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL+ +L + P+GKW C
Sbjct: 295 AYHLVCLE----PELDKAPEGKWSC 315
>gi|431919612|gb|ELK18000.1| Tripartite motif-containing protein 66 [Pteropus alecto]
Length = 450
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 44/71 (61%), Gaps = 7/71 (9%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKE 766
P + EN+D C +C++GG+LLCCD CP+ +H+ C LP + P G W C C + Q E
Sbjct: 197 PPAPIENEDFCAVCLNGGELLCCDRCPKVYHLSC-HLPALLSFPGGDWVCTLCCSLTQPE 255
Query: 767 KFVEYNA-NAR 776
+EY+ NAR
Sbjct: 256 --MEYDCENAR 264
>gi|327264985|ref|XP_003217289.1| PREDICTED: tripartite motif-containing protein 66-like [Anolis
carolinensis]
Length = 1265
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ FH+ C +L P G W C C N + E VEY+
Sbjct: 908 ENEDFCAVCLNGGELLCCDYCPKVFHLLCHVPALLSFPVGEWVCTLCSNVEKPE--VEYD 965
Query: 773 A-NAR 776
NAR
Sbjct: 966 CENAR 970
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 885 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 942
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 943 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 969
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 800 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 857
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 858 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 884
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 736 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 793
Query: 774 NA 775
+A
Sbjct: 794 DA 795
>gi|443719422|gb|ELU09603.1| hypothetical protein CAPTEDRAFT_166178 [Capitella teleta]
Length = 1711
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 71/166 (42%), Gaps = 45/166 (27%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C+ + F E
Sbjct: 151 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEQAPEGKWSCPHCVRSSVNIAFEE-- 208
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
G +E Q D + C +C+ G C CD
Sbjct: 209 -----NGPVE----------------QDDDEHMEFCRVCKDGGELLC-----------CD 236
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YHV CL ++ +P G W C C+ C+ + +QK++
Sbjct: 237 TCTSAYHVFCL----APNVANVPDGIWHCPRCS-CQPLKARVQKIL 277
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 39/64 (60%), Gaps = 6/64 (9%)
Query: 702 SIKLSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWH 755
S+ ++ E +++DD C +C DGG+LLCCD+C A+H+ C++ + +P G WH
Sbjct: 201 SVNIAFEENGPVEQDDDEHMEFCRVCKDGGELLCCDTCTSAYHVFCLAPNVANVPDGIWH 260
Query: 756 CRYC 759
C C
Sbjct: 261 CPRC 264
>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
Length = 1138
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 897 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGEWICTFCRDIGKPE--VEYDC 954
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 955 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 981
>gi|156848686|ref|XP_001647224.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
gi|156117909|gb|EDO19366.1| hypothetical protein Kpol_1002p11 [Vanderwaltozyma polyspora DSM
70294]
Length = 682
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 84/223 (37%), Gaps = 70/223 (31%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM-------NT- 762
+NDD C C+ G LCCD+CPR+FH C++ P +P G W C +C+ NT
Sbjct: 272 DNDDFCSTCLQTGSFLCCDTCPRSFHFLCLNPPLDPDQLPEGDWSCPHCIFRMKYPTNTA 331
Query: 763 FQK--EKFV-EYNANARAAGR----IEGVDPFAQMVSRCIRIV----------------- 798
F+K ++FV + R G+ +E +P V + +R
Sbjct: 332 FKKAEKEFVNDLPQQNRLFGKLLFQLESTNPIQYRVPKTVREAFQNVKTGARGQYQDDRE 391
Query: 799 -----------------------QTPDTELGG-------CVLC---RGRDFCKSRFGRRT 825
TPDT + C C + + + +
Sbjct: 392 KEPLTEKLIFGTPYGQSITKLDCYTPDTHINTEDNSFLLCFRCGTTKMGTWDNYKTDSKL 451
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINL 868
++ CD C +H+ C+ + ++ L KW+C + NL
Sbjct: 452 IMKCDYCNTPWHLDCIPEVPRASMKNL-GTKWMCPLHADKANL 493
>gi|47221566|emb|CAF97831.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1989
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 65/153 (42%), Gaps = 36/153 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC----MNTFQKEKF 768
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + ++
Sbjct: 302 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEARDDL 361
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTV 826
+ R EGV+ + D + C +C+ G C
Sbjct: 362 SDGEGEDEEDRRDEGVE-------------EEDDHHIEFCRVCKDGGELLC--------- 399
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C YH+ CL L E+P G+W+C
Sbjct: 400 --CDTCPSSYHIHCLN----PPLPEIPNGEWIC 426
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 386 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 429
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 802 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 292 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 345
>gi|207342360|gb|EDZ70146.1| YMR075Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 671
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDYDFNQDF 537
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 736 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 793
Query: 774 NA 775
+A
Sbjct: 794 DA 795
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 730 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 787
Query: 774 NA 775
+A
Sbjct: 788 DA 789
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 793 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 850
Query: 774 NA 775
+A
Sbjct: 851 DA 852
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 717 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 774
Query: 774 NA 775
+A
Sbjct: 775 DA 776
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 753 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 810
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 811 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 837
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 719 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 776
Query: 774 NA 775
+A
Sbjct: 777 DA 778
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 704 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 761
Query: 774 NA 775
+A
Sbjct: 762 DA 763
>gi|301627494|ref|XP_002942909.1| PREDICTED: tripartite motif-containing protein 66-like [Xenopus
(Silurana) tropicalis]
Length = 1207
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG++LCCD CP+ FH+ C +L P G W C C N + E VEY+
Sbjct: 964 ENEDFCAVCLNGGEMLCCDRCPKVFHLSCHVPALLSFPVGEWLCTLCRNLTKPE--VEYD 1021
Query: 773 AN 774
+
Sbjct: 1022 CD 1023
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 NA 775
+A
Sbjct: 849 DA 850
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 708 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 765
Query: 774 NA 775
+A
Sbjct: 766 DA 767
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 790 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 847
Query: 774 NA 775
+A
Sbjct: 848 DA 849
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|395537374|ref|XP_003770678.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Sarcophilus harrisii]
Length = 386
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 64/147 (43%), Gaps = 26/147 (17%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N
Sbjct: 257 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDN 316
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G DP + D + C +C+ ++ CD C
Sbjct: 317 SEGEEILEEVGGDP-----------EEEDDHHMEFCRVCKDGG---------ELLCCDPC 356
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 357 PSSYHIHCLN----PPLPEIPNGEWLC 379
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD CP ++HI C++ LP IP+G W C +C
Sbjct: 339 EFCRVCKDGGELLCCDPCPSSYHIHCLNPPLPEIPNGEWLCPHC 382
>gi|256273506|gb|EEU08440.1| Rco1p [Saccharomyces cerevisiae JAY291]
Length = 684
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 258 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 317
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 318 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 377
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 378 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 435
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 436 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 469
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 470 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 516
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 517 NGNIQIIPTTSHTDCDFNQDF 537
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 187 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 244
Query: 774 NA 775
+A
Sbjct: 245 DA 246
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 NA 775
+A
Sbjct: 849 DA 850
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 441 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 498
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 499 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 525
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 824 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 881
Query: 774 NA 775
+A
Sbjct: 882 DA 883
>gi|327288760|ref|XP_003229093.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Anolis
carolinensis]
Length = 2059
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CDSCPRA+H+ C+ L P G W C +C + + E
Sbjct: 436 DHQDYCEVCQQGGEIILCDSCPRAYHLVCLDPELDKAPEGKWSCPHCEKEGVQWEPKEEE 495
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + C R+ + GG +LC CD C
Sbjct: 496 DEYEGEMDDAEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 532
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 533 ISSYHIHCLN----PPLPEIPNGEWLCPRCT-CPMLKGRVQKIL 571
>gi|228311800|pdb|2KFT|A Chain A, Nmr Solution Structure Of The First Phd Finger Domain Of
Human Autoimmune Regulator (Aire) In Complex With
Histone H3(1-20cys) Peptide
Length = 56
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 38/55 (69%), Gaps = 2/55 (3%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
+N+D C +C DGG+L+CCD CPRAFH+ C+S P IPSGTW C C+ +E
Sbjct: 2 SKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQE 56
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 1017 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 1074
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 1075 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 1101
>gi|47419909|ref|NP_003843.3| transcription intermediary factor 1-alpha isoform b [Homo sapiens]
gi|114616228|ref|XP_001149035.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3 [Pan
troglodytes]
gi|397484617|ref|XP_003813470.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Pan
paniscus]
gi|4325107|gb|AAD17258.1| transcriptional intermediary factor 1 alpha [Homo sapiens]
gi|51094800|gb|EAL24046.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|119604290|gb|EAW83884.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|119604292|gb|EAW83886.1| tripartite motif-containing 24, isoform CRA_a [Homo sapiens]
gi|410249254|gb|JAA12594.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287762|gb|JAA22481.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1016
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 NA 775
+A
Sbjct: 849 DA 850
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 820 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 877
Query: 774 NA 775
+A
Sbjct: 878 DA 879
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 775 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 832
Query: 774 NA 775
+A
Sbjct: 833 DA 834
>gi|383422495|gb|AFH34461.1| transcription intermediary factor 1-alpha isoform b [Macaca
mulatta]
Length = 1016
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 NA 775
+A
Sbjct: 849 DA 850
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 996 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 1053
Query: 774 NA 775
+A
Sbjct: 1054 DA 1055
>gi|444522352|gb|ELV13370.1| Tripartite motif-containing protein 66 [Tupaia chinensis]
Length = 1183
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 943 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1000
Query: 773 A-NAR 776
NAR
Sbjct: 1001 CENAR 1005
>gi|13992535|emb|CAC38114.1| KIAA0298 protein [Mus musculus]
gi|148684989|gb|EDL16936.1| tripartite motif-containing 66, isoform CRA_b [Mus musculus]
Length = 1209
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 961 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1018
Query: 773 A-NAR 776
NAR
Sbjct: 1019 CENAR 1023
>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
Length = 1615
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 1392 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 1449
Query: 774 NA 775
+A
Sbjct: 1450 DA 1451
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 785 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 842
Query: 774 NA 775
+A
Sbjct: 843 DA 844
>gi|345314790|ref|XP_001520060.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Ornithorhynchus anatinus]
Length = 1760
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 34/147 (23%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ +++
Sbjct: 273 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC-----EKEGIQWE 327
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG + + C R+ + GG +LC CD C
Sbjct: 328 PKEEDEEEEEGGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 364
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 365 PSSYHLHCLN----PPLPEIPNGEWLC 387
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 347 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 390
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 893 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 950
Query: 774 NA 775
+A
Sbjct: 951 DA 952
>gi|348579257|ref|XP_003475397.1| PREDICTED: transcription intermediary factor 1-alpha-like [Cavia
porcellus]
Length = 1165
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 940 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 997
Query: 774 NA 775
+A
Sbjct: 998 DA 999
>gi|432860089|ref|XP_004069385.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Oryzias latipes]
Length = 2111
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 70/164 (42%), Gaps = 36/164 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 502 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEA 557
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + + C R+ + GG +LC CD C
Sbjct: 558 KDEEEDEEEPVGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 594
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 595 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 633
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 250 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 307
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 308 DNLQHSKKGKTAQGLSPVDQR--KCERLL 334
>gi|45861758|gb|AAS78677.1| transcriptional intermediary factor 1 delta short isoform [Mus
musculus]
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 773 A-NAR 776
NAR
Sbjct: 1052 CENAR 1056
>gi|410913151|ref|XP_003970052.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1057
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 41/68 (60%), Gaps = 5/68 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 772
E++D C +C+ GGDLLCCD CP+ FH+ C P PSG W C C ++ E VEY+
Sbjct: 824 ESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLCRDSVLPE--VEYD 881
Query: 773 A-NARAAG 779
N R +G
Sbjct: 882 CENERTSG 889
>gi|189230248|ref|NP_001121448.1| tripartite motif containing 24 [Xenopus (Silurana) tropicalis]
gi|183985692|gb|AAI66206.1| LOC100158542 protein [Xenopus (Silurana) tropicalis]
Length = 1040
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/91 (37%), Positives = 50/91 (54%), Gaps = 12/91 (13%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 819 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSRPE--VEYDC 876
Query: 773 -----ANARAAGRIEGVDPFAQMVSRCIRIV 798
A R G + + P Q +C RI+
Sbjct: 877 DDPSLAEKRKLGGAQSMAPIDQ--RKCERIL 905
>gi|154290947|ref|XP_001546062.1| hypothetical protein BC1G_15537 [Botryotinia fuckeliana B05.10]
Length = 1238
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 70/185 (37%), Gaps = 42/185 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGT---WHCRYCM-NTFQKEKFV 769
ENDD C C G LLCCD C R+FH CV L P P+ T W C C+ + + K
Sbjct: 845 ENDDYCAACGGNGKLLCCDGCTRSFHFKCVDLSPDSPTLTTEEWFCNECLAQSVPRSKDE 904
Query: 770 EYNANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDTEL------- 805
E +E +P A + + IR +VQ P
Sbjct: 905 EIGYFGPLLANMERKNPSAFHLPKAIREYFENVKTGTEGEYEEGVVQKPKNNRGYDEAPD 964
Query: 806 --------GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK- 856
G +LC G R +I C C +H+ CL D + +E P G+
Sbjct: 965 YFKLKDSKGNPILCHGCHLSAVN-PSRAIIPCTYCSLSWHLDCL-DPPL--AKEPPPGRS 1020
Query: 857 WLCCA 861
W C A
Sbjct: 1021 WRCPA 1025
>gi|284004879|ref|NP_862901.3| tripartite motif-containing protein 66 isoform 2 [Mus musculus]
gi|342187123|sp|Q924W6.3|TRI66_MOUSE RecName: Full=Tripartite motif-containing protein 66; AltName:
Full=Transcriptional intermediary factor 1 delta
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 773 A-NAR 776
NAR
Sbjct: 1052 CENAR 1056
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 700 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 757
Query: 774 NA 775
+A
Sbjct: 758 DA 759
>gi|344299817|gb|EGW30170.1| hypothetical protein SPAPADRAFT_144654 [Spathaspora passalidarum
NRRL Y-27907]
Length = 604
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 82/217 (37%), Gaps = 62/217 (28%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 770
+K+NDD C C + G +CC++CP++FH C P P W CR C+ EK
Sbjct: 191 TKDNDDFCSSCGNPGIFICCENCPKSFHFTCCDPPIEQPPEDDWFCRECIAKLHPEKIKT 250
Query: 771 YN---ANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDT------- 803
YN + ++E +P + + +R PD
Sbjct: 251 YNDIGVFGQLLNQLEVKNPKVFQLPKHLREDTFIGVTTGDNGDYSDDTTKPDVPSSKNSQ 310
Query: 804 ----------ELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKD--------- 843
+ G LC C +S RT++ CD C YH+ CL D
Sbjct: 311 DVDIESSLYDKAGNPYLCHK---CGESGMNNRTLMHCDYCPLIYHIDCLNDPICGPKTIG 367
Query: 844 ------HGMEDLQELPKG--KWLCCADCKRINLALQK 872
+ +EDL LP G K DC+ ++ +L K
Sbjct: 368 NKWRCPNHIEDL--LPPGYAKLRQFKDCQVVDSSLHK 402
>gi|187950891|gb|AAI38033.1| Tripartite motif-containing 66 [Mus musculus]
gi|219519295|gb|AAI45107.1| Tripartite motif-containing 66 [Mus musculus]
Length = 1242
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 773 A-NAR 776
NAR
Sbjct: 1052 CENAR 1056
>gi|149068363|gb|EDM17915.1| rCG40452, isoform CRA_b [Rattus norvegicus]
Length = 1210
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 962 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1019
Query: 773 A-NAR 776
NAR
Sbjct: 1020 CENAR 1024
>gi|372467017|pdb|3U5M|A Chain A, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467018|pdb|3U5M|B Chain B, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467019|pdb|3U5M|C Chain C, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467020|pdb|3U5M|D Chain D, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467021|pdb|3U5M|E Chain E, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467022|pdb|3U5M|F Chain F, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467023|pdb|3U5M|G Chain G, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467024|pdb|3U5M|H Chain H, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467025|pdb|3U5M|I Chain I, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467026|pdb|3U5M|J Chain J, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467027|pdb|3U5M|K Chain K, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467028|pdb|3U5M|L Chain L, Crystal Structure Of Trim33 Phd-Bromo In The Free State
gi|372467029|pdb|3U5N|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467031|pdb|3U5N|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-20) K9me3k14ac Histone Peptide
gi|372467033|pdb|3U5O|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467034|pdb|3U5O|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467035|pdb|3U5O|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467036|pdb|3U5O|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467037|pdb|3U5O|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467038|pdb|3U5O|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467039|pdb|3U5O|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467040|pdb|3U5O|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-22) K9me3k14ack18ac Histone Peptide
gi|372467049|pdb|3U5P|A Chain A, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467051|pdb|3U5P|B Chain B, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467053|pdb|3U5P|C Chain C, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467055|pdb|3U5P|D Chain D, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467057|pdb|3U5P|E Chain E, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467059|pdb|3U5P|F Chain F, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467061|pdb|3U5P|G Chain G, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
gi|372467063|pdb|3U5P|H Chain H, Crystal Structure Of The Complex Of Trim33 Phd-Bromo And
H3(1-28) K9me3k14ack18ack23ac Histone Peptide
Length = 207
Score = 67.0 bits (162), Expect = 5e-08, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 6 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 63
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 64 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 90
>gi|284004883|ref|NP_001164384.1| tripartite motif-containing protein 66 isoform 3 [Mus musculus]
Length = 1233
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 773 A-NAR 776
NAR
Sbjct: 1052 CENAR 1056
>gi|347840107|emb|CCD54679.1| hypothetical protein [Botryotinia fuckeliana]
Length = 1267
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 70/185 (37%), Gaps = 42/185 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGT---WHCRYCM-NTFQKEKFV 769
ENDD C C G LLCCD C R+FH CV L P P+ T W C C+ + + K
Sbjct: 874 ENDDYCAACGGNGKLLCCDGCTRSFHFKCVDLSPDSPTLTTEEWFCNECLAQSVPRSKDE 933
Query: 770 EYNANARAAGRIEGVDPFAQMVSRCIR-----------------IVQTPDTEL------- 805
E +E +P A + + IR +VQ P
Sbjct: 934 EIGYFGPLLANMERKNPSAFHLPKAIREYFENVKTGTEGEYEEGVVQKPKNNRGYDEAPD 993
Query: 806 --------GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK- 856
G +LC G R +I C C +H+ CL D + +E P G+
Sbjct: 994 YFKLKDSKGNPILCHGCHLSAVNPS-RAIIPCTYCSLSWHLDCL-DPPL--AKEPPPGRS 1049
Query: 857 WLCCA 861
W C A
Sbjct: 1050 WRCPA 1054
>gi|410919217|ref|XP_003973081.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5-like [Takifugu rubripes]
Length = 1982
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 69/164 (42%), Gaps = 36/164 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEKAPEGKWSCPHC----EKEGIQWEA 382
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ + C R+ + GG +LC CD C
Sbjct: 383 KGEEEEEEEAAGEEEDDHMEFC-RVCKD-----GGELLC-----------------CDTC 419
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L E+P G+WLC C C + +QK++
Sbjct: 420 PSSYHIHCLN----PPLPEIPNGEWLCPRCM-CPPLKGKVQKIL 458
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 787 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 844
Query: 774 NA 775
+A
Sbjct: 845 DA 846
>gi|343961759|dbj|BAK62469.1| transcription intermediary factor 1-alpha [Pan troglodytes]
Length = 375
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 150 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 207
Query: 774 NA 775
+A
Sbjct: 208 DA 209
>gi|323353060|gb|EGA85360.1| Rco1p [Saccharomyces cerevisiae VL3]
Length = 541
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/321 (20%), Positives = 112/321 (34%), Gaps = 112/321 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---------MN 761
EN+D C C G LCCD+CP++FH C+ P +P G WHC C M
Sbjct: 115 ENEDFCSACNQSGSFLCCDTCPKSFHFLCLDPPIDPNNLPKGDWHCNECKFKIFINNSMA 174
Query: 762 TFQK--EKFVEYNANARAAGR----IEGVDP----FAQMVSRCIRIVQT----------- 800
T +K F++ N N + + I+ +P + V+T
Sbjct: 175 TLKKIESNFIKQNNNVKIFAKLLFNIDSHNPKQFQLPNYIKETFPAVKTGSRGQYSDEND 234
Query: 801 -------------------------PDTEL----GGCVLCRGRDFCKSRFG-------RR 824
PDT + G ++C + ++R G R
Sbjct: 235 KIPLTDRQLFNTSYGQSITKLDSYNPDTHIDSNSGKFLICYKCN--QTRLGSWSHPENSR 292
Query: 825 TVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEEKLPET 884
++ CD C+ +H+ C+ ++L KW C P
Sbjct: 293 LIMTCDYCQTPWHLDCVPRASFKNL----GSKWKC----------------------PLH 326
Query: 885 SLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDASDGTRALLSKAVSIFHDRFDPI-IES 943
S + KK DN+V++ V W+ R L++K ++++ I ++
Sbjct: 327 SPTKVYKKIHHCQEDNSVNYKV-WKKQR------------LINKKNQLYYEPLQKIGYQN 373
Query: 944 ASKLDLIPAMVYGRSHRGQDY 964
+ +IP + QD+
Sbjct: 374 NGNIQIIPTTSHTDYDFNQDF 394
>gi|74184798|dbj|BAE27995.1| unnamed protein product [Mus musculus]
Length = 1233
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 994 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1051
Query: 773 A-NAR 776
NAR
Sbjct: 1052 CENAR 1056
>gi|348526369|ref|XP_003450692.1| PREDICTED: chromodomain-helicase-DNA-binding protein 4-like
[Oreochromis niloticus]
Length = 1972
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 64/149 (42%), Gaps = 28/149 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE
Sbjct: 384 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHC----EKEGI---- 435
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + R + + D + C +C+ G C CD
Sbjct: 436 -QWEARDDLSEAEGEDDDDRRDEGMEEEDDHHIEFCRVCKDGGELLC-----------CD 483
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+W+C
Sbjct: 484 TCPSSYHIHCLN----PPLPEIPNGEWIC 508
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 468 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWICPRC 511
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 802 DTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+TE G +D+C+ +ILCD C R YH+ CL D+++ P+GKW C
Sbjct: 374 ETEDGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLD----PDMEKAPEGKWSC 427
>gi|156544115|ref|XP_001605754.1| PREDICTED: PHD finger protein 12-like [Nasonia vitripennis]
Length = 661
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 72/183 (39%), Gaps = 42/183 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC----VSLPGIPSGTWHCRYCMNTFQKEKFV- 769
N D C C DGG+L+CCD CP +FH+ C + L IP+G W C C +KE +
Sbjct: 56 HNRDSCDACHDGGELICCDKCPASFHLQCHDPPLELSDIPNGEWICHACRCAMKKENSIG 115
Query: 770 ----EYNANARAAGRIEGVDPFAQMVSRCIRI-VQTPDTE-LGGCVLCRGRDF------- 816
+ NA A V+P + R ++I + P T+ + RG+
Sbjct: 116 NKRKKKNALEVLALAASLVNPKEFELPRELQIPITFPGTDKIDPVSFKRGKHHNSNNING 175
Query: 817 --------------------CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 856
C+ + +I CD C +H CL L P G+
Sbjct: 176 KIHYHEYGSITPLPARLCFVCRKSCRKAPLIACDYCPLYFHQDCLD----PPLTAFPSGR 231
Query: 857 WLC 859
W+C
Sbjct: 232 WMC 234
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 8/76 (10%)
Query: 719 LCGICMDG---GDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
LC +C L+ CD CP FH DC+ P PSG W C +N F + +
Sbjct: 192 LCFVCRKSCRKAPLIACDYCPLYFHQDCLDPPLTAFPSGRWMCPNHLNHFIDQNLL---T 248
Query: 774 NARAAGRIEGVDPFAQ 789
+ A R++ D FA
Sbjct: 249 SCAATERVKLWDKFAH 264
>gi|335305258|ref|XP_003360173.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Sus scrofa]
Length = 298
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 73 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 130
Query: 774 NA 775
+A
Sbjct: 131 DA 132
>gi|45861756|gb|AAS78676.1| transcriptional intermediary factor 1 delta [Mus musculus]
Length = 1344
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 773 A-NAR 776
NAR
Sbjct: 1154 CENAR 1158
>gi|414866292|tpg|DAA44849.1| TPA: hypothetical protein ZEAMMB73_580600 [Zea mays]
Length = 1013
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 28/149 (18%)
Query: 926 LSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRGQDYHGMYCAILTVNQVVVSAGIFR 985
L AV I H+ F IIE ++ D+ +V+ R
Sbjct: 771 LCIAVDILHECFVTIIEPRTQSDISEDIVFNR---------------------------E 803
Query: 986 IFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKFG 1045
I GQ+ AELPL+ T + + QG + L +EKLL L V+ L+LP+ E WT FG
Sbjct: 804 ICGQKFAELPLIGTRSLYRRQGMCRLLINELEKLLLDLGVERLLLPAVPELLQTWTCSFG 863
Query: 1046 FSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
F++M+ E+ + + ++ FQGT+M QK
Sbjct: 864 FTVMSNSERLELAGN-SILSFQGTTMCQK 891
>gi|242023690|ref|XP_002432264.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
gi|212517673|gb|EEB19526.1| Chromodomain helicase-DNA-binding protein, putative [Pediculus
humanus corporis]
Length = 1999
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 57/150 (38%), Gaps = 51/150 (34%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEY 771
+E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C
Sbjct: 348 QEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGKWSCPHC------------ 395
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILC 829
A G E Q D C LC+ G C C
Sbjct: 396 ----EAEGTQE----------------QDDDEHNEFCRLCKDGGELLC-----------C 424
Query: 830 DQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
D C YH+ CL L E+P G W C
Sbjct: 425 DSCTSAYHIFCLN----PPLSEIPDGDWKC 450
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+++ C +C DGG+LLCCDSC A+HI C++ P IP G W C C
Sbjct: 407 EHNEFCRLCKDGGELLCCDSCTSAYHIFCLNPPLSEIPDGDWKCPRC 453
>gi|284004881|ref|NP_001164383.1| tripartite motif-containing protein 66 isoform 1 [Mus musculus]
Length = 1344
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1096 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1153
Query: 773 A-NAR 776
NAR
Sbjct: 1154 CENAR 1158
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 893 NEDWCAVCQNGGDLLCCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 950
Query: 774 N 774
+
Sbjct: 951 D 951
>gi|326932279|ref|XP_003212247.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Meleagris gallopavo]
Length = 1949
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 33/147 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE +++
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEG-IQWE 371
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG + R+ + GG +LC CD C
Sbjct: 372 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 409
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 410 PSSYHLHCLN----PPLPEIPNGEWLC 432
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 392 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 435
>gi|449269479|gb|EMC80242.1| Transcription intermediary factor 1-alpha, partial [Columba livia]
Length = 914
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 698 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLRSFPSGEWICTFCRDLSKPE--VEYDC 755
Query: 774 NARA 777
+ A
Sbjct: 756 DKPA 759
>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
Length = 1016
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 NA 775
+A
Sbjct: 849 DA 850
>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
Length = 1050
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 NA 775
+A
Sbjct: 883 DA 884
>gi|363741929|ref|XP_003642567.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gallus
gallus]
Length = 1947
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 64/147 (43%), Gaps = 33/147 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE +++
Sbjct: 332 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEG-IQWE 386
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG + R+ + GG +LC CD C
Sbjct: 387 PKEEEDEEEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 424
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 425 PSSYHLHCLN----PPLPEIPNGEWLC 447
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 407 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 450
>gi|300797500|ref|NP_001178768.1| tripartite motif-containing protein 66 [Rattus norvegicus]
Length = 1348
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + Q E +EY+
Sbjct: 1100 ENEDFCAVCINGGELLCCDRCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPE--MEYD 1157
Query: 773 A-NAR 776
NAR
Sbjct: 1158 CENAR 1162
>gi|115470813|ref|NP_001059005.1| Os07g0173400 [Oryza sativa Japonica Group]
gi|113610541|dbj|BAF20919.1| Os07g0173400, partial [Oryza sativa Japonica Group]
Length = 502
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 985 RIFGQELAELPLVATSNDCQGQGYFQSLFCCIEKLLGFLNVKTLVLPSASEAQAIWTNKF 1044
RI G +LAE+P + T + QG L IE L LNV+ LV+P+ E Q WT F
Sbjct: 1 RIHGTDLAEMPFIGTRGIYRRQGMCHRLLNAIESALSSLNVRRLVIPAIPELQNTWTTVF 60
Query: 1045 GFSMMTEEEQNKYRNDYPLMIFQGTSMLQK 1074
GF + ++ K ++ ++I GT +L+K
Sbjct: 61 GFKPVEPSKRQKIKS-LNILIIHGTGLLEK 89
>gi|338722190|ref|XP_001492263.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Equus
caballus]
Length = 1930
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 317 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEE 376
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D + C + GG +LC CD C
Sbjct: 377 EEEEEGGCEEEEDDHMEFCRVC---------KDGGELLC-----------------CDAC 410
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 411 PSSYHLHCLN----PPLPEIPNGEWLC 433
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 393 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 436
>gi|321479460|gb|EFX90416.1| hypothetical protein DAPPUDRAFT_232072 [Daphnia pulex]
Length = 2083
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 60/158 (37%), Gaps = 43/158 (27%)
Query: 711 FSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 761
F S+E D D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 402 FPSEEGDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEAPEGRWSCPHC-- 459
Query: 762 TFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRF 821
+ E +A D ++ C + GG +LC
Sbjct: 460 --EGEGITAATVTEKAGRNAADDDEHSEFCRIC---------KDGGELLC---------- 498
Query: 822 GRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C YH CL L E+P G W C
Sbjct: 499 -------CDSCTSAYHTFCLN----PPLSEIPDGDWKC 525
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ + C IC DGG+LLCCDSC A+H C++ P IP G W C C
Sbjct: 482 EHSEFCRICKDGGELLCCDSCTSAYHTFCLNPPLSEIPDGDWKCPRC 528
>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
Length = 607
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 382 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSKPE--VEYDC 439
Query: 774 NARA 777
+A +
Sbjct: 440 DAPS 443
>gi|189238110|ref|XP_001814014.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270008739|gb|EFA05187.1| hypothetical protein TcasGA2_TC015317 [Tribolium castaneum]
Length = 706
Score = 66.2 bits (160), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 77/209 (36%), Gaps = 62/209 (29%)
Query: 706 SLERPFSSK----ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCR 757
+ E P+ + N D C C +GGDL+CCD CP +FH+ C P IPSG W C
Sbjct: 38 ATEHPYFKRPGKGHNHDSCDACGEGGDLICCDKCPSSFHLQCHDPPLEEKDIPSGEWLCH 97
Query: 758 YCMNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT-----------PDTELG 806
C + K+ N R+A G P VS ++Q P +
Sbjct: 98 SC--KYAKKSNAPQTRNKRSASTPAGKAPKKAKVSPMDLLIQAASSMNPKQFDLPRSMSV 155
Query: 807 GCVLCRGRDFCK---SRFGRRT---------------------------------VILCD 830
C+ G D + SR GR++ +I CD
Sbjct: 156 PCIF-PGTDKVEAPYSRNGRKSMKVHKDHEKKQDATIFLPAKKCHECRKSCRVAPLISCD 214
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C +H+ CL L P G+W+C
Sbjct: 215 FCSLYFHMDCLD----PPLTTPPSGRWMC 239
>gi|449504115|ref|XP_004174568.1| PREDICTED: tripartite motif-containing protein 66 [Taeniopygia
guttata]
Length = 958
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 47/82 (57%), Gaps = 7/82 (8%)
Query: 693 SNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP 750
SNG++ + K + + S EN+D C +C++ G+LLCCD CP+ FH+ C +L P
Sbjct: 692 SNGIATMK---KSPVTQEVSPIENEDFCAVCLNVGELLCCDHCPKVFHLSCHVPALLSFP 748
Query: 751 SGTWHCRYCMNTFQKEKFVEYN 772
G W C C N + E VEY+
Sbjct: 749 VGEWVCSLCRNPVKPE--VEYD 768
>gi|317147911|ref|XP_001822376.2| PHD finger domain protein [Aspergillus oryzae RIB40]
Length = 857
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 763
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 468 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 527
Query: 764 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 799
+ Y A + G+ + V PF ++ R R +++
Sbjct: 528 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 587
Query: 800 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 588 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|238502471|ref|XP_002382469.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
gi|220691279|gb|EED47627.1| PHD finger domain protein, putative [Aspergillus flavus NRRL3357]
Length = 875
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 763
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 468 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 527
Query: 764 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 799
+ Y A + G+ + V PF ++ R R +++
Sbjct: 528 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 587
Query: 800 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 588 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 626
>gi|344282967|ref|XP_003413244.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Loxodonta
africana]
Length = 2101
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 365 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDEE 424
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D + C + GG +LC CD C
Sbjct: 425 EEEEEGGCEEEEDDHMEFCRVC---------KDGGELLC-----------------CDAC 458
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 459 PSSYHLHCLN----PPLPEIPNGEWLC 481
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 441 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 484
>gi|301093217|ref|XP_002997457.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110713|gb|EEY68765.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 248
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 42/61 (68%), Gaps = 3/61 (4%)
Query: 704 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMN 761
+++ +RP ++ ++ + CGIC+ GG LLCCD C RA+H++CV +L +P G W C +C +
Sbjct: 175 RIAQQRP-AADQHLEFCGICLKGGKLLCCDGCERAYHLNCVRPALLDVPEGDWFCSHCRD 233
Query: 762 T 762
Sbjct: 234 A 234
>gi|395526186|ref|XP_003765249.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Sarcophilus
harrisii]
Length = 2043
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 77/179 (43%), Gaps = 36/179 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P G W C +C +KE +++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPEMEKAPEGKWSCPHC----EKEG-IQWE 398
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG + R+ + GG +LC CD C
Sbjct: 399 PKDDDEEDEEGGEEEEDDHMEFCRVCKD-----GGELLC-----------------CDTC 436
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVI 889
YH+ CL L E+P G+WLC C C + +Q+++ + P + V+
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLCPRCT-CPPLKGKVQRILHWKWTEPPAPFITVL 490
>gi|405972247|gb|EKC37026.1| Chromodomain-helicase-DNA-binding protein Mi-2-like protein
[Crassostrea gigas]
Length = 2123
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 714 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+E DD C +C DGG+LLCCD+CP A+H+ C++ P IP G WHC C
Sbjct: 377 QEEDDHMEFCRVCKDGGELLCCDTCPSAYHVHCLNPPMKMIPDGEWHCPRC 427
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 56/149 (37%), Gaps = 51/149 (34%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C L P G W C +C KE
Sbjct: 323 EHQDYCEVCQQGGEIILCDTCPRAYHLVCFDPELEEPPEGKWSCPHCEGEGIKE------ 376
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
Q D + C +C+ G C CD
Sbjct: 377 --------------------------QEEDDHMEFCRVCKDGGELLC-----------CD 399
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YHV CL ++ +P G+W C
Sbjct: 400 TCPSAYHVHCLN----PPMKMIPDGEWHC 424
>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
garnettii]
Length = 1014
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 4/64 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 789 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 846
Query: 774 NARA 777
+ +
Sbjct: 847 DTSS 850
>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Bos taurus]
Length = 1047
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 822 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 879
Query: 774 N 774
+
Sbjct: 880 D 880
>gi|391871058|gb|EIT80224.1| PHD Zn-finger protein [Aspergillus oryzae 3.042]
Length = 730
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 763
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 341 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 400
Query: 764 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 799
+ Y A + G+ + V PF ++ R R +++
Sbjct: 401 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 460
Query: 800 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 461 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|339242107|ref|XP_003376979.1| domain protein, SNF2 family [Trichinella spiralis]
gi|316974280|gb|EFV57776.1| domain protein, SNF2 family [Trichinella spiralis]
Length = 2137
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 59/152 (38%), Gaps = 47/152 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTF--QKEKFV 769
++ D C +C GG+++ CD+CPRA+H+ C+ P + P G W C +C N +
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLD-PDMEEPPGGKWSCPHCENDLVNDNDAVT 395
Query: 770 EYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVI 827
A AG +E C LCR G C
Sbjct: 396 SKEAAPAKAGNME------------------------FCRLCRDGGELLC---------- 421
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C YH CL + L +P+G W C
Sbjct: 422 -CDSCPSSYHRYCL----IPPLTTIPEGDWHC 448
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
N + C +C DGG+LLCCDSCP ++H C+ P IP G WHC C
Sbjct: 406 NMEFCRLCRDGGELLCCDSCPSSYHRYCLIPPLTTIPEGDWHCPRC 451
>gi|47230454|emb|CAF99647.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2153
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 39/68 (57%), Gaps = 10/68 (14%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 772
E++D C +C+ GGDLLCCD CP+ FH+ C P PSG W C C + VEY+
Sbjct: 1877 ESEDFCAVCLIGGDLLCCDRCPKVFHLSCHVPPLLSFPSGDWVCSLC-------REVEYD 1929
Query: 773 A-NARAAG 779
N R +G
Sbjct: 1930 CENERTSG 1937
>gi|83771111|dbj|BAE61243.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 730
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC-----MNTF- 763
+EN+D C C G LLCCD C +FH C++ P P G W C C M T
Sbjct: 341 EENNDFCRECNGSGQLLCCDGCVNSFHFSCLNPPLDPANPPEGDWFCPRCSILKPMGTLL 400
Query: 764 ------------QKEKFVEYNANARAA--GRIEGVDPFAQMVSRCIR----------IVQ 799
+ Y A + G+ + V PF ++ R R +++
Sbjct: 401 VAVDKVSHKDFALPSRLRSYFAGVQTGDKGQYDEVLPFPRINPRSGRNRSGRYDDPFLLR 460
Query: 800 TPDTELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
T D + G +LC G C ++ GRR +I CD C +H+ C+
Sbjct: 461 TVDAK-GKLILCHG---CGRTTNGRRPIIQCDFCPLAFHMDCI 499
>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
taurus]
gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
Length = 1050
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 825 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 882
Query: 774 N 774
+
Sbjct: 883 D 883
>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
mutus]
Length = 940
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 715 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 772
Query: 774 N 774
+
Sbjct: 773 D 773
>gi|345322044|ref|XP_001508010.2| PREDICTED: transcription intermediary factor 1-alpha
[Ornithorhynchus anatinus]
Length = 800
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 575 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLMNFPSGEWICTFCRDLSKPE--VEYD 631
>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
taurus]
gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
Length = 1016
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 791 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLANFPSGEWICTFCRDLSKPE--VEYDC 848
Query: 774 N 774
+
Sbjct: 849 D 849
>gi|363728093|ref|XP_416340.3| PREDICTED: transcription intermediary factor 1-alpha [Gallus
gallus]
Length = 939
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 714 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRDLSKPE--VEYDC 771
Query: 774 N 774
+
Sbjct: 772 D 772
>gi|301616286|ref|XP_002937591.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Xenopus (Silurana) tropicalis]
Length = 1906
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 62/147 (42%), Gaps = 30/147 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 337 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 385
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ Q + + E +FC+ ++ CD C
Sbjct: 386 ---------EGI----QWEPKEDDEDEEDGAEEEEEEEDDHMEFCRVCKDGGELLCCDTC 432
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 415 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPEIPNGEWLCPRC 458
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 865
+D+C+ +ILCD C R YH+ CL +L++ P+GKW C C++
Sbjct: 339 QDYCEVCQQGGEIILCDTCPRAYHLVCLD----PELEKAPEGKW-SCPHCEK 385
>gi|327272268|ref|XP_003220907.1| PREDICTED: transcription intermediary factor 1-alpha-like [Anolis
carolinensis]
Length = 1038
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCCD CP+ FH+ C +L PSG W C +C + E VEY+
Sbjct: 812 NEDWCAVCQNGGELLCCDRCPKVFHLSCHVPTLRNFPSGEWICTFCRDLCDPE--VEYDC 869
Query: 774 N 774
+
Sbjct: 870 D 870
>gi|224099263|ref|XP_002334498.1| predicted protein [Populus trichocarpa]
gi|222872484|gb|EEF09615.1| predicted protein [Populus trichocarpa]
Length = 117
Score = 65.5 bits (158), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/94 (34%), Positives = 50/94 (53%), Gaps = 6/94 (6%)
Query: 437 LRGVVKGSGISCFCDDCKGNQVVTPAVFELHAGSSNKRPPEYIYLENGKTLRDIMNVCKD 496
L G++ G G C C C ++V++ FE HAG+ + P +IYLENGK + I+ K
Sbjct: 7 LDGIIDGGGYLCGCSSCNFSKVLSAYEFEQHAGAKTRHPNNHIYLENGKPIYSIIQELKT 66
Query: 497 SPLETLEKAVRMVLGSS------SMKKANFCLNC 524
+PL ++ ++ V GSS + K + C C
Sbjct: 67 APLSMIDGVIKDVAGSSINEEFFRVWKGDICFIC 100
>gi|403215174|emb|CCK69674.1| hypothetical protein KNAG_0C05760 [Kazachstania naganishii CBS
8797]
Length = 731
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 97/246 (39%), Gaps = 79/246 (32%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHC-------RYCMNTF 763
ENDD C C+ G LCCD+CP++FH C+ P +P G W C +Y NT
Sbjct: 308 ENDDFCSACLQSGSFLCCDTCPKSFHFLCLDPPIDPSNLPEGDWSCPSCEFKSQYATNTQ 367
Query: 764 QK---EKFVEYNAN-------ARAAGRIEGVDPFAQMVSRCIR----------------- 796
K ++F++ A+ ++ +++G +P + + ++
Sbjct: 368 YKKGQKQFIQEAADNGTNKLFSKLLFQLQGFNPKQFNLPQSVKESFPEVSTGDRGQYTDD 427
Query: 797 IVQTPDTELG------GCVLCRGRDFC------------------------KSRFGR--- 823
++TP TE G + + +C ++FG
Sbjct: 428 TMRTPLTEKQMFNSPYGQSITKLDSYCPENHYIDDNSDTTPDRFLLCYKCKTTKFGTWDH 487
Query: 824 ----RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGEE 879
R +I CD C+ +H+ C+ D L+ L KW C N+ Q+ +++ ++
Sbjct: 488 PGNARLLIKCDYCQTPWHLDCIPDVPRASLKIL-GYKWRCPL---HANVKQQRRLNKRQQ 543
Query: 880 KLPETS 885
K E S
Sbjct: 544 KYLEPS 549
>gi|432105627|gb|ELK31821.1| Chromodomain-helicase-DNA-binding protein 3 [Myotis davidii]
Length = 1998
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 429
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 430 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 470
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 471 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPMLKGRVQKIL 511
>gi|326912229|ref|XP_003202456.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Meleagris gallopavo]
Length = 1051
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C SL PSG W C +C + + E VEY+
Sbjct: 826 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPSLMSFPSGEWICTFCRDLSKPE--VEYDC 883
Query: 774 N 774
+
Sbjct: 884 D 884
>gi|334331581|ref|XP_001379177.2| PREDICTED: tripartite motif-containing protein 66-like [Monodelphis
domestica]
Length = 1162
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 4/68 (5%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + + E VEY+
Sbjct: 911 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTKPE--VEYD 968
Query: 773 ANARAAGR 780
+ G
Sbjct: 969 CDNTRYGH 976
>gi|145533979|ref|XP_001452734.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420433|emb|CAK85337.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+D C C GG ++CCD+CP+ FH C+ L +P G W+C C++ F+++
Sbjct: 844 EDRCNKCGQGGKVICCDTCPKVFHTKCLGLKEVPKGKWNCLVCLSNFERQ 893
>gi|301609145|ref|XP_002934140.1| PREDICTED: PHD finger protein 21A-like [Xenopus (Silurana)
tropicalis]
Length = 681
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 703 IKLSLERPFSSKEN--DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRY 758
+ +SL PFS+ + +D C IC G LL CD+C R +H+DC+ P IP G W C
Sbjct: 475 LPVSLPSPFSTDSDIHEDFCSICRKSGQLLMCDTCSRVYHLDCLEPPLKTIPKGMWICPK 534
Query: 759 CMNTFQKE 766
C + K+
Sbjct: 535 CQDQMMKK 542
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 6/50 (12%)
Query: 815 DFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CAD 862
DFC +++CD C R YH+ CL+ L+ +PKG W+C C D
Sbjct: 492 DFCSICRKSGQLLMCDTCSRVYHLDCLEP----PLKTIPKGMWICPKCQD 537
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 31/49 (63%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
S N C +C GG L+CC+SCP AFH DC+ IP G+W+CR C N
Sbjct: 1082 SHVNVSWCFMCSKGGSLMCCESCPAAFHPDCIGYDEIPDGSWYCRDCTN 1130
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 ENDDLCGICMDGGDLLCCD--SCPRAFHIDCVSLPGIPSGTWHCRY 758
E++D C C +GG+L+ CD +CP+A+H+ C+ L P G W C +
Sbjct: 1498 EHEDDCFRCGEGGELVMCDKKTCPKAYHVKCLGLSKRPYGKWECPW 1543
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 712 SSKENDDLCGICMDGGDLLCCD-SCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
S KEN +C +C G+LL C+ C AFH+DC+ L +PSG + C C++
Sbjct: 919 SRKEN--VCQVCERPGELLLCEGQCCGAFHLDCIGLQQMPSGVFKCDECIS 967
>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
Length = 1341
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 31/51 (60%), Gaps = 2/51 (3%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKF 768
C IC DGG LLCCD CPRAFH+ C+ S+ IP W+C+ C + +
Sbjct: 161 CNICKDGGQLLCCDRCPRAFHMSCLGMSVDMIPDSEWYCKMCTECLDRRRL 211
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 709 RPFSSKENDDL-CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
RP + ++ D+ C +C GG+LLCCD CPRAFH++C+ L IP W C C
Sbjct: 1230 RPKAPEDQWDVDCSVCGLGGELLCCDGCPRAFHVNCIGLAEIPDTEWFCNEC 1281
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQ 764
C IC +GG+LLCCD CP FH C+ L IP G C C T +
Sbjct: 1117 CYICTEGGELLCCDGCPHVFHYSCIGLRRIPRGKIFCHECDTTVK 1161
>gi|195428619|ref|XP_002062369.1| GK17504 [Drosophila willistoni]
gi|194158454|gb|EDW73355.1| GK17504 [Drosophila willistoni]
Length = 2023
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 15/103 (14%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C +
Sbjct: 442 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC-------------S 488
Query: 774 NARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF 816
G+ E + + V R + P T G CR R++
Sbjct: 489 CPPLTGKAEKIITWRWAVHRNAAGDEQPSTSKGAAKPCRIREY 531
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 381 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 440
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 441 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 461
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 462 PSAYHTFCLN----PPLDTIPDGDWRC 484
>gi|302915931|ref|XP_003051776.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
gi|256732715|gb|EEU46063.1| hypothetical protein NECHADRAFT_79186 [Nectria haematococca mpVI
77-13-4]
Length = 1194
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 67/165 (40%), Gaps = 43/165 (26%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPS----GTWHCRYCM---------- 760
+ND+ C C GD++CCD CPR+FH +CV + +PS W+C C+
Sbjct: 754 DNDEDCSACGAAGDVVCCDGCPRSFHFECVGM--VPSEDLPDEWYCNECLFKRYPSRVPV 811
Query: 761 ---------NTFQKEKFVEYNANARAAGRIEGVDP----------FAQMVSRCIRIVQTP 801
N +K ++ + R EGV A+ R + P
Sbjct: 812 HKGVFGPALNNLEKSIPRAFSLPKKLQTRFEGVKAGPDGEYEEVTTAKTTKRKNGYEEVP 871
Query: 802 D----TELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
D + G VLC G C K+ R +I C C R +H+ CL
Sbjct: 872 DFFRQRDDGQPVLCHG---CQKAATDVRAIIPCSVCPRYWHIDCL 913
>gi|170579494|ref|XP_001894854.1| PHD-finger family protein [Brugia malayi]
gi|158598405|gb|EDP36306.1| PHD-finger family protein [Brugia malayi]
Length = 598
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 77/203 (37%), Gaps = 62/203 (30%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKE----- 766
N C CM+GG+LLCCD CP +FH+ C P IPSG W C C + +
Sbjct: 55 NSQYCSSCMEGGELLCCDRCPASFHLMCHEPPIERSSIPSGKWLCNRCTHAVANDTSPKA 114
Query: 767 --KFVEYNA----------NARAAGRIE--------------------GVDPFAQMVSRC 794
K + +A NA +G +E + F+
Sbjct: 115 IKKVPKLDAECSSVDARRNNATISGIMEQNTDSGLSALAVLADAALATNAEQFSIPYDLS 174
Query: 795 IRIVQTP------DTEL-----GGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKD 843
+I P T+L C LC R+ + T+I CD C YH+ CL
Sbjct: 175 CQIPPLPFDERLTSTQLVPPPKAVCHLCASRN------EKSTMIQCDFCSLCYHLDCL-- 226
Query: 844 HGMEDLQELPKGKWLCCADCKRI 866
L +PK KW+C A + I
Sbjct: 227 --TPPLPSVPKDKWMCPAHVEHI 247
>gi|440799225|gb|ELR20283.1| PHDfinger domain containing protein [Acanthamoeba castellanii str.
Neff]
Length = 561
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 72/179 (40%), Gaps = 39/179 (21%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM--NTFQKEKFVEY 771
N D C C +GG+L+CCD CP +FH++C++ LP +P G W CR C+ +T + + +E
Sbjct: 26 NHDNCDSCGEGGELICCDRCPASFHLECLNPPLPCVPDGDWFCRACLLQDTPFEPQSMEM 85
Query: 772 NANARAAGRIEGVDPFA-QMVSRCIRIVQTPDTELG------------------------ 806
R +EG + A + I+ V+ D G
Sbjct: 86 TIMGRLLQHLEGRNTVAFSLPLGVIKAVEPEDYADGQDDADAPGESNLFDSDQSDSDRGL 145
Query: 807 --GCVLCRGRDFCK----SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
G R +C R + C +C YH+ CL L P KWLC
Sbjct: 146 RQGKRKRRHDSYCSVCSLPSPSRDDLAQCTRCPHSYHLWCLD----PPLLAKPTVKWLC 200
>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
harrisii]
Length = 880
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 655 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 712
Query: 774 N 774
+
Sbjct: 713 D 713
>gi|391338290|ref|XP_003743492.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1481
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 68/162 (41%), Gaps = 25/162 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
E++D C +C++ L+ C SCPR+FH+ C+ + P W C C +K Y
Sbjct: 990 ESNDFCEVCLNDEQLISCGSCPRSFHLICIQMKRAPRRDWRCLACTAGVKK-----YKQE 1044
Query: 775 ARAAGR-IEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCE 833
+ + IE + F S D + C+ C R I C C
Sbjct: 1045 LKDLKKIIEEKEAFEAKDS------NEEDFSINQCLKC-------GELLSRGHIECIGCG 1091
Query: 834 REYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVD 875
R+YH+ C DL + PKG W C C+ ++L + ++
Sbjct: 1092 RKYHLAC------ADLTKRPKGDWYCKKRCEPGYVSLNQTMN 1127
>gi|395543452|ref|XP_003773631.1| PREDICTED: tripartite motif-containing protein 66 [Sarcophilus
harrisii]
Length = 1496
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C +C++GG+LLCCD CP+ +H+ C +L P G W C C + + E VEY+
Sbjct: 1258 ENEDFCAVCLNGGELLCCDHCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTKPE--VEYD 1315
Query: 773 AN 774
+
Sbjct: 1316 CD 1317
>gi|403420600|ref|NP_001258155.1| chromodomain-helicase-DNA-binding protein 5 [Rattus norvegicus]
Length = 1948
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 339 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 387
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 388 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 432
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 433 PSSYHLHCLN----PPLPEIPNGEWLC 455
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 415 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 458
>gi|355568209|gb|EHH24490.1| hypothetical protein EGK_08151 [Macaca mulatta]
Length = 1931
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 375 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 429
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 430 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 470
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 471 ACISSYHIHCLN----PPLPDIPNGEWLC 495
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 455 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 498
>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Monodelphis domestica]
Length = 1061
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 836 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 893
Query: 774 N----------ARAAGRIEGVD 785
+ + GR+ +D
Sbjct: 894 DDPSYSSEKRKSEGHGRLTPID 915
>gi|47206539|emb|CAF92235.1| unnamed protein product [Tetraodon nigroviridis]
Length = 993
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCM 760
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C CM
Sbjct: 2 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCM 46
>gi|405960336|gb|EKC26267.1| E3 ubiquitin-protein ligase TRIM33 [Crassostrea gigas]
Length = 899
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 2/49 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNT 762
N+D C +C +GGDLLCCD CP+ +H+ C L PS W C C NT
Sbjct: 672 NEDYCAVCQNGGDLLCCDKCPKVYHLKCHIPELKEFPSDEWQCTMCTNT 720
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 2/51 (3%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF 768
C IC DGG+LLCCD CPRAFH++C+ + IP W+C+ C + +
Sbjct: 161 CNICKDGGELLCCDRCPRAFHMNCLGMSEDMIPDSEWYCKMCSECLDRRRL 211
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 26/40 (65%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
C +C GG+LLCCD CPRAFH+ C+ L IP W C C
Sbjct: 1195 CSVCGLGGELLCCDGCPRAFHVTCIGLEKIPETEWFCNEC 1234
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
C IC +GG+L+CCD CP FH C+ L +P G C C
Sbjct: 1062 CYICSEGGELVCCDGCPHVFHYSCIGLRRVPRGKIFCHEC 1101
>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
[Monodelphis domestica]
Length = 1060
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 835 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 892
Query: 774 N----------ARAAGRIEGVD 785
+ + GR+ +D
Sbjct: 893 DDPSYSSEKRKSEGHGRLTPID 914
>gi|148682990|gb|EDL14937.1| mCG131426 [Mus musculus]
Length = 1955
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 400
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 401 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 445
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 446 PSSYHLHCLN----PPLPEIPNGEWLC 468
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 428 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 471
>gi|124487025|ref|NP_001074845.1| chromodomain helicase DNA binding protein 5 isoform 1 [Mus
musculus]
Length = 1952
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 391
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 436
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|431894030|gb|ELK03836.1| Chromodomain-helicase-DNA-binding protein 3 [Pteropus alecto]
Length = 2007
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 383 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 437
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 438 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 478
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 479 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 519
>gi|345800551|ref|XP_536627.3| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Canis lupus familiaris]
Length = 1999
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + D + C +C+ G C CD
Sbjct: 432 AKEEEE---------EYEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 471
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 472 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 512
>gi|189458814|ref|NP_083492.2| chromodomain helicase DNA binding protein 5 isoform 2 [Mus
musculus]
Length = 1915
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 391
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 392 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 436
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 437 PSSYHLHCLN----PPLPEIPNGEWLC 459
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 419 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 462
>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Monodelphis domestica]
Length = 1027
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 45/82 (54%), Gaps = 14/82 (17%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 802 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSKPE--VEYDC 859
Query: 774 N----------ARAAGRIEGVD 785
+ + GR+ +D
Sbjct: 860 DDPSYSSEKRKSEGHGRLTPID 881
>gi|297791319|ref|XP_002863544.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
gi|297309379|gb|EFH39803.1| hypothetical protein ARALYDRAFT_917093 [Arabidopsis lyrata subsp.
lyrata]
Length = 2251
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 44/76 (57%), Gaps = 7/76 (9%)
Query: 691 YTSNG-VSLHELSIKLSLERPFSSKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVS 745
YTS+ + H L L+ ER S ++ +D C IC GGDLLCCDSCPR +H C++
Sbjct: 44 YTSSAKPTKHRLKTDLTPERNSSKRKGNDGNYFECVICDLGGDLLCCDSCPRTYHTACLN 103
Query: 746 LP--GIPSGTWHCRYC 759
P IP+G W C C
Sbjct: 104 PPLKRIPNGKWICPKC 119
>gi|198432555|ref|XP_002131918.1| PREDICTED: similar to chromodomain helicase DNA binding protein 3
[Ciona intestinalis]
Length = 1904
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 76/186 (40%), Gaps = 29/186 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD CPRA+H+ C+ P P G+W C C+
Sbjct: 342 DHQDYCEVCKQGGEIILCDGCPRAYHLVCLEPPLDQPPEGSWPCPTCVKN---------- 391
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G+ P + + + E + +FC +++CD C
Sbjct: 392 ----------GIKPKVRGAEKDEDYDDLEEEEEAEENMDEHMEFCSRCKDGGDLLICDTC 441
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 890
YH+ CL ++++P+G+W C C C + Q+++ P T + +
Sbjct: 442 PHSYHLNCLN----PPVEKVPEGEWSCPRCT-CPMLKGKCQRILAWKWRDAPYTEVADNR 496
Query: 891 KKHEES 896
HE S
Sbjct: 497 PGHEGS 502
>gi|145547050|ref|XP_001459207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427031|emb|CAK91810.1| unnamed protein product [Paramecium tetraurelia]
Length = 927
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+D C C GG ++CCD+CP+ FH C++L +P G W+C C+ F+++
Sbjct: 865 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLRNFERQ 914
>gi|357527416|ref|NP_666131.3| chromodomain helicase DNA binding protein 3 [Mus musculus]
Length = 2055
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 430 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 484
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + D + C +C+ G C CD
Sbjct: 485 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 524
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 525 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 565
>gi|148678548|gb|EDL10495.1| mCG140617 [Mus musculus]
Length = 1826
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 63/149 (42%), Gaps = 33/149 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 254
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + D + C +C+ G C CD
Sbjct: 255 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 294
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 295 ACISSYHIHCLN----PPLPDIPNGEWLC 319
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 279 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 322
>gi|440908595|gb|ELR58598.1| Chromodomain-helicase-DNA-binding protein 5, partial [Bos grunniens
mutus]
Length = 1920
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 315 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 363
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 364 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 408
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 409 PSSYHLHCLN----PPLPEIPNGEWLC 431
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 391 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 434
>gi|395836470|ref|XP_003791177.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Otolemur garnettii]
Length = 1964
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 428
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 429 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 469
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 TCISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 510
>gi|292622418|ref|XP_685699.4| PREDICTED: chromodomain-helicase-DNA-binding protein 4 isoform 1
[Danio rerio]
Length = 1953
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 761
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C++
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRCLS 503
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 68/149 (45%), Gaps = 27/149 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHC-----EKEGIQWE 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A ++ E + + D + C +C+ G C CD
Sbjct: 428 AREESSEGEEEN---DDGRRDDGDVEEEDDHHMEFCRVCKDGGELLC-----------CD 473
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+W+C
Sbjct: 474 TCPSSYHLHCLN----PPLPDIPNGEWIC 498
>gi|355753729|gb|EHH57694.1| hypothetical protein EGM_07385 [Macaca fascicularis]
Length = 1961
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|403274994|ref|XP_003929245.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Saimiri boliviensis boliviensis]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|241998816|ref|XP_002434051.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
gi|215495810|gb|EEC05451.1| transcription intermediary factor 1-alpha, putative [Ixodes
scapularis]
Length = 907
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/55 (47%), Positives = 34/55 (61%), Gaps = 2/55 (3%)
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQ 764
SS N+D C +C DGG+LLCC SCPR +H+ C SL PS W C C++ +
Sbjct: 689 SSDPNEDWCSVCHDGGELLCCGSCPRVYHLQCHVPSLSATPSEDWTCLLCLDILK 743
>gi|403274996|ref|XP_003929246.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Saimiri boliviensis boliviensis]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|426384011|ref|XP_004058570.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Gorilla gorilla gorilla]
Length = 2000
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|159162701|pdb|1MM2|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b
gi|319443275|pdb|2L75|A Chain A, Solution Structure Of Chd4-Phd2 In Complex With H3k9me3
Length = 61
Score = 64.7 bits (156), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/52 (48%), Positives = 35/52 (67%), Gaps = 2/52 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
P S + + C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 2 PLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 53
>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
Length = 764
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 38/59 (64%), Gaps = 4/59 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 539 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYD 595
>gi|332847234|ref|XP_003315413.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Pan troglodytes]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|24308089|ref|NP_056372.1| chromodomain-helicase-DNA-binding protein 5 [Homo sapiens]
gi|51701343|sp|Q8TDI0.1|CHD5_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 5;
Short=CHD-5; AltName: Full=ATP-dependent helicase CHD5
gi|19773960|gb|AAL98962.1|AF425231_1 chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|119591922|gb|EAW71516.1| chromodomain helicase DNA binding protein 5 [Homo sapiens]
gi|148922387|gb|AAI46382.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|151555557|gb|AAI48804.1| Chromodomain helicase DNA binding protein 5 [synthetic construct]
gi|261857536|dbj|BAI45290.1| Chromodomain-helicase-DNA-binding protein 5 [synthetic construct]
Length = 1954
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|426384013|ref|XP_004058571.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Gorilla gorilla gorilla]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|426240369|ref|XP_004014081.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Ovis aries]
Length = 2056
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 325 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 373
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 374 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 418
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 419 PSSYHLHCLN----PPLPEIPNGEWLC 441
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 401 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 444
>gi|52630322|ref|NP_005843.2| chromodomain-helicase-DNA-binding protein 3 isoform 2 [Homo
sapiens]
gi|119610520|gb|EAW90114.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
gi|119610522|gb|EAW90116.1| chromodomain helicase DNA binding protein 3, isoform CRA_a [Homo
sapiens]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|402898652|ref|XP_003912334.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Papio anubis]
Length = 1966
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|397477893|ref|XP_003810301.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pan paniscus]
Length = 2011
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|355678671|gb|AER96180.1| chromodomain helicase DNA binding protein 3 [Mustela putorius furo]
Length = 1740
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 252 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 306
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 307 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 347
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 348 ACISSYHIHCLN----PPLPDIPNGEWLC 372
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 332 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 375
>gi|52630326|ref|NP_001005273.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Homo
sapiens]
gi|88911273|sp|Q12873.3|CHD3_HUMAN RecName: Full=Chromodomain-helicase-DNA-binding protein 3;
Short=CHD-3; AltName: Full=ATP-dependent helicase CHD3;
AltName: Full=Mi-2 autoantigen 240 kDa protein; AltName:
Full=Mi2-alpha; AltName: Full=Zinc finger helicase;
Short=hZFH
gi|119610521|gb|EAW90115.1| chromodomain helicase DNA binding protein 3, isoform CRA_b [Homo
sapiens]
Length = 2000
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|410979901|ref|XP_003996319.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Felis catus]
Length = 2100
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 471 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 525
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 526 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 566
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 567 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 607
>gi|402898650|ref|XP_003912333.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Papio anubis]
Length = 2000
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|354501163|ref|XP_003512662.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like
[Cricetulus griseus]
Length = 1977
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 327 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 375
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 376 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 420
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 421 PSSYHLHCLN----PPLPEIPNGEWLC 443
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 403 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 446
>gi|332847232|ref|XP_003339343.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Pan
troglodytes]
Length = 2000
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|3298562|gb|AAC39923.1| zinc-finger helicase [Homo sapiens]
Length = 2000
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|291405109|ref|XP_002719030.1| PREDICTED: chromodomain helicase DNA binding protein 3-like
[Oryctolagus cuniculus]
Length = 1910
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 371 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 425
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 426 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 466
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 467 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 507
>gi|402852746|ref|XP_003891074.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 1
[Papio anubis]
Length = 1954
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|383415425|gb|AFH30926.1| chromodomain-helicase-DNA-binding protein 3 isoform 1 [Macaca
mulatta]
Length = 1996
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 428 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 468
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 469 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 509
>gi|380787663|gb|AFE65707.1| chromodomain-helicase-DNA-binding protein 5 [Macaca mulatta]
Length = 1954
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|354469736|ref|XP_003497281.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3-like [Cricetulus griseus]
Length = 1959
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 36/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 394 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 448
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + D + C +C+ G C CD
Sbjct: 449 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 488
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 489 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 529
>gi|334323402|ref|XP_001369227.2| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Monodelphis
domestica]
Length = 2114
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 528 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 582
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 583 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 623
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 624 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 664
>gi|315113501|pdb|3O33|A Chain A, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113502|pdb|3O33|B Chain B, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113503|pdb|3O33|C Chain C, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113504|pdb|3O33|D Chain D, Crystal Structure Of Trim24 Phd-Bromo In The Free State
gi|315113505|pdb|3O34|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(13-32)k23ac Peptide
gi|315113507|pdb|3O35|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113508|pdb|3O35|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(23-31)k27ac Peptide
gi|315113511|pdb|3O36|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113512|pdb|3O36|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H4(14-19)k16ac Peptide
gi|315113515|pdb|3O37|A Chain A, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113516|pdb|3O37|B Chain B, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113517|pdb|3O37|C Chain C, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
gi|315113518|pdb|3O37|D Chain D, Crystal Structure Of Trim24 Phd-Bromo Complexed With
H3(1-10)k4 Peptide
Length = 184
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 41/64 (64%), Gaps = 4/64 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 3 NEDWCAVCQNGGELLCCEKCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSKPE--VEYDC 60
Query: 774 NARA 777
+A +
Sbjct: 61 DAPS 64
>gi|2645433|gb|AAB87383.1| CHD3 [Homo sapiens]
Length = 1944
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|395533467|ref|XP_003768781.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Sarcophilus harrisii]
Length = 1971
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 352 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 406
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 407 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 447
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 448 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 488
>gi|397503175|ref|XP_003822207.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Pan
paniscus]
Length = 1957
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 61/148 (41%), Gaps = 33/148 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E +
Sbjct: 343 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCEKEGIQWEPKDDD 402
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
+ E D + C + GG +LC CD
Sbjct: 403 DEEEEGGCEEEEEDDHMEFCRVC---------KDGGELLC-----------------CDA 436
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L E+P G+WLC
Sbjct: 437 CPSSYHLHCLN----PPLPEIPNGEWLC 460
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 420 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 463
>gi|332847230|ref|XP_512012.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 3
[Pan troglodytes]
Length = 2058
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 435 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 489
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 490 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 530
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 531 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 571
>gi|158420731|ref|NP_001005271.2| chromodomain-helicase-DNA-binding protein 3 isoform 3 [Homo
sapiens]
Length = 2059
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 436 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 490
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 491 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 531
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 532 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 572
>gi|145482349|ref|XP_001427197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394277|emb|CAK59799.1| unnamed protein product [Paramecium tetraurelia]
Length = 922
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 33/50 (66%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+D C C GG ++CCD+CP+ FH C++L +P G W+C C+ F+++
Sbjct: 860 EDRCKKCNKGGKVICCDTCPKVFHPKCINLKEVPQGKWNCLNCLTNFERQ 909
>gi|162318864|gb|AAI56473.1| Chromodomain helicase DNA binding protein 3 [synthetic construct]
Length = 2045
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 422 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 476
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 477 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 517
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 518 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 558
>gi|395836468|ref|XP_003791176.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Otolemur garnettii]
Length = 1998
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 374 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 428
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 429 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 469
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 TCISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 510
>gi|109113157|ref|XP_001111066.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
2 [Macaca mulatta]
Length = 1981
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|344290176|ref|XP_003416814.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Loxodonta
africana]
Length = 1863
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 283 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 337
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 338 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 378
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 379 ACISSYHIHCLN----PPLPDIPNGEWLC 403
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 363 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 406
>gi|145513166|ref|XP_001442494.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409847|emb|CAK75097.1| unnamed protein product [Paramecium tetraurelia]
Length = 906
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 33/50 (66%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
+D C C GG ++CCD+CP+ FH C+ L IP G W+C C++ F+++
Sbjct: 844 EDQCKECGKGGKVICCDTCPKVFHAKCLGLKEIPKGRWNCLVCLSNFERQ 893
>gi|109113159|ref|XP_001110923.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like isoform
1 [Macaca mulatta]
Length = 1947
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|332250910|ref|XP_003274592.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Nomascus leucogenys]
Length = 1985
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 432 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 472
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 473 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|345800756|ref|XP_546747.3| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Canis lupus
familiaris]
Length = 1986
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 426
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 427 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 471
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|402852748|ref|XP_003891075.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 isoform 2
[Papio anubis]
Length = 1951
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 389
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 390 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 434
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|358416078|ref|XP_609360.5| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 1991
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 378 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 426
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 427 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 471
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 472 PSSYHLHCLN----PPLPEIPNGEWLC 494
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 454 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 497
>gi|417413984|gb|JAA53300.1| Putative chromatin remodeling complex wstf-iswi small subunit,
partial [Desmodus rotundus]
Length = 1846
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 398
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 399 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 439
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 440 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 480
>gi|395841073|ref|XP_003793373.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Otolemur
garnettii]
Length = 2088
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 479 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 527
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 528 ---------EGIQWEPKEDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 572
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 573 PSSYHLHCLN----PPLAEIPNGEWLC 595
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ P IP+G W C C
Sbjct: 555 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLAEIPNGEWLCPRC 598
>gi|395748521|ref|XP_002827042.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Pongo abelii]
Length = 1993
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 363 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 417
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 418 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 458
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 459 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 499
>gi|351701586|gb|EHB04505.1| Chromodomain-helicase-DNA-binding protein 3 [Heterocephalus glaber]
Length = 1774
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 392 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 446
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 447 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 487
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 488 ACISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 528
>gi|5834582|emb|CAB55313.1| rfg7 protein [Homo sapiens]
Length = 1052
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GGDLLCC+ CP+ H+ C +L PSG W C +C + + E VEY+
Sbjct: 811 NEDWCAVCQNGGDLLCCEKCPKVSHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 868
Query: 774 N----ARAAGRIEGVDPFAQMVSRCIRIV 798
+ ++ +G+ P Q +C R++
Sbjct: 869 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 895
>gi|359074223|ref|XP_002694217.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Bos taurus]
Length = 2042
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 477
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 522
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|260792165|ref|XP_002591087.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
gi|229276287|gb|EEN47098.1| hypothetical protein BRAFLDRAFT_69357 [Branchiostoma floridae]
Length = 1005
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 759
N+D C +C +GGDLLCCD+CP+ +H+ C ++P +PSG + C C
Sbjct: 789 NEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 834
>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
Length = 1017
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 792 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 849
Query: 774 N 774
+
Sbjct: 850 D 850
>gi|410966154|ref|XP_003989600.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Felis
catus]
Length = 2003
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 416
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 417 ---------EGIQWEPKDDEDDEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 461
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 462 PSSYHLHCLN----PPLPEIPNGEWLC 484
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 444 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 487
>gi|170036699|ref|XP_001846200.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
gi|167879513|gb|EDS42896.1| chromodomain helicase-DNA-binding protein 3 [Culex
quinquefasciatus]
Length = 1982
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 49/164 (29%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 397 EHQDFCEVCQQGGEIILCDTCPKAYHLVCLEPELEDTPEGKWSCPTC------------- 443
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D Q R + GG +LC CD C
Sbjct: 444 --EADGGVAEDDDDEHQEFCRICKD--------GGELLC-----------------CDMC 476
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH CL L ++P G W C C+ C + +QK++
Sbjct: 477 PSAYHTFCL----TPPLDDIPDGDWRCPRCS-CPPLQYKVQKIL 515
>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase Trim24; AltName: Full=Tripartite motif-containing
protein 24
gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
Length = 1051
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 826 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 883
Query: 774 N 774
+
Sbjct: 884 D 884
>gi|449433493|ref|XP_004134532.1| PREDICTED: uncharacterized protein LOC101204186 [Cucumis sativus]
Length = 2368
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 711 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 763
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 102 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 158
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 815
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 159 QKNDLP-----LDATSYLDTISKRARTKVISAKCKNGIKSSDTEKVSRIFGSSILAKKRS 213
Query: 816 FCKSR---------FGRRTV 826
K + FGR++V
Sbjct: 214 SNKRKSILAHKVKTFGRKSV 233
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 80/199 (40%), Gaps = 45/199 (22%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP---GIPSGTWHCRYC---------- 759
+ +ND+ C C + GD+LCCD CPR+FH +CV+L +P W+C C
Sbjct: 892 ATDNDEFCSACGNAGDVLCCDGCPRSFHFECVNLTQSEDLPD-DWYCSECIMRRFPSRVP 950
Query: 760 ---------MNTFQKEKFVEYNANARAAGRIEGVDP-----FAQMVSRCIR----IVQTP 801
+N +K ++ R EGV + ++V + ++ +TP
Sbjct: 951 IHKGAFAPALNALEKSIPRAFSLPKHIQNRFEGVKAGPGGDYEEIVGKTVKRRTGFDETP 1010
Query: 802 DT----ELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGK 856
D + VLC C KS R ++ C C +H+ CL L P K
Sbjct: 1011 DLFKQRDENQPVLCHA---CQKSSNDTRAILPCSLCSYYWHLDCLD----PPLAVPPVLK 1063
Query: 857 -WLCCADCKRINLALQKLV 874
W C A I + +L
Sbjct: 1064 SWRCPAHVDHILADVPRLA 1082
>gi|149024737|gb|EDL81234.1| rCG30890, isoform CRA_a [Rattus norvegicus]
Length = 1668
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 67 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 115
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 116 ---------EGIQWEPKDDDEEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 160
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 161 PSSYHLHCLN----PPLPEIPNGEWLC 183
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 143 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 186
>gi|296479122|tpg|DAA21237.1| TPA: chromodomain helicase DNA binding protein 5 [Bos taurus]
Length = 2099
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 429 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 477
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 478 ---------EGIQWEPKDDDDDEDEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 522
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 523 PSSYHLHCLN----PPLPEIPNGEWLC 545
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 505 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 548
>gi|390462993|ref|XP_002806848.2| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3, partial [Callithrix jacchus]
Length = 1943
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 32/149 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 421
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCD 830
A + + D + C +C+ G C CD
Sbjct: 422 AKEEEEE--------YEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------CD 462
Query: 831 QCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 463 ACISSYHIHCLN----PPLPDIPNGEWLC 487
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 447 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 490
>gi|168051599|ref|XP_001778241.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162670338|gb|EDQ56908.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1245
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 40/71 (56%), Gaps = 4/71 (5%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMNTF-QKEKFVEY 771
+++D C IC GGDLLCCD+C FH+ C+ P +P G W C C+N EKF++
Sbjct: 51 QSEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKCVNPLGDVEKFIDT 110
Query: 772 NANA-RAAGRI 781
+ GRI
Sbjct: 111 QMRPMKVPGRI 121
>gi|242024147|ref|XP_002432491.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212517929|gb|EEB19753.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1049
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 35/50 (70%), Gaps = 4/50 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMN 761
N+D C +CMDGG+L+CCD+CP+ FH++C +L +P + TW C C +
Sbjct: 847 NEDWCAVCMDGGELICCDNCPKVFHVNCHIPALKAMPGETETWQCLLCTD 896
>gi|50290691|ref|XP_447778.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527089|emb|CAG60725.1| unnamed protein product [Candida glabrata]
Length = 671
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 44/109 (40%), Gaps = 36/109 (33%)
Query: 688 QHIYTSNGVSLHELSIKLSLE--RPFSSK------------------------------E 715
+H+ G++LH IK L+ F SK E
Sbjct: 212 EHVLVPKGINLHNPHIKTKLKDTHAFDSKTKLISQNPVGQMNDIAMAKKKSSDGTTQEIE 271
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
NDD C C G LCCD+CP++FH C++ P +P G W C CM
Sbjct: 272 NDDYCSACFQTGSFLCCDTCPKSFHFLCLNPPLDPDHLPEGDWSCPQCM 320
>gi|111226798|ref|XP_642746.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|90970774|gb|EAL68857.2| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 914
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 31/46 (67%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
NDD C C DGGDLLCCD+C +FH+ C++ P IPSG W C C
Sbjct: 60 NDDYCSGCGDGGDLLCCDNCQASFHLICLNPPLNEIPSGDWFCDSC 105
>gi|426327635|ref|XP_004024622.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Gorilla
gorilla gorilla]
Length = 2024
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 411 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 459
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 460 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 504
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 505 PSSYHLHCLN----PPLPEIPNGEWLC 527
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 487 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 530
>gi|395538667|ref|XP_003771296.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 4 [Sarcophilus harrisii]
Length = 2011
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 523 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 566
>gi|428182510|gb|EKX51370.1| hypothetical protein GUITHDRAFT_102641 [Guillardia theta CCMP2712]
Length = 1947
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 32/52 (61%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
E DD C IC +GG+L+CCDSCPR H C+ L IP G ++C C KE
Sbjct: 1757 EWDDYCLICFEGGNLICCDSCPRTVHAACLGLSKIPKGDFYCFDCERERAKE 1808
>gi|405959089|gb|EKC25157.1| Bromodomain adjacent to zinc finger domain protein 1A [Crassostrea
gigas]
Length = 1488
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKF 768
ND++C +C GG L+CCD+CP +H+DC P +P G W C+ C K K
Sbjct: 1226 NDNVCAVCSTGGMLVCCDTCPLVYHLDCAVPPLKKVPRGKWQCQLCTGVTTKGKI 1280
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 8/54 (14%)
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCK 864
CR C+ + ++LCD+C+R +H+ CLK L+ +PKG W C DCK
Sbjct: 1132 CR---ICRRKGDAEQMLLCDKCDRGHHMYCLKPR----LKHVPKGDWF-CPDCK 1177
>gi|390465301|ref|XP_003733383.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 5 [Callithrix jacchus]
Length = 1887
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 326 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 374
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 375 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 419
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 420 PSSYHLHCLN----PPLPEIPNGEWLC 442
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 402 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 445
>gi|355557485|gb|EHH14265.1| hypothetical protein EGK_00158 [Macaca mulatta]
Length = 2247
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 492
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 493 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 537
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 538 PSSYHLHCLN----PPLPEIPNGEWLC 560
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 520 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 563
>gi|348571006|ref|XP_003471287.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Cavia
porcellus]
Length = 2442
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 732 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 780
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 781 ---------EGIQWEPKDDEDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 825
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 826 PSSYHLHCLN----PPLPEIPNGEWLC 848
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 808 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 851
>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
Length = 981
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 756 NEDWCAVCQNGGELLCCEKCPKVFHLTCQVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 813
Query: 774 N 774
+
Sbjct: 814 D 814
>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
Length = 981
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 756 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 813
Query: 774 N 774
+
Sbjct: 814 D 814
>gi|242000110|ref|XP_002434698.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
gi|215498028|gb|EEC07522.1| chromodomain helicase DNA binding protein, putative [Ixodes
scapularis]
Length = 1882
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 33/47 (70%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ + C +C DGG+LLCCDSCP AFH C++ P +P+G W+C C
Sbjct: 473 EHMEFCRVCKDGGELLCCDSCPAAFHTFCLNPPLKNVPTGKWNCPRC 519
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 54/159 (33%)
Query: 710 PFSSKEND-------DLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCM 760
P S END D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 403 PNSDAENDGYETDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEEPPEGKWSCPHCE 462
Query: 761 NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSR 820
+E+ + + + C R+ + GG +LC
Sbjct: 463 GEGIQEQEED------------------EHMEFC-RVCKD-----GGELLC--------- 489
Query: 821 FGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C +H CL L+ +P GKW C
Sbjct: 490 --------CDSCPAAFHTFCLN----PPLKNVPTGKWNC 516
>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
Length = 516
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG+LLCC+ CP+ FH+ C +L PSG W C +C + + E V+Y+
Sbjct: 291 NEDWCAVCQNGGELLCCEKCPKVFHLTCHVPTLTNFPSGEWICTFCRDLSKPE--VDYDC 348
Query: 774 N 774
+
Sbjct: 349 D 349
>gi|307107106|gb|EFN55350.1| hypothetical protein CHLNCDRAFT_134366 [Chlorella variabilis]
Length = 884
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 30/45 (66%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
+NDDLC IC GGDL+CC++CP FH C+ L P G +HC C
Sbjct: 117 DNDDLCHICGLGGDLMCCETCPGVFHAACLGLAAPPEGDYHCPLC 161
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 83/177 (46%), Gaps = 12/177 (6%)
Query: 859 CCADCKRINLALQKLVDRGEEKLPETSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVDA 918
C + R + ++ R +LP SL V++ G+ A D R + D
Sbjct: 366 CMVEGARATPSAGEVSSRAFAELP-LSLLVVR------GAAAAAPRDCR-GIAPAYTPDQ 417
Query: 919 SDGTRALLSKAVSIFHDRFDPIIESASKLDLIPAMVYGRSHRG--QDYHGMYCAILTVNQ 976
R LS A+ + H ++P+ +S + D++P ++ G G DY GM+ A+L
Sbjct: 418 RQELRVALSAALHVLHSCYEPLPDSRTGADMLPWLLRGAVLAGGKADYSGMHTAVLFAGP 477
Query: 977 VVVSAGIFRIFGQELAELPLVATSNDCQGQ-GYFQSLFCCIEKLLGFLNVKTLVLPS 1032
V+ +FR FG +LAE+P++A + Q + G + L +E+LL K + P+
Sbjct: 478 AAVAVAVFRSFG-DLAEVPVLAVRPELQRRNGLGRLLLAAVEQLLLLAGAKAIFTPA 533
>gi|224096253|ref|XP_002191494.1| PREDICTED: transcription intermediary factor 1-alpha [Taeniopygia
guttata]
Length = 1061
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C +GG LLCC+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 836 NEDWCAVCQNGGKLLCCEKCPKVFHLSCHVPTLMSFPSGEWICTFCRDLSKPE--VEYDC 893
Query: 774 N 774
+
Sbjct: 894 D 894
>gi|170058471|ref|XP_001864936.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877568|gb|EDS40951.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 830
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 75/213 (35%), Gaps = 76/213 (35%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQ------ 764
N+D C C +GGDL+CCD CP +FH+ C P IP+G W C C T Q
Sbjct: 43 HNNDTCDACGEGGDLICCDRCPSSFHLGCHDPPLSEQDIPNGLWICHTCKMTDQGSATQE 102
Query: 765 --------------KEKFVEYNANAR-----AAG----------RIE-------GVDPFA 788
K K + +N+R +AG R+E + P
Sbjct: 103 TRPDAAMAESSQISKIKKMRNRSNSRKNSANSAGGAEKGTIVDERVEKELEDLSSLSPLD 162
Query: 789 QMVSRCIRIVQTPDTEL----------------------GGCVLCRGRDFCKSRFGRRTV 826
Q++ R RI+ EL G C C R +
Sbjct: 163 QLI-RAARILNPRQFELPREINSHFPFPGTEKHDSGQKNGNAKTCHT---CGKSCRRAPL 218
Query: 827 ILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
I CD C+ +H CL L LP W+C
Sbjct: 219 IACDYCDLFFHQDCLD----PPLTALPTSMWMC 247
>gi|449490665|ref|XP_004158671.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101228553, partial [Cucumis sativus]
Length = 1851
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 711 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 763
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 102 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 158
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 815
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 159 QKNDLP-----LDATSYLDTISKRARTKVISAKCKNGIKSSDTEKVSRIFGSSILAKKRS 213
Query: 816 FCKSR---------FGRRTV 826
K + FGR++V
Sbjct: 214 SNKRKSILAHKVKTFGRKSV 233
>gi|307136401|gb|ADN34210.1| chromatin remodeling complex subunit [Cucumis melo subsp. melo]
Length = 2374
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 67/140 (47%), Gaps = 32/140 (22%)
Query: 711 FSSKE--NDDL---CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTF 763
FSSK+ ND C +C GG+LLCCDSCPR +H+ C++ P IP G WHC C
Sbjct: 109 FSSKKKGNDGYFFECVVCDLGGNLLCCDSCPRTYHLQCLNPPLKRIPMGKWHCPTCN--- 165
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQ---MVSRCIRIVQTPDTE-----LGGCVLCRGRD 815
QK A ++ + A+ + ++C +++ DTE G +L + R
Sbjct: 166 QKNDLP-----LDATSYLDTISKRARTKVVSAKCKNGIKSSDTEKVSRIFGSSILAKKRS 220
Query: 816 FCKSR---------FGRRTV 826
K + FGR++V
Sbjct: 221 SNKRKSILAHKVKTFGRKSV 240
>gi|334348294|ref|XP_001369474.2| PREDICTED: chromodomain-helicase-DNA-binding protein 4 [Monodelphis
domestica]
Length = 1823
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 361 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 404
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 66/155 (42%), Gaps = 33/155 (21%)
Query: 726 GGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYNANARAAGRIEG 783
GG+++ CD+CPRA+H+ C+ + P G W C +C + + E N+ G
Sbjct: 290 GGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQWEAKEDNSEGEEILEEVG 349
Query: 784 VDPFAQMVSRCIRIVQTPDTELGGCVLCR--GRDFCKSRFGRRTVILCDQCEREYHVGCL 841
DP + D + C +C+ G C CD C YH+ CL
Sbjct: 350 GDP-----------EEEDDHHMEFCRVCKDGGELLC-----------CDTCPSSYHIHCL 387
Query: 842 KDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
L E+P G+WLC C C + +QK++
Sbjct: 388 N----PPLPEIPNGEWLCPRCT-CPSLKGKVQKIL 417
>gi|62530240|gb|AAX85377.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1924
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 333 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 392
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + + C R+ + GG +LC CD C
Sbjct: 393 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 429
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 430 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 468
>gi|47220585|emb|CAG05611.1| unnamed protein product [Tetraodon nigroviridis]
Length = 185
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 759
+DD C +C++GG+LLCCD CP+ +H+ C L G P G W C C
Sbjct: 1 SDDFCAVCLNGGELLCCDRCPKVYHLSCHLPPLSGFPQGDWVCTLC 46
>gi|260817522|ref|XP_002603635.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
gi|229288956|gb|EEN59646.1| hypothetical protein BRAFLDRAFT_240330 [Branchiostoma floridae]
Length = 190
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 22/46 (47%), Positives = 33/46 (71%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 759
N+D C +C +GGDLLCCD+CP+ +H+ C ++P +PSG + C C
Sbjct: 13 NEDWCAVCHNGGDLLCCDTCPKVYHLTCHVPNIPAMPSGDFMCTLC 58
>gi|62530244|gb|AAX85379.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530246|gb|AAX85380.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530248|gb|AAX85381.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
gi|62530250|gb|AAX85382.1| chromodomain helicase DNA-binding protein 3 long isoform [Rattus
norvegicus]
Length = 1959
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + + C R+ + GG +LC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 469
>gi|62530236|gb|AAX85375.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
gi|62530238|gb|AAX85376.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1925
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 393
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + + C R+ + GG +LC CD C
Sbjct: 394 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 430
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 431 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 469
>gi|62530242|gb|AAX85378.1| chromodomain helicase DNA-binding protein 3 short isoform [Rattus
norvegicus]
Length = 1927
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 336 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 395
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + + C R+ + GG +LC CD C
Sbjct: 396 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 432
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 433 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 471
>gi|350585547|ref|XP_003481984.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like [Sus
scrofa]
Length = 1865
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 334 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 389
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + R+ + GG +LC CD C
Sbjct: 390 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 427
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 428 PSSYHLHCLN----PPLPEIPNGEWLC 450
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 410 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 453
>gi|403297789|ref|XP_003939734.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Saimiri
boliviensis boliviensis]
Length = 2203
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 591 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 639
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 640 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 684
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 685 PSSYHLHCLN----PPLPEIPNGEWLC 707
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 667 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 710
>gi|392351358|ref|XP_220602.6| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2069
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + E
Sbjct: 444 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 503
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + + C R+ + GG +LC CD C
Sbjct: 504 EDYEEEGEEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDAC 540
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 541 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 579
>gi|156402732|ref|XP_001639744.1| predicted protein [Nematostella vectensis]
gi|156226874|gb|EDO47681.1| predicted protein [Nematostella vectensis]
Length = 180
Score = 63.5 bits (153), Expect = 6e-07, Method: Composition-based stats.
Identities = 24/46 (52%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYC 759
NDD C +C +GG+LLCCD+CPR FH+ C S+ PS W C C
Sbjct: 4 NDDWCAVCRNGGELLCCDTCPRVFHLQCHIPSVNNTPSDKWSCGLC 49
>gi|426238820|ref|XP_004013342.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Ovis aries]
Length = 2020
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 395 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 449
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 450 AKEEEEDYEEDGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 492
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 493 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 531
>gi|395731282|ref|XP_002811619.2| PREDICTED: chromodomain-helicase-DNA-binding protein 5, partial
[Pongo abelii]
Length = 1588
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C EK
Sbjct: 51 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC------EK----- 99
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
EG+ + + E CR CK ++ CD C
Sbjct: 100 ---------EGIQWEPKDDDDEEEEGGCEEEEDDHMEFCR---VCKD---GGELLCCDAC 144
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 145 PSSYHLHCLN----PPLPEIPNGEWLC 167
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 127 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 170
>gi|363755388|ref|XP_003647909.1| hypothetical protein Ecym_7248 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891945|gb|AET41092.1| hypothetical protein Ecym_7248 [Eremothecium cymbalariae
DBVPG#7215]
Length = 617
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 37/74 (50%), Gaps = 11/74 (14%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 770
ENDD C C+ G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 230 ENDDFCSSCLQTGIFLCCDTCPKSFHFACLNPPLDPDNLPEGDWSCYEC-------RFKQ 282
Query: 771 YNANARAAGRIEGV 784
N N R E +
Sbjct: 283 MNPNKSVMARNEKL 296
>gi|2645435|gb|AAB87384.1| CHD3 [Drosophila melanogaster]
Length = 1518
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 37 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 83
>gi|196008026|ref|XP_002113879.1| hypothetical protein TRIADDRAFT_27056 [Trichoplax adhaerens]
gi|190584283|gb|EDV24353.1| hypothetical protein TRIADDRAFT_27056, partial [Trichoplax
adhaerens]
Length = 871
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
D C +C DGG LLCCDSCP ++H+ C++ P IP G W C C+
Sbjct: 1 DFCHVCKDGGQLLCCDSCPLSYHLRCLNPPLEDIPEGDWRCPRCL 45
>gi|332265298|ref|XP_003281663.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5 [Nomascus
leucogenys]
Length = 2435
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 812 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 867
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + R+ + GG +LC CD C
Sbjct: 868 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 905
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 906 PSSYHLHCLN----PPLPEIPNGEWLC 928
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 888 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 931
>gi|311268331|ref|XP_003131999.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 1
[Sus scrofa]
Length = 1968
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 432 AKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 474
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|213408481|ref|XP_002175011.1| transcriptional regulatory protein RCO1 [Schizosaccharomyces
japonicus yFS275]
gi|212003058|gb|EEB08718.1| transcriptional regulatory protein RCO1 [Schizosaccharomyces
japonicus yFS275]
Length = 335
Score = 63.2 bits (152), Expect = 7e-07, Method: Composition-based stats.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 4/50 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
+N+D CG C G LCC+SCPR+FH+ C++ P IP G+W+C CM
Sbjct: 62 QNEDFCGACGGQGLFLCCESCPRSFHLSCLNPPLSRNDIPEGSWYCNKCM 111
>gi|413946875|gb|AFW79524.1| hypothetical protein ZEAMMB73_072548 [Zea mays]
Length = 537
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%)
Query: 712 SSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
S +E D +C +C+DGG+LL CD CP AFH CV L P G W C C
Sbjct: 460 SEEEGDSVCSVCIDGGELLLCDKCPSAFHHACVGLQATPEGDWCCPLC 507
>gi|194378472|dbj|BAG63401.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 10/89 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN- 772
N+D C +C +GGDLL C+ CP+ FH+ C +L PSG W C +C + + E VEY+
Sbjct: 392 NEDWCAVCQNGGDLLRCEKCPKVFHLTCHVPTLLSFPSGDWICTFCRDIGKPE--VEYDC 449
Query: 773 ---ANARAAGRIEGVDPFAQMVSRCIRIV 798
+++ +G+ P Q +C R++
Sbjct: 450 DNLQHSKKGKTAQGLSPVDQ--RKCERLL 476
>gi|168012021|ref|XP_001758701.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
gi|162690311|gb|EDQ76679.1| SNF2 family DNA-dependent ATPase [Physcomitrella patens subsp.
patens]
Length = 1220
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMNTF-QKEKFVE 770
+++D C IC GGDLLCCD+C FH+ C+ P +P G W C C+N EKF++
Sbjct: 51 QSEDFCSICHLGGDLLCCDTCTAVFHLGCLDPPMKVVPRGKWSCPKCVNPLGDVEKFID 109
>gi|341900249|gb|EGT56184.1| hypothetical protein CAEBREN_32223 [Caenorhabditis brenneri]
Length = 1816
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 81/218 (37%), Gaps = 57/218 (26%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C C GG+L+ CD+CPRA+H C+ ++ P G W C +CM
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCM------------ 300
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
I +P Q DFCK ++LCD C
Sbjct: 301 ---EHGPEIVKEEPAKQ-----------------------NDDFCKICKETENLLLCDNC 334
Query: 833 EREYHVGCLKDHGMEDLQELPKG--KWLC--CADCKRINLALQKLVDRGEE-----KLPE 883
+H C+ L ELP W C C K A + L R +E LPE
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWACPRCELAKPEQKAEKILCWRWKEIPYADPLPE 390
Query: 884 ----TSLDVIKKKHEESGSDNAVDFDVRWRVLRGKKVD 917
T D++ K + +F ++W+ L + D
Sbjct: 391 GTEPTEDDLLLKPPRKMAPRREREFFIKWKYLSYWQCD 428
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSG--TWHCRYC 759
+K+NDD C IC + +LL CD+C +FH C+ P +P +W C C
Sbjct: 312 AKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
>gi|281205840|gb|EFA80029.1| hypothetical protein PPL_06850 [Polysphondylium pallidum PN500]
Length = 704
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 32/48 (66%), Gaps = 2/48 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYCMN 761
N+D C C DGGDLLCC++C +FH+ C++ P +P G W+C C N
Sbjct: 87 NNDFCYSCRDGGDLLCCENCELSFHLLCLNPPNPEVPDGDWYCTRCTN 134
>gi|348690302|gb|EGZ30116.1| hypothetical protein PHYSODRAFT_474458 [Phytophthora sojae]
Length = 239
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 704 KLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMN 761
+++ RP ++ ++ + C +C+ GG LLCCD C RA+H++CV SL +P G W C YC +
Sbjct: 179 RMARMRP-AADQHREYCEMCLKGGKLLCCDGCERAYHLNCVRPSLLDVPEGDWFCPYCRD 237
Query: 762 T 762
Sbjct: 238 A 238
>gi|341891282|gb|EGT47217.1| CBN-LET-418 protein [Caenorhabditis brenneri]
Length = 1835
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 79/212 (37%), Gaps = 57/212 (26%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C C GG+L+ CD+CPRA+H C+ ++ P G W C +CM
Sbjct: 253 ENNDYCEECKSGGELILCDTCPRAYHTVCIDANMEEAPEGDWSCPHCM------------ 300
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
I +P Q DFCK ++LCD C
Sbjct: 301 ---EHGPEIVKEEPAKQ-----------------------NDDFCKICKETENLLLCDNC 334
Query: 833 EREYHVGCLKDHGMEDLQELPKG--KWLC--CADCKRINLALQKLVDRGEE-----KLPE 883
+H C+ L ELP W C C K A + L R +E LPE
Sbjct: 335 TCSFHAYCMD----PPLLELPPQDESWACPRCELAKPEQKAEKILCWRWKEIPYADPLPE 390
Query: 884 ----TSLDVIKKKHEESGSDNAVDFDVRWRVL 911
T D++ K + +F ++W+ L
Sbjct: 391 GTEPTEDDLLLKPPRKMAPRREREFFIKWKYL 422
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 4/51 (7%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSG--TWHCRYC 759
+K+NDD C IC + +LL CD+C +FH C+ P +P +W C C
Sbjct: 312 AKQNDDFCKICKETENLLLCDNCTCSFHAYCMDPPLLELPPQDESWACPRC 362
>gi|161611630|gb|AAI55800.1| Wu:fd12d03 protein [Danio rerio]
Length = 1074
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 761
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C++
Sbjct: 458 EFCRVCKDGGELLCCDTCPSSYHLHCLNPPLPDIPNGEWICPRCLS 503
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/147 (23%), Positives = 64/147 (43%), Gaps = 23/147 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ + P GTW C +C +++ +++
Sbjct: 373 DHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMERAPEGTWSCPHC-----EKEGIQWE 427
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A ++ D +FC+ ++ CD C
Sbjct: 428 AREESS------------EGEEENDDGRRDDGDVEEEDDHHMEFCRVCKDGGELLCCDTC 475
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L ++P G+W+C
Sbjct: 476 PSSYHLHCLN----PPLPDIPNGEWIC 498
>gi|367008710|ref|XP_003678856.1| hypothetical protein TDEL_0A03130 [Torulaspora delbrueckii]
gi|359746513|emb|CCE89645.1| hypothetical protein TDEL_0A03130 [Torulaspora delbrueckii]
Length = 689
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 58/130 (44%), Gaps = 6/130 (4%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQK 765
P ENDD C C+ G LCCD+CP++FH C++ P +P G W C C +
Sbjct: 279 PTQEVENDDFCSSCLQSGSFLCCDTCPKSFHFLCLNPPLDADNLPEGDWSCPQCTFKQKH 338
Query: 766 EKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT 825
+ + + + R E + P A++ + + ++ + + F R G R
Sbjct: 339 QNLTQIRKSEKEFIRTE-LPPSARLFGKLLFQLEASNPKQFSLPPSIKDSFQHVRSGSRG 397
Query: 826 VILCDQCERE 835
CD+ E+E
Sbjct: 398 Q-YCDEREKE 406
>gi|311268329|ref|XP_003132000.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 isoform 2
[Sus scrofa]
Length = 2002
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 432 AKEEDDDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 474
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 475 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 513
>gi|430812048|emb|CCJ30503.1| unnamed protein product [Pneumocystis jirovecii]
Length = 635
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 115/295 (38%), Gaps = 64/295 (21%)
Query: 689 HIYTSNGVSLHELSIKLSLERPFSSK-ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
++YT ++H S E P + +++D C C G LCC+SCPR+FH C+ P
Sbjct: 217 NLYTGKATNIHIQQQLSSQEDPTKVRNQSNDFCSACKRHGRFLCCESCPRSFHFSCIEPP 276
Query: 748 ----GIPSGTWHCRYC-------------MNTFQKEKFV-----EYNANARAAGRIEGVD 785
+P +WHC C + F E + E++ EGV
Sbjct: 277 IDENCLPEDSWHCTLCRTKRFPPPKQPRGIFAFLMEYIIRNNPKEFSLPLNIREYFEGVS 336
Query: 786 --PFAQMVSRCIRIVQTPDTELGGCVLCRGRD------------FC----KSRFGRRTVI 827
P + I++ + + G L RD C ++ + ++
Sbjct: 337 TGPNGEYQD----IMEHKNQKFGRQKLMEERDPFQLEDKNGNPILCYKCSQNAMNGKAIV 392
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQ-----KLVD----RGE 878
CD C +H+ C+ D M + + KW+C + ++ +++D RG
Sbjct: 393 SCDYCSLHWHIDCI-DPPMSSIPSASR-KWMCPNHAYHVTPKIRQPKKSRILDVCLKRGF 450
Query: 879 EKLPETSLDVI-------KKKHEESGSDNAVDFDVRWRVLR-GKKVDASDGTRAL 925
+ +D KK+ + S D+ + ++R+R+ G K+D D R L
Sbjct: 451 KNNGNIEIDNTEEEEEEQKKRKKISFDDSYIINNIRYRLPESGVKLDFLDKVRTL 505
>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
Length = 209
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 37/55 (67%), Gaps = 4/55 (7%)
Query: 715 ENDDLCGICMDGGDLLCCD-SCPRAFHIDCVSLPGIPSG-TWHCRYCMNTFQKEK 767
END++C IC DGGDLL CD C R +H+ C++L +P G TW C YC QKEK
Sbjct: 53 ENDEICYICGDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLCPYCAR--QKEK 105
>gi|312372079|gb|EFR20122.1| hypothetical protein AND_20633 [Anopheles darlingi]
Length = 2227
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 64/164 (39%), Gaps = 50/164 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 454 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 500
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A G + D Q R + GG +LC CD C
Sbjct: 501 ---EAEGPADEDDDEHQEFCRICKD--------GGELLC-----------------CDNC 532
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH CL L ++P G W C C+ C + +QK++
Sbjct: 533 PSAYHTFCLN----PPLDDIPDGDWRCPRCS-CPPLPDKVQKIL 571
>gi|157134600|ref|XP_001663323.1| chromodomain helicase DNA binding protein [Aedes aegypti]
gi|108870421|gb|EAT34646.1| AAEL013136-PA [Aedes aegypti]
Length = 1983
Score = 63.2 bits (152), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
E+ + C +C DGG++LCCDSCP A+H C++ P IP G W C C
Sbjct: 434 EHQEFCRVCKDGGEMLCCDSCPSAYHTWCLTPPLDDIPDGDWRCPRC 480
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 50/164 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 376 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 422
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A G + D Q R + GG +LC CD C
Sbjct: 423 ---EAEGPADEDDDEHQEFCRVCKD--------GGEMLC-----------------CDSC 454
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH CL L ++P G W C C+ C I + K++
Sbjct: 455 PSAYHTWCL----TPPLDDIPDGDWRCPRCS-CPPIPYKVAKIL 493
>gi|397596945|gb|EJK56894.1| hypothetical protein THAOC_23124 [Thalassiosira oceanica]
Length = 1752
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 34/62 (54%), Gaps = 2/62 (3%)
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKE 766
+P E+DD C +C GGDLLCCD C +A+H+ C L IP G W C+ C K
Sbjct: 683 KPLFKGEHDDTCYMCYQGGDLLCCDYCSKAYHMKCHLPPLTEIPEGNWKCQECAAVEMKR 742
Query: 767 KF 768
F
Sbjct: 743 MF 744
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 73/172 (42%), Gaps = 43/172 (25%)
Query: 715 ENDDLCGICMDGG-----------------------DLLCCDSCPRAFHIDC--VSLPGI 749
E+D C IC +GG DL+CCD CP+ +H +C + +
Sbjct: 505 EHDAECFICGEGGGKRLITYSHSHHTHSPFTPLTYTDLVCCDGCPKVYHSNCHKPKIREL 564
Query: 750 PSGTWHCRYC--MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGG 807
P G W C +C +K+K+ + A+ G E VD A+ V +C V+ P+ E
Sbjct: 565 PDGEWLCMHCKPKGADRKKKYQGFRL-AKIPG--ETVDSPARHV-KC--TVRWPEME--- 615
Query: 808 CVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
C++C G + + C C+ YH C+ L+ P GKW C
Sbjct: 616 CIICEGTEV-TGPLKDNDWVTCATCDDAYHTRCV------GLETRPGGKWRC 660
Score = 47.4 bits (111), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 50/135 (37%), Gaps = 37/135 (27%)
Query: 728 DLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC---MNTFQKEKFVEYNANARAAGRIEGV 784
D + C +C A+H CV L P G W C C K K + NA ++ + E
Sbjct: 632 DWVTCATCDDAYHTRCVGLETRPGGKWRCPTCKERKKLIPKTKPNKENAPSKPLFKGEHD 691
Query: 785 DPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDH 844
D C Q GG +LC CD C + YH+ C
Sbjct: 692 D-------TCYMCYQ------GGDLLC-----------------CDYCSKAYHMKC---- 717
Query: 845 GMEDLQELPKGKWLC 859
+ L E+P+G W C
Sbjct: 718 HLPPLTEIPEGNWKC 732
>gi|392332091|ref|XP_001079343.3| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Rattus
norvegicus]
Length = 2080
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 70/165 (42%), Gaps = 33/165 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 454 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 513
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
G E + C R+ + GG +LC CD
Sbjct: 514 EDYEEERGGKERRREEDDHMEYC-RVCKD-----GGELLC-----------------CDA 550
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 551 CISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 590
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 714 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 527 REEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 577
>gi|363745215|ref|XP_003643224.1| PREDICTED: chromodomain-helicase-DNA-binding protein 5-like,
partial [Gallus gallus]
Length = 137
Score = 62.8 bits (151), Expect = 1e-06, Method: Composition-based stats.
Identities = 26/48 (54%), Positives = 35/48 (72%), Gaps = 3/48 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCM 760
+N+D C + DGG+L+CCD CPRAFH+ C+ LP +PSGTW C C+
Sbjct: 1 DNEDECSV-RDGGELICCDGCPRAFHLPCLVPPLPRVPSGTWQCSSCV 47
>gi|345481883|ref|XP_001605650.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Nasonia vitripennis]
Length = 2009
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 76/204 (37%), Gaps = 51/204 (25%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEK 767
P ++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C N
Sbjct: 362 PDGEGDHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHCEND----- 416
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVI 827
G +E D + C + GG +LC
Sbjct: 417 -----------GALEDDDEHMEFCRVC---------KDGGELLC---------------- 440
Query: 828 LCDQCEREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETS 885
CD C YH CL L E+P G W C C+ + L R +E + S
Sbjct: 441 -CDSCTSAYHTHCLN----PPLTEIPDGDWKCPRCSCPPLFGKVAKILTWRWKEVMDPPS 495
Query: 886 LDVIKKKHEESGSDNAV-DFDVRW 908
+ K SG + +F V+W
Sbjct: 496 EEPSTSKASSSGKPRKIREFFVKW 519
>gi|323456424|gb|EGB12291.1| hypothetical protein AURANDRAFT_61296 [Aureococcus anophagefferens]
Length = 757
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 29/44 (65%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N+D C IC GG+LLCCD+C RAFH+ C L +P G W C C
Sbjct: 697 NEDHCFICRSGGELLCCDTCERAFHVTCCGLADVPEGAWSCYVC 740
>gi|307180228|gb|EFN68261.1| Nucleosome-remodeling factor subunit NURF301 [Camponotus
floridanus]
Length = 3651
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 49/119 (41%), Gaps = 32/119 (26%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 770
S DD C +C GDLLCC++CP FH++CV P +P+ W C C
Sbjct: 361 SMHYDDHCRVCHRLGDLLCCETCPAVFHLECVEPPLVDVPTDDWQCSTC----------- 409
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILC 829
A ++ G V CI PD E G LCR R+GR+ L
Sbjct: 410 ------KANKVMG-------VVDCI-----PDVEKNGS-LCRQEHLGFDRYGRKYWFLA 449
>gi|170051000|ref|XP_001861566.1| nucleosome-remodeling factor subunit NURF301 [Culex
quinquefasciatus]
gi|167872443|gb|EDS35826.1| nucleosome-remodeling factor subunit NURF301 [Culex
quinquefasciatus]
Length = 2337
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
DD C IC GDLLCC++CP FH+DCV P +PS W C C
Sbjct: 366 DDHCRICHRLGDLLCCETCPAVFHLDCVDPPLIDVPSEDWQCNLC 410
>gi|149053041|gb|EDM04858.1| chromodomain helicase DNA binding protein 3 [Rattus norvegicus]
Length = 1827
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 62/148 (41%), Gaps = 30/148 (20%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 200 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 259
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
G E + C R+ + GG +LC CD
Sbjct: 260 EDYEEERGGKERRREEDDHMEYC-RVCKD-----GGELLC-----------------CDA 296
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLC 859
C YH+ CL L ++P G+WLC
Sbjct: 297 CISSYHIHCLN----PPLPDIPNGEWLC 320
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 34/51 (66%), Gaps = 5/51 (9%)
Query: 714 KENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+E DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 273 REEDDHMEYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 323
>gi|374109838|gb|AEY98743.1| FAGL075Cp [Ashbya gossypii FDAG1]
Length = 594
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 11/72 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 770
ENDD C CM G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 207 ENDDFCSSCMQTGIFLCCDTCPKSFHFACLNPPLDPDRLPEGDWSCHEC-------RFKQ 259
Query: 771 YNANARAAGRIE 782
AAG+ E
Sbjct: 260 MYPTKAAAGKAE 271
>gi|440906808|gb|ELR57029.1| Chromodomain-helicase-DNA-binding protein 3, partial [Bos grunniens
mutus]
Length = 1940
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 344 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 398
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 399 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 441
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 442 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 480
>gi|348560832|ref|XP_003466217.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like [Cavia
porcellus]
Length = 1995
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 372 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 426
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 427 AKEEDEEYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 469
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 470 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 508
>gi|358417347|ref|XP_003583617.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein 3 [Bos taurus]
Length = 2012
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 72/164 (43%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 388 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 442
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 443 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 485
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 524
>gi|125812233|ref|XP_001337239.1| PREDICTED: autoimmune regulator-like [Danio rerio]
Length = 640
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP--SGTWHCRYCMNTFQKEKFVEYNA 773
NDD+C C G DL CCD CPRAFH DC LP +P SG W C +C+ + +++ A
Sbjct: 468 NDDVCYACHCGVDLRCCDGCPRAFHSDC-HLPAVPEGSGEWICTFCV----LKTSLQWRA 522
Query: 774 NARAAGRIEGVDPFAQMVSRC 794
++ + P +Q + +C
Sbjct: 523 SSNMTEQEAFDSPVSQYILQC 543
>gi|291230097|ref|XP_002735005.1| PREDICTED: tripartite motif-containing 28 protein-like
[Saccoglossus kowalevskii]
Length = 995
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 30/48 (62%), Gaps = 2/48 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMN 761
N+D C C +GGDLLCCD+CP+ FH+ C SL P TW C C +
Sbjct: 792 NEDYCACCQNGGDLLCCDTCPKVFHLQCHIPSLTATPKETWICGLCQD 839
>gi|391336322|ref|XP_003742530.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like
[Metaseiulus occidentalis]
Length = 1321
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 31/45 (68%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCM 760
N ++C IC G+L+ CD CP++FH+DC+ L +P GTW C C+
Sbjct: 1064 NQEVCNICESPGELILCDFCPKSFHLDCIDLKRLPRGTWKCPPCV 1108
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 56/145 (38%), Gaps = 23/145 (15%)
Query: 720 CGICMDGGD---LLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNAN 774
C +C + +L CD C R +HI C+ P IP G W C C T Q A
Sbjct: 978 CRVCRKKSNPEQMLLCDGCDRGYHIYCLKPPLSEIPQGDWFCSQCSPT-QLSPRKRTKAP 1036
Query: 775 ARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQCER 834
+ E + + + E+ C +C +ILCD C +
Sbjct: 1037 VEVSSEEEDDNEKVDEDGDEDEEEEDLNQEV--CNICESPG---------ELILCDFCPK 1085
Query: 835 EYHVGCLKDHGMEDLQELPKGKWLC 859
+H+ C+ DL+ LP+G W C
Sbjct: 1086 SFHLDCI------DLKRLPRGTWKC 1104
Score = 44.3 bits (103), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 5/56 (8%)
Query: 817 CKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQK 872
C+ + ++LCD C+R YH+ CLK L E+P+G W C+ C L+ +K
Sbjct: 981 CRKKSNPEQMLLCDGCDRGYHIYCLK----PPLSEIPQGDWF-CSQCSPTQLSPRK 1031
>gi|45201021|ref|NP_986591.1| AGL075Cp [Ashbya gossypii ATCC 10895]
gi|44985791|gb|AAS54415.1| AGL075Cp [Ashbya gossypii ATCC 10895]
Length = 594
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 37/72 (51%), Gaps = 11/72 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 770
ENDD C CM G LCCD+CP++FH C++ P +P G W C C +F +
Sbjct: 207 ENDDFCSSCMQTGIFLCCDTCPKSFHFACLNPPLDPDRLPEGDWSCHEC-------RFKQ 259
Query: 771 YNANARAAGRIE 782
AAG+ E
Sbjct: 260 MYPTKAAAGKAE 271
>gi|158300661|ref|XP_320523.4| AGAP012009-PA [Anopheles gambiae str. PEST]
gi|157013268|gb|EAA00692.4| AGAP012009-PA [Anopheles gambiae str. PEST]
Length = 2037
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 65/164 (39%), Gaps = 50/164 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CP+A+H+ C+ L P G W C C
Sbjct: 411 EHQDYCEVCQQGGEIILCDTCPKAYHLVCLDPELEDTPEGKWSCPTC------------- 457
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A G + D Q R + GG +LC CD C
Sbjct: 458 ---EAEGPADEDDDEHQEFCRVCKD--------GGELLC-----------------CDNC 489
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH CL L ++P G+W C C+ C + +QK++
Sbjct: 490 PSAYHTFCLN----PPLDDIPDGEWRCPRCS-CPPLADKVQKIL 528
>gi|345482162|ref|XP_001605087.2| PREDICTED: nucleosome-remodeling factor subunit NURF301 [Nasonia
vitripennis]
Length = 2739
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 50/118 (42%), Gaps = 32/118 (27%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVE 770
+++ DD C +C GDLLCC++CP FH++CV P +P W C C
Sbjct: 353 NRDYDDHCRVCHRLGDLLCCETCPAVFHLECVEPPLVDVPPEDWQCNIC----------- 401
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
A +I G V CI PD E G +LCR R GR+ L
Sbjct: 402 ------KAHKISG-------VMDCI-----PDVEKNG-LLCRQEHLGFDRHGRKYWFL 440
>gi|359076762|ref|XP_003587462.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Bos taurus]
Length = 1833
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 64/147 (43%), Gaps = 28/147 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 209 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 263
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A E + + + + GG +LC CD C
Sbjct: 264 AKEEEEDYEEEGEEEGEKEEEDDHMEYCRVCKDGGELLC-----------------CDAC 306
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L ++P G+WLC
Sbjct: 307 ISSYHIHCLN----PPLPDIPNGEWLC 329
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 31/44 (70%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C
Sbjct: 289 EYCRVCKDGGELLCCDACISSYHIHCLNPPLPDIPNGEWLCPRC 332
>gi|366994322|ref|XP_003676925.1| hypothetical protein NCAS_0F00850 [Naumovozyma castellii CBS 4309]
gi|342302793|emb|CCC70569.1| hypothetical protein NCAS_0F00850 [Naumovozyma castellii CBS 4309]
Length = 742
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
ENDD C C G LCCD+CP++FH C+ P +P G W C C+
Sbjct: 324 ENDDYCSACFQTGSFLCCDTCPKSFHFLCLDPPLDPNNLPEGDWSCHECL 373
>gi|432920325|ref|XP_004079948.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3-like
[Oryzias latipes]
Length = 1963
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 36/52 (69%), Gaps = 5/52 (9%)
Query: 715 ENDD---LCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMN 761
E+DD C +C DGG+LLCCD+C ++HI C++ LP IP+G W C C++
Sbjct: 464 EDDDHMEFCRVCKDGGELLCCDTCTSSYHIHCLNPPLPEIPNGEWLCPRCLS 515
Score = 40.0 bits (92), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+ILCD C R YH+ CL+ +L + P+GKW C
Sbjct: 392 IILCDTCPRAYHLVCLE----PELDKAPEGKWSC 421
>gi|74149099|dbj|BAE32198.1| unnamed protein product [Mus musculus]
Length = 895
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++HI C++ LP IP+G W C C
Sbjct: 57 EFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRC 100
>gi|449675047|ref|XP_002155989.2| PREDICTED: transcription intermediary factor 1-alpha-like [Hydra
magnipapillata]
Length = 1025
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 42/72 (58%), Gaps = 5/72 (6%)
Query: 691 YTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPG 748
++ NG++ E + + S+ N+D CG+C +GG LLCCD+CP+ +H+ C L
Sbjct: 805 FSENGINQEEYQCESGFDE---SQNNEDYCGVCRNGGQLLCCDTCPKVYHLQCHIPVLND 861
Query: 749 IPSGTWHCRYCM 760
+P +W C C+
Sbjct: 862 LPRDSWSCGLCV 873
>gi|34533780|dbj|BAC86802.1| unnamed protein product [Homo sapiens]
Length = 1225
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 341 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 396
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + R+ + GG +LC CD C
Sbjct: 397 KDDDDEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 434
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 435 PSSYHLHCLN----PPLPEIPNGEWLC 457
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 417 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 460
>gi|195591505|ref|XP_002085481.1| GD14801 [Drosophila simulans]
gi|194197490|gb|EDX11066.1| GD14801 [Drosophila simulans]
Length = 893
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 81
>gi|195354154|ref|XP_002043565.1| GM19418 [Drosophila sechellia]
gi|194127733|gb|EDW49776.1| GM19418 [Drosophila sechellia]
Length = 882
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 35 DEEYCKVCRDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 81
>gi|344256322|gb|EGW12426.1| Chromodomain-helicase-DNA-binding protein 5 [Cricetulus griseus]
Length = 999
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 32/147 (21%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +KE
Sbjct: 321 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHC----EKEGIQWEP 376
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ G + R+ + GG +LC CD C
Sbjct: 377 KDDDEEEEEGGCEEEEDDHMEFCRVCKD-----GGELLC-----------------CDAC 414
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH+ CL L E+P G+WLC
Sbjct: 415 PSSYHLHCLN----PPLPEIPNGEWLC 437
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+CP ++H+ C++ LP IP+G W C C
Sbjct: 397 EFCRVCKDGGELLCCDACPSSYHLHCLNPPLPEIPNGEWLCPRC 440
>gi|24666729|ref|NP_649111.1| Chd3 [Drosophila melanogaster]
gi|25089877|sp|O16102.3|CHD3_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein 3; AltName:
Full=ATP-dependent helicase Chd3
gi|23093148|gb|AAF49162.2| Chd3 [Drosophila melanogaster]
Length = 892
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 80
>gi|308501284|ref|XP_003112827.1| CRE-LET-418 protein [Caenorhabditis remanei]
gi|308267395|gb|EFP11348.1| CRE-LET-418 protein [Caenorhabditis remanei]
Length = 1884
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 82/211 (38%), Gaps = 56/211 (26%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
EN+D C C GG+LL CD+CPRA+H C+ S+ P G W C +C
Sbjct: 249 ENNDFCEECKQGGELLLCDTCPRAYHTPCIDSSMEDPPEGDWSCPHC------------- 295
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ IV+ ++ DFCK ++LCD C
Sbjct: 296 ------------------IEHGPEIVKEEPQKVND-------DFCKICKETENLLLCDTC 330
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLCCADCKRI---NLALQKLVDRGEE-----KLPE- 883
+H C+ L ++P+ + C C+ + A + L R +E LPE
Sbjct: 331 VCAFHAYCMD----PPLTQVPQEETWNCPRCELVKPEQKAEKILCWRWKEIPYPDPLPEG 386
Query: 884 ---TSLDVIKKKHEESGSDNAVDFDVRWRVL 911
T D++ K + +F ++W+ L
Sbjct: 387 TEPTEDDLLLKPPRKMAPRREREFFIKWKYL 417
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 714 KENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPS-GTWHCRYC 759
K NDD C IC + +LL CD+C AFH C+ P +P TW+C C
Sbjct: 309 KVNDDFCKICKETENLLLCDTCVCAFHAYCMDPPLTQVPQEETWNCPRC 357
>gi|332024007|gb|EGI64225.1| E3 ubiquitin-protein ligase TRIM33 [Acromyrmex echinatior]
Length = 1083
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 875 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHVPSLKSFPDESETWQCMLCTNVL 927
>gi|62472261|ref|NP_001014591.1| Mi-2, isoform B [Drosophila melanogaster]
gi|61678453|gb|AAX52739.1| Mi-2, isoform B [Drosophila melanogaster]
Length = 1983
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 437 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 376 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 435
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 436 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 456
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 457 PSAYHTFCLN----PPLDTIPDGDWRC 479
>gi|442633513|ref|NP_001262078.1| Mi-2, isoform D [Drosophila melanogaster]
gi|440216038|gb|AGB94771.1| Mi-2, isoform D [Drosophila melanogaster]
Length = 1973
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 366 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 425
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 426 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 446
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 447 PSAYHTFCLN----PPLDTIPDGDWRC 469
>gi|194751939|ref|XP_001958281.1| GF10842 [Drosophila ananassae]
gi|190625563|gb|EDV41087.1| GF10842 [Drosophila ananassae]
Length = 1971
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 421 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 466
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 360 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 419
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 420 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 440
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 441 PSAYHTFCLN----PPLDTIPDGDWRC 463
>gi|195128581|ref|XP_002008741.1| GI13663 [Drosophila mojavensis]
gi|193920350|gb|EDW19217.1| GI13663 [Drosophila mojavensis]
Length = 1992
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 423 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 468
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 362 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 421
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 422 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 442
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 443 PSAYHTFCLN----PPLDTIPDGDWRC 465
>gi|4325130|gb|AAD17276.1| dMi-2 protein [Drosophila melanogaster]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 375 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 434
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 435 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 455
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 456 PSAYHTFCLN----PPLDTIPDGDWRC 478
>gi|340715100|ref|XP_003396058.1| PREDICTED: LOW QUALITY PROTEIN: e3 ubiquitin-protein ligase
TRIM33-like [Bombus terrestris]
Length = 1036
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 829 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 881
>gi|24667055|ref|NP_649154.2| Mi-2, isoform A [Drosophila melanogaster]
gi|281366478|ref|NP_001163476.1| Mi-2, isoform C [Drosophila melanogaster]
gi|13124018|sp|O97159.2|CHDM_DROME RecName: Full=Chromodomain-helicase-DNA-binding protein Mi-2
homolog; AltName: Full=ATP-dependent helicase Mi-2;
Short=dMi-2
gi|23093096|gb|AAF49099.2| Mi-2, isoform A [Drosophila melanogaster]
gi|272455249|gb|ACZ94747.1| Mi-2, isoform C [Drosophila melanogaster]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 375 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 434
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 435 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 455
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 456 PSAYHTFCLN----PPLDTIPDGDWRC 478
>gi|380020007|ref|XP_003693890.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase
TRIM33-like [Apis florea]
Length = 1046
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 839 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 891
>gi|350414692|ref|XP_003490389.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Bombus
impatiens]
Length = 1036
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 829 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 881
>gi|307205723|gb|EFN83968.1| E3 ubiquitin-protein ligase TRIM33 [Harpegnathos saltator]
Length = 1101
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 893 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLESFPDESETWQCMLCTNVL 945
>gi|195354288|ref|XP_002043630.1| GM15785 [Drosophila sechellia]
gi|194127798|gb|EDW49841.1| GM15785 [Drosophila sechellia]
Length = 1921
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 427 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 472
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 366 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 425
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 426 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 446
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 447 PSAYHTFCLN----PPLDTIPDGDWRC 469
>gi|403179544|ref|XP_003888549.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165103|gb|EHS62842.1| hypothetical protein PGTG_22694 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1912
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 111/292 (38%), Gaps = 69/292 (23%)
Query: 717 DDLCGICMDG-----GDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKEKFVEY 771
D C IC DG ++ CD C A H DC +P IP G W CR C + + E
Sbjct: 140 DSRCAICEDGDTENSNAIVFCDGCNLAVHQDCYGVPYIPEGQWLCRKCTVSPDRPVTCEL 199
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRD----FCKSRFG----- 822
N+ A + + +A +V C + P+T +G + D K R+
Sbjct: 200 CPNSFGAFKQTSENKWAHLV--C--AIHIPETGVGNAMYMEPVDGVRCIPKQRWKLKCYI 255
Query: 823 -RRTVILCDQCERE-----YHVGCLKDHGMEDLQELPKGKWL--------------CCAD 862
++TV C QC YH C ++ G+ ++ P G C AD
Sbjct: 256 CKKTVGACIQCANRSCCVAYHATCAQEVGLY-VKMKPAGSLYPTTGARGTDEGPDSCAAD 314
Query: 863 ---CKRINL----ALQKLVDRGEEKLPETSLDV--IKKKHEESGSDNAVDFDVRWRVLRG 913
I L ALQ P T+ D KK + GSD+ + +
Sbjct: 315 HVGADGIRLRHIEALQAAAVARLSNNPATTTDSSDAKKTFDPQGSDSHL-------AISS 367
Query: 914 KKVDASDGTRAL--------LSKAVSIFHDRFDPIIESASKLDLIPAMVYGR 957
KK+++ D RAL SK+ +H + +S L+P +++ R
Sbjct: 368 KKINSLDDLRALKLLATGSQTSKSARAYHKSY------SSGPPLVPEVIFQR 413
>gi|326671403|ref|XP_001919955.3| PREDICTED: PHD finger protein 12 [Danio rerio]
Length = 941
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D+C C +GGDLLCCD CP AFH+ C + P +PSG W C C
Sbjct: 55 NHDICDSCREGGDLLCCDHCPAAFHLQCCNPPLSREMLPSGDWMCHRC 102
>gi|145483001|ref|XP_001427523.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394605|emb|CAK60125.1| unnamed protein product [Paramecium tetraurelia]
Length = 883
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMNTFQKE 766
C +C GG +L CD+CPR FH C+ L IP G W C C++ F ++
Sbjct: 824 CKVCGQGGKVLLCDTCPRVFHPRCLKLKEIPKGKWSCMICLSYFSRQ 870
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 6/34 (17%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
V+LCD C R +H CLK L+E+PKGKW C
Sbjct: 833 VLLCDTCPRVFHPRCLK------LKEIPKGKWSC 860
>gi|17946168|gb|AAL49125.1| RE55932p [Drosophila melanogaster]
Length = 627
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
+++ C +C DGGDLLCCDSCP +H C+S P IP G W C C+
Sbjct: 34 DEEYCKVCSDGGDLLCCDSCPSVYHRTCLSPPLKSIPKGDWICPRCI 80
>gi|328716042|ref|XP_003245819.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Acyrthosiphon pisum]
Length = 2002
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 719 LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
C +C DGG+LLCCDSCP A+H C+S P +P G W C C
Sbjct: 434 FCRVCKDGGELLCCDSCPAAYHTFCLSPPITDVPDGDWKCPRC 476
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 57/151 (37%), Gaps = 47/151 (31%)
Query: 711 FSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKF 768
+ ++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + +E+
Sbjct: 368 YEQTDHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEDTPEGKWSCPHCESEGGQEQ- 426
Query: 769 VEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVIL 828
R+ + GG +LC
Sbjct: 427 ------------------EEDEHQEFCRVCKD-----GGELLC----------------- 446
Query: 829 CDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
CD C YH CL + ++P G W C
Sbjct: 447 CDSCPAAYHTFCLS----PPITDVPDGDWKC 473
>gi|195130615|ref|XP_002009747.1| GI15065 [Drosophila mojavensis]
gi|193908197|gb|EDW07064.1| GI15065 [Drosophila mojavensis]
Length = 886
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N DLC C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPSGQWLCHSC 99
>gi|255731658|ref|XP_002550753.1| predicted protein [Candida tropicalis MYA-3404]
gi|240131762|gb|EER31321.1| predicted protein [Candida tropicalis MYA-3404]
Length = 507
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 65/168 (38%), Gaps = 34/168 (20%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
NDD C C G L+CC+SCP++FH C P P W+C+ C + E
Sbjct: 201 NDDFCSSCELPGILICCESCPKSFHFTCCDPPLEEAPEDEWYCQECYSKIHPPILTE-GI 259
Query: 774 NARAAGRIEGVDP----------FAQMVSRCIRIVQTPDTELGG----------CVLCRG 813
+ +E +P + S+ ++ + +L C C G
Sbjct: 260 FGKLLFELEKTNPKEFKLPVKFQLPKNESQLQERIKDKNWDLEALYDENDKPYLCHCCGG 319
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKD--HGMEDLQELPKGKWLC 859
S RRT+I CD C YH+ CL G + L E KW C
Sbjct: 320 -----SGLNRRTLIHCDYCPLVYHIDCLNPPMFGPKTLGE----KWKC 358
>gi|410899062|ref|XP_003963016.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Takifugu
rubripes]
Length = 1148
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 48/86 (55%), Gaps = 8/86 (9%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEYN 772
N+D C +C++GGDLLCCD CP+ FH+ C +P I P G + C +C + E ++Y
Sbjct: 908 NEDWCAVCINGGDLLCCDRCPKVFHMKC-HVPTIKIFPKGEFLCTFCRSLETPE--IKYC 964
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIV 798
+R + + P Q RC R++
Sbjct: 965 EESRKTSTEQSLSPEDQ--RRCERLL 988
>gi|195496103|ref|XP_002095551.1| GE22457 [Drosophila yakuba]
gi|194181652|gb|EDW95263.1| GE22457 [Drosophila yakuba]
Length = 1982
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 481
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 375 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 434
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 435 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 455
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 456 PSAYHTFCLN----PPLDTIPDGDWRC 478
>gi|146165350|ref|XP_001014826.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila]
gi|146145551|gb|EAR94494.2| SNF2 family N-terminal domain containing protein [Tetrahymena
thermophila SB210]
Length = 1612
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 28/44 (63%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N+ C IC D G L+CC++C + FH+ CV + P+G W C YC
Sbjct: 295 NESWCFICRDQGKLICCENCSKTFHLTCVGIKKPPTGAWECPYC 338
>gi|383860809|ref|XP_003705881.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Megachile rotundata]
Length = 1061
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 854 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 906
>gi|307180087|gb|EFN68155.1| E3 ubiquitin-protein ligase TRIM33 [Camponotus floridanus]
Length = 1078
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C SL P S TW C C N
Sbjct: 870 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPSLKSFPDESETWQCMLCTNVL 922
>gi|195379440|ref|XP_002048487.1| GJ13998 [Drosophila virilis]
gi|194155645|gb|EDW70829.1| GJ13998 [Drosophila virilis]
Length = 2012
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 431 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 370 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 429
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 430 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 450
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 451 PSAYHTFCLN----PPLDTIPDGDWRC 473
>gi|390349281|ref|XP_783138.3| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 2 [Strongylocentrotus purpuratus]
Length = 1784
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
E++D+C C GG+L+CCD+CP+AFH++C L +P G W C C
Sbjct: 1420 EHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENC 1466
Score = 43.5 bits (101), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 696 VSLHELSIKLS-LERPFSSKEN--DDLCGICMDGGD---LLCCDSCPRAFHIDCVSLP-- 747
SL ++ + LS LER + ++ C +C GG+ +L CDSC R H+ C+ P
Sbjct: 1247 TSLSQIFLHLSTLERSVTWNKSILKARCRMCRRGGNPEAMLLCDSCNRGHHMFCLKPPLK 1306
Query: 748 GIPSGTWHCRYC 759
+P G W C+ C
Sbjct: 1307 KVPKGEWFCKDC 1318
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
CR C+ ++LCD C R +H+ CLK L+++PKG+W C DC
Sbjct: 1274 CR---MCRRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDC 1318
>gi|313247391|emb|CBY15642.1| unnamed protein product [Oikopleura dioica]
Length = 1498
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 31/47 (65%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYC 759
E+++LC C GG+LL C+SCPR +H C++ P IP G W C YC
Sbjct: 146 EHNELCQFCKSGGELLACESCPRVYHPKCLNPPQTEIPDGDWFCPYC 192
>gi|195402015|ref|XP_002059606.1| GJ14859 [Drosophila virilis]
gi|194147313|gb|EDW63028.1| GJ14859 [Drosophila virilis]
Length = 884
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N DLC C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|307208076|gb|EFN85607.1| PHD finger protein 12 [Harpegnathos saltator]
Length = 633
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 69/179 (38%), Gaps = 41/179 (22%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC----VSLPGIPSGTWHCRYCMNTFQKEKFVE 770
N D C C DGG+L+CCD CP ++H+ C V IP+G W C C ++E +
Sbjct: 54 HNHDFCDACKDGGELICCDRCPASYHLQCHYPAVDPMDIPNGEWLCYTCRCAAKREMDNK 113
Query: 771 YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRT----- 825
N + +E + A +V+ R + P EL ++ G + GRR
Sbjct: 114 GNDKKKKMSALEILAFAASLVNP--REFELP-KELQLPIMFPGSNKMDYVSGRRVGKSHC 170
Query: 826 -------------------------VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+I CD C +H CL L P G+W+C
Sbjct: 171 LDSNLMVPLPARLCFECGRSCRKAPLIACDYCPLYFHQDCLD----PPLTTFPIGRWMC 225
>gi|195020242|ref|XP_001985154.1| GH16907 [Drosophila grimshawi]
gi|193898636|gb|EDV97502.1| GH16907 [Drosophila grimshawi]
Length = 2013
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 431 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 476
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 370 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 429
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 430 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 450
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 451 PSAYHTFCLN----PPLDTIPDGDWRC 473
>gi|322707103|gb|EFY98682.1| Essential subunit of the histone deacetylase Rpd3S complex
[Metarhizium anisopliae ARSEF 23]
Length = 1190
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 69/163 (42%), Gaps = 40/163 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL---PGIPSGTWHCRYC------------ 759
+ND+ C C + GD++CCD CPR+FH +CV + +P W+C C
Sbjct: 754 DNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDDLPD-EWYCNECLIRRYPSRVPIH 812
Query: 760 -------MNTFQKEKFVEYNANARAAGRIEGVDP-----FAQMVS-RCIRI---VQTPD- 802
+N +K ++ R R EGV + ++VS R R PD
Sbjct: 813 KGIFGSALNNLEKSIPRAFSLPKRVQNRFEGVKAGADGDYEEVVSNRTARKRNGTDEPDF 872
Query: 803 ---TELGGCVLCRGRDFCKSRFGR-RTVILCDQCEREYHVGCL 841
E G VLC C+ + R++I C C +H+ CL
Sbjct: 873 FKQREDGQAVLCHS---CQKPATQIRSIIPCSVCSFYWHIDCL 912
>gi|260791426|ref|XP_002590730.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
gi|229275926|gb|EEN46741.1| hypothetical protein BRAFLDRAFT_89536 [Branchiostoma floridae]
Length = 1073
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
N D C C +GGDLLCCD CP AFH+ C P +P G W C CM
Sbjct: 49 NHDCCDSCKEGGDLLCCDRCPAAFHLQCCDPPLCEEDLPEGEWLCHRCM 97
>gi|410074363|ref|XP_003954764.1| hypothetical protein KAFR_0A01910 [Kazachstania africana CBS 2517]
gi|372461346|emb|CCF55629.1| hypothetical protein KAFR_0A01910 [Kazachstania africana CBS 2517]
Length = 686
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
+NDD C C+ G LCCD+CPR+FH C+ P +P G W C C
Sbjct: 247 DNDDFCSACLQTGSFLCCDTCPRSFHFLCLDPPIDPNHLPEGDWSCPSC 295
>gi|357618139|gb|EHJ71233.1| hypothetical protein KGM_08685 [Danaus plexippus]
Length = 792
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 34/59 (57%), Gaps = 4/59 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFV 769
N D C C +GGDL+CCD CP +FH+ C P IP+G+W CR C +K+ V
Sbjct: 49 HNHDSCDACREGGDLICCDRCPASFHLGCYDPPLEENDIPAGSWLCRECKAGDEKQGVV 107
>gi|328869901|gb|EGG18276.1| hypothetical protein DFA_03770 [Dictyostelium fasciculatum]
Length = 1246
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
N+D C C DGGDLLCC+SC AFH+ C+ P +P G W C C
Sbjct: 88 NNDFCDACHDGGDLLCCESCECAFHMMCLDPPVSSLPEGDWFCHSC 133
>gi|297737996|emb|CBI27197.3| unnamed protein product [Vitis vinifera]
Length = 1638
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 33/49 (67%), Gaps = 2/49 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKE 766
C IC GG+LLCCDSCPR +H+ C++ P IP+G W C C + +K+
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKCFCSIEKK 126
>gi|156554524|ref|XP_001605295.1| PREDICTED: transcription intermediary factor 1-alpha-like [Nasonia
vitripennis]
Length = 1085
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 32/53 (60%), Gaps = 5/53 (9%)
Query: 716 NDDLCGICMDGGD-LLCCDSCPRAFHIDC--VSLPGIP--SGTWHCRYCMNTF 763
N+D C +CMDGGD +LCCD CP+ FH+ C +L P S TW C C N
Sbjct: 878 NEDWCAVCMDGGDAVLCCDKCPKVFHLYCHIPNLKSFPEESETWQCMLCTNVL 930
>gi|166158078|ref|NP_001107447.1| uncharacterized protein LOC100135295 [Xenopus (Silurana)
tropicalis]
gi|156230557|gb|AAI52125.1| Zgc:173656 protein [Danio rerio]
gi|163916551|gb|AAI57592.1| LOC100135295 protein [Xenopus (Silurana) tropicalis]
Length = 386
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 197 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 255
Query: 746 LPGIP---SGTWHCRYCM 760
LP +P SG W C C+
Sbjct: 256 LPAVPEDSSGKWSCTICV 273
>gi|390349283|ref|XP_003727183.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 1A
isoform 1 [Strongylocentrotus purpuratus]
Length = 1852
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
E++D+C C GG+L+CCD+CP+AFH++C L +P G W C C
Sbjct: 1491 EHEDVCSRCRHGGELICCDTCPKAFHMECCKPVLRKVPKGHWECENC 1537
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 43/162 (26%)
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM---NTF 763
R + ++++LC C G L+ C CP +H +C+ P + W C C+ T
Sbjct: 1416 RGLGALDHNELCQSCGHAGQLILCHDCPIVYHCECLDPPLSKLTQDHWFCPLCVMDRTTN 1475
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGR 823
+ E +N G IE D + SRC GG ++C
Sbjct: 1476 GADSEEEMGSN---DGEIEHED----VCSRCRH---------GGELIC------------ 1507
Query: 824 RTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 865
CD C + +H+ C K L+++PKG W C +CK+
Sbjct: 1508 -----CDTCPKAFHMECCKPV----LRKVPKGHWE-CENCKK 1539
Score = 43.1 bits (100), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 8/72 (11%)
Query: 696 VSLHELSIKLS-LERPFSSKEN--DDLCGICMDGGD---LLCCDSCPRAFHIDCVSLP-- 747
SL ++ + LS LER + ++ C +C GG+ +L CDSC R H+ C+ P
Sbjct: 1247 TSLSQIFLHLSTLERSVTWNKSILKARCRMCRRGGNPEAMLLCDSCNRGHHMFCLKPPLK 1306
Query: 748 GIPSGTWHCRYC 759
+P G W C+ C
Sbjct: 1307 KVPKGEWFCKDC 1318
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 8/53 (15%)
Query: 811 CRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
CR C+ ++LCD C R +H+ CLK L+++PKG+W C DC
Sbjct: 1274 CR---MCRRGGNPEAMLLCDSCNRGHHMFCLK----PPLKKVPKGEWF-CKDC 1318
>gi|348523329|ref|XP_003449176.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Oreochromis
niloticus]
Length = 1174
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 39/63 (61%), Gaps = 4/63 (6%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYNA 773
N+D C +C++GGDLLCCD CP+ FH+ C ++ +P G + C +C + E +EY
Sbjct: 934 NEDWCAVCINGGDLLCCDRCPKVFHMKCHVPTIKIVPKGDFLCTFCRSITNPE--IEYCD 991
Query: 774 NAR 776
+R
Sbjct: 992 ESR 994
>gi|194874449|ref|XP_001973400.1| GG13363 [Drosophila erecta]
gi|190655183|gb|EDV52426.1| GG13363 [Drosophila erecta]
Length = 711
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 5/67 (7%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMN---TFQKEKFVE 770
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C T + EK +
Sbjct: 436 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRCSCPPLTGKAEKIIT 495
Query: 771 YNANARA 777
+ R+
Sbjct: 496 WRWAQRS 502
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 375 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 434
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 435 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 455
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 456 PSAYHTFCLN----PPLDTIPDGDWRC 478
>gi|340726708|ref|XP_003401695.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Bombus terrestris]
Length = 1974
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 74/198 (37%), Gaps = 48/198 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 376 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 422
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
AG E D + C + GG +LC CD C
Sbjct: 423 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 456
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 890
YH CL L E+P G W C C+ C I + K++ ++ PET +
Sbjct: 457 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKECPETPSEEPS 511
Query: 891 KKHEESGSDNAVDFDVRW 908
+F V+W
Sbjct: 512 TSKAAPKQRRIREFFVKW 529
>gi|440571986|gb|AGC12539.1| GH21519p1 [Drosophila melanogaster]
Length = 1084
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C++ P IP G W C C
Sbjct: 437 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPPLDTIPDGDWRCPRC 482
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 376 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 435
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 436 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 456
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 457 PSAYHTFCLN----PPLDTIPDGDWRC 479
>gi|254578124|ref|XP_002495048.1| ZYRO0B02156p [Zygosaccharomyces rouxii]
gi|238937938|emb|CAR26115.1| ZYRO0B02156p [Zygosaccharomyces rouxii]
Length = 696
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 30/50 (60%), Gaps = 4/50 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCM 760
ENDD C C+ G LCCD+CP++FH C++ P +P G W C C+
Sbjct: 298 ENDDFCSACLQSGSFLCCDTCPKSFHFLCLNPPVDPDELPDGDWSCPQCV 347
>gi|326427315|gb|EGD72885.1| hypothetical protein PTSG_12193 [Salpingoeca sp. ATCC 50818]
Length = 2049
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 4/54 (7%)
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ P +S E D C +C G+LLCCD CPR +H+DCV+ L +P G W C C
Sbjct: 711 DSPTTSHE--DYCIVCQQSGELLCCDGCPRVYHLDCVTPRLAEVPEGDWFCPAC 762
>gi|312377051|gb|EFR23976.1| hypothetical protein AND_11766 [Anopheles darlingi]
Length = 2960
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
DD C C GDLLCC++CP FH++CV P IP+G W C C
Sbjct: 355 DDHCRACHRVGDLLCCETCPAVFHLECVEPPLVNIPNGDWQCNLC 399
>gi|160773236|gb|AAI55238.1| Zgc:173656 protein [Danio rerio]
Length = 408
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 746 LPGIP---SGTWHCRYCM 760
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCTICV 295
>gi|432864384|ref|XP_004070295.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like, partial
[Oryzias latipes]
Length = 1042
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 52/87 (59%), Gaps = 10/87 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGI---PSGTWHCRYCMNTFQKEKFVEY- 771
N+D C +C++GGDLLCCD+CP+ FH+ C +P I P G + C +C + E +EY
Sbjct: 803 NEDWCAVCINGGDLLCCDNCPKVFHMKC-HVPTIKIFPKGDFLCTFCRSLSVPE--IEYC 859
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIV 798
+ N R G + ++P Q RC R++
Sbjct: 860 DDNKRIKGE-QSLNPEDQ--RRCERVL 883
>gi|350424653|ref|XP_003493868.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Bombus impatiens]
Length = 1965
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 68/174 (39%), Gaps = 48/174 (27%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 413
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
AG E D + C + GG +LC CD C
Sbjct: 414 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 447
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPET 884
YH CL L E+P G W C C+ C I + K++ ++ PET
Sbjct: 448 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKECPET 496
>gi|351713762|gb|EHB16681.1| Chromodomain-helicase-DNA-binding protein 5 [Heterocephalus glaber]
Length = 2263
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 21/135 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVE-- 770
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C+ +F+ + V
Sbjct: 409 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEKAPEGKWSCPHCVPSFEAQGCVLSV 468
Query: 771 ------YNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDF-CKSRF-- 821
Y A R+ + Q VSR + Q GG + GR + C ++
Sbjct: 469 LGGALCYPATLRSPSILG-----PQCVSRLMETGQCTRVGPGGSLARGGRGWPCDNKLTA 523
Query: 822 ---GRRTVILCDQCE 833
G RT + C E
Sbjct: 524 KCEGPRTAMSCSGPE 538
Score = 40.0 bits (92), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 4/46 (8%)
Query: 814 RDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLC 859
+D+C+ +ILCD C R YH+ CL +L++ P+GKW C
Sbjct: 411 QDYCEVCQQGGEIILCDTCPRAYHLVCLD----PELEKAPEGKWSC 452
>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
Length = 869
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
++ C +C DGGDLLCCDSCP +H C++ P IP G W C C+
Sbjct: 8 EEYCRVCRDGGDLLCCDSCPSVYHRTCLTPPLKSIPKGDWICPRCI 53
>gi|198466497|ref|XP_002135204.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
gi|198150627|gb|EDY73831.1| GA23929 [Drosophila pseudoobscura pseudoobscura]
Length = 2036
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C+ +L IP G W C C
Sbjct: 433 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 478
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 372 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 431
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 432 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 452
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 453 PSAYHTFCLN----PALDTIPDGDWRC 475
>gi|195166785|ref|XP_002024215.1| GL22908 [Drosophila persimilis]
gi|194107570|gb|EDW29613.1| GL22908 [Drosophila persimilis]
Length = 1898
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 759
+ + C +C DGG+LLCCDSCP A+H C+ +L IP G W C C
Sbjct: 291 HQEFCRVCKDGGELLCCDSCPSAYHTFCLNPALDTIPDGDWRCPRC 336
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 55/147 (37%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
E+ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C + + +
Sbjct: 230 EHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELDEPPEGKWSCPHCEADGGAAEEEDDD 289
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
+ Q R + GG +LC CD C
Sbjct: 290 EH--------------QEFCRVCKD--------GGELLC-----------------CDSC 310
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L +P G W C
Sbjct: 311 PSAYHTFCLN----PALDTIPDGDWRC 333
>gi|380020464|ref|XP_003694103.1| PREDICTED: LOW QUALITY PROTEIN: chromodomain-helicase-DNA-binding
protein Mi-2 homolog [Apis florea]
Length = 1964
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 73/198 (36%), Gaps = 48/198 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 366 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 412
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
AG E D + C + GG +LC CD C
Sbjct: 413 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 446
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 890
YH CL L E+P G W C C+ C I + K++ + PET +
Sbjct: 447 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKDCPETPSEEPS 501
Query: 891 KKHEESGSDNAVDFDVRW 908
+F V+W
Sbjct: 502 TSKATPKQRRMREFFVKW 519
>gi|110777198|ref|XP_624414.2| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Apis mellifera]
Length = 1966
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 73/198 (36%), Gaps = 48/198 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 367 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 413
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
AG E D + C + GG +LC CD C
Sbjct: 414 EGEGIAGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 447
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLVDRGEEKLPETSLDVIK 890
YH CL L E+P G W C C+ C I + K++ + PET +
Sbjct: 448 TSAYHTHCLN----PPLSEIPDGDWKCPRCS-CPPIRGKVAKILTWRWKDCPETPSEEPS 502
Query: 891 KKHEESGSDNAVDFDVRW 908
+F V+W
Sbjct: 503 TSKATPKQRRMREFFVKW 520
>gi|225543329|ref|NP_001098590.2| uncharacterized protein LOC550611 isoform 2 [Danio rerio]
Length = 386
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 197 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 255
Query: 746 LPGIP---SGTWHCRYCM 760
LP +P SG W C C+
Sbjct: 256 LPAVPEDSSGKWSCIICV 273
>gi|157108665|ref|XP_001650336.1| fetal alzheimer antigen, falz [Aedes aegypti]
gi|108868530|gb|EAT32755.1| AAEL015015-PA [Aedes aegypti]
Length = 2722
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
DD C IC GDLLCC++CP FH++CV P +PS W C C
Sbjct: 357 DDHCRICHRLGDLLCCETCPAVFHLECVDPPLVDVPSEDWQCNLC 401
>gi|356510796|ref|XP_003524120.1| PREDICTED: uncharacterized protein LOC100793933 [Glycine max]
Length = 2325
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM 760
C IC GG+LLCCDSCPR +H+ C+ P IP+G W C C
Sbjct: 73 CVICDVGGNLLCCDSCPRTYHLQCLDPPLKRIPNGKWQCPSCF 115
>gi|255544538|ref|XP_002513330.1| conserved hypothetical protein [Ricinus communis]
gi|223547238|gb|EEF48733.1| conserved hypothetical protein [Ricinus communis]
Length = 602
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
C IC +GGDLLCCD+CP +H+ C++ P +PSG W C C
Sbjct: 64 CVICDNGGDLLCCDTCPGTYHLQCLTPPLELVPSGNWQCENC 105
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 719 LCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYCMN 761
C C GGDL+CC+SCP A+H C+ +P G W CR C+N
Sbjct: 997 WCFTCSLGGDLICCESCPAAYHAKCLGFDSVPDGNWFCRDCVN 1039
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 715 ENDDLCGICMDGGDLLCCD--SCPRAFHIDCVSLPGIPSGTWHCRY 758
+++D C C +GG+L CD +CP+A+H+DC+ L P G W C +
Sbjct: 1411 KHEDYCFRCAEGGELTMCDVKTCPKAYHLDCLGLTKQPYGKWQCPW 1456
Score = 43.1 bits (100), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 59/150 (39%), Gaps = 23/150 (15%)
Query: 708 ERPFS---SKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYCMNTF 763
ERP + + +++C +C G LL C+ C AFH+DC+ L PSG++ C C++
Sbjct: 819 ERPAGGGGAAKKENVCQVCERTGQLLLCEGGCCGAFHLDCIGLQVAPSGSFRCDECISGV 878
Query: 764 Q------------KEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLC 811
+ V G I P + SR L CV C
Sbjct: 879 HSCFVCKLSDQEVRRCHVPVCGKYYHEGCIRRF-PLTRFDSRGFTC------PLHACVAC 931
Query: 812 RGRDFCKSRFGRRTVILCDQCEREYHVGCL 841
+ ++ R ++ C +C YH G L
Sbjct: 932 FADNPKSTKASRGRLMRCVRCPTAYHQGDL 961
>gi|189236075|ref|XP_972162.2| PREDICTED: similar to Dual specificity
tyrosine-phosphorylation-regulated kinase [Tribolium
castaneum]
Length = 2981
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP---SGTWHCRYCMN 761
N+D C +CMDGG+L+CCD CP+ FH C +P + + TW C C N
Sbjct: 2783 NEDWCAVCMDGGELVCCDKCPKVFHQYC-HIPNLSVEENDTWQCLLCTN 2830
>gi|26337379|dbj|BAC32375.1| unnamed protein product [Mus musculus]
Length = 708
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 71/164 (43%), Gaps = 32/164 (19%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 389 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 443
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A + D + C +C+ ++ CD C
Sbjct: 444 AKEEEE---------EYEEEGEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 485
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC--CADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C C + +QK++
Sbjct: 486 ISSYHIHCLN----PPLPDIPNGEWLCPRCT-CPVLKGRVQKIL 524
>gi|195048447|ref|XP_001992528.1| GH24800 [Drosophila grimshawi]
gi|193893369|gb|EDV92235.1| GH24800 [Drosophila grimshawi]
Length = 887
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N DLC C +GG+LLCCD CP +FH+ C P IP+G W C C
Sbjct: 51 HNHDLCDACEEGGNLLCCDRCPSSFHLQCHDPPLNEEDIPTGQWLCHSC 99
>gi|159162702|pdb|1MM3|A Chain A, Solution Structure Of The 2nd Phd Domain From Mi2b With C-
Terminal Loop Replaced By Corresponding Loop From Wstf
Length = 61
Score = 60.5 bits (145), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/52 (44%), Positives = 33/52 (63%), Gaps = 2/52 (3%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV--SLPGIPSGTWHCRYC 759
P S + + C +C DGG+LLCCD+CP ++HI C+ +L +P G W C C
Sbjct: 2 PLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLRPALYEVPDGEWQCPRC 53
>gi|225543325|ref|NP_001017912.2| uncharacterized protein LOC550611 isoform 1 [Danio rerio]
Length = 408
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 746 LPGIP---SGTWHCRYCM 760
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCIICV 295
>gi|312373820|gb|EFR21503.1| hypothetical protein AND_16980 [Anopheles darlingi]
Length = 1342
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 37/77 (48%), Gaps = 12/77 (15%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCV-----SLPGIPSGTWHCRYCMNTFQKEKFVE 770
N+D C +CMDGG+L+CCD CP+ FH C SLP S TW C C N F +
Sbjct: 1006 NEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDE-SETWQCLLCYN------FAD 1058
Query: 771 YNANARAAGRIEGVDPF 787
R G+ P
Sbjct: 1059 LPPEPTGEKRNVGITPL 1075
>gi|62471470|gb|AAH93591.1| Zgc:113411 [Danio rerio]
gi|182889002|gb|AAI64509.1| Zgc:113411 protein [Danio rerio]
Length = 408
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 4/78 (5%)
Query: 686 PFQHIYTSNGVSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVS 745
P + + LH ++ + + NDD+C +C G+L+CCD CPRAFH C
Sbjct: 219 PLGRLIKKKVLELHTINCDCEICQELDQHLNDDVCFVCNSEGNLVCCDECPRAFHHHC-H 277
Query: 746 LPGIP---SGTWHCRYCM 760
LP +P SG W C C+
Sbjct: 278 LPAVPEDSSGKWSCIICV 295
>gi|321453629|gb|EFX64846.1| hypothetical protein DAPPUDRAFT_229741 [Daphnia pulex]
Length = 1431
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 44/72 (61%), Gaps = 6/72 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQ---KEKFVE 770
++++C +C +GG+++CCD+CP +H++C++ P +P G W C C Q + K E
Sbjct: 1149 DNEICPVCQEGGEVICCDTCPAVYHLECINPPLRKVPRGKWSCPQCKTPPQDRERGKLRE 1208
Query: 771 YNANARA-AGRI 781
N++ R + RI
Sbjct: 1209 KNSDGRTGSARI 1220
Score = 47.4 bits (111), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 77/207 (37%), Gaps = 50/207 (24%)
Query: 720 CGICMDGGD---LLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYNAN 774
C IC GD +L CD C + H+ C+ L +P G W C C K E
Sbjct: 1053 CRICRRKGDPEKMLLCDGCDKGHHMYCLKPLLTVVPEGDWFCAEC-------KPREKPRT 1105
Query: 775 ARAAGRI----------------EGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCK 818
R A R+ G + V++ D E+ C +C+
Sbjct: 1106 PRKARRVFSEESEDEAVEEEEADAGEEEAEVEAEEGESDVESVDNEI--CPVCQEGG--- 1160
Query: 819 SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKRINLALQKLVDRGE 878
VI CD C YH+ C+ L+++P+GKW C CK DR
Sbjct: 1161 ------EVICCDTCPAVYHLECIN----PPLRKVPRGKW-SCPQCK------TPPQDRER 1203
Query: 879 EKLPETSLDVIKKKHEESGSDNAVDFD 905
KL E + D S + +A+DFD
Sbjct: 1204 GKLREKNSDGRTGSARISRTRHAIDFD 1230
>gi|296420656|ref|XP_002839885.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636091|emb|CAZ84076.1| unnamed protein product [Tuber melanosporum]
Length = 990
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 77/187 (41%), Gaps = 47/187 (25%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEK----- 767
EN+D+C +C G LCC+ CPR+FH C++ P +P G W C C
Sbjct: 589 ENEDVCAVCNGPGRFLCCERCPRSFHFTCLNPPLEEVPEGMWFCNKCTTQHNPPPKPPRG 648
Query: 768 -FVEY--NANARAAGRIE----------GVD--PFAQMVSRCIRIVQT--------P--- 801
FV+ N N R GV+ P + V +R +T P
Sbjct: 649 LFVDLLDNINRRNPSSFRLPSEIRNYFVGVETGPLGEYVD--VRDNKTQRFGRSGFPEEY 706
Query: 802 DT-----ELGGCVLCRGRDFCK--SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPK 854
DT + G +LC C+ + G+R +I CD C +H+ C+ M + K
Sbjct: 707 DTHRLKDKSGNIILCHS---CRRSAADGQR-IISCDFCPLNWHLDCVSPMPMPNPPPAQK 762
Query: 855 GKWLCCA 861
KW+C A
Sbjct: 763 -KWMCPA 768
>gi|347965727|ref|XP_321817.5| AGAP001328-PA [Anopheles gambiae str. PEST]
gi|333470378|gb|EAA01183.5| AGAP001328-PA [Anopheles gambiae str. PEST]
Length = 1278
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 37/77 (48%), Gaps = 12/77 (15%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCV-----SLPGIPSGTWHCRYCMNTFQKEKFVE 770
N+D C +CMDGG+L+CCD CP+ FH C SLP S TW C C N F +
Sbjct: 1078 NEDWCAVCMDGGELMCCDKCPKVFHQTCHIPVIDSLPDE-SETWQCLLCYN------FAD 1130
Query: 771 YNANARAAGRIEGVDPF 787
R G+ P
Sbjct: 1131 LPPEPTGEKRNVGITPL 1147
>gi|292621498|ref|XP_002664670.1| PREDICTED: PHD finger protein 12 [Danio rerio]
Length = 939
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 54 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSREMLPPGDWMCHRC 101
>gi|359472760|ref|XP_003631193.1| PREDICTED: uncharacterized protein LOC100247555 [Vitis vinifera]
Length = 2355
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 29/42 (69%), Gaps = 2/42 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
C IC GG+LLCCDSCPR +H+ C++ P IP+G W C C
Sbjct: 78 CVICDLGGNLLCCDSCPRTYHLQCLNPPLKRIPNGKWQCPKC 119
Score = 39.7 bits (91), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 27/62 (43%), Gaps = 21/62 (33%)
Query: 804 ELGGCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADC 863
+LGG +LC CD C R YH+ CL L+ +P GKW C C
Sbjct: 82 DLGGNLLC-----------------CDSCPRTYHLQCLN----PPLKRIPNGKWQCPKCC 120
Query: 864 KR 865
++
Sbjct: 121 QK 122
>gi|325188434|emb|CCA22970.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 609
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 712 SSKEND---DLCGICMDGGDLLCCDSCPRAFHIDCVSL--PGIPSGTWHCRYC 759
SS E+D D C IC+ GD +CCD CPR+FH++C+ L +P G W C C
Sbjct: 431 SSDEHDTNLDYCSICLKDGDFVCCDICPRSFHLECLRLRQEDLPKGEWQCEGC 483
>gi|427786979|gb|JAA58941.1| Putative sin3-type complex [Rhipicephalus pulchellus]
Length = 533
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 67/188 (35%), Gaps = 51/188 (27%)
Query: 708 ERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC---- 759
RP N D C C +GGDL+CCD CP FH+ C P +PSG W C C
Sbjct: 37 HRPTGRAVNHDCCDSCKEGGDLICCDRCPATFHLQCHDPPLDEESLPSGEWICHRCTVMA 96
Query: 760 --------------------------MNTFQKEKFVEYNANARAAGRIEGVDPFAQMVSR 793
+N Q E E + + G + P +R
Sbjct: 97 AASSGPASNSSEASSALHTLVAAASLLNAQQFELPNELSCHIALPGSSKRPRP-----AR 151
Query: 794 CIRIVQTPDTELG--GCVLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQE 851
R P L C CR R+ +I CD C +H CL+ L
Sbjct: 152 GRRHDLEPPVPLPLRVCFTCRR----SCRWA--PLIQCDYCPLLFHADCLE----FPLAS 201
Query: 852 LPKGKWLC 859
LP G+W+C
Sbjct: 202 LPTGRWMC 209
>gi|195055097|ref|XP_001994457.1| GH17187 [Drosophila grimshawi]
gi|193892220|gb|EDV91086.1| GH17187 [Drosophila grimshawi]
Length = 1128
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 892 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 941
>gi|432109785|gb|ELK33837.1| Autoimmune regulator, partial [Myotis davidii]
Length = 374
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 701 LSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP 747
L + LS E P ++N+D C +C DGG+L+CCD CPRAFH+ C+S P
Sbjct: 240 LPLPLSSE-PQLHQKNEDECAVCRDGGELICCDGCPRAFHLACLSPP 285
>gi|18858131|ref|NP_572311.1| CG3815 [Drosophila melanogaster]
gi|17861444|gb|AAL39199.1| GH06635p [Drosophila melanogaster]
gi|22831814|gb|AAF46150.2| CG3815 [Drosophila melanogaster]
gi|220944994|gb|ACL85040.1| CG3815-PA [synthetic construct]
gi|220954818|gb|ACL89952.1| CG3815-PA [synthetic construct]
Length = 878
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|195391037|ref|XP_002054172.1| GJ22942 [Drosophila virilis]
gi|194152258|gb|EDW67692.1| GJ22942 [Drosophila virilis]
Length = 1119
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|442620095|ref|NP_001262765.1| bonus, isoform C [Drosophila melanogaster]
gi|440217666|gb|AGB96145.1| bonus, isoform C [Drosophila melanogaster]
Length = 1207
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|281352213|gb|EFB27797.1| hypothetical protein PANDA_013886 [Ailuropoda melanoleuca]
Length = 640
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 68/164 (41%), Gaps = 31/164 (18%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC-MNTFQKEKFVEY 771
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C Q E E
Sbjct: 377 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHCEKEGVQWEAKEEE 436
Query: 772 NANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQ 831
+ + C R+ + GG +LC CD
Sbjct: 437 EEYEEEGEDEGEKEEEDDHMEYC-RVCKD-----GGELLC-----------------CDA 473
Query: 832 CEREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 874
C YH+ CL L ++P G+WLC C + +QK++
Sbjct: 474 CISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 513
>gi|198467663|ref|XP_001354469.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
gi|198149350|gb|EAL31522.2| GA17705 [Drosophila pseudoobscura pseudoobscura]
Length = 951
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 50 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 98
>gi|195340386|ref|XP_002036794.1| GM12479 [Drosophila sechellia]
gi|194130910|gb|EDW52953.1| GM12479 [Drosophila sechellia]
Length = 876
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 702 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 759
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1426 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1484
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.8 bits (112), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 702 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 759
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1529 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1587
>gi|298707919|emb|CBJ30305.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1534
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCM---NTFQKEKFV 769
+++DLC +C GGDLLCCD+C FH C L +P+G W+C++C+ E
Sbjct: 108 DHNDLCEVCSHGGDLLCCDTCSLVFHTKCHRPELKEVPAGDWNCQFCVADSTHVPPEDRA 167
Query: 770 EYNANARAAGRIE 782
E N A R++
Sbjct: 168 EAQRNVAAMRRVQ 180
Score = 45.1 bits (105), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 11/61 (18%)
Query: 802 DTELGGCVLCRGRDFCKSRFGRRT---VILCDQCEREYHVGCLKDHGMEDLQELPKGKWL 858
D E+ C LC C+ +G++ +LC+ C+ EYH CL L E+PKGKW
Sbjct: 341 DDEVMMCPLC----SCRKCYGKQDPELALLCEHCDDEYHTYCLD----PPLTEVPKGKWY 392
Query: 859 C 859
C
Sbjct: 393 C 393
>gi|195113037|ref|XP_002001076.1| GI10584 [Drosophila mojavensis]
gi|193917670|gb|EDW16537.1| GI10584 [Drosophila mojavensis]
Length = 1121
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 886 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 935
>gi|195569415|ref|XP_002102705.1| GD19361 [Drosophila simulans]
gi|194198632|gb|EDX12208.1| GD19361 [Drosophila simulans]
Length = 1140
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|6631000|gb|AAF19646.1|AF210315_1 bonus [Drosophila melanogaster]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|24648457|ref|NP_524724.2| bonus, isoform A [Drosophila melanogaster]
gi|23176008|gb|AAF55786.2| bonus, isoform A [Drosophila melanogaster]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|260943552|ref|XP_002616074.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
gi|238849723|gb|EEQ39187.1| hypothetical protein CLUG_03315 [Clavispora lusitaniae ATCC 42720]
Length = 766
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 64/167 (38%), Gaps = 34/167 (20%)
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC---MNTF 763
+P + +NDD C C G+ +CCDSCP++FH+ C P +P W+C C
Sbjct: 227 QPEDTTQNDDYCATCGGTGEFICCDSCPKSFHLLCCGPPLREVPEDNWNCSECRAAQGMT 286
Query: 764 QKEKFVEYNANARAAGRIEGVDPFAQMVSRCIR------IVQTPDTELGGCVLCRGR--- 814
Q+ + + G +P + R +R + P + L R
Sbjct: 287 QRRSWNSVGVFGALLNALHGRNPREYCLPRRLRENTFIDVSAGPSGQYTDASLKPERTSR 346
Query: 815 ------DFCKSRFGR--------------RTVILCDQCEREYHVGCL 841
D R GR +T++ CD C ++H+ CL
Sbjct: 347 ADELDIDALYDRAGRPFLCHRCRMSGRANKTLVACDYCPLKWHLDCL 393
>gi|195498245|ref|XP_002096441.1| GE25673 [Drosophila yakuba]
gi|194182542|gb|EDW96153.1| GE25673 [Drosophila yakuba]
Length = 1133
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 896 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 945
>gi|113195649|ref|NP_001037823.1| zinc finger protein LOC723799 [Ciona intestinalis]
gi|92081516|dbj|BAE93305.1| zinc finger protein [Ciona intestinalis]
Length = 1298
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 4/51 (7%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNT 762
N + C C +GGDLLCCD CP AFH+ C + P +P G W C C+ T
Sbjct: 53 NHETCDACGEGGDLLCCDFCPAAFHLQCCNPPLDEDKVPHGEWACHRCVVT 103
>gi|194896219|ref|XP_001978436.1| GG19583 [Drosophila erecta]
gi|190650085|gb|EDV47363.1| GG19583 [Drosophila erecta]
Length = 876
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|195353945|ref|XP_002043462.1| GM23122 [Drosophila sechellia]
gi|194127603|gb|EDW49646.1| GM23122 [Drosophila sechellia]
Length = 1131
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 702 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 759
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1426 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1484
>gi|194744148|ref|XP_001954557.1| GF16683 [Drosophila ananassae]
gi|190627594|gb|EDV43118.1| GF16683 [Drosophila ananassae]
Length = 1183
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 895 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 944
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|281362149|ref|NP_001163665.1| bonus, isoform B [Drosophila melanogaster]
gi|159884207|gb|ABX00782.1| RE48191p [Drosophila melanogaster]
gi|272477073|gb|ACZ94961.1| bonus, isoform B [Drosophila melanogaster]
Length = 1125
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|2660669|gb|AAC79140.1| human Mi-2 autoantigen-like protein [Arabidopsis thaliana]
gi|9758384|dbj|BAB08833.1| helicase-like protein [Arabidopsis thaliana]
Length = 2228
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
C IC GGDLLCCDSCPR +H C++ P IP+G W C C
Sbjct: 64 CVICDLGGDLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKC 105
>gi|405957765|gb|EKC23951.1| PHD finger protein 12 [Crassostrea gigas]
Length = 913
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 27/48 (56%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C P +P G W C C
Sbjct: 49 NHDSCDSCKEGGDLLCCDWCPAAFHLQCHDPPLEEDDVPPGEWRCHRC 96
>gi|442620101|ref|NP_001262768.1| bonus, isoform F [Drosophila melanogaster]
gi|440217669|gb|AGB96148.1| bonus, isoform F [Drosophila melanogaster]
Length = 1122
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 886 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 935
>gi|194899728|ref|XP_001979410.1| GG24000 [Drosophila erecta]
gi|190651113|gb|EDV48368.1| GG24000 [Drosophila erecta]
Length = 1131
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 894 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 943
>gi|442620099|ref|NP_001262767.1| bonus, isoform E [Drosophila melanogaster]
gi|440217668|gb|AGB96147.1| bonus, isoform E [Drosophila melanogaster]
Length = 1125
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1519 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1561
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1927 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1974
>gi|348543602|ref|XP_003459272.1| PREDICTED: PHD finger protein 12-like [Oreochromis niloticus]
Length = 1018
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1708 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1750
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2116 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2163
>gi|442620097|ref|NP_001262766.1| bonus, isoform D [Drosophila melanogaster]
gi|440217667|gb|AGB96146.1| bonus, isoform D [Drosophila melanogaster]
Length = 1119
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 882 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 931
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1422 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1464
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1830 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1877
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1398 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1440
Score = 47.8 bits (112), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1806 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1853
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 702 SIKLSLERPFSSKENDDLCGICMDGGDLLCCDS-CPRAFHIDCVSLPGIPSGTWHCRYC 759
S KL +ER + +++C C G+LL C++ C AFH++C+ LP +P G + C C
Sbjct: 1220 SKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGAFHLECLGLPEMPRGKFICNEC 1278
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|334188199|ref|NP_199293.3| chromatin remodeling 4 protein [Arabidopsis thaliana]
gi|332007781|gb|AED95164.1| chromatin remodeling 4 protein [Arabidopsis thaliana]
Length = 2223
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/42 (57%), Positives = 28/42 (66%), Gaps = 2/42 (4%)
Query: 720 CGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
C IC GGDLLCCDSCPR +H C++ P IP+G W C C
Sbjct: 78 CVICDLGGDLLCCDSCPRTYHTACLNPPLKRIPNGKWICPKC 119
>gi|156379809|ref|XP_001631648.1| predicted protein [Nematostella vectensis]
gi|156218692|gb|EDO39585.1| predicted protein [Nematostella vectensis]
Length = 824
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 29/46 (63%), Gaps = 2/46 (4%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPG--IPSGTWHCRYC 759
+ D C +C DGG LLCCD CP A+H+ C+ P +P+G W C C
Sbjct: 10 HSDYCRVCKDGGQLLCCDKCPMAYHLKCLIPPMMRVPTGEWKCPRC 55
>gi|270005635|gb|EFA02083.1| hypothetical protein TcasGA2_TC007718 [Tribolium castaneum]
Length = 980
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP---SGTWHCRYCMN 761
N+D C +CMDGG+L+CCD CP+ FH C +P + + TW C C N
Sbjct: 782 NEDWCAVCMDGGELVCCDKCPKVFHQYC-HIPNLSVEENDTWQCLLCTN 829
>gi|194763533|ref|XP_001963887.1| GF21258 [Drosophila ananassae]
gi|190618812|gb|EDV34336.1| GF21258 [Drosophila ananassae]
Length = 876
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IPSG W C C
Sbjct: 51 HNHDYCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPSGQWLCHSC 99
>gi|223997928|ref|XP_002288637.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975745|gb|EED94073.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 646
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 36/66 (54%), Gaps = 10/66 (15%)
Query: 700 ELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCV------SLPGIPSGT 753
EL I L E+P S N + C IC DGG L+CCD+C R +H C+ +LP +
Sbjct: 50 ELGIALKSEKPGSRGGNQNECAICEDGGKLICCDNCDRVYHATCLRIVDVDTLPDV---- 105
Query: 754 WHCRYC 759
WHC C
Sbjct: 106 WHCPKC 111
>gi|1585696|prf||2201456A Mi-2 autoantigen
Length = 529
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 29/163 (17%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 307
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A + + D + C +C+ ++ CD C
Sbjct: 308 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 350
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C + +QK++
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|348577397|ref|XP_003474471.1| PREDICTED: nuclear body protein SP140-like [Cavia porcellus]
Length = 652
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 667 SEVSPSQFEAHAGWASRRKPFQHIYTSNGVSLHELSIKLSLERPFSSK---ENDDLCGIC 723
S +SP +FE G AS + ++ +G +L EL K L P S K N + C +C
Sbjct: 424 SLLSPREFEVAGGRASSKN-WKISVRCHGWTLRELMQKEYLPIPPSKKPKLRNSNECEVC 482
Query: 724 MDGGDLLCCDSCPRAFHIDC-VSLPGIPSGTWHCRYC 759
DGG L CCD+C RAFH C + L + W C +C
Sbjct: 483 RDGGTLFCCDTCSRAFHEYCHLPLVEVERNPWSCIFC 519
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1710 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2118 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2165
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|2135739|pir||I38558 Mi-2 autoantigen 240 kDa protein - human (fragment)
gi|761718|gb|AAC50228.1| Mi-2 autoantigen 240 kDa protein, partial [Homo sapiens]
Length = 530
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 29/163 (17%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C+ L P G W C +C +++ V++
Sbjct: 253 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELDRAPEGKWSCPHC-----EKEGVQWE 307
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
A + + D + C +C+ ++ CD C
Sbjct: 308 AKEEEE--------EYEEEGEEEGEKEEEDDHMEYCRVCKDGG---------ELLCCDAC 350
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLCC-ADCKRINLALQKLV 874
YH+ CL L ++P G+WLC C + +QK++
Sbjct: 351 ISSYHIHCLN----PPLPDIPNGEWLCPRCTCPVLKGRVQKIL 389
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1601 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1643
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C DGG L+ C CP+ +H DC++L P+G W C +
Sbjct: 2009 TKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2056
>gi|357621017|gb|EHJ73003.1| putative fetal alzheimer antigen, falz [Danaus plexippus]
Length = 2482
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 717 DDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
DD C +C GDLLCC++CP FH++CV P +PS W C C
Sbjct: 345 DDHCRVCHRLGDLLCCETCPAVFHLECVDPPLVNVPSEDWQCALC 389
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1705 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1747
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2113 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2160
>gi|195449701|ref|XP_002072186.1| GK22457 [Drosophila willistoni]
gi|194168271|gb|EDW83172.1| GK22457 [Drosophila willistoni]
Length = 1120
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 888 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 937
>gi|74218224|dbj|BAE43222.1| unnamed protein product [Mus musculus]
Length = 239
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1698 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1740
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2106 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2153
>gi|224014282|ref|XP_002296804.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968659|gb|EED87005.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 2544
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 30/53 (56%)
Query: 707 LERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
L F+ NDD C +C GDLLCCD CP +FH C+ + +P G W C C
Sbjct: 1043 LSHEFAEIVNDDECTVCGIEGDLLCCDGCPGSFHRQCIGVARLPEGKWLCPEC 1095
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1400 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1442
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1808 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1855
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1337 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1379
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1745 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1792
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1440 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1482
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1848 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1895
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1603 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 47.4 bits (111), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2011 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2058
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1707 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1749
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2115 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2162
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1603 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1645
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2011 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2058
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1604 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1646
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2012 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2059
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1350 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
Score = 47.8 bits (112), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1747 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1794
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1709 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1751
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2117 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2164
>gi|255713490|ref|XP_002553027.1| KLTH0D07040p [Lachancea thermotolerans]
gi|238934407|emb|CAR22589.1| KLTH0D07040p [Lachancea thermotolerans CBS 6340]
Length = 674
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 39/80 (48%), Gaps = 12/80 (15%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYCMNTFQKEKFVE 770
ENDD C C G LCCD+CP++FH C + P +P G W C C +F
Sbjct: 282 ENDDFCASCRQPGIFLCCDTCPKSFHFACCNPPLDPDNLPEGDWSCAEC-------QFRI 334
Query: 771 YNANARAAGRIEGVDPFAQM 790
N AA R+E D AQ+
Sbjct: 335 RCPNKAAAHRLEK-DYLAQL 353
>gi|340522049|gb|EGR52282.1| predicted protein [Trichoderma reesei QM6a]
Length = 1069
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 65/163 (39%), Gaps = 40/163 (24%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSL---PGIPSGTWHCRYCM----------- 760
+ND+ C C + GD++CCD CPR+FH +CV + +P W+C C+
Sbjct: 632 DNDEYCSACGNTGDVVCCDGCPRSFHFECVDMVQSDHLPD-EWYCNECLVRRFPSRVPVY 690
Query: 761 --------NTFQKEKFVEYNANARAAGRIEGVDPFAQMVSRCIRIVQT---------PD- 802
N +K ++ + R EGV A + V+ PD
Sbjct: 691 KGAFASALNALEKSIPRAFSLPKKIQTRFEGVKAGADGDYEEVTTVKAKKRSGYDELPDF 750
Query: 803 ---TELGGCVLCRGRDFC-KSRFGRRTVILCDQCEREYHVGCL 841
E G VLC C K+ R +I C C +H+ CL
Sbjct: 751 FKQREDGEAVLCHA---CQKAATEIRAIIPCSACPLHWHIDCL 790
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1396 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1438
Score = 49.7 bits (117), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C DGG L+ C CP+ +H DC++L P+G W C +
Sbjct: 1804 TKEREDECFSCGDGGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1851
>gi|383851360|ref|XP_003701201.1| PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog
[Megachile rotundata]
Length = 1967
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/53 (49%), Positives = 32/53 (60%), Gaps = 6/53 (11%)
Query: 713 SKENDD----LCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYC 759
+ E+DD C IC DGG+LLCCDSC A+H C++ P IP G W C C
Sbjct: 422 AAEDDDEHMEFCRICKDGGELLCCDSCTSAYHTHCLNPPLSEIPDGDWKCPRC 474
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 54/147 (36%), Gaps = 45/147 (30%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDC--VSLPGIPSGTWHCRYCMNTFQKEKFVEYN 772
++ D C +C GG+++ CD+CPRA+H+ C L P G W C +C
Sbjct: 368 DHQDYCEVCQQGGEIILCDTCPRAYHLVCLEPELEETPEGKWSCPHC------------- 414
Query: 773 ANARAAGRIEGVDPFAQMVSRCIRIVQTPDTELGGCVLCRGRDFCKSRFGRRTVILCDQC 832
G E D + C + GG +LC CD C
Sbjct: 415 EGEGITGAAEDDDEHMEFCRIC---------KDGGELLC-----------------CDSC 448
Query: 833 EREYHVGCLKDHGMEDLQELPKGKWLC 859
YH CL L E+P G W C
Sbjct: 449 TSAYHTHCLN----PPLSEIPDGDWKC 471
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1350 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1392
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1706 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1748
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2114 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2161
>gi|427788395|gb|JAA59649.1| Putative atp-dependent chromatin assembly factor large subunit
[Rhipicephalus pulchellus]
Length = 1568
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQK 765
N+D C C GG L+CCDSCP A+H++C P +P G W+C C Q+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCTVAKQR 1314
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 809 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 861
CR C+ R ++LCD C+R +H+ CLK L+E+PKG W C +
Sbjct: 1158 AYCR---ICRRRRDPERMLLCDGCDRGHHLYCLK----PPLEEIPKGDWYCIS 1203
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1436 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1478
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1844 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1891
>gi|427788397|gb|JAA59650.1| Putative bromodomain adjacent to zinc finger domain protein 1a
[Rhipicephalus pulchellus]
Length = 1568
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQK 765
N+D C C GG L+CCDSCP A+H++C P +P G W+C C Q+
Sbjct: 1263 NEDSCDACGKGGTLICCDSCPLAYHLECTRPPLRRLPRGNWNCHKCTVAKQR 1314
Score = 43.1 bits (100), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 809 VLCRGRDFCKSRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCA 861
CR C+ R ++LCD C+R +H+ CLK L+E+PKG W C +
Sbjct: 1158 AYCR---ICRRRRDPERMLLCDGCDRGHHLYCLK----PPLEEIPKGDWYCIS 1203
>gi|432894203|ref|XP_004075956.1| PREDICTED: PHD finger protein 12-like [Oryzias latipes]
Length = 865
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 51 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 98
Score = 43.1 bits (100), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 47/110 (42%), Gaps = 34/110 (30%)
Query: 668 EVSPSQFE---------AHAGWASRR-------KPF---QHIYTSNGVSLHELSIKLSLE 708
E +P+QF+ A G + RR KPF QH +NG L L +K+
Sbjct: 220 ERNPTQFQLPSELTCTTALPGSSKRRRKEELLGKPFRRPQHELDTNG--LVPLPVKVCF- 276
Query: 709 RPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHC 756
LC L+ CD CP FH+DC+ P G+P+G W C
Sbjct: 277 ----------LCNKSCRLAPLIQCDYCPLLFHMDCLDPPLTGLPAGRWMC 316
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1441 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1483
Score = 47.4 bits (111), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1849 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1896
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1437 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1845 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1892
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1710 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1752
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 2118 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 2165
>gi|198449950|ref|XP_001357792.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
gi|198130826|gb|EAL26927.2| GA18737 [Drosophila pseudoobscura pseudoobscura]
Length = 1128
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|195158907|ref|XP_002020325.1| GL13567 [Drosophila persimilis]
gi|194117094|gb|EDW39137.1| GL13567 [Drosophila persimilis]
Length = 1053
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 6/51 (11%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIP-----SGTWHCRYCMN 761
N+D C +C+DGG+L+CCD CP+ FH +C +P I S +W C C+N
Sbjct: 885 NEDWCAVCLDGGELMCCDKCPKVFHQNC-HIPAISSLPDESESWQCLLCVN 934
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1437 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1479
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1845 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1892
>gi|410930175|ref|XP_003978474.1| PREDICTED: PHD finger protein 12-like [Takifugu rubripes]
Length = 1011
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDTCDSCREGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
Score = 40.0 bits (92), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 53/123 (43%), Gaps = 35/123 (28%)
Query: 655 KKGFGILCTCCNSEVSPSQFE---------AHAGWASRR-------KPF---QHIYTSNG 695
K+ F +L E +P+QF+ A G + RR KPF QH SNG
Sbjct: 227 KRPFQLLIAAA-MERNPTQFQLPSELTCTTALPGSSKRRRKEELLGKPFRRPQHELDSNG 285
Query: 696 VSLHELSIKLSLERPFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGT 753
L L +K+ FS + L L+ CD CP FH+DC+ P +P+G
Sbjct: 286 --LVPLPVKIC----FSCNRSCRL-------APLIQCDYCPLLFHMDCLDPPLTALPAGK 332
Query: 754 WHC 756
W C
Sbjct: 333 WMC 335
>gi|348666072|gb|EGZ05900.1| hypothetical protein PHYSODRAFT_342104 [Phytophthora sojae]
Length = 1134
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 14/86 (16%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCMNTFQKEKFVEYNA 773
N D C +C GDL+CCD CPR++H+ C+ + +P G W C C + +F Y+
Sbjct: 933 NLDFCEVCQRAGDLVCCDKCPRSYHLKCLHMTENDLPEGDWQCAECK---KPSRFDAYSV 989
Query: 774 NARAAGRIEGVDPFAQMVSRCIRIVQ 799
V ++ RC++IV+
Sbjct: 990 ---------AVASEQSLLDRCLKIVE 1006
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 33/53 (62%), Gaps = 9/53 (16%)
Query: 815 DFCK--SRFGRRTVILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 865
DFC+ R G ++ CD+C R YH+ CL H E+ +LP+G W CA+CK+
Sbjct: 935 DFCEVCQRAG--DLVCCDKCPRSYHLKCL--HMTEN--DLPEGDWQ-CAECKK 980
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1438 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1846 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1893
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1401 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1443
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1809 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1856
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1438 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1480
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 713 SKENDDLCGICMDGGDLLCC--DSCPRAFHIDCVSLPGIPSGTWHCRY 758
+KE +D C C D G L+ C CP+ +H DC++L P+G W C +
Sbjct: 1846 TKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECPW 1893
>gi|403373758|gb|EJY86802.1| PHD domain containing protein [Oxytricha trifallax]
Length = 988
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/45 (51%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSL-PGIPSGTWHCRYC 759
N D+CGIC G L+CCD CP AFH DC+ P G W C +C
Sbjct: 576 NADVCGICDLPGKLICCDDCPSAFHADCLGYEKQCPRGKWKCYFC 620
>gi|363741325|ref|XP_415826.3| PREDICTED: PHD finger protein 12 [Gallus gallus]
Length = 1084
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 130 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 177
>gi|294658848|ref|XP_461185.2| DEHA2F19338p [Debaryomyces hansenii CBS767]
gi|202953432|emb|CAG89573.2| DEHA2F19338p [Debaryomyces hansenii CBS767]
Length = 795
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 71/196 (36%), Gaps = 54/196 (27%)
Query: 713 SKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP--GIPSGTWHCRYCM---NTFQKEK 767
+K NDD C C G +CCD+CP++FH C + P P WHC+ C+ N K+
Sbjct: 329 TKNNDDFCTNCGGPGVFICCDTCPKSFHFTCCNPPLEECPEDNWHCQECVIKQNPGFKKT 388
Query: 768 FVEYNANARAAGRIEGVDPFAQMVSRCIR------------------------------- 796
+ + + EG +P + R IR
Sbjct: 389 YNHIGIFGQLLNQSEGRNPKEFQLPRKIRDNSFIGVTTGENGVYQDATFKPEVSYTKMNE 448
Query: 797 -----IVQTPDTELGGCVLCRGRDFCKSRFGR-----RTVILCDQCEREYHVGCLKDHGM 846
+ D ++ G G + + G +T+ CD C +H+ CL
Sbjct: 449 YQIIGFNKNTDLDVDGLYDKNGNPYLCHKCGLSGLKGKTLTHCDYCPLVWHIDCL----- 503
Query: 847 EDLQELPK---GKWLC 859
ED +PK KW C
Sbjct: 504 EDPLCIPKTLGSKWRC 519
>gi|195449007|ref|XP_002071907.1| GK24911 [Drosophila willistoni]
gi|194167992|gb|EDW82893.1| GK24911 [Drosophila willistoni]
Length = 884
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 28/49 (57%), Gaps = 4/49 (8%)
Query: 715 ENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GG+LLCCD CP +FH+ C P IP+G W C C
Sbjct: 51 HNHDFCDACEEGGNLLCCDRCPSSFHLQCHDPPLSEEDIPTGQWLCHSC 99
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 30/44 (68%), Gaps = 1/44 (2%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLPGIPSGTWHCRYC 759
N C +C +GG LLCCDSCP AFH +C+++ IP G W+C C
Sbjct: 1155 NVSWCFVCSEGGSLLCCDSCPAAFHRECLNI-DIPEGNWYCNDC 1197
>gi|75677357|ref|NP_001028733.1| PHD finger protein 12 isoform 1 [Homo sapiens]
gi|71153050|sp|Q96QT6.2|PHF12_HUMAN RecName: Full=PHD finger protein 12; AltName: Full=PHD factor 1;
Short=Pf1
gi|119571550|gb|EAW51165.1| PHD finger protein 12, isoform CRA_b [Homo sapiens]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|410352467|gb|JAA42837.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|158256276|dbj|BAF84109.1| unnamed protein product [Homo sapiens]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|410220518|gb|JAA07478.1| PHD finger protein 12 [Pan troglodytes]
gi|410294710|gb|JAA25955.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|397483106|ref|XP_003812746.1| PREDICTED: PHD finger protein 12 [Pan paniscus]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|345790467|ref|XP_852450.2| PREDICTED: nuclear body protein SP140 [Canis lupus familiaris]
Length = 781
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 64/142 (45%), Gaps = 31/142 (21%)
Query: 646 GEVKFLVGYKKGF--GILCTCCNSE----VSPSQFEAHAGWASRRKPFQHIYTSNGVSLH 699
GEVK ++ YKK G+L C SE +P +FE G A R K ++ G+ L
Sbjct: 507 GEVKGML-YKKKLKQGVLVRCIQSEDGNWFTPREFEVKGGHA-RSKNWRMSVRCGGMPLR 564
Query: 700 ELSIKLSLERP----FSSKE----------------NDDLCGICMDGGDLLCCDSCPRAF 739
L K L P F ++ N D C C DGG+L CCD+C R+F
Sbjct: 565 WLIEKELLHNPPRKYFRKRKTIPKAHNNTLVDPYLGNSDECETCRDGGELFCCDTCSRSF 624
Query: 740 HIDCVSLPGIPS--GTWHCRYC 759
H DC +P + + W C +C
Sbjct: 625 HEDC-HIPPVETERSPWSCTFC 645
>gi|50312393|ref|XP_456230.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49645366|emb|CAG98938.1| KLLA0F25828p [Kluyveromyces lactis]
Length = 623
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 710 PFSSKENDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
P ++KENDD C C G LCCD+CP++FH C + P +P G W C C
Sbjct: 236 PDNNKENDDFCSNCHGPGVFLCCDTCPKSFHFACCNPPLDPSHLPEGDWSCDEC 289
>gi|410267244|gb|JAA21588.1| PHD finger protein 12 [Pan troglodytes]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|291405467|ref|XP_002718962.1| PREDICTED: PHD finger protein 12 [Oryctolagus cuniculus]
Length = 1004
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 28/48 (58%), Gaps = 4/48 (8%)
Query: 716 NDDLCGICMDGGDLLCCDSCPRAFHIDCVSLP----GIPSGTWHCRYC 759
N D C C +GGDLLCCD CP AFH+ C + P +P G W C C
Sbjct: 55 NHDSCDSCKEGGDLLCCDHCPAAFHLQCCNPPLSEEMLPPGEWMCHRC 102
>gi|301623129|ref|XP_002940874.1| PREDICTED: chromodomain-helicase-DNA-binding protein 3 [Xenopus
(Silurana) tropicalis]
Length = 1954
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 718 DLCGICMDGGDLLCCDSCPRAFHIDCVS--LPGIPSGTWHCRYC 759
+ C +C DGG+LLCCD+C ++HI C++ LP IP G W C C
Sbjct: 478 EFCRVCKDGGELLCCDACVSSYHIHCLNPPLPDIPHGEWLCPRC 521
Score = 40.8 bits (94), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 25/40 (62%), Gaps = 5/40 (12%)
Query: 826 VILCDQCEREYHVGCLKDHGMEDLQELPKGKWLCCADCKR 865
+ILCD C R YH+ CL+ +L+ P+GKW C C++
Sbjct: 413 IILCDTCPRAYHLVCLE----PELERAPQGKW-SCPHCEK 447
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.133 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,994,294,595
Number of Sequences: 23463169
Number of extensions: 749906031
Number of successful extensions: 2532114
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3174
Number of HSP's successfully gapped in prelim test: 18269
Number of HSP's that attempted gapping in prelim test: 2303251
Number of HSP's gapped (non-prelim): 128024
length of query: 1088
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 934
effective length of database: 8,745,867,341
effective search space: 8168640096494
effective search space used: 8168640096494
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 83 (36.6 bits)