BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 046120
         (381 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
 gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
          Length = 391

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 238/382 (62%), Positives = 281/382 (73%), Gaps = 20/382 (5%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS----FCHNCFRKTMHS 63
           V V EI+GRGRGLVSTQ L+ GQIVL DSPILLYSALP          +C  CF KT+ S
Sbjct: 11  VRVEEIQGRGRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCF-KTIQS 69

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           +S  CP+CS   FCSP C +AA +SSH+P+VCQ+L+RL +       +  ++RQ+QA+FL
Sbjct: 70  ASVSCPTCSHQRFCSPTCLSAALASSHTPWVCQSLSRLRDCQD-FLQHHSVERQIQAQFL 128

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL----SSLCPPPA----STTIELTAA 174
           +AAYNLA V+PS FQILL+ QG   D D +    L    SSLCPPP     S ++EL AA
Sbjct: 129 VAAYNLAFVSPSDFQILLSLQGRAEDEDPAIVQSLHSVISSLCPPPPIEGFSFSLELIAA 188

Query: 175 LLAKDRLNAFGLMEPYI--EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           L+AKDR NAFGLMEP    E   G+RSVRAYGIY KAS FNHDCLPNACRFDYVD     
Sbjct: 189 LVAKDRFNAFGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVDTNNSG 248

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-DND 291
           NTDI+VRMIHDVPQGREICLSYFPVN +YSTR+KRLL DY F CDCDRCKVEA WS D  
Sbjct: 249 NTDIVVRMIHDVPQGREICLSYFPVNSNYSTRRKRLLEDYGFTCDCDRCKVEATWSDDEG 308

Query: 292 NDDENNEEVMDEDQDEQMVASDDDAEVHGD--TNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
           + D+N+ EVM+ED DE M A  D  E+  D  T+FPHAYFF+RYMC+R+NCWGTLAP PP
Sbjct: 309 DGDDNDNEVMEEDVDEPMEAESDGEEIGNDNSTDFPHAYFFLRYMCNRNNCWGTLAPFPP 368

Query: 350 SDATPSTVMECNVCGNLKNDAI 371
           SDA PS ++ECN CG++KND +
Sbjct: 369 SDAKPSNLLECNACGDIKNDEV 390


>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
           max]
          Length = 419

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 240/420 (57%), Positives = 290/420 (69%), Gaps = 42/420 (10%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--------- 51
           M+A S  + V EI+GRGRG+V++Q LKAGQIVLRDSPILLYSALP +  S          
Sbjct: 3   MAAPSSLLKVEEIQGRGRGMVASQPLKAGQIVLRDSPILLYSALPLVRQSLSSSSSSAST 62

Query: 52  --FCHNCFR-------KTMHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLL 101
             FC +CFR           SS+ +CP+C    FC+  C + A +SSHS +VCQAL+ L 
Sbjct: 63  SCFCDHCFRILSPSLQGDSSSSTVLCPNCRHHCFCNSNCLSNALNSSHSSWVCQALSHL- 121

Query: 102 NLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL---- 157
             NS      PL+ Q+Q  FL+AAYNLA ++PS FQI+L+ QG+  D+  +AA +L    
Sbjct: 122 RANSL-LLEQPLEHQVQVNFLVAAYNLANISPSDFQIMLSLQGSPDDSTIAAAQFLHPLI 180

Query: 158 SSLCP-----PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFF 212
           SSLC      P    ++ELT+A+LAKD+LNAFG+M+P+ E  D +RSVRAYGIY  ASFF
Sbjct: 181 SSLCSLALIGPQNGFSLELTSAILAKDKLNAFGIMQPFSE-HDDQRSVRAYGIYPYASFF 239

Query: 213 NHDCLPNACRFDYVDAAAEN---NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
           NHDCLPNACRFDYVDA   +   NTD I+RMIHDVPQGREICLSYFPVN  YS+RQKRL+
Sbjct: 240 NHDCLPNACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEKYSSRQKRLI 299

Query: 270 YDYDFACDCDRCKVEANWSDNDNDDEN---NEEVMDEDQDEQMVASDDDAEVHGDTN-FP 325
            DY F C+CDRC VE+NWSDND+ ++N    EEVMDEDQ E M ASD D   H D N FP
Sbjct: 300 EDYGFTCNCDRCNVESNWSDNDSVEDNAEEEEEVMDEDQCETMAASDTDDHPHEDNNDFP 359

Query: 326 HAYFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDAI----GREDTVGMDD 381
           HAYFF++YMCDR NCWGTLAPLPP   TP  VMECNVCG LK+D        +D V M+D
Sbjct: 360 HAYFFLKYMCDRTNCWGTLAPLPPQGDTPCNVMECNVCGKLKSDNTFDIDEGQDEVSMED 419


>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
           vinifera]
          Length = 405

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 236/397 (59%), Positives = 282/397 (71%), Gaps = 22/397 (5%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
           S  V   EIEGRGR LV++QSL+ GQI+L DSPILLYSA P  +SS ++C NCFR     
Sbjct: 11  SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70

Query: 64  SSSI-CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           S+ + C SC   FCSP C T A SSSHSP+ C  L+ L    SP  +    +RQ+QARFL
Sbjct: 71  STLVSCSSCPCLFCSPDCLTHALSSSHSPWACLTLSLLRA--SPSLSLSHSERQVQARFL 128

Query: 123 IAAYNLAVVNPSQFQILLAFQGTV-----TDNDTSAAHYLSSLCPPP--ASTTIELTAAL 175
           +AAYNLA+V+PS F ILL+ QG       +D  T     LSSL PP   A  ++ELT AL
Sbjct: 129 VAAYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTAL 188

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           LAKD+LNAFGLMEP      G RSVRAYGIY KASFFNHDCLPNACRFDYVD A+ +NTD
Sbjct: 189 LAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTD 248

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
           I +R+IHDVP+G EICLSYFPVN  Y+ RQKRLL DY F C CDRC+VEANW D+D  +E
Sbjct: 249 ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308

Query: 296 NNE---EVMDEDQDEQMVASDDDAEV---HGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
             +   +VMDEDQDEQM+ S+++ E+    G+ +FPHAYFF+RYMC R+NCWGTLAPLPP
Sbjct: 309 EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGENDFPHAYFFLRYMCTRENCWGTLAPLPP 368

Query: 350 --SDATPSTVMECNVCGNLKN---DAIGREDTVGMDD 381
             SDA+PS +MECNVCGN K    D    ED + +DD
Sbjct: 369 SDSDASPSNLMECNVCGNSKKSDEDFNSDEDRLSVDD 405


>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
          Length = 405

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 236/397 (59%), Positives = 281/397 (70%), Gaps = 22/397 (5%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
           S  V   EIEGRGR LV++QSL+ GQI+L DSPILLYSA P  +SS ++C NCFR     
Sbjct: 11  SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70

Query: 64  SSSI-CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           S+ + C SC   FCSP C T A SSSHSP+ C  L+ L    SP  +    +RQ+QARFL
Sbjct: 71  STLVSCSSCPCLFCSPDCLTXALSSSHSPWACLTLSLLRA--SPSLSLSHSERQVQARFL 128

Query: 123 IAAYNLAVVNPSQFQILLAFQGTV-----TDNDTSAAHYLSSLCPPP--ASTTIELTAAL 175
           +AAYNLA+V+PS F ILL+ QG       +D  T     LSSL PP   A  ++ELT AL
Sbjct: 129 VAAYNLAIVSPSHFHILLSLQGMALPSSDSDAPTFLHSLLSSLSPPQGVAGFSVELTTAL 188

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           LAKD+LNAFGLMEP      G RSVRAYGIY KASFFNHDCLPNACRFDYVD A+ +NTD
Sbjct: 189 LAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACRFDYVDTASHHNTD 248

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
           I +R+IHDVP+G EICLSYFPVN  Y+ RQKRLL DY F C CDRC+VEANW D+D  +E
Sbjct: 249 ITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRCRVEANWKDDDEQEE 308

Query: 296 NNE---EVMDEDQDEQMVASDDDAEV---HGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
             +   +VMDEDQDEQM+ S+++ E+    G  +FPHAYFF+RYMC R+NCWGTLAPLPP
Sbjct: 309 EQDDEGQVMDEDQDEQMIGSENEIEIGDGGGXNDFPHAYFFLRYMCTRENCWGTLAPLPP 368

Query: 350 --SDATPSTVMECNVCGNLKN---DAIGREDTVGMDD 381
             SDA+PS +MECNVCGN K    D    ED + +DD
Sbjct: 369 SDSDASPSNLMECNVCGNSKKXDEDFNSDEDRLSVDD 405


>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
           Full=ASH1-related protein 2; AltName: Full=Protein SET
           DOMAIN GROUP 39
 gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 398

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 215/389 (55%), Positives = 287/389 (73%), Gaps = 30/389 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
           + VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS   +C +CFR    S+ 
Sbjct: 13  LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72

Query: 66  SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
             C SCS V+FCSP C      +SH+P++C++L RL   +S   ++ P DRQ+QARFL++
Sbjct: 73  QKCQSCSLVSFCSPNCF-----ASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLS 127

Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAHYL----SSLCPP-PASTTIEL 171
           AYNLA  +PS FQILL+ QG+ + N        D++AA +L    SS+CP  P S + +L
Sbjct: 128 AYNLAAASPSDFQILLSLQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDL 187

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
           TAALL+KD++NAFGLMEP     + +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++
Sbjct: 188 TAALLSKDKVNAFGLMEP-CSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASD 246

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
            NTDII+RMIHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE +WS+ +
Sbjct: 247 GNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVEFSWSEGE 306

Query: 292 NDDENNEEVMDEDQDEQMV---ASDDDAEVHG-----DTNFPHAYFFMRYMCDRDNCWGT 343
            D+    E M++  +++ +     +++ EV G     ++NFPHAYFF+RYMC+++NC+GT
Sbjct: 307 EDENEIMEEMEDQDEQEEMEDSVGENEEEVCGNGVDDESNFPHAYFFVRYMCEKENCFGT 366

Query: 344 LAPLPPSDATPSTVMECNVCGNLKNDAIG 372
           LAPLPP     S V+ECNVCG++K D +G
Sbjct: 367 LAPLPPKTHDASRVLECNVCGSVKEDEVG 395


>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
 gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
          Length = 379

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 226/377 (59%), Positives = 273/377 (72%), Gaps = 30/377 (7%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---------FCHNCFRKTMHSSSSI 67
           GRG+VS Q L+ G +V+RDSP+LLYSA P   SS          +C NC+RK +HS+   
Sbjct: 12  GRGVVSYQPLRGGDVVVRDSPLLLYSAFPLSRSSITSTPTSTSVYCDNCYRK-IHSAVIC 70

Query: 68  CPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           CP+CS   FCSP C +A   SS++P+VCQAL+RL + +S    + PL+RQ+QARFLIAAY
Sbjct: 71  CPTCSHHKFCSPNCVSA---SSNAPWVCQALSRLHDCSSL-VVHQPLERQVQARFLIAAY 126

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYL----SSLCPPPAST--TIELTAALLAKDR 180
           NL +V+PS FQILL+ QG     D   A +L    SSLCPP      ++ELT+ALLAKD+
Sbjct: 127 NLFLVSPSNFQILLSLQGQGGGGDDEDAQFLHSLISSLCPPTGGVPFSLELTSALLAKDK 186

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           LNAFGLMEP+ +  DG+RSVRAYGIY KA+ FNHDCLPNACRFDYVD     +TD+I+RM
Sbjct: 187 LNAFGLMEPF-DINDGKRSVRAYGIYPKAALFNHDCLPNACRFDYVDT---QDTDLIIRM 242

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN----DDEN 296
           IHDVPQGREICLSYFPVNYDYSTRQKRL  DY F CDCDRCKVEANWSD ++    DDEN
Sbjct: 243 IHDVPQGREICLSYFPVNYDYSTRQKRLREDYGFICDCDRCKVEANWSDQEHDDADDDEN 302

Query: 297 NEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPST 356
             E M+ED +E M    DDA    D++FPHAYFF+RYMCD  NCWGTLAPL  S+    T
Sbjct: 303 ENEAMEEDSEEAMADEKDDAPSDNDSDFPHAYFFLRYMCDGTNCWGTLAPLAQSNVN-IT 361

Query: 357 VMECNVCGNLKNDAIGR 373
           ++ECNVCG +K D +G 
Sbjct: 362 LLECNVCGKIKRDEVGH 378


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 206/373 (55%), Positives = 269/373 (72%), Gaps = 28/373 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
           + VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS   +C +CFR    S+ 
Sbjct: 12  LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 71

Query: 66  SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
             C SCS V+FCSP C      +SH+P++C++L RL + +S   ++ P DRQ+QARFL++
Sbjct: 72  QKCQSCSLVSFCSPNCF-----ASHTPWLCESLLRL-HQSSSAFSDQPSDRQVQARFLLS 125

Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAH-YLSSLCPP-PASTTIELTAA 174
           AYNLA  +PS FQILL+ QG+   N        D+   H  LS++CPP P   + ELTAA
Sbjct: 126 AYNLAAASPSDFQILLSLQGSGCSNGDPSSSATDSGFLHSLLSAVCPPLPVCISPELTAA 185

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           LLAKD++NAFGLMEP+    D +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++ NT
Sbjct: 186 LLAKDKVNAFGLMEPFSVSND-KRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASDGNT 244

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
           DII+R IHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE++WS+ + D+
Sbjct: 245 DIIIRTIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVESSWSEGEEDE 304

Query: 295 ENNEEVMDEDQDEQMVASDDDAEVH--------GDTNFPHAYFFMRYMCDRDNCWGTLAP 346
               E M  + DE+ +   +   V          ++NFPHAYFF+RYMC++++C+GTLAP
Sbjct: 305 NEVMEEMGNEDDEEEMEDSEGENVEEVCGNGVDDESNFPHAYFFVRYMCEKESCFGTLAP 364

Query: 347 LPPSDATPSTVME 359
           LPP     S V+E
Sbjct: 365 LPPKTHDASRVLE 377


>gi|357484667|ref|XP_003612621.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
 gi|355513956|gb|AES95579.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
          Length = 384

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 183/384 (47%), Positives = 245/384 (63%), Gaps = 45/384 (11%)

Query: 1   MSAVSET--VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNC 56
           M+A + T  + +  I  +GRGL++TQ LKAGQI+L +SP+LLYSA P  +   S +CHNC
Sbjct: 28  MAAQTSTSLLRLEHIPNKGRGLIATQDLKAGQIILTESPLLLYSATPLFSHTPSPYCHNC 87

Query: 57  FRKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
           FR    S +  CPSCS   FCS KC + A +SSHS + CQ L+ L N  SP +   P + 
Sbjct: 88  FRTLPPSQTFQCPSCSNYLFCSQKCLSIALNSSHSSWTCQTLSHLQNPTSPLSEK-PCEL 146

Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA--AHYLSSL----CPPPASTTI 169
           Q+QAR ++AAYNLA+  PS+ Q LL+  G   D D+    A +L SL    C P  + + 
Sbjct: 147 QVQARLIVAAYNLAIHTPSKLQTLLSLHGNPNDQDSIVDNAKFLHSLISPFCSPHMNFSA 206

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
           EL A ++AK+RLN+F LMEPY   Q G +RS++AYGIY KA+FFNH C+PNACRFDYV++
Sbjct: 207 ELAAKIIAKERLNSFCLMEPY--SQKGPQRSIKAYGIYQKATFFNHGCIPNACRFDYVES 264

Query: 229 AA---ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
                E+NTDI++R+I DV  G EIC+SYF +N DYSTR+K L+ DY F C+CDRCK+EA
Sbjct: 265 GEPGDEHNTDIVIRLIKDVGVGSEICISYFRINKDYSTRKKILMEDYGFTCECDRCKIEA 324

Query: 286 NWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLA 345
           +W+D +N                             ++ PH  F  +++CD++NC GT+A
Sbjct: 325 SWNDGENKY---------------------------SDLPHVIFLSKFVCDKENCAGTMA 357

Query: 346 PLPPSDATPSTVMECNVCGNLKND 369
           PLPP D   S V+ECN CGNLK D
Sbjct: 358 PLPPKDGEKSNVLECNFCGNLKVD 381


>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
 gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
           thaliana]
 gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
           thaliana]
 gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
          Length = 341

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 176/299 (58%), Positives = 226/299 (75%), Gaps = 22/299 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSS 65
           + VAEI GRGR LV+ QSL+AGQ++LR+SP+LLYSA PF++SS   +C +CFR    S+ 
Sbjct: 13  LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72

Query: 66  SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
             C SCS V+FCSP C      +SH+P++C++L RL   +S   ++ P DRQ+QARFL++
Sbjct: 73  QKCQSCSLVSFCSPNCF-----ASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLS 127

Query: 125 AYNLAVVNPSQFQILLAFQGTVTDN--------DTSAAHYL----SSLCPP-PASTTIEL 171
           AYNLA  +PS FQILL+ QG+ + N        D++AA +L    SS+CP  P S + +L
Sbjct: 128 AYNLAAASPSDFQILLSLQGSGSSNGDPSCSAGDSAAAGFLHSLLSSVCPSLPVSISPDL 187

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
           TAALL+KD++NAFGLMEP     + +RSVRAYGIY K SFFNHDCLPNACRFDYVD+A++
Sbjct: 188 TAALLSKDKVNAFGLMEP-CSVSNEKRSVRAYGIYPKTSFFNHDCLPNACRFDYVDSASD 246

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
            NTDII+RMIHDVP+GRE+CLSYFPVN +YS+RQKRLL DY F CDCDRCKVE +WS+ 
Sbjct: 247 GNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKRLLEDYGFKCDCDRCKVEFSWSEG 305


>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
           angustifolius]
          Length = 357

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 178/381 (46%), Positives = 247/381 (64%), Gaps = 47/381 (12%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCHNCFR 58
           S  + +  I G+G+GLV+ QSLKAGQ++LR+SP+++YS+ P I++SS      +C +CFR
Sbjct: 6   SSLLRLDHIAGKGKGLVAAQSLKAGQVILRESPLIIYSSSPLISTSSPSSLFPYCDHCFR 65

Query: 59  KTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
            T+ +++  CPSCS   FCS KC + A ++ HS  VC AL+ L +  S      P +RQ+
Sbjct: 66  -TLPTNTIPCPSCSYHHFCSHKCFSTAFNTFHSSLVCHALSHLKD--SESLQQQPYERQV 122

Query: 118 QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC----PPPASTTIELTA 173
           QARF++AAYNLA+++PS     L+  GT  D    AA +L SL     PP  + +++LTA
Sbjct: 123 QARFVVAAYNLAIISPSGIHAFLSLHGTPDDTIVEAAKFLHSLISPLFPPSVNISVDLTA 182

Query: 174 ALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA--- 229
            LLAKDR+N+F LM PY    DG +RS++AY IY KAS FNHDC+PNACRFDYVD+A   
Sbjct: 183 QLLAKDRINSFCLMNPY--SPDGPQRSIKAYAIYPKASMFNHDCIPNACRFDYVDSADLD 240

Query: 230 -AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
              NNTD+++RMI D+P+GRE+C+SYF ++ DY TR++ L+ DY F+C+CDRCK+EANW 
Sbjct: 241 DEHNNTDMVIRMIQDLPEGREVCISYFRISRDYCTRKRILMDDYGFSCECDRCKIEANWP 300

Query: 289 DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLP 348
              +D +N  E                      ++ PH  F  +Y+CDR NC GTLA   
Sbjct: 301 ---HDCQNYVEEY--------------------SDLPHVRFIAKYVCDRKNCNGTLA--- 334

Query: 349 PSDATPSTVMECNVCGNLKND 369
           P D   + V+ECN CGNLK+D
Sbjct: 335 PKDDAHTNVLECNFCGNLKSD 355


>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
           [Brachypodium distachyon]
 gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
           [Brachypodium distachyon]
          Length = 389

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 159/388 (40%), Positives = 225/388 (57%), Gaps = 39/388 (10%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
             + + VA++ GRGRGLV+ + +  G+++L + PILLY +    +  S+C  CFR    +
Sbjct: 2   AGDALRVADLPGRGRGLVAARDILEGEVLLSEPPILLYPS-SLASLPSYCSACFRCLPQA 60

Query: 64  S-SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
             ++ CPSC + AFCSP C    A++SH   +C AL+RL        A  P   Q Q  F
Sbjct: 61  PHAAPCPSCRAAAFCSPAC----AAASHPRLLCAALSRL--------AAAPESHQEQLLF 108

Query: 122 LIAAYNLAVVNPSQFQILLAF----QGTVTDNDTSAAHYLSSLCPP---PASTTIELTAA 174
           L++AY+L       F  LL+     QGT   +  S    +SSL PP   PA  + +LTAA
Sbjct: 109 LLSAYSL---QEPAFHALLSLSSAPQGTQQQDAASLHAMVSSLAPPHMLPAGFSPDLTAA 165

Query: 175 LLAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
           LL+KDR N+F +MEPY  +   G R  RAY +Y +AS  NHDCLPNAC FDY D     N
Sbjct: 166 LLSKDRTNSFSIMEPYRPDVAQGLRKARAYAVYHRASLLNHDCLPNACHFDYPDRPGPGN 225

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS----- 288
           TDI++R +H +  G E+ +SYF  N+ Y+ RQ+RLL DY F C+C+RC++E+ W      
Sbjct: 226 TDIVLRALHGITAGMEVRISYFAANWRYADRQRRLLEDYGFRCECERCQIESKWKFDDDE 285

Query: 289 -------DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCW 341
                  D+  ++E+++E +    DE M   +     + + +FPHA+FF+R+MCDR++C 
Sbjct: 286 NDGGDGDDDTMEEEHDKEDVGNGGDEGMEQEEGSDGDNDNDDFPHAFFFVRFMCDREDCH 345

Query: 342 GTLAPLPP-SDATPSTVMECNVCGNLKN 368
           G LAPLPP      S V ECNVC  LK 
Sbjct: 346 GMLAPLPPLPSGELSHVFECNVCEQLKK 373


>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
          Length = 404

 Score =  278 bits (710), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/400 (41%), Positives = 234/400 (58%), Gaps = 28/400 (7%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M+A  + + VA++ GRGRGLV+ ++++ G+++L + P+LLY +    +  S+C  CFR  
Sbjct: 10  MAAAGDVLRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPST-LSSLRSYCSACFRSL 68

Query: 61  MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQ 118
             +++  C SC + AFCSP C    A++SH   +C AL+         AA  P +  Q  
Sbjct: 69  PPAATIPCVSCRAAAFCSPAC----AAASHPHLLCAALSHGGGAGL--AAAAPTEAIQEP 122

Query: 119 ARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTA 173
             FL++AY+L      S   +  A        D +  H   +   P    P   + +LTA
Sbjct: 123 LLFLLSAYSLPESTMRSMLSLSSAPPPPPGAQDAAGLHAAVAALAPPHMLPQGLSPDLTA 182

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           ALLAKDR N+F +MEPY  G        RAY +Y +AS FNHDCLPNAC FDY D     
Sbjct: 183 ALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPG 242

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
           NTDI+VR +HD+P+GRE+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+D+
Sbjct: 243 NTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDDDD 302

Query: 293 DDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
           ++ ++ +   E++ E  V             D A   GD +FPHAYFF+RY+CD + CWG
Sbjct: 303 NNGDDGDDTMEEEGEDDVGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVRYLCDNEGCWG 362

Query: 343 TLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
            LAPLPPS +   S V ECN+CG L+   D I  E T GM
Sbjct: 363 MLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 402


>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
 gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
          Length = 404

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 165/402 (41%), Positives = 234/402 (58%), Gaps = 34/402 (8%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
           +A  + + VA++ GRGRGLV+ ++++ G+++L + P+LLY +    + +S+C  CFR   
Sbjct: 11  AAAGDALRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPST-LSSLASYCSACFRSLP 69

Query: 62  HSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQA 119
            +++  C SC + AFCSP C    A++SH   +C AL+         AA  P +  Q   
Sbjct: 70  AAATVPCASCRAAAFCSPAC----AAASHPRLLCAALSHGGGAGV--AAAAPTEAIQEPL 123

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIEL 171
            FL++AY+L     S    +L+          +        A   L+     P   + +L
Sbjct: 124 LFLLSAYSLPE---STIHSILSLSSAPPPPPGAQDAAGLHAAVAALAPPHMLPQGFSPDL 180

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
           TAALLAKDR N+F +MEPY  G        RAY +Y +AS FNHDCLPNAC FDY D   
Sbjct: 181 TAALLAKDRANSFAIMEPYRVGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPG 240

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
             NTDI+VR +HD+P+GRE+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+
Sbjct: 241 PGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDD 300

Query: 291 DND----------DENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNC 340
           D+D          +E  E+   +  D+ M   + D    G+ +FPHAYFF+RY+CD + C
Sbjct: 301 DDDNNGGDGDDTMEEEGEDDAGDRGDDGMEEEEGDGATDGNDDFPHAYFFVRYLCDNEGC 360

Query: 341 WGTLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
           WG LAPLPPS +   S V ECN+CG L+   DA+  E T GM
Sbjct: 361 WGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDAMPDEGTSGM 402


>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
          Length = 392

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 160/384 (41%), Positives = 227/384 (59%), Gaps = 26/384 (6%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
             + +  A++ GRGRGL++ +S++ G+++L + P+LLY A    +  SFC  CFR ++ +
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPAS-LASLPSFCSACFR-SLSA 59

Query: 64  SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           ++S CPSC  A FCSP C    A++SH   +C AL+      +  +A  P    L   FL
Sbjct: 60  AASPCPSCRAAGFCSPSC----AAASHPRLLCTALSGGGGNGNLASAAEPHQEPLL--FL 113

Query: 123 IAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPP---PASTTIELTAALLA 177
           ++AY+L          +  A     +D D  + H + ++L PP   P   + +LTAALL+
Sbjct: 114 LSAYSLPEPSLRVLLSLSSAATPPPSDQDPGSLHAMVAALVPPQMLPPGFSPDLTAALLS 173

Query: 178 KDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           KDR N+F +MEPY  E     R  RAY +Y +AS  NHDCLPNAC FDY D     NTDI
Sbjct: 174 KDRTNSFSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDI 233

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           +VR +HD+ +GRE+CLSYF  N+ Y  RQ+RLL DY F C+C+RC+VE+ W  +D+ D  
Sbjct: 234 VVRALHDITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGG 293

Query: 297 NEEVM----------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAP 346
           + +             E  D+ M   + D     D +FPH+YFF+RY+C+   C+G LAP
Sbjct: 294 DGDDTMEEEEEDGNGGEGGDDGMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAP 353

Query: 347 LPP-SDATPSTVMECNVCGNLKND 369
           LPP  +  PS V ECNVCGNLKN+
Sbjct: 354 LPPLPNGEPSHVFECNVCGNLKNE 377


>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
          Length = 392

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 160/384 (41%), Positives = 227/384 (59%), Gaps = 26/384 (6%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
             + +  A++ GRGRGL++ +S++ G+++L + P+LLY A    +  SFC  CFR ++ +
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPAS-LSSLPSFCSACFR-SLSA 59

Query: 64  SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           ++S CPSC  A FCSP C    A++SH   +C AL+      +  +A  P    L   FL
Sbjct: 60  AASPCPSCRAAGFCSPSC----AAASHPRLLCTALSGGGGNGNLASAAEPHQEPLL--FL 113

Query: 123 IAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPP---PASTTIELTAALLA 177
           ++AY+L          +  A      D D  + H + ++L PP   P   + +LTAALL+
Sbjct: 114 LSAYSLPEPSLRVLLSLSSAATPPPRDQDPGSLHAMVTALVPPQMLPPGFSPDLTAALLS 173

Query: 178 KDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           KDR N+F +MEPY  E     R  RAY +Y +AS  NHDCLPNAC FDY D     NTDI
Sbjct: 174 KDRTNSFSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDI 233

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           +VR +HD+ +GRE+CLSYF  N+ Y  RQ+RLL DY F C+C+RC+VE+ W  +D+ D  
Sbjct: 234 VVRALHDITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGG 293

Query: 297 NEEVM----------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAP 346
           + +             E  D++M   + D     D +FPH+YFF+RY+C+   C+G LAP
Sbjct: 294 DGDDTMEEEEEDGNGGEGGDDEMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAP 353

Query: 347 LPP-SDATPSTVMECNVCGNLKND 369
           LPP  +  PS V ECNVCGNLKN+
Sbjct: 354 LPPLPNGEPSHVFECNVCGNLKNE 377


>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
 gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
          Length = 404

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 166/400 (41%), Positives = 232/400 (58%), Gaps = 28/400 (7%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M+A  + + VA++ GRGRGLV+ + ++ G+++  + P+LLY +    +  S+C  CFR  
Sbjct: 10  MAAAGDVLRVADLPGRGRGLVAARIVREGEVLFSEPPVLLYPST-LSSLRSYCSACFRSL 68

Query: 61  MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQ 118
             +++  C SC + AFCSP C    A++SH   +C AL+         AA  P +  Q  
Sbjct: 69  TPAATIPCVSCRAAAFCSPAC----AAASHPHLLCAALSHGGGAGL--AAAAPTEAIQEP 122

Query: 119 ARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTA 173
             FL++AY+L      S   +  A        D +  H   +   P    P   + +LTA
Sbjct: 123 LLFLLSAYSLPESTMRSMLSLSSAPPPXPGAQDAAGLHAAVAALAPPHMLPQGLSPDLTA 182

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           ALLAKDR N+F +MEPY  G        RAY +Y +AS FNHDCLPNAC FDY D     
Sbjct: 183 ALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPG 242

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
           NTDI+VR +HD+P+GRE+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC+VE+ W D+D+
Sbjct: 243 NTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQVESQWKDDDD 302

Query: 293 DDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
           ++ ++ +   E++DE                D A   GD +FPHAYFF+RY+CD + CWG
Sbjct: 303 NNGDDGDDTMEEEDEDDAGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVRYLCDNEGCWG 362

Query: 343 TLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
            LAPLPPS +   S V ECN+CG L+   D I  E T GM
Sbjct: 363 MLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 402


>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 377

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 151/377 (40%), Positives = 214/377 (56%), Gaps = 26/377 (6%)

Query: 13  IEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSI-CPS 70
           I GRGRG+V+T+ L+AG ++LR+SP+LLY       S++FC NC R  ++     + C +
Sbjct: 6   IAGRGRGMVATRMLRAGAVLLRESPLLLYQEAESAESTAFCSNCMRVLSLPGREKVPCSA 65

Query: 71  CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
           C  A FC   C  +A   SH+  VC  +  L    S D     L+ +  ARFLIAAYNLA
Sbjct: 66  CDGATFCGKSCFESARVGSHNTRVCSGIAMLQK--SKD--KFSLEHRTLARFLIAAYNLA 121

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHY-----LSSLCPPPAST--TIELTAALLAKDRLN 182
           V +P  F  LL  +G   D +   + +     +       AS   ++ELT  LLA+D  N
Sbjct: 122 VEDPLAFAKLLELEGQECDEEVVKSLHDFVVEIFGKLDGGASLELSLELTGKLLARDACN 181

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
            FGLM P   G++  R VR Y ++ +AS FNHDCLPNACRF+YVD   + NTD+IVR +H
Sbjct: 182 TFGLMAPSCAGEE--RKVRGYAMFAQASMFNHDCLPNACRFEYVDIDGDGNTDVIVRALH 239

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-------E 295
           D+ +G E+CLSYFPV++ Y  RQ++L  +Y F C C RC VE+ W D D +         
Sbjct: 240 DMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARCNVESKWKDEDEEQDEEDEGYS 299

Query: 296 NNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPS 355
           N E  + + +         D +   + +F HA FF++Y+C  + C GT+APLPP +    
Sbjct: 300 NGELELGKGEATLEDVESIDDDDDDEDDFGHAMFFVKYLCPVEGCGGTMAPLPPGELDHQ 359

Query: 356 TV---MECNVCGNLKND 369
            +   MECN CG +++D
Sbjct: 360 GLEGSMECNFCGCVRSD 376


>gi|255644430|gb|ACU22720.1| unknown [Glycine max]
          Length = 381

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 145/295 (49%), Positives = 200/295 (67%), Gaps = 22/295 (7%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS----FCHNCFR---KTMH 62
           + +I G+GRGLV++Q LKAGQIVL +SP++LYSA P +  SS    +C +CFR    T +
Sbjct: 1   MEQIPGKGRGLVASQPLKAGQIVLTESPLILYSASPLLTPSSSPYTYCDHCFRILPLTHN 60

Query: 63  SSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL-NLNSPDAANLPLDRQLQAR 120
           S++  CPSCS  +FCS KC + A  SSHS +VC+AL  L  + NS      P +RQ+QAR
Sbjct: 61  STTVTCPSCSNHSFCSQKCFSLALKSSHSTWVCKALMSLQQHPNSTLLQQHPQERQVQAR 120

Query: 121 FLIAAYNLAVVN--PSQFQILLAFQGTVTDNDTSAAHYLSSLC----PPPASTTIELTAA 174
            ++A++ L + N  PS+    L+  GT  D    AA++L SL     PP A  +++L A 
Sbjct: 121 LIVASHKLFLHNHTPSELDTFLSLHGTPDDAILDAANFLHSLISPLFPPQAQLSVDLIAQ 180

Query: 175 LLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA---- 229
           LLAKDRLN+FGLM+PY    DG +RS++AY IY KA+FFNHDC+PNACRFDYVD+     
Sbjct: 181 LLAKDRLNSFGLMDPY--SPDGPQRSIKAYAIYPKATFFNHDCVPNACRFDYVDSTNDDY 238

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             N+TDI++R+I DV +G+E+C+SYF +  DY TR++ L+ DY F C CDRC ++
Sbjct: 239 ERNSTDIVIRLIEDVDEGKEVCISYFRIGRDYCTRKRILMEDYGFTCGCDRCTLK 293


>gi|326511411|dbj|BAJ87719.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 160/395 (40%), Positives = 228/395 (57%), Gaps = 35/395 (8%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
            +T+ VAE+ GRGR L++ + +  G+++L +SPILLY +    + SS+C  CFR      
Sbjct: 3   GDTLRVAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPP 61

Query: 65  SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
            + CPSC + AFCSP C    A++SH   +C AL+  L   +P+A   PL       FL+
Sbjct: 62  HTPCPSCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLL 110

Query: 124 AAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALL 176
           +AY+L   + S    L +  QG        AA     ++S+ PP   PA  +++LTAALL
Sbjct: 111 SAYSLQEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALL 170

Query: 177 AKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           +KDR N+F ++EPY        R  R   +Y +AS  NHDCLPNAC FDY D     NTD
Sbjct: 171 SKDRTNSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTD 230

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS------- 288
           ++VR +H + +G E+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC++E+ W        
Sbjct: 231 MVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDG 290

Query: 289 ----DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
               D   D+E+ +E  ++  DE M    ++     D +FPHA+FF+RY+CDRD+C+G L
Sbjct: 291 ADGDDTMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGML 348

Query: 345 APLPP-SDATPSTVMECNVCGNLKNDAIGREDTVG 378
           APLPP  +   S V ECN CG LK +    E   G
Sbjct: 349 APLPPLPNGELSHVFECNACGQLKKEEDEDEPDAG 383


>gi|326511387|dbj|BAJ87707.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 160/395 (40%), Positives = 228/395 (57%), Gaps = 35/395 (8%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
            +T+ VAE+ GRGR L++ + +  G+++L +SPILLY +    + SS+C  CFR      
Sbjct: 3   GDTLRVAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPP 61

Query: 65  SSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
            + CPSC + AFCSP C    A++SH   +C AL+  L   +P+A   PL       FL+
Sbjct: 62  HTPCPSCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLL 110

Query: 124 AAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALL 176
           +AY+L   + S    L +  QG        AA     ++S+ PP   PA  +++LTAALL
Sbjct: 111 SAYSLQEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALL 170

Query: 177 AKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           +KDR N+F ++EPY        R  R   +Y +AS  NHDCLPNAC FDY D     NTD
Sbjct: 171 SKDRTNSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTD 230

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS------- 288
           ++VR +H + +G E+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC++E+ W        
Sbjct: 231 MVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDG 290

Query: 289 ----DNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
               D   D+E+ +E  ++  DE M    ++     D +FPHA+FF+RY+CDRD+C+G L
Sbjct: 291 ADGDDTMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGML 348

Query: 345 APLPP-SDATPSTVMECNVCGNLKNDAIGREDTVG 378
           APLPP  +   S V ECN CG LK +    E   G
Sbjct: 349 APLPPLPNGELSHVFECNACGQLKKEEDEDEPDAG 383


>gi|326508652|dbj|BAJ95848.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 383

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 159/390 (40%), Positives = 225/390 (57%), Gaps = 35/390 (8%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           VAE+ GRGR L++ + +  G+++L +SPILLY +    + SS+C  CFR       + CP
Sbjct: 2   VAELPGRGRALLAARDILEGEVLLSESPILLYPS-SLASLSSYCSACFRSLPPPPHTPCP 60

Query: 70  SC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
           SC + AFCSP C    A++SH   +C AL+  L   +P+A   PL       FL++AY+L
Sbjct: 61  SCRAAAFCSPAC----AAASHPRLLCAALSSGLA-AAPEAHQEPL------LFLLSAYSL 109

Query: 129 AVVNPSQFQILLAF-QGTVTDNDTSAAHY---LSSLCPP---PASTTIELTAALLAKDRL 181
              + S    L +  QG        AA     ++S+ PP   PA  +++LTAALL+KDR 
Sbjct: 110 QEPSLSAILSLSSAPQGPSPSQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALLSKDRT 169

Query: 182 NAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           N+F ++EPY        R  R   +Y +AS  NHDCLPNAC FDY D     NTD++VR 
Sbjct: 170 NSFSILEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGNTDMVVRA 229

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-----------D 289
           +H + +G E+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC++E+ W            D
Sbjct: 230 LHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRCQIESKWKLDDDNDGADGDD 289

Query: 290 NDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP 349
              D+E+ +E  ++  DE M    ++     D +FPHA+FF+RY+CDRD+C+G LAPLPP
Sbjct: 290 TMEDEEHGKEDAEDGGDEGM--EQEEGSDDDDDDFPHAFFFVRYLCDRDDCYGMLAPLPP 347

Query: 350 -SDATPSTVMECNVCGNLKNDAIGREDTVG 378
             +   S V ECN CG LK +    E   G
Sbjct: 348 LPNGELSHVFECNACGQLKKEEDEDEPDAG 377


>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
          Length = 268

 Score =  224 bits (570), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 116/230 (50%), Positives = 150/230 (65%), Gaps = 14/230 (6%)

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR-SVRAYGIYLKASFFNHDCLPNACR 222
           P   + +LTAALLAKDR N+F +MEPY  G        RAY +Y +AS FNHDCLPNAC 
Sbjct: 37  PQGLSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARAYAVYPRASLFNHDCLPNACH 96

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           FDY D     NTDI+VR +HD+P+GRE+C+SYF  N+ Y+ RQ+RLL DY F C+CDRC+
Sbjct: 97  FDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRCQ 156

Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVA----------SDDDAEVHGDTNFPHAYFFMR 332
           VE+ W D+D+++ ++ +   E++ E  V             D A   GD +FPHAYFF+R
Sbjct: 157 VESQWKDDDDNNGDDGDDTMEEEGEDDVGDRGDDGIEEEEGDGATNGGDDDFPHAYFFVR 216

Query: 333 YMCDRDNCWGTLAPLPPS-DATPSTVMECNVCGNLKN--DAIGREDTVGM 379
           Y+CD + CWG LAPLPPS +   S V ECN+CG L+   D I  E T GM
Sbjct: 217 YLCDNEGCWGMLAPLPPSPNGDLSHVFECNLCGKLRKEEDVIPDEGTSGM 266


>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
 gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
          Length = 331

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 140/378 (37%), Positives = 195/378 (51%), Gaps = 75/378 (19%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
             + +  A++ GRGRGL++ +S++ G+++L + P+LL                +  ++ S
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPLLL----------------YPASLAS 45

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
             S C +C  +       +AAAS   S        R     SP  A     R       +
Sbjct: 46  LPSFCSACFRSL------SAAASPCPS-------CRAAGFCSPSCAAASHPR-------L 85

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
            A   A+V P   Q+L                        P   + +LTAALL+KDR N+
Sbjct: 86  HAMVAALVPP---QML------------------------PPGFSPDLTAALLSKDRTNS 118

Query: 184 FGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           F +MEPY  E     R  RAY +Y +AS  NHDCLPNAC FDY D     NTDI+VR +H
Sbjct: 119 FSIMEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALH 178

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM- 301
           D+ +GRE+CLSYF  N+ Y  RQ+RLL DY F C+C+RC+VE+ W  +D+ D  + +   
Sbjct: 179 DITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERCQVESKWKQDDDSDGGDGDDTM 238

Query: 302 ---------DEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP-SD 351
                     E  D+ M   + D     D +FPH+YFF+RY+C+   C+G LAPLPP  +
Sbjct: 239 EEEEEDGNGGEGGDDGMEQEEGDGGSDSDDDFPHSYFFVRYLCNHGECYGMLAPLPPLPN 298

Query: 352 ATPSTVMECNVCGNLKND 369
             PS V ECNVCGNLKN+
Sbjct: 299 GEPSHVFECNVCGNLKNE 316


>gi|302774507|ref|XP_002970670.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
 gi|300161381|gb|EFJ27996.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
          Length = 292

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 120/262 (45%), Positives = 168/262 (64%), Gaps = 16/262 (6%)

Query: 118 QARFLIAAYNLAVVNPSQFQ--ILLAFQGTVTDNDTSAAH-------YLSSLCPPPA-ST 167
           QARFL+AA +LA  N   FQ  +LL  Q +V  ND   A         L + C   A   
Sbjct: 20  QARFLLAAVSLAARNAEDFQRLLLLDGQASVFANDPRPAGVAELVLALLRTHCFELAKGL 79

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           ++E+   LLAKD++NAFGLM P       +R VRAY  Y +AS  NHDCLPNACRF+Y+D
Sbjct: 80  SLEMFLNLLAKDKINAFGLMAP--SSGTAQRKVRAYARYAQASMLNHDCLPNACRFEYLD 137

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
               +NTDI +R++HDVPQG EIC+SYFPVN++Y  R++RL+ DY F C+C+RC+VE  W
Sbjct: 138 KPGASNTDIYIRLLHDVPQGSEICISYFPVNWNYKERRERLVEDYGFECNCERCRVEQTW 197

Query: 288 SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPL 347
           SD D + +  E +  +  D      +++ +  GD +FPHA FF++Y+C  ++C GT+APL
Sbjct: 198 SDGDGEGDEEEAMEHDQTDGDGEDEEEEMDKDGDGDFPHAMFFVKYLCPEESCGGTMAPL 257

Query: 348 PPSDATPSTVMECNVCGNLKND 369
           P +    + +MECNVCG L+++
Sbjct: 258 PGN----ANLMECNVCGRLRSE 275


>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
          Length = 327

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 93/128 (72%), Positives = 103/128 (80%), Gaps = 3/128 (2%)

Query: 163 PP---ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
           PP   A  ++ELT ALLAKD+LNAFGLMEP      G RSVRAYGIY KASFFNHDCLPN
Sbjct: 104 PPQGVAGFSVELTTALLAKDKLNAFGLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPN 163

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
           ACRFDYVD A+ +NTDI +R+IHDVP+G EICLSYFPVN  Y+ RQKRLL DY F C CD
Sbjct: 164 ACRFDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCD 223

Query: 280 RCKVEANW 287
           RC+VEANW
Sbjct: 224 RCRVEANW 231



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/69 (56%), Positives = 48/69 (69%), Gaps = 2/69 (2%)

Query: 5  SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHS 63
          S  V   EIEGRGR LV++QSL+ GQI+L DSPILLYSA P  +SS ++C NCFR     
Sbjct: 11 SGLVKTVEIEGRGRALVASQSLRGGQIILTDSPILLYSAHPLSSSSNAYCSNCFRHLQTC 70

Query: 64 SSSI-CPSC 71
          S+ + C SC
Sbjct: 71 STLVSCSSC 79


>gi|302771838|ref|XP_002969337.1| hypothetical protein SELMODRAFT_410322 [Selaginella moellendorffii]
 gi|300162813|gb|EFJ29425.1| hypothetical protein SELMODRAFT_410322 [Selaginella moellendorffii]
          Length = 172

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 78/159 (49%), Positives = 114/159 (71%), Gaps = 4/159 (2%)

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
             NHDCLPNACRF+Y+D    +NTDI +R++HDVPQG EIC+SYFPVN++Y  R++RL+ 
Sbjct: 1   MLNHDCLPNACRFEYLDKPGASNTDIYIRLLHDVPQGSEICVSYFPVNWNYKERRERLVE 60

Query: 271 DYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFF 330
           DY F C+C+RC+VE  WSD D + +  E +  +  D   V  +++ +  GD +FPHA FF
Sbjct: 61  DYGFECNCERCRVEQTWSDGDGEGDEEEAMEHDQTDGDGVDEEEEMDKDGDGDFPHAMFF 120

Query: 331 MRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
           ++Y+C  ++C GT+APLP +    + +MECNVCG L+++
Sbjct: 121 VKYLCPEESCGGTMAPLPGN----ANLMECNVCGRLRSE 155


>gi|384249285|gb|EIE22767.1| hypothetical protein COCSUDRAFT_42390 [Coccomyxa subellipsoidea
           C-169]
          Length = 329

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 75/201 (37%), Positives = 114/201 (56%), Gaps = 14/201 (6%)

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR---FDYV 226
           E  A  LAK+ LN +G+M P   G +G R +R   +Y KAS  NH+CLPN  R   FD  
Sbjct: 113 EEVADALAKEELNGYGIMAP--SGPEGERRIRGTALYPKASLLNHECLPNVARVDNFDEA 170

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           D  A  NT +  + +H++P G E   SYFP++  Y  RQ+R    Y FAC+C RCK E+ 
Sbjct: 171 DVNAPENTAVHFKALHNLPAGEEFTQSYFPMHTSYHVRQQRCQDQYGFACNCPRCKEEST 230

Query: 287 W-SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPH--AYFFMRYMCDRDNCWGT 343
           W SD++ D+  +EE M+ +++   +   ++AE  GD         F ++Y+C + +C+GT
Sbjct: 231 WPSDDEADNLGDEEQMEVEREAGSMG--EEAEQDGDRADAGYIQVFLLKYVCPQTDCFGT 288

Query: 344 LAPLPPSDATPSTVMECNVCG 364
           + P+       S++  CN+CG
Sbjct: 289 MVPIQAG----SSLHVCNMCG 305


>gi|307105392|gb|EFN53641.1| hypothetical protein CHLNCDRAFT_136351 [Chlorella variabilis]
          Length = 430

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 112/223 (50%), Gaps = 31/223 (13%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-- 232
           LL K+  N +G+M P     DG R +R   +Y +AS  NH+CLPN  RFD  DA++    
Sbjct: 202 LLRKEATNGYGVMAP--SAPDGERRIRGTALYAQASLINHECLPNVARFDRFDASSAPAS 259

Query: 233 -------NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
                  NT +  R +HD+P G E+  SYFP+ +  + RQ+R   DY F C C RCK EA
Sbjct: 260 SAPHPGANTAVEFRALHDIPAGEELTQSYFPLPWPLADRQQRCQEDYGFQCTCPRCKEEA 319

Query: 286 NWSDNDNDDENNEEVM----------------DEDQDEQMVASDDDAEVHGDTNFPH--A 327
            WSD++ + + +  +                    Q +        AE  G  + P   +
Sbjct: 320 TWSDSEWEADGDAGMEDALAAEGAADGAACGHHSHQHDAPGGGPAAAEQEGGAD-PACIS 378

Query: 328 YFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDA 370
            F ++Y+C  + C+GTLAPLPP  A      ECN+CG  +++A
Sbjct: 379 LFLLKYVCPAEGCYGTLAPLPPG-AGAGDQYECNMCGRRRSEA 420


>gi|302835433|ref|XP_002949278.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
           nagariensis]
 gi|300265580|gb|EFJ49771.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
           nagariensis]
          Length = 467

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 140/357 (39%), Gaps = 97/357 (27%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
           +GR L +T+ +K G+ VL ++P+LL  A     + S C  C R         C  C  A 
Sbjct: 23  KGRSLYTTRLVKGGETVLDEAPLLLLVAPE--TAQSTCIACLRAVRPQECIECSVCRQAC 80

Query: 75  FCSPKCSTAAASSS--HSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
           FCS  C   A  +   HS  +C A  RL          LP ++Q Q RFL+ A  L    
Sbjct: 81  FCSATCQQTAQGTPWLHSQPLCSAYARLAMFG------LPPEQQSQLRFLLHALALRHAG 134

Query: 133 PSQ---------------------------FQILLAFQGTVTDNDTSAAHYLSSLCPP-- 163
            ++                           F  L    G  T+ + + A  L+ L     
Sbjct: 135 TAEAGELGAGPQPLGKTPIGRHNQEKGQRRFAALSCLVGEATEAERAVAARLAPLLVEAL 194

Query: 164 ----------PASTTIELTAALLAKDRLNAFGLME-PYI--------------------- 191
                     P    +E  A LL K++LN++G++  PY                      
Sbjct: 195 RDLQGVGGSLPIPWDVEELARLLRKEQLNSYGVLAAPYQYQHHHHHHHHGGTAAPGPGCS 254

Query: 192 -EGQDGR------------------------RSVRAYGIYLKASFFNHDCLPNACRFDYV 226
            EG +G+                        R +R   +Y +AS  NH+CLPN  RFD  
Sbjct: 255 GEGGNGQTAPKSSCGAGCSGGGSGGEAGDGERRLRGSALYAQASLINHECLPNVARFDDF 314

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
           D+   + T +  R +HD+P G E+  SY P+++D + RQ +    Y FAC C RC+ 
Sbjct: 315 DSDLPDRTHVTFRALHDLPPGTELAQSYVPLHWDLAERQAQCREVYGFACTCPRCQT 371


>gi|255083056|ref|XP_002504514.1| set domain protein [Micromonas sp. RCC299]
 gi|226519782|gb|ACO65772.1| set domain protein [Micromonas sp. RCC299]
          Length = 462

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 120/461 (26%), Positives = 177/461 (38%), Gaps = 113/461 (24%)

Query: 7   TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS 66
            V   EI GRG GLV+ + + AG+ V+ +  +L+  +  F      C +C     H +++
Sbjct: 5   AVAQREIPGRGVGLVAARDIDAGETVVAEHALLVGVSEEF--RRVCCASCL---AHGAAT 59

Query: 67  ICPSCSVAFCSPKCSTAAA----SSSHSPYVCQALTRLLN-----LNSPDAANLPL---- 113
            C  C        C   A+     + H   VC A  R+ +     L++PD   L      
Sbjct: 60  PCSGCGEVVLCDTCRDPASPWHHGAGHGAVVCAA-ERVASAWRPPLSAPDRERLRFLGAC 118

Query: 114 -----------DRQLQARFLIAAYNLAVVNPSQFQI---------------LLAFQGTV- 146
                      D + +AR          V P+   I                 A Q T+ 
Sbjct: 119 AELRRAHVVNGDARARARLDAVLRLCPDVGPAGEGIDPREASAAERVRPALEAAVQATIA 178

Query: 147 -TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI--------EGQDGR 197
            T+ +T+    L ++    A   IE  AALLAK+  NAFG+M            +G D  
Sbjct: 179 TTEANTAGGGLLRAI----AGDAIE-NAALLAKETKNAFGVMAATRGGDGSHPNDGSDDD 233

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE-----------------NNTDIIVRM 240
           R VR   +Y  AS  NH C PN  RFD  D                       T++ +  
Sbjct: 234 RRVRGGAVYELASRVNHACFPNVARFDNFDGVLSSPSHEFYVPHRDGPNPPGPTELRLVA 293

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
           I  +P G E+ +SY PV+   + R++RL   + F C+C+RC +E  W+  D     +E  
Sbjct: 294 IDKIPAGCEVLMSYLPVSESCARRRRRLRNTFGFGCECERCVIECAWATEDGAATGDEGA 353

Query: 301 MDEDQDEQMVASDDDAEVHGDT------------------------------NFP--HAY 328
             +       A D+ AE+  D                                 P  +A 
Sbjct: 354 YRDASRRAREARDEAAELEADAIADALDRECDEKEEVANQSLDAAELTRRADRIPPTYAL 413

Query: 329 FFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
           +F+R MC  D C GTLAP P + A     M CN CG  ++D
Sbjct: 414 WFVRNMCPVDGCGGTLAP-PHARADH---MTCNYCGERRSD 450


>gi|357438875|ref|XP_003589714.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
 gi|355478762|gb|AES59965.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
          Length = 130

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 64/129 (49%), Positives = 81/129 (62%), Gaps = 15/129 (11%)

Query: 211 FFNHDCLPNACRFDYVD-------AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           FF HDC PNACRFD+V+       AA+ NNTDII+RM HDV QG EICLSYF VN  YS+
Sbjct: 4   FFKHDCFPNACRFDHVNVNPPHHAAASPNNTDIIIRMTHDVRQGMEICLSYFVVNKTYSS 63

Query: 264 RQKR-LLYDYDFACDCD-RCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGD 321
           RQK+ LL DY F C+C+  C  +A   +N  +DE    VMD++++EQ         +H  
Sbjct: 64  RQKKILLEDYCFKCNCESNCSDDARVEENVEEDEQEVMVMDDEEEEQC------ENIHQT 117

Query: 322 TNFPHAYFF 330
              PH +FF
Sbjct: 118 LTIPHIFFF 126


>gi|303281140|ref|XP_003059862.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458517|gb|EEH55814.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 116/515 (22%), Positives = 175/515 (33%), Gaps = 173/515 (33%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS-SSS 66
           + +A+I GRG+G+V+T+ ++ G+ VL +   +  + +   + ++ C  C R+      + 
Sbjct: 17  IALADIPGRGKGVVATREIRPGECVLEERATM--TGVSESHRATTCAGCLRRAARCVRTR 74

Query: 67  ICPSCSVAFCSPKCSTAA-----ASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQ 118
            C  C        C  AA     A  +       A TR+        A+  L   DR+ +
Sbjct: 75  ACERCGEVLLCDDCHRAAELAFGADDARLAMAPNAHTRVKCRGEAVGASAALSTGDRE-R 133

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP----------PPAS-- 166
            RFL+A   L  +   + +       T    D      ++SLCP          P AS  
Sbjct: 134 FRFLVACAELRAMKTMRAE-------TQAREDDRGFDDVASLCPREDDDGGGRWPIASHV 186

Query: 167 -----------------------------------------TTIELTAALLAKDRLNAFG 185
                                                     TI   A LLAK+  NAFG
Sbjct: 187 DAATRLHPLLFATMHDADIERGVLKVNGASLSSDDFERMYVGTIRENAMLLAKESKNAFG 246

Query: 186 LMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---- 235
           +M      +         RSVR   +Y  AS  NH C PN  RFD  D   ++ +     
Sbjct: 247 VMAAAAADERDENDFGAARSVRGGALYRDASRVNHGCFPNVARFDNFDGDFQDPSKAFSS 306

Query: 236 ------------IIVRMIHDVPQGREICL------------------------------- 252
                       + +  I  +P G E+ +                               
Sbjct: 307 PSSLSALCEPSTLRLLAIDVIPAGAEVLMRTFSPGVSLRPGSLAFNPRPRRLSTPKSDAF 366

Query: 253 -------------SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEE 299
                        SY PVN     R+ RL   + FACDC RC +E  W+  D +   +EE
Sbjct: 367 QLHPDVASYGTTLSYLPVNEPVGARRARLRRGFGFACDCARCALEVRWALEDGERTGDEE 426

Query: 300 VMDEDQD-------------------------EQMVASDDDAEVHGDTNFPHAYFFMRYM 334
             +   D                         E+     D A V       +A +FM+ M
Sbjct: 427 PTETYDDATTAADASEAAEEARDEATAAALTEEEQTRRADRAPVE------YALWFMKNM 480

Query: 335 CDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKND 369
           C  ++C GTLAP  P+    +T   CN CG ++ D
Sbjct: 481 CAEEDCGGTLAPPNPT----ATHCACNYCGRVRTD 511


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 109/291 (37%), Gaps = 48/291 (16%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M+       V E E +GRGL + + LK+G  +L + P++    L  +     C  C  K 
Sbjct: 1   MAPKPSVFEVFETESKGRGLRAAKPLKSGDTILSEQPVVY--MLSNMLRGQRCDFCLEKL 58

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
             S    C  C  A +C   C  AA     S   C+ L R+     P    L + R  Q 
Sbjct: 59  --SDLQRCSRCKFARYCGASCQRAAWRIHKSE--CERLKRVFP-RVPTDLVLLMFRVWQL 113

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---------AHYLSSLCPPPASTTIE 170
           +     Y+  V N  +            D+D              YL S   PP      
Sbjct: 114 KSQNGWYDSLVSNVEKI-----------DSDAKEDFVSVLMVLNEYLGSEISPPEGLE-- 160

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
               L +K   N+F +        DG       GI+  A   NH C PN+        A 
Sbjct: 161 ----LFSKISCNSFAIC-------DGEMQAIGTGIFPNAVCLNHSCAPNS-------VAV 202

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            N T+I ++ + ++P G E+ +SY    +   TRQ+ L   + F C C RC
Sbjct: 203 FNGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQCHRC 253


>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
 gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
          Length = 503

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 112/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLLKSWGPNEPDVANSVIIRLL---CVARALNLS---KEQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     IE+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKIIEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +  +      +Y      NHDC+PNA  + + D       ++IVR   D+P+G E+  +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNA-YYTFED----KTNNMIVRAAVDIPEGFEVTTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 302

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/317 (22%), Positives = 125/317 (39%), Gaps = 55/317 (17%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRD--SPILLYSALPFINSSSFCHNCFRKT 60
           ++ E + +A+ + RGR +V+T  +  G+++L +  +  + Y  L     S +C  C +  
Sbjct: 2   SLHEGLKIAQSDDRGRHVVATTDIARGELLLSEECTVAMTYRDL----ESPYCSQCVKPM 57

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPY-VCQALTRLLN------LNSPDAANLP 112
           +  +   C  C +  +CS  C+ AA  S H+    C      ++      +   D +   
Sbjct: 58  VDETPVECEGCFLERYCSKSCAEAAEGSHHARTGECATFATAVDAEAATLIEGTDHSEFE 117

Query: 113 LDRQLQARFLIAAYNLA---------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP 163
            D   Q RF+I     A         +   +  + LL  Q    + D+    +LS +   
Sbjct: 118 FDDAPQ-RFIIRVLCQAGGWRERGGGITGEATLRRLLKLQSHEPEVDSDEREWLSGI--- 173

Query: 164 PASTTIELT-------AALLAKDRLNAFGLME-------------PYIEGQDGRRSVRAY 203
            A  T+ L          +  +     +G+ E                  ++        
Sbjct: 174 -ARNTLRLMEQDVDEGVKVFGEKDAPGYGVEELVRLMCCVRCNSHTLYASEEWPSEPVGT 232

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            +YLK S FNH CLP+A  ++         T + VR + D+  G E+ +SY PV      
Sbjct: 233 AVYLKGSAFNHSCLPSAEFYN-------EGTSLRVRSVRDISAGEEVTISYVPVTETLWD 285

Query: 264 RQKRLLYDYDFACDCDR 280
           R++ L   Y F C+CDR
Sbjct: 286 RRQALWRQYKFDCECDR 302


>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
 gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
          Length = 500

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 110/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     IE+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +  +      +Y      NHDC+PNA          E   ++IVR   D+P+G E+  +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNA-----YYTFEEKTNNMIVRAAVDIPEGFEVTTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
 gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
          Length = 497

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 74/268 (27%), Positives = 111/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     IE+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +  +      +Y      NHDC+PNA    Y      NN  +IVR   D+P+G E+  +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
          Length = 500

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 74/268 (27%), Positives = 111/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVVIRLL---CVARAINLS---KEQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     IE+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +  +      +Y      NHDC+PNA    Y      NN  +IVR   D+P+G E+  +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
 gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
          Length = 497

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 74/268 (27%), Positives = 110/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLMGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL      Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLT---KEQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     IE+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +  +      +Y      NHDC+PNA    Y      NN  +IVR   D+P+G E+  +
Sbjct: 198 TNDNQEFNYRALYPLFGVVNHDCIPNAY---YTFEEKTNN--MIVRAAVDIPEGFEVTTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|340513811|gb|EGR44092.1| predicted protein [Trichoderma reesei QM6a]
          Length = 549

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 74/311 (23%), Positives = 122/311 (39%), Gaps = 55/311 (17%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTM 61
           A S ++  + I   G GL + + ++ G+++    P++L + +   + +  FC    R+  
Sbjct: 49  AGSLSIKQSPILNAGSGLYTNRDVEVGELIFTSVPLVLCAEVGDGMEACDFCFQQRRRVF 108

Query: 62  HSSSS-------------ICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
           H S S             +C  C +  +CS  C   A  + H  Y C  L+         
Sbjct: 109 HPSESRFLGPGEMMPPLHVCNGCHMYQYCSKSCWQRAWDTGHL-YECGLLS--------- 158

Query: 108 AANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
            A   L+ +   R LI      V+ P Q Q L   +  + + +  A      +       
Sbjct: 159 GATTDLETRTLYRLLILMRK-KVLLPEQVQGLSRLESEMANFERRAKKTWPKV------- 210

Query: 168 TIELTAALLAKDRLNA-FGLMEPYI-----------EGQDGRRSVRAYGIYLKASFFNHD 215
              L  AL AK+R  +  G+ E  I             Q  R +     + L  +  NHD
Sbjct: 211 ---LELALEAKERTKSELGIYEILILYGIIRCNALPVDQTYRNAPLGIALDLGGALLNHD 267

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
           C PN        A   NNT + VR +  +  G E+   Y  + YD++ R  R+   Y F 
Sbjct: 268 CDPNV-------AIVFNNTQVQVRALRKLKAGEELLHCYRDIAYDFTFRNPRITSRYQFR 320

Query: 276 CDCDRCKVEAN 286
           C C+RC+ E++
Sbjct: 321 CHCERCRQESD 331


>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
 gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
          Length = 496

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 74/270 (27%), Positives = 112/270 (41%), Gaps = 28/270 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+V+T+SLK G+I+ RDSP+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       L +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKSDC--DLFKSWGPNEPDVANSVIIRLL---CVARAINLS---KEQ 140

Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
             ++   Q  + +N  +    AA    +   P     IE+    +A  R N F   +   
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLIEIMNRTVAVLRTNGF---DKTT 195

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           +  +  +      +Y      NHDC+PNA          E   ++IVR   D+P+G E+ 
Sbjct: 196 DRTNDNQEFNYRALYPLFGVVNHDCIPNA-----YYTFEEKTNNMIVRAAVDIPEGFEVT 250

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +Y  +      R   L     F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKSFTCKCSRC 280


>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
 gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
          Length = 311

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/322 (27%), Positives = 133/322 (41%), Gaps = 62/322 (19%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSS 64
           + VT   +EG+G  L +T+ L  G  +LR +P+    A+P+    +  C  CF+      
Sbjct: 11  DAVTTDRVEGKGEALRATRDLPKGARLLRVAPLF---AVPYAAELTRLCGGCFQP----R 63

Query: 65  SSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL------LNLNSPDAANLPLDRQLQ 118
            ++C SC  A    +C   AA + H    C AL RL      L L   D   L     ++
Sbjct: 64  GAVCASCGSARLCSRCGAGAAGTLHG-LECHALARLRDGEEGLTLAHSDLRLLLRALAVR 122

Query: 119 A--RFLIAAYNLAVVNPSQFQILL-----AFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
           +  R + A  + A +  ++   ++     A +G ++  D      L    P  A  TI  
Sbjct: 123 SMRRAVDAGGDPAAIAAAEDGDVIVDGYDALEGLMSGVDGGDDGEL----PHDAVATIAE 178

Query: 172 TAA-------------------LLAKDRLNAFGLM--EPYIEGQD-----GRRSVRAYGI 205
            A                     L + +LN F +   EP  EG D     G R V   G+
Sbjct: 179 VAKQARFLLAASCRCSMDECVRTLGRLQLNGFEMTASEPE-EGADEAEGGGHRPV-GVGV 236

Query: 206 YLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +  AS+ NH C PN A RFD        +  I+V    DV  G E+ + Y  V      R
Sbjct: 237 FPSASYTNHSCAPNCAQRFD-------GHGCIVVETARDVRGGEELTIPYVDVRLGRRER 289

Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
           ++RL  ++ F C C+RC  EA+
Sbjct: 290 RERLRKNFAFDCACERCAAEAD 311


>gi|440801098|gb|ELR22123.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 397

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/321 (23%), Positives = 118/321 (36%), Gaps = 55/321 (17%)

Query: 12  EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPS 70
           + E  GRG+ + +  KAG ++L +SP+ +      +     C  C RK   S++++ CP 
Sbjct: 71  QAETMGRGMFALRDFKAGDVILEESPLAIAPRDCVVAQMLHCSQCLRKLAASTAAVACPH 130

Query: 71  CSV-AFCSPKCSTAAASSSHSPYVCQA-----------------------------LTRL 100
           C +  +CS +C+ A A  SH   +C+A                             + RL
Sbjct: 131 CRLDPYCSDRCA-AEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARL 189

Query: 101 LNLNSPDAANLPLDRQLQARF--------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
           +   S D A L  D  L+            +  +  A + P  F    +    +      
Sbjct: 190 IAQLSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPEFSMLDYIFGQTNL 249

Query: 153 AAHYLSS-------LCPPPASTTIELTAALLAKDRLNAFGLM---EPYIEGQDGRRSVRA 202
           A+H LS+       L P            L      N FGL    E   EG    R    
Sbjct: 250 ASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYTDRREDG 309

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
             ++L AS+FNH C PN    D  +   +     I     D+ +G ++ + Y        
Sbjct: 310 VALFLHASYFNHSCTPNV---DRCNRHGDKRVAFIA--CADIKKGEQLFIEYVDTRAMVD 364

Query: 263 TRQKRLLYDYDFACDCDRCKV 283
            R++ L   Y F C C +C V
Sbjct: 365 ERRQELAQRYGFLCSCPKCSV 385


>gi|440793165|gb|ELR14357.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 387

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/321 (23%), Positives = 117/321 (36%), Gaps = 55/321 (17%)

Query: 12  EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPS 70
           + E  GRG+ + +  KAG ++L +SP+        +     C  C RK   S++++ CP 
Sbjct: 62  QAETMGRGMFALRDFKAGDVILEESPLAFAPRDCVVAQMLHCSQCLRKLAASTAAVTCPH 121

Query: 71  CSV-AFCSPKCSTAAASSSHSPYVCQA-----------------------------LTRL 100
           C +  +CS +C+ A A  SH   +C+A                             + RL
Sbjct: 122 CRLDPYCSDRCA-AEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARL 180

Query: 101 LNLNSPDAANLPLDRQLQARF--------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
           +   S D A L  D  L+            +  +  A + P  F    +    +      
Sbjct: 181 IAQLSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPEFSMLDYIFGQTNL 240

Query: 153 AAHYLSS-------LCPPPASTTIELTAALLAKDRLNAFGLM---EPYIEGQDGRRSVRA 202
           A+H LS+       L P            L      N FGL    E   EG    R    
Sbjct: 241 ASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYTDRREDG 300

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
             ++L AS+FNH C PN    D  +   +     I     D+ +G ++ + Y        
Sbjct: 301 VALFLHASYFNHSCTPNV---DRCNRHGDKRVAFIA--CADIKKGEQLFIEYVDTRAPVD 355

Query: 263 TRQKRLLYDYDFACDCDRCKV 283
            R++ L   Y F C C +C V
Sbjct: 356 ERRQELAQRYGFLCSCPKCSV 376


>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
 gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
          Length = 503

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 106/267 (39%), Gaps = 22/267 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GRG+V+T+ LK G+I+ RDSP+L+  A    +S + C  C      +       C +   
Sbjct: 34  GRGVVATRHLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCMILLPDTRFMCRQGCGL--- 90

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P CS  A    H       L R    N P+ AN  + R L    +  A NL   N  Q 
Sbjct: 91  -PVCSLCAKKKQHKTDC--DLFRSWGPNEPEVANSVIIRLL---CVARAINL---NKDQR 141

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
            ++   Q  + +N  +     +      P     IE+    +A  R N F   +   +  
Sbjct: 142 DLIYCLQANLDNNHRTEVRNAAKCFKKFPTDKKVIEIMNRTVAVIRTNGF---DKTTDRT 198

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           +  +      +Y      NHDC+PN+          E   ++IVR   D+P+G EI  +Y
Sbjct: 199 NDNQEFNYRALYPLFGVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDIPEGFEITTTY 253

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +      R   L    +F C C RC
Sbjct: 254 TKLFTGNIARHLFLKMKKNFTCKCPRC 280


>gi|224088154|ref|XP_002308346.1| SET domain protein [Populus trichocarpa]
 gi|222854322|gb|EEE91869.1| SET domain protein [Populus trichocarpa]
          Length = 283

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 125/291 (42%), Gaps = 32/291 (10%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + VA  E  GRG+ ST+ + AG ++    PIL + +L  IN+   C+ C +K    +S+ 
Sbjct: 4   IRVALTESAGRGVFSTRKICAGDLIHTAKPILAHPSLSTINT--VCYFCLKKL---TSTE 58

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIA 124
                VAFCS +C   A       Y  +     L  +          PL  +  A  +I+
Sbjct: 59  FHGKGVAFCSQECKENAKVF----YEVETKADWLAFDDYCRNKGLKYPLLVKRLACMVIS 114

Query: 125 ----AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAALLAKD 179
               A +L ++ PS     +  +         +   ++++     A  T E   ++LA+ 
Sbjct: 115 GAASAESLDILQPSNLSHEMVLEMEEGYGLLKSGFAMANISDEQMAFLTKEWYNSVLARI 174

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAA 230
           R+NAF +       +D   S  A           +Y+  S +NHDC PNA    ++D   
Sbjct: 175 RINAFRIELAVGSYEDLLSSAAACIEAEAAVGNAVYMLPSLYNHDCDPNA-HIVWID--- 230

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             N D  ++ + DV +  E+ + Y   + D+  RQ  LL  + F C+C RC
Sbjct: 231 --NADAQLKALCDVEEDEELRICYIDASMDHDARQSFLLQGFGFKCNCLRC 279


>gi|317038856|ref|XP_001402319.2| set and mynd domain containing protein [Aspergillus niger CBS
           513.88]
 gi|350631791|gb|EHA20161.1| hypothetical protein ASPNIDRAFT_195107 [Aspergillus niger ATCC
           1015]
          Length = 357

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 126/305 (41%), Gaps = 44/305 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSAL--PFINSSSFCHN--CFRKTMHSSSSI-CPS 70
           +GR L ++Q ++ G++++ D P  L   +  P  + S  C N  C R+T HSS  I CP+
Sbjct: 45  KGRQLRASQPIRKGELLMIDVPYALIPVVDDPASSDSLLCSNPTCSRQTQHSSGRISCPN 104

Query: 71  ---CSVAFCSPKCSTAAASSSHSPYVCQALTRLLN-----LNSPDAANLPL------DRQ 116
                V +CS  C  A        + C  L R  +         D   + L       RQ
Sbjct: 105 RCLADVVWCSSTCQEA--DQLRHEFECTWLQRYASPIRAKWGEYDFGMMWLIVRILASRQ 162

Query: 117 LQARFLIAAYNLAVVN-PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPAST 167
            + + L+ A N         ++ + +F G++     +           YL S    P   
Sbjct: 163 TERQQLLDANNETTQRFKGGWEAIQSFCGSIESWSHAQVRSWTALVKKYLRSSPILPHDL 222

Query: 168 TIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
           +++   AL+ ++  N+FGL      + P  E    R       +Y +A+  NH C PN  
Sbjct: 223 SVDEIVALICREEANSFGLYPRETGVYPVPEPPVDRGEQFGAAVYPRAAIANHSCSPN-- 280

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDFACDCD 279
               +    +++  ++     D+  G E C+SYF ++   D  +R+  L   + F C CD
Sbjct: 281 ----IMHKPDHHGRMVFTASKDIAAGEECCISYFDLSKRVDLKSRRDHLQGLFRFVCGCD 336

Query: 280 RCKVE 284
           RC  E
Sbjct: 337 RCTAE 341


>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 434

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 115/303 (37%), Gaps = 49/303 (16%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+G GL +T  ++ G++V    P  L   +    S + CH CF  T H +   C  C +A
Sbjct: 12  GKGNGLRATVRIEEGELVYVTEP--LAYCVSQKQSRNVCHQCF--TRHETLLRCSQCKMA 67

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVVN 132
            +CS  C   A S       C+ L  LL         +P D  +L AR + A  +    +
Sbjct: 68  RYCSATCQRRAWSDHKRE--CKCLQSLL-------PRIPTDSVRLAARLIFAMLSSCSSS 118

Query: 133 PSQFQILLAFQGTVTDNDTSAAHYLSSLCP----------------PPASTTIELTAALL 176
             +   L   +  +T         LS L                   P+ ++     +L+
Sbjct: 119 SEELYTLEEHESHLTSLSEQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSCRDALSLI 178

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           AK   N F +        DG       G+Y   S  NHDC PN C   +        T +
Sbjct: 179 AKVTCNCFTI-------SDGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------EGTKL 224

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           ++R +  +  G E+ +SY         RQ+RL   Y F C C  C    N  DND    +
Sbjct: 225 LLRAVRGLSPGEELTISYIETLSLNEDRQQRLEDQYCFTCHCQCC----NSPDNDKLMLS 280

Query: 297 NEE 299
            EE
Sbjct: 281 GEE 283


>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 471

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 120/285 (42%), Gaps = 39/285 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
           +GRGL +T+ +  G++V  +     YSA+ F + +S  CH+CFR    +    C  C  A
Sbjct: 17  KGRGLRATKEISTGEVVFAEPS---YSAVVFDSFASQVCHSCFRHQ--AQLHRCGQCKFA 71

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAY-NLAVV 131
            +C+  C TA          CQA+ +L          +P D+ +L AR +   + +  V 
Sbjct: 72  HYCNRTCQTACWEEHKQE--CQAIRKL--------GTVPGDKVRLAARVMWRIHKDTGVA 121

Query: 132 NPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
           + SQ   +   +  V D         DT    +L          +++  A +    + N 
Sbjct: 122 SDSQLLSVEELEDHVADLPEDHLKRIDTDVHVFLQYWSCGRTKHSLDEIAHIFGIIKCNG 181

Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNTDII 237
           F L +     Q G ++V   G++      NHDC PN             ++A  +   I 
Sbjct: 182 FTLSD-----QRGLKAV-GVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSALHSKRRIE 235

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +R +  + +G E+ +SY       + RQ++L   + F C C+ C+
Sbjct: 236 LRALRKICEGEELTVSYVDFLDTSAERQRKLKEHFYFECTCEHCR 280


>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
          Length = 431

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 12/112 (10%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA----AA 230
           LLA   LN+F L      G      +  YG YL+ +F NH C PNAC+  Y+D     A 
Sbjct: 151 LLAVIHLNSFALSSERFPG------MTTYGFYLRMAFCNHSCRPNACQ--YIDPNSTRAR 202

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            N+  I++R + D+ +G E+C+SY  +      R++ L   Y F C C RC+
Sbjct: 203 LNSPSIVLRAVSDIAEGEEVCISYIELMDTTPERREALQELYYFTCQCPRCE 254


>gi|358394896|gb|EHK44289.1| hypothetical protein TRIATDRAFT_79382 [Trichoderma atroviride IMI
           206040]
          Length = 742

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 60/113 (53%), Gaps = 16/113 (14%)

Query: 181 LNAFGLMEPYIEGQDGRR-SVR------AYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
           LN FG   P ++ +D ++   R      + GI+L+AS+ NH CLPNA R    D      
Sbjct: 474 LNGFGC--PRVKSRDKKQPKARNLGFNDSTGIWLQASYANHSCLPNATRAFIGDM----- 526

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             +IVR + ++P G EI + Y P +  +  RQ+ +   Y F CDCD C+ EA 
Sbjct: 527 --MIVRALREIPVGGEILMQYVPQDKPFDKRQEVVENHYGFKCDCDLCRAEAK 577


>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
           intestinalis]
          Length = 430

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 73/276 (26%), Positives = 107/276 (38%), Gaps = 28/276 (10%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYS-ALPFINSSSFCHNCFRKTMHSSSSICPSCS 72
           E  GRGL + ++L  G  VL   P   Y+  L       +C  CF+K        C  C 
Sbjct: 11  EKSGRGLKAKRNLNPGSTVLSSEP---YAYLLSKKQKGVYCDFCFKK--QDGLLQCSGCK 65

Query: 73  -VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
            + +C+  C   A +  H    C AL  ++    PD   L L R L      ++  L   
Sbjct: 66  YMKYCNRNCQKMAWNEHHKAE-CPALKNVMPKRPPDFVIL-LGRLLWNMQQYSSAKLPEK 123

Query: 132 N-----PSQFQILLAFQGTVTDNDTSAAHYLSSLCP-PPASTTIELTAALLAKDRLNAFG 185
           N      S +  L   Q     N     H   S  P PP  T  ++   L A+ + N+F 
Sbjct: 124 NSILDLESNYNKLSQNQKEALMNFLVILHTFWSPKPLPPQVTDNKMLLELCARIKNNSFA 183

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +    ++   G       G+YL  SF NH C PN         A  N   + +R + ++ 
Sbjct: 184 ICNEELQSDVGT------GVYLNCSFINHSCEPNC-------VAEFNMRTLKIRAVKNIT 230

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            G E+ +SY  +      RQ+ L+  Y F C C  C
Sbjct: 231 AGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSC 266


>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
          Length = 456

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/292 (27%), Positives = 118/292 (40%), Gaps = 25/292 (8%)

Query: 10  VAEIEGRGRGLVST-QSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
           V + + R R +V+  Q++K GQ++LR S  L  S LP  +    C  C R+T   +   C
Sbjct: 9   VPDSQARSRAVVAPGQTVKPGQVILRSSS-LATSLLP-KSRGQRCDECCRQT---TVKAC 63

Query: 69  PSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
             C  AF C  +C +AA  S H    C  L+       P  +  P  +Q     LI  Y 
Sbjct: 64  SRCKEAFYCDTRCQSAAWKSHHR-TTCALLSNGYRAKHPYISQ-PEGKQADLDLLITLYG 121

Query: 128 L-AVVNPSQFQ---ILLAFQGTVTDND-----TSAAHYLSSLCPPPASTTIELTAALLAK 178
           L A   PS F+     +  + T+   +     ++   +   + PP   TT   T A   K
Sbjct: 122 LVATSQPSMFEQSNWAVRREPTLLPEEPLECFSTLLPHSGQVGPPYIPTTFGKTQAEFLK 181

Query: 179 DRLNAF---GLMEPYIEGQDGRRSVRAYGIYLKASF-FNHDCLPNACRFDYVDAAAENNT 234
           +    F     +   I          A+GI+  AS  FNH C PNA     ++   +   
Sbjct: 182 EAWARFENNNFVLHNISSMVPNSGAYAHGIFPHASRGFNHSCSPNAWPAFVLE---QRQA 238

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            + +R +  + +  EI + Y         RQ RL   Y F C C RC +E  
Sbjct: 239 WLEIRALISIKESEEITIPYLDPALSLPERQARLKATYGFDCTCSRCDLEKK 290


>gi|302672815|ref|XP_003026095.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
 gi|300099775|gb|EFI91192.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
          Length = 528

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 10/121 (8%)

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
           P  +T  L  A+LA+D  NAFGL +     + G   +  + +Y+  S+FNHDC PN  + 
Sbjct: 398 PYLSTPHLLRAILARDHANAFGLFD---MQETGESEMLGWAVYVAGSYFNHDCAPNVRKL 454

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
                       +      DV  G E+C+SY       ++R  +    ++F C C RC+ 
Sbjct: 455 -------RRGRALQFVTTRDVAPGEELCISYVDTQDTKASRAAQFAQHWNFVCGCGRCRG 507

Query: 284 E 284
           E
Sbjct: 508 E 508


>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
          Length = 497

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/282 (25%), Positives = 114/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267


>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
 gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
          Length = 492

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 120/289 (41%), Gaps = 42/289 (14%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           VA+ E  GR +V+++++K GQ++L+++P+LL    P +     C NC  +        CP
Sbjct: 10  VAKNEIFGRFIVASKNIKQGQLILKENPVLL---CPQVGGPIICFNCCAQL--KKLFFCP 64

Query: 70  SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL-----NLNSPDAANLPLDRQLQARFLI 123
            C +A  C+P C     +  H+   C AL  L       L++P+    PL   L  +F  
Sbjct: 65  DCRIAVLCNPNCK----NQFHNLQECTALKSLPIKHDDVLSNPEIVT-PLRCLLWRQF-- 117

Query: 124 AAYNLAVVNPSQFQILLAFQGTVT---------DNDTSAAHYLSSL-CPPPASTTIELTA 173
                   N +Q +  L  +  +           N  +    L  L     +    E+  
Sbjct: 118 --------NKNQLENFLQLEAHLNFRRDSQIWRRNRVNIEDVLFKLKILDESDLKEEVVQ 169

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            +     +N F + +P        ++    G+YL+A+   HDC+ N         A +++
Sbjct: 170 KICGILDINTFDVRQPQRNRLGFNQAENLRGLYLRAALMAHDCVANT------HLAVDDD 223

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             + V    D+P+G  I  +Y  V      R++RLL    F C C RC+
Sbjct: 224 FVLYVHAAVDIPEGSPIYFNYTDVLQGNDERKRRLLNAKHFECQCSRCR 272


>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
          Length = 583

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/315 (22%), Positives = 121/315 (38%), Gaps = 39/315 (12%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSAL-PFINSSSFCHNCFRKTMHS 63
           S ++  + I   G G+ +T+ ++ G+++    P++  + + P + +  FC    R+  H 
Sbjct: 85  SLSIQESTIPNAGSGMFTTRDVEEGELIFTSVPLVTCAEVGPGMEACDFCFQQRRRVFHP 144

Query: 64  SS-------------SICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA 109
                           IC  C +  +CS  CS  A  + H  Y C     LL   S D  
Sbjct: 145 VEDRFLQPGEVLPPLHICNGCRLYQYCSQSCSQRAWDTGHL-YECG----LLAGASADVE 199

Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
              L R      L+      V+ P Q + L   +  V + +         +         
Sbjct: 200 TRTLYR------LLILMRKKVLLPQQVKALARLENEVANFEKRTKKSWPRVLNLAREAKE 253

Query: 170 ELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
              + L   + L  +G++        Q  R +     + +  +  NH C PN        
Sbjct: 254 RTKSELSIGEVLMLYGIVRCNSLPVDQTFRNAPLGIALDMGGALLNHCCDPNV------- 306

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
               N+T + VR +  +  G E+   Y  + YD++ R  R+   Y F C CDRCK+E   
Sbjct: 307 VIVFNSTQVQVRALRKIKDGEELLHCYRDIAYDFTFRNPRITARYQFNCQCDRCKME--- 363

Query: 288 SDND-NDDENNEEVM 301
           SDN     ++NE V+
Sbjct: 364 SDNHFKQAKSNENVI 378


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 120/290 (41%), Gaps = 43/290 (14%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           + +T+TV      GR L++ + LKAG ++L++ PI++    P   +   C  C+      
Sbjct: 46  IEQTLTV------GRHLIACRDLKAGDVILQEKPIVMG---PKHTAGQICLGCYSGV--D 94

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
             + C  C    C  +    A  S H+   C  +    +   P   +LP+ +  Q+  ++
Sbjct: 95  GRTRCSQCGWPMCG-RDDCHAHESDHAAE-CGVMA---SGGRPIVGSLPV-QAYQSVMVL 148

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTD---------NDTSAAHYLSSLC--PPPASTTIELT 172
               L   NP ++  L+  +  V +         +  +A  ++        P    ++L 
Sbjct: 149 RCLALRDQNPERWDELMQLEAHVQERRQKGMEDVDQATAVRFIRETLGLQIPEELILQLC 208

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
             L+    +N+F   +P ++G      V    +Y  AS   HDC+ NA +         N
Sbjct: 209 GILM----VNSFE--QPPMKGNSQHGLV---AVYSTASLLEHDCVANAIK------TFTN 253

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             DI++R    +P+G +I L Y    +    RQ+ L     F C C+RCK
Sbjct: 254 KGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLSQTKFFQCVCERCK 303


>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
          Length = 410

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/323 (22%), Positives = 130/323 (40%), Gaps = 43/323 (13%)

Query: 1   MSAVSETVTVAEIEG-RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNC 56
             AVS+ V    ++   G+GL +++ +K  +I+  + P +    +       S   C  C
Sbjct: 88  FKAVSDCVVARMVDPVTGKGLFASRDIKQDEILFTEEPYVYFPPWEGFSLARSGHVCGLC 147

Query: 57  FRKTMHSS--SSICPSCSVAFCSPKCSTAAASSSHSPYVCQ---ALTRLLNLNSPDAANL 111
            +  ++++  +  C  C + +CS  C   A    H     +   A+T  +N    +    
Sbjct: 148 CKPILYANRLTQHCKHCDMYYCSKSCRATAWDQFHQLECTRLNPAITPFMNFCEGERWQA 207

Query: 112 PLD-RQLQARFLIAAYN------------LAVVNPSQFQ-----ILLAFQGTVTDNDTSA 153
           P+   ++ AR ++A                A V+  + Q      +   Q T    + + 
Sbjct: 208 PMAVARIYARLILAHQREELNDVIGRLDAFATVSQEERQAKETEWIFMEQPTRELWNKAR 267

Query: 154 AHYLSSLCPPPASTTI--ELTAAL---LAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYL 207
                +   PPA   I  +L  AL   L +D       M  + I  Q+G       G+YL
Sbjct: 268 DLLRKAYKQPPAKCKITQDLPEALKEKLFEDEETFLNFMGKFNINNQNG-------GMYL 320

Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
             S  NH+C PN    DY +  ++    I VR + D+ QG ++  +Y    ++  TRQ  
Sbjct: 321 VHSHINHNCYPNVS-IDYPNPKSQYK--IAVRAVRDIKQGEQLFETYVNPRWNKETRQTY 377

Query: 268 LLYDYDFACDCDRCKVEANWSDN 290
           L   Y F C C+RC+ +   +D 
Sbjct: 378 LDKSYLFNCKCERCEKDLELTDE 400


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 108/285 (37%), Gaps = 43/285 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+G GL +   ++ G++V+   P  L   +    +   CH+CF  T H S   C  C +A
Sbjct: 13  GKGNGLRAAGRIRKGELVISAEP--LACCVSNRLAKEVCHHCF--TRHESLLRCSQCKMA 68

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVVN 132
            +C+  C   A S       C+ L  LL         LP D  +L A+ + A  + +  +
Sbjct: 69  RYCNTTCQKQAWSGHKRE--CKCLQSLL-------PRLPTDSVRLAAKLIFALLSPSKSS 119

Query: 133 PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG------L 186
             +   L   +  +T         LS L     ++ +EL     A D L   G       
Sbjct: 120 SEELYTLEEHESHLTSMPEQKKQGLSQL-----ASMLELYLQQEAPDLLQEMGSALPPSC 174

Query: 187 MEPY----------IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
            EP               DG       G+Y   S  NHDC PN C   +        T +
Sbjct: 175 REPLSLIAKVTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------EGTKL 227

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +R + D+    E+ +SY         RQ++L   Y F C C RC
Sbjct: 228 ELRAVRDIDPEDELTISYIETLSLTEDRQRQLEEQYHFTCHCQRC 272


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 123/318 (38%), Gaps = 65/318 (20%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--SSICPSC 71
           +G GR +++++ L  G +VLR SP    S L        C  CF+K   S   +  C +C
Sbjct: 219 DGFGRCVLASEDLPKGTMVLRVSPFA--SVLEDHKIEKNCGFCFKKINKSIRINQTCKNC 276

Query: 72  SVAFCSPKCSTAAASSSHSPYVCQAL-----------TR--------LLNLNSPDAANLP 112
                 P+CS    S ++    C  L           TR        LLN+   D  N  
Sbjct: 277 KNHLLCPQCSVDEYSLNYHKDECDILNFLKQYYPSSQTRDFRFMFRVLLNV-IKDKKNKS 335

Query: 113 LDRQLQAR--------FLIAAYNLAV--------VNPSQFQILLAFQGTVTD-----NDT 151
             ++ Q++        F+  +Y   +        V P Q +   AF+ +        N  
Sbjct: 336 FSKENQSKQWLNHQNPFIFDSYKYLINLSRTLDKVQPEQME---AFKRSAQSVIAIFNKL 392

Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
               +    C       IE+ + +L+    N   ++ P              GI+   S+
Sbjct: 393 RGPKFFDE-CGVTIDEIIEIYSIVLS----NGHEMLHPL------NCHTYGLGIFPTGSY 441

Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYD 271
            NH CLPNA  F Y D        ++ R +  + +G EI  SY  +  + S R+K LL  
Sbjct: 442 LNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQ 495

Query: 272 YDFACDCDRCKVEANWSD 289
           Y F C C +CK ++  +D
Sbjct: 496 YFFFCQCQQCKFQSKLTD 513


>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 420

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 127/316 (40%), Gaps = 47/316 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           MS + E   ++E    G+G+ S +  +  +I+  ++P++  ++  + N  + C  C +  
Sbjct: 114 MSTIFEVKEISET--LGKGVFSKKEHQVNEIIYTETPVI--ASAKYKNYKNCCLYCLKAL 169

Query: 61  MHSSSSICPS-------------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
             + S+   S             C   +CS KC            VC  L ++++  +  
Sbjct: 170 FPNESNTYTSVFNTNVVKGVKCKCGYVYCSKKCQDLDVGHQ---LVCGKLNKVVSFCTER 226

Query: 108 AANLPLD-RQLQARFLIAAYNLAVVNPS---QFQILLAFQGTV-------------TDND 150
             + PL   ++ A+ L A Y    + P      Q L+ F  TV             T  D
Sbjct: 227 CVSHPLTITKMFAKVLSATYREPTIYPFVVFHAQPLVKFDMTVEKIFVEIFGNKLETFQD 286

Query: 151 TSAAHY----LSSLCPPPASTTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGI 205
                Y    +SS+    AST + L A  +LA D        E  +  +  + SV   G+
Sbjct: 287 FGGWKYVYRIMSSVLKYNASTILPLNAIQMLATDPQKKVLSKEEALNWEVSKFSVEGEGL 346

Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQ 265
           Y   +  NH C PN      V A   ++  + +  + D+  G E+ +SY   + +  TR 
Sbjct: 347 YKYLNTLNHSCDPNC-----VLACTTDDFKLSLIALKDIKAGEELTISYIDNSMNKETRL 401

Query: 266 KRLLYDYDFACDCDRC 281
           K L+  Y+F C C +C
Sbjct: 402 KTLMDQYNFDCKCKKC 417


>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
          Length = 421

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 82/307 (26%), Positives = 122/307 (39%), Gaps = 38/307 (12%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           RGR + +  +L AG + LR +P L    LP +   S CH CF     +  S C  C+ AF
Sbjct: 17  RGRRVETETALAAGDLALR-APALATVLLPAL-WDSHCHKCF--ASGTRLSRCGRCNTAF 72

Query: 76  -CSPKCSTAAASSSHSPYVCQALTRLLNL---NSPDAANLPLDRQLQARFLIAAYNLAVV 131
            CS  C  A     H    C+ L +L  L   N   A  L L R L+        +   +
Sbjct: 73  YCSKACQQADWKPDHRKE-CKVLAQLAQLGLRNDQTADVLLLGRVLRRE------DAEGL 125

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
            P +   L+ ++  + D +      L+  L     S +++    +L++ R N F + +  
Sbjct: 126 QPKE---LVWYEEDMEDQELMLLAALAQKLELVDGSYSMDEMLRMLSRFRNNNFSICDEL 182

Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           +  Q         G +   +  NH C PN C   +V    E    +  R +  +  G EI
Sbjct: 183 LLEQGA-------GCFPLGAMINHSCDPN-CAITFVPKTLE----MEFRAMRPIKAGEEI 230

Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-------DNDNDDENNEEVMDE 303
             +Y  V      R +RL   Y F C C RC V    S       D D D    E+   E
Sbjct: 231 TQTYVDVALPRRERHERLQRKYHFNCACSRCSVPLQESGSLDAFLDADIDGVPKEQWSQE 290

Query: 304 DQDEQMV 310
            +DEQ +
Sbjct: 291 RKDEQCI 297


>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
 gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
          Length = 496

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 110/269 (40%), Gaps = 26/269 (9%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GRGL++T+S+K G+I+ +D P+L+  +    +S + C  C R+   +       C +   
Sbjct: 34  GRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRELPDTRF----MCRIGCG 89

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P CS  +    H       L +    N P+ AN  + R L    +  A NL   N  Q 
Sbjct: 90  LPVCSLCSKKKQHKTDC--DLLKSWGPNEPEVANSVIIRLL---CVARALNL---NKDQR 141

Query: 137 QILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
            ++   Q  + +N      +AA    +   P      E+    +A  R N F   +   +
Sbjct: 142 DLIYCLQANLDNNHRVEVRNAAKCFKNF--PTDKKIFEIMNRTVAVLRTNGF---DKTTD 196

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
             +  +      +Y   +  NHDC+PN+    Y     + N  +IVR   D+P+G EI  
Sbjct: 197 RTNDNQEFNYRALYPLFAVMNHDCIPNS----YYTFEEKTNI-MIVRAAVDIPEGFEITT 251

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +Y  +      R   L     F C C RC
Sbjct: 252 TYTKLFTGNIARHLFLKIKRGFTCKCSRC 280


>gi|157125398|ref|XP_001654321.1| hypothetical protein AaeL_AAEL001920 [Aedes aegypti]
 gi|108882685|gb|EAT46910.1| AAEL001920-PA [Aedes aegypti]
          Length = 527

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 70/280 (25%), Positives = 112/280 (40%), Gaps = 36/280 (12%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
           GR LV+ ++LKAG+ +++  P+ +    P+  S   C  C+++    S  I C  C    
Sbjct: 56  GRYLVAARNLKAGEKIIQVEPLAMG---PWAASDPVCLGCYKRFAKGSKIIRCSGCGWRI 112

Query: 76  CSPKCSTAAASSSHSPYVCQALTR-----LLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
           CS  CS   A+ +H    C  L       LL  +S +   L      +A F +    L  
Sbjct: 113 CSHTCSGLTANGAHKRLECNPLKEHNVVNLLETSSTEQIKL----MYEAIFALRCLLLKK 168

Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
            +P ++  LL  +             +SSL        +     +  + R + F   E +
Sbjct: 169 TDPERYATLLEMESL-----NELRQKISSLWKRNQEMIV---GRIRDEWRFDEFSEGEIH 220

Query: 191 -----IEG---QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII-VRMI 241
                IE    Q G+  V A  +Y +A +  HDC PN    D      +  T ++ +R+ 
Sbjct: 221 TVCGIIEVNSFQIGQDDVYARALYPEAFYIMHDCTPNTTHTD------DPKTHVLSIRLT 274

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +D+  G  I LSY         R++ L     F C C RC
Sbjct: 275 NDLKAGDPITLSYSYTLQGTLKRRQHLFESKFFWCQCKRC 314


>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
 gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
          Length = 477

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 114/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CLAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCACEHCQ 267


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
           magnipapillata]
          Length = 1037

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 74/291 (25%), Positives = 111/291 (38%), Gaps = 45/291 (15%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS------SFCHNC 56
            V +   V E E +GRG+ + Q +K G  +L++ P+    A    NS        +C++ 
Sbjct: 2   VVFQVYEVFETEEKGRGVRALQDIKRGVEILKEEPL----ACILTNSKYRGIRCDYCYSE 57

Query: 57  FRKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
             K +      C  C  +A+C   C   A+      Y C+ LT+      PD   L    
Sbjct: 58  PEKLLK-----CSKCKFIAYCGKVCQ--ASDWKMHKYECKCLTKSAPKQPPDFCRLV--S 108

Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH-YLSSLCPPPASTTIEL--- 171
           QL   F     N  + N      L A +G +++    A   + + L        I +   
Sbjct: 109 QLIFNFYYNKKNTLINN------LYANKGNISNARKEAFFTFAAVLVEYLQDVNININDI 162

Query: 172 -TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
               L+ K   N+F +        +   +    GI+  AS FNH C PN         A 
Sbjct: 163 DIYGLMCKASCNSFAIT-------NAELNSLGTGIFSSASLFNHSCDPNC-------VAT 208

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            N  DI +R I  + +G E+ LSY  +      RQ  L   Y F C C  C
Sbjct: 209 FNGRDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFTCKCTVC 259


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 75/271 (27%), Positives = 122/271 (45%), Gaps = 30/271 (11%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           +GR L++++S   G  +L  +P +L  +L   ++S+ C +CF+     S S C +  VA+
Sbjct: 14  KGRCLIASRSFTPGDTILTFTPTILIPSLS--HASTVCSHCFKPGEPRSCSRCHA--VAY 69

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
           C   C +AA  + HS   C+ L ++     P    LP   +   + L+ A   A +   +
Sbjct: 70  CDASCQSAAWKAIHSKE-CKVLQKVSAQGRP---GLPTPVRAVVQALVKAEVGAALEDLE 125

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIEG 193
              L +++G+    D       +S      +T   ++   +LL K + NAF   +  I G
Sbjct: 126 GNDL-SWRGSDKWADMEMMAMGASAFAGLGTTQEAVKKALSLLCKIQTNAFHRYDADI-G 183

Query: 194 QDGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           Q         GI+L  + +  NH C+PNA    +V   A      I+R    +    EI 
Sbjct: 184 Q--------VGIFLEPRLAMANHSCIPNAT-VQFVGRRA------ILRAERPIKADEEIE 228

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +SY   NY  S R K  L  Y F C+C RC+
Sbjct: 229 ISYTDYNYPLSKR-KEALAPYFFTCECTRCR 258


>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
          Length = 661

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 132/330 (40%), Gaps = 53/330 (16%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSP---ILLYSALPFINSSSFCHNCFRK 59
           A SETV +A  E RGR   + + ++AG+I+L + P   +LL        S S C NCF K
Sbjct: 240 AASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEF-----SKSHCQNCFNK 294

Query: 60  TMHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
            +      CP C +V FCS KC   A  S H  Y C  L  L         ++ L    Q
Sbjct: 295 CLIPLP--CPKCPNVIFCSEKCLDIALKSYHG-YECHILPLLWKSGCSVTCHIALRMITQ 351

Query: 119 AR---FLIAAYNLAVVNPSQFQI--------LLAFQGTVTDND----TSAAHYLSSLCPP 163
                F+    +L       ++         L++ +   T  D    T  A +L  L   
Sbjct: 352 NSKDYFMKIMQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDILHRTEMAIFLLKLLEI 411

Query: 164 P------ASTTIELTAALLAKDRLNAFGLMEPY-IEGQDGRRSVRAYG--------IYLK 208
                  A++   L    L   + NA  + E   ++ +DG R ++  G        +Y  
Sbjct: 412 SGYFNDDAASFGCLILKNLQVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPT 471

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQK 266
            + FNH C P   R+          + I+V  + ++ +G E+  +Y P+        RQ 
Sbjct: 472 LALFNHSCEPGIVRYF-------CGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQS 524

Query: 267 RLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           +L   Y F C C  C  E NW   ++  EN
Sbjct: 525 QLKEQYWFDCKCLPC--EQNWPKYEDMTEN 552


>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
 gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
          Length = 492

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 111/268 (41%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+++T++LK G+I+ +DSP+L+  A    ++ + C  CF K +  +  +C   C +  
Sbjct: 34  GRGVMATRNLKRGEIIFQDSPLLIGLAAHEEDTLNACSVCF-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       + +    N PD AN  + R L    +  A NL+     Q
Sbjct: 91  --PVCSLCAKKKQHKTDC--DMFKAWGSNEPDVANSVIIRLL---CVARAINLS---KDQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     +E+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKIVEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
               +      +Y   +  NHDC+PN+    Y      NN  +IVR   D+ +G EI  +
Sbjct: 198 TTDNQEFNYRALYPLFAVMNHDCIPNSY---YTFEEKTNN--MIVRAAVDIAEGEEITTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLFLKMKKGFTCKCPRC 280


>gi|194747113|ref|XP_001955997.1| GF24983 [Drosophila ananassae]
 gi|190623279|gb|EDV38803.1| GF24983 [Drosophila ananassae]
          Length = 663

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 131/314 (41%), Gaps = 44/314 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  S+ V + E + +GR +V+++ +K G +VL + P+       F  ++  CH+CF++ 
Sbjct: 252 LSGASKVVKLVETKDKGRFVVASEGVKTGDVVLCEEPVAACLIPSFFGTN--CHHCFKR- 308

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H+  S C  CS VAFCS +C   A SS H  + C+ +  L+         + L    QA
Sbjct: 309 LHTPVS-CLHCSGVAFCSAQCMGEACSSYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 366

Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
             L   +A  NL   +              Q  ++  F   +           +    P 
Sbjct: 367 NSLEDGLATANLLFEHLCSHEEVRQADDYLQRSLMAGFLMRILQKSLYFGRRKTEGVNP- 425

Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
             T +EL  A     LL   + NA  + +  +  +   DG  +V    G+Y   S+FNH+
Sbjct: 426 --TAVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSNTVYVGAGLYGTGSYFNHE 483

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYD 273
           C P+        A       +++           + ++Y P  +  +   RQ+ L   Y 
Sbjct: 484 CWPSV-------AGHFVGKKLVLTATKPHRPNEVVAVNYGPLFIKMNLKERQRTLRGRYS 536

Query: 274 FACDCDRCKVEANW 287
           F+C+C  C  + NW
Sbjct: 537 FSCNCMAC--QENW 548


>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
          Length = 397

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 69/267 (25%), Positives = 108/267 (40%), Gaps = 27/267 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
           GRGLV+TQ +KAG+++  D P L+Y     I     C  C  K + S     C  C++  
Sbjct: 39  GRGLVATQDIKAGEVLFVDYP-LVYGPRSGIIVQRGCTVC--KNIDSDIFFKCSKCALIL 95

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL-PLDRQLQARFLIAAYNLAVVNPS 134
           CS +C  +   S      C  ++     + P+   +  +D  L +R L A   L ++N  
Sbjct: 96  CSVQCQNSDFHSGD----CSIIS-----HWPNKVPIEEVDDTLLSRALTAIRAL-LLNED 145

Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
           Q  +L + Q        S    L      P     E    ++     NAF +  PY    
Sbjct: 146 QKYLLTSLQANKLPQYGSEIRDLKQYFDIPLHEE-EFMILVICILNTNAFQMATPY---- 200

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
            G++ +   G+Y  AS  NH+C+PN             +  + V+    +  G EI   Y
Sbjct: 201 -GKKEMSLRGLYPVASILNHNCVPNT------RNCFNGDLQMTVKATKTINAGSEIFTCY 253

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +    R+  L     F CDC+RC
Sbjct: 254 SGMLWGTPARRLYLYKSKHFLCDCERC 280


>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
 gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
          Length = 343

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 69/307 (22%), Positives = 128/307 (41%), Gaps = 37/307 (12%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLK---------AGQIVLRDSPILLYSALPFINSSSFCHN 55
           ++ V + E + +GRGL  TQ+ +            I+ ++ P + Y ++  ++    C++
Sbjct: 32  NKLVEIKESDKKGRGLFYTQNTQLPVINKETNKPNILFKEEPFISYPSIIKLDHDKICNH 91

Query: 56  CFR-------KTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQA----LTRLLNLN 104
           C +          +++ + C  C+V +CS +C   A+   HS  +C++       L   +
Sbjct: 92  CLKSLENNNSNNNNNNHTECEGCNVKYCSNECKEKASLQYHS-VLCKSNGSGFHYLEKHS 150

Query: 105 SPDAANLPL-DRQLQARFLIAAYNLAVVNPSQFQI-LLAFQGT---VTDNDTSAAHYLSS 159
             +    PL   ++ AR L+  +     N + F + +L F      +   D       S 
Sbjct: 151 QIEKRRFPLLAGKILARVLMGYHLEKSANKTWFPLQMLTFANKPAPLEWKDDYLIFSRSL 210

Query: 160 LCPPPASTTIELTAALLAKDR----LNAFGL-MEPYIEGQDGRRSVRAYGIYLKASFFNH 214
           L     S+  +       K      LN  G+ ++P            + G+Y   SF NH
Sbjct: 211 LKGMKESSLKKFNYDWFVKVMQILYLNTIGIDIDPNQTTTKMSTPESSIGLYFLTSFINH 270

Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
            C PNA    YV    ++  +I  R++  +  G EI +SY   + D   R+ +L  +Y F
Sbjct: 271 SCDPNA----YVQFPNDHTAEI--RLLKPINPGEEITISYADTSKDIIDRRSQLFENYGF 324

Query: 275 ACDCDRC 281
            C+C +C
Sbjct: 325 NCECPKC 331


>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
 gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
          Length = 614

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 122/323 (37%), Gaps = 75/323 (23%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
           VA ++ RGR  V+T+++KAG++V+ +  I       +    + CH C      +     C
Sbjct: 68  VATVKNRGRCAVATRAIKAGEVVVAEQAIAFVPRGQY--RKAICHVCGVNVAENGPKFEC 125

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
            +C+      +C       SH  Y   A+ R       + A++  D    A  L   +NL
Sbjct: 126 SACNYTIFCEECYKPP-QGSHENYC--AVYR-------EIASIARDENCDADLLRLVFNL 175

Query: 129 AVVNPS---------QFQIL---LAFQ---------------------GTVTDNDTSAAH 155
           A+             Q ++L   L FQ                     G  T    + + 
Sbjct: 176 ALKKAGTAQARSSLLQLELLIETLPFQNSKDETCTVDGVIHSCFKDALGLETHRSKAPSS 235

Query: 156 YLSSL-----------C------PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
           +L ++           C      P P S +IE    L      N+ G+       Q+ + 
Sbjct: 236 WLEAVRNGCTRLHKIACVGNGGKPAPVSCSIEELEMLATLINTNSHGM-----GAQNLQN 290

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
           +  A G++   S  NH C PN C        A   + + VR + D+P+G E+CLSY  + 
Sbjct: 291 THVALGLFPYVSILNHSCRPNCC-------FASEGSVMYVRAVQDIPKGAELCLSYINLY 343

Query: 259 YDYSTRQKRLLYDYDFACDCDRC 281
                R+  L+    F C CDRC
Sbjct: 344 ESRRVRKTLLVATKHFDCTCDRC 366


>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
 gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
          Length = 498

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 115/278 (41%), Gaps = 28/278 (10%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           V V++I GRG  L++T+S+K G+I+ +D P+L+  +    +S + C  C R+   +    
Sbjct: 27  VGVSKIAGRG--LMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRELPDTRFMC 84

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
              C +    P CS  +    H       L +    N P+ AN  + R L    +  A N
Sbjct: 85  RMGCGL----PVCSLCSKKKQHKTDC--DLLKSWGPNEPEVANSVIIRLL---CVARALN 135

Query: 128 LAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
           L   N  Q  ++   Q  + +N      +AA    +   P      E+    +A  R N 
Sbjct: 136 L---NKDQRDLIYCLQANLDNNHRVEVRNAAKCFKNF--PTDKKIFEIMNRTVAVLRTNG 190

Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           F   +   +  +  +      +Y   +  NHDC+PN+    Y     + N  +IVR   D
Sbjct: 191 F---DKTTDRTNDNQEFNYRALYPLFAVMNHDCIPNS----YYTFEEKTNI-MIVRAAVD 242

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +P+G EI  +Y  +      R   L     F C C RC
Sbjct: 243 IPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRC 280


>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
           cuniculus]
          Length = 477

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 117/284 (41%), Gaps = 30/284 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ R   +  P+  N+ L  ++  R   
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAARIMWRVER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSAA--HYLSSLCPPPASTTIELTAALLAK 178
               L    +V+    Q  +   G     +  A    +L    P     +++  + +   
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N F L +     Q G ++V   GI+   +  NHDC PN C   +      NN  I +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGIFPNLALVNHDCWPN-CTVIF------NNGKIEL 223

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 224 RALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQ 267


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 75/300 (25%), Positives = 121/300 (40%), Gaps = 48/300 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
           +TV+    +GR L+S +    G++++   P   Y ++P  N+S   S C  CFR    S+
Sbjct: 13  LTVSTTPEKGRCLLSIKDFSPGEVIISQEP---YVSVP--NNSAVHSRCEGCFRS---SN 64

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
              C +C V  +C   C  +          C AL+RL       L  +      L + R+
Sbjct: 65  LKKCSACHVVWYCGSTCQKSDWKLHRLE--CNALSRLEKERQKSLTPSIRLMVKLYMRRK 122

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-------TSAAHYLSSLCPPPASTTI 169
           LQ+  +     +       + ++ A    +TD D          A+ ++ +   P     
Sbjct: 123 LQSEKI-----MPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPDINVK 177

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E+ A   +K   NA  +        DG       G+Y   S  NH CLPN+    + +  
Sbjct: 178 EI-AENFSKLACNAHTIC-------DGELRPLGTGLYPVISIINHSCLPNSV-LVFEERL 228

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
           A      +VR +  +P+G E+ +SY        TRQK L   Y F C C RC+    + D
Sbjct: 229 A------VVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDD 282


>gi|194869827|ref|XP_001972529.1| GG13835 [Drosophila erecta]
 gi|190654312|gb|EDV51555.1| GG13835 [Drosophila erecta]
          Length = 665

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 133/312 (42%), Gaps = 40/312 (12%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +   S+ V + E + +GR +V+ + L+AG ++L + P+   + L      + CH+CF++ 
Sbjct: 251 LIGASKVVRLVETKDKGRFVVANEGLRAGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 308

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
             S+   C  CS VAFCS +C   A+SS H  + C+ +  ++         + L    QA
Sbjct: 309 --STPVSCLHCSGVAFCSAQCMGEASSSYHR-FECEYMDLMIGSGMSILCFIALRVFTQA 365

Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
             L   +A  NL   +         P  +       G +      + ++          T
Sbjct: 366 PSLEQGLATANLLFEHLCSHEEERQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 425

Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
            +EL  A     LL   + NA  + +  +  +   DG ++V  A G+Y   S+FNH+C P
Sbjct: 426 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 485

Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
           + AC F            +++           + ++Y P  +  +   RQ+ L   Y F+
Sbjct: 486 STACHF--------VGKKLVLTATKPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 537

Query: 276 CDCDRCKVEANW 287
           C+C  C  + NW
Sbjct: 538 CNCMAC--QENW 547


>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 75/300 (25%), Positives = 121/300 (40%), Gaps = 48/300 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
           +TV+    +GR L+S +    G++++   P   Y ++P  N+S   S C  CFR    S+
Sbjct: 13  LTVSTTPEKGRCLLSIKDFSPGEVIISQEP---YVSVP--NNSAVHSRCEGCFRS---SN 64

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
              C +C V  +C   C  +          C AL+RL       L  +      L + R+
Sbjct: 65  LKKCSACHVVWYCGSTCQKSDWKLHRLE--CNALSRLEKERQKSLTPSIRLMVKLYMRRK 122

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-------TSAAHYLSSLCPPPASTTI 169
           LQ+  +     +       + ++ A    +TD D          A+ ++ +   P     
Sbjct: 123 LQSEKI-----MPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLVNLILQWPDINVK 177

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E+ A   +K   NA  +        DG       G+Y   S  NH CLPN+    + +  
Sbjct: 178 EI-AENFSKLACNAHTIC-------DGELRPLGTGLYPVISIINHSCLPNSV-LVFEERL 228

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
           A      +VR +  +P+G E+ +SY        TRQK L   Y F C C RC+    + D
Sbjct: 229 A------VVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDD 282


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QER 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 267


>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
 gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
          Length = 499

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/270 (25%), Positives = 110/270 (40%), Gaps = 28/270 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GRG+++T+SLK G+I+ +DSP+L+  A    ++ + C  C ++   +       C +  C
Sbjct: 34  GRGVMATRSLKRGEIIFKDSPLLIGLAAHEEDALNACTVCLKELPDTKFMCRQGCGLPVC 93

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P C    A        C  + +    N P+ AN  + R L    +  A NL      Q 
Sbjct: 94  -PLC----AKKKQHKTDCD-MFKSWGPNEPEVANSVIIRLL---CIARAINLT---KDQR 141

Query: 137 QILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
            ++   Q  + +N  +    AA    +   P     IE+    +A  R N F   +   +
Sbjct: 142 DLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLIEIMNRTVAVMRTNGF---DKTTD 196

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
             +  +      +Y      NHDC+PN+   FD      E    ++VR   D+P+G EI 
Sbjct: 197 RTNDNQEFNYRALYPLFGVMNHDCIPNSYYTFD------EKTNHMVVRAAVDIPEGFEIT 250

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +Y  +      R   L     F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKGFTCKCPRC 280


>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
          Length = 449

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 111/282 (39%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWASDVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A         C A+ R   + N N   AA +    + +  
Sbjct: 62  LHHCGQCKFARYCDRTCQKDAWVDHKKE--CLAVKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +            +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVENFGEEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 267


>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 319

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/292 (26%), Positives = 114/292 (39%), Gaps = 56/292 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCFR-----KTMHSSSSI 67
           +GRGL +  +++AG  VL D P +       +N       C +C R      T    S  
Sbjct: 14  KGRGLRAVSTIQAGTCVLEDEPYVY-----ILNEEQLQLRCGHCLRALSLGNTSEVESKN 68

Query: 68  CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA- 125
           C SC  V +C+  C  A        Y C    +LL     D   L   R L    L +  
Sbjct: 69  CKSCKRVVYCNKACKVAGRKEHR--YEC----KLLQGKEKDDVTL---RLLMKMILKSKN 119

Query: 126 YNLAVVN------PSQFQILLAFQGTVTDNDTSAAHY----------LSSLCPPPASTTI 169
           Y L   N      PS ++ L +FQ +    +   + +          L S   P  S  +
Sbjct: 120 YELKKSNGTLCEFPSSYRDLCSFQLSTKAQEHLLSLFRCYEKGWRSKLRSHLSPFRSQEV 179

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
             T   L    +N+F + +      +  R++    +Y++AS FNH C PN C F +    
Sbjct: 180 LDTYQKLT---INSFSMYD------EMTRTIVGEALYIRASMFNHSCEPN-CTFVF---- 225

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
               + + VR I  +  G E C+SY         R+++L   Y F C C RC
Sbjct: 226 --EGSRLSVRAIKRIEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRC 275


>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
          Length = 458

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 121/295 (41%), Gaps = 63/295 (21%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           + E E  GR LV+++ +KAG+++L++ P+++    P +N+   C  C++    + +  C 
Sbjct: 43  IKESEAEGRYLVASRLIKAGEVILQELPLVVG---PKLNTLPLCLGCYKSI--TDTYRCS 97

Query: 70  SCSVAFCSPKCSTAA-------------------ASSSHSPYVCQALTRLLNLNSPDAAN 110
            C+   CS  C  +A                   +  + +  V Q +T L  L  PD+  
Sbjct: 98  RCNWPLCSAACEESALHKNGECRMIDPTLMTNHLSQGAINSQVFQCITPLRYLTLPDSDR 157

Query: 111 LPLDR---QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
             LD     L+ R  +  Y L   N S F   L ++  +T  D               S 
Sbjct: 158 ERLDELVSHLEQRRGMDIYRLVEQNISSF---LRYRLLLTQYD---------------SE 199

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYV 226
           +I+    +L     N F   E  I+   GR SVR  G+Y  AS  NHDC+ N    FD  
Sbjct: 200 SIQRVCGILET---NCF---EIRIQ---GRVSVR--GLYPTASLMNHDCVANTRHVFDPA 248

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           D        I +    D+P G +I  +Y    ++   R+  L     F C C RC
Sbjct: 249 DF------RIRILATKDIPAGDKISATYTRSLWNTLDRRLHLKSTKHFWCQCSRC 297


>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
           mulatta]
          Length = 399

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/282 (24%), Positives = 113/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P      ++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFGMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267


>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
          Length = 429

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 112/270 (41%), Gaps = 28/270 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+ +T+ LK G I+LRD PIL+  A    ++ + C  CF K +  +  +C   C++  
Sbjct: 34  GRGVFTTRKLKRGDIILRDIPILIGLAARQEDTLNSCSICF-KVLPDTKFMCRQGCAL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H      AL +      PD AN  + R      LI       ++  Q
Sbjct: 91  --PVCSVCAKKKQHKDDC--ALFKSWEPIEPDVANSVIIR------LICIARAINLSKEQ 140

Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
             ++   Q  + +N  +    AA   +    P     IE+    +A  R N F   +   
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFNKF--PTDKKLIEIMNRTVAVLRTNGF---DETA 195

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           +  +  +      ++   +  NHDC+PN+    Y     + N  +++R   D+P+G EI 
Sbjct: 196 DRTNDNQEFFYRALFPLFALVNHDCVPNS----YYTFEEKTNY-MVLRASVDLPEGTEIT 250

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +Y  +      R   L    +F C C RC
Sbjct: 251 TTYTKLFTGNIARHLFLKMKKNFTCKCSRC 280


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 2   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ +   + N N   AA +    + +  
Sbjct: 57  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 114

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 115 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 220

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 221 LGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 262


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 115/289 (39%), Gaps = 32/289 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF----RKTM 61
           +T+   +   +G  +++ + + AG  +L + P + +  L      S CH+C     R  +
Sbjct: 56  KTIMCEKTPAKGTVVIAKKDIPAGTTILAE-PAIAWQPLED-RIMSVCHHCMTEVPRWAV 113

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTR-LLNLNSPDAANLPLDRQLQAR 120
                      + +CSPKC  A+ +     +      R + N    D   + L     A 
Sbjct: 114 GCGEGAGGCSGLGYCSPKCREASEALHRVEHKVFLQARDIANRTKADITLIKL-----AT 168

Query: 121 FLIAAYNLAVVNPSQFQI-LLAFQG-------TVTDNDTSAAHYLSSLCPPPASTTIELT 172
            +IA  +L+  +  QF+  ++A  G          D+ T  A  +  L P PA  +    
Sbjct: 169 RIIALRSLSEGHRVQFERGVMAMMGHEDECPQQWVDSITETAKLVMPLLPKPARLSAREF 228

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
             + A+   N+      ++      + +   G+Y  AS  NH C PN C F         
Sbjct: 229 VKVCARINTNSHRQHHMFVP-----QRILGVGLYPLASLINHSCQPN-CGF------YNR 276

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              + +R + DV +G E+C SY  +    S R+  LL    F C C+RC
Sbjct: 277 GPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRC 325


>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
           harrisii]
          Length = 490

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 29/293 (9%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTM 61
           A  E V V   EG+GRGL +T+ L A  +V  +     YSA+ F + +   CH CF++  
Sbjct: 4   ASMEKVEVFTSEGKGRGLKTTKELWAADVVFAEPA---YSAVVFDSFTQVVCHTCFKR-- 58

Query: 62  HSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
                 C  C  A +C   C   A  +  +   C A+ +      P + N+ L  ++  R
Sbjct: 59  QEKLQRCGQCKFARYCGRPCQKDAWLNHKNE--CAAIKKH---GKPPSENIRLAARIMWR 113

Query: 121 FLIAAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAAL 175
                  L    +V+    Q  +   G     D       +L+   P     +++  + +
Sbjct: 114 IEREGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRMDVDSFLNFWPPQSQQFSMQYISHI 173

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAA 229
                 NAF L +     Q G ++V   GI+      NHDC PN          + V + 
Sbjct: 174 FGVINCNAFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSM 227

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                 I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 228 FHTQMRIELRALGKISEGEELTVSYIDFLNISQERKKQLKKQYYFDCTCEHCE 280


>gi|162458873|ref|NP_001105199.1| SET domain protein 123 [Zea mays]
 gi|27466907|gb|AAO12860.1| SET domain protein 123 [Zea mays]
 gi|194705040|gb|ACF86604.1| unknown [Zea mays]
 gi|195624178|gb|ACG33919.1| SET domain protein 123 [Zea mays]
 gi|414871456|tpg|DAA50013.1| TPA: SET domain protein 123 [Zea mays]
          Length = 303

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 116/289 (40%), Gaps = 27/289 (9%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  E  GRG+ +T+ + +G+++    P++ + +   ++    C+NC R+      + 
Sbjct: 23  IRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHE--VCYNCLRRKPGEGRA- 79

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
             S    FCS  C   A                 + +S D     + ++L    +  A N
Sbjct: 80  -SSGGDYFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP------PASTTIELTAALLAKDRL 181
              +N  Q  +L   QGT+ +         S+              TI+    +LA+ R+
Sbjct: 139 ADCLNILQPALL--HQGTLIEMKEEFELLESTFRKAGFQEEITTFLTIDWYINVLARIRI 196

Query: 182 NAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           NAF +       +D   S  A           +Y+  SF+NHDC PNA            
Sbjct: 197 NAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNA------HIVWLQ 250

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           N D  ++ + D+ +G E+C+ Y   + D   RQK L   + F C C RC
Sbjct: 251 NADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 299


>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/290 (26%), Positives = 121/290 (41%), Gaps = 44/290 (15%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSS 66
           + VA +  +GR LV+T+    G++++   P   Y  +P  +++ S C  CF     SSS+
Sbjct: 13  LRVANVTEKGRSLVTTKDFNPGEVIISQEP---YVCVPNNSATESRCDRCF-----SSSN 64

Query: 67  I--CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQ 116
           +  C +C VA +C   C             C+AL++L        +P       L + R+
Sbjct: 65  VKKCSACQVAWYCGSSCQKLEWKLHRIE--CEALSKLDKDRRKYVTPSIRLMVRLLIRRK 122

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELT 172
           LQ+  +I++   A  N    + L+A    + +         A+ ++ +   P    I+  
Sbjct: 123 LQSEKIISST--ATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWP-DVNIKEI 179

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAE 231
           A   +K   NA  +        D        G+Y   S  NH CLPNA   FD   A   
Sbjct: 180 AENFSKLACNAHTIC-------DSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLA--- 229

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                +V  +  +P+G E+ +SY        TRQK L   Y F C C RC
Sbjct: 230 -----VVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRC 274


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/282 (24%), Positives = 115/282 (40%), Gaps = 26/282 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ +   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R 
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRA 225

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 226 LGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 267


>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
 gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 490

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 114/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVITSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRMDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L E     Q G ++V   GI+      NH+C PN          + V +     T
Sbjct: 179 CNGFTLSE-----QRGLQAV-GVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQT 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNFSEERKKQLKKQYYFDCTCEHCQ 280


>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 740

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 115/305 (37%), Gaps = 68/305 (22%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS---- 72
           GR +++T+ +K G+++LR +P     AL        C +CFRK  +  S  C SC     
Sbjct: 272 GRCIMATRDIKQGELILRVAP--FGCALSDDQIDLNCGSCFRKIKYYKSEKCNSCKTNLL 329

Query: 73  ----------VAFCSPKCSTAAASSSHSPYV----------------CQALTRLLNLNSP 106
                     V+    +CS  +    H P                     L R+    +P
Sbjct: 330 CEICIKDPTIVSMHQEECSLISFLQKHYPNAQTRDFRFMIRVLLSGRANKLGRITKEKTP 389

Query: 107 DAANLPLDRQLQARFLIAAY--------NLAVVNPSQFQILLAFQGTVTD--NDTSAAHY 156
              N       Q  F+  +Y        +++ +   Q Q       ++T+         Y
Sbjct: 390 QLWN-------QQPFIYDSYEDLASLTTDVSKIESKQLQSFTVATNSITNFFAAVKGPKY 442

Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
           LS   P       +L   +L     NA   ++P    + GR      GIY  AS+ NH C
Sbjct: 443 LSD--PFTKQEIFDLYPKIL----FNAHEYIDPLTHYEIGR------GIYPTASYINHTC 490

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
           LPN   ++      +++  I+ R   D+ +G EI  SY  +      R+K  L  Y F C
Sbjct: 491 LPNTTWYN------DDHGLILYRSSRDILKGEEITTSYLDILKPKLQRRKD-LKQYSFVC 543

Query: 277 DCDRC 281
            C+RC
Sbjct: 544 QCERC 548


>gi|414871458|tpg|DAA50015.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
          Length = 299

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/287 (24%), Positives = 117/287 (40%), Gaps = 27/287 (9%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  E  GRG+ +T+ + +G+++    P++ + +   ++    C+NC R+      + 
Sbjct: 23  IRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHE--VCYNCLRRKPGEGRA- 79

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
             S    FCS  C   A                 + +S D     + ++L    +  A N
Sbjct: 80  -SSGGDYFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAKDRLNA 183
              +N  Q  +L   QGT+ +         S+     A    E+T      +LA+ R+NA
Sbjct: 139 ADCLNILQPALL--HQGTLIEMKEEFELLESTF--RKAGFQEEITTYWYINVLARIRINA 194

Query: 184 FGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           F +       +D   S  A           +Y+  SF+NHDC PNA            N 
Sbjct: 195 FRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNA------HIVWLQNA 248

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           D  ++ + D+ +G E+C+ Y   + D   RQK L   + F C C RC
Sbjct: 249 DAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 295


>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
 gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
          Length = 463

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/287 (25%), Positives = 109/287 (37%), Gaps = 60/287 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
           +GRGL +T+  K G +V    P     A    NS     C  CF +      S C  C  
Sbjct: 13  KGRGLCATKVFKPGNLVRAADPY----AYVLCNSERGKRCDFCFARK--DDMSRCSGCKF 66

Query: 74  A-FCSPKCSTAAASSSHSPYVCQALT---------------RLLNLNSPDAANLPLDR-- 115
           A +C  KC  AA +   S   C+++                R++N    D+  +P +   
Sbjct: 67  ARYCDGKCQKAAWTEHKSE--CKSIKTVKPETPTDSIRLIARIINKTKTDSPGVPGNSID 124

Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
           +LQ+       N+  +      +L  + G    +D              A    EL   +
Sbjct: 125 ELQSNLREMPENVKEMFAQLAVVLRMYVGKDVMDD--------------AREIFELFGRM 170

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                 N F + +P ++           GIY K S FNH C PN         A  N   
Sbjct: 171 TC----NTFSICDPEMQ-------YIGIGIYPKMSLFNHSCEPNC-------VAVFNGLR 212

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           + VR I ++  G E+ +SY  +    S R+++LL  Y F C C RC+
Sbjct: 213 MEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRCQ 259


>gi|225713426|gb|ACO12559.1| msta, isoform A [Lepeophtheirus salmonis]
          Length = 505

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 120/283 (42%), Gaps = 50/283 (17%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+T+ +KA +++L +SP+++    P+I+S++ C  CF KT+ +S   C SC+  FC
Sbjct: 59  GRHLVATRDIKALEVILEESPLVIG---PYISSNNQCLECF-KTLDTSPHSCISCNYPFC 114

Query: 77  SPKCS---------TAAASSSHSP-------YVCQALTRLLNL--NSPDAANLPLDRQLQ 118
           S +CS         +A     H         Y      RLL+L  + P+  N  +   L 
Sbjct: 115 SQECSQGPWHKIECSAFQKYGHKRDENDNDFYSIITTLRLLSLQESKPEFWNTIMKDMLD 174

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
               I  Y      P ++          T  + +    + SL    A T  +    L+  
Sbjct: 175 HNSDIQEY-----TPEKW----------TSYEETVVKKIQSLTKGNADT--DTLTRLIGV 217

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N   L  P+ +G+ G+ +    G+YL  S  NH C+ N    ++ D   E      +
Sbjct: 218 IDTNGADLNLPH-DGKHGKGT----GLYLIYSNLNHSCVCNTKTINFQDRRIE------M 266

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R   ++ +G +IC  Y   N     R+  +   + F C C RC
Sbjct: 267 RASVNISKGEQICNQYVRPNKCTLARRTTIKNKWYFDCCCKRC 309


>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
 gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
 gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
 gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
 gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
 gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
 gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
 gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
          Length = 490

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
          Length = 484

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 1   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 55

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 56  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 113

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 114 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 172

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 173 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 226

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 227 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 274


>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
          Length = 493

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
          Length = 510

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 503

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 70/328 (21%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT---- 60
           S  ++++    + R +++T+S+  G  VL +       A+ FI+  + C  C   +    
Sbjct: 29  SSGLSLSTSATKHRCILATKSIPVGSTVLSEP------AMAFISLGTLCEYCLSTSDIQG 82

Query: 61  MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYV-------CQALTRLLNLN-------- 104
            H+    C  C  V +CS  C  A   + H+          C A TR  N N        
Sbjct: 83  QHTQLQRCTGCRQVVYCSVVCQKADWIAGHNSACAVLKTIDCSATTRQSNRNDIAMLFKV 142

Query: 105 -----SPDAANLPLDRQLQARF----LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH 155
                +P   +  LD ++        +    N+  + P  F   L+ Q  V D +T  A 
Sbjct: 143 VRIISNPSFQSTTLDDRMDLDCPMDNVALVRNIRSIQPLVF---LSLQSHVLDFETCQAS 199

Query: 156 Y--------------LSSLCPPPASTTIELTAA----LLAKDRLNAFGLMEPYIEGQDGR 197
           Y              + S  P  A   + L  +     L + R N F ++       D  
Sbjct: 200 YPHATNALNGHSMKQIESQLPSAAVKLMNLPVSDLIHHLGRFRCNNFTII-------DSN 252

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
                 G Y  AS FNHDC PN         A  + + ++++ I D+ +G E+C+SY   
Sbjct: 253 LFPVGEGTYPLASLFNHDCWPNC-------IAIFDGSRVVIQTIRDIAKGDELCISYIDP 305

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
             D+ +R+  L   Y F C C  C  E+
Sbjct: 306 ILDHDSRRMSLETKYCFNCQCSVCMSES 333


>gi|195493827|ref|XP_002094580.1| GE20125 [Drosophila yakuba]
 gi|194180681|gb|EDW94292.1| GE20125 [Drosophila yakuba]
          Length = 667

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 130/312 (41%), Gaps = 40/312 (12%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +   S+ V + E + +GR +V+ + L+ G ++L + P+   + L      + CH+CF++ 
Sbjct: 253 LIGASKVVRLVETKEKGRFVVANEGLRTGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 310

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
               S  C  CS +AFCS +C   A SS H  + C+ +  ++         + L    QA
Sbjct: 311 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRVFTQA 367

Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
             L   +A  NL   +         P  +       G +      + ++          T
Sbjct: 368 SSLEQGLATANLLFEHLCSHEEERQPEDYLRRTLMSGFLLRILQKSLYFGRRKTEGVNPT 427

Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
            +EL  A     LL   + NA  + +  +  +   DG ++V  A G+Y   S+FNH+C P
Sbjct: 428 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 487

Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
           + AC F            +++           + ++Y P  +  +   RQ+ L   Y F+
Sbjct: 488 STACHF--------VGKKLVLTATKPHRANESVAVNYGPIFIKNNLKERQRALRGRYSFS 539

Query: 276 CDCDRCKVEANW 287
           C+C  C  + NW
Sbjct: 540 CNCMAC--QENW 549


>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Cavia porcellus]
          Length = 477

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/280 (23%), Positives = 111/280 (39%), Gaps = 22/280 (7%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V +   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEIFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCST-AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
              C  C  A +C   C   A  +  H     +   ++ N N   AA +    + +   L
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKHECSAIKRYGKVPNENVRLAARIMWRVEREGTGL 121

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
                L  V+  Q  +    +    +       +L    P     +++  + +      N
Sbjct: 122 TEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 180

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
            F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +R + 
Sbjct: 181 GFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPN-CTVIF------NNGKIELRALG 227

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +  G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 228 KISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 267


>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
          Length = 490

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVPVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
           gorilla gorilla]
          Length = 459

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 1-like [Pongo abelii]
          Length = 490

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
           garnettii]
          Length = 490

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +I+  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQPFSIQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V A      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVTEERRKQLKKQYYFDCACEHCQ 280


>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
           troglodytes]
          Length = 452

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFITEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|21355139|ref|NP_648574.1| SET and MYND domain protein 4 [Drosophila melanogaster]
 gi|7294583|gb|AAF49923.1| SET and MYND domain protein 4 [Drosophila melanogaster]
 gi|17945748|gb|AAL48922.1| RE32936p [Drosophila melanogaster]
          Length = 663

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 74/312 (23%), Positives = 132/312 (42%), Gaps = 40/312 (12%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +   S+ V + E + +GR +V+ + L+ G ++L + P+   + L      + CH+CF++ 
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLFEEPVA--ACLEPSYFGTHCHHCFKR- 305

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H+  S C  CS +AFCS +C   A SS H  + C+ +  ++         + L    QA
Sbjct: 306 LHTPVS-CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363

Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
             L   +A  NL   +         P  +       G +      + ++          T
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 423

Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
            +EL  A     LL   + NA  + +  +  +   DG ++V  A G+Y   S+FNH+C P
Sbjct: 424 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVYLAAGLYGTGSYFNHECWP 483

Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
           + AC F            +++           + ++Y P  +  +   RQ+ L   Y F+
Sbjct: 484 STACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 535

Query: 276 CDCDRCKVEANW 287
           C C  C  + NW
Sbjct: 536 CSCMAC--QENW 545


>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
          Length = 529

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 24/273 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR +VS ++LK G ++L ++P+++    P       C  C+ KT+ SS     +C    C
Sbjct: 49  GRYMVSRKNLKPGDVILSEAPLVIG---PCTGCKVQCLGCY-KTLESSFCTKCTCGWPLC 104

Query: 77  SPKCSTAAASSSHSPYVCQAL-----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           SPKC        HS   C  L     ++ LN ++ +      +  +  R L+    L   
Sbjct: 105 SPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCLL----LKET 160

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLME-- 188
           +   +Q L+  +            + ++         I+      +++ +++  G++E  
Sbjct: 161 DARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVN 220

Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
            +  GQ G   V   G+Y  A   +HDC+PN    D      E+   + VR    +  G 
Sbjct: 221 SFEIGQQG---VNIRGLYPSAFLMSHDCVPNTNHID-----EESTFRLTVRASTRIEPGE 272

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            I LSY         R++ LL +  F C C RC
Sbjct: 273 MITLSYAYTLQSTLKRREHLLENKFFECQCRRC 305


>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
          Length = 338

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +Y + S  NH C PNA       A   N    ++R + D+P+G E+C+SY         R
Sbjct: 133 VYAELSRANHSCQPNA-------AVVYNGAAAVLRSMRDIPEGEEVCISYVDPTLARDVR 185

Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
           ++ L+  Y FACDC RC  EA+
Sbjct: 186 RRELVQSYGFACDCARCATEAS 207


>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
          Length = 528

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 24/273 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR +VS ++LK G ++L ++P+++    P       C  C+ KT+ SS     +C    C
Sbjct: 48  GRYMVSRKNLKPGDVILSEAPLVIG---PCTGCKVQCLGCY-KTLESSFCTKCTCGWPLC 103

Query: 77  SPKCSTAAASSSHSPYVCQAL-----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           SPKC        HS   C  L     ++ LN ++ +      +  +  R L+    L   
Sbjct: 104 SPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCLL----LKET 159

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLME-- 188
           +   +Q L+  +            + ++         I+      +++ +++  G++E  
Sbjct: 160 DARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQEEIHSVCGILEVN 219

Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
            +  GQ G   V   G+Y  A   +HDC+PN    D      E+   + VR    +  G 
Sbjct: 220 SFEIGQQG---VNIRGLYPSAFLMSHDCVPNTNHID-----EESTFRLTVRASTRIEPGE 271

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            I LSY         R++ LL +  F C C RC
Sbjct: 272 MITLSYAYTLQSTLKRREHLLENKFFECQCRRC 304


>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
          Length = 490

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFIAEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    D       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
           jacchus]
          Length = 442

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIRRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 114/288 (39%), Gaps = 44/288 (15%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
           S+ V + E   +GRG+   + L  G  V +++P     +   +  SS+CH+C    M S 
Sbjct: 8   SDNVEIFETYEKGRGIRCKKQLAIGTSVGKENPFCHVVSQDML--SSYCHSCL--LMQSE 63

Query: 65  SSICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL----------PL 113
              C  C +  +C   C           Y C+++TRL     PD+  L            
Sbjct: 64  LYKCSRCKIIMYCCKSCQ--KEDWQWHKYECKSITRLGPKVPPDSIRLLGRVAYTILQGQ 121

Query: 114 DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
           DR  Q +FL++  N  ++  S+       + T+ D       YLS+      +       
Sbjct: 122 DRADQFKFLLS--NRELLEGSR-------KNTIVDGINLLKEYLSN----KVAINENEII 168

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++++   N F +    ++           G+Y   S  NH C PN        +A    
Sbjct: 169 EIISRVTCNTFTICNSEMQ-------TVGIGVYPGLSLVNHSCSPNC-------SATFRG 214

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +R+I +   G E+ +SY       S+RQ +L   Y F C C+RC
Sbjct: 215 KQMQLRIIENTKIGDELLISYIDPMQVLSSRQNQLQSQYCFKCICERC 262


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 111/283 (39%), Gaps = 30/283 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
           E V V +  G+GRGL +T+   AG ++  + P   ++++ F   +SS CH+CFR+     
Sbjct: 2   EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
              C  C  A +C   C  A          C A+        P + N+ L  ++  R   
Sbjct: 57  LQRCGQCRFAQYCDKTCQRAGWEEHKLE--CAAIK---TYGKPPSENVRLAARILWRMDK 111

Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              +++   L  +   +  I    +  + D      ++L          T++  + +L  
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N F      +  Q G ++V   G++      NHDC PN            NN  I +
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNNGKIEL 218

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R +  +  G E+ ++Y       + RQ+ L   Y F C C  C
Sbjct: 219 RALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHC 261


>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
           boliviensis boliviensis]
          Length = 490

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280


>gi|255084131|ref|XP_002508640.1| predicted protein [Micromonas sp. RCC299]
 gi|226523917|gb|ACO69898.1| predicted protein [Micromonas sp. RCC299]
          Length = 701

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 75/314 (23%), Positives = 124/314 (39%), Gaps = 44/314 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS-CSVA 74
           +GR + + + +K G+ +LR  P +    +        C  CF KT+ SS   C   C V 
Sbjct: 119 KGRCVRARREIKPGETILRCDPFV--HVVHDRRREDHCAGCF-KTLQSSCVECGDFCGVK 175

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
           +C+  C  +  +         A   ++  N+   A+L       AR  +   +    +P+
Sbjct: 176 YCADDCRESDVAHV-------AECSMVRTNAGTGADL-----RGARMCLRLIHKRQTDPA 223

Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ 194
           +F  ++A       + + AA  L+S     A+        +L K R N+ G++       
Sbjct: 224 RFAEVMAALRCEKKSPSPAATKLASAVRHAAALDPSEVEDMLGKTRENSHGVV------- 276

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRF---------------DYVDAAAENNTDIIVR 239
           D +      GIY +AS FNH C PNA                  DY   A E      V 
Sbjct: 277 DWKLRQLGTGIYPEASMFNHSCAPNAAVSFGAGGTLNTCCIGVDDYEARAMEVGPKGYVP 336

Query: 240 MIHD--VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND---- 293
           +     + +G E+C++Y  +    ++R+  L     F C+C RC   +     D +    
Sbjct: 337 LGRPRAIRKGEEVCIAYTELYRPAASRRLTLEKSKGFVCECVRCANASVLRALDTELHAA 396

Query: 294 DENNEEVMDEDQDE 307
            + NEEV DE   E
Sbjct: 397 TDGNEEVADEKAKE 410


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 72/329 (21%), Positives = 133/329 (40%), Gaps = 35/329 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
           ETV + +  G+GRGL +T+   AG ++  ++ I  +++ +L    +   CH+CFR+    
Sbjct: 2   ETVAIFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSL----AERICHSCFRR--QE 55

Query: 64  SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQA 119
               C  C  A +C   C  A  A         +A  ++ N N    A +   LD++   
Sbjct: 56  KLQRCGQCKFAHYCDRTCQRAGWAEHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGST 115

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
              ++   L  ++  +  I    +  + +      ++L          TI+  + +    
Sbjct: 116 ---VSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVI 172

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
             N F      +  Q G ++V   G++      NHDC PN            N+  I +R
Sbjct: 173 NCNGFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELR 219

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDNDND 293
            +  + +G E+ ++Y         RQ+ L   Y F C C+ CK      ++    + D  
Sbjct: 220 ALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKNRIKDDIKIGGREEDGV 279

Query: 294 DENNEEVMD-EDQDEQMVASDDDAEVHGD 321
             + E+V +  D   QM+   + A ++GD
Sbjct: 280 KPSEEQVKEATDYCFQMLEKMEKARLNGD 308


>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 227

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 58/116 (50%), Gaps = 12/116 (10%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++++   N FG+ +P     +G+ +     ++  AS+FNH C PN         + +++ 
Sbjct: 75  IISRIESNCFGIWKP-----NGKEACMGRAVFPAASYFNHSCFPNC-------QSIKHDH 122

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
            +  R + DV +G  + +SY   N   S R+ RL+ DY F C C+RC  E+    N
Sbjct: 123 KMAFRTLKDVSKGEMLTISYIDTNMPVSARRARLMDDYFFECMCERCISESGMVSN 178


>gi|195327115|ref|XP_002030267.1| GM24661 [Drosophila sechellia]
 gi|194119210|gb|EDW41253.1| GM24661 [Drosophila sechellia]
          Length = 660

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 130/312 (41%), Gaps = 40/312 (12%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +   S+ V + E + +GR +V+ + L+ G ++L + P+   + L      + CH+CF++ 
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVA--ACLEPTYFGTHCHHCFKRL 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
               S  C  CS +AFCS +C   A SS H  + C+ +  ++         + L    QA
Sbjct: 307 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363

Query: 120 RFL---IAAYNLAVVN---------PSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST 167
             L   +A  NL   +         P  +       G +      + ++          T
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKSLYFGRRKTEGVNPT 423

Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHDCLP 218
            +EL  A     LL   + NA  + +  +  +   DG ++V  A G+Y   S+FNH+C P
Sbjct: 424 AVELQVATALLGLLQVLQYNAHQIYQTQVTEEHRFDGSKTVNLAAGLYGTGSYFNHECWP 483

Query: 219 N-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFA 275
           + AC F            +++           + ++Y P  +  +   RQ+ L   Y F+
Sbjct: 484 STACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIKNNLKERQRSLRGRYSFS 535

Query: 276 CDCDRCKVEANW 287
           C+C  C  + NW
Sbjct: 536 CNCMAC--QENW 545


>gi|115481972|ref|NP_001064579.1| Os10g0410700 [Oryza sativa Japonica Group]
 gi|78708616|gb|ABB47591.1| SET domain protein 123, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639188|dbj|BAF26493.1| Os10g0410700 [Oryza sativa Japonica Group]
 gi|215767244|dbj|BAG99472.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767272|dbj|BAG99500.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612812|gb|EEE50944.1| hypothetical protein OsJ_31491 [Oryza sativa Japonica Group]
          Length = 298

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 128/301 (42%), Gaps = 39/301 (12%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
           +A +  + VA  E  GRG+ +T+ + AG+++    P++ + + P I+    C++C R+  
Sbjct: 12  TAAAPPIRVALTESSGRGVFATRPISAGEVLHSAQPLVSHPSPPLIHE--VCYSCLRRKS 69

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQAR 120
            S      SC   FCS  C   A    H        +   +  S      P + ++L   
Sbjct: 70  GSGGGSSGSCY--FCSDACREHA-KGFHGVEKKADWSLFDDHCSSRGLKYPYMAKRLACM 126

Query: 121 FLIAAYN---LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-- 175
            +  A +   L ++ P++       QGT+T+ +   A   S+     A    E+T  L  
Sbjct: 127 VISGAVSADCLDILQPARLH-----QGTLTEMEEEFALLDSTF--RKAGFQEEITTFLTK 179

Query: 176 ------LAKDRLNAFGL------MEPYIEGQDGRRSVRAY---GIYLKASFFNHDCLPNA 220
                 LA+ R+NAF +       E  +       S  A     +Y+  SF+NHDC PN 
Sbjct: 180 EWYINVLARIRINAFRIELVASSYENLLSSAVASVSCDAAVGNAVYMLPSFYNHDCDPNT 239

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
               ++ +A     D  ++ + ++ +G E+ + Y   + D   RQ+ L   + F C C R
Sbjct: 240 -HIVWLASA-----DARLKALRNIEEGEELRICYIDASMDVDARQRILAEGFGFECRCQR 293

Query: 281 C 281
           C
Sbjct: 294 C 294


>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 554

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 69/286 (24%), Positives = 117/286 (40%), Gaps = 31/286 (10%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSSSSIC 68
           +A  E  GR L +++ L+AG+++LR+ P+ +       N   F C +           IC
Sbjct: 19  IAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCMKNPICFECLSLLPDIEQDFHYIC 78

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
             C+V   C   C   A    HS Y C+ +   + L+  +   L         F++  + 
Sbjct: 79  SGCNVVTLCGFSCEERAY--YHSMYECEIIKNNMELSIENTDTLA-----AVLFVLRLWL 131

Query: 128 LAVVNPSQFQILLAFQGTVT---------DNDTSAAHYLSSL--CPPPASTTIELTAALL 176
           L   +P  ++ +L+ +  +          D + +  + L SL   P    +  E+   L 
Sbjct: 132 LKQKDPELWKRVLSLESHLNKRRNTIVWEDREINIVNVLKSLNFVPENDPSVSEIIQQLC 191

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
               +N+F L  P   G DG   +   G+YL+AS   HDC  N      V   A++N  +
Sbjct: 192 GILDVNSFELRSP--GGMDG---LLLRGLYLEASLMAHDCRGN------VHVTADDNFHL 240

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            V     + +G  I  +Y       + R++ L     F C+CD CK
Sbjct: 241 TVYASIPIKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDLCK 286


>gi|330846003|ref|XP_003294847.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
 gi|325074603|gb|EGC28624.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
          Length = 428

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 134/318 (42%), Gaps = 71/318 (22%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF------RKTMHSSSSICPS 70
           GR LV+T+ ++   ++LRD P     A+        C +CF      ++ + +   +C  
Sbjct: 78  GRYLVATKDIEEQTVILRDLPYTW--AVDHAACDIVCQHCFLEVPLNQQILPTDFYMCEG 135

Query: 71  CS-VAFCSPKCS--------------TAAASSSHSPYVCQALTRLLN------LNSPDAA 109
           C+ V +CS  C                   +  +SP++   +  L+       L     +
Sbjct: 136 CNRVGYCSIHCKYIDYNQHRFECQIFKELDTEEYSPFLLSEIKLLVRTLSRKWLEESINS 195

Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDND-----TSAAHYLSSLCPPP 164
           +  +D    ++  I  YN    NPS    L+     +  ND     ++  +Y  SL    
Sbjct: 196 SAGVDESEISK--INTYN-QYKNPSS---LIPQDNGLRYNDYADLVSNVENYNESLKESL 249

Query: 165 A----STTIELTAAL------------LAKDRLNAFGLMEPYIEGQ--DGRRS-VRAYGI 205
           +       ++L A +            L ++R NAF     YI+G+  DG     R  G+
Sbjct: 250 SYWICKYIVKLAAKIDKEEDEDELLNILLRNRCNAF-----YIQGRPRDGTSGESRGCGV 304

Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           Y++ SFFNH C PN      V+    NNT ++   ++ +V +G E+ +SY   +   + R
Sbjct: 305 YVRNSFFNHSCDPN------VNYWVVNNTLEVECTLLKNVKEGEELTISYIDTSAPLNKR 358

Query: 265 QKRLLYDYDFACDCDRCK 282
           +++LL  Y F C C +CK
Sbjct: 359 REKLLEGYLFTCLCTKCK 376


>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
          Length = 462

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNLSEERRKQLKKQYYFDCTCEHCQ 280


>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
          Length = 464

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 75/297 (25%), Positives = 114/297 (38%), Gaps = 48/297 (16%)

Query: 19  GLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSCSVA-FC 76
           GL +T+  + G  +L +     YS + F  N  + CH CF+     S   C  C  A +C
Sbjct: 2   GLFATKPYEVGNSILVEEA---YSWVTFRDNGPNICHTCFK--FSKSLKKCSQCKFARYC 56

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN-----LAVV 131
              C   A   +   + C+A+       S   A +P   +L A FL  A N     + +V
Sbjct: 57  GLDCQKKAWKENSHRWECKAIA------SSVEAKIPSIVRLAALFLFRALNSNDKDIRLV 110

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSS---------------LCPPPASTTIEL----- 171
           +  +     ++    + +D +    LS                L   P     EL     
Sbjct: 111 DRCR-SFPYSYSNMDSRSDENDVDVLSRQRVDCLVEDLLRFIRLSHYPQHEVSELFEDNF 169

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN--ACRFDYVDAA 229
            A+L+    +NA  +        D   +    G + KASF NHDC PN  A       + 
Sbjct: 170 VASLIRMLEMNAHTIY-------DSELNTLGVGFFPKASFMNHDCRPNCVALFTGGFHSV 222

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           +     I +R I  +  G EI +SY  V   +  R + L   Y F C C RCK E++
Sbjct: 223 SGKPISIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLKEHYQFECCCSRCKEESS 279


>gi|260940791|ref|XP_002615235.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
 gi|238850525|gb|EEQ39989.1| hypothetical protein CLUG_04117 [Clavispora lusitaniae ATCC 42720]
          Length = 402

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           +I+    ++ ++  NAFG+  P +   D  R    +G+Y  ASFFNH C PN  +   + 
Sbjct: 291 SIQAVRDIIGRNLTNAFGIWSP-VTSSDEEREYFGFGVYPSASFFNHSCKPNVTK---IR 346

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             A  + + I +   D+P G E+C+SY     D   +++  L ++ F C C RC  E  
Sbjct: 347 RGA--SYEYIAK--EDIPVGAELCISYGIRESDGLKQRQDALREWFFECGCSRCTTEGK 401


>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
 gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
          Length = 487

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 67/280 (23%), Positives = 110/280 (39%), Gaps = 29/280 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA-F 75
           G  +V+++ LK G+I+L++ P++L    P INS+  C  CFR+ +      C  C +A  
Sbjct: 3   GTFMVASKDLKPGEIILKEQPLILG---PSINSNILCLGCFRRILFEKFKFCRECQIAPV 59

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA--------ANLPLDRQLQARFLIAAYN 127
           C   CS     S HS Y C+   +L   +  D           LPL       FL    +
Sbjct: 60  CGEDCSL---KSQHSKYECKYYKKLFENDKIDKNFIVSNFQIVLPLKCFSLKLFLFPDDD 116

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL------ 181
               +     I +        N+    +Y  ++     S  I      L   ++      
Sbjct: 117 DDDDHLWDNFINMESHVDERQNEPIWNYYQENIIKVLNSVNILNDEECLLMQKICGILDV 176

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N+F +  P  E       +R  GIY +A+F +HDC+ N         +  +  ++ +   
Sbjct: 177 NSFEMRCPDDEQISAGNQLR--GIYREAAFMSHDCIGNT------HVSLNDFFEMTIHAS 228

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +  G+ I  +Y       + RQ+RL     F C C RC
Sbjct: 229 VPIKSGQAIFFNYANPFECTAARQQRLKKGKYFNCFCKRC 268


>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 481

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 85/324 (26%), Positives = 126/324 (38%), Gaps = 63/324 (19%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS- 66
           V V    GRGR  V+ +++ AG +VLR+       A P       CH CFR   ++    
Sbjct: 106 VEVRYARGRGRYAVAARAVAAGAVVLRERAFAAAVAAP--QQGLACHACFRYAPNAGPGG 163

Query: 67  -----ICPSCS-VAFCSPKCSTAAASSS--HSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
                 C  C  V+FCS  C+T   S    H    C+AL RL    + +     +D   +
Sbjct: 164 AALRHTCARCGQVSFCSAACATPTPSGRFVHPKAECEALQRLHT--TCNLKGFSVDEVAE 221

Query: 119 ARFLIAAYNLAVVNPSQFQILL-AFQGTVTDN----DTSAAHYLSSLCPPPASTTIELTA 173
            R LI    L  +  +   +      G  +D     +TS AH++  +     S      +
Sbjct: 222 VRLLIRILVLRWIAANSANVEEDGNNGEGSDTSSAVETSFAHFMELM-----SNKHSFGS 276

Query: 174 ALLAKDRLNAFGLMEPY--------IEGQD--------GR-------------RSVRAYG 204
           A + + R  AF +ME          ++G D        GR             R    YG
Sbjct: 277 AAINRFRQRAFLIMEALSGRATSDVLQGLDVDEIVEYIGRLESNTFARYSTTKRKTVNYG 336

Query: 205 --IYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             +Y  A+ FNH C P+  R FD           + +R +  +  G E+ ++Y P+  D 
Sbjct: 337 SALYNCAAMFNHSCFPSVVRQFD--------GQHLTLRALRPLNPGDELTMTYIPLRDDT 388

Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
             RQ  L   Y FAC C RC+   
Sbjct: 389 PERQDELQQHYHFACRCQRCEARG 412


>gi|358391549|gb|EHK40953.1| hypothetical protein TRIATDRAFT_29913 [Trichoderma atroviride IMI
           206040]
          Length = 554

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 71/292 (24%), Positives = 114/292 (39%), Gaps = 46/292 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSS------- 66
           G GL +T+ L AG+++    P++L + +   +S   C  CF   R+ +H           
Sbjct: 66  GSGLFTTRDLAAGELIFTSVPLVLCAEVG--DSKEACDFCFQQRRRAIHPVEDRLADPGE 123

Query: 67  ------ICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
                  C  C++  +CS  C   A  + H  Y C  L           AN P + +++ 
Sbjct: 124 TLPDVYKCMGCNLYQYCSESCWQRAWDTGHL-YECGLL-----------ANAPYELEIRM 171

Query: 120 RF-LIAAYNLAVVNPSQFQIL--LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
            + ++      V+ P Q Q L  LA +    +  +S    +  +    A     + + L 
Sbjct: 172 LYRILILLRKKVLLPEQVQALARLAHEQDKYEQLSSDWQGVKDIA---AEAKRRMKSELD 228

Query: 177 AKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
             D L  + L+        Q  R S     I L A+  NH+C PN            N+T
Sbjct: 229 VADVLKLYCLIRCNAVPVDQTFRNSPLGSAIDLGAAMLNHNCEPNI-------VIVFNST 281

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            +  R +  +  G E+   Y  + YD + R  R+   Y F C CDRC  E N
Sbjct: 282 RVEARAVRSIKAGEELQHCYRDIAYDCTFRSPRIAARYQFKCQCDRCIRETN 333


>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
           cuniculus]
          Length = 490

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ R   +  P+  N+ L  ++  R   
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAARIMWRVER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSAA--HYLSSLCPPPASTTIELTAALLAK 178
               L    +V+    Q  +   G     +  A    +L    P     +++  + +   
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              N F L +     Q G ++V   GI+   +  NHDC PN          + V +    
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGIFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHT 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQ 280


>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
 gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
          Length = 614

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 121/323 (37%), Gaps = 75/323 (23%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
           VA ++ RGR  V+T+++KAG++V+ +  I       +    + CH C      +     C
Sbjct: 68  VATVKNRGRCAVATRAIKAGEVVVAEQAIAFVPRGQY--RKAICHVCGVNVAENGPKFEC 125

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
            +C+      +C       SH  Y   A+ R       + A++  D    A  L   +NL
Sbjct: 126 SACNYTIFCEECYKPP-QGSHENYC--AVYR-------EIASIARDENCDADLLRLVFNL 175

Query: 129 AVVNPS---------QFQIL---LAFQ---------------------GTVTDNDTSAAH 155
           A+             Q ++L   L FQ                     G  T    + + 
Sbjct: 176 ALKKAGTAQARSSFLQLELLIETLPFQNSKDETCTVDGVILSCFKDALGLETHRSKAPSS 235

Query: 156 YLSSL---CP--------------PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
           +L ++   C                P S +IE    L      N+ G+       Q+ + 
Sbjct: 236 WLEAVRNGCTRLHKIACVGNGGKLAPISCSIEELEMLATLINTNSHGM-----GAQNLQN 290

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
           +  A G++   S  NH C PN C        A   + + VR + D+P+G E+CLSY  + 
Sbjct: 291 THVALGLFPYVSILNHSCRPNCC-------FASEGSVMYVRAVQDIPKGAELCLSYINLY 343

Query: 259 YDYSTRQKRLLYDYDFACDCDRC 281
                R+  L+    F C CDRC
Sbjct: 344 ESRRVRKTLLVATKHFDCTCDRC 366


>gi|302896962|ref|XP_003047360.1| hypothetical protein NECHADRAFT_123247 [Nectria haematococca mpVI
           77-13-4]
 gi|256728290|gb|EEU41647.1| hypothetical protein NECHADRAFT_123247 [Nectria haematococca mpVI
           77-13-4]
          Length = 354

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 131/330 (39%), Gaps = 73/330 (22%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---FCHN--CFRKTMHSSSSICPS 70
           +GR L ++ +++ G ++L D+P   Y+ +P +NS      C N  C R+       +   
Sbjct: 41  KGRLLRASAAVREGTVLLIDTP---YAIVPSVNSDKEDLICSNLSCSRRVPRDGHGV--Q 95

Query: 71  CS------VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--- 121
           C       V +C+  C   AA      + C  L         +   L  DR+ Q  F   
Sbjct: 96  CHRNCFQDVVWCNTACR--AADHVRHGFECLWLK--------EKGELIRDREGQYNFVTI 145

Query: 122 -----LIAAYNLAVVN---------PSQFQILLAFQGT--------------VTDNDTSA 153
                L+A +N+ + +         P   +    ++                +      A
Sbjct: 146 WHVVRLLATWNVELKSGASISRHRYPRDDEFSRGWKAVEMCCSYLDSWPESQINHWKRLA 205

Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG---QDGRRSVR--AYGI--Y 206
             YLS     P   + +   +L+ K+  N FGL  P + G      R + R  +YG+  Y
Sbjct: 206 EEYLSDASVLPCVLSPDEMLSLICKEETNTFGLY-PKVTGPLCMIDRPTARGDSYGLSLY 264

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTR 264
            +A+ FNH CLPN      V    +    ++     D+ QG E  ++YF   V+ D ++R
Sbjct: 265 PRAAMFNHSCLPN------VTHKPDAQGRMVYTAARDIAQGEECMITYFDLTVHEDVASR 318

Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
           QK +   + F C C+RC  E    + +N D
Sbjct: 319 QKHVQEQFQFKCTCERCLSEEAEENVENMD 348


>gi|50550485|ref|XP_502715.1| YALI0D11792p [Yarrowia lipolytica]
 gi|74634629|sp|Q6C9E7.1|SET5_YARLI RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|49648583|emb|CAG80903.1| YALI0D11792p [Yarrowia lipolytica CLIB122]
          Length = 438

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 112/305 (36%), Gaps = 48/305 (15%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSFCHNCFRKTMH 62
           E VT+   + RG+GL + + +K  + +  +   LL   L  +        C  C R    
Sbjct: 107 EKVTLNIAKNRGKGLYAKKDIKKDEELWNEQAFLLVPPLEHVGIMRKGMGCAFCSRPFQS 166

Query: 63  SSSSICPSCSVAFCSPKCS------TAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
            +   CPSC+  +C  +C        A    S    V +   R          +  ++  
Sbjct: 167 HNGVECPSCAAKWCDNQCKKKDVTHVAMWHESRHGKVTRVEWRAFE-------DFCVENG 219

Query: 117 LQARFLIAAYNLAVV-NPSQFQI---LLAFQGTVTDNDTSAAHYLSSLCPPPASTTI--- 169
             A + +    L V+ +P + ++   ++AF     D    A    +SL     S  +   
Sbjct: 220 WMALYALGLLWLRVIRDPKKDEVQKQMMAFARVGQDERHKAVEQSNSLFASEQSEVLWKK 279

Query: 170 --ELTAALL-------AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
             E+   LL        KD L   G+    I   DG        +YL  S  NH C PN 
Sbjct: 280 GYEMLDKLLLDTEFSYEKDFLPGLGMFN--INNVDG-------NMYLTQSHLNHSCEPN- 329

Query: 221 CRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
                VD      T  I VR   D+  G E+  +Y    +    R+  L  ++ F C+C 
Sbjct: 330 -----VDVKNVGRTQGISVRAKRDIKTGEELFTTYVNPEHQLDDRRYNLRVNWGFNCNCT 384

Query: 280 RCKVE 284
           RCK E
Sbjct: 385 RCKRE 389


>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVAVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIS 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
 gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
 gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
          Length = 1187

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 112/302 (37%), Gaps = 52/302 (17%)

Query: 17   GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
            GR +++++ +  G+++LR +P    +AL      + C +CFR   +    +C  C     
Sbjct: 728  GRKIIASRDIAKGELLLRVAP--YGAALVDDTLLTHCTSCFRNISYYKHHLCQKCKQCIL 785

Query: 77   SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN---- 132
              +C++     +     C  L   L  N P A     D +   R ++ +  +A++N    
Sbjct: 786  CEECNSDVDLVNEHNEECDILV-FLKQNVPGADTR--DFRFMVRVMLKS--IAILNGKLS 840

Query: 133  -------------PSQFQILLAFQGTVTDND-------------TSAAHYLSSLCPPPAS 166
                         P  F          TD               T +   +  L   P S
Sbjct: 841  KEQSPKCWSKNGVPFIFDSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKS 900

Query: 167  T---TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
                T +    L  K   NA   ++P    +  R      GIY  A++ NH C PN    
Sbjct: 901  LEPLTNQQILDLYPKMLFNAHEYIDPLYHSEVAR------GIYPTAAYLNHSCEPNTVWH 954

Query: 224  DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
            +      +NN  I  R I D+  G EI  +Y  +    STRQ  LL  Y F C C RC+ 
Sbjct: 955  N------DNNGMIAYRSIRDIKAGEEITTTYIDITKYKSTRQLNLLSQYAFLCQCARCQD 1008

Query: 284  EA 285
             A
Sbjct: 1009 RA 1010


>gi|354543298|emb|CCE40016.1| hypothetical protein CPAR2_100550 [Candida parapsilosis]
          Length = 477

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 141/360 (39%), Gaps = 50/360 (13%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRKTMHSS 64
           V V     RG+ L + + +K G+++  ++P+     L     +   S C NC +    S 
Sbjct: 110 VQVVMTSKRGKCLYAKKPIKEGELIWSETPLFFVPPLANVHLVEHGSACANCGKLLQQSE 169

Query: 65  SSI------CPSCSVAFCSPKC----STAAASSSHSPYVCQALT-RLLNLNS-PDAANLP 112
                    C  C+  +CS KC    S    ++ H+ +   +   ++++ N+  +     
Sbjct: 170 MKTVLKGLDCDVCAEVWCSKKCKQIDSQLHGATKHNVFNPNSKKPKIIDSNAFLELQEYC 229

Query: 113 LDRQLQARFLIA------AYNLAVVNPSQFQIL-----------LAFQGTVTDNDTSAAH 155
           L     A + IA        + + V   QF  +           L   G   DN    A 
Sbjct: 230 LKESWNALYAIALIYANIVLDKSGVKQKQFNAMARVSQDVRYKALDSGGGTFDNMNGGAL 289

Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
           ++            E    +  K+ ++    +  Y+ G     ++ +  +YL  S  NH+
Sbjct: 290 FVQEQQETLWKEGYEKFLRVFPKNTISYKEFL--YMMGTYNINNLDS-SVYLTQSHLNHN 346

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
           C PN      VD A++  + + V    D+ +G E+  +Y   ++    RQ+ L  ++ F 
Sbjct: 347 CDPNTT----VDTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTLHQRQRELRVNWGFI 402

Query: 276 CDCDRCKVEANW-------SDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAY 328
           C CD+CK +AN        S +   D  N   M ++  EQ+    +D E+  D   PH +
Sbjct: 403 CACDKCKSDANATGRRKSSSASKKQDAKNIRNMLKETKEQV---GEDGEIELDIP-PHEF 458


>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 454

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 115/277 (41%), Gaps = 38/277 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
           +GRGL + + L  G+++  +     ++A+ F + ++  CH+CFR    ++   C  C  A
Sbjct: 17  KGRGLRAAKELNTGEVIFSEPS---FAAVVFDSLATQVCHSCFRH--QANLHRCAQCKFA 71

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY-NLAVVN 132
            +C   C TA  +       C A+ +L    S  + N+    +L AR L   + +  + +
Sbjct: 72  HYCDRTCQTACWNEHKQE--CGAIKKL---GSAPSENI----RLAARVLWRMHKDTGIAS 122

Query: 133 PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
            SQ   +   Q  V D               +L          +++  + +    + N F
Sbjct: 123 DSQLISVDQLQEHVADLAAEDFKQLRADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGF 182

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            L +     Q G ++V   G++      NHDC PN            N+  I +R +  +
Sbjct: 183 TLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELRALGKI 229

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           P+G E+ +SY       + RQK+L   + F C C+ C
Sbjct: 230 PEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHC 266


>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
          Length = 505

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/295 (24%), Positives = 111/295 (37%), Gaps = 51/295 (17%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
           +V+  +  + I   G GL + +++  G  + R +P++               NC R   H
Sbjct: 34  SVALEILPSRIHNNGTGLFAMEAIPEGSEIFRSTPLV---------------NCVRSAKH 78

Query: 63  SSS--------SICPSCSV-AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
           +          S+C  C V  +CS  C   A    H    C+AL R     SP A     
Sbjct: 79  AKGEVMVLPKISLCTQCGVCGYCSKDCQAKALEKYHKSE-CEALKR-----SPSA----- 127

Query: 114 DRQLQARFL---IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
              L  R L   +A +     +  ++Q L    G    N++    +        A T+I+
Sbjct: 128 --DLWTRMLYRVLAMHKQGTYSEQEWQALHHLCGNQRQNNSVFDSHKGIDISKGAKTSIQ 185

Query: 171 LTAALLAKDRLNAFGLMEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
                 A   L    L     I  Q+G   +    + L  SF NH C PNA  F      
Sbjct: 186 SKLGNEAIQDLFCKVLTNVMCINHQEGSPGIT---LDLVGSFVNHSCDPNAFVFF----- 237

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
               + + +R +  +  G EI L+Y  +      R+++L   Y F C C+RCK E
Sbjct: 238 --EGSQLRMRSLKPINAGDEITLTYTELREGVLMRKRKLHRGYSFFCRCNRCKKE 290


>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
 gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
          Length = 506

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 65/268 (24%), Positives = 107/268 (39%), Gaps = 24/268 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+++T+ LK G+I+ +DSP+L+  A    ++ + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVMATRQLKRGEIIFQDSPLLIGLAAQEEDTLNACSICL-KMLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS  A    H       + +    N PD AN  + R L    +  A NL      Q
Sbjct: 91  --PICSLCAKKKQHKSDC--DMFKSWGPNEPDVANSVIIRLL---CVARAINLT---KDQ 140

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
             ++   Q  + +N  +     +      P     +E+    +A  R N F   +   + 
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNFPTDKKLVEIMNRTVAVLRTNGF---DKTTDR 197

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
               +      +Y   +  NHDC+PN+          E   ++IVR   D+ +G EI  +
Sbjct: 198 TTDNQEFSYRALYPLFAVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDIAEGEEITTT 252

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  +      R   L     F C C RC
Sbjct: 253 YTKLFTGNIARHLYLKMKKGFTCKCPRC 280


>gi|342182557|emb|CCC92036.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 387

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 63/150 (42%), Gaps = 27/150 (18%)

Query: 163 PP----ASTTIELTAALLAKDRLNAF---------GL----MEPYIEGQDGRRS-----V 200
           PP    AS  +E+   LL ++ LN F         GL    ++P + G     S     +
Sbjct: 240 PPVSSDASWALEIVDGLLREETLNTFLGILALNSQGLNGFVIDPPVNGDTSSHSSIEWVL 299

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           +  GIY   S FNH C PN   F           DI +R + DV  G E+ +SY PV   
Sbjct: 300 KGGGIYSLQSNFNHSCQPNVAVFTEC-----GTHDITLRTLRDVQAGEELTISYIPVENT 354

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
               + ++L  Y F C C  C+ E N  D 
Sbjct: 355 NRAERHKMLEGYFFTCQCALCEYEKNLVDT 384


>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
          Length = 483

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 2   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 57  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 114

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 115 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDMFLQYWPPQSQQFSMQYISHIFGVIN 173

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 227

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 228 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 275


>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
 gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 114/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKR--QER 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 280


>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
 gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
          Length = 495

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/339 (23%), Positives = 131/339 (38%), Gaps = 72/339 (21%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------- 67
           G G GL +T ++K GQ V  D P    + L     +  C NCF     +++ +       
Sbjct: 15  GAGTGLFATSTVKTGQAVF-DLPATFSTVLSTERLADACSNCFANAPFAANVMADVGIKL 73

Query: 68  --CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNS---PD 107
             C  C V  +C   C T   ++ H               P   +A+ R+L L S   PD
Sbjct: 74  RFCTGCRVVKYCGKGCQTENWAAIHKHECATYKRLYPKVLPVNSRAVLRILKLRSSKQPD 133

Query: 108 AA-NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
              +L + R L++ F      +A  N  Q++ ++         +   +H         + 
Sbjct: 134 VQQDLGMFRSLRSHF----KEIAETNKEQYERIM----LCAKAEKEYSH---------SD 176

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
             IE  A  LAK  +N F    P+  G      ++ +     A F NH C PNA      
Sbjct: 177 LDIETIAEYLAKIEVNGFTFTTPF--GDPLGLCIQPF-----ACFVNHSCDPNA------ 223

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
                +   I V+ +  +    ++ +SY      +  RQK L   Y F C C +C ++ +
Sbjct: 224 -VVGFDEGRITVKALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKC-LQGD 281

Query: 287 WSDND---------NDDENNEEVMDEDQDEQMVASDDDA 316
            +  D         ND E  +E   E Q ++++AS  +A
Sbjct: 282 KAREDQFIPASPPSNDIETLKEA--EKQAQELLASAKNA 318


>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
 gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
          Length = 611

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/289 (23%), Positives = 115/289 (39%), Gaps = 30/289 (10%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           + A S +  VA+    GRG+ + + +  G+++ R+  +L+        +   C  C+ + 
Sbjct: 32  LRAESPSYRVADSPISGRGIFALRDIAKGELIFREHTLLVGPTAHRGRNLRTCTQCYGQP 91

Query: 61  MHSS--SSICPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
              S  S++CP+ C +    P C   A+S+ H+P  CQ   +     +  A   P     
Sbjct: 92  PGDSDASALCPAGCGL----PVCGNCASSTRHTPE-CQLFRKWQPKEATKAGIEP----- 141

Query: 118 QARFLIAAYNLAVVNPSQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTA 173
           QA  +++      +N  Q ++L A Q             AA        P     +E   
Sbjct: 142 QALRILSVVRCFFLNEPQRKLLYAMQANADRYYIREVERAAECFEQF--PRDPELLEYFY 199

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
             +     NAF     +I+GQ+    V A  ++  A   NH C PNA          E+ 
Sbjct: 200 RTVCAFNTNAFE-SRFHIDGQE----VVARALFPLAGLLNHQCTPNAAHH------FEDG 248

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             I+V     +P G EI +SY  + +    R+  L     F C C RC+
Sbjct: 249 ETIVVTATERIPMGAEITMSYAKLLWSTLARKMFLGMTKHFMCRCPRCQ 297



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 14/91 (15%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR    +P+G EI  +Y  + +   T
Sbjct: 391 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARAIPKGGEITTTYTKILWSNLT 444

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
           R   L     F C+CDRC        NDN +
Sbjct: 445 RGIFLKMTKYFVCNCDRC--------NDNSE 467


>gi|255081676|ref|XP_002508060.1| predicted protein [Micromonas sp. RCC299]
 gi|226523336|gb|ACO69318.1| predicted protein [Micromonas sp. RCC299]
          Length = 541

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 113/309 (36%), Gaps = 44/309 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           V V + + RGR LV+ +  K G +VL ++P      LP +     C  CF K       +
Sbjct: 24  VDVRDDKKRGRCLVAAKDFKRGDLVLANAPY----GLP-VPMGDRCAGCFMKEDDDRVKM 78

Query: 68  -CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRL-------------------LNLN-- 104
            C +C  V FCS +C   A    H+   C A+ R                    L+ N  
Sbjct: 79  RCKACDMVWFCSERCMDGAMDGGHA-VECAAMRRYFADDPDDAIRENGFVQFSALHRNCG 137

Query: 105 ---SPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQIL--LAFQGTVTDNDTSAAHYLSS 159
               P+  + PLD  L      AA     +            + G      T+ A     
Sbjct: 138 KAAHPEDTDEPLDDHLPYLVPTAADKADTIARWALHGFPWREYCGDFAAEATAGARRCGL 197

Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
           +   P+S T E     + + R+N F + +     + G   + A   Y   +  NH C P 
Sbjct: 198 I---PSSATDETIKKEVVRSRMNDFFIQQ----WRGGDIDLMAGATYPLGALLNHSCAPT 250

Query: 220 AC---RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
                R D  DA          R + DV +G E+  +Y   +   + R+  LL  Y F C
Sbjct: 251 CVCSYRLDISDAVQYPTWIQEFRCVADVGKGTELTHAYVDASDWSNHRRAELLDRYGFVC 310

Query: 277 DCDRCKVEA 285
           DC RC   A
Sbjct: 311 DCPRCPTNA 319


>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Equus caballus]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFNSEGKGRGLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQRELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   G++      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Canis lupus familiaris]
          Length = 490

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHHCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVAVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQGQQFSMQYISHIFGVIS 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNLSEDRKKQLKKQYYFDCTCEHCQ 280


>gi|412986494|emb|CCO14920.1| SET and MYND domain-containing protein 5 [Bathycoccus prasinos]
          Length = 336

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 132/337 (39%), Gaps = 79/337 (23%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRK------T 60
           ++  E +GR + + +  K+G IV  ++P++  +        ++  +CH CF K      +
Sbjct: 16  ISWCERKGRRVTTRKETKSGTIVSDEAPLVWLAGCEKDDDDDADEYCHRCFEKITKPNAS 75

Query: 61  MHSSSSI---CPSCSV-AFCSPKCSTAAASSSHSPY-VCQALTRLLNLNSPDAANLPLDR 115
           + SSSS    C  C+V  +CS  C   AAS  H     C A     +    D    P D 
Sbjct: 76  LKSSSSFVVECEYCNVEKYCSVDCRRLAASEYHRKSGECFAFETGKDAEKSDPTRYPFD- 134

Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN------------------DTSAAHYL 157
            +  RF I AY   +   +  +  L  +  V++N                  D +  ++L
Sbjct: 135 DVPNRFAIRAY-AKLGGWTGLEKTLEEKKKVSENLRKHLNECVAHVPNDYHHDDNKKNFL 193

Query: 158 SSLCPPPASTTIELTAALLAKD----------------RLNAFGLMEPYIEGQDGRRSVR 201
           S+LC          T AL+  D                R NA  L +   +  D   +  
Sbjct: 194 STLCSN--------TRALMGADEEIVKEEDFVRAAWAMRANAHTLYDYNYDDDDNNNNNN 245

Query: 202 ----------AYGIYLKASF-FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
                       G++ + S  +NH C PNA  F+ V         + VR+I DV  G EI
Sbjct: 246 NKKKRRRKKIGVGLFPEVSRSYNHSCEPNAF-FENV------RLRLRVRLIKDVASGEEI 298

Query: 251 CLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVE 284
            +SY   +     RQ R   L   Y+F C+C +C+ E
Sbjct: 299 TISYLNDDEKGEKRQHRRELLKKQYNFDCECAKCERE 335


>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 363

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/287 (21%), Positives = 112/287 (39%), Gaps = 28/287 (9%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK----------TMHSSSS 66
           GRGL +  + K G  + ++ P + Y +L  ++ S+ C++C ++                 
Sbjct: 76  GRGLFAITAFKKGSQIFQEEPFVSYPSLD-VDKSTICNHCLKQLNNNGNNNNDNKSKQQQ 134

Query: 67  ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS---PDAANLPL--DRQLQARF 121
            C  C   +CS KC  ++A   H    C   + + N+      +    PL   R L    
Sbjct: 135 QCTKCGERYCSEKCR-SSAEVQHHLASCPTSSNIDNITRYSLVEKRRFPLLAARILSRIL 193

Query: 122 LIAAYNLAVVNPSQFQIL-LAFQGTVTDNDTSAAHYLSSLCPPPASTT---IELTAALLA 177
           L   ++  + N    Q+L  A +    +       +  SL    ++           ++ 
Sbjct: 194 LENHFDKTMHNWENLQVLSFAKKDAPLEWKDDYDVFKKSLLTRESNQKRFDFNWFVRIMQ 253

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
              LN  G+ +             + G++  +SFFNH C PN      V  A  ++   +
Sbjct: 254 ILYLNTLGI-DVGSTKPSISSPTSSIGLFFLSSFFNHSCDPN------VYMAFPHDKTAV 306

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +  + D+ +G E+ +SY     D   RQ  L  +Y F CDC +C  E
Sbjct: 307 ITALRDIKKGEELFISYGDSEKDMFDRQTHLFDNYGFNCDCPKCTQE 353


>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 438

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 56/110 (50%), Gaps = 13/110 (11%)

Query: 175 LLAKDRLNAFGLMEPYIEGQ---DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
           +L ++R NAF     YI+G+    G    R  G+Y++ SFFNH C PN   +       E
Sbjct: 304 VLLRNRCNAF-----YIQGRPKNGGNGESRGCGVYVRNSFFNHSCNPNVNYW-----VVE 353

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           N  ++   ++  V +G E+C+SY         R+++L   Y F C C++C
Sbjct: 354 NTLEVECSLMRAVRKGEELCISYIDTAASLRDRREKLSEGYLFHCRCEKC 403


>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
           NIH/UT8656]
          Length = 550

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/306 (24%), Positives = 113/306 (36%), Gaps = 57/306 (18%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKA-GQIVLRDSPILLYSALPFINSSSFCHNCFRK----- 59
           E V       RG GL +   + A  Q++    P+L+  AL      + C+ CFR+     
Sbjct: 33  ERVYTKRTADRGNGLFAVDDIAAESQLLFVARPLLV--ALENAKLPTNCYYCFREPGGQG 90

Query: 60  ------TMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
                     S   C  C VA FC  KC T A S  H    C+  +RL     P      
Sbjct: 91  LWQMTERKEDSLKTCSRCKVAKFCDQKCQTEAWSQYHR-LECKLFSRLYPRVLPSTVRAV 149

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT---- 168
           +        L+  +   ++ P +++ LLA Q    D   +       L            
Sbjct: 150 IR-------LLKQHKAGILPPGEWEQLLALQSHQQDLANAGGQRWQDLLIMSQGIKGYSG 202

Query: 169 --------IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL--KASFFNHDCLP 218
                   + L+ A++    +N+F L     +         + G+ L  K +  NH C P
Sbjct: 203 TDEDDDLILRLSCAVI----VNSFTLSNATFD---------SIGVILHPKPALLNHSCDP 249

Query: 219 NA-CRFDYVDAAAENNT--DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
           NA  RFD     +E +T   I V  + D+ +  EI +SY         RQ++L   Y F 
Sbjct: 250 NAYVRFD----VSETDTLGSISVHALRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFT 305

Query: 276 CDCDRC 281
           C C  C
Sbjct: 306 CQCHLC 311


>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 117/283 (41%), Gaps = 37/283 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
           +GRGL + + L  G+++  +     ++A+ F + ++  CH+CFR    ++   C  C  A
Sbjct: 17  KGRGLRAAKELNTGEVIFSEPS---FAAVVFDSLATQVCHSCFRH--QANLHRCAQCKFA 71

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY-NLAVVN 132
            +C   C TA  +       C A+ +L    S  + N+    +L AR L   + +  + +
Sbjct: 72  HYCDRTCQTACWNEHKQE--CGAIKKL---GSAPSENI----RLAARVLWRMHKDTGIAS 122

Query: 133 PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
            SQ   +   Q  V D               +L          +++  + +    + N F
Sbjct: 123 DSQLISVDQLQEHVADLAAEDFKQLRADVHKFLQYWSYGTRQHSVDYISHIFGIIKCNGF 182

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDIIV 238
            L +     Q G ++V   G++      NHDC PN            V +A  +   I +
Sbjct: 183 TLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVSSALHSQRRIEL 236

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R +  +P+G E+ +SY       + RQK+L   + F C C+ C
Sbjct: 237 RALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHC 279


>gi|195435832|ref|XP_002065883.1| GK20590 [Drosophila willistoni]
 gi|194161968|gb|EDW76869.1| GK20590 [Drosophila willistoni]
          Length = 661

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/328 (22%), Positives = 135/328 (41%), Gaps = 72/328 (21%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  ++ V + E + +GR +V+   LK G +VL ++P+       F  ++  CH+CF++ 
Sbjct: 250 LSGAAQGVKLVETKEKGRFVVANGGLKTGDVVLCENPVAACLLPNFFGTN--CHHCFKRL 307

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNS 105
            +  +  C  CS VAFCS +C   A+ S H                 +C    R+    S
Sbjct: 308 RNPVA--CLHCSGVAFCSAQCMGEASESYHRFECEFMDLLIGSGMSILCFIALRVFTQAS 365

Query: 106 P-----DAANLPLD--------RQ----LQARFLIAAYNLAVVNPSQFQILLAFQGTVTD 148
                 + ANL  +        RQ    LQ R L+A + L ++  +Q+     F    T+
Sbjct: 366 SLEDGLNTANLLFEHLCSHEDVRQADDYLQ-RSLMAGFLLRILQKAQY-----FGRRKTE 419

Query: 149 NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI------EGQDGRRSVR- 201
                          P +  +++  +LL   ++  +   E Y          +G +++  
Sbjct: 420 GVN------------PTAVELQVATSLLGLLQVLQYNAHEIYQTLVTDDHRFEGSKTIHV 467

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NY 259
             G+Y   S+FNH+C P+   + +V         +++R          + ++Y P+    
Sbjct: 468 GAGLYGTGSYFNHECWPSVASY-FV------GKKLVLRTTKPHRPNEVVAVNYGPIFTTM 520

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANW 287
               RQ+ L   Y F+C+C  C  + NW
Sbjct: 521 HLKERQRSLRGRYAFSCNCMAC--QENW 546


>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
          Length = 389

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 22/193 (11%)

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
           +Q+    L   Y      P  F+ LLA  GT      +++          +     ++A 
Sbjct: 211 QQMMKEALNTEYTAHWFTPDGFRSLLALVGTNGQGIGTSSF---------SRWVKNVSAL 261

Query: 175 LLAKD-RLNAFGLMEPYIEGQDGR----RSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
            L KD R+    L+E   +  D       +    G+Y+  S  NH C+PNA     V+  
Sbjct: 262 ELPKDERIYIDKLIEKIYDAMDEAVGPFLNNEGSGLYVLQSSVNHSCVPNA----VVEFP 317

Query: 230 AENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             NNT ++++ I D+  G EIC SY     +     +RQ+ L   Y F C CD+C+ +AN
Sbjct: 318 YSNNT-LVLKAIRDIKVGEEICTSYLDECQLERSRHSRQQALSSLYLFVCHCDKCQAQAN 376

Query: 287 WSDNDNDDENNEE 299
             D  +D+E++E+
Sbjct: 377 DPDVTSDEESDED 389



 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 75/183 (40%), Gaps = 46/183 (25%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFR---------KTMH 62
           +G+GL S +  + G ++L + P++      F+ +S F    C +C R         + + 
Sbjct: 21  KGKGLFSNRVFREGDVILEERPVV---CCQFLWNSEFGYVACDHCLRPLETAEENVRRLT 77

Query: 63  SSSSI-----------------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS 105
            +SS                  CP C V +CS +C T A    HS    QA  R+ + + 
Sbjct: 78  GNSSFVVPHKECCEVNKSLITSCPLCGVKYCSVECQTEAFQRYHSTLCLQA--RVKDESH 135

Query: 106 PDAANLPLDRQLQ-----ARFLIAAYNLAVVNPS--QFQILLAF----QGTVTDNDTSAA 154
           P        +Q+      A  ++ A  +AVVN +  + + L  F      TV+++     
Sbjct: 136 PLNHLCETWKQMHYPPESASIMLLARMVAVVNQANDKEEALAMFSQFCHRTVSNDTHEIV 195

Query: 155 HYL 157
           HYL
Sbjct: 196 HYL 198


>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
          Length = 548

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 118/291 (40%), Gaps = 31/291 (10%)

Query: 3   AVSETVTVAEIEGR---GRGLVSTQSLKAGQIVL-RDSPILLYSALPFINSSSFCHNCFR 58
           AV++ + V   +G    GR + +T+  K G ++   + P ++    P   +  +C++C +
Sbjct: 2   AVNDGIEVRGTKGGDKGGRSIHATRRFKPGDVIACFEEPAVVLP--PGHRALEYCNHCLK 59

Query: 59  KTMHSSSSI--CPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPLD 114
           K       +  C  C +VA+C P C  A  S  H    C+A+ RL  +   D  A +P  
Sbjct: 60  KQQPGGPKLRACTGCKTVAYCGPACQRANWSLVHK-LECKAIQRLHEIKPADEPAWVPTP 118

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS-TTIELTA 173
            +  A+ ++  + LA     +  +    + + TD    +   +  L     +   +E   
Sbjct: 119 IRAAAQVMLRPHVLAQFGELEGHVEQWRKKSETDLQLQSHGVVRCLGLDMGTFEALEAAF 178

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAE 231
            +L K + NAF   E Y E           G++L  +    NH C+PNA     V     
Sbjct: 179 QVLCKLQTNAFSRSEEYYE---------TGGVFLDTTLAMINHSCVPNA----LVQFGGR 225

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             T      IH    G EI +SY       S RQ  L   Y F C C +C+
Sbjct: 226 TATLRSASFIHP---GSEIEISYIDQTQPKSRRQHELSL-YHFECHCSKCQ 272


>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
          Length = 485

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/288 (22%), Positives = 114/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 2   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKRQ--EK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ +   + N N   AA +    + +  
Sbjct: 57  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 114

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 115 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 174 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 227

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 228 RIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 275


>gi|392590224|gb|EIW79553.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 410

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 65/148 (43%), Gaps = 24/148 (16%)

Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLME-------------PYIEGQDGR----R 198
           +L +  PPP    + +   LLA++  NAFG+ E                E +D R    R
Sbjct: 270 FLGAALPPPLRAHVGVVRQLLARETGNAFGIWEGDQLPEGRISIGDSEKEAEDTRDMRER 329

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
            +  +G+++ AS FNH C PN  +         +   +    + DV +G E+ +SY   +
Sbjct: 330 EMFGWGVWVSASLFNHSCAPNVRK-------KRSGRSLEFFTLRDVKEGEELRISYASTD 382

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
                R++ L   + F C C RC  E++
Sbjct: 383 KPVGERREALRTSWFFDCMCSRCVRESS 410


>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
 gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
           Full=CD8b-opposite; AltName: Full=Zinc finger protein
           BOP; Short=m-BOP
 gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
           Methyltransferase Smyd1
 gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
 gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
 gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
          Length = 490

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/288 (22%), Positives = 114/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLINFVCHTCFKRQ--EK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ +   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CAAIKKYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEG-CLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQ 280


>gi|170056265|ref|XP_001863952.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876021|gb|EDS39404.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 526

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 115/281 (40%), Gaps = 37/281 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVAF 75
           GR L+++++LKAG+++++  P+   +  P+  S   C  C R     + ++ CP+C+   
Sbjct: 56  GRYLIASRNLKAGEVIIQVEPL---AVGPWAESDPVCLGCHRTFEVGAKTVRCPTCNWRI 112

Query: 76  CSPKCSTAAASSSHSPYVC-----QALTRLLNL-NSPDAANLPLDRQLQARFLIAAYNLA 129
           CSP C+    ++ HS   C       + +LL   +S     L  +  L  R L+    L 
Sbjct: 113 CSPTCT--GLTTHHSRLECIPLKEHGVAKLLETCSSATQIKLAYEAILALRCLL----LK 166

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
              P +++ LL       D        ++SL        ++    +  + +   F   E 
Sbjct: 167 RTAPDRYEKLLEM-----DAMNERRQQITSLWKRNQEAIVQ---RIRDEWKFEEFSEAEV 218

Query: 190 Y-----IEG---QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           +     IE    Q G   V A  +Y +A +  HDC PN    D        + ++ VR  
Sbjct: 219 HTVCGIIEVNAFQIGPTEVHARALYPEAFYIMHDCTPNTTHTD-----QPGSHELTVRPT 273

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            D+  G  + LSY         R++ L     F C C RC+
Sbjct: 274 RDLKAGEPLTLSYSYTLQGTLKRRQHLCDSKFFWCQCPRCE 314


>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 425

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/300 (26%), Positives = 114/300 (38%), Gaps = 66/300 (22%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSC-SVA 74
           +GR  V++ +L+AG  VLR      + A+    SSS C  CF   +  S   C  C  V 
Sbjct: 12  KGRCNVASVALRAGSCVLRTPA---FCAV----SSSSCGWCFAPQL--SLQRCTGCRQVQ 62

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-----------------LDRQL 117
           +CS +C     S       CQA   +     P  A LP                 +D++ 
Sbjct: 63  YCSRRCQKLDWSQHRRE--CQAWRSI-----PVDATLPTVLLVCRLAAKLFLSSKVDQED 115

Query: 118 QARFLIAAYNLA-VVNPSQFQ----------ILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
           + R L   ++L     P Q Q          +L  ++    +   S A    SL P    
Sbjct: 116 KNRILNLRHHLDDHTAPKQQQFCETTPLVHLLLSEYKVDKQEQTPSFAELQESLKPE--- 172

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
                   L  +   N F +M        G       G++L+ S FNHDC PN C   + 
Sbjct: 173 -----ILKLFGQVSCNGFSIMNGVTNEPVG------IGLFLQGSMFNHDCDPN-CVVSF- 219

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
                   ++ V +I DV +G+E+ +SY  V      RQK L   Y F C C RC  E  
Sbjct: 220 -----QGQEMNVHVIKDVKEGQELTISYVEVLQSTKKRQKILKDSYFFECQCSRCTTETT 274


>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
           [Oryzias latipes]
          Length = 476

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 114/283 (40%), Gaps = 28/283 (9%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
            E V + +   +GRGL +++    G++V  +     YSA+ F +S +F  CH CFR+   
Sbjct: 6   KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
           +    C  C  A +C+  C TA          C A+ +     S     AA L L R  +
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              L +   L  V+  Q  +    Q  +    T                ++E  + +   
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
            + N F L +     Q G ++V   G++   +  NHDC PN         A  N+  I V
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNC-------TAILNHGKIEV 223

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R +  + +G E+ +SY       + RQK+L   + F C C  C
Sbjct: 224 RTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 266


>gi|395331801|gb|EJF64181.1| hypothetical protein DICSQDRAFT_178674 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 488

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 58/119 (48%), Gaps = 13/119 (10%)

Query: 168 TIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
           T+      L  D  N+FG+ E P ++  +       + +Y  ASFFNH C PN      V
Sbjct: 374 TVAGVRTALGIDPGNSFGVWEVPLMDESE----CLGFAVYPIASFFNHHCSPN------V 423

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
               E  T   V     V +G E+C+SY  V   D++TRQ+ LL  + F+C C RCK E
Sbjct: 424 RKEREGRTLRFV-TTRTVEEGEELCISYGHVEGMDWATRQQELLEGWYFSCRCSRCKSE 481


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 72/335 (21%), Positives = 134/335 (40%), Gaps = 34/335 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
           ETV + +  G+GRGL +T+   AG ++  ++ I  +++ +L    +   CH+CFR+    
Sbjct: 2   ETVAIFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSL----AERICHSCFRR--QE 55

Query: 64  SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQA 119
               C  C  A +C   C  A  A         +A  ++ N N    A +   LD++   
Sbjct: 56  KLQRCGQCKFAHYCDRTCQRAGWAEHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGST 115

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
              ++   L  ++  +  I    +  + +      ++L          TI+  + +    
Sbjct: 116 ---VSDMQLTTLDELEDHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVI 172

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENN 233
             N F      +  Q G ++V   G++      NHDC PN            V+    + 
Sbjct: 173 NCNGFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQ 226

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANW 287
             I +R +  + +G E+ ++Y         RQ+ L   Y F C C+ CK      ++   
Sbjct: 227 RRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKNRIKDDIKIGG 286

Query: 288 SDNDNDDENNEEVMD-EDQDEQMVASDDDAEVHGD 321
            + D    + E+V +  D   QM+   + A ++GD
Sbjct: 287 REEDGVKPSEEQVKEATDYCFQMLEKMEKARLNGD 321


>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
 gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
          Length = 485

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/304 (21%), Positives = 116/304 (38%), Gaps = 50/304 (16%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI----- 67
           + RGL +T +LKAG ++  D P+++      L  + S   C  C     HS  +I     
Sbjct: 118 KQRGLFATNALKAGDMIFEDLPLVVVPPMDKLVLMQSGKACPLCGSSVSHSQHTIIMNGL 177

Query: 68  -CPSCSVAFCSPKCST---AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
            C  C   +CS  C        S  H+   C    R +NLN         +   +    +
Sbjct: 178 DCGECGAVWCSKNCKKNDLTHTSLRHTRSRC----RQINLNGWKK----YESFCKQYVFV 229

Query: 124 AAYNLAVVNPS-------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
           AAY++ ++  S             +FQ+L      +  + + + +   +L     + T +
Sbjct: 230 AAYSIGIIYASMILDKNGKNGVHPKFQLLADVSQRIRKDSSDSTNLGGTLDASSGAFTSD 289

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG----------IYLKASFFNHDCLPNA 220
               +  +  L+ F    P ++  D    +   G          I+   SF NHDC PN 
Sbjct: 290 DPEPMWTQ-ALDLFKEAFPELDDMDMEAYLNCIGKFNINQVSGQIFPVYSFINHDCEPNV 348

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
            R++  D        + +     + +G E+  +Y    +    R++ L  ++ F C CDR
Sbjct: 349 -RYEIDDKLR-----LKLYARKHIKKGEELLTTYVNPLHGVKLRRRELRVNWGFLCHCDR 402

Query: 281 CKVE 284
           C  E
Sbjct: 403 CNRE 406


>gi|344234777|gb|EGV66645.1| SET domain-containing protein [Candida tenuis ATCC 10573]
          Length = 355

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 135/355 (38%), Gaps = 87/355 (24%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFR-- 58
           +S +S    V+E +  G+G  S +S+  G  +L   P+    +  F      CH CF   
Sbjct: 18  LSQISPFFQVSETKYGGKGCFSYESIPRGTDILVSKPLGFTISKDF--KKEVCHKCFNYD 75

Query: 59  --KTMHSSSSI--CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNS--------- 105
             KTM    S+    + S+ FCS KC +A  S  +     ++L  L  L +         
Sbjct: 76  NGKTMKFRLSLPTFKNQSICFCSEKCLSAFKSMKYYEVYLRSLLNLDKLYTTFCTEEMAE 135

Query: 106 -PDAANLPLDRQLQARF-LIAAYNLAVVN--PSQFQILL-----------AFQGTVTDND 150
            P A        ++ ++  +  ++ +V+   P+++Q L+            +   V  N 
Sbjct: 136 VPVAGVSEFIHTIKGKWEQVEQWDASVLKTKPTKWQKLIPKITETEYLEIKYVLGVLHNQ 195

Query: 151 TSAAHYLSSL--------------------------------CPPPASTTIE-LTAALLA 177
           T A  + S L                                 PP     I  L   L+ 
Sbjct: 196 TDAFVFESRLFDLLQSNEISKVLKFKNLLVSYITIFKFIRLTSPPQFQKFINPLDIRLII 255

Query: 178 KDRL-NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
            + L NAFG+        +G +    +G+Y   SFFNH C PN  +          N ++
Sbjct: 256 GNNLTNAFGIWTL----NEGDKDFLGFGVYPSGSFFNHSCTPNIIK-------RRINNEL 304

Query: 237 IVRMIHDVPQGREICLSYF-----PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
               + D+ +  E+C+ Y      PVN      +++ L ++ F C C RC++EA 
Sbjct: 305 HFLTLADIQKNDELCIDYGNYLEEPVNI-----RRKELSEWFFICTCKRCEMEAQ 354


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 64/289 (22%), Positives = 112/289 (38%), Gaps = 29/289 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
           E V V +  G+GRGL +T+   AG ++  + P   ++++ F   +SS CH+CFR+     
Sbjct: 2   EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
              C  C  A +C   C  A          C A+        P + N+ L  ++  R   
Sbjct: 57  LQRCGQCRFAQYCDKTCQRAGWEEHKLE--CAAIK---TYGKPPSENVRLAARILWRMDK 111

Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              +++   L  +   +  I    +  + D      ++L          T++  + +L  
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
              N F      +  Q G ++V   G++      NHDC PN            +D    +
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFHS 225

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              I +R +  +  G E+ ++Y       + RQ+ L   Y F C C  C
Sbjct: 226 QKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHC 274


>gi|170096156|ref|XP_001879298.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164645666|gb|EDR09913.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 367

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 57/122 (46%), Gaps = 11/122 (9%)

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
           +T E   A+LA+D  N FG+ +   +  D    +  +G+Y+  S+FNHDC PN  +    
Sbjct: 246 STPETVRAILARDHGNVFGIWDTAPD--DEESEMLGWGLYVSGSYFNHDCSPNLKK---- 299

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFACDCDRCKVE 284
                +   I     HDV    E+C+SY  V        R+  L  D+ F C C RC  E
Sbjct: 300 ---RRHKRGINFYTTHDVNADEELCISYVDVCPGASVINRKTALENDWFFVCQCARCARE 356

Query: 285 AN 286
           +N
Sbjct: 357 SN 358


>gi|195160613|ref|XP_002021169.1| GL24960 [Drosophila persimilis]
 gi|194118282|gb|EDW40325.1| GL24960 [Drosophila persimilis]
          Length = 661

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 133/315 (42%), Gaps = 46/315 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  +  V + E + +GR +V+++ LK G +VL ++P+       F  +   CH+CF++ 
Sbjct: 250 LSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPSFFGTH--CHHCFKR- 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H+  + C  CS +AFCS +C   A  S H  + C+ +  L+         + L    QA
Sbjct: 307 LHTPVA-CLHCSGIAFCSAQCMGEACESYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 364

Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
             +   +A  NL   N              Q  ++  F   +           +    P 
Sbjct: 365 ASVDDGLATANLLFENLCSHEDVRQAEDYLQRSLMAGFLMRILQKAQYFGQRKTEGVNP- 423

Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
             T +EL  A     LL   + NA  + +  +  +   +G ++V    G+Y   S+FNH+
Sbjct: 424 --TAVELQVATALLGLLQVLQYNAHEIYQTQVTDEHRFEGSKTVHVGAGLYGTGSYFNHE 481

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRLLYDY 272
           C P+   + +V         +++           + ++Y P+ + Y+    RQ+ L   Y
Sbjct: 482 CWPSVASY-FV------GKKLVLAATKPHRPNEVVAVNYGPI-FTYTNLKERQRSLRGRY 533

Query: 273 DFACDCDRCKVEANW 287
            F+C C  C  + NW
Sbjct: 534 SFSCSCMAC--QENW 546


>gi|125978717|ref|XP_001353391.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
 gi|54642149|gb|EAL30898.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
          Length = 661

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 133/315 (42%), Gaps = 46/315 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  +  V + E + +GR +V+++ LK G +VL ++P+       F  +   CH+CF++ 
Sbjct: 250 LSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPSFFGTH--CHHCFKR- 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H+  + C  CS +AFCS +C   A  S H  + C+ +  L+         + L    QA
Sbjct: 307 LHTPVA-CLHCSGIAFCSAQCMGEACESYHR-FECEFMDLLIGSGMSILCFIALRIFTQA 364

Query: 120 RFL---IAAYNLAVVNPS------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
             +   +A  NL   N              Q  ++  F   +           +    P 
Sbjct: 365 ASVDDGLATANLLFENLCSHEDVRQAEDYLQRSLMAGFLMRILQKAQYFGQRKTEGVNP- 423

Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
             T +EL  A     LL   + NA  + +  +  +   +G ++V    G+Y   S+FNH+
Sbjct: 424 --TAVELQVATALLGLLQVLQYNAHEIYQTQVTDEHRFEGSKTVHVGAGLYGTGSYFNHE 481

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRLLYDY 272
           C P+   + +V         +++           + ++Y P+ + Y+    RQ+ L   Y
Sbjct: 482 CWPSVASY-FV------GKKLVLAATKPHRPNEVVAVNYGPI-FTYTNLKERQRSLRGRY 533

Query: 273 DFACDCDRCKVEANW 287
            F+C C  C  + NW
Sbjct: 534 SFSCSCMAC--QENW 546


>gi|241948289|ref|XP_002416867.1| conserved hypothetical protein [Candida dubliniensis CD36]
 gi|223640205|emb|CAX44454.1| conserved hypothetical protein [Candida dubliniensis CD36]
          Length = 378

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 64/131 (48%), Gaps = 10/131 (7%)

Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           ++   C P   T I+ T   +++ K+  NAFG+        + +  +  +G+Y  ASFFN
Sbjct: 253 FIKLTCTPELQTFIDSTTIRSIIGKNLSNAFGIWSEVTNPSEDKEFL-GFGVYPSASFFN 311

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
           H C PN  +         NNT+++     D+  G E+C++Y     +    +++ L ++ 
Sbjct: 312 HSCSPNIVK-------TRNNTEMVFTTSKDIEVGEELCINYGNYTDEPVELRQQQLKEWF 364

Query: 274 FACDCDRCKVE 284
           F C C +C+ E
Sbjct: 365 FDCACTKCQTE 375


>gi|156366270|ref|XP_001627062.1| predicted protein [Nematostella vectensis]
 gi|156213960|gb|EDO34962.1| predicted protein [Nematostella vectensis]
          Length = 391

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 88/388 (22%), Positives = 145/388 (37%), Gaps = 117/388 (30%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFR---------KTMH 62
           RGR L +++  K G  +  + P++    L++A     S + C +C R         + + 
Sbjct: 17  RGRALFASRDFKEGDTIFEEDPLVCSQFLWNA---AYSYTACDHCMRSLETAQDMARRLS 73

Query: 63  SSSSI--------------------CPSCSVAFCSPKCSTAAASSSH--------SPYVC 94
           S+ ++                    CP C+VA+CS  C   A    H        +P   
Sbjct: 74  SNPTLELPYSAECCAVTKAGEPISYCPQCNVAYCSENCRIKALDQYHRILCLGTSTPDPN 133

Query: 95  QALTRLL----NLN-SPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN 149
             L +L     N++  P+ AN+ L  ++ A  L A    + V    F     F   V + 
Sbjct: 134 HPLVKLQETWKNIHYPPETANIMLIARIMATILQATN--SDVKKGSFS---HFCSNVVNK 188

Query: 150 DTSAAHYLSSLCPPPASTTIE-LTAALLAKDRLNAFGLMEPY--------IEGQ------ 194
           +   AH L  L        I  L +  +  DRL  +   E +          GQ      
Sbjct: 189 EQQIAHKLLGLHFQEQLDMIRILLSEAMYDDRLEQWFTPEGFSSLFALVGTNGQGIGTSS 248

Query: 195 ----------------DGRRSVRAY---------------------GIYLKASFFNHDCL 217
                           D R+++  +                     G+Y   S  NH C 
Sbjct: 249 LSLYVHNIDSYPALSDDERQAIDIFLNQLYEEMERVSGQFLNCEGAGLYALQSSCNHSCA 308

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY---DYDF 274
           PNA          +NN+ ++++ +H +  G EIC+SY        +R  RL Y   +Y F
Sbjct: 309 PNA-----EVTFPKNNSTLVLKALHPIKNGEEICISYLEECQRERSRHSRLKYLRENYIF 363

Query: 275 ACDCDRCKVEANWSD---NDNDDENNEE 299
            C C +C+++AN  D   +D DDE+++E
Sbjct: 364 DCTCTKCELQANEPDESSSDGDDEHDDE 391


>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
           griseus]
          Length = 498

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 69/292 (23%), Positives = 114/292 (39%), Gaps = 33/292 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF---INSSSFCHNCFRKTMH 62
           E V V   EG+GRGL +T+   A  +V  +     YSA+ F   IN    CH CF++   
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVVFAERA---YSAVVFDSLINV--VCHTCFKR--Q 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A +C   C   A  +  +   C A+ R   +  P+  N+ L  ++  R 
Sbjct: 60  EKLHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAGRIMWRV 114

Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALL 176
                 L    +V+    Q  +   G     +       +L    P     +++  + + 
Sbjct: 115 EREGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
                N F L +     Q G ++V   GI+      NHDC PN          + V +  
Sbjct: 175 GVINCNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMF 228

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                I +R +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 229 HTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHCQ 280


>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Taeniopygia guttata]
          Length = 478

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 72/286 (25%), Positives = 119/286 (41%), Gaps = 34/286 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
           E V +   EG+GRGL + +   AG ++  +     Y+A+ F + +   CH CF++   +H
Sbjct: 7   ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 63

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A +C   C  AA  S  +   C A+ R  +  +P   N+ L  ++  + 
Sbjct: 64  R----CGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKM 114

Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALL 176
                 L+   +V   + Q  +   G     D  A    +L    P      ++L + + 
Sbjct: 115 EREGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIF 174

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
                N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I
Sbjct: 175 GVISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVVF------NNGKI 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +R ++ +  G E+ +SY         R+++L   Y F C C+ CK
Sbjct: 222 ELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 267


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 70/326 (21%), Positives = 130/326 (39%), Gaps = 29/326 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V + +  G+GRGL + +   AG ++  ++ +   +A+ F + +   CH+CFR+     
Sbjct: 2   ENVVIFDSPGKGRGLKAAKEFWAGDVIFSEASL---AAVVFDSLAERVCHSCFRR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
              C  C  A +C   C  A  A         +A  ++ N N    A + L R  +    
Sbjct: 57  LQRCGQCKFAHYCDRTCQRAGWAEHKQECSAIKAYGKVPNENIRLVARI-LWRLDKDGST 115

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
           ++   L  ++  +  I    +  + +      ++L          T++  + +      N
Sbjct: 116 VSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDISHIFGVINCN 175

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
            F      +  Q G ++V   G++      NHDC PN            N+  I +R + 
Sbjct: 176 GFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNC-------TVILNHGKIELRSLG 222

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDNDNDDEN 296
            + +G E+ ++Y         RQ+ L   Y F C C+ CK      ++    + D    +
Sbjct: 223 KIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREVDGVKPS 282

Query: 297 NEEVMD-EDQDEQMVASDDDAEVHGD 321
            E+V +  D   QM+   D A ++GD
Sbjct: 283 EEQVKEATDYCYQMLEKMDKARLNGD 308


>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
          Length = 467

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 69/292 (23%), Positives = 114/292 (39%), Gaps = 33/292 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF---INSSSFCHNCFRKTMH 62
           E V V   EG+GRGL +T+   A  +V  +     YSA+ F   IN    CH CF++   
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVVFAERA---YSAVVFDSLINV--VCHTCFKR--Q 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A +C   C   A  +  +   C A+ R   +  P+  N+ L  ++  R 
Sbjct: 60  EKLHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKV--PNE-NIRLAGRIMWRV 114

Query: 122 LIAAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALL 176
                 L    +V+    Q  +   G     +       +L    P     +++  + + 
Sbjct: 115 EREGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
                N F L +     Q G ++V   GI+      NHDC PN          + V +  
Sbjct: 175 GVINCNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMF 228

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                I +R +  + +G E+ +SY    +    R+++L   Y F C C+ C+
Sbjct: 229 HTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHCQ 280


>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Cavia porcellus]
          Length = 490

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 110/286 (38%), Gaps = 21/286 (7%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V +   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEIFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCST-AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
              C  C  A +C   C   A  +  H     +   ++ N N   AA +    + +   L
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKHECSAIKRYGKVPNENVRLAARIMWRVEREGTGL 121

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
                L  V+  Q  +    +    +       +L    P     +++  + +      N
Sbjct: 122 TEGC-LVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCN 180

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNTDI 236
            F L +     Q G ++V   GI+      NHDC PN          + V +       I
Sbjct: 181 GFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRI 234

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +R +  +  G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 235 ELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|402219261|gb|EJT99335.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 445

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 125/314 (39%), Gaps = 55/314 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI---CP 69
           +G+GLV+   +  G+ + R+ P +    +       S + C  C    + +  ++   CP
Sbjct: 110 KGKGLVAKLPIAEGEALWREDPFVHAADWDVFDLQQSGNACQYCALPFIPAPPTLAVSCP 169

Query: 70  S----CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
           +    C+ AFCS  C T A    HS      L R  N  S D  +     Q +A   +A 
Sbjct: 170 AISHGCNAAFCSRLCLTRAQQGGHS-----LLCRGRNPPSKDLLDFARKEQWRAVIALAV 224

Query: 126 YNLAVVNP-----SQFQILLAFQGTVT--DNDTSAA---------------------HYL 157
               ++       S+ Q +L   GT    D +  A                      H++
Sbjct: 225 AVAKIMTAFERSDSEGQDMLEAWGTFATLDQEDRAKTLPEWEFTQERHRALWNMDYDHFV 284

Query: 158 SSLCPP----PASTTI-ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA---YGIYLKA 209
           ++LCPP     A T + ++    L++D          ++ G  GR S+      G++L  
Sbjct: 285 TTLCPPETDKAAQTRLRKIVKKTLSEDVRTELFSYRGFLRGL-GRVSLNLDSHGGLFLLH 343

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMI--HDVPQGREICLSYFPVNYDYSTRQKR 267
           S  NH C+PN        A A+      + ++   D+  G E+ ++Y    + Y  R+K 
Sbjct: 344 SHMNHSCIPNMAVKHPPAAQAQKYPSSRIALVANRDIQPGEELTVTYVNPTWPYR-RRKE 402

Query: 268 LLYDYDFACDCDRC 281
            L ++   CDC+RC
Sbjct: 403 ELKEWGVECDCERC 416


>gi|302676884|ref|XP_003028125.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
 gi|300101813|gb|EFI93222.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
          Length = 427

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/316 (24%), Positives = 118/316 (37%), Gaps = 66/316 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI-CPS- 70
           +G+GLV+   +K G+++ ++ P ++   ++A       + C  C     H+   + C + 
Sbjct: 91  KGKGLVAVDPIKEGEVIWKEDPFVMAPEWNAFEMQIQGTVCWQCCTPMQHTGREVSCRAH 150

Query: 71  ------CSVAFCSPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLD--RQLQ 118
                 C   FC+  C     S    P +C A     ++LL   + D   + L    QL 
Sbjct: 151 TSSANYCPARFCNRLC--LVRSDKTHPLLCPAQNPASSQLLRY-AHDKQWMALHGYAQLV 207

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ-------------GTVTDNDTSAA---HYLSSLCP 162
           AR ++A         S+  I+ AF              G   D +T  A   H L++   
Sbjct: 208 ARIILAYQGDEETFRSELDIMRAFAEIDVETRHRNTNPGLEPDKETWKAGHKHLLNTFVD 267

Query: 163 PPAS-------------TTIELTAALLAKDR--LNAFGLMEPYIEGQDGRRSVRAYGIYL 207
           PP +              T EL   L   D   L   G M    E        R+ G+Y+
Sbjct: 268 PPTADGKKKLKKIIRKPLTAELEKELFDYDTAFLKGLGRMVLNFE--------RSGGLYV 319

Query: 208 KASFFNHDCLPNAC--RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQ 265
             S  NH C+PN     FD     + N   I +    D+  G E+ +SY      Y  RQ
Sbjct: 320 LHSHMNHSCVPNISVRHFD----KSTNWARITMVAKKDLAPGEELMISYVDPEAPYKARQ 375

Query: 266 KRLLYDYDFACDCDRC 281
              L  + F C C RC
Sbjct: 376 AE-LEQWGFKCVCPRC 390


>gi|170042501|ref|XP_001848962.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167866038|gb|EDS29421.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 667

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 118/317 (37%), Gaps = 70/317 (22%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS- 72
           +GR +V+ Q LK  + ++ +  +   LY+        S C+ CF K +  +   CP C+ 
Sbjct: 271 KGRYIVAAQDLKPAEPIVVEPAVGACLYTKF----FGSHCNACFAKLV--APVACPECAG 324

Query: 73  VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLPLDRQLQ 118
           VAFCSP+C   A S  H                 +C    RL+         + L  +L+
Sbjct: 325 VAFCSPECRDRACSGYHRFECHYLDLLIGSGMSVLCHLALRLVTQAGTPEKAIELGAELK 384

Query: 119 ----------------ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
                            R L+AA+ L  +  ++F              T AA       P
Sbjct: 385 QTLCAHEDRREPSDYFQRTLMAAFLLRCLQKAEF---------FGRRKTEAAE------P 429

Query: 163 PPASTTI-ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHDCL 217
            P    +  +  ALL   + NA  + E  I G+    S +      G+Y  ++ FNH+C 
Sbjct: 430 TPLEAQVGGVILALLQSLQFNAHEIYETKISGEHRIDSAKVQYVGVGVYRTSAMFNHECY 489

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY--FPVNYDYSTRQKRLLYDYDFA 275
           P   R           T + +      P G  I  +Y    + +  + RQ+ L   Y F 
Sbjct: 490 PGVSRTFL-------GTTMCLYTSRPFPAGATIPENYGMHFIRHPAAVRQRTLRSRYWFG 542

Query: 276 CDCDRCKVEANWSDNDN 292
           C+C  C  + NW   D 
Sbjct: 543 CECRAC--QENWPLMDK 557


>gi|297739785|emb|CBI29967.3| unnamed protein product [Vitis vinifera]
          Length = 660

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 65/294 (22%), Positives = 123/294 (41%), Gaps = 37/294 (12%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  E  GRG+ +T+ + +G ++    P++ + +L  I+S   C+ C RK    +SS 
Sbjct: 380 IRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHS--VCYFCLRKLKPVTSS- 436

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLDRQLQARF 121
              C+V FCS +C   +        V  A+ R  + ++ D          PL  +  A  
Sbjct: 437 -EDCNVRFCSQECEEQSK-------VFVAVERKADWSAYDDYCRTRGLKYPLLVKRLACM 488

Query: 122 LIAAYN----LAVVNPSQFQI-LLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
           +++       L ++ P+     +++  G       SA     +     A  T +    +L
Sbjct: 489 VVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQWYINVL 548

Query: 177 AKDRLNAFGL---------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           A+ R+N+F +         +           +     +Y+  SF+NHDC PN     ++D
Sbjct: 549 ARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNV-HIIWID 607

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                N +  ++ + ++  G E+ + Y   + D+  RQ  L   + F C C RC
Sbjct: 608 -----NVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRC 656


>gi|195012821|ref|XP_001983754.1| GH16068 [Drosophila grimshawi]
 gi|193897236|gb|EDV96102.1| GH16068 [Drosophila grimshawi]
          Length = 662

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 130/314 (41%), Gaps = 44/314 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  +  V + E + +GR +V+ + LK G +VL ++P+       F  ++  CH+CF++ 
Sbjct: 251 LSGAANVVKLVETKDKGRFVVANEGLKTGDVVLSENPVAACLLPNFFGTN--CHHCFKR- 307

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H+  + C  CS +AFCS +C   A  + H  + CQ +   +         + L    QA
Sbjct: 308 LHTPVA-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCYIALRIFTQA 365

Query: 120 RFL---IAAYNLA---------VVNPS---QFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
             L   +   NL          V  P    Q  ++  F   +        H  +    P 
Sbjct: 366 ASLEDGLQTSNLLFEHLVSHQDVRQPDDYLQRSLMAGFLMRILQKAQYFGHRKTEGVNP- 424

Query: 165 ASTTIELTAA-----LLAKDRLNAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
             T +EL  A     LL   + NA  +    +  +   +G + +    G+Y   S+FNH+
Sbjct: 425 --TAVELQVATSLLGLLQVLQYNAHEIYHTMVTDEHCFEGCKVIYLGAGLYGTGSYFNHE 482

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYD 273
           C P+   + YV         +++           + ++Y P+    +   RQ+ L   Y 
Sbjct: 483 CWPSTAGY-YV------GKRLVMAATKPHRPNEIVAVNYGPIFSKMNLKERQRSLRGRYA 535

Query: 274 FACDCDRCKVEANW 287
           F+C+C  C  + NW
Sbjct: 536 FSCNCMTC--QENW 547


>gi|238879311|gb|EEQ42949.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 379

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 16/134 (11%)

Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           ++   C P     I+ T   +++ K+  NAFG+     +  + +  +  +G+Y  ASFFN
Sbjct: 253 FIKLTCTPELQAFIDSTTIRSIIGKNLSNAFGIWSEVTDPSEDKEFL-GFGVYPSASFFN 311

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLY 270
           H C PN  +         NN++++     D+  G E+C+SY   NY       RQK+L  
Sbjct: 312 HSCSPNIVK-------TRNNSEMVFTTSKDIEIGEELCISY--GNYTDEPVELRQKQLK- 361

Query: 271 DYDFACDCDRCKVE 284
           ++ F C C +C+ E
Sbjct: 362 EWFFDCACTKCQTE 375


>gi|68484599|ref|XP_713759.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|68484668|ref|XP_713725.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46435236|gb|EAK94622.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46435271|gb|EAK94656.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
          Length = 379

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 16/134 (11%)

Query: 156 YLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           ++   C P     I+ T   +++ K+  NAFG+     +  + +  +  +G+Y  ASFFN
Sbjct: 253 FIKLTCTPELQAFIDSTTIRSIIGKNLSNAFGIWSEVTDPSEDKEFL-GFGVYPSASFFN 311

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLY 270
           H C PN  +         NN++++     D+  G E+C+SY   NY       RQK+L  
Sbjct: 312 HSCSPNIVK-------TRNNSEMVFTTSKDIEIGEELCISY--GNYTDEPVELRQKQLK- 361

Query: 271 DYDFACDCDRCKVE 284
           ++ F C C +C+ E
Sbjct: 362 EWFFDCACTKCQTE 375


>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
 gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
          Length = 478

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 129/312 (41%), Gaps = 40/312 (12%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V + E EG+GRGL + +   AG I+  +     YSA+ F N S S CH+CF++     
Sbjct: 2   ENVEIFESEGKGRGLRAIRESWAGDIIFAEPA---YSAVVFDNLSHSVCHSCFKR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   + ++  +   C A+ +     +P+  N+ L  ++  R   
Sbjct: 57  LLRCGQCKFAHYCDRTCQKESWANHKNE--CVAIKKA--GKAPN-ENIRLAARILWRIER 111

Query: 124 AAYNLA---VVNPSQFQILL-----AFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
               L    +V+    Q  +     A +G + ++      Y  S         I    ++
Sbjct: 112 EGSGLTEGCLVSIDDLQNHIDKFDEAEKGLLMEDVQKFLEYWPSQSQQFGMQYISHIFSV 171

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           ++    N F L +     Q G ++V   GI+      NHDC PN C   +      NN  
Sbjct: 172 IS---CNGFTLSD-----QRGLQAV-GVGIFPNLCLANHDCWPN-CTVIF------NNGK 215

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN----WSDND 291
           I +R +  + +G E+ +SY         R+ +L   Y F C C+ C  +       +  D
Sbjct: 216 IELRALGKINKGEELTVSYVDFLNLTEDRKAQLKKQYYFDCTCEHCTKKTKDALLLAVKD 275

Query: 292 NDDENNEEVMDE 303
            +D+  E V+ E
Sbjct: 276 GEDKPEERVVKE 287


>gi|298708048|emb|CBJ30401.1| set and mynd domain containing [Ectocarpus siliculosus]
          Length = 589

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 110/296 (37%), Gaps = 43/296 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSICPSCSVA 74
           RGRG+ + +++ AG  VL        +ALP      +FC  C      S    C  C   
Sbjct: 134 RGRGIFAVEAVPAGTEVLVAKQ---AAALPRDRYRPAFCRRCLTLLDKSLLIKCRRCEDR 190

Query: 75  FCSPKCSTAAASSSHSPYVCQALT-----RLLNLNSPD-AANLPLDRQLQARFLIAAYNL 128
           FC  +C  AAA        C  +      +L  L   D AA    D             L
Sbjct: 191 FCGKECVIAAAGEGTHEVTCGLVEGLGEEQLSELGRGDNAAATETD----------ILRL 240

Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
            V   S+    L+ +    + D                 T+E      A+ RLN  GL  
Sbjct: 241 TVECMSRRGAGLSDEEEWIEIDDLVLPEDEGGGEGSGGGTLEPDVIREAQARLNDGGLEV 300

Query: 189 PYIEGQDGRRSVR------------------AYGIYLKASFFNHDCLPNACRFDYVDAAA 230
              E Q   R +R                    G++  AS  NH C PNAC F    A  
Sbjct: 301 SAEEIQTVHRRLRRNFHTVQPEPLGRLLPPTVVGVFPAASLVNHSCEPNAC-FHSRRAGP 359

Query: 231 EN-NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYD-YDFACDCDRCK 282
           E    ++ +R   DV  G E+C+SY     D +T++ R  LL + + F+CDC RC+
Sbjct: 360 EGPPLELALRCTTDVSAGEEVCVSYLAHCADAATKEGRRELLQNVWGFSCDCPRCE 415


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 121/290 (41%), Gaps = 37/290 (12%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS 64
           S  ++V+    +GR L++ ++   G+++L+  P +   AL   + +  C  C+R++  S+
Sbjct: 16  SHNLSVSVSPAKGRCLIANRAFHRGEVILQQEPYV--CALDDSSQALRCDRCYRQS--SN 71

Query: 65  SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLN--LNSPD-----AANLPLDRQ 116
              C +C   F C   C  +        + C  L RLL     SP           + R+
Sbjct: 72  LKRCSACKTVFYCCANCQRSGWGLH--KFECAILVRLLKEKKRSPTPSLRLVMRFLIKRR 129

Query: 117 LQA-RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP---PPASTTIELT 172
           LQA R  + A   AV N    ++L       +++       ++S+      P    ++  
Sbjct: 130 LQAERVQLKA---AVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMAPDEVNVKEV 186

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAE 231
             ++ +   NA  + +  +            G+Y   S  NH C+PNA   FD   AA  
Sbjct: 187 TQMICRFACNAHTICDEEVRPL-------GTGLYPVISIVNHSCVPNAVLHFDGNRAA-- 237

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 +R + D  +G EI +SY  +    +TR+K L   Y F C+C RC
Sbjct: 238 ------LRALEDTQEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRC 281


>gi|440302623|gb|ELP94930.1| set and mynd domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 428

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 62/273 (22%), Positives = 102/273 (37%), Gaps = 39/273 (14%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           V E  G+GRG+     +     V+     L+   L ++    +C  CF K +  +S I  
Sbjct: 6   VKESTGKGRGVFLNTDITEKTEVVHSK--LIGGCLNYVEWKFYCEYCF-KELKENSKISC 62

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
            C   +CS +C   A    H    C  +  L  +  PD     L        L+   ++ 
Sbjct: 63  GCGFNYCSMECQKKAYDEYHKN-ECSIIKSLREI--PDGIGEIL--------LLYRCSIK 111

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
            V    +  L         N+  A  Y   L    +S    +   + A    N+F L   
Sbjct: 112 TVKWEDYCSLPNI------NNYKAVEYCVLLFKSNSSLLKTIWEVIQA----NSFCLT-- 159

Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
                +G   V   G++  ASF NH C PN           +N  ++ ++ +  +  G E
Sbjct: 160 -----NGEEQVIGIGLFDYASFINHSCCPNC-------VPLQNKREMSIKSLTSIKSGEE 207

Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           I +SY  +   +  R+K L   + F+C C  C+
Sbjct: 208 IFISYIDITESFERREKELR-KWHFSCGCSLCE 239


>gi|225441557|ref|XP_002276611.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Vitis
           vinifera]
          Length = 327

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/294 (22%), Positives = 123/294 (41%), Gaps = 37/294 (12%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  E  GRG+ +T+ + +G ++    P++ + +L  I+S   C+ C RK    +SS 
Sbjct: 47  IRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHS--VCYFCLRKLKPVTSS- 103

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLDRQLQARF 121
              C+V FCS +C   +        V  A+ R  + ++ D          PL  +  A  
Sbjct: 104 -EDCNVRFCSQECEEQSK-------VFVAVERKADWSAYDDYCRTRGLKYPLLVKRLACM 155

Query: 122 LIAAYN----LAVVNPSQFQI-LLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
           +++       L ++ P+     +++  G       SA     +     A  T +    +L
Sbjct: 156 VVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQWYINVL 215

Query: 177 AKDRLNAFGL---------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           A+ R+N+F +         +           +     +Y+  SF+NHDC PN     ++D
Sbjct: 216 ARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNV-HIIWID 274

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                N +  ++ + ++  G E+ + Y   + D+  RQ  L   + F C C RC
Sbjct: 275 -----NVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRC 323


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 115/290 (39%), Gaps = 33/290 (11%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M+A+++   V    G GR +++T+ L+ G   + + P +       +N +  C  CFR  
Sbjct: 1   MTAINDAAQVENRNGYGRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQT--CSGCFRLA 58

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
            H     C SC V  +CS  C  +  S  H P  C+    +     P    L L R +  
Sbjct: 59  AHMQQ--CSSCKVVQYCSQTCQRSDWSI-HKPE-CEGFKAVQPRIPPSPVRL-LGRMMFK 113

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA---STTIELTAALL 176
           R         VV   Q +     + +      +A   ++S   PPA   STT ++ A L 
Sbjct: 114 RAKDCNEFERVVG--QLESHRDKRASKDIEHIAAMLQMASGFIPPALLLSTTADMIA-LC 170

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
            K ++N                + R   IY + S  NH C+PNAC    +   A      
Sbjct: 171 CKIQVNTMT-------------TERGVAIYDRLSTVNHSCVPNACLTFGIGGIAR----- 212

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            +  +  +  G +I +SY  V     TRQ++L   Y F C C  C   AN
Sbjct: 213 -LSPMTAIASGDQINISYVDVFQSCETRQRQLKEQYYFDCTCRLCTANAN 261


>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
           domestica]
          Length = 490

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 113/290 (38%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F + +   CH CF++     
Sbjct: 7   EKVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSFTQVVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +    + C A+ +      P + N+ L  ++  R   
Sbjct: 62  LQRCGQCKFAHYCDRTCQKDAWLNH--KHECAAIKKH---GKPPSENIRLAARIMWRIER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDND--TSAAHYLSSLCPPPASTTIELTAALLAK 178
               L    +V+    Q  +   G     D       +L+         +++  + +   
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRVDVDSFLNFWPAQSQQFSMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              NAF L +     Q G ++V   GI+      NHDC PN          + V +    
Sbjct: 177 INCNAFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHT 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYVDFLNVSQERKKQLKKQYYFDCTCEHCE 280


>gi|195589810|ref|XP_002084642.1| GD12724 [Drosophila simulans]
 gi|194196651|gb|EDX10227.1| GD12724 [Drosophila simulans]
          Length = 663

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 75/329 (22%), Positives = 136/329 (41%), Gaps = 74/329 (22%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +   S+ V + E + +GR +V+ + L+ G ++L + P+   + L      + CH+CF++ 
Sbjct: 249 LVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVA--ACLEPSYFGTHCHHCFKRL 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLN--------------LNS 105
               S  C  CS +AFCS +C   A SS H  + C+ +  ++                 +
Sbjct: 307 RTPVS--CLHCSGIAFCSAQCMGEACSSYHR-FECEYMDLMIGSGMSILCFIALRIFTQA 363

Query: 106 PD------AANLPL--------DRQ---LQARFLIAAYNLAVVNPSQFQILLAFQGTVTD 148
           P        ANL          DRQ      R L++ + L ++  S     L F    T+
Sbjct: 364 PSLEQGLATANLLFEHLCSHEEDRQPDDYLRRALMSGFLLRILQKS-----LYFGRRKTE 418

Query: 149 NDTSAAHYLSSLCPPPASTTIELTAALLAKDRL---NAFGLMEPYIEGQD---GRRSVR- 201
             +            P +  +++  ALL   ++   NA  + +  +  +    G ++V  
Sbjct: 419 GVS------------PTAVEVQVATALLGLMQVLQYNAHQIYQTQVTEEHRFAGSKTVYL 466

Query: 202 AYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VN 258
           A G+Y   S+FNH+C P+ AC F            +++           + ++Y P  + 
Sbjct: 467 AAGLYGTGSYFNHECWPSTACHF--------VGKKLVLTATRPHRANELVAVNYGPIFIK 518

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANW 287
            +   RQ+ L   Y F+C+C  C  + NW
Sbjct: 519 NNLKERQRSLRGRYSFSCNCMAC--QENW 545


>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
          Length = 490

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +  CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   G++   +  NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNISEERKKQLKRQYYFDCTCEHCQ 280


>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
          Length = 484

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 71/286 (24%), Positives = 113/286 (39%), Gaps = 40/286 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
           G GRGL ++  +  G+ VL   P+   + L   +    C  CF +       I    C  
Sbjct: 13  GLGRGLFASTDIAVGEDVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRG 71

Query: 71  CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
           C V  +C   C        HS + C    +L          LP++ +   R ++ +    
Sbjct: 72  CGVVKYCDKTCQAKDWKLGHS-FECTIYQKL------KPRILPINARAVLRMVLRSERQK 124

Query: 130 VVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTTI---ELTAALLAKDRL 181
             +  +    L  +  + D  D SA+H+    LSS      S T    E+ +A+ AK  L
Sbjct: 125 YSD-EELDQFLQLETHIKDIRDESASHWERISLSSKAIKAYSGTEMSEEVISAMGAKLDL 183

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
           N+F L     +           G+YL   A+ FNH C  NA       A + +  ++ ++
Sbjct: 184 NSFNLTNAVYD---------RLGVYLHPYAAIFNHSCDHNA-------AVSFDGPNLHIK 227

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
            I  + +  +I ++Y  V   Y  RQ  L   Y F C C +C  EA
Sbjct: 228 AIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHCSKCTSEA 273


>gi|303280161|ref|XP_003059373.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226459209|gb|EEH56505.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 740

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 3/99 (3%)

Query: 198 RSVRAYG-IYLKASFFNHDCLPNACRFDYVD-AAAENNTDIIVRMIHDVPQGREICLSYF 255
           R  R  G +Y K S FNH C PN C + +    +A++   + V+ I  + +G E  + Y 
Sbjct: 328 REERVGGAVYAKLSRFNHSCAPN-CTWHFAPPGSADSPVSVTVKAIRPIKKGDEATVCYL 386

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
             +     R++RL  +Y FAC+CDRC+      + + DD
Sbjct: 387 DASGGRDARRRRLWAEYRFACECDRCQRLIRQGEENPDD 425


>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Ailuropoda melanoleuca]
          Length = 490

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 65/288 (22%), Positives = 112/288 (38%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  ++  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFTSEGKGRGLKATKEFWAADVIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + S     AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPSESIRLAARILWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEG-CLVAVDELQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHVFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NH+C PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQ 280


>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
          Length = 490

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +  CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDMFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   G++   +  NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQ 280


>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
          Length = 476

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 112/288 (38%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   +G+GR L +T+   A  I+  +     YSA+ F +  +F CH CF++     
Sbjct: 7   ENVEVFISDGKGRSLKATKEFWAADIIFAERA---YSAVVFDSLVNFVCHTCFKR--QEK 61

Query: 65  SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A+ C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHHCGQCKFAYYCDRTCQKDAWLNHKNE--CLAIKRYGKVPNENIRLAARIMWRVEREGT 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  ++   +            +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVMHFGEEEQKQLQVDVDTFLQYWPPKSQQFSMQYISNIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   GI+      NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQ 280


>gi|452987883|gb|EME87638.1| hypothetical protein MYCFIDRAFT_27819 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 351

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 64/149 (42%), Gaps = 18/149 (12%)

Query: 145 TVTDNDTSAAHYL--SSLCPPPASTTI--ELTAALLAKDRLNAFGLMEPYIEGQDGRRSV 200
           T  D D+S   +L  +S+ P      +  EL  AL+  D  NAFG+      G +     
Sbjct: 213 TKQDLDSSCRAFLQLTSILPETILPALGAELCLALVQADNHNAFGIRG----GGEDSEEY 268

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PV 257
             YG+Y  AS+FNH C PN        A            + D+  G E C++Y      
Sbjct: 269 MGYGLYPCASYFNHSCDPNI-------AKHRRGRSWEFHALRDISAGEECCITYLGGDER 321

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           + D   R +RL   +DF C C RCK EA 
Sbjct: 322 DMDVLERLRRLRDVWDFVCVCQRCKAEAG 350


>gi|403218380|emb|CCK72870.1| hypothetical protein KNAG_0M00170 [Kazachstania naganishii CBS
           8797]
          Length = 491

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 70/323 (21%), Positives = 127/323 (39%), Gaps = 52/323 (16%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRD-SPILL---YSALPFINSSSFCHNCF 57
           +A+   + + +    GRGL S +   AG ++++D  PI L      L  + +   C  C 
Sbjct: 107 NALGNQLEIKDTPHMGRGLFSLKDFNAGDLLIKDLEPICLIPPMEKLTLMQNGKACSLCG 166

Query: 58  RKTMHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVC-QALTRLLNLNSPDAA 109
           +   + +S         C  C   +CS KC        H+P    +   +L+N       
Sbjct: 167 QSIANINSHFIVINGLDCNDCQAVWCSKKCKLKDII--HAPLTHNKGKNKLVN-----GG 219

Query: 110 NL-PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
           N    ++       +AAY+L ++  S++ +  + +         +  ++S       S +
Sbjct: 220 NWRKFEKFCSDNVFVAAYSLGIMF-SKYLLTKSKEAQTIKKKFDSLAFVSQRTRAQYSDS 278

Query: 169 IELTAALLAKDRLNAFGLMEP-YIEGQD----------------------GRRSVRAYG- 204
             +   L A D +N     EP + EG                        GR ++ A   
Sbjct: 279 SNIGGTLDASDGVNTTADPEPKWKEGYQLFIDTFPSLKEEVDYERFLEYIGRFNLNAIAE 338

Query: 205 -IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            +Y   SF NH+C PN  RF+      ++   I      D+ +G +I ++Y    ++ + 
Sbjct: 339 QLYYIPSFINHNCEPNV-RFE-----KDSRLHINFYARKDIKKGEQIFMTYCNPLHEVNL 392

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R++ L  +Y F C CDRC+ E  
Sbjct: 393 RRRELRVNYGFLCFCDRCRKEVK 415


>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
 gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
          Length = 490

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 25/288 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V V   EG+GRGL +T+   A  I+  +     YSA+ F +  +  CH CF++     
Sbjct: 7   ENVEVFTAEGKGRGLKATKEFCAADIIFAERA---YSAVVFDSLVNVVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQAR 120
              C  C  A +C   C   A  +  +   C A+ R   + N N   AA +    + +  
Sbjct: 62  LHRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKRYGKVPNENIRLAARIMWRVEREGS 119

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            L     L  V+  Q  +    +    +       +L    P     +++  + +     
Sbjct: 120 GLTEGC-LVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAENNT 234
            N F L +     Q G ++V   G++   +  NHDC PN          + V +      
Sbjct: 179 CNGFTLSD-----QRGLQAV-GVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQM 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  + +G E+ +SY         R+K+L   Y F C C+ C+
Sbjct: 233 RIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQ 280


>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
          Length = 554

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 123/295 (41%), Gaps = 48/295 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFRKTMHS 63
           V +   + +GR  V+++ L  G      + + L +A  F+  S F    CH C      +
Sbjct: 61  VHLKNTKTKGRHAVASEELPEG------ATVSLETATAFVVRSEFIDQNCHFCLADL--T 112

Query: 64  SSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR-- 120
              IC +CS+++ CS  CS+    + H P VC A +++  +    A ++ +D        
Sbjct: 113 QKVICSNCSLSYYCSETCSSKD-HTLHQP-VCNAFSQVQAI--AHATDVDVDLLRLMLLL 168

Query: 121 ----FLIAAYNLAVVNPSQ-FQILL--------AFQGTVTDNDTSAAHYLSSLCPPPAST 167
                + +  N   V P    + LL        AF+  VT    +A+  L S  P     
Sbjct: 169 LARRHMDSQSNQEDVTPFWCVEDLLSHIESAPAAFKQVVT----AASERLLSELPESVQI 224

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY-LKASFFNHDCLPNACRFDYV 226
            +E   +L  +   NA GL + +      R +  A G++ L A FFNH C PN     +V
Sbjct: 225 PVEDMVSLACRINSNAHGLGDNH-----SRNTDVALGLFPLGALFFNHGCNPNTA---FV 276

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 N  +  R I  V +  E+ +SY  +  D   R++ LL    F C C RC
Sbjct: 277 GLP---NGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQELLTTKHFWCKCKRC 328


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 67/284 (23%), Positives = 116/284 (40%), Gaps = 30/284 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           ETV V + +G+GRGL + +   AG ++  +     Y+A+ F + +   CH CF++  H+ 
Sbjct: 2   ETVEVFDTDGKGRGLKAAKEFWAGDVIFAERA---YAAVVFDSLTHLVCHTCFKR--HAK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ +  +  +P+  N+ L  ++  R   
Sbjct: 57  LHRCGQCKFAHYCDRTCQKDAWVNHKNE--CAAIKK--HGKAPN-ENIRLAARIMWRIER 111

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
               L     V+    Q  +   G     +       +L           ++  + +L  
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLESFLEFWPHDGKQFAMQYISHILGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N F L +     Q G ++V   GI+      NHDC PN C   +      NN  I +
Sbjct: 172 INCNGFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPN-CTVIF------NNGKIEL 218

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           R +  +  G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 219 RALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 262


>gi|303286946|ref|XP_003062762.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226455398|gb|EEH52701.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 732

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 66/146 (45%), Gaps = 25/146 (17%)

Query: 154 AHYLSSLCPP-PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFF 212
           A+ ++S+ PP PA   ++  A +++K  +N+ G++       D        G+Y  A+ F
Sbjct: 355 ANAVNSMLPPGPARMDVDALAEVMSKVHVNSHGVV-------DAAGRALGTGVYPPAALF 407

Query: 213 NHDCLPNAC-RFDYV----------------DAAAENNTDIIVRMIHDVPQGREICLSYF 255
           NH C PNA   F                   + +  N   + VR +  V +G EIC++Y 
Sbjct: 408 NHSCAPNAVVSFGGADDDGDGSHSDRDGSCSNGSTNNGARLTVRCVSPVEEGEEICIAYA 467

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
            V     TR++ L     FAC+C RC
Sbjct: 468 EVYATRETRREALWEKKAFACECRRC 493


>gi|332022210|gb|EGI62525.1| SET and MYND domain-containing protein 5 [Acromyrmex echinatior]
          Length = 391

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 89/205 (43%), Gaps = 22/205 (10%)

Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
           L       + + +Q+    L   Y      P  F+ LLA  GT      +++        
Sbjct: 200 LGEKFVGQIDILQQMMKEALNTEYTAHWFTPDGFRSLLALVGTNGQGIGTSSF------- 252

Query: 163 PPASTTIELTAALLAKD-RLNAFGLMEPYIEGQDGR----RSVRAYGIYLKASFFNHDCL 217
             +     ++A  L KD R+    L+E   +  D       +    G+Y+  S  NH C+
Sbjct: 253 --SRWVKNVSALELPKDERIYIDKLIERIYDAMDEAVGPFLNNEGSGLYVLQSSVNHSCI 310

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDF 274
           PNA     V+    NNT ++++ I ++  G EIC SY     +     +RQ+ L   Y F
Sbjct: 311 PNA----VVEFPYSNNT-LVLKAIRNIKVGEEICTSYLDECQLERSRHSRQQALSSLYLF 365

Query: 275 ACDCDRCKVEANWSDNDNDDENNEE 299
            C CD+C+ +AN  D  +D+E ++E
Sbjct: 366 VCHCDKCQAQANDPDVTSDEEESDE 390


>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
 gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
          Length = 398

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 50/83 (60%), Gaps = 6/83 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY-DYS 262
           G+YL  S  NHDC PN    +++D    NN  + V+ I ++ +G+E+C+SY   +  ++ 
Sbjct: 300 GLYLLQSCCNHDCSPNV-EINFLD----NNATLTVKAIRNISEGQELCISYIDSDIKNWK 354

Query: 263 TRQKRLLYDYDFACDCDRCKVEA 285
            RQ  L+ +Y F C C+RC VE+
Sbjct: 355 KRQAILMENYLFECTCNRCMVES 377


>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 315

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 75/309 (24%), Positives = 114/309 (36%), Gaps = 57/309 (18%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHS 63
           E+V V +    GR LV T    AG+ +L + P     A    N     FC  C ++   S
Sbjct: 9   ESVRVVKSATCGRSLVFTSKFAAGKCLLEELPY----AYVLCNKKRGLFCDFCLKQC--S 62

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQA 119
           +   C SC  V +CS  C        H    C+ L R+     PD A L    + +Q Q+
Sbjct: 63  TLLRCSSCKYVRYCSRPCQKGDWKRCHKQD-CKTLKRIHPRVPPDLAQLLAQIIRKQRQS 121

Query: 120 ----------------RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPP 163
                           +       L+      F+ L        + D            P
Sbjct: 122 PPCTQDDEDCFPTTVDQLESHHEKLSDARKDHFESLWFVLQQCIEEDVL----------P 171

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
             S+ +++  A +     N+F + +  + G        A GIYL+AS  NH C PN    
Sbjct: 172 QPSSLVKMFGATIC----NSFSICDNDLNGI-------AVGIYLRASMLNHSCYPNC--- 217

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
                   +   + +R + DV +G    +SY  V      RQ  L   Y F+C+C +C  
Sbjct: 218 ----VVVFDERKLQLRTVRDVKEGDVCTISYVDVINPAKERQTELEERYHFSCNCVKCID 273

Query: 284 EANWSDNDN 292
           E N  + D+
Sbjct: 274 EINELEPDD 282


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 108/276 (39%), Gaps = 37/276 (13%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSS 66
           + V+ +  +GR L++T++   G+++LR  P   Y  +P  +S+ S C  CF      S  
Sbjct: 13  LAVSNLPEKGRCLLTTKNFNPGEVILRQEP---YVCVPNNSSTVSRCDGCF---ASESLK 66

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
            C +C V  +C   C  +          C AL+RL        A  P  R L  R  +  
Sbjct: 67  KCSACQVVWYCGSTCQKSEWKLHRLE--CNALSRLEK--EKRKAVTPSIR-LMVRLYLRR 121

Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
                ++  + Q++L  Q          A+ +  +   P     E+ A   +K   NA  
Sbjct: 122 KLQNEMDLDEKQLVLYAQ---------MANLVHFILQWPEINLKEI-AENFSKLACNAHT 171

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +        D        G+Y   S  NH C+PNA                +VR +  +P
Sbjct: 172 IC-------DCELRPLGTGLYPVVSIINHSCMPNA-------VLTFEGKSSVVRAVEHIP 217

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +G E+ ++Y        TRQK L   Y F C C RC
Sbjct: 218 EGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRC 253


>gi|365758076|gb|EHM99938.1| Set6p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 374

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 72/156 (46%), Gaps = 23/156 (14%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       TI L   +L  +  NAFGL +   
Sbjct: 232 SKFPVLLHFQKLVFQT-------LYILLPSHTRKMLTISLLRHILGTEYGNAFGLWQ--- 281

Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDVPQGRE 249
           EG+    R    Y ++ +AS+FNH C PN  ++   +    N NTDI         +  +
Sbjct: 282 EGEAAESREYFGYWVFPEASYFNHSCTPNVTKYRKGNIMLFNVNTDI--------KKNEQ 333

Query: 250 ICLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           IC+ Y  V    + ++++ L D + F C C+RCK E
Sbjct: 334 ICIDYSGVLELPTVKRRKFLADSWFFDCACERCKSE 369


>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 140

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 13/114 (11%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T +    LL +   N F +++    G        A G+Y  A+ FNH C PN      V 
Sbjct: 31  TFDAILGLLVRFPCNNFAIVDDLWSGI-------AAGVYPNAALFNHSCHPN------VI 77

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            A  + + +  R I D+  G EIC SY  +      R+  LL DY+F C+C+RC
Sbjct: 78  PAFGHGSTLSFRAIRDISPGEEICHSYVELTLPSWKRRDVLLRDYEFLCECERC 131


>gi|401841590|gb|EJT43957.1| SET6-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 374

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 72/156 (46%), Gaps = 23/156 (14%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       TI L   +L  +  NAFGL +   
Sbjct: 232 SKFPVLLHFQKLVFQT-------LYILLPSHTGKMLTIPLLRHILGTEYGNAFGLWQ--- 281

Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDVPQGRE 249
           EG+    R    Y ++ +AS+FNH C PN  ++   +    N NTDI         +  +
Sbjct: 282 EGEAAESREYFGYWVFPEASYFNHSCTPNVTKYRKGNIMLFNVNTDI--------KKNEQ 333

Query: 250 ICLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           IC+ Y  V    + ++++ L D + F C C+RCK E
Sbjct: 334 ICIDYSGVLELPTVKRRKFLADSWFFDCACERCKSE 369


>gi|167521373|ref|XP_001745025.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776639|gb|EDQ90258.1| predicted protein [Monosiga brevicollis MX1]
          Length = 399

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 21/135 (15%)

Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
           PP  ++ EL        RL A  +   ++   + +      G++ +A+  NH C PNA  
Sbjct: 98  PPVQSSGEL-------QRLVAICMYNCFLTDHERQPGFDVLGLWPQAAAINHSCRPNAT- 149

Query: 223 FDYVDAAA-----ENNTDI-------IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
             Y+DA+A     E+  D+       I+R + D+ +G  I +SY  +   +  RQ+ L  
Sbjct: 150 -HYLDASAPMKPRESGADLPPEGGTMIIRSVSDIARGEPITISYVELGDPWPVRQEALRT 208

Query: 271 DYDFACDCDRCKVEA 285
            Y FAC C RC  EA
Sbjct: 209 GYGFACTCIRCTEEA 223


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 99/268 (36%), Gaps = 50/268 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSCS-V 73
           RGR L +T+++K G+ +LRD     Y  +    N +  C  C R     +   C  CS V
Sbjct: 16  RGRLLRATRNVKMGESILRDCA---YGNVVLSGNRAELCAVCLRAA--DADICCDDCSKV 70

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            FCS +C         +   C+AL  +  + S  + +L  D +   R   A         
Sbjct: 71  FFCSEECREKLRDIHENE--CEALEEVDLIASKTSVDLHSDEEGNVRSSYA--------- 119

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
                          N     H L     P A              R  A  ++E   + 
Sbjct: 120 ---------------NVKDLVHVLDKEAGPWADHV-----------RAGAKRILEDLPDE 153

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
                 V A G++      NH C PN    +  D+  E      VR + D+ +G EI LS
Sbjct: 154 CHLPHLVAAVGLFSICGLINHSCQPNCTWSNAGDSVME------VRALRDIEEGEEITLS 207

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  ++ + S RQK L     F C C+RC
Sbjct: 208 YIDIDKERSERQKELRDTKHFDCQCERC 235


>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 487

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/291 (25%), Positives = 116/291 (39%), Gaps = 35/291 (12%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS-FCHNCFRKTMHSS 64
           E+V V +    GR LV T     GQ +L + P   Y+     N    FC  C +K   S+
Sbjct: 7   ESVRVVKSATCGRSLVFTSKFARGQCILEELP---YAYTLHDNKRGLFCDFCLKKC--ST 61

Query: 65  SSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C SC+ V++C+  C     +  H    C+ L ++      D A L L + ++ R   
Sbjct: 62  LKKCSSCNYVSYCNKSCQKKDWARCHKQD-CKTLKKIHPRVPSDLAQL-LSQIIRKRRKS 119

Query: 124 A--AYNLAVVNPSQFQILLAFQGTVT-DNDTSAAHYLSSL-----CPPPASTTIELTAAL 175
           A  + +     P+    L +    ++ ++  SA H L         P    + +++  A 
Sbjct: 120 APCSQDDEDCFPTTVDQLESHHEKLSIESSVSALHKLKHCIEEEDVPTDPRSLLKMYGAT 179

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                 N+FG+        D      + GIYL+AS  NH C       DY      +   
Sbjct: 180 ----NCNSFGIF-------DNDLIAISDGIYLRASMVNHSC-------DYNCIVVFDERK 221

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           + +R + DV +G E  + Y  V +    R+  L   Y F C C +C  E N
Sbjct: 222 LQLRTVKDVKEGEECTIGYVDVIHPAKERRAELEEKYHFTCKCVKCNEEIN 272



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 39/91 (42%), Gaps = 11/91 (12%)

Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
           G+ ++ +Y    KAS  NH C PN        A   +   I +  + DV +G E  ++Y 
Sbjct: 354 GQLNIESY----KASMLNHSCDPNC-------AWVSDGRKIQIMTVKDVKEGEECTITYV 402

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            V      RQ  L   Y F C C +C  E N
Sbjct: 403 DVMDPAKVRQADLKERYHFTCKCVKCIEEIN 433


>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
           FGSC 2508]
          Length = 668

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/298 (25%), Positives = 113/298 (37%), Gaps = 51/298 (17%)

Query: 16  RGRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSS 65
           RGR L ST+   AG+ I + D+P+L   ALP   N  + C+ C        F   + +  
Sbjct: 22  RGRSLFSTERFGAGETIAIIDNPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVEAGP 78

Query: 66  SICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
             C +C+      +C+ +C  A     H    C+   R+      D    P+    Q   
Sbjct: 79  RTCKACTGCKAAVYCNAECQRAHWKLVHKAE-CKMFKRIKERTGKDWLPTPVRAVAQVML 137

Query: 122 LIAAYNLAVV-----------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
           L+ A +   V           N   F+      G      T A  Y   L    +  T++
Sbjct: 138 LLKAGDEETVKAFGPGGTLESNVEGFKTDEGLWGDFELQATGAVVYAGLL---QSDETLK 194

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDA 228
               +L K + NAF       +   G+      GIYL  S    NH C+PNA    Y+  
Sbjct: 195 QAMEVLCKMQTNAFN----RFDADTGQ-----AGIYLHPSLSMVNHSCVPNA----YI-- 239

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            A       ++   D+  G EI +SY         RQ+ L   Y F C+C RCK + N
Sbjct: 240 -AFEKRKAFLKAERDIEPGDEILISYIDHTMPRRARQESLRL-YHFQCNCIRCKDDLN 295


>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
 gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
          Length = 597

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/300 (22%), Positives = 113/300 (37%), Gaps = 27/300 (9%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +  + E + +      GRGL + Q +K G +++   P +  S L   N +S+C NCF + 
Sbjct: 64  LHGIPEYLYIRSSPSEGRGLWAKQPIKRGTVIIAVKPHV--SVLTTQNLASYCSNCFEEA 121

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
             +    C  C V  +C+ +C     ++      C AL            ++P D     
Sbjct: 122 PEAGLKRCAHCRVVHYCNSECQNKDWATHK--RECTALQEWAKHAPAAEVSVPSDAVRCL 179

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
             L+       ++ +  Q + A Q +      S   + + L     S ++     L + D
Sbjct: 180 GRLLWKRQKKGLDSTWAQQIAAMQSSRKSLQPSGYEFHTHL-----SHSLVRYLGLSSPD 234

Query: 180 RLNAFGLMEPY-IEGQDGRRSVRAYGIYLK------------ASFFNHDCLPNACRFDYV 226
            L  FG+  P  +     R +  +Y +                S  NH C PNA    + 
Sbjct: 235 GLQDFGISSPGDLLDLTSRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATV-FP 293

Query: 227 DAAAENNTD---IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
            A+   +TD   I V  I D+    +I  SY       + R++ L   Y+F C C  CK 
Sbjct: 294 RASKTPSTDEPLISVVAIRDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLCKT 353


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 70/295 (23%), Positives = 118/295 (40%), Gaps = 33/295 (11%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           ++++ +  +GR L++T+    G++++   P +       ++    C  CF    ++  S 
Sbjct: 13  LSLSTLPEKGRSLLATRDFYPGEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVLSR 72

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYV--CQALTRL-------LNLNSPDAANLPLDRQLQ 118
           C  C +AF    C TA   S    +   C+ L+ L       L  +      L L R+LQ
Sbjct: 73  CSRCQLAF---YCGTACQRSEWKLHRLECEVLSSLHKYKRKSLTPSIRLMLRLYLRRKLQ 129

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPPPASTTIELTAA 174
              +I +   A+ N +  + L+A    +T+         A+ ++S+   P     E+ A 
Sbjct: 130 NDKIIPS--TAMDNYNLVEALVAHMSDITEEQLVLYAQMANLVNSILEWPGINIKEI-AE 186

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
             +K   NA  +        D        G+Y   S  NH CLPN+             +
Sbjct: 187 NFSKFACNAHTIC-------DSELRPVGTGLYPVISIINHSCLPNS-------VLVFEGS 232

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             +VR +  +P G E+ +SY        TRQK L   Y F C C RC     + D
Sbjct: 233 SALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDD 287


>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
          Length = 385

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 60/198 (30%), Positives = 91/198 (45%), Gaps = 27/198 (13%)

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
           R++  + L   Y      P  F+ LLA  G  +    ++          P S  ++  +A
Sbjct: 204 REMMKKALNIEYTAHWFTPDGFKSLLALVGMNSQGIGTS----------PFSRWVKNVSA 253

Query: 175 L-LAKD-RLNAFGLMEP-YIEGQDGRRSV---RAYGIYLKASFFNHDCLPNACRFDYVDA 228
           L + KD R+    L++  Y E ++   S       G+Y+  S  NH C+PNA     V+ 
Sbjct: 254 LDMHKDQRIFIDKLIDRIYDEMEEVVGSFLDNEGSGLYILQSAVNHSCMPNAS----VEF 309

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
              NNT +++R I D+    EIC SY     +     +RQK L   Y F C CD+C+ +A
Sbjct: 310 PYSNNT-LVLRAIRDIQPEEEICTSYLEECELERSRYSRQKALSSLYLFVCHCDKCQSQA 368

Query: 286 NWSDNDNDDENNEEVMDE 303
           N  D  +D+   EEV DE
Sbjct: 369 NDPDMTSDE---EEVEDE 383


>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
          Length = 530

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 115/282 (40%), Gaps = 44/282 (15%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           +GRG+ ST+S   G ++L  +P +L  +L  IN  + C +CF++    + S C   +V++
Sbjct: 14  KGRGIFSTKSFAPGDVILPFTPTILIPSLSHIN--TICSHCFKQAEVRACSRCH--AVSY 69

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
           C   C  A  ++ HS   C+ L ++     P    LP          I A   A+V P  
Sbjct: 70  CDAACQAANWTAVHSKE-CKVLRKVTEQGRP---GLPTP--------IRAVVQALVKPEI 117

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAST---------TIELTAALLAKDRLNAFGL 186
              +   +G       S       +    AS           ++    +L K + NAF  
Sbjct: 118 GNAIEDLEGNAASWRKSEKWADMEMMAMGASAFAGLGTGQEDVQKALTILCKIQTNAFHR 177

Query: 187 MEPYIEGQDGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            +  + GQ         GI+L  K +  NH C+PNA    ++   A      I+R    +
Sbjct: 178 YDTDL-GQ--------VGIFLGTKLAMANHSCIPNAM-VQFIGRRA------ILRAEKPI 221

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
               EI +SY    +  S R K  L  Y F C C RC+ + N
Sbjct: 222 KIDDEIEISYTDYTFPLSNR-KLALAPYFFDCMCLRCEKDLN 262


>gi|452982319|gb|EME82078.1| hypothetical protein MYCFIDRAFT_77623 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 709

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 11/125 (8%)

Query: 186 LMEPYIEGQD-GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
           + +PY  G+     +V + G+++K S+ NH C+PNA      D        ++V+   ++
Sbjct: 456 IAKPYPFGRSYAGDAVGSSGLWIKTSYINHSCIPNAEPSVIGDL-------MVVKATRNI 508

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
            QG EI +SYF  + DY TR +++   + F C C+ C  EA  S  D   E   ++ D  
Sbjct: 509 TQGEEITISYFN-DIDYQTRSRKIKSSWGFDCTCELCTAEAATSPADM--EERIQLRDLI 565

Query: 305 QDEQM 309
            ++QM
Sbjct: 566 TEQQM 570


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 114/311 (36%), Gaps = 56/311 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRALAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           VA +CS KC   A         C+ L        PD+  L L R            L   
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVVKLMEE 115

Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAALL--------- 176
            PS+ + L +F         +T+        L+          I+  + LL         
Sbjct: 116 TPSESEKLYSFYDLESNISKLTEEKKEGIRQLAMTFQHFMREEIQDVSQLLPAFDIFEAF 175

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           AK   NAF +    ++G          G+Y   S  NH C PN        +   N   +
Sbjct: 176 AKVFCNAFTICNAEMQGV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPHL 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-E 295
           ++R + D+  G E+ + Y  +      R+K L   Y F CDC RC+ +    D D D   
Sbjct: 222 LLRAVRDIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRCQTQ----DKDADMLT 277

Query: 296 NNEEVMDEDQD 306
            +E+V  E Q+
Sbjct: 278 GDEQVWKEVQE 288


>gi|297806681|ref|XP_002871224.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317061|gb|EFH47483.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 325

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/308 (23%), Positives = 121/308 (39%), Gaps = 62/308 (20%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V   E  GR + +T+ + AG ++    P++   +L  +   S C+ C +K M S+   
Sbjct: 44  IRVGLTESAGRAVFATRRIGAGDLIHTAKPVVACPSL--LKLDSVCYLCLKKLMGSAKF- 100

Query: 68  CPSCSVAFCSPKCSTAAAS-------SSHSPYVCQALTRLLNLNSPDAAN---------- 110
                V++CS +C   +         +  S +V     R  N   P              
Sbjct: 101 -EDRGVSYCSQECQENSKGFLDVERRADWSSFV--DYCRTHNFKYPLMVKRLCCMIISGV 157

Query: 111 LPLD-RQLQARFLIAAYNLAVVNPSQFQILLAF-QGTVTDNDTSAAHYLSSLCPPPASTT 168
           LP D   +     +++  ++ +      +  AF +  + D+D S               T
Sbjct: 158 LPADCLDILQPAALSSEMISKIEDGYGLLWNAFRKANLKDDDVSF-------------LT 204

Query: 169 IELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKASFFNH 214
            +   A+LA+ R+NAF               L    +EG+        + +Y+  SF+NH
Sbjct: 205 KQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPSFYNH 260

Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
           DC PNA      +A A  NT      + DV +G E+ + Y   +  Y  RQ  L   + F
Sbjct: 261 DCDPNAHIIWLQNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQGFGF 314

Query: 275 ACDCDRCK 282
            C+C RC+
Sbjct: 315 LCNCLRCQ 322


>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
          Length = 526

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 114/282 (40%), Gaps = 43/282 (15%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
           + R L +TQ  K  QI+    P++L+ +L  +  +S C +C R     S   C  C  A 
Sbjct: 12  KHRALHATQPFKPSQIIHTFHPLILHPSLSQL--ASVCTHCLRP---GSPRACSRCHAAY 66

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
           +C+  C  AA ++ HS   C+AL +   L S   A+LP          + A    ++N  
Sbjct: 67  YCNTSCQQAAWTAVHSKE-CKALQQ-RKLGSRTGADLPTP--------VRALLQTLLNKE 116

Query: 135 QFQILLAFQGTVTDNDTSAA--------HYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
               + A  G   +   + +            +       + I     LL K + NAF  
Sbjct: 117 TEDGIAALDGHTEERRKTKSWPDLEMMAMAACAFAGRQGESNIRRAVELLCKIQTNAFHR 176

Query: 187 MEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            +  + GQ         G++L+ +    NH C+PNA    +V   A      I+R    +
Sbjct: 177 WDVDL-GQ--------VGVFLEPTLAMANHSCVPNAV-VQFVGRKA------ILRAERPI 220

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             G EI +SY       STR +  L  Y F C C RCK + N
Sbjct: 221 HAGDEIEISYTDYTMPLSTR-REALEQYSFECTCARCKGDLN 261


>gi|440638927|gb|ELR08846.1| hypothetical protein GMDG_03520 [Geomyces destructans 20631-21]
          Length = 539

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 117/281 (41%), Gaps = 27/281 (9%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G   G  +T+S+     VL DS +   ++     S+  C  C         + C  CSV 
Sbjct: 269 GDVLGAFATKSIAPSDTVLVDSTVAGVTS-----SAERCPTCCGGITEDILNDC--CSVP 321

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP----LDRQLQARFLIAAYNLAV 130
           +CSP CS  A  + H+  VC      L  ++ ++A L     LD  L  R L  +     
Sbjct: 322 YCSPDCSKTALDTFHAS-VCGKDFGFLT-SAAESAELTTDLSLDSVLLLRVLALSMQEPT 379

Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
           ++P +  +L     T  ++D    ++   +  P      EL   + A    + + L    
Sbjct: 380 LHPLKTSLLNRLTPTYKNDDLIIFNFSDHIVTP-IRVLGELGIDVFANAAYDTWVLHTIR 438

Query: 191 IEGQDGRRSVRAYGIYLKA-----SFFNHDCLPNACRFDYVDAAAEN-NTDIIVRMIHDV 244
              Q+ +      G    A     S FNH C PN      VD   E+ N+ + +  + DV
Sbjct: 439 CRLQNNKHGQTLDGFPGTAVNPLYSMFNHSCAPN------VDWEHEDSNSTLRMFALRDV 492

Query: 245 PQGREICLSYF-PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            +G E+ +SY  P+  DY+ RQ+ LL      C+C+RCK E
Sbjct: 493 NEGEELFISYIRPLTMDYAERQESLLPWLGMECECERCKAE 533


>gi|409042906|gb|EKM52389.1| hypothetical protein PHACADRAFT_148986 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 436

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/310 (25%), Positives = 127/310 (40%), Gaps = 48/310 (15%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMH-------SSS 65
           +G+GLV+T+ +  G++V ++ P +L   +S      +S  C +C     +       S S
Sbjct: 105 KGKGLVATERIAEGEVVWKEDPFILAPEWSLYDLQTASVACTHCSTPLNNSPLAVPCSGS 164

Query: 66  SICPSCSVAFCSPKCSTAAASSSHSPYVC-----QALTRLLNLNSPDAANLPLDRQLQAR 120
           +  PSCS  FC+  C + AA +   P +C      AL  L      +   L    Q  AR
Sbjct: 165 AKVPSCSARFCNRLCLSRAART--HPLLCPAQNPAALPLLEFARRHEWMALHALAQCTAR 222

Query: 121 FLIAAYNLAVVNPSQFQIL--LAFQG------------TVTDNDTSAAHY---LSSLCPP 163
            L+A         + + +L  LA  G               D DT    Y   + +   P
Sbjct: 223 VLLAEQQDEAAFSADWAVLRGLAQLGMEERAKGGWLGSAEPDRDTWKKAYELFVRAFKDP 282

Query: 164 PASTTIELTAALLAKDRLNA-----FGLMEPYIEGQDGRRSVRAY---GIYLKASFFNHD 215
              +  +  A LL K          FG  + ++ G  GR S+      G+Y+  S  NH 
Sbjct: 283 GTESEKKRLARLLRKPTRKEVDDEIFG-YDAFLRGL-GRMSLNLEAHGGLYVLHSHVNHS 340

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
           C PN         +A +   ++VR   D+  G E+ +SY   +     R+++LL ++ F 
Sbjct: 341 CTPNLSARHLDQRSALSRITVVVR--RDIEVGEELFISYVNPDLPLEGRRRQLL-EWGFG 397

Query: 276 -CDCDRCKVE 284
            C C RC +E
Sbjct: 398 TCQCPRCVME 407


>gi|195126158|ref|XP_002007541.1| GI12333 [Drosophila mojavensis]
 gi|193919150|gb|EDW18017.1| GI12333 [Drosophila mojavensis]
          Length = 661

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 131/331 (39%), Gaps = 78/331 (23%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  ++ V + E + +GR +V+ Q LK G +VL + P+       F  ++  CH+CF++ 
Sbjct: 250 LSGAAKVVKLVETKDKGRFVVADQGLKTGDVVLAEDPVAACLLPNFFGTN--CHHCFKR- 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQ----------------ALTRLLNL 103
           +H+    C  CS +AFCS +C   A  + H  + CQ                AL      
Sbjct: 307 LHTPVP-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCFIALRIFTQA 364

Query: 104 NSPD----AANLPL------------DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVT 147
            S D     ANL              D  LQ R L+A + + V+  +Q+     F    T
Sbjct: 365 PSLDDGLATANLLFEHLCSHQDVREADDHLQ-RSLMAGFLMRVLQKAQY-----FGRRKT 418

Query: 148 DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY---------IEGQDGRR 198
           +               P +  +++  A+L   +L  +   E Y          EG   + 
Sbjct: 419 EGVN------------PTAVELQVATAMLGLLQLLQYNAHEIYHTMVTDEHCFEG--CKV 464

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV- 257
                G+Y   S+FNH+C P+   + +V         +++           + ++Y P+ 
Sbjct: 465 VYVGAGLYGTGSYFNHECWPSVAGY-FV------GKKLVMSATKPHRPNEIVAVNYGPIF 517

Query: 258 -NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
              +   RQ+ L   Y F C+C  C  + NW
Sbjct: 518 TKMNLKERQRSLRGRYAFNCNCMTC--QENW 546


>gi|408392117|gb|EKJ71478.1| hypothetical protein FPSE_08348 [Fusarium pseudograminearum CS3096]
          Length = 358

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 63/155 (40%), Gaps = 22/155 (14%)

Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKA 209
           YLS     P   T +    LL K+  N FGL       Q        R     + +Y +A
Sbjct: 212 YLSDASVLPCMLTADEMLLLLCKEETNTFGLYPKATGTQFVNDNATSRGESYGFALYPRA 271

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKR 267
           + FNH CLPN      V    +    ++     D+ +G E  ++YF      D S+RQK 
Sbjct: 272 AQFNHSCLPN------VSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQKY 325

Query: 268 LLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMD 302
               + F C C+RC  E        + E N ++MD
Sbjct: 326 AQTQFQFKCTCNRCLEE--------EAEENMDIMD 352


>gi|118781832|ref|XP_311885.3| AGAP002999-PA [Anopheles gambiae str. PEST]
 gi|116129279|gb|EAA07926.3| AGAP002999-PA [Anopheles gambiae str. PEST]
          Length = 664

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 118/319 (36%), Gaps = 74/319 (23%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSA-LPFINSSSFCHNCFRKTMHSSSSICPSC-SV 73
           RGR +V+   L  G+++L +     Y+A L      + C  CF + +  +   CP C  V
Sbjct: 267 RGRYVVAAADLGPGEVILTEPA---YAACLHAKYYGTHCSACFSRLI--APVACPDCCGV 321

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-------------- 119
           AFCS  C   A ++ H  + CQ L  ++        ++ L    QA              
Sbjct: 322 AFCSVACRDKACATYHR-FECQYLDLMIGSGMSILCHVALRMVTQAGTPEKVLEEGKMLR 380

Query: 120 -----------------RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
                            R L+ A+ L  +  ++F     F    T+             P
Sbjct: 381 DTFCAHTEHRDPEDHFKRTLMTAFLLRCLQKAEF-----FGRRTTE------------AP 423

Query: 163 PPASTTIELTAALLA---KDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHD 215
            P    +E+ A LL+     + NA  + E  I G+    + +      GIY  AS FNH+
Sbjct: 424 EPTEQELEVGAVLLSALQSLQFNAHEVYETRITGEHRFDTAKVQYIGVGIYRGASMFNHE 483

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYD 273
           C P   R           T +I+     +P G  +  +Y P  +    + RQ+ L   Y 
Sbjct: 484 CYPGVTRTFL-------GTAMILHTSRPIPAGAVVPENYGPHFMRQPKAIRQRNLRSRYW 536

Query: 274 FACDCDRCKVEANWSDNDN 292
           F CDC  C    +W   D 
Sbjct: 537 FKCDCRACA--EDWPQMDK 553


>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 489

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 115/289 (39%), Gaps = 27/289 (9%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
            E V + +   +GRGL +++    G++V  +     YSA+ F +S +F  CH CFR+   
Sbjct: 6   KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
           +    C  C  A +C+  C TA          C A+ +     S     AA L L R  +
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              L +   L  V+  Q  +    Q  +    T                ++E  + +   
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
            + N F L +     Q G ++V   G++   +  NHDC PN            V +A  +
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNCTAILNHGNHSAVSSALHS 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              I VR +  + +G E+ +SY       + RQK+L   + F C C  C
Sbjct: 231 PRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 279


>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 662

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/308 (26%), Positives = 117/308 (37%), Gaps = 90/308 (29%)

Query: 9   TVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
           TV    G GRGL + +  KAG++++                                   
Sbjct: 381 TVKSSPGAGRGLFAARDFKAGELIM----------------------------------- 405

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
             C  AFC     T ++    SP V  ALT           N+  D  +  R   A  + 
Sbjct: 406 --CEKAFC-----TVSSKDKASPAVT-ALT----------VNIGQDYSI--RVFPAGLHR 445

Query: 129 AVV-----NPSQFQILLA-----FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
           AVV     NPSQ   +L      +QG     +T A     S    P   T ++   +   
Sbjct: 446 AVVQKLLNNPSQASKVLGLDSGDYQGV---GETGA-----STAEGPVVDTFQVHNIV--- 494

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
            + NAFGL     +  D   S    G++ +AS+ NH C+PN+ +    D A +    I+V
Sbjct: 495 -QRNAFGLGP---QSADEDVSNATTGLWARASYLNHSCMPNSVK----DFAGDL---IVV 543

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNE 298
           R +  +  G EI  +Y   N DY  RQ  L   + F C C  C  EA  +D D       
Sbjct: 544 RALQHIRTGDEITHAYQD-NGDYDARQALLQTTWGFTCRCKLCVAEA--ADGDEVRVKRR 600

Query: 299 EVMDEDQD 306
           E+M E ++
Sbjct: 601 ELMKEAEE 608


>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
          Length = 490

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 67/279 (24%), Positives = 112/279 (40%), Gaps = 41/279 (14%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP------S 70
           GRGL +T+ ++AG+++  D P+L+     +      C  C++         CP       
Sbjct: 36  GRGLFATRDIQAGELIFTDVPLLM-GPRCYNKHLPMCVVCYKNN-------CPLFPCDYG 87

Query: 71  CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
           C +  CS +C  +A    H    CQ L       +P   +      L    LI    L+ 
Sbjct: 88  CGLPICSAECENSAM---HVQGECQFLREW----APTCGSTWSKDLLLVVVLIRGLALS- 139

Query: 131 VNPSQFQILLAFQGTVTDNDTSAAHYL--------SSLCPPPASTTIELTAALLAKDRLN 182
               Q  +L AF+     N     H           ++   P+   ++L   +      N
Sbjct: 140 --EEQRTLLYAFECHANLNRNYEVHQFILKIDLLKRNVANLPSEEQMKLMQRVCDVFNTN 197

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           +F ++ P     D   S+R  G+Y   +  NH C+PN+ R  + D        + V    
Sbjct: 198 SFEVVVPS-RNNDHTMSLR--GLYPMGALQNHCCVPNS-RHHFDDQQR-----LHVSAAL 248

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +  G EI +SY  + +D STR++ L     F+C+C+RC
Sbjct: 249 PIAAGEEITMSYTDLLWDTSTRRQFLRITKRFSCNCNRC 287


>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 478

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 115/289 (39%), Gaps = 27/289 (9%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMH 62
            E V + +   +GRGL +++    G++V  +     YSA+ F +S +F  CH CFR+   
Sbjct: 6   KENVRLFDAGAKGRGLKASKEFSTGEVVFAEPS---YSAVVF-DSLAFQVCHCCFRR--Q 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD---AANLPLDRQLQ 118
           +    C  C  A +C+  C TA          C A+ +     S     AA L L R  +
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACWEEHKQE--CAAIRKSGKAPSESVRLAARL-LWRLHK 116

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              L +   L  V+  Q  +    Q  +    T                ++E  + +   
Sbjct: 117 DNGLASDSQLVSVDQLQEHVDQLSQEQLQQLQTDVRVLQDYWSYGSKQHSVEYISHIFGI 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAEN 232
            + N F L +     Q G ++V   G++   +  NHDC PN            V +A  +
Sbjct: 177 IKCNGFTLTD-----QRGLQAV-GVGLFPNLALVNHDCWPNCTAILNHGNHSAVSSALHS 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              I VR +  + +G E+ +SY       + RQK+L   + F C C  C
Sbjct: 231 PRRIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 279


>gi|356533089|ref|XP_003535101.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Glycine
           max]
          Length = 313

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 69/304 (22%), Positives = 119/304 (39%), Gaps = 60/304 (19%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + VA  E  GR + +T+ + +G ++   +P + + +    ++ + C++C     HS S  
Sbjct: 36  IRVALTESVGRAVFATRPIASGDLIHTATPTVCHPSSS--SARAACYSCLAALPHSQSQG 93

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
            P     FCS +C       S   Y  +     +  N     +    R L+  FL+    
Sbjct: 94  IP-----FCSQRCH----QRSKGYYDVEMKANWVAFN-----DYCWTRGLKYPFLVKRLV 139

Query: 127 -----------NLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
                       L ++ P+           +  L  +   T    +  H         A 
Sbjct: 140 CMVISGDARSDTLDILQPANLTPEMVLKMEEEFLLLRNAFTKALIADEHI--------AF 191

Query: 167 TTIELTAALLAKDRLNAF------GLMEPYIEGQDGRRSVRAY---GIYLKASFFNHDCL 217
            T +    +LA+ R+NAF      GL E  +          A     +YL  SF+NHDC 
Sbjct: 192 LTKQWYINILARIRINAFRIELAGGLYEDLLASAVASVEAEAAVGNAVYLLPSFYNHDCD 251

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
           PNA    ++D     N D  ++ + D+ +G E+ + Y   + D + RQ+ L   + F C+
Sbjct: 252 PNA-HIIWID-----NADAKLKALRDIVEGEELRICYIDASLDRNARQELLSRGFGFQCN 305

Query: 278 CDRC 281
           C RC
Sbjct: 306 CSRC 309


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/288 (20%), Positives = 111/288 (38%), Gaps = 31/288 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           + V V +  G+GRGL +T+ L +G ++  + P   ++A+ F + +   CH+CFR+     
Sbjct: 2   DNVEVFDSPGKGRGLRATKELLSGDVLFSEPP---FAAVVFDSLADKICHSCFRR--QDK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDAANLPLDRQLQA 119
              C  C  A +C   C +A          C A+        N       N+ L  ++  
Sbjct: 57  LQRCGQCKFAQYCDKTCQSAGWKEH--KLECGAIKAFGAIGDNFGKAPNENIRLAARIMW 114

Query: 120 RF-----LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
           R      +++   +  ++  +  I    +  + +      ++L           ++  + 
Sbjct: 115 RLDKEGGVMSDMQMTSLDDLENHISDMPEDDLKELKVDIHNFLDYWPRNSKQHRVDDISH 174

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           +      N F      +  Q G  +V   G++      NHDC PN            N+ 
Sbjct: 175 IFGVINCNGF-----TVSDQRGLNAV-GVGLFPNLCLVNHDCWPNC-------TVILNHG 221

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I +R +  +  G E+ +SY         RQ++L   Y F C C+ CK
Sbjct: 222 KIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHCK 269


>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 442

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 46/96 (47%), Gaps = 4/96 (4%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           D  RS R  GIY     FNH C PN      V  ++ N+  + V  + DV  G E+ +SY
Sbjct: 350 DLLRSCRGQGIYEVGCLFNHSCDPNLS----VQYSSLNDETLTVVALRDVKAGEELTISY 405

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
              +  ++ RQ++LL  Y F C C RC  E     N
Sbjct: 406 IDSSLPFAVRQQQLLDHYLFECRCPRCVAEGTTDAN 441


>gi|254583928|ref|XP_002497532.1| ZYRO0F07722p [Zygosaccharomyces rouxii]
 gi|238940425|emb|CAR28599.1| ZYRO0F07722p [Zygosaccharomyces rouxii]
          Length = 386

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 72/158 (45%), Gaps = 27/158 (17%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYI 191
           S+F ILL+FQ  V          L  L PP   + + +     +L  +  NAFG+ +  +
Sbjct: 244 SRFPILLSFQIAVYQT-------LYILLPPTLHSQLSIPKFRHILGAEYGNAFGIWQ--M 294

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
               G R    Y ++ +ASFFNH C PN      VD   +      + +  DV  G ++C
Sbjct: 295 GESVGDREYLGYWLFPRASFFNHSCDPN------VDKVRKGRKMCFI-LNRDVTSGTQLC 347

Query: 252 LSY-----FPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           ++Y      PV    S RQ+ +  ++ F C CDRC  E
Sbjct: 348 IAYNCDLTLPV----SGRQQTMKDNWFFECMCDRCSSE 381


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 120/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +  +L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTALRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGTPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
 gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
          Length = 499

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/271 (25%), Positives = 111/271 (40%), Gaps = 30/271 (11%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC-PSCSVAF 75
           GRG+++T++LK G+I+ +D P+L+  A    +S + C  C  K +  +  +C   C +  
Sbjct: 34  GRGVMATRNLKRGEIIFQDMPLLIGLAAQEEDSLNACSICL-KLLPDTRFMCRQGCGL-- 90

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
             P CS       H       + +    N PD AN  + R L    +  A NL      Q
Sbjct: 91  --PICSLCGKKRQHKTDC--DMFKSWGPNEPDVANSVIIRLL---CVARAINLT---KDQ 140

Query: 136 FQILLAFQGTVTDNDTS----AAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLMEPY 190
             ++   Q  + +N  +    AA    +   P     +E+    +A  R N F    +  
Sbjct: 141 RDLIYCLQANLDNNHRTEVRNAAKCFKNF--PTDKKLVEIMNRTVAVLRTNGFDKTTDRC 198

Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
            + Q+   S RA  +Y      NHDC+PN+          E   ++IVR   D+ +G EI
Sbjct: 199 TDNQEF--SYRA--LYPLFGVMNHDCIPNS-----YYTFEEKTNNMIVRAAVDILEGEEI 249

Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +Y  +      R   L     F C C RC
Sbjct: 250 TTTYTKLFTGNIARHLFLKMKKGFTCKCPRC 280


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 70/332 (21%), Positives = 131/332 (39%), Gaps = 28/332 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V + +  G+GRGL + +   AG ++  ++ +   +A+ F + +   CH+CFR+     
Sbjct: 2   ENVVIFDSPGKGRGLKAAKEFWAGDVIFSEASL---AAVVFDSLAERVCHSCFRR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
              C  C  A +C   C  A  A         +A  ++ N N    A + L R  +    
Sbjct: 57  LQRCGQCKFAHYCDRTCQRAGWAEHKQECSAIKAYGKVPNENIRLVARI-LWRLDKDGST 115

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
           ++   L  ++  +  I    +  + +      ++L          T++  + +      N
Sbjct: 116 VSDMQLTTLDELEDHITDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDISHIFGVINCN 175

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDI 236
            F      +  Q G ++V   G++      NHDC PN            V+    +   I
Sbjct: 176 GFS-----VSDQRGLQAV-GVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRI 229

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK------VEANWSDN 290
            +R +  + +G E+ ++Y         RQ+ L   Y F C C+ CK      ++    + 
Sbjct: 230 ELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREV 289

Query: 291 DNDDENNEEVMD-EDQDEQMVASDDDAEVHGD 321
           D    + E+V +  D   QM+   D A ++GD
Sbjct: 290 DGVKPSEEQVKEATDYCYQMLEKMDKARLNGD 321


>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
 gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
          Length = 532

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 126/297 (42%), Gaps = 52/297 (17%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS--- 66
           VA  E  GR L +T+ +K G+ +LR++P+LL    P + S+  C  C R  +        
Sbjct: 50  VATNETLGRHLRATRDVKIGEEILREAPLLLG---PKVASAPICLGCHRNLLAPQKQRGN 106

Query: 67  --ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--- 121
              C SCS   C  +C      SSH    CQ ++   N  S        D + ++ +   
Sbjct: 107 YYKCSSCSWPLCGRECE----ESSHHRAECQLMSG-SNFQSKINYTPGEDERKESAYCVI 161

Query: 122 -LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH---------YLSSLCPPPASTTIEL 171
            L+    L   +P  F  L A +  + +   +  +         ++ ++      + +++
Sbjct: 162 MLLRCMQLKASDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRDWSEVDI 221

Query: 172 --TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFDYVDA 228
              AA+L     N F L +P       RR VRA  ++  A+  +HDC PN   RFD    
Sbjct: 222 LRIAAILDT---NTFELRQPR-----ERRKVRA--LFPGAAMISHDCAPNMRHRFD---- 267

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYD--FACDCDRCK 282
              ++ +II      + +G  + +SY  P+    ST Q+RL       F C CDRC+
Sbjct: 268 ---DDMNIIFLAKRPIAKGEILTISYTQPLR---STIQRRLHLRQAKCFDCACDRCQ 318


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/320 (22%), Positives = 114/320 (35%), Gaps = 50/320 (15%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
           RG GL ++ +L+ G ++ R  P L Y+     +  + C  C           C  C VA 
Sbjct: 14  RGNGLRASAALRPGDLLFRSDP-LAYTVCKG-SRGAVCDRCL--LGKEKLLRCSQCRVAK 69

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------------PLDRQLQARFL 122
           +CS KC   A         C+ L        PD+  L            P + +    F 
Sbjct: 70  YCSAKCQKKAWPDHKRE--CKCLKNCRPRYPPDSVRLLGRVVCKLMEGTPSESEKLYSFY 127

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
               N+  +   + + L     T  D         S L  PP+    E  A ++     N
Sbjct: 128 DLESNINKLTEDKKEGLRQLAMTFQDFMREEIQDASQL--PPSFDIFEAFAKVIC----N 181

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           +F +        D        G+Y   S  NH C PN        +   N   +++R + 
Sbjct: 182 SFTIC-------DAEMQEVGVGLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRAVR 227

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-----------EANWSDND 291
           D+  G E+ + Y  +      R+K+L   Y F CDC RC+            E  W +  
Sbjct: 228 DIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCRTQDKDADMLTGNEQAWKEVQ 287

Query: 292 NDDENNEEVMDEDQDEQMVA 311
              +  EE+    + EQ++A
Sbjct: 288 ESLKKIEELKAHRKWEQVLA 307


>gi|449524736|ref|XP_004169377.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
           sativus]
          Length = 346

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/292 (20%), Positives = 117/292 (40%), Gaps = 22/292 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  +  GRG+ +T+ + AG+++    P++ + +L  I+    C+ C +K    ++  
Sbjct: 60  IRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHH--VCNFCLQKLQRYANVD 117

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
             +   +FCS +C   +    H   +    +   N         PL  +  A  +I+   
Sbjct: 118 SDARRASFCSEECEQHS-KVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGAM 176

Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA-STTIELTAALLAKDRLN 182
              +L ++ PS+    +  +     +    A   +++        T E    +LA+ R+N
Sbjct: 177 SSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRIN 236

Query: 183 AFGL--------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           AF +        +           +     +Y+  SF+NHDC PN            NN 
Sbjct: 237 AFRIELAGGYEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPN------THIIWINNA 290

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           +  ++ + DV    E+ + Y   + DY  RQ  L   + F C+C RC    N
Sbjct: 291 NAKLKALRDVDPDEELRICYIDTSMDYDARQTLLHRGFGFICNCARCSYGIN 342


>gi|348688194|gb|EGZ28008.1| putative SET-domain transcriptional regulator protein, Zn-binding
           site [Phytophthora sojae]
          Length = 457

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 2/80 (2%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYS 262
           G++L  + FNHDC PN  +F   DA AE      VR+   + +G ++ +SY +P      
Sbjct: 205 GVFLYNAMFNHDCNPNCVKFTPEDAGAEGGVS-EVRVARPIAKGEQLTISYLYPREQSRE 263

Query: 263 TRQKRLLYDYDFACDCDRCK 282
            RQK L   + F C C+ CK
Sbjct: 264 NRQKNLWEQFGFECSCELCK 283


>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 367

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 62/301 (20%), Positives = 117/301 (38%), Gaps = 53/301 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK---------------T 60
           RGRGL +   +++GQ++  + P + Y +L  I++ + C++C +                 
Sbjct: 68  RGRGLFTKLDVQSGQVIYTEKPFVSYQSLE-IDNKTICNHCLKSLDKDKIEKSKQKKDDD 126

Query: 61  MHSSSSICPSCSVAFCSPKCSTAAASSSH--SPYVCQALTRLLNLNSPDAANLP-LDRQL 117
                S C  CS  +CS +C + +    H  S       + +L   + +    P L  ++
Sbjct: 127 FDDLFSKCVKCSTRYCSEQCKSESDIQYHLASCPSTSGFSEILKYTAVEKRRFPILAAKI 186

Query: 118 QARFLIAAY---------NLAVVNPSQFQILLA-------FQGTVTDNDTSAAHYLSSLC 161
            AR L+  +         NL V++ ++    L        FQ  +   +++   +     
Sbjct: 187 LARILLGYHFQKNMEHWENLQVLSFAKRDPPLEWKDDYNLFQRALLTKESNKKRF----- 241

Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
                   +    ++    +N  G+     E +    S    G++   SF NH C PN  
Sbjct: 242 ------NYDWFVRVMQILYINTLGIEVGATEPKAASTS-SGIGLFYLTSFINHSCDPNCY 294

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                D  A       +  +  +  G E+ ++Y   N DY  RQ  L  +Y F+C+C +C
Sbjct: 295 LAFPTDHTAH------LTALKPLKAGDELLIAYGDPNKDYIDRQSHLFDNYGFSCNCSKC 348

Query: 282 K 282
           +
Sbjct: 349 Q 349


>gi|302853821|ref|XP_002958423.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
 gi|300256228|gb|EFJ40499.1| hypothetical protein VOLCADRAFT_99704 [Volvox carteri f.
           nagariensis]
          Length = 729

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 18/110 (16%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++ + +  NH C PNA      D        ++VR    +P+GRE+ L++       + 
Sbjct: 616 GVWPEVALINHSCGPNAVAVMLYD-------RLVVRATRRIPRGREVLLNWLGSQGALAP 668

Query: 264 ---RQKRLLYDYDFACDCDRCKVEAN--------WSDNDNDDENNEEVMD 302
              R+ +L   YDFAC C RC++E +        +SD +   E   E M+
Sbjct: 669 ARERRAQLSQMYDFACRCARCRLEGSEAVSLPFLYSDAERGSEEALEAME 718


>gi|72392603|ref|XP_847102.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175616|gb|AAX69749.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70803132|gb|AAZ13036.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 399

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/354 (22%), Positives = 125/354 (35%), Gaps = 88/354 (24%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRK--------- 59
           VA    +G+ +++T  + A   V  ++P++ + +  ++     FC  C R+         
Sbjct: 54  VANTVEKGKHILATADMPAQYSVFEEAPVVSWPSQGYLTLGVPFCLFCLRQRNNADIADE 113

Query: 60  TMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL------ 113
             H+ +S C  C   FCS  C  A+A   H   +C  L  L      D    P+      
Sbjct: 114 GEHNWNS-CAGCGSLFCSESC-LASAEMPHR-VLCGVLQELRKRECTDGCCRPITKESLA 170

Query: 114 ---------------DRQLQARFLIAAY--NLAVVNPSQFQILLAFQGTVTDNDTSA--- 153
                           RQ   + L   +  N   ++   F ++ A      D   +A   
Sbjct: 171 NCVAWVVGRIAGAIKQRQFSGKLLEENHRENANSISRQLFHVVTAPFNRFLDAPKNAEFA 230

Query: 154 --------------------AHYLSSLCPPP---ASTTIELTAALLAKDRLNAF-GLMEP 189
                               A  L S   PP   A   +E+   LL  D L  F G++  
Sbjct: 231 DVDANSWYEEIDKLLREPCRAVLLQSAAAPPSIGADWALEIVDGLLRHDTLERFLGILTL 290

Query: 190 YIEGQDGRRSV------------------RAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
             +G +G  +V                  +  GIY   S FNH C+PN      V   AE
Sbjct: 291 NSQGLNGFVAVPGAAGDMDSPSLPITWVLKGGGIYSLQSAFNHSCVPN------VAVLAE 344

Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             T DI +R +  +  G E+ ++Y PV       ++  L  Y F C C  C+ E
Sbjct: 345 GGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQMKLEGYFFTCRCPLCEEE 398


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRALARLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVIFKLMEETPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q++L FQ  + +    A+        PP+    E  A 
Sbjct: 124 YSFYDLESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQL------PPSFDIFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 115/300 (38%), Gaps = 51/300 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-CPSCSVA 74
           RG GL +   L+ G+++ R  P L Y+    ++  S    C R  +     + C  C VA
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYT----VSKGSRGVVCDRCLLGKEKLMRCSQCRVA 68

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLIA 124
            +CS KC   A         C+ L        PD+  L L R +         ++  L +
Sbjct: 69  KYCSSKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEGSPSESEKLYS 125

Query: 125 AYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
            Y+L                Q+++ FQ  + +    A+        PPA    E  A ++
Sbjct: 126 FYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDIFEAFAKVI 179

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
                N+F +    ++           G+Y   S  NH C PN        +   N   +
Sbjct: 180 C----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SVVFNGPHL 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           ++R + DV  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 222 LLRAVRDVEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGDER 281


>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 359

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 48/94 (51%), Gaps = 4/94 (4%)

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           E Q  R+S R  G+Y     FNH C PN  +  Y ++  E  + + +R   D+  G E+C
Sbjct: 247 ELQHLRQSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---DIEPGEELC 302

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           +SY   +  Y  RQ+ L   Y F C C +C+ EA
Sbjct: 303 ISYIDESLSYPERQQELYEHYLFFCQCPKCEREA 336


>gi|350632678|gb|EHA21045.1| hypothetical protein ASPNIDRAFT_45712 [Aspergillus niger ATCC 1015]
          Length = 367

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/320 (22%), Positives = 121/320 (37%), Gaps = 70/320 (21%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI----NSSSFCHN--CFRK-TMHSSSSIC 68
           +GR L +   ++AG++++ D P   Y+ LP +       S C N  C R+ + H+++S+ 
Sbjct: 62  KGRTLSTNGPIRAGELLMADLP---YAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118

Query: 69  --PSCS--VAFCSPKCS---------TAAASSSHSPYVCQAL----------------TR 99
               CS  + +C+  C            A    +SP + Q L                 R
Sbjct: 119 CETGCSLDIWWCNESCKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAAR 178

Query: 100 LLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN-------PSQFQILLAFQGTVTDNDTS 152
            L L      N+      Q RF+     + ++         SQ Q   A           
Sbjct: 179 YLELQGTTPTNVHQQLTFQDRFVSGWKGVQLLRSNRDSWPTSQLQHWSAL---------- 228

Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIY 206
           A  YL      P +  ++    L+  +  N F L      + P+      R +     ++
Sbjct: 229 AEKYLLQSSLLPETLDLDTLVDLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVF 288

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTR 264
           L+ +  NH C PN      V   A++   ++V  + D+  G E C SYF ++   D   R
Sbjct: 289 LRITLANHSCAPN------VTHQADDRGRMMVTALRDIAPGEECCTSYFDLSEYVDLQAR 342

Query: 265 QKRLLYDYDFACDCDRCKVE 284
           +K+    + F C C RC  E
Sbjct: 343 RKKTQELFTFTCTCPRCLQE 362


>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
 gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 521

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 15/111 (13%)

Query: 175 LLAKDRLNAFGLMEPYIEGQ--DGRRS-VRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
           +L ++R NAF     YI+G+  DG     R  G+Y++ SFFNH C PN      V+    
Sbjct: 333 ILLRNRCNAF-----YIQGRPRDGSSGESRGCGVYVRNSFFNHSCDPN------VNYWVV 381

Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           NNT ++   ++ +V +G E+ +SY       + R+++LL  Y F C C +C
Sbjct: 382 NNTLEVECTLLKNVKEGDELTISYIDTTSPLNKRREKLLEGYLFNCLCTKC 432


>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
          Length = 333

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 6/86 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+   S+ NH CLPNA  F Y D        ++ R +  + +G EI  SY  +  + S 
Sbjct: 71  GIFPTGSYLNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSE 124

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
           R+K LL  Y F C C +CK +   +D
Sbjct: 125 RRKHLLKQYFFFCQCQQCKFQKKLAD 150


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/300 (24%), Positives = 109/300 (36%), Gaps = 49/300 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           V + +   +GRGL +T+  + GQ VL+  P         I+    CH C        + +
Sbjct: 6   VELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDV--VCHYCLCAPGQPGAPV 63

Query: 68  -----CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A +C+ +C   A         C A+ R+      D   L + R L  R 
Sbjct: 64  EDLHRCTGCKFAQYCTKECQKKAWPEHKQE--CAAIKRITPGKPVDQTRL-VGRILWRRK 120

Query: 122 LIAAYN-------------LAVVNPSQFQILLAFQGT-----VTDNDTSAAHYLSSLCPP 163
                N             + +V   + +  L+ +       + +   S   Y +    P
Sbjct: 121 REENLNGEKKDGKENDEKKVELVKIEELEDHLSKRNAEEKEAIDEKVYSFGDYFTYDEMP 180

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR--AYGIYLKASFFNHDCLPNAC 221
            +    E  A L A    NA GL        +  R V+    GIY   S  NHDC PN  
Sbjct: 181 DSD---EEMAHLFAIIDCNAIGL--------NDHRGVQTIGVGIYPGISMLNHDCSPNC- 228

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                  A  N   + VR +  +  G E+C+SY         R+++L   Y F C+CD C
Sbjct: 229 ------VAMNNGPRLEVRALRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTC 282


>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 513

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 38/273 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCH-NCFRKTMHSSSSICP------ 69
           GRG+ +T+ ++AG+++  D P+L+            C+  CF   +    + CP      
Sbjct: 48  GRGMFATRDIQAGELIFTDVPLLI---------GPRCYRKCFPMCVVCYKNNCPLFPCDH 98

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
            C +  CS +C  +   S           RLL    P   +   +  L A   I    L+
Sbjct: 99  DCGLPICSTECENSTMHSQEC--------RLLREWMPTCGSSWSNELLLAVIPIRGLTLS 150

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLME 188
                Q ++L AF+  +    T     L  ++   P    +EL   +      N+F ++ 
Sbjct: 151 ---KEQRKLLYAFECHLELTRTHEIDLLKRNVKKFPNKEQLELMKRICGVFNTNSFEVL- 206

Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
                +D   S+R  G+Y   +  NH C+PN  R  + D    + T  +      +  G 
Sbjct: 207 -VASNKDCTTSLR--GLYPLGALQNHCCVPNT-RHHFDDQQQLHVTATL-----PIAAGE 257

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           EI +SY  + +D S+R++ L     F+C+C RC
Sbjct: 258 EITMSYTDLLWDTSSRRQFLKATKHFSCNCSRC 290


>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 456

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 108/288 (37%), Gaps = 33/288 (11%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--RKTMHSSSSI 67
           V  + GRGR + +T++L  GQ VL + P L +        +  C +CF   K  H  ++ 
Sbjct: 4   VVGVPGRGRAVRATKALARGQTVLLNPP-LAFVLRHEERVARRCEDCFVSEKPEHRLANC 62

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSP----DAANLPLDR--QLQARF 121
               + A+CS  C T     +H  +VC+ L  L     P    DAA + +     L+ R 
Sbjct: 63  SLCHTAAYCSKPCQTRNWKRAHK-HVCKLLQTLPENPQPPHIIDAAAMTVATLVALERRA 121

Query: 122 LIAAYNLAVVNP-----------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE 170
            +        +P           + F  +     T+   +      L ++   P ST  +
Sbjct: 122 KLEDKESEQASPDPGSAVRQPRCADFWAMAQHTPTLNSEELDDVLQLVAVTQCPGSTDKQ 181

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
               +L +   N F + +  +         R  G+Y   +  NH C PN C   Y     
Sbjct: 182 RVMDVLQRADCNNFSIWDELL-------LPRGAGVYPWGAILNHSCEPN-CVMTYRGPLH 233

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
                  V+ + D+  G E+C SY  +      R   L   Y F CDC
Sbjct: 234 AQ----AVKALRDIAVGEELCHSYIDLYAPTGQRHSHLGDQYGFECDC 277


>gi|392580063|gb|EIW73190.1| hypothetical protein TREMEDRAFT_73008 [Tremella mesenterica DSM
           1558]
          Length = 593

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 84/383 (21%), Positives = 140/383 (36%), Gaps = 76/383 (19%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSS-----S 65
           +GRG+GLV+ Q +  G+++ ++ P ++ +     PF+ S   C  CF   M S      S
Sbjct: 179 DGRGKGLVAKQRILQGEVLWQEEPWVVTADANLYPFLLSQRMCTQCF--AMFSRPNPPLS 236

Query: 66  SICPSCSVA-FCSPKC-STAAASSSHSPYVC----------------------QALTRLL 101
             C SC +A FC+  C S A AS +H   +C                      +A++R++
Sbjct: 237 VPCGSCDMAHFCNRLCLSKARASGAHHDLLCPGQNPAAMELLTLIHKQGSRPLEAVSRII 296

Query: 102 N--LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL-- 157
                  +   L    +++ R       ++ +          F G +   +   AH L  
Sbjct: 297 AKWRGEREWGGLEKGEEVRKRVWGGMARVSELEKESQMREWEFIGELRLKEWHHAHELLI 356

Query: 158 SSLCPPP-------------ASTTIELTAALLAKDRLNAFGLMEPYIE--GQDGRRSVRA 202
             L PP              +    +L A  L+ +    +   + ++E  G  G     +
Sbjct: 357 KVLNPPSDDPGYKAFHKLLTSRRPKKLQAVPLSAEEAAQWFSFDSFLELLGLVGLNQEDS 416

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI-------------------HD 243
            GIY   + FNH C PN    +   +     +  + R I                    +
Sbjct: 417 GGIYALHAHFNHSCDPNIQARNLPKSFTPPTSSELPREIPPPNAQGVRGTNKLTMLARRN 476

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
           +  G E+ LSY  +      R++ L   Y F C C RC  E       N   N  +  DE
Sbjct: 477 INPGEELTLSYVNIALPKWHRRRLLREQYGFWCSCPRCGREKE-EPPQNKMMNGGKSRDE 535

Query: 304 DQDEQMVASDDD---AEVHGDTN 323
            + E +V    +   AE  G T+
Sbjct: 536 VEAEDIVRRIQEMSMAESQGGTD 558


>gi|261330291|emb|CBH13275.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 399

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 80/354 (22%), Positives = 125/354 (35%), Gaps = 88/354 (24%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRK--------- 59
           VA    +G+ +++T  + A   V  ++P++ + +  ++     FC  C R+         
Sbjct: 54  VANTVEKGKHILATADMPAQYSVFEEAPVVSWPSQGYLTLGVPFCLFCLRQRNNADIADE 113

Query: 60  TMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL------ 113
             H+ +S C  C   FCS  C  A+A   H   +C  L  L      D    P+      
Sbjct: 114 GEHNWNS-CAGCGSLFCSESC-LASAEMPHR-VLCGVLQELRKRECTDGCCRPITKESLA 170

Query: 114 ---------------DRQLQARFLIAAY--NLAVVNPSQFQILLAFQGTVTDNDTSA--- 153
                           RQ   + L   +  N   ++   F ++ A      D   +A   
Sbjct: 171 NCVAWVVGRIAGAIKQRQFSGKLLEENHRENANSISRQLFHVVTAPFNRFLDAPKNAEFA 230

Query: 154 --------------------AHYLSSLCPPP---ASTTIELTAALLAKDRLNAF-GLMEP 189
                               A  L S   PP   A   +E+   LL  D L  F G++  
Sbjct: 231 DVDANSWYEEINKLLREPCRAVLLQSAAAPPSIGADWALEIVDGLLRHDTLERFLGILTL 290

Query: 190 YIEGQDGRRSV------------------RAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
             +G +G  +V                  +  GIY   S FNH C+PN      V   AE
Sbjct: 291 NSQGLNGFVAVPGAAGDMDSPSLPITWVLKGGGIYSLQSAFNHSCVPN------VAVLAE 344

Query: 232 NNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             T DI +R +  +  G E+ ++Y PV       ++  L  Y F C C  C+ E
Sbjct: 345 GGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQMKLEGYFFTCRCPLCEEE 398


>gi|383866011|ref|XP_003708465.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 636

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 127/335 (37%), Gaps = 73/335 (21%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKT 60
           A S+ V++A  E  GR LV+T+  K G IV  + P     A      S F  CH+C  ++
Sbjct: 240 AASDGVSIAFSEKYGRHLVATKEFKPGDIVTIEDPF----AFTIYMQSYFTHCHHCLARS 295

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
            +     C  C VA +CS  C   A   +H    C  L  + NL + D   + +  ++  
Sbjct: 296 FNLIP--CLKCPVAQYCSETCRKLAWEMAHQ-IECPILALVGNLLNVDKNKIRMLTKI-V 351

Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
           RFLI A               +A  NP         +G +    + +A  L++      L
Sbjct: 352 RFLIVATAKGKKIQELLEDMKVAESNPDNRTAGFTDEGILDSTSSRSALSLATNMTTRPL 411

Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
               A   I   AA+L   + N F       + +D          G    RA        
Sbjct: 412 IGISAFACISALAAILLATQTNFFCKKYEVDQLRDINDLPEIKFCGSIMFRACVIMCSNC 471

Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
                        G+Y+  S +NH C PN  R F+           +I R +  +  G +
Sbjct: 472 FLVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHFE--------GMTMITRALEPIFPGDQ 523

Query: 250 ICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRC 281
           I  +Y   +Y Y TR +R   ++ DY F CDC  C
Sbjct: 524 IFTNYC-ASYAYMTRSERREKIMQDYFFECDCIAC 557


>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
          Length = 515

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 70/306 (22%), Positives = 119/306 (38%), Gaps = 53/306 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDS-PILL---YSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +GRGL +T+ ++ G+++  +  PI +      L  I S   C  C     +SS       
Sbjct: 126 KGRGLYATRDIQQGELLFHEKVPIAMVPPMDKLKLIRSGKSCSMCGVSLSNSSHFTMLHG 185

Query: 68  --CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
             C  C+  +CS  C     +    PY+    ++  N+  P   N+  +R  Q    +AA
Sbjct: 186 LDCNGCNSIWCSTNCKQKDITH---PYLKHFGSKNKNIR-PMDWNM-FERHCQENIFVAA 240

Query: 126 YNLAVVNPS-------------QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
           Y++ V++ +             QF+ L      V    + + +   +      S   E  
Sbjct: 241 YSIGVIHAASLIDKAQSDEINQQFEALAKISQRVRYESSDSNNIGGTFDAEIGSLKEENP 300

Query: 173 AALLAKDRLNAFGLM------------EPYIEGQDGRRSVRAYG--IYLKASFFNHDCLP 218
             L  K    AF L             E ++E   GR  +      +Y   SF NH+C P
Sbjct: 301 EPLWKK----AFDLFIRTFPDTAEMGYEVFLEYL-GRFHINQLSGQLYFLYSFLNHNCEP 355

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           N  R+D       N  ++ V     + +  E+  +Y    +  S R++ L  ++ F C+C
Sbjct: 356 NV-RYD-----INNKLELKVYARKFIKKDEELVTTYVNPLHGVSLRRRELRVNWGFICNC 409

Query: 279 DRCKVE 284
           DRC  E
Sbjct: 410 DRCAKE 415


>gi|389643366|ref|XP_003719315.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|351639084|gb|EHA46948.1| TPR domain-containing protein [Magnaporthe oryzae 70-15]
 gi|440463125|gb|ELQ32752.1| TPR domain-containing protein [Magnaporthe oryzae Y34]
 gi|440483490|gb|ELQ63875.1| TPR domain-containing protein [Magnaporthe oryzae P131]
          Length = 746

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 16/109 (14%)

Query: 181 LNAFGLMEPYIEGQDGRRS-------VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
           LN FG   P +  +D  R          + G++L A++ NH C+PNA R    D      
Sbjct: 476 LNGFGC--PRLRSRDKERMNVQEGGPESSTGMWLHAAYANHTCIPNATRAFIGDM----- 528

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             +IVR   D+P G EI + Y  +   + +R+ +    Y F CDC+ C+
Sbjct: 529 --MIVRAARDIPAGAEIFMGYASLAEPFESRRSKFKTSYGFECDCEMCR 575


>gi|328875104|gb|EGG23469.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 448

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 14/122 (11%)

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
           I +   +  K R N FG+       +D    V    I   AS+FNH C PN      VD 
Sbjct: 263 ISMIRPITCKIRCNQFGIWSK----KDKCLGVSVTPI---ASYFNHSCCPNI-----VDV 310

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                T +  + +H +P+G ++C+SY  ++    +RQ  L+Y Y F C C RC  + +  
Sbjct: 311 RG--TTLLEFKALHFIPKGSQLCISYLDLDQTTDSRQDYLIYSYYFKCGCKRCNDKGDSI 368

Query: 289 DN 290
           DN
Sbjct: 369 DN 370


>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
           CCMP2712]
          Length = 98

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 16/113 (14%)

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDA 228
           E+ A++++K + N  G+++  ++GQ+       +G+Y  ASF NH C PNA   FD    
Sbjct: 1   EVIASMISKAQRNMHGIVD--LKGQN-----FGHGLYPLASFINHSCEPNAIISFD---- 49

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                  ++VR + ++P+G EI ++Y  +      R+  LL    F C C RC
Sbjct: 50  ----GNKLVVRALENIPRGTEITIAYVELYAPLDVRRDALLSRKGFLCRCSRC 98


>gi|414871461|tpg|DAA50018.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
          Length = 173

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 15/123 (12%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLP 218
           TI+    +LA+ R+NAF +       +D   S  A           +Y+  SF+NHDC P
Sbjct: 53  TIDWYINVLARIRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDP 112

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           NA            N D  ++ + D+ +G E+C+ Y   + D   RQK L   + F C C
Sbjct: 113 NA------HIVWLQNADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRC 166

Query: 279 DRC 281
            RC
Sbjct: 167 LRC 169


>gi|134074941|emb|CAK47970.1| unnamed protein product [Aspergillus niger]
          Length = 380

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 44/297 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSAL--PFINSSSFCHN--CFRKTMHSSSSI-CPS 70
           +GR L ++Q ++ G++++ D P  L   +  P  + S  C N  C R+T HSS  I CP+
Sbjct: 45  KGRQLRASQPIRKGELLMIDVPYALIPVVDDPASSDSLLCSNPTCSRQTQHSSGRISCPN 104

Query: 71  ---CSVAFCSPKCSTAAASSSHSPYVCQALTRLLN-----LNSPDAANLPL------DRQ 116
                V +CS  C  A        + C  L R  +         D   + L       RQ
Sbjct: 105 RCLADVVWCSSTCQEA--DQLRHEFECTWLQRYASPIRAKWGEYDFGMMWLIVRILASRQ 162

Query: 117 LQARFLIAAYNLAVVN-PSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPAST 167
            + + L+ A N         ++ + +F G++     +           YL S    P   
Sbjct: 163 TERQQLLDANNETTQRFKGGWEAIQSFCGSIESWSHAQVRSWTALVKKYLRSSPILPHDL 222

Query: 168 TIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
           +++   AL+ ++  N+FGL      + P  E    R       +Y +A+  NH C PN  
Sbjct: 223 SVDEIVALICREEANSFGLYPRETGVYPVPEPPVDRGEQFGAAVYPRAAIANHSCSPN-- 280

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDFAC 276
               +    +++  ++     D+  G E C+SYF ++   D  +R+  L   + F C
Sbjct: 281 ----IMHKPDHHGRMVFTASKDIAAGEECCISYFDLSKRVDLKSRRDHLQGLFRFVC 333


>gi|134108020|ref|XP_777392.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338819406|sp|P0CR43.1|SET5_CRYNB RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|50260082|gb|EAL22745.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 449

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 54/104 (51%), Gaps = 14/104 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
           G+G+GLV+ + LK G+++ ++ P ++ S       +  S  C  CF      S  I   C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPISVPC 162

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           P C+ A FC+  C T + SSSH P +C  L       +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199


>gi|190347709|gb|EDK40035.2| hypothetical protein PGUG_04133 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 338

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)

Query: 151 TSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK 208
           T    YL   C P      TI+    ++     NAFG+    + G+D  +      +Y  
Sbjct: 211 TRMYKYLKLTCGPQLQPFITIDNIRRIIGTSLTNAFGIWSSTVPGED--KEYFGCALYPS 268

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKR 267
           ASFFNH C  N  R         +   I     H V QG E+C+ Y      DY TRQK 
Sbjct: 269 ASFFNHSCSANVSR-------TRHGRSISFVTSHPVLQGEELCIQYGNHTTEDYHTRQKD 321

Query: 268 LLYDYDFACDCDRCKV 283
           L  ++ F C C +C++
Sbjct: 322 LK-EWFFECGCKKCEM 336


>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
          Length = 520

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 114/274 (41%), Gaps = 38/274 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           RGRG+ ST+S   G ++L  +P +L  +L  IN  + C +CF++    + S C   +V++
Sbjct: 14  RGRGIFSTKSFVPGDVILPFTPRILIPSLSHIN--NICSHCFKQAEVRACSRCH--AVSY 69

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
           C   C  A  ++ HS   C+ L ++     P    LP          I A   A+V P  
Sbjct: 70  CDAACQAANWTAVHSKE-CKVLRKVTEQGRP---GLPTP--------IRAVVQALVKPEI 117

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD-RLNAFGLMEPYIEGQ 194
              +   +G  T    S       +    AS    L      +D + NAF   +  + GQ
Sbjct: 118 GNAIEDLEGNGTSWRKSEKWADMEMMAMGASAFAGLGTG--QEDIQTNAFHRYDTDL-GQ 174

Query: 195 DGRRSVRAYGIYL--KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
                    GI+L  K +  NH C+PNA    ++   A      I+R    +    EI +
Sbjct: 175 --------VGIFLEPKLAMANHSCIPNAM-VQFIGRKA------ILRAEKPIKIDDEIEI 219

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           SY    +  S R K  L  Y F C C RC+ + N
Sbjct: 220 SYTDYTFPLSKR-KLALAPYFFDCMCLRCEKDLN 252


>gi|392566265|gb|EIW59441.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 452

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 5/102 (4%)

Query: 202 AYGIY-LKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           A+G++ L +  FNH C+PNA C+  Y+  A+E     +V  + D+ +G EI + Y     
Sbjct: 198 AHGVFPLASRLFNHSCIPNAACK--YIIRASEPVAMQVV-ALRDIAEGEEITIPYLDPAL 254

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
            Y TRQ+ L  +Y F CDC  C+ ++     +   E   + +
Sbjct: 255 PYQTRQEALEVNYSFNCDCRLCRFQSGIHPVNAPPERGSDAL 296


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 15  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 67

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 68  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 124

Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L + +N           Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 125 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 178

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 179 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 220

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 221 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 276

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 277 LTGDEQVWKEVQE 289


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|46115820|ref|XP_383928.1| hypothetical protein FG03752.1 [Gibberella zeae PH-1]
          Length = 358

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 72/327 (22%), Positives = 122/327 (37%), Gaps = 59/327 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS---FCHN--CFRKTMHSSSSI-CP 69
           +GR L +T  ++ G I+L D+    Y+ +P + S +    C N  C R+   +  ++ C 
Sbjct: 45  KGRLLRATGPIREGTILLVDTA---YAIVPSVTSDAEPLICSNLSCSRRVPQNGRAVRCE 101

Query: 70  SC---SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           +     V +C+  C   A+      Y C  L +  +      +            L+A +
Sbjct: 102 NACFKDVVWCNIACR--ASDKLRHDYECAWLKKNADSMRRKESEYNFATLWHVVRLLATW 159

Query: 127 NLAVVN---------PSQFQILLAFQGT--------------VTDNDTSAAHYLSSLCPP 163
           N  + +         P +   L  ++                +T        YLS     
Sbjct: 160 NAELHSGNAVAQHRYPWEAHFLRGWKAVDMCCAYLDSWPEEQLTHWKRLVNEYLSDTSVL 219

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCL 217
           P   T +    LL K+  N FGL       Q        R     + +Y +A+ FNH CL
Sbjct: 220 PCMLTADDMLLLLCKEETNTFGLYPKATGTQFVNDNATSRGESYGFALYPRAAQFNHSCL 279

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFA 275
           PN      V    +    ++     D+ +G E  ++YF +    D S+RQK     + F 
Sbjct: 280 PN------VSHKPDGQARMVYTAARDISKGEECMITYFDLATRKDVSSRQKYAQTQFQFK 333

Query: 276 CDCDRCKVEANWSDNDNDDENNEEVMD 302
           C C+RC  E        + + N ++MD
Sbjct: 334 CTCNRCLEE--------EADENMDIMD 352


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123

Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L + +N           Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 120/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 16  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 68

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 69  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 125

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 126 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 179

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN C   +      N  
Sbjct: 180 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPN-CSIVF------NGP 221

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 222 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 277

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 278 LTGDEQVWKEVQE 290


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 123/313 (39%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 50  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 102

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 103 VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 159

Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L + +N           Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 160 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 213

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN C   +      N  
Sbjct: 214 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPN-CSIVF------NGP 255

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 256 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 311

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 312 LTGDEQVWKEVQE 324


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 119/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
           [Taeniopygia guttata]
          Length = 490

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V +   EG+GRGL + +   AG ++  +     Y+A+ F + +   CH CF++     
Sbjct: 7   ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKR--QER 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C  AA  S  +   C A+ R  +  +P   N+ L  ++  +   
Sbjct: 62  LHRCGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKMER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALLAK 178
               L+   +V   + Q  +   G     D  A    +L    P      ++L + +   
Sbjct: 117 EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              N F L +     Q G ++V   GI+      NHDC PN          + V +    
Sbjct: 177 ISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHT 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R ++ +  G E+ +SY         R+++L   Y F C C+ CK
Sbjct: 231 QMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 280


>gi|403165497|ref|XP_003325490.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375165747|gb|EFP81071.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 691

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 6/104 (5%)

Query: 203 YGIYLKASFFNHDCLPNACRFDYV---DAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           +G+YL  S FNH C PNA    +    D      + I V    D+ +  EI LSY   + 
Sbjct: 539 HGLYLLHSRFNHSCDPNAVVLKHPGSEDIHKSAPSTIYVMARKDIAKDEEITLSYINPDL 598

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
               R+++L  DY F+C C RC+ E    + + + ++ +E+MD+
Sbjct: 599 SLDARRRKLFEDYLFSCFCSRCRKEL---EQEEERKHYQELMDK 639


>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
          Length = 428

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 72/303 (23%), Positives = 116/303 (38%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K MH     C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMH-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C  L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|170067842|ref|XP_001868639.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863902|gb|EDS27285.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 504

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 114/286 (39%), Gaps = 37/286 (12%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSI 67
           +A  EG GRGLV+T+ +   +++  D PIL+    P +N+    FC +C R  +    ++
Sbjct: 45  IAVFEGYGRGLVATRDIAVNELIFLDRPILVG---PRVNNYDVIFCASCCR--IQKRLTL 99

Query: 68  CP-SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C   C +  C+  C  +AA  +     C  +T          +  P D++   + ++ A 
Sbjct: 100 CSGGCRLPICA-DCDRSAAPETPHTAECTVIT----------SWQPKDQKRYCKTILYAL 148

Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAA-------HYLSSLCPPPASTTIELTAALL 176
                 ++N  + +I+   +G     D              ++L  P  S  +     ++
Sbjct: 149 TSIRAMLLNDLESRIVFNMEGHAPRKDMITEIDRLLKDQIFANL--PEGSANLTFLRRIV 206

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
                NAF       + ++    +   G+Y+  +  NH CL N      V    + N  +
Sbjct: 207 NVLNTNAFETFRIVQDEENNDHEIILRGLYILGALMNHHCLAN------VRYVFDENQVM 260

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                  + +G +I  +Y  V +    R   L +   F C+C+RC+
Sbjct: 261 WCHASRPIRKGEQIFNNYSKVLWGTQHRIIHLWFSKHFLCECERCR 306


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 69/283 (24%), Positives = 112/283 (39%), Gaps = 37/283 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF--CHNCFRKTMHSSSSICPSCSV 73
           +GRGL  T+ L AG++V  ++      A   ++S S   CH+CFR+ ++     C  C  
Sbjct: 17  KGRGLRGTRDLSAGEVVFAEASF----AAVVLDSLSLQVCHSCFRRQVNPHR--CAQCKF 70

Query: 74  A-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
           A +C   C  AA         C A+    N+      N+ L  ++  R      +  +V+
Sbjct: 71  AHYCDRTCQRAAWDEHRKE--CSAIR---NIGKAPNENVRLVARILWRI---QKHTGLVS 122

Query: 133 PSQFQILLAFQ---GTVTDNDTSA----AHYLSSLCPPPASTTIE-LTAALLAKDRLNAF 184
            SQ   L   +     +T  D            +  P  +    E   + L      N F
Sbjct: 123 DSQLTTLDMLEDHLSRMTPEDLKELKADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGF 182

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTDIIV 238
            L +     Q G +SV   G++      NHDC PN            +DA+  ++  I +
Sbjct: 183 TLSD-----QRGLQSV-GIGLFPNLCLVNHDCWPNCTVILNHGDQSALDASFHSSRRIEL 236

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R +  +  G+E+ +SY       + RQ+ L   Y F C C+ C
Sbjct: 237 RALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHC 279


>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Ornithorhynchus anatinus]
          Length = 477

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 114/284 (40%), Gaps = 30/284 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   E +GRGL +T+   A  ++  +     YSA+ F + +   CH CF++     
Sbjct: 7   EKVEVFTSESKGRGLKATKEFWAADLIFSERA---YSAVVFDSLTHLVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ +  +  +P+  N+ L  ++  R   
Sbjct: 62  LQRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKK--HGKAPNE-NIRLAARIMWRIER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
               L    +V+    Q  +   G     +       +L    P     +++  + +   
Sbjct: 117 EGTGLTEGCLVSIDDLQNHVDSFGEEEQKELRLDVESFLHFWPPQSQQFSMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N F L +     Q G ++V   G++      NHDC PN C   +      NN  I +
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGVFPNLCLVNHDCWPN-CTVIF------NNGKIEL 223

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           R +  + +G E+ +SY       + R+K L   Y F C C+ C+
Sbjct: 224 RALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCE 267


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 122/313 (38%), Gaps = 60/313 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPSESEKL 123

Query: 123 IAAYNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L + +N           Q+++ FQ  + +    A+        PPA    E  A 
Sbjct: 124 YSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D 
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADM 275

Query: 295 -ENNEEVMDEDQD 306
              +E+V  E Q+
Sbjct: 276 LTGDEQVWKEVQE 288


>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Taeniopygia guttata]
          Length = 491

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 116/290 (40%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V +   EG+GRGL + +   AG ++  +     Y+A+ F + +   CH CF++     
Sbjct: 7   ECVELFTAEGKGRGLKAQKEFLAGDVIFAEPA---YAAVVFDSLTHVICHTCFKR--QER 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C  AA  S  +   C A+ R  +  +P   N+ L  ++  +   
Sbjct: 62  LHRCGQCKFAHYCDRTCQRAAWLSHKNE--CSAIKR--HGKAP-TENIRLAARILWKMER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTSA--AHYLSSLCPPPASTTIELTAALLAK 178
               L+   +V   + Q  +   G     D  A    +L    P      ++L + +   
Sbjct: 117 EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQLISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              N F L +     Q G ++V   GI+      NHDC PN          + V +    
Sbjct: 177 ISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVVFNNGNHEAVRSMFHT 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R ++ +  G E+ +SY         R+++L   Y F C C+ CK
Sbjct: 231 QMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 280


>gi|380491929|emb|CCF34973.1| hypothetical protein CH063_00009 [Colletotrichum higginsianum]
          Length = 412

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 59/123 (47%), Gaps = 14/123 (11%)

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
           TT E    L ++D  NAFG+     EG +    +  +G +  AS+FNH C PN  +   V
Sbjct: 297 TTPESLFLLSSRDSHNAFGIRSLEDEGSE----LFGFGCWPAASYFNHSCGPNVEK-KRV 351

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRCKV 283
             A E       R  ++V +G E+C++Y        +R+ R+L    ++ F C C RCK 
Sbjct: 352 GRAWE------FRAANNVEKGGELCITYLGGEERKLSREARMLTLRRNWGFECQCKRCKE 405

Query: 284 EAN 286
           E  
Sbjct: 406 EGG 408


>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
          Length = 403

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 6/86 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+   S+ NH CLPNA  F Y D        ++ R +  + +G EI  SY  +  + S 
Sbjct: 141 GIFPTGSYLNHSCLPNA--FWYNDDQGM----MVFRTLRPIKKGEEILTSYTDITTECSE 194

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
           R+K LL  Y F C C +CK +   +D
Sbjct: 195 RRKHLLKQYFFFCQCQQCKFQKKLAD 220


>gi|195376269|ref|XP_002046919.1| GJ12224 [Drosophila virilis]
 gi|194154077|gb|EDW69261.1| GJ12224 [Drosophila virilis]
          Length = 661

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 72/314 (22%), Positives = 131/314 (41%), Gaps = 44/314 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +S  +  V + E + +GR +V+ + LK G +VL ++P+       F  ++  CH+CF++ 
Sbjct: 250 LSGAANVVKLVETKEKGRFVVANEGLKTGDVVLCENPVAACLLPNFFGTN--CHHCFKR- 306

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +H     C  CS +AFCS +C   A  + H  + CQ +   +         + L    QA
Sbjct: 307 LHIPVP-CLHCSGIAFCSAQCMGEACETYHR-FECQFMDLFIGSGMSILCFVALRVFTQA 364

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDND----TSAAHYLSSLCPP------------ 163
             L      A +    F+ L + Q     +D    +  A +L  +               
Sbjct: 365 ASLEDGLTTANL---LFEHLCSHQDVREADDYLQRSLMAGFLMRILQKALYFGRRKTEGV 421

Query: 164 -PASTTIELTAALLAKDRL---NAFGLMEPYIEGQ---DGRRSVR-AYGIYLKASFFNHD 215
            P +  +++  ALL   ++   NA  +    +  +   DG + V    G+Y   S+FNH+
Sbjct: 422 NPTAVELQVATALLGLLQVLQYNAHEIYHTMVTDEHCFDGCKVVYVGAGLYGTGSYFNHE 481

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYD 273
           C P+   + YV         +++           + ++Y P+    +   RQ+ L   Y 
Sbjct: 482 CWPSVAGY-YV------GKKLVMSATKPHRPNEIVAVNYGPIFTKMNLKERQRTLRGRYA 534

Query: 274 FACDCDRCKVEANW 287
           F+C+C  C  + NW
Sbjct: 535 FSCNCLTC--QENW 546


>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
           niloticus]
          Length = 440

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 66/281 (23%), Positives = 102/281 (36%), Gaps = 33/281 (11%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL  T+  K G+++   S       L       +C  CF +T H +   C  C  A
Sbjct: 23  GKGRGLRVTRPFKVGELLF--SCPAFSHVLSVKERGCYCEFCFTRTQHLAR--CGKCKKA 78

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-----AANLPLDRQLQARFLIAAYNL 128
           F C+ KC       +     C A+        P       A +   +++Q    I    L
Sbjct: 79  FYCNVKCQKG--DWAMHKLECSAMVAFGENWCPSELSRLVARILAKKKMQKERCICEKML 136

Query: 129 AVVN-PSQFQILLAFQGTVTDNDTSAAH--YLSSLCPPPASTTIELTAALLAKDRLNAFG 185
            +    S  +     +  +T+ D +  H  Y   L  P     +     L ++   N F 
Sbjct: 137 LIGEMQSHTEDEDNEKREMTEADIAGLHRFYSKHLDVPDHKELL----TLFSQVACNGFT 192

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +       +D   S     +Y   +  NH CLP+              T   VR + D+ 
Sbjct: 193 I-------EDDELSHLGTAVYPDMALINHSCLPSV-------IVTFKGTSAEVRAVQDMK 238

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            G E+ +SY  + Y    R  RL   Y F CDC  CK ++ 
Sbjct: 239 PGDEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKSQSK 279


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 115/297 (38%), Gaps = 58/297 (19%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS---SFCHNCFRKTMHSS 64
           +TV  +  +GR L +T++ + G+ ++   P   Y  +P  N+S   S C  CF      +
Sbjct: 13  LTVTILPEKGRSLFTTRNFRPGEEIISQEP---YVCVP--NNSPVESRCDRCFASI---N 64

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLPLDRQ 116
              C +C VA +CS  C  +          C AL RL       +  +      L + R+
Sbjct: 65  LKKCSACKVAWYCSSVCQKSEWKLHRLE--CDALARLDKDRHKSVTPSIRLMIKLFIRRK 122

Query: 117 LQARFLI---AAYNLAVVNP---------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
           LQ+  +I   A  N  +V            +  +L A  GT+ +           +   P
Sbjct: 123 LQSEKVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLVN----------FILQWP 172

Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
                E+ A   +K   NA  +        D        G+Y   S  NH CLPNA    
Sbjct: 173 GMNVKEI-AVNFSKLACNAHTIC-------DSELRPLGTGLYPVISIINHSCLPNAV--- 221

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 E  T + VR +  +P G E+ +SY        TRQK L  +Y F C C RC
Sbjct: 222 ---LVFEGRTAV-VRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRC 274


>gi|449437548|ref|XP_004136554.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
           sativus]
          Length = 341

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 60/287 (20%), Positives = 115/287 (40%), Gaps = 22/287 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V+  +  GRG+ +T+ + AG+++    P++ + +L  I+    C+ C +K    ++  
Sbjct: 60  IRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHH--VCNFCLQKLQRYANVD 117

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY- 126
             +   +FCS +C   +    H   +    +   N         PL  +  A  +I+   
Sbjct: 118 SDARRASFCSEECEQHS-KVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGAM 176

Query: 127 ---NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA-STTIELTAALLAKDRLN 182
              +L ++ PS+    +  +     +    A   +++        T E    +LA+ R+N
Sbjct: 177 SSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRIN 236

Query: 183 AFGL--------MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           AF +        +           +     +Y+  SF+NHDC PN            NN 
Sbjct: 237 AFRIELAGGYEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPN------THIIWINNA 290

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +  ++ + DV    E+ + Y   + DY  RQ  L   + F C C RC
Sbjct: 291 NAKLKALRDVDPDEELRICYIDASMDYDARQTLLHRGFGFICKCARC 337


>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
          Length = 429

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 108/292 (36%), Gaps = 64/292 (21%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCHNCFRKTMHSSSSIC 68
           G+G GL + Q + AG+        LLY+A P     S       CH+CF +  H +   C
Sbjct: 12  GKGNGLRANQRVLAGE--------LLYTAEPLACCVSSQCRNVVCHSCFSR--HETLLRC 61

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAY 126
             C VA +C   C   A S       C     LL+L+      +P D  +L AR +    
Sbjct: 62  SQCKVARYCDVTCQKRAWSIHKRECKC-----LLSLH----PRIPTDSVRLAARIIFRLL 112

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL--------------- 171
             + + P Q       +  + D        LS L     S+ ++L               
Sbjct: 113 CPSQMTP-QLYSFEEHESHLCDMGEEKREGLSQL-----SSMLQLYLKQEQPDIIQKVPS 166

Query: 172 --TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
               +LLAK   N F +        D        G+Y   S  NHDC P +C   +    
Sbjct: 167 FDPISLLAKVTCNCFTI-------SDAELQEIGVGLYPSMSLLNHDCRP-SCVMLF---- 214

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 + +R I D+    E+ +SY  V      RQ +L+  Y F+C C  C
Sbjct: 215 --QGKTLQLRAIRDIQPTEEVTISYIGVLLPTRERQTQLMEQYHFSCQCGLC 264


>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 669

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 80/337 (23%), Positives = 132/337 (39%), Gaps = 86/337 (25%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMH 62
           +S  VT+     +GR  V+T+ +K G +++ ++P   YS++     + S CH+C+ + + 
Sbjct: 240 MSSAVTIESSPDKGRYAVATRDVKVGDVLIVENP---YSSVGLQPCNVSHCHHCYIRVL- 295

Query: 63  SSSSICPSCS-VAFCSPKCSTAAASSSHS----------------------PYVCQALTR 99
            +S  C  C+ + +CS +C  A+    H+                        +   L  
Sbjct: 296 -ASIPCLQCAGIVYCSKECRNASWEMYHNLECHHLDLIQELGLGMGHLALRTIIRTGLAF 354

Query: 100 LL-------NLNSPDAA--NLPLDRQLQARFLIAAYNL------------------AVVN 132
           LL       N+N PD +     +D + ++ +  + YNL                  AV  
Sbjct: 355 LLKFREQSANVNIPDESFHGCTVDGEYESNYY-SVYNLVGHSEDRKPGDLFKRVVKAVCL 413

Query: 133 PSQFQILLAFQGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGLM 187
               Q    FQ    DN+   A     H L+ L   P +   E++   L +  +     +
Sbjct: 414 LRCLQQTNFFQSVGADNEEDVAIFIGGHMLTHLQTIPCNAH-EISEYELWRSDITKCHFV 472

Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
           E               G+Y   S  NH C P   R  Y +         +VR I ++ +G
Sbjct: 473 EV------------GSGLYPTMSLVNHSCDPVVTRNCYGETC-------VVRAIRNIYKG 513

Query: 248 REICLSY---FPVNYDYSTRQKRLLYDYDFACDCDRC 281
            EI  +Y   +PV +D S RQ RL + Y F C CD C
Sbjct: 514 EEITDNYGYLYPV-HDKSERQTRLKWQYFFECKCDAC 549


>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 674

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 138/362 (38%), Gaps = 96/362 (26%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--R 58
           + + S+ VT+   E RGR LV+T+++KAG I++ +SP    +     N+ +F  NC    
Sbjct: 218 LESCSDAVTLQFDEKRGRHLVATRNIKAGYILIVESPFAFST-----NNEAFDRNCLNCH 272

Query: 59  KTMHSSSSICPSC----SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD 114
            T+  + S+   C    +V+FCS KC   A    H  Y C     + ++   + +  P  
Sbjct: 273 VTLKLNDSVKIPCYFCRTVSFCSEKCRNEAWKLYHR-YECS----IFDIFCGNDSEQP-- 325

Query: 115 RQLQARFLIAAYNLAV-------------VNPSQFQIL----LAFQGTVTDNDTSAAHYL 157
            Q Q+ +L+ AY + +             ++ ++  +L    L + GT T+ + S     
Sbjct: 326 -QRQSSYLLLAYRMTIAGCLLSNTDKVNNMDKTEIPVLNDNFLRYHGTNTNQECSDLGIN 384

Query: 158 SSLCPPPASTTIELT-----------------AALLAK------DRLNAFGLMEPYIEGQ 194
               P    T ++L                  A  L K       +++   L E +I   
Sbjct: 385 EVYSPRDYRTILKLKTHCEKIEPNINLIRAIEAIFLTKCFTFVLSKMDVVCLKETFISLA 444

Query: 195 ----------------------DGRRSV---RAYG--IYLKASFFNHDCLPNACRFDYVD 227
                                 D +  V   R  G  IY   S  NH C PN  R  Y  
Sbjct: 445 VAMLHHLQAINCNAYEIVENLYDKKAHVWEPRYVGGAIYPSVSLVNHSCYPNVVRHSYPS 504

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
                   ++VR +  + +G EI   Y P  ++     R++ L   Y F C C+ C    
Sbjct: 505 GV------VVVRSLRFIGKGTEIVDCYGPHWLSEGRLPRREYLWKKYRFLCACEAC--TQ 556

Query: 286 NW 287
           NW
Sbjct: 557 NW 558


>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
          Length = 382

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 90/195 (46%), Gaps = 18/195 (9%)

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
           R++ A  +   Y    +NP  F  L+A  GT      ++   L        ++  +LT +
Sbjct: 200 REMTANVIRGEYIDEFLNPEGFSSLMALVGTNGQGIGTSPLALW------VNSVSQLTMS 253

Query: 175 LLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAE 231
              + +L+ F   +  Y+E + G+  +    G+Y   S  NH C PNA   F Y      
Sbjct: 254 DDERQQLDVFIDKIYQYVEEESGQFLNTEGSGLYQLQSACNHSCAPNAESSFPY------ 307

Query: 232 NNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
            N  I ++ I ++  G EI +SY     +     +RQ+ L  +Y F C C+RC  ++N  
Sbjct: 308 GNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHSRQRELSENYLFVCCCERCMSQSNEP 367

Query: 289 DNDNDDENNEEVMDE 303
           D  +D+E +++ +D+
Sbjct: 368 DCTSDEEMSDDDLDD 382


>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
          Length = 513

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 119/314 (37%), Gaps = 45/314 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           + R L +TQ  +  Q +    P++L+ +L  +  +S C +C R     S   C  C  A+
Sbjct: 12  KHRALHATQPFRPSQTIHTFHPLILHPSLSHL--ASVCTHCLRP---GSPRACSRCHAAY 66

Query: 76  -CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD-RQLQARFLIAAYNLAVVNP 133
            C+  C  AA ++ HS   C+AL +   L S   A+LP   R L    LI      V   
Sbjct: 67  YCNTSCQQAAWTAVHSKE-CKALQQR-KLGSRTGADLPTPVRALLQTLLIKEIEDGVA-- 122

Query: 134 SQFQILLAFQGTVTDNDTSA--------AHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
                  A  G   +   +         A    +       + +     LL K + NAF 
Sbjct: 123 -------ALDGHTEERRKTKSWPDLEMMALAACAFAGRQGESNVRKAVELLCKIQTNAFH 175

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFF--NHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
             +  + GQ         G++L+ +    NH C+PNA    +V   A      I+R    
Sbjct: 176 RWDVDL-GQ--------VGVFLEPTLAMANHSCVPNAV-VQFVGRKA------ILRAERP 219

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
           +  G EI +SY       S R +  L  Y F C C RCK + N         N   V D 
Sbjct: 220 IQAGDEIEISYTDYTMPLSAR-REALEQYSFECTCARCKGDLNVYQVRTACPNTYSVADV 278

Query: 304 DQDEQMVASDDDAE 317
             D     +D  A+
Sbjct: 279 SGDGHPALTDKAAQ 292


>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
          Length = 374

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 69/308 (22%), Positives = 116/308 (37%), Gaps = 50/308 (16%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHN--CFRKTMHSSSSI-CP- 69
           +GR L +   ++ G ++L D+P   Y+ +P + S+    C N  C R+   +  ++ C  
Sbjct: 43  KGRLLRAAGPIREGTVLLVDTP---YAIVPSVTSNEPLICSNLSCSRRVPQNGRAVRCEH 99

Query: 70  SCS--VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
           +CS  VA+C+  C   A   +   + C  L +                      L+A +N
Sbjct: 100 ACSKDVAWCNVACR--ATDKARHDFECSWLKKHAEPLRQKEGEYNFATVWHVVRLLATWN 157

Query: 128 L------AVV---NPSQFQILLAFQGT--------------VTDNDTSAAHYLSSLCPPP 164
                  A+V   +P     L  ++                +         YLS     P
Sbjct: 158 AELHSSNALVQQRHPWGAHFLRGWKAVDMCCAYLDSWPEVQIIHWKRLVHEYLSDATVLP 217

Query: 165 ASTTIELTAALLAKDRLNAFGLM------EPYIEGQDGRRSVRAYGIYLKASFFNHDCLP 218
              + E    LL K+  N FGL       +P  +    R       +Y +A+ FNH CLP
Sbjct: 218 PLLSAEQMLLLLCKEETNTFGLYPRATGSQPVNDNAAPRGESYGMALYPRAAQFNHSCLP 277

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKRLLYDYDFAC 276
           N      V    +    ++     D+ +G E  ++YF    + D ++RQ      + F C
Sbjct: 278 N------VTHQPDGQARMVYTAARDISKGEECMITYFDLTTHKDLTSRQNHTQEQFQFKC 331

Query: 277 DCDRCKVE 284
            C+RC  E
Sbjct: 332 TCERCLKE 339


>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ornithorhynchus anatinus]
          Length = 415

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 55/129 (42%), Gaps = 18/129 (13%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           +G       G+Y   S  NH C PN              T + +R + ++ +G E+ + Y
Sbjct: 161 NGEMQEVGVGLYPSMSLLNHSCDPNC-------VIVFEGTSLFLRAVREIQKGEELTICY 213

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDE 303
             V      RQK+L   Y FACDC RCK            E  W +  +  +  EE+  +
Sbjct: 214 LDVLLPSQERQKQLKEQYCFACDCIRCKTQDKDVDMLAGEEPFWKEVKDAVDTVEELQSQ 273

Query: 304 DQDEQMVAS 312
            + EQ++A+
Sbjct: 274 KKWEQVLAT 282


>gi|392565527|gb|EIW58704.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 402

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 66/156 (42%), Gaps = 22/156 (14%)

Query: 136 FQILLAFQGTVTDNDTSAAHYLSSLCPPPAS---TTIELTAALLAKDRLNAFGLME-PYI 191
           +Q+L    G+       AA   +  C  P      TI     +L+ D  NAFG+ E P +
Sbjct: 261 YQLLKGLLGS------RAAERSNGACEEPHLADLVTIANVRTILSVDPGNAFGIWEVPLM 314

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           E  +       + +Y   SFFNH C  N  +     A     T  +V        G E+C
Sbjct: 315 EESE----CLGFALYPVPSFFNHHCAANVRKERDGRALRFVTTRAVV-------AGEELC 363

Query: 252 LSYFPVN-YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           +SY  V   D++TRQK LL  + F C C RC  E  
Sbjct: 364 ISYGHVEAMDWATRQKELLEGWYFECRCGRCTAEGE 399


>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 522

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 58/142 (40%), Gaps = 25/142 (17%)

Query: 170 ELTAALLAKDRL--------NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
           EL A    KD L        N FGL        DG+    A  ++  ASFFNH C P+  
Sbjct: 314 ELMAGFPLKDVLKLVSQIESNGFGLW-------DGKGECYARALFPSASFFNHSCDPSCD 366

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R+       ++   + +     +  G E+ +SY  VN    TRQ  LL  Y F C C RC
Sbjct: 367 RY-------QDKFLLSIATRRPIAAGEELSISYIDVNAPCRTRQHELLDSYHFQCSCTRC 419

Query: 282 KVEA---NWSDNDNDDENNEEV 300
             E    +      D E  E++
Sbjct: 420 VRELAAHSAGSKSRDKEEREKI 441


>gi|219130732|ref|XP_002185512.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403043|gb|EEC42999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 493

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 40/84 (47%), Gaps = 7/84 (8%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R  G++  A+  NH C+ NA R    D        ++V    D+P G E+  SY P    
Sbjct: 218 RLLGLFPLAAMINHSCVGNAVRVLVDDV-------MVVHATTDLPAGTELVWSYGPPTTP 270

Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
           ++ R  RL   Y F CDC RC+ E
Sbjct: 271 FAQRNTRLRAHYGFVCDCPRCQRE 294


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 109/300 (36%), Gaps = 51/300 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL ++  L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRASAQLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           VA +CS KC   A         C+ L        PD+  L L R          + L   
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVFKLMEE 115

Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
            PS+ + L +F         +T++       L           I+  + L          
Sbjct: 116 TPSESEKLYSFYDLESNINKLTEDKKDGLRQLVMTFQHFMREEIQDASQLPSSFDIFEAF 175

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           AK   NAF +    ++           G+Y   S  NH C PN        +   N   +
Sbjct: 176 AKVICNAFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPHL 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           ++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRCETQDKDADMLTGDEQ 281


>gi|58264108|ref|XP_569210.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|338819407|sp|P0CR42.1|SET5_CRYNJ RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|57223860|gb|AAW41903.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 449

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 14/104 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
           G+G+GLV+ + LK G+++ ++ P ++ S       +  S  C  CF      S  +   C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPLSVPC 162

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           P C+ A FC+  C T + SSSH P +C  L       +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 111/300 (37%), Gaps = 51/300 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +  +L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVVALRPGELLFRSDP-LAYTVCKG-SRGIVCDRCLLGKEKLMR-----CSQCQ 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           VA +CS KC   A         C+ L        PD+  L L R          + L   
Sbjct: 67  VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VIFKLMEE 115

Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
            PS+ + L +F         +T++       L+          I+  + L          
Sbjct: 116 TPSESEKLYSFYDLESNIKKLTEDKKEGLRQLAMTFQHFMREEIQDVSQLPPSFDVFEAF 175

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           AK   N+F +    ++           G+Y   S  NH C PN C   +      N   +
Sbjct: 176 AKVICNSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGPHL 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           ++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQTQDKDADMLTGDEQ 281


>gi|406608054|emb|CCH40488.1| SET and MYND domain-containing protein 5 [Wickerhamomyces ciferrii]
          Length = 370

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 56/118 (47%), Gaps = 8/118 (6%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           TIE    +  ++  N+FG  +   +     +    Y ++ +ASFFNH C PN  ++  V+
Sbjct: 244 TIETIRLIFGREYGNSFGTWQLVDDKSSENKEFLGYSLFPEASFFNHSCSPNLKKYRKVN 303

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
                 T        D+ +G ++C+ YF  ++     RQ+ L  ++ F C CDRC  E
Sbjct: 304 RMHFQTT-------ADIAKGEQMCIDYFHMLDESLMVRQEVLSKNWFFECACDRCDEE 354


>gi|198467928|ref|XP_001354556.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
 gi|198146172|gb|EAL31609.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
          Length = 533

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/305 (21%), Positives = 112/305 (36%), Gaps = 55/305 (18%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHS 63
           S+   +A+ E  GR LV+++ L+AG++++ + P+ +    P ++  + C  C++   +  
Sbjct: 43  SDAFHIAQSEIYGRYLVASRQLEAGELLITEEPLAIG---PCVSGDAVCLGCYQPVGLGR 99

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--- 120
               CP C    C   C        H+   C     L     P AA L  DR    R   
Sbjct: 100 EQYRCPGCGWPLCGASCPGLGRRIGHTTEECA----LYGERRPAAAELLNDRTSPPRVRD 155

Query: 121 -----FLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSS 159
                 ++    L    P Q+ ++   +    +   N+   +HY             L  
Sbjct: 156 LYELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDD 215

Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
           L     +  +     +L    +N F +         G++  +A  +Y  A    HDC PN
Sbjct: 216 LAAEEEAQAVHEICGILD---VNCFEI---------GQKGAKARTLYPSAFLLAHDCSPN 263

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FAC 276
               D   + A     I++R    V +   + LSY    Y      KR  + Y+   F C
Sbjct: 264 TAHTDDPRSYA-----ILLRTSRRVREQEALTLSY---AYTLQGTLKRRSFMYEGKLFWC 315

Query: 277 DCDRC 281
            C RC
Sbjct: 316 RCQRC 320


>gi|321248842|ref|XP_003191260.1| hypothetical protein CGB_A2330W [Cryptococcus gattii WM276]
 gi|317457727|gb|ADV19473.1| conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 449

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 14/104 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
           G+G+GLV+ + LK G+++ ++ P ++ S       +  S  C  CF      S  +   C
Sbjct: 103 GKGKGLVAKEGLKQGEMLWQEEPWIVTSDPGHYSLLTQSMMCSQCFSLFARPSPPLSVPC 162

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           P C+ A FC+  C T + SSSH P +C  L       +PDA++L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHPPLLCPGL-------NPDASSL 199


>gi|392561695|gb|EIW54876.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 342

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 48/103 (46%), Gaps = 5/103 (4%)

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           L A G+ E         RS R  G++L A+  NH C PN  R     +  E +  +  R 
Sbjct: 123 LPALGIFETNALPCGKARSGRKAGVFLTAARLNHSCRPNLAR-----SWDEASQQMTFRA 177

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
           + DV +G E+CL+Y  V    + R + L   Y F C C+ C +
Sbjct: 178 LRDVAEGEELCLNYVDVIGTRAQRTEELQSAYGFECVCEACAL 220


>gi|195169991|ref|XP_002025797.1| GL18252 [Drosophila persimilis]
 gi|194110650|gb|EDW32693.1| GL18252 [Drosophila persimilis]
          Length = 533

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/305 (21%), Positives = 112/305 (36%), Gaps = 55/305 (18%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHS 63
           S+   +A+ E  GR LV+++ L+AG++++ + P+ +    P ++  + C  C++   +  
Sbjct: 43  SDAFHIAQSEIYGRYLVASRQLEAGELLITEEPLAIG---PCVSGDAVCLGCYQPVGLGR 99

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--- 120
               CP C    C   C        H+   C     L     P AA L  DR    R   
Sbjct: 100 EQYRCPGCGWPLCGASCPGLGRRIGHTTEECA----LYGERRPAAAELLNDRTSPPRVRD 155

Query: 121 -----FLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSS 159
                 ++    L    P Q+ ++   +    +   N+   +HY             L  
Sbjct: 156 LYELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDD 215

Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
           L     +  +     +L    +N F +         G++  +A  +Y  A    HDC PN
Sbjct: 216 LAAEEEAQAVHEICGILD---VNCFEI---------GQKGAKARTLYPSAFLLAHDCSPN 263

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FAC 276
               D   + A     I++R    V +   + LSY    Y      KR  + Y+   F C
Sbjct: 264 TAHTDDPRSYA-----ILLRTSRRVREQEALTLSY---AYTLQGTLKRRSFMYEGKLFWC 315

Query: 277 DCDRC 281
            C RC
Sbjct: 316 RCQRC 320


>gi|146414868|ref|XP_001483404.1| hypothetical protein PGUG_04133 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 338

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)

Query: 151 TSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK 208
           T    YL   C P      TI+    ++     NAFG+    + G+D  +      +Y  
Sbjct: 211 TRMYKYLKLTCGPQLQPFITIDNIRRIIGTSLTNAFGIWSSTVPGED--KEYFGCALYPS 268

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKR 267
           ASFFNH C  N  R         +   I     H V QG E+C+ Y      DY TRQK 
Sbjct: 269 ASFFNHSCSANVSR-------TRHGRLISFVTSHPVLQGEELCIQYGNHTTEDYHTRQKD 321

Query: 268 LLYDYDFACDCDRCKV 283
           L  ++ F C C +C++
Sbjct: 322 LK-EWFFECGCKKCEM 336


>gi|398404870|ref|XP_003853901.1| hypothetical protein MYCGRDRAFT_39064 [Zymoseptoria tritici IPO323]
 gi|339473784|gb|EGP88877.1| hypothetical protein MYCGRDRAFT_39064 [Zymoseptoria tritici IPO323]
          Length = 757

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 72/174 (41%), Gaps = 23/174 (13%)

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
           LI+  N  + NP+Q  +             S    +  + P     T ++ AAL     +
Sbjct: 436 LISVVNKLLHNPNQAALFSELNDGGGYTPQSKIKLVDGVVP---VDTFQVQAAL----EI 488

Query: 182 NAFGLMEPYIEGQDGRR-------SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           N FG   P +    G R       S  A G+++ ASF NHDC+ NA R    D       
Sbjct: 489 NGFGC--PEMATAIGTRPEEKEEASDTATGVWITASFANHDCIGNAHRSFIGDF------ 540

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
            +I+R   D+ +  EI  +Y   N D+   QK L   +DF C C  C  E+  S
Sbjct: 541 -MIIRATKDIAKDEEILHTYVEANDDHEQFQKTLQRSWDFKCRCRLCVAESKTS 593


>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
          Length = 515

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/288 (21%), Positives = 120/288 (41%), Gaps = 37/288 (12%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           V + E  GR LV+ + +KAG+ +L D P +L    P  ++S  C NC+   ++    +C 
Sbjct: 7   VEKSEKLGRYLVAAKDIKAGERILSDQPFVLG---PSSDTSLVCFNCYLPLINKFL-VCK 62

Query: 70  SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
           +C+VA  C         +  H+   C    R L L +   + + + + + +  ++ A   
Sbjct: 63  NCAVAPLCPGDGCPDEFTKYHNRQECDVF-RNLKL-TKGISPMTMVQNVGSLSVLRALLK 120

Query: 129 AVVNPSQFQILLAFQGTVTDNDTS--------AAHYLSSLCPPPASTTIELTAALLAKDR 180
              N  ++++ +  +  +     S           ++ SL         +L   + A   
Sbjct: 121 KETNLLEWKLFMELETHLERRRESNVWQYYDNTVKFIQSLGLLENGQNQDLVQKICAAID 180

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           +N+F +  P I        +R  G+YL+A+  +HDC+ N         + ++N  ++   
Sbjct: 181 VNSFEVRGPPIPAIGCAEILR--GVYLQAALLSHDCIANT------HMSIDDNNMLVCHA 232

Query: 241 IHDVPQGREICLSYFPVNYDYS-------TRQKRLLYDYDFACDCDRC 281
             D+ +G  I       NY+Y+        RQ+ L+    F C C RC
Sbjct: 233 SVDIKKGESI-------NYNYTDPLKGTIPRQQHLIVGKYFKCTCTRC 273


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/322 (24%), Positives = 120/322 (37%), Gaps = 59/322 (18%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           +TV+ I  +GR L +T+    G +++   P   Y  +P   +   C  CF  T  S  S 
Sbjct: 13  LTVSTISEKGRSLFTTRDFHPGDVIISQEP---YVCVP---TQKRCDGCFSTTNLSKCSR 66

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA--------NLPLDRQLQA 119
           C    V +C   C  +          CQAL+R L+ N   +          L L R+LQ 
Sbjct: 67  CQV--VWYCGTPCQKSEWKLHRLE--CQALSR-LDSNKRKSVTPSIRLMLKLYLRRKLQD 121

Query: 120 RFLIAA-----YNL--AVVNPSQFQIL------LAFQGTVTDNDTSAAHYLSSLCPPPAS 166
           + +I +     Y L  A+V    F I+      +  + ++  N +      ++      S
Sbjct: 122 QKIIPSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMS 181

Query: 167 TTIELTAALLAK-DRLNAFGLMEPYIEGQ-----------------DGRRSVRAYGIYLK 208
              E    L A+   L    L  P I  +                 D        G+Y  
Sbjct: 182 DIKEEQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPV 241

Query: 209 ASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
            S  NH CLPN+   FD  +A+        VR +  +P+G E+ +SY        TRQK 
Sbjct: 242 VSIINHSCLPNSVLVFDGREAS--------VRALQHIPKGTEVLISYIETAGSTVTRQKA 293

Query: 268 LLYDYDFACDCDRCKVEANWSD 289
           L   Y F C C  C     + D
Sbjct: 294 LREQYLFQCVCPLCSKVGQYED 315


>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
          Length = 648

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 127/330 (38%), Gaps = 66/330 (20%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           + + S +V +     RGR LV+T+ +K G+ V+ +      S L +    S CH+C   T
Sbjct: 242 LPSASASVELINTPDRGRCLVATEDIKIGETVIVEKAHA--SILQYEFKESHCHHCLHWT 299

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSH------SPYVCQALTRLLNLNSPDAANLPL 113
                  C  CS V FCS +C   A  S H      + ++C+  TR  ++N+     L L
Sbjct: 300 --PGPVPCHKCSQVGFCSTQCRDEAWDSYHQFECGLTDFLCRT-TR--DVNTGQHGLLAL 354

Query: 114 DRQLQA-RFLIAAYNLAVVNPSQFQILL---------------AFQGTVTDNDTSA--AH 155
              L+A R LI   N    +P  F   +               + Q + TD    A  A 
Sbjct: 355 RTVLKADRRLIIIANEQEKSPESFASQVFDSANYDTVHRLVDNSSQRSTTDIFRRAVMAV 414

Query: 156 YLSSLCP------PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG----RRSVRAYGI 205
           YL+SL         P          LL     NA  +    I    G     +S++   I
Sbjct: 415 YLTSLIQIRDGKDRPDEVLATAVLRLLHSYPCNAHEISHMAIPVPSGFCAQSKSLQLQQI 474

Query: 206 ---------YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
                    +   S  NH C PN     Y D        ++V++IH + +G EI  +Y  
Sbjct: 475 QSCEIGSAAFPVVSLMNHSCNPNVVHLCYGDV-------MVVKVIHRIARGEEILDNY-- 525

Query: 257 VNYDYSTRQKR-----LLYDYDFACDCDRC 281
             Y Y+T +KR     L   Y F C C  C
Sbjct: 526 -GYHYATHEKRERQLKLCQQYYFRCRCQSC 554


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
           harrisii]
          Length = 428

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/323 (20%), Positives = 110/323 (34%), Gaps = 54/323 (16%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           GRG GL +   L+ G+++ R  P L Y+     +    C  C  +        C  C VA
Sbjct: 13  GRGNGLRAVAPLRPGELLFRSEP-LAYTVCKG-SRGVVCERCLCRK--EKLLRCSQCKVA 68

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            +C  +C   A         C        L S +    P   +L  + +      +    
Sbjct: 69  RYCGSECQKQAWQDHKRECKC--------LKSSEPKFPPDSVRLLGKVVFKLLRESSCAS 120

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN-------AFGL 186
            +       +  + +        L  L          +T  L  K+ +        AF +
Sbjct: 121 EKLYSFFDLESNIKNLSEEKKEGLRHL---------AMTLQLYLKEEIQNASQLPPAFDI 171

Query: 187 MEPYIEG-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
            E + +         +G       G+Y   S  NH C PN                + +R
Sbjct: 172 FESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNC-------VIVFEGPSLFLR 224

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN-----------WS 288
            I ++P G E+ + Y  V    + RQK+L   Y F CDC  CK ++            W 
Sbjct: 225 AIRNIPLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDADMLAGEEQAWK 284

Query: 289 DNDNDDENNEEVMDEDQDEQMVA 311
           +        E++  +++ EQ++A
Sbjct: 285 EIQGSLIKIEDLQSQEKWEQVLA 307


>gi|79507571|ref|NP_196280.2| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
 gi|229488103|sp|Q9FG08.2|ATXR4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR4; AltName:
           Full=Protein SET DOMAIN GROUP 38; AltName:
           Full=Trithorax-related protein 4; Short=TRX-related
           protein 4; Flags: Precursor
 gi|332003660|gb|AED91043.1| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
          Length = 325

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 24/132 (18%)

Query: 165 ASTTIELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKAS 210
           A  T +   A+LA+ R+NAF               L    +EG+        + +Y+  S
Sbjct: 201 AFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPS 256

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
           F+NHDC PNA      +A A  NT      + DV +G E+ + Y   +  Y  RQ  L  
Sbjct: 257 FYNHDCDPNAHIIWLHNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQ 310

Query: 271 DYDFACDCDRCK 282
            + F C+C RC+
Sbjct: 311 GFGFLCNCLRCQ 322


>gi|449435328|ref|XP_004135447.1| PREDICTED: uncharacterized protein LOC101202892 [Cucumis sativus]
 gi|449522881|ref|XP_004168454.1| PREDICTED: uncharacterized protein LOC101228219 [Cucumis sativus]
          Length = 540

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 45/98 (45%), Gaps = 8/98 (8%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           R     G+++  SF NH C+PNA R    D        I+V    DV  G EI  +YF  
Sbjct: 316 RDYYGVGLWVLPSFINHSCIPNARRLHIGD-------HILVHASRDVKAGEEITFAYFDP 368

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
              +  R KR+   + F C+C RC+ E   S+ +   E
Sbjct: 369 LSSWKDR-KRMSETWGFNCNCKRCRFEEEISNKEEMKE 405


>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
           rubripes]
          Length = 432

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 106/296 (35%), Gaps = 43/296 (14%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYS-ALPFINSSSFCHNCFRK 59
           M+ + E V   +  G+GRGL  T++ + G+++L       YS  L        C +CF  
Sbjct: 1   MTGIMEGVERFDSPGKGRGLRVTRAFQVGELLLSSQA---YSYVLSVKERGEHCESCF-- 55

Query: 60  TMHSSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
           T   S + C  C  AF C  KC     +       C A+        P         +L 
Sbjct: 56  TRKKSLARCGKCKKAFYCDVKCQKGDWAMHRLE--CSAMNAFGEKWCPSEIT-----RLV 108

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDN-----------DTSAAH-YLSSLCPPPAS 166
           AR L           S+  +LL    + T++           D +  H + S     P  
Sbjct: 109 ARILTKKKMQKDRCASEKLLLLGEMQSHTEDMDNERRETMEADVAGLHQFFSKHLEIPGH 168

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
             +     L ++   N F +       +D   S     +Y   +  NH C P+       
Sbjct: 169 KDL---LTLFSQVACNGFTI-------EDEELSHLGTAVYPDVALINHSCRPSV------ 212

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                + T   VR + D+  G E+ +SY  V Y    R  RL   Y F C C+ C+
Sbjct: 213 -IVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEECE 267


>gi|159485954|ref|XP_001701009.1| hypothetical protein CHLREDRAFT_142739 [Chlamydomonas reinhardtii]
 gi|158281508|gb|EDP07263.1| predicted protein, partial [Chlamydomonas reinhardtii]
          Length = 504

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  AS+FNH C+ NA R+   D        + VR   D+P G E+ ++Y      +
Sbjct: 415 ATGLWGLASYFNHACVANAHRYFLGDF-------LFVRASRDLPAGAEVTITYLNPTLTW 467

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
             R  +LL    FAC C+ C  E  W     D+
Sbjct: 468 EQRSAKLLR-RGFACGCELCMDETEWRRGHPDE 499


>gi|10178118|dbj|BAB11411.1| unnamed protein product [Arabidopsis thaliana]
          Length = 258

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 24/129 (18%)

Query: 168 TIELTAALLAKDRLNAF--------------GLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           T +   A+LA+ R+NAF               L    +EG+        + +Y+  SF+N
Sbjct: 137 TKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGE----GAVGHAVYMLPSFYN 192

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
           HDC PNA      +A A  NT      + DV +G E+ + Y   +  Y  RQ  L   + 
Sbjct: 193 HDCDPNAHIIWLHNADARLNT------LRDVEEGEELRICYIDASMGYEARQTILSQGFG 246

Query: 274 FACDCDRCK 282
           F C+C RC+
Sbjct: 247 FLCNCLRCQ 255


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 114/311 (36%), Gaps = 56/311 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRALAQLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           VA +CS KC   A         C+ L        PD+  L L R          + L   
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGR--------VVFKLMEE 115

Query: 132 NPSQFQILLAFQ------GTVTDNDTSAAHYLSSLCPPPASTTIELTAAL---------L 176
            PS+ + L +F         +T+        L           I+  + L          
Sbjct: 116 TPSESEKLYSFYDLESNINKLTEEKKEGLRQLVMTFQHFMREEIQDASQLPLPFDIFEAF 175

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           AK   NAF +    ++           G+Y   S  NH C PN C   +      N   +
Sbjct: 176 AKVICNAFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGPHL 221

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-E 295
           ++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D   
Sbjct: 222 LLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----DKDADMLT 277

Query: 296 NNEEVMDEDQD 306
            +E+V  E Q+
Sbjct: 278 GDEQVWKEVQE 288


>gi|401623295|gb|EJS41399.1| set6p [Saccharomyces arboricola H-6]
          Length = 373

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       TI L   +L  +  NAFG+ +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLRKMLTIPLLRHILGTEYGNAFGIWQ--- 280

Query: 192 EGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  +             ++ ++  D+ +G +I
Sbjct: 281 EGEAAESREYFGYWVFPEASYFNHSCNPNVTK-------HRKGNSMLFKVNTDIKKGEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    +  ++  L D + F C C+RC +E
Sbjct: 334 CIDYSGVLSLPTLERRNFLADSWFFDCACERCVLE 368


>gi|66363372|ref|XP_628652.1| SET domain protein with MYND insert (Skm/BOP family)
           [Cryptosporidium parvum Iowa II]
 gi|46229836|gb|EAK90654.1| SET domain protein with MYND insert (Skm/BOP family)
           [Cryptosporidium parvum Iowa II]
          Length = 587

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/346 (21%), Positives = 130/346 (37%), Gaps = 74/346 (21%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCF-- 57
           M  + E + + E E +GR +++ + ++ G+ +L + P   Y  + F  N    C  CF  
Sbjct: 14  MWNLPEKIEIKEDERKGRSIIAKEEIQIGESILMEEP---YCRILFSDNIEEICDTCFNY 70

Query: 58  -RKTMHSSSSI--CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
            R     S  I  C  C  + FCS KC   + +  H  + C  L   +     +   +  
Sbjct: 71  LRSEGSYSECILECQECKKIKFCSKKCMEESKTIHH--FECGILKLDILQMISNKVGVSF 128

Query: 114 DR-QLQARFLIAAY---------------NLAVVNPSQFQILL----AFQGTVTDNDTSA 153
           DR +L  RF+I                  +  + N +Q   L+     F  ++ D     
Sbjct: 129 DRSRLLTRFVIKLILELNDKNKSQREQNNSYLISNLNQINSLVDNQEKFLQSIKDVYQEL 188

Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRL----------NAFGL----MEPYIEGQDGRR- 198
           A  L  L  P     I+   + +  D+L          N+FG+    ++  + G      
Sbjct: 189 A--LEILKIPKLKAEIDKLESDIITDKLLVKISCIIDSNSFGIPKFPLKCNVNGDSVSEL 246

Query: 199 ---------------------SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
                                S+  +G++  +S FNH C PN C F  V+    N + + 
Sbjct: 247 VNPRQKAPNSSLELSNSLLNPSILGWGLFSYSSLFNHSCDPN-CDFIGVNPIP-NQSSVT 304

Query: 238 VRMI--HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           + +I    + +  EI ++Y  +      R K LL    F C C+RC
Sbjct: 305 INLIANRKIQKDEEITINYVEIYDTRRNRIKNLLKTKHFICHCERC 350


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 69/302 (22%), Positives = 112/302 (37%), Gaps = 55/302 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P  L   +   +    C +C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDHCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           VA +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  VAKYCSAKCQKKAWQDHKQE--CKCLKSCKPRYPPDSVRL-LGRVVIKLMEKTPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+ + FQ  + +    A+        PP+    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDIFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    D
Sbjct: 220 HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGD 279

Query: 295 EN 296
           E 
Sbjct: 280 EQ 281


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 110/274 (40%), Gaps = 44/274 (16%)

Query: 26  LKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAAA 85
           +K G  ++   P      L   +  + C  CF+       S C    V +C   C   + 
Sbjct: 3   IKKGDCLISSKPFAY--VLSSKHKDNHCDYCFKSGKLLKCSGCQY--VYYCDRSCQKESW 58

Query: 86  SSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN--------LAVVNPSQFQ 137
           S   S   C  L R+     PDAA      +L AR ++             A     +F+
Sbjct: 59  SVHKSE--CINLKRIAPRTIPDAA------RLMARIIVKLQKGGGDEKDYYAKNAYRKFK 110

Query: 138 ILLAFQGTVTDNDTSAAHYLSSLCPP----PASTTIELTAALLA---KDRLNAFGLMEPY 190
            L++    + ++     H++S LC         TT+  +A +L    +  +N++ +++P 
Sbjct: 111 DLMSHYTDIKNDPKRIEHFVS-LCQVLEDFMEGTTLPNSAEILGLYGRICVNSYNILDPD 169

Query: 191 IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--GR 248
           +            GIYL  S  +H C PNA        A    T I++R + D+P+    
Sbjct: 170 MNSI-------GVGIYLGPSVIDHSCKPNA-------VAVFEGTTILIRALEDIPRLDWS 215

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +I +SY  V    STR   L   Y F C+C+RCK
Sbjct: 216 QIHISYIDVLNTTSTRCTELQNTYYFLCECERCK 249


>gi|340055687|emb|CCC50008.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 374

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 73/296 (24%), Positives = 112/296 (37%), Gaps = 65/296 (21%)

Query: 17  GRGLVSTQSLKAGQIVLRDS-PILLYSALPFINSS--SFCHNCFRKTMHSSSSICPSCSV 73
           GRGLV+ +++  G++V+R + P L Y     +N    ++C     +   + ++  P C  
Sbjct: 113 GRGLVAKENIMQGKVVVRRADPWLRYP----VNDRMCAYCAGALPERFFTCAN--PKCHE 166

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            +CS  C T A S  HS        R  +     A  L +  Q++A   I   N A    
Sbjct: 167 EYCSRDCRTMALSLYHS--------RTCSCEGLQALELDMYSQMKASESITEKNAAAAQL 218

Query: 134 SQFQIL-LAFQGTVTDNDTSAAHYLSS----------------------LCPPPASTTIE 170
              ++L ++ Q  +     S    LS                        C    S T E
Sbjct: 219 LMLRVLSVSLQQQIVPTALSEMRILSGRLLFHPKTLAGCFLEIYERFTRACGTATSITYE 278

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
               +LA+   N F   E  +E            + L  S  NH C  N          A
Sbjct: 279 EMVGILARVTANCFH-RENCVE------------LCLPRSMLNHSCEANV---------A 316

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           E+     ++   DVP+GRE+ ++Y+P   + DY  R +  L    F C C RCK E
Sbjct: 317 EDVQTGDLKTTQDVPRGRELTINYYPQLKHLDYLERSQE-LKRRGFTCLCHRCKRE 371


>gi|320166460|gb|EFW43359.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 443

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 133/373 (35%), Gaps = 73/373 (19%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRKTMHSSSSICPSC-SVA 74
           G GLV+T+ ++ G++V+R+  I L S  P   N S  C    R         C  C  V 
Sbjct: 103 GYGLVATRPIRRGEVVVREC-IFLESQPPLEGNDSDPCALTVR---------CEKCVQVY 152

Query: 75  FCSPKCS-TAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
           FCS  C   + A   HS   C AL  L  +   P+     L RQ      I A  L ++ 
Sbjct: 153 FCSTACRDESKARDLHSEIECNALAFLGTDAIHPNDVISDLLRQAIRILSIRAKRLQLLP 212

Query: 133 -PSQFQIL---------LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAK 178
            P + Q+          LA+  T+ D++     ++           + +       +  +
Sbjct: 213 LPGEPQLRNSYENIAKNLAYSPTLADDNDHTLKFVVKFANELVPAAVRIPGPEFLNIFHR 272

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA-SFFNHDCLPNACRFDYV--DAAAENNTD 235
            + NAF ++ P      GR     YG Y       NH C P   R  +       +    
Sbjct: 273 HQCNAFSMLGP------GRMDT-MYGAYTACWHLMNHSCAPVLMREHHAFHRPLVDGLPH 325

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
              R + DV +G EI   Y  +    + R++ L   Y F C+C RC              
Sbjct: 326 FEARAVIDVAEGTEITWCYSNIRNPKAERREHLRQFYGFLCECPRCT------------- 372

Query: 296 NNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPS 355
                     DE+ V             FP      R +C R NC G + P  PS     
Sbjct: 373 --------GPDERRVLE----------GFPE---HRRMLCRRHNC-GFVMPGDPSAEDDG 410

Query: 356 TVMECNVCGNLKN 368
            +  CN CG ++ 
Sbjct: 411 ILGYCNQCGLVRR 423


>gi|242019692|ref|XP_002430293.1| protein msta, isoform A, putative [Pediculus humanus corporis]
 gi|212515408|gb|EEB17555.1| protein msta, isoform A, putative [Pediculus humanus corporis]
          Length = 501

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/289 (21%), Positives = 109/289 (37%), Gaps = 21/289 (7%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M  +++   + + +  GR  V+ Q  + G +++++SP   ++  P  +S   C  C RK 
Sbjct: 1   MLNLNKNFDILKNDKFGRYAVALQDYEPGDLIIQESP---FTYGPKSDSPPVCLGCHRKV 57

Query: 61  MHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQAL-TRLLNLNSPDAANLPLDRQLQA 119
                 +C +C    C  KC      S +   +  ++  R   + +P       +     
Sbjct: 58  --DGKILCSTCGWPVCDEKCEKEPIHSENECIIFSSVKVRFTKVKNPQGNCPQYECITPL 115

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELTA 173
           R L+       +  SQ   L +        D       +   +L  +C      + +L  
Sbjct: 116 RVLLGKEKFPEIWNSQISKLESHNEERQKTDIWKVESINTVGFLHKVCKLQERFSSDLIH 175

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
                   NAF    P   G D R       +Y K+S F H C+PN   + + ++     
Sbjct: 176 FACGILETNAFS--SPSSYGYDMRY------LYPKSSLFLHSCIPNVGHYIFKNSNDGER 227

Query: 234 TDII-VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +++I VR    + +G+E  LSY    +    R+  L     F C C RC
Sbjct: 228 SNVIHVRASLKIKKGQEFTLSYTQTLWPTLLRRSHLKEGKFFDCCCQRC 276


>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 404

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 10/108 (9%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           LL   + NAFGL        +G   +    +Y   S FNH C+PN      VD    +  
Sbjct: 240 LLCIQQCNAFGL-------SNGEGEMTGVALYPALSLFNHSCMPNCA---AVDDGTGSKR 289

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              ++ +  VP G E+ +SY  ++     RQ +L   Y F C C RC+
Sbjct: 290 VCAIKTLVAVPPGEELTISYIDLDLTRELRQDKLEESYAFRCTCARCR 337


>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 635

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 130/335 (38%), Gaps = 73/335 (21%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKT 60
           A S+ ++++  E  GR LV+T+  K G +V  ++P   ++Y+   +    + CH+C  ++
Sbjct: 239 AASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYY----THCHHCLSRS 294

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
            +     C  C VA +CS KC   A   +H    C  +  + NL + D   + +  ++  
Sbjct: 295 YNLIP--CSHCPVAQYCSEKCRKLAWEMAHQ-IECPIMALVGNLLNVDKDKIRMLTKI-I 350

Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
           RFLI                 LA  NP         +G +      +A  L++      L
Sbjct: 351 RFLIVITSKGKKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPL 410

Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
               A   I   AA+L   + N F       + +D          G    RA        
Sbjct: 411 IGISAFACISALAAILLATQTNFFCNKYEVNQLKDISNYPKIIFSGSLMFRACVIMSSNC 470

Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
                        G+Y+  S +NH C PN  R F+           +I R +  +  G +
Sbjct: 471 FSVQQEPGIKTGSGLYITHSLYNHSCAPNTFRHFE--------ELTMITRALRPIYPGDQ 522

Query: 250 ICLSYFPVNYDYST---RQKRLLYDYDFACDCDRC 281
           I  +Y    Y Y T   R+K+++ DY F CDC  C
Sbjct: 523 IFTNY-GAAYAYMTKSERRKKIIQDYFFECDCIAC 556


>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
          Length = 425

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/283 (23%), Positives = 104/283 (36%), Gaps = 58/283 (20%)

Query: 26  LKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAFCSPKCSTAA 84
           +K G  +L   P +    L     ++ C NCF+         C SC  V +C+  C   +
Sbjct: 1   VKKGTTILTGRPFVF--VLRSKYRTTRCDNCFKSEQLVKLLKCSSCQYVYYCNRNCQKQS 58

Query: 85  ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI-----AAYNLAVVNPSQFQIL 139
                   VC  L ++L +  PD+       +L AR +I      A  +       ++  
Sbjct: 59  WRIHKRECVC--LKKILPVILPDSV------RLVARIIIKLKQGGADEMDYYTKENYRKF 110

Query: 140 LAFQGTVTDNDTSAAH-------------YLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
                  +D    A H             +L     P  +  I +   + + +RLN    
Sbjct: 111 TDLMSHYSDMRADAKHMEPFIKLYRFLAEFLDETLMPSIAELISIYGKM-STNRLNI--- 166

Query: 187 MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
                  QD   +V  +GIYL AS   H C PNA       A  E  T II++       
Sbjct: 167 -------QDEWLNVIGFGIYLGASVVGHSCKPNAV------ATFEGTTIIIIK------- 206

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
                +SY  +      R++ L   Y F CDC+RC+ E + ++
Sbjct: 207 -----ISYVDLIKSKKDRREELYSLYHFWCDCERCEKEESLTE 244


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 111/302 (36%), Gaps = 55/302 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P  L   +   +    C  C     K M      C  C 
Sbjct: 14  RGNGLRALAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMQETPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+ L FQ  + +    A+        PP+    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLALTFQHFMREEIQDASQL------PPSFDIFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN        +   N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
            +++R + DV  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    D
Sbjct: 220 HLLLRTVRDVEAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRCQTQDKDADMLTGD 279

Query: 295 EN 296
           E 
Sbjct: 280 EQ 281


>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
           morsitans]
          Length = 395

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH CLPNA   F Y      +N  ++++ +  +  G EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCLPNAQVTFPY------SNDIVVLKALQPIQVGEEICISYLDEGQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +RQK L  +Y F C+C +C+ EAN
Sbjct: 342 SRHSRQKILKENYIFVCECFKCQREAN 368


>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
          Length = 553

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 127/312 (40%), Gaps = 26/312 (8%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSC-SVA 74
           RGR + ST ++  G +++   P+   +    IN   +C+ CFRK        C  C SV 
Sbjct: 20  RGRYIKSTATIPKGTVIITSQPLGTVALPQTINE--YCNYCFRKQTRPPLQRCSRCKSVY 77

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ---ARFLIAAYNLAVV 131
           FC   C + A  S H  +VC    R+ + ++ +  +L +  ++    +R+   A      
Sbjct: 78  FCDMGCFSNAWLSYHQ-FVCDP-ARVRHEDAENELDLEMLEKVALNVSRYRKRAKTETEA 135

Query: 132 NPSQFQILL-AFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLAKDRL----NAF 184
                ++ + AF   +  +   A   L+S       A     +    + +D L    N F
Sbjct: 136 EGETVEVTMEAFFSLMGHDALQAGRVLASHRRLAREALKRAHVQQTSIDEDELVHYLNVF 195

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                 ++ Q+        G Y  AS FNH C PNA    +  A AE      +R I  +
Sbjct: 196 KSNNFTLDDQE--MFAVGEGTYPVASLFNHTCRPNAV-IVFDGALAE------IRAIDTI 246

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN--DDENNEEVMD 302
               EI +SY    +  + R++ L   Y F C C RC  +       +  DD   EE  D
Sbjct: 247 EPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCTRQEGPKAYLSLVDDLLGEEEDD 306

Query: 303 EDQDEQMVASDD 314
            D+ +Q++ + D
Sbjct: 307 WDRAQQLLGNQD 318


>gi|326510051|dbj|BAJ87242.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 293

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 69/295 (23%), Positives = 117/295 (39%), Gaps = 39/295 (13%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           V VA  +  GRG+ +T+ +  G+++    P++ + +    +    C+ C ++        
Sbjct: 13  VRVALTDSAGRGVFATRPIATGEVLHSAQPLVTHPSRSLFHE--VCYRCLKRKAGKGDDS 70

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQARFL---I 123
              C   FCS  C    A   H        + L +  S      P + ++L    +   +
Sbjct: 71  RGDCY--FCSDAC-WEHAEGFHGIEKKADWSLLDDHCSSRGLKYPYMVKRLACMVISGDV 127

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-------- 175
            A  L ++ P+Q       Q T+ + +       S+     A    ELT  L        
Sbjct: 128 GADCLDILQPAQLH-----QDTIIEMEEEFELLKSTF--RKAGFQEELTTFLTKQWYINV 180

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDYV 226
           LA+ R+NAF +       +D   S  A           +Y+  SF+NHDC PN       
Sbjct: 181 LARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWLE 240

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +A A+ NT      + D+ +G E+ + Y   + + + RQK L   + F C C RC
Sbjct: 241 NADAKLNT------LRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRC 289


>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 422

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 47/107 (43%), Gaps = 14/107 (13%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           LL     N FGL + Y E            IYL+AS FNH C PNA R          + 
Sbjct: 232 LLTMSTRNCFGLWKNYEE-------CFGLAIYLEASLFNHSCYPNAAR-----VQRGRSI 279

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           DII   I D+    EIC+SY  +      R+  L  +Y F C C RC
Sbjct: 280 DIIA--IRDIEPNEEICISYLNITNGSHERKDHLKNNYLFDCVCIRC 324


>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Anolis carolinensis]
          Length = 485

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 115/290 (39%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           ETV V + +G+GRGL + +   AG ++  +     Y+A+ F + +   CH CF++  H+ 
Sbjct: 2   ETVEVFDTDGKGRGLKAAKEFWAGDVIFAERA---YAAVVFDSLTHLVCHTCFKR--HAK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ +  +  +P+  N+ L  ++  R   
Sbjct: 57  LHRCGQCKFAHYCDRTCQKDAWVNHKNE--CAAIKK--HGKAPN-ENIRLAARIMWRIER 111

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
               L     V+    Q  +   G     +       +L           ++  + +L  
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLESFLEFWPHDGKQFAMQYISHILGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              N F L +     Q G ++V   GI+      NHDC PN          + V +    
Sbjct: 172 INCNGFTLSD-----QRGLQAV-GVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHT 225

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R +  +  G E+ +SY         R+++L   Y F C C+ C+
Sbjct: 226 QMRIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 275


>gi|270011320|gb|EFA07768.1| hypothetical protein TcasGA2_TC005322 [Tribolium castaneum]
          Length = 1086

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/352 (20%), Positives = 132/352 (37%), Gaps = 61/352 (17%)

Query: 1    MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
            + + S  + +   E   R + + + ++ G+I+  + P +   A   +     CH C++  
Sbjct: 719  IPSASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFTLAAADLYH---CHECYQ-- 773

Query: 61   MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
            +  +   C  CS   +C  +C   A    H  Y C  L  L N+     A L     ++ 
Sbjct: 774  LCYNPIPCEICSQTLYCGEECRDKAREKYHQ-YECPILISLKNIVGKHKAFL---LAIKM 829

Query: 120  RFLIA----------------------AYNLAVVNPSQFQILLAFQGTVTDNDTSA---- 153
             F+I+                       +  A++    + ++  + G   ++D  A    
Sbjct: 830  SFMISDENDVPEVYALVENLSRDNNDEVFTTALITALMYHLVKTYTGKFPEDDLEAENKF 889

Query: 154  AHYLSS---LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
             H+L +   +C   A+   EL    +++ +       EP  E    +       +Y   +
Sbjct: 890  KHFLMTHLRICLTHAAGIDELYPNQVSEGQ-------EPGQELLSFKSETVGCALYPFYA 942

Query: 211  FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST--RQKRL 268
             F H C PN         A  + T  ++R +  + +G+E  +SY P   ++S   R+ RL
Sbjct: 943  LFRHACCPNV-------FAHHHGTQRVLRAVRTIHEGQECFVSYGPYYVEHSKQERKSRL 995

Query: 269  LYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
            L  Y F C C  C  E +W   D     N   +   QD   VA   +A +  
Sbjct: 996  LSQYHFTCKCRAC--EEDWPQLD----LNPYYLTSKQDLDFVAKIRNANIEA 1041



 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 57/300 (19%), Positives = 123/300 (41%), Gaps = 31/300 (10%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHS 63
           + +V +   E   + +V+T+ ++ GQ++  ++P +  +AL  IN   F CH+C+   +  
Sbjct: 202 TNSVEIKVDEKLRKRVVATKDIQIGQVIAVETPCV--AAL--INVVLFHCHDCY--ILCY 255

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPY--VCQALTRLLNLNSPDAANLPLDRQLQAR 120
           +   C +C+ V +CS  C   A +  H     +  ++ +L+ +++     L +   +Q +
Sbjct: 256 NPIPCKTCTEVVYCSEACRENAFAKYHQKECPIYLSMRKLVGIDTHFQWALKMTLLVQTQ 315

Query: 121 FLIAAYNLAVVNPSQFQILLA--FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
                  +   +    + L+A  F   + +  T     L+ +          +  + +  
Sbjct: 316 ADKCCDRIDTDDDRTRKTLMATLFYHLIKNCTTV----LAGVDEAGVKNFKRVLMSYMHI 371

Query: 179 DRLNAFGLMEPYIEGQ----DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
              +   + E ++ G     + ++   A  +Y  +    H C PN            +  
Sbjct: 372 CDYHVSDIDEIFVHGGSRDLELKQETFAKAMYPFSDKLRHSCCPNV-------MGWYHGV 424

Query: 235 DIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
             ++R I  + +G E   SY P+  N +   RQ  + + Y+F C C  CK   NW   ++
Sbjct: 425 TRVLRAIRTIKKGEECFFSYGPLYTNIEKEERQNYIFFIYNFKCACRACK--QNWPQTES 482


>gi|451845079|gb|EMD58393.1| hypothetical protein COCSADRAFT_350740 [Cochliobolus sativus
           ND90Pr]
          Length = 739

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 107/287 (37%), Gaps = 80/287 (27%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           VTV + +G+GRGL +T S+KAG +VL                                  
Sbjct: 355 VTVQKAKGKGRGLFTTASMKAGDLVL---------------------------------- 380

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
              C  AFC      +  S++        +T L+N+ + D A +      QA  +     
Sbjct: 381 ---CEKAFCHAHVDDSGESNA-------KMTLLMNVET-DTAFMGG----QAHLIQLITQ 425

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFG- 185
               NPS    + A    +   D      +S    P   T  +E T AL      N FG 
Sbjct: 426 KLYKNPS----MAAAFTELHHGDYKTVDKMSVDGQPVVDTFLVERTMAL------NVFGC 475

Query: 186 ----------LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                     +M   +  +DG  +  + GI++KAS+ NH CL N  R    D        
Sbjct: 476 PVTSLKSHNDVMSNKVSKRDG--TFHSCGIWIKASYINHSCLGNVRRSFIGDM------- 526

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +IVR   D+  G E+   Y     DY+ + ++   ++ F C C  C+
Sbjct: 527 MIVRACKDLEAGTELAFPYVAPEGDYTPKVEQKFKNWGFICRCALCE 573


>gi|320591240|gb|EFX03679.1| oxysterol-binding protein [Grosmannia clavigera kw1407]
          Length = 850

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 44/92 (47%), Gaps = 21/92 (22%)

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI----HDVPQGRE 249
           QDGR       IY KA+  NHDC PN             NTD+  R +     D+  G E
Sbjct: 203 QDGR-----VAIYPKAALVNHDCRPNVF-----------NTDVAGRRVIMATRDIAVGEE 246

Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +  +Y P+  D +TRQ+RL+  Y F C C  C
Sbjct: 247 LLTTYVPLLADTTTRQRRLI-QYGFHCSCVAC 277


>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
 gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
 gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
          Length = 338

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 9/86 (10%)

Query: 203 YGIYLKASFFNHDCLPNACR--FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           YGI+ K S FNH C PN+ R     +D A       +V ++ DV +G EI +SY      
Sbjct: 118 YGIFPKVSRFNHGCRPNSMRSYHPVLDQA-------VVHVVKDVSEGEEITVSYVEPGLA 170

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
           +  RQ++L   + F C C+ C +  N
Sbjct: 171 FYLRQEQLKEKFGFICGCNLCLMPEN 196


>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
 gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
          Length = 391

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + QG EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALGPIQQGEEICISYLDECMLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368


>gi|157169545|ref|XP_001657892.1| hypothetical protein AaeL_AAEL001050 [Aedes aegypti]
 gi|108883672|gb|EAT47897.1| AAEL001050-PA [Aedes aegypti]
          Length = 662

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 82/316 (25%), Positives = 119/316 (37%), Gaps = 66/316 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSC-SV 73
           RGR +V+ + LK+G+ VL +     Y A  +     + C+ CF + +  +S  C  C  V
Sbjct: 272 RGRYIVAAEDLKSGETVLVEPA---YGACLYPKYFGTHCNQCFTRLV--ASVPCEQCCGV 326

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLN----------------LNSPDAA-----NLP 112
           AFCSP+C   A +  H  + CQ L  ++                   SP+ A     NL 
Sbjct: 327 AFCSPECRDRACAGYHR-FECQYLDLMIGSGMSILCHLAFRLVTQAGSPEKAAEITKNLA 385

Query: 113 LDRQLQAR--------FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP 164
           L    + R         L+AA+ L  +  S+F              T AA       P P
Sbjct: 386 LCSHAEKRSHEDYFRRTLMAAFLLRCLQKSEF---------FGRRKTEAAE------PTP 430

Query: 165 ASTTIE-LTAALLAKDRLNAFGLMEPYIEGQDGRRSVR----AYGIYLKASFFNHDCLPN 219
               +  +   LL   + NA  + E  I G     S +      G+Y   +  NHDC   
Sbjct: 431 IEAKVGGVILELLQALQFNAHEIYEVKIAGDHRVDSAKVQYIGVGVYKTGAMLNHDCHSG 490

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQ--GREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
             R  +V +    +T+  +     VP+  G        PV      RQK L   Y F CD
Sbjct: 491 VSR-TFVKSTMILHTNRPLTKGSLVPENYGMHFLRQPLPV------RQKVLRSRYWFKCD 543

Query: 278 CDRCKVEANWSDNDND 293
           C  C  +    D  ND
Sbjct: 544 CKACFEDWPVFDKMND 559


>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
 gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
 gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
          Length = 393

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + QG EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQQGEEICISYLDECMLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/287 (20%), Positives = 111/287 (38%), Gaps = 23/287 (8%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
           E V + +  G+GRGL +T+   AG ++  +  +  +++ +L    +   CH+CFR+    
Sbjct: 2   ENVAIFDSPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSL----AERICHSCFRR--QE 55

Query: 64  SSSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
               C  C  A +C   C  A  A         +A  +  N N    A L + R  +   
Sbjct: 56  KLQKCSQCKFAHYCDRTCQRAGWAEHKQECGAIKAYGKAPNENIRLVARL-MWRLDKEGS 114

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
           +++   L  V   +  +    +  + +      ++L          TI+  + +      
Sbjct: 115 VVSDMQLITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISHIFGVINC 174

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------DYVDAAAENNTD 235
           N F      +  Q G ++V   G++      NH+C PN            V+    +   
Sbjct: 175 NGFT-----VSDQRGLQAV-GVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRR 228

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           I +R +  + +G E+ ++Y         R++ L   Y F C CD CK
Sbjct: 229 IELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCK 275


>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
 gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
          Length = 392

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + QG EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALGPIQQGEEICISYLDECMLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368


>gi|255588343|ref|XP_002534573.1| protein with unknown function [Ricinus communis]
 gi|223524997|gb|EEF27811.1| protein with unknown function [Ricinus communis]
          Length = 319

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 68/298 (22%), Positives = 118/298 (39%), Gaps = 48/298 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + V   E  GRG+ ST+ +  G+++    PI+ Y +    ++++ C+ C +K   + +  
Sbjct: 42  IRVGVTESAGRGVFSTRRISGGELIHNAKPIVSYPSRS--STNTVCYFCLKKLASTENR- 98

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIA 124
               SVAF    CS     ++   Y  +        +          PL  +  A  +I+
Sbjct: 99  ----SVAF----CSQECKQNAKVFYDVETKADWSGFDDYCRTQGLKYPLMVKRLACMVIS 150

Query: 125 AYN----LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL----- 175
                  L ++ P+     +  +           + L   C   A+   +  A L     
Sbjct: 151 GAATVECLDILQPANLSPEMILE-------MEEGYDLLRSCFTKANIADDRLAFLTRQWY 203

Query: 176 ---LAKDRLNAF------GLMEPYIEGQDGRRSVRAY---GIYLKASFFNHDCLPNACRF 223
              LA+ R+NAF      GL E  +          A     +Y+  SFFNHDC PNA   
Sbjct: 204 INQLARIRINAFRIELAVGLYEDLLSSAAACIEAEAAVGNSVYMLPSFFNHDCDPNA-HI 262

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +++     N D  ++ + D+    E+ + Y   + D+  RQ  LL  + F C+C RC
Sbjct: 263 IWIE-----NADARLKALRDIDPDEELRICYIDASMDHGARQTILLQGFGFKCNCLRC 315


>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
          Length = 359

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 4/88 (4%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           +S R  G+Y     FNH C PN  +  Y ++  E  + + +R   D+  G E+C+SY   
Sbjct: 253 QSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---DIEPGEELCISYIDE 308

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           +  Y  RQ+ L   Y F C C +C+ EA
Sbjct: 309 SLSYPERQQELYEHYLFVCHCPKCEREA 336


>gi|357120626|ref|XP_003562026.1| PREDICTED: uncharacterized protein LOC100843629 [Brachypodium
           distachyon]
          Length = 542

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  SF NH C PNACR    D A       IV    ++  G EI   YF V    
Sbjct: 318 GVGLWVLPSFINHSCHPNACRTHVGDHA-------IVHASREIKAGEEITFPYFDVLVPV 370

Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
             RQ+     + F C CDRC+ EA
Sbjct: 371 GKRQE-AARAWGFECRCDRCRFEA 393


>gi|390601664|gb|EIN11058.1| hypothetical protein PUNSTDRAFT_101047 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 241

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 13/115 (11%)

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
           A+ ++D  N+FG+   +    D    +  +GI+  ASFFNH C PN    D +  A E  
Sbjct: 130 AIASRDVHNSFGI---WSHPGDSTAEMLGFGIWPDASFFNHSCAPNMGTKDRLGRAWE-- 184

Query: 234 TDIIVRMIHDVPQGREICLSYFPV----NYDYSTRQKRLLYDYDFACDCDRCKVE 284
                R    + +  E+ +SY  +      D   R++RL   + F C C RC++E
Sbjct: 185 ----FRSSKQIQKDEELFISYLSLEELEEMDAFARRERLQNVWGFECSCHRCRIE 235


>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
          Length = 554

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 118/287 (41%), Gaps = 41/287 (14%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM-HSSSSICPSCSVA 74
           + R + + Q+  AG  +    P   ++ +   +  + C +C + T        C SC +A
Sbjct: 26  KKRSVHARQAFAAGDTLFETRPP--FAVVGNSDILTRCFHCLKPTSPERELKRCSSCQLA 83

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPL-DRQLQARFLIAAYNLAVV 131
            FC P C  A  +     Y C A  R    ++P + A LPL + +L AR + A  +    
Sbjct: 84  RFCGPACVKAGWTPEGHKYECAAFKRCGFKDTPASLAKLPLTEARLLARMIWAQKH---- 139

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT----------AALL--AKD 179
           +PSQ +  L       DN   A   L +     AS  +  +          AALL  AK 
Sbjct: 140 DPSQHRYDLHGDVDCFDNKEKA---LWARRIQEASVLVGQSLFNEVLGNAKAALLQMAKV 196

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPN--ACRFDYVDAAAENNTD 235
           ++NA  L  P+ E         A G+ L +SF   NH C PN  A    + D   E    
Sbjct: 197 QMNAHTLCTPFGE---------ACGVSLSSSFALINHSCAPNTFAMSSHWPD---EKPKY 244

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           + V     +  G EI ++Y  V  +   R++ +   Y F CDC  C+
Sbjct: 245 LRVAACRPIKAGDEITIAYVDVEEENLQRRQTIKATYGFDCDCRLCE 291


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 138/371 (37%), Gaps = 84/371 (22%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + +A I G+GRGLV+T+    G+++L   P   Y++ P  I   S C +CF     S+  
Sbjct: 17  LAIASIPGKGRGLVTTRRFFPGEVILCGEP---YASTPNKILVGSNCDHCF---TSSNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A++       ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSVCQKEEWKLHQ--LECRAISALTDDRKKMLTPTIRLMVRLILRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTD-NDTSAAHY-----LSSLCPPPASTTIELT 172
              +I +          + ++ A +  +++ +D     Y     L  L  P     ++ T
Sbjct: 129 NEKVIPS-----TGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLILPAIELDLKET 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           A + +K   NA  + +P +      R V   G++   S  NH C+PNA          E 
Sbjct: 184 AHIFSKFSCNAHTICDPEL------RPV-GTGLFPAISTINHSCVPNAVLL------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
            T   VR +  +    E+ +SY         R   L + Y F C C RC           
Sbjct: 231 RTA-YVRALQPLSNNTEVSISYIETAATTLKRHNDLKH-YFFTCTCPRC----------- 277

Query: 293 DDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDA 352
                           +  S++D  + G            Y C   NC G L  LP S  
Sbjct: 278 ----------------IKGSEEDPLLEG------------YRCKDQNCDGFL--LPDSGK 307

Query: 353 TPSTVMECNVC 363
              T  +C++C
Sbjct: 308 KAYTCQKCSMC 318


>gi|91089699|ref|XP_974834.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
 gi|270011321|gb|EFA07769.1| hypothetical protein TcasGA2_TC005323 [Tribolium castaneum]
          Length = 543

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 120/323 (37%), Gaps = 69/323 (21%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           + + S  + V E E  GR +V+T+ +K G+++  + P++    L     S+ CH C   +
Sbjct: 180 IQSASACIAVREEENWGRFVVATRDIKVGEVLAVEKPLV---TLVVNELSNHCHECV--S 234

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +  +   C +C+ A +CS  C   A    H  Y C  L  L  L         L   LQ 
Sbjct: 235 LCYNLIPCKTCTQAMYCSESCRDYAFDMYHK-YECSILATLRFLQFDKLKLFALKISLQY 293

Query: 120 RFLIAAYNLA------------VVNPSQFQILLAFQ-----GTVTDNDTSAAHYLSSL-- 160
             L    +L             V N ++  +   F+       + D   +  ++ S+   
Sbjct: 294 NDLGETSDLLYRSDRYREIHNLVTNTTKRSVPDLFERATAAALIYDLVKTHTNFFSAFNQ 353

Query: 161 -------------CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL 207
                         P      +EL          N+ G+ EP            A G + 
Sbjct: 354 NNFKEILLLHMQTGPSNFHEIVELVP--------NSRGIYEP---------EEIASGAFA 396

Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY---DYSTR 264
             S  NH C PN  RF Y        + +++R I ++ +G E C   +  ++   D S R
Sbjct: 397 FLSLLNHSCCPNVARFSY-------GSTLVLRAIQNIQEG-EQCFDNYGYHFALMDKSER 448

Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
           +K L   Y F C C  C  E NW
Sbjct: 449 KKHLQSQYYFNCVCQAC--EKNW 469


>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
 gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
          Length = 430

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/311 (24%), Positives = 127/311 (40%), Gaps = 75/311 (24%)

Query: 15  GRGRGLVSTQSLKA-GQIVLRDSPILLYSALPFI----NSSSFCHNCFRKTMHSSSSICP 69
           GR   +V++ S  + G+++L + P        FI      +  C+ CF+  +  S   C 
Sbjct: 23  GRYYEVVASSSTTSFGKLILHEKPFC------FIIDDRERTHCCNYCFKANIKLSR--CK 74

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
           +C V   S  CS     S H    C+AL        P   N         R L+  + L+
Sbjct: 75  ACKV---SHYCSMNCYHSDHHLKECKALA-----THPKVTN-------SIRLLLKCF-LS 118

Query: 130 VVNPSQFQIL---------------------LAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
             +  QF+++                     + F   V D D S++ +L +         
Sbjct: 119 CSDEKQFEMIQNLAPSSSSDEKVEDNLIKLAILFADYVKDVDLSSS-FLDN-----RKED 172

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVD 227
           ++    LL K + N F +    +            GIYLKAS FNH C+PN A  FD   
Sbjct: 173 LDFIYLLLLKLQRNTFSICNEEMNAIGS-------GIYLKASMFNHSCVPNCAILFD--- 222

Query: 228 AAAENNTDIIVRMIHD---VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
               ++ ++ VR+++    + +G  + ++Y  +    + RQK+L   Y F C C RC + 
Sbjct: 223 ----SDKNLYVRILNPSSLLEEGTPLTINYVDLMDLTANRQKKLKEQYHFTCTCPRC-LN 277

Query: 285 ANWSDNDNDDE 295
           +N   N+N D+
Sbjct: 278 SNEEVNENVDK 288


>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
 gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
          Length = 600

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 64/270 (23%), Positives = 101/270 (37%), Gaps = 25/270 (9%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GRG+ + + +  G+++ R+  +L+        +   C  C+R+     S     C+    
Sbjct: 37  GRGIFAVRDIAQGELLFRERTLLVGPTAHRGRNLRTCTQCYRQPAGEDSDAAALCTAGCG 96

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P C    AS  H+   CQ L R      P   +    R L+   +++      +N +Q 
Sbjct: 97  LPVCGGCVASPRHA-VECQ-LFRKWQPKDPSRID---PRALR---ILSVVRCFFLNETQR 148

Query: 137 QILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIE 192
           ++L A Q             AA        P     +E     +     NAF   E   +
Sbjct: 149 KLLYAMQANADRYYMREVERAAECFEHF--PREQDMLEYFYRTVCAFNTNAF---ESRCQ 203

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
             DGR  V A  ++  A   NH C PNA          E+   I+V     +P G EI +
Sbjct: 204 -VDGREVV-ARALFPLAGMLNHQCTPNAAHH------FEDGETIVVTATERIPLGAEITM 255

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           SY  + +    R+  L     F C C RC+
Sbjct: 256 SYAKLLWSTLARKMFLGMTKHFMCQCPRCQ 285



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 380 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWSNLT 433

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     F C+CDRC
Sbjct: 434 RGIFLKMTKYFVCNCDRC 451


>gi|448122372|ref|XP_004204432.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
 gi|358349971|emb|CCE73250.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
          Length = 485

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 62/326 (19%), Positives = 118/326 (36%), Gaps = 60/326 (18%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI------NSSSFCHNCFRK 59
           + + V +   +G+GL + +++K G+ +  + P+    AL  +       + S+C     +
Sbjct: 108 DKIKVLQTSKKGKGLYAKKNVKKGEFLWEEKPLFYVPALANMTLIKNGKACSYCGKLLTQ 167

Query: 60  TMHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
               SS +       C  CS  +CS +C    +          AL +  N+ +P  +N  
Sbjct: 168 GTRGSSGVSYLRGLDCDVCSEIWCSKECKNTDSK-------LHALLKH-NIYNPSKSNTR 219

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT---- 168
            +    A   +  Y L     + + I   +   V+D         S++      T     
Sbjct: 220 KNINANAYLALQDYCLKEQWNALYAITRIYADIVSDKSGVKQKQFSAMARISQETRYKAL 279

Query: 169 ----------------IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG-------- 204
                           ++     L K+  + F  + P I   DG+ +   +         
Sbjct: 280 DSSAGSFDNFQGGALFVQEQQESLWKEGFSLFSNVFP-ISVDDGKINYEEFMYMLGTYNI 338

Query: 205 ------IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
                 I+L  S  NH+C P+      V  ++     + V    D+  G E+  SY   +
Sbjct: 339 NNLDSIIFLLQSHLNHNCEPSTT----VKLSSNRTEGLKVYAARDIKAGEELTTSYVNPS 394

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVE 284
           +    RQ+ L  ++ F C C++CK E
Sbjct: 395 HTVQQRQRELRVNWGFRCACEKCKNE 420


>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
 gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
          Length = 532

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/326 (22%), Positives = 125/326 (38%), Gaps = 49/326 (15%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------ 67
           E  G G+ +++   AG+ V      L+ S L        C NC+  T  SS+        
Sbjct: 29  ESNGNGVFASRRFGAGEEVASFKRPLVGS-LETERLLDTCANCYVWTEGSSTGTRLYVPE 87

Query: 68  ------CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
                 C +C+   +CS  C  AA +  H  + C+ L  +     P A        L   
Sbjct: 88  GVKVDKCAACARFRYCSKACQKAAWNRGHK-HECKVLKPMAGRGLPKAF-------LACI 139

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELT 172
            L+      +++   ++++      V D        N    A        PP     +  
Sbjct: 140 ELLTRRKHGLISDQDWEMVCRLPSHVDDFKRNGTYGNIEMMAMGAPQFALPPNMFDRDFI 199

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAA 230
           AA+ A+   NA  ++ P ++           GI L  +    NH C PNA  F  +D  +
Sbjct: 200 AAMYARVMSNALTIITPTLD---------PLGIILDPTLCSLNHSCDPNA--FIMMDGPS 248

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS-D 289
                + +R +  + + +EI +SY    Y Y  RQ+ L   + F C C +C+ +A    D
Sbjct: 249 -----VSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKCQEKATLQED 303

Query: 290 NDNDDENNEEVMDEDQDEQMVASDDD 315
           N     +++ V+D +  + M  + + 
Sbjct: 304 NWLVPADSKFVLDPEAKQAMAQTQEQ 329


>gi|6325092|ref|NP_015160.1| Set6p [Saccharomyces cerevisiae S288c]
 gi|74583857|sp|Q12529.1|SET6_YEAST RecName: Full=Potential protein lysine methyltransferase SET6;
           AltName: Full=SET domain-containing protein 6
 gi|1370351|emb|CAA97872.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|1403544|emb|CAA65556.1| P2545 protein [Saccharomyces cerevisiae]
 gi|285815377|tpg|DAA11269.1| TPA: Set6p [Saccharomyces cerevisiae S288c]
          Length = 373

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  M  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTMNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
 gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
          Length = 545

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 10/107 (9%)

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
           L+   L+E  + G+   ++   YG+ L   ASF NH C+PNA R    D        +IV
Sbjct: 307 LDVNSLVEDAVSGKVMGKNKEYYGVGLWTLASFINHSCIPNARRLHVGDY-------VIV 359

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
               D+  G EI  +YF V      R K +   + F C C RCK E+
Sbjct: 360 HASRDIKTGEEISFAYFDVLSPLEKR-KEMAESWGFCCGCSRCKFES 405


>gi|299739226|ref|XP_001835144.2| hypothetical protein CC1G_07286 [Coprinopsis cinerea okayama7#130]
 gi|298403677|gb|EAU86628.2| hypothetical protein CC1G_07286 [Coprinopsis cinerea okayama7#130]
          Length = 620

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 9/113 (7%)

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
           A+ A+D  N FG+ E   E +D    +  +G+Y+  ++FNHDC PN  +         + 
Sbjct: 509 AITARDHSNVFGIWEQAEEEEDADCEMLGWGMYISGAYFNHDCSPNLKK-------RRSG 561

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRCKVE 284
             +    I DV  G E+C +Y  +  + S   R + L  ++ F C C RC  E
Sbjct: 562 RSMQFFTIRDVQAGEELCTNYIDIGEECSVEMRNEELEREWFFRCACGRCVRE 614


>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
 gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
          Length = 393

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  I+++ +  + QG EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIIVLKALAPIQQGDEICISYLDECMLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCPKCRAQAS 368


>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
          Length = 969

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 10/107 (9%)

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
           L+   L+E  + G+   ++   YG+ L   ASF NH C+PNA R    D        +IV
Sbjct: 731 LDVNSLVEDAVSGKVMGKNKEYYGVGLWTLASFINHSCIPNARRLHVGDY-------VIV 783

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
               D+  G EI  +YF V      R K +   + F C C RCK E+
Sbjct: 784 HASRDIKTGEEISFAYFDVLSPLEKR-KEMAESWGFCCGCSRCKFES 829


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/276 (24%), Positives = 111/276 (40%), Gaps = 32/276 (11%)

Query: 17  GRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--CPSC-S 72
           GR + +T+  K G ++ R D+P ++    P   +  +C++C +K   +   +  C  C +
Sbjct: 19  GRSIYATRRFKPGDVIARFDNPAVVLP--PGHRALEYCNHCVKKQRPAGVKLRACTGCKT 76

Query: 73  VAFCSPKCSTAAASSSHSPYVCQALTRLLN---LNSPDAANLPLDRQLQARFLIAAYNLA 129
           VA+C P C  A  S  H    C+A+ RL      + PD    P+  +  A+ ++    LA
Sbjct: 77  VAYCGPACQRANWSLVHK-LECKAIQRLHEAKPAHQPDWVPTPI--RAAAQVMLRPQVLA 133

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS-TTIELTAALLAKDRLNAFGLME 188
                +  +    +   TD    +   +  L     +   +E +  +L K + NAF   E
Sbjct: 134 RFEELEGHVEQWRKKDGTDLQLQSHGVVKCLGLDTVTFERLETSFQVLCKLQTNAFSRTE 193

Query: 189 PYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
            Y E           G++L  +    NH C+PNA     V       T   +R    +  
Sbjct: 194 EYYE---------TGGVFLDTTLAMINHSCVPNAL----VQFGGRTAT---LRATSFLDP 237

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           G EI +SY         R   L   Y F C C +C+
Sbjct: 238 GDEIEISYIDQTQPRGKRHGELDL-YHFECSCYKCQ 272


>gi|440471791|gb|ELQ40736.1| ankyrin repeat protein [Magnaporthe oryzae Y34]
 gi|440479502|gb|ELQ60265.1| ankyrin repeat protein [Magnaporthe oryzae P131]
          Length = 2018

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 197  RRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
            RR V  +  +++ S  NH+C PN A  FD +  +        +  I D+  G E+ +SY 
Sbjct: 1675 RRGVDFHSTFVEVSRMNHECSPNCAAYFDPMTMSQR------IYAIRDIMPGEELTVSYI 1728

Query: 256  PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                    RQ RL  D+ F C C RC  EA++ 
Sbjct: 1729 DPVQTREARQNRLRKDWGFGCSCQRCTSEAHFG 1761


>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
           latipes]
          Length = 495

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           DG       G+Y   S  NHDC PN C   +      N T++ +R + D+  G E+ +SY
Sbjct: 250 DGELQEIGVGLYPSLSLLNHDCRPN-CVMVF------NGTELQLRAVQDINPGEELRISY 302

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
                    RQK+L   Y F C C RC
Sbjct: 303 IETLSLTEDRQKQLEEQYHFTCHCQRC 329


>gi|194889892|ref|XP_001977181.1| GG18383 [Drosophila erecta]
 gi|190648830|gb|EDV46108.1| GG18383 [Drosophila erecta]
          Length = 402

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 111/282 (39%), Gaps = 37/282 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+++ L+AG+ ++R+ P+ +    P ++    C  C++  ++ S    CP C+   
Sbjct: 57  GRYLVASRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKSDQYRCPGCAWPL 113

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
           C   C+       H+   CQ    L       A  L  +R   A          ++    
Sbjct: 114 CGSACAGLKQRHGHTETECQ----LYGERRAVAGELLTERAGPAEVRDLYELIMIVRILL 169

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
           L   +P QF ++   +    +   +A    HY   +       T +L   L A+      
Sbjct: 170 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-HRLRVTWQLE-DLEAEQVHEVC 227

Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           G+++   +  GQ+G    +A  +Y  A    HDC PN    D        + +I++R   
Sbjct: 228 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 279

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
            V +   + LSY    Y      KR  + ++   F C C RC
Sbjct: 280 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 318


>gi|91089701|ref|XP_974846.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 561

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/352 (20%), Positives = 131/352 (37%), Gaps = 61/352 (17%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           + + S  + +   E   R + + + ++ G+I+  + P +   A   +     CH C++  
Sbjct: 194 IPSASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFTLAAADLYH---CHECYQ-- 248

Query: 61  MHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           +  +   C  CS   +C  +C   A    H  Y C  L  L N+     A L     ++ 
Sbjct: 249 LCYNPIPCEICSQTLYCGEECRDKAREKYHQ-YECPILISLKNIVGKHKAFL---LAIKM 304

Query: 120 RFLIA----------------------AYNLAVVNPSQFQILLAFQGTVTDNDTSA---- 153
            F+I+                       +  A++    + ++  + G   ++D  A    
Sbjct: 305 SFMISDENDVPEVYALVENLSRDNNDEVFTTALITALMYHLVKTYTGKFPEDDLEAENKF 364

Query: 154 AHYLSS---LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
            H+L +   +C   A+   EL    +++ +       EP  E    +       +Y   +
Sbjct: 365 KHFLMTHLRICLTHAAGIDELYPNQVSEGQ-------EPGQELLSFKSETVGCALYPFYA 417

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRL 268
            F H C PN         A  + T  ++R +  + +G+E  +SY P  V +    R+ RL
Sbjct: 418 LFRHACCPNV-------FAHHHGTQRVLRAVRTIHEGQECFVSYGPYYVEHSKQERKSRL 470

Query: 269 LYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
           L  Y F C C  C  E +W   D     N   +   QD   VA   +A +  
Sbjct: 471 LSQYHFTCKCRAC--EEDWPQLD----LNPYYLTSKQDLDFVAKIRNANIEA 516


>gi|303282313|ref|XP_003060448.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457919|gb|EEH55217.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 516

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           R+ ++ G+Y   SF NH C PNAC+             +  R   D+  G E+C+ YF V
Sbjct: 195 RNEKSVGVYPLMSFANHSCAPNACKLLI-------GHTMFTRAARDLVAGEEVCVKYFDV 247

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEA 285
               S R   +   + F C C RC +EA
Sbjct: 248 TAPKSERNA-VAKRWGFECACARCGMEA 274


>gi|366998786|ref|XP_003684129.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
 gi|357522425|emb|CCE61695.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
          Length = 523

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 63/310 (20%), Positives = 123/310 (39%), Gaps = 48/310 (15%)

Query: 11  AEIEGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI 67
            ++  +GRGL + ++ K  +++ ++ PI++      L  I +   C  C R     S   
Sbjct: 136 KDVLSKGRGLFAKKNFKKDELIFKEEPIVIVPPIERLNLIEAGKACALCGRLMNEISEQF 195

Query: 68  -------CPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
                  C SC +  +CS +C     +  +  +  ++ T+L  +NS    N   ++  + 
Sbjct: 196 IIKNGLDCESCINTVWCSKQCLKRDFTHKYLKHPFKSSTKL-KVNSKKWKNF--EKCCKE 252

Query: 120 RFLIAAYNLAVV-------NPSQFQILLAFQ--GTVTDNDTSAAHYLSSLCPPPASTTIE 170
              I  Y +  +       + ++F I   F     ++  D    + L++       T   
Sbjct: 253 SVFIEGYIIGHIYAASMIDHKNEFDIKKKFDYLAYISQKDRYQGNNLTNFEKSLEDTNDS 312

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQD----------GR-----RSVRAYGIYLKASFFNHD 215
           ++    ++    +F   +      D          GR     +S + Y IY   S  NH+
Sbjct: 313 ISDEHPSEIWEKSFNYFKEAFPESDIDYEEYLNYIGRYNLNHKSAQLYSIY---SLVNHN 369

Query: 216 CLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
           C PN     DY      +  ++ +     + +  E+  +Y    +D   R+K LL +Y F
Sbjct: 370 CEPNVRVEVDY------HTKELKLYARKSISKNSELLTTYINPLHDVELRRKILLINYGF 423

Query: 275 ACDCDRCKVE 284
           AC+C+RC+ E
Sbjct: 424 ACNCERCENE 433


>gi|302759442|ref|XP_002963144.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
 gi|300170005|gb|EFJ36607.1| hypothetical protein SELMODRAFT_404768 [Selaginella moellendorffii]
          Length = 664

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 8/82 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYS 262
           G++      NH C+PNA +    D        + VR   D+  G E+ LSY  P    Y+
Sbjct: 447 GLWASVGLINHSCIPNASKMSIGDV-------VFVRAAKDLKAGDEVLLSYLEPPLAPYA 499

Query: 263 TRQKRLLYDYDFACDCDRCKVE 284
             +++++  Y+F C C+RCK+E
Sbjct: 500 GYREKMIQQYNFECSCERCKLE 521


>gi|255713868|ref|XP_002553216.1| KLTH0D11638p [Lachancea thermotolerans]
 gi|238934596|emb|CAR22778.1| KLTH0D11638p [Lachancea thermotolerans CBS 6340]
          Length = 353

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 19/154 (12%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYI 191
           S+F +LL  Q  V          L  L PP   +  ++     +L  +  N+FG+ +   
Sbjct: 212 SRFPVLLDLQKRVFQT-------LYILLPPFLKSGFDIPTFRHILGSEYGNSFGIWQNE- 263

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           E  D R  +  Y +  +ASFFNH C PN  +  +          +I  +  DV + +E+C
Sbjct: 264 EASDSREYL-GYWVLPEASFFNHSCAPNLIKHRF-------GNKMIFTLNSDVTKDQELC 315

Query: 252 LSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
           + Y  + +     R+  L  ++ F+C+C RCK+E
Sbjct: 316 IDYKDILDLKVDERRHILKENWFFSCECPRCKIE 349


>gi|67524199|ref|XP_660161.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
 gi|40745506|gb|EAA64662.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
 gi|259487990|tpe|CBF87094.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 638

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 60/122 (49%), Gaps = 13/122 (10%)

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQD---GRRSVRAYGI--YLKASFFNHDCLPNACRFD 224
           E   +L+ K+  N+FGL        D   G++    YG+  Y +A+  NH C+PN     
Sbjct: 519 EELLSLICKEESNSFGLYHGATGPPDSLQGQQRGPQYGLACYPRATLCNHSCVPN----- 573

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYF--PVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +    +  + +++    D+  G E C+SYF   V+ D + R+KR    + F+C C+RC 
Sbjct: 574 -LKHGPDEQSRMVLTATRDIAAGEECCISYFDLTVHVDLNARRKRTRELFTFSCTCERCL 632

Query: 283 VE 284
            E
Sbjct: 633 RE 634


>gi|367032576|ref|XP_003665571.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
           42464]
 gi|347012842|gb|AEO60326.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
           42464]
          Length = 607

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 8/80 (10%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           ++++A++FNH C+PN  R    D        I+VR + D+  G E+  SY  V  DY  R
Sbjct: 460 LWIRAAYFNHSCVPNTEREFIGDL-------IVVRALRDIAAGEELVQSY-DVTGDYEGR 511

Query: 265 QKRLLYDYDFACDCDRCKVE 284
           ++ L+  + F C+C  C+ E
Sbjct: 512 REALMTTWGFECNCALCEAE 531


>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 976

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 39/82 (47%), Gaps = 8/82 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++  ASF NH C+PNA R    D        +IV    D+  G EI  +YF V      
Sbjct: 763 GLWTLASFINHSCIPNARRLHVGDY-------VIVHASRDIKAGEEITFAYFDVLSPLEK 815

Query: 264 RQKRLLYDYDFACDCDRCKVEA 285
           R K +   + F C C RCK E+
Sbjct: 816 R-KEMAESWGFRCGCSRCKFES 836


>gi|151942634|gb|EDN60980.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 373

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 71/156 (45%), Gaps = 23/156 (14%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKVE 284
           C+ Y  V  D  T ++R  L  ++ F C C+RCK E
Sbjct: 334 CIDYSGV-LDLPTVKRRAFLADNWFFVCACERCKSE 368


>gi|320168918|gb|EFW45817.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 644

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G +  A+FFNH C PN   F  +    +  T   V ++ DV  G E+ LSY P    Y  
Sbjct: 333 GFFPLAAFFNHGCRPNCSGFVEI---TDQGTFFSVVLLEDVYLGDELVLSYLPPELLYKP 389

Query: 264 RQKR---LLYDYDFACDCDRC 281
           R++R   LL ++ F C C RC
Sbjct: 390 REQRRAHLLQNFHFECGCSRC 410


>gi|302796834|ref|XP_002980178.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
 gi|300151794|gb|EFJ18438.1| hypothetical protein SELMODRAFT_419787 [Selaginella moellendorffii]
          Length = 662

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 8/82 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYS 262
           G++      NH C+PNA +    D        + VR   D+  G E+ LSY  P    Y+
Sbjct: 445 GLWASVGLINHSCIPNASKMSIGDV-------VFVRAAKDLKAGDEVLLSYLEPPLAPYA 497

Query: 263 TRQKRLLYDYDFACDCDRCKVE 284
             +++++  Y+F C C+RCK+E
Sbjct: 498 GYREKMIQQYNFECSCERCKLE 519


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P  L   +   +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVVKLMDGKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
           B]
          Length = 410

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 55/116 (47%), Gaps = 13/116 (11%)

Query: 168 TIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
           TIE     L  D  N+FG+ E P +E  +       + +Y + SFFNHDC PNA +    
Sbjct: 291 TIENVRTALGVDPGNSFGIWEVPIMEESE----CLGFAVYPRLSFFNHDCAPNARK---- 342

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRC 281
                +   +      +V +G E+C+SY  V+   +  R+K L   + F C C RC
Sbjct: 343 ---ERDGRALRFVTTREVAEGDELCISYGHVDGMAWRERRKELSDGWFFDCACGRC 395


>gi|350424241|ref|XP_003493732.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 498

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/282 (25%), Positives = 111/282 (39%), Gaps = 55/282 (19%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSICP---S 70
           GRGL +T+ +K  +++  DSP+L+   Y    F      C +C++    +  ++ P    
Sbjct: 42  GRGLFATRDIKQNELIFIDSPLLVGPKYIGKYF----EMCVSCYK----NECALFPCDRG 93

Query: 71  CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-AANLPLDRQLQARFLIAAYNLA 129
           C +  CS +C     S  H  Y C+ L  L+     D + NL L         +      
Sbjct: 94  CGLPVCSTQCEN---SPKHVNYECEYLKSLVPTCGTDWSPNLLLA--------VVPIRAL 142

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP----PPASTTIELTAALLAKDRLNAFG 185
            +   Q + L   Q   +D + +    +  L       P+   +EL   +      N+F 
Sbjct: 143 FLTEQQRKCLATLQ---SDKNLTCYPEIEQLKKNVTNSPSEEDMELMKHMCRILNTNSF- 198

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA-CRFD-----YVDAAAENNTDIIVR 239
                I   D   SV   G+Y  ASF NH C+PN    FD     YV AA          
Sbjct: 199 ---ETIMVHDKEHSVSLRGLYSIASFQNHCCVPNTRHHFDGEFRMYVSAAL--------- 246

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
               +  G EI  +Y  + +D + R+K L     F+C C RC
Sbjct: 247 ---PIAAGEEITSTYTSLFWDTTLRRKFLSITKHFSCMCKRC 285


>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
 gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
          Length = 400

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 111/282 (39%), Gaps = 42/282 (14%)

Query: 15  GRGRGL----VSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS 70
           G+GRG+      +  ++ G +++++ P   Y+        + CH C ++  +S S  C +
Sbjct: 13  GQGRGIRCNKKGSSGIEPGTLIVKEEP-YSYTLTDGELLRTRCHYCLKRLENSVS--CDA 69

Query: 71  CSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY--- 126
           C  A +C+ +C  AA    H+P  C+  +RL+NL        P   ++  R L   +   
Sbjct: 70  CRTAKYCNEECKKAA-KFHHTPE-CRGYSRLMNL--------PEHLRVMGRILYKMHARK 119

Query: 127 -NLAVVNP-----SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR 180
            ++  + P     S  + L   +  +T  D S    LS      A         +  K  
Sbjct: 120 TDMGALGPLSSLVSNVETLKNCEEGITSLD-SKMECLSQHMEKDALPDRAFMEEIYGKIA 178

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N+F ++       D        G+Y +AS  NH C  N     Y          I +R 
Sbjct: 179 SNSFAIL-------DENMCSIGIGVYPQASMINHSCKSNCIGMFY-------GPQIQIRA 224

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              +  G +I   Y P     + RQ++LL  Y F C C  C+
Sbjct: 225 NEFIRPGEQIFHGYIPPLLPTAKRQEKLLKTYHFLCQCADCR 266


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/291 (23%), Positives = 117/291 (40%), Gaps = 46/291 (15%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCFRKTMHSS 64
           ++V+ +  +GR L +T+ ++ G+++L   P   Y  +P  N++S    C  CF+    ++
Sbjct: 13  LSVSNLPHKGRSLFTTRDIRPGEVILSQKP---YICVP--NNTSLESRCDGCFKT---NN 64

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQ 116
              C  C V  +C   C  +        + C+ALTRL        +P       L + R 
Sbjct: 65  LKKCSGCQVVWYCGSFCQKSEWKLHR--HECKALTRLEKEKRKFVTPTIRLMVKLYIKRN 122

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY------LSSLCPPPASTTIE 170
           LQ   +     L + +   + ++ A    +++ D            L +L     S  + 
Sbjct: 123 LQNEKV-----LPITSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLR 177

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
             A   +K   NA  + +  +  Q         G++   S  NH C PNA    + +  A
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQ-------GIGLFPLVSIINHSCSPNAV-LVFEEQMA 229

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 +VR + ++ +  EI +SY        TRQK L   Y F C C RC
Sbjct: 230 ------VVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARC 274


>gi|429859827|gb|ELA34590.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 648

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 41/85 (48%), Gaps = 5/85 (5%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
             R  GI+L+AS  NH C  NA      D        + V  + D+  G EI +SY   +
Sbjct: 424 GAREGGIFLEASRINHSCRHNAQNTWNAD-----RGQLTVHALRDIDDGEEITISYLGAS 478

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKV 283
            +YS+RQ RL   + F C C+ C +
Sbjct: 479 ENYSSRQSRLKESFGFTCACELCTL 503


>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
 gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
          Length = 188

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 83  GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGEEICISYLDECQLER 136

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 137 SRHSRHKVLRENYIFVCQCLKCQAQAS 163


>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 278

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 7/109 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            +Y K S FNHDC PN     +   +   +T + VR   D+P G E+ +SY  V    + 
Sbjct: 69  AVYPKVSRFNHDCRPNV----HYRLSGLIHTTVAVR---DIPAGTELTISYIYVRKSRAE 121

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVAS 312
           RQK L   ++F C C++C   A      +      + ++ED + +++ S
Sbjct: 122 RQKELSESWNFTCTCEQCSKSAEEIAASDARMRRIKALEEDIERKVIES 170


>gi|189240077|ref|XP_971167.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 637

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 124/320 (38%), Gaps = 65/320 (20%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF---CHNCF 57
            S  S  +T+      GR +VS   +  G+I++ +      +A+  +N   F   C +CF
Sbjct: 225 FSHASSKITLKTSPDVGRYVVSNVDIATGEILVAEP-----AAVACLNPEKFGTHCQHCF 279

Query: 58  RKTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDAANLP 112
            + +      CP CS VAFCSPKC   A ++ H  Y C+    LL    ++ S  A  + 
Sbjct: 280 ARLLAPVG--CPHCSSVAFCSPKCRDDAITTYHK-YECKFFDLLLGSGMSVLSLMALRII 336

Query: 113 LDRQLQARFLI-------AAYNLAVVNPSQFQ--------ILLAF-------QGTVTDND 150
             + L     I       A YNL   N S+ Q        ++ AF        G   +N 
Sbjct: 337 TQQSLTQTLEIYDKKNTNALYNLC-TNESKRQNSDFLQRSLMAAFLLRCLQKSGYFGENG 395

Query: 151 TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA-------Y 203
           T            P  T  ++   LL   ++  F   E Y        S+++        
Sbjct: 396 TVV----------PTQTEHKVGEMLLHYLQILQFNAHEIYETLYSEDHSLKSAKMINIGV 445

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDY 261
            +Y   + FNH+C P+  R+ +V         I++  I  +     I  +Y P+      
Sbjct: 446 AVYPTVALFNHECYPSVTRY-FVGKT------IVIASIRPLTPNTPISENYGPIFTRIKL 498

Query: 262 STRQKRLLYDYDFACDCDRC 281
           + RQ+ LL  Y F C C  C
Sbjct: 499 AERQRTLLGRYWFNCQCQAC 518


>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 643

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 12/103 (11%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFG  + ++ G     S  + G++++A++ NH C+PNA R    D        +++R +
Sbjct: 479 NAFGADDSHLGGV----SKPSTGLWVRAAYINHSCVPNAKREFVGDL-------MVIRSL 527

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             + +G EI  SY   + DY  RQ  L+  + F C C  C  E
Sbjct: 528 RKIKKGEEIFHSY-DESGDYEARQAALMTTWGFECGCALCAAE 569


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/301 (22%), Positives = 110/301 (36%), Gaps = 53/301 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVVPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------------PLDRQLQA 119
           VA +CS KC   A         C+ L        PD+  L            P + +   
Sbjct: 67  VAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRLLGRVVFKLMEEIPSESEKLY 124

Query: 120 RFLIAAYNLAVVNPSQ----FQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
            F     N+  +   +     Q+ + FQ  + +    A+        PP+    E  A +
Sbjct: 125 TFYDLESNINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDIFEAFAKV 178

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           +     N+F +    ++           G+Y   S  NH C PN        +   N   
Sbjct: 179 IC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPH 220

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
           +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    DE
Sbjct: 221 LLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCETQDKDADMLTGDE 280

Query: 296 N 296
            
Sbjct: 281 Q 281


>gi|310798972|gb|EFQ33865.1| tetratricopeptide [Glomerella graminicola M1.001]
          Length = 657

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 11/103 (10%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFG    Y  G++G R+  + G+++ A++ NH C+ NA + +YV         +++R +
Sbjct: 489 NAFGPGGQY--GEEGARNA-STGLWVWAAYINHSCVANAKK-EYVGDL------MVLRAL 538

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             + +G EI  SY   + DY TRQ+ L+  + F C C  C  E
Sbjct: 539 RPIKEGEEIFHSY-DESADYETRQRALMTTWGFECSCALCTAE 580


>gi|168035501|ref|XP_001770248.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678465|gb|EDQ64923.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 691

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 162 PPPASTTIEL--TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY--GIYLKASFFNHDCL 217
           P PA+T       A L    + NAFG  +  +  ++ R   R++  G+++  SF NH C 
Sbjct: 379 PVPATTAFPQNEIARLCGIVQCNAFGHTQ--VTRKNARDITRSWSCGLWMLPSFMNHSCT 436

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACD 277
           P+      V      N  IIV    D+  G E+ ++YF +      R+  +L+ ++F C 
Sbjct: 437 PS------VATVVIGNAMIIV-AARDLKCGDELTVAYFDIFRPLQERRASMLHSWNFMCS 489

Query: 278 CDRCKVEANWSD 289
           C RC +EA   D
Sbjct: 490 CPRCALEARMED 501


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C  L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
          Length = 386

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 14/105 (13%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y+  S  NH C PNA   F Y      +N  ++++ I D+  G EIC SY     +  
Sbjct: 289 GLYILQSSVNHSCTPNAIAEFPY------SNNTLVLKAIRDIQVGEEICTSYLDECELER 342

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
              +RQK L   Y F C CD+C+ +     ND +  ++EE+ D+D
Sbjct: 343 SRYSRQKALSSLYLFVCHCDKCQTQI----NDPNVTSDEELDDDD 383


>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 676

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 84/210 (40%), Gaps = 64/210 (30%)

Query: 89  HSPYVCQALTRLLNLN--------SPDAAN-LPLDRQLQARFLIAAYNLAVVNPSQFQIL 139
           H  Y+   L RLL  N        +PD+A  +P D +L    LI  +NL ++   QF   
Sbjct: 402 HRTYIATWLLRLLKRNPYFPEWVKTPDSAEAIPSDGELYIGSLIL-HNLMLI---QFN-- 455

Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
                         AH +S L  P  S        +LAK +         +I G      
Sbjct: 456 --------------AHEISELAVPKGSN-------ILAKAK-------SKFIGG------ 481

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-- 257
               G+Y   S FNH C P   R+          T ++VR I  +P G EI  +Y P+  
Sbjct: 482 ----GVYSTVSLFNHSCNPGIIRYFI-------GTTMVVRAIRTIPAGEEISENYGPIFT 530

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
               + R+++L   Y F C+C+ C   A+W
Sbjct: 531 TTPEAERKRKLRVQYWFDCNCEACT--AHW 558


>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 333

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 118/295 (40%), Gaps = 60/295 (20%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI-----NSSSFCHNCF 57
           + S ++ V    G+GR +V+ +++K G+       + +  A  F+       ++ CHNC 
Sbjct: 49  SASASIEVKHAPGKGRYVVANRNIKKGE------TLFVEDAFTFVLVNNNKDNTHCHNCC 102

Query: 58  RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL 117
           +  +             FC+ KC   A  S H         R L L    A+ + L RQ+
Sbjct: 103 KSYLDVPVPCTECVDTLFCNMKCWDEACLSYH---------RWLCL----ASQMGLLRQI 149

Query: 118 QARFLIAAYNLAVV-----NPSQF---QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
               L  A  L  V     N  +F   Q L+     +T  D  A  Y   +C   A T I
Sbjct: 150 GITHL--ALKLLCVCATTTNNEKFNDVQQLVTNFNKLTPED--AFSYGILVCNGHAITKI 205

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
            +TA     D  N   L+E  I          A  IY  AS  NH C PN     ++D  
Sbjct: 206 NVTA----DDHENKL-LIEEQIR--------IATAIYPSASMMNHSCDPNIIN-SFLDQT 251

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLYDYDFACDCDRC 281
                 +IV+ I D+ +G EI L  + V++     + RQ+ L   Y F C+C  C
Sbjct: 252 ------LIVKAIQDIKEGEEI-LHCYGVDFRRMPKAERQESLESQYYFTCNCKAC 299


>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 481

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 50/110 (45%), Gaps = 14/110 (12%)

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           +NAF L  P+ +            I   A+  NH C PNA       A   +   I VR 
Sbjct: 177 INAFTLTTPFYDHVGA-------AIEPLAALCNHSCSPNA-------ATDFDKGKIWVRA 222

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
           + D+ +G ++ +SY      Y+ RQ  LL  Y F C C++C++E N  D 
Sbjct: 223 LRDIGKGEQVFVSYIETTDPYAHRQSELLKRYYFNCKCNKCEIEKNAPDT 272


>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
           Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
           Methionine
          Length = 429

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 118/310 (38%), Gaps = 54/310 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA- 74
           RG GL +   L+ G+++ R  P L Y+     +    C  C           C  C VA 
Sbjct: 15  RGNGLRAVTPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCL--LGKEKLXRCSQCRVAK 70

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLIAA 125
           +CS KC   A         C+ L        PD+  L L R +         ++  L + 
Sbjct: 71  YCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLXDGAPSESEKLYSF 127

Query: 126 YNL-AVVNP-------SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
           Y+L + +N           Q++  FQ    +    A+        PPA    E  A ++ 
Sbjct: 128 YDLESNINKLTEDRKEGLRQLVXTFQHFXREEIQDASQL------PPAFDLFEAFAKVIC 181

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
               N+F +     E Q+        G+Y   S  NH C PN        +   N   ++
Sbjct: 182 ----NSFTICNA--EXQEV-----GVGLYPSISLLNHSCDPNC-------SIVFNGPHLL 223

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-EN 296
           +R + D+  G E+ + Y         R+K+L   Y F CDC RC+ +    D D D    
Sbjct: 224 LRAVRDIEVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRCQTQ----DKDADXLTG 279

Query: 297 NEEVMDEDQD 306
           +E+V  E Q+
Sbjct: 280 DEQVWKEVQE 289


>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
 gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
          Length = 497

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/293 (23%), Positives = 111/293 (37%), Gaps = 35/293 (11%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           +SE   V +    GR  VS + LKAG ++L + P   ++  P +NSS  C  C      S
Sbjct: 1   MSEPYKVLDNPELGRYAVSAKDLKAGDVILEEIP---FAIGPKVNSSPLCLECCCPVDGS 57

Query: 64  SSS-ICPSCSVAFCSPKCSTAAASSSHSPYVC----QALTRLLNLNSPDAANLPLDRQLQ 118
           ++   C  C    C  +C+    +  +    C    +   +  N+    A+ + LD    
Sbjct: 58  ANGPKCSDCGWPMCE-ECNEDKDNVVYHKKECALFVKGKAKFQNVEDSTASCMQLDCITP 116

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ---------GTVTDNDT-SAAHYLSSLCPPPASTT 168
            R L+A  +    +P ++   L+           GT    D  +   YL   C      +
Sbjct: 117 LRMLLAKED----HPERWDAELSVMEYHNDARKDGTTWHQDQQNIVGYLRGPCGLKDRFS 172

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
            E+   ++    +NAF +           +   A G+Y K     H C+PN         
Sbjct: 173 EEVIQQVIGILDINAFEVRT--------SKGYSARGLYPKLGVMAHSCVPNVVH----SI 220

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              +   +I R+  D+P+G ++   Y       STRQ  L     F C C RC
Sbjct: 221 HPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAALKSSKYFTCRCPRC 273


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C  L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/303 (23%), Positives = 115/303 (37%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C  L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHRRE--CSCLKSCKPRYPPDSVRL-LGRVIVKLMDEKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+ +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
 gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
          Length = 383

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 82/366 (22%), Positives = 131/366 (35%), Gaps = 100/366 (27%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF----CHNCFR------------- 58
           +G+GL S +  K+G I+  + P++   +  F  ++++    C +C R             
Sbjct: 12  KGKGLFSKKCFKSGAIIFEEEPLV---SCQFSWNAAYQYKACDHCLRPLETAEENARRLT 68

Query: 59  -------------KTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTR-----L 100
                         T  S  + C  C   +CS +C T A +  H     Q   R     L
Sbjct: 69  GKPELELPFPECCTTDKSKFTSCTLCGTDYCSVECQTTAYNQYHRILCLQTTERNNFHPL 128

Query: 101 LNLNS--------PDAANLPLDRQLQARFLIAA-YNLAVVNPSQF---------QILLAF 142
             LN         P+   + L  +L AR + +   +LA+    QF         ++    
Sbjct: 129 EQLNEAWKHVHYPPETNTIMLIVRLLARIIQSPNRDLAIEQTLQFCHRTVNEDAELAHKL 188

Query: 143 QGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----LLAKDRLNAFGLMEPYIEG----- 193
            G    N  S  H L   C P       LT A    LLA    N  G+    I       
Sbjct: 189 LGEKFANQLSLLHSLLLQCVPHEGIEQFLTPAGFQGLLALIGTNGQGVGTSTISQWVTRT 248

Query: 194 -----QDGRRSV----------------------RAYGIYLKASFFNHDCLPNACRFDYV 226
                 D  R+V                          ++   S  NH C+PNA      
Sbjct: 249 SDLAISDDERAVLDKFIDKLYEDMDSHSGNFLNNEGVALFTLQSACNHSCVPNA-----E 303

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKV 283
                NN  + +  + D+ +G EIC+SY     ++    +R+K L+ +Y FAC+C +C+ 
Sbjct: 304 PTYLHNNNKLSLVAVRDIQEGEEICISYLDECNLHRSRHSRRKELMENYLFACNCPKCEE 363

Query: 284 EANWSD 289
           +A+  D
Sbjct: 364 QADEPD 369


>gi|115402861|ref|XP_001217507.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114189353|gb|EAU31053.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 349

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/311 (22%), Positives = 122/311 (39%), Gaps = 51/311 (16%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSAL-PFINSSSFCHNCFRKTMHSSSSICPSC- 71
           + RGR    + S++AG +V  D+P  L   + P    S  C N   +        C +C 
Sbjct: 32  QHRGRYSRVSASVRAGTVVFADAPYALIPTVDPTSKGSLICSNLMCRRQVKWDLECVTCP 91

Query: 72  -----SVAFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAA 125
                 V +C+  C       +   + C  L +  + L   +         L  R L+AA
Sbjct: 92  NDCIRDVVWCNSACRIQ--DQARHDFECSWLKKHGVTLRQREGEYDFCMLWLVVR-LVAA 148

Query: 126 YNLAV-VNPSQ-------------FQILLAFQ--------GTVTDNDTSAAHYLSSLCPP 163
             L +  +P+              +Q +  F          T+   +     YL +    
Sbjct: 149 RRLEMEYDPTSHERYGWEDRFKRGWQAIEEFSTNRHLWPDATIQHWEYLIQTYLRNFPGF 208

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEG------QDGRRSVRAYGIYLKASFFNHDCL 217
           P  T +     L+ ++ +N+FGL  P I G      Q  R      G Y +A+  NH C+
Sbjct: 209 PGPTEL---LGLICREEINSFGLY-PGITGTLPPGQQRKRGQQYGLGCYPRATMLNHSCV 264

Query: 218 PNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRLLYDYDFA 275
           PN      ++ A+++   +++    D+   +E  +SYF +  + D   RQ+     + F+
Sbjct: 265 PN------LNRASDDRGRMVITANQDIAADKECTISYFDLVEHADLEDRQRLTHEMFLFS 318

Query: 276 CDCDRCKVEAN 286
           C C RC VEA+
Sbjct: 319 CTCQRCLVEAD 329


>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
          Length = 359

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 47/94 (50%), Gaps = 4/94 (4%)

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           E Q   +S R  G+Y     FNH C PN  +  Y ++  E  + + +R   ++  G E+C
Sbjct: 247 ELQHLLQSSRGQGVYTVGCLFNHSCEPNL-QVVYSESGDETLSVVALR---NIELGEELC 302

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           +SY   +  Y  RQ+ L   Y F C C +C+ EA
Sbjct: 303 ISYIDESLSYPERQQELYEHYLFVCQCPKCEREA 336


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 69/289 (23%), Positives = 110/289 (38%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA I G+GRGL++T++   G ++L   P   Y++ P  I   S C +CF        S
Sbjct: 17  LAVASIPGKGRGLIATRTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73

Query: 67  ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
           +C      +CS  C             C+A+        ++L         L L R+LQ 
Sbjct: 74  MCQV--TWYCSTNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLVLKRKLQN 129

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELTA 173
             +I + ++       + ++ A +  +   D       +    L SL  P     ++  A
Sbjct: 130 EKVIPSSSI-----DNYNLVDALESHIWKVDENHLVLYAQMANLVSLILPLIELDLKEIA 184

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAEN 232
              +K   NA  + +P +            G+Y   S  NH C+PNA   FD   A    
Sbjct: 185 HTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA---- 233

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                VR +  + +  E+ +SY         R    L  Y F C C RC
Sbjct: 234 ----YVRALQPIGKNEEVSISYIETAAVTKKRHND-LKQYFFTCSCPRC 277


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 110/281 (39%), Gaps = 26/281 (9%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E V + + EG+GRGL +T+   AG ++  +     YSA+ F N S   CH+CF++     
Sbjct: 2   ENVEIFDSEGKGRGLRATRESWAGDVIFAEPA---YSAVVFDNLSHCVCHSCFKR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAA-ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
              C  C  A +C   C   +  +  H     +   +  N N   AA +    + +   L
Sbjct: 57  LLRCGQCKFAHYCDRTCQKESWVNHKHECLAIKKAGKAPNENIRLAARILWRIEREGGGL 116

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
                L  ++  Q  +     G  +        ++           ++  + + +    N
Sbjct: 117 TEGC-LVSIDDLQNHLDKFGDGEKSSLMEDVQKFMDFWPSQSQQFGMQYISHIFSVISCN 175

Query: 183 AFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            F L +        +R ++A G  I+      NHDC PN C   +      NN  I +R 
Sbjct: 176 GFTLSD--------QRGLQAVGVAIFPNLCLTNHDCWPN-CTVIF------NNGKIELRA 220

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +  + +G E+ +SY         R ++L   Y F C C+ C
Sbjct: 221 LGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHC 261


>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
           2509]
          Length = 547

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 107/287 (37%), Gaps = 49/287 (17%)

Query: 16  RGRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSS 65
           RGR L ST+S  AG+ I + D+P+L   ALP   N  + C+ C        F   + +  
Sbjct: 22  RGRSLFSTESFGAGETIAIIDNPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVEAGP 78

Query: 66  SICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
             C +C+      +C+ +C  A     H    C+   R+      D    P         
Sbjct: 79  RTCKACTGCKAAVYCNAECQRAHWKLVHKAE-CKMFKRIKERTGKDWLPTPA-------- 129

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
                     N   F+      G      T A  Y   L    +  T++    +L K + 
Sbjct: 130 -FGPGGTLESNVEGFKTDEGLWGDFELQATGAVVYAGLL---QSDETLKQAMEVLCKIQT 185

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVR 239
           NAF       +   G+      GIYL  S    NH C+PNA    Y+          +++
Sbjct: 186 NAFN----RFDADTGQA-----GIYLHPSLSMVNHSCVPNA----YITF---EKRKAVLK 229

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
              D+  G EI +SY         RQ+ L   Y F C+C RCK + N
Sbjct: 230 AERDLEPGDEILISYIDHTMPRRARQESLRL-YHFQCNCIRCKDDLN 275


>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
           70294]
 gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 499

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 70/320 (21%), Positives = 131/320 (40%), Gaps = 56/320 (17%)

Query: 16  RGRGLVSTQSLKAGQIVLRDS-PILLYSA---LPFINSSSFCHNCFRKTMHSSSSI---- 67
           +GRGL + +      ++ ++S PI++  +   L  +     C  C     H S       
Sbjct: 124 KGRGLYALKDFVKDDLIFKESYPIVVVPSMERLTLMKMGKACSLCGGLLSHLSKHFIVVH 183

Query: 68  ---CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDA-ANLPLDRQLQARFLI 123
              C +C   +CS +C       +   ++ ++      +N  D+ A    ++       +
Sbjct: 184 SLDCDNCGGVWCSNECKNIDIGHNTLKHLHKS-----KINGIDSVAWGKFEKYCNDNVFV 238

Query: 124 AAYNLAVVNP-------------SQFQILLAFQGTVTDNDTSAAHY----------LSSL 160
           AAY++ V+                +F +L +    +  N++ + +           +S+ 
Sbjct: 239 AAYSIGVIMALALIDKKNTDKIRKKFNLLSSVSQGIRINESDSTNIGGTFDASSGAMSNK 298

Query: 161 CPPPA-STTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDC 216
            P P   T+ EL   A+   D L+    ME Y+    GR ++      ++   S  NH+C
Sbjct: 299 DPEPIWKTSYELLKNAIPTTDELD----METYL-SYIGRYNINQISDQMFFLPSLINHNC 353

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
            PN  RF+ V     +N +I V    ++  G+E+  +Y    +    R++ L  +Y F C
Sbjct: 354 EPNV-RFEVV-----SNKEIRVYARKNISAGQELLTNYINPLHGVKLRRRELRVNYGFLC 407

Query: 277 DCDRCKVEANWSDNDNDDEN 296
            CDRC  E    +ND ++EN
Sbjct: 408 HCDRCIKEIK-RNNDVENEN 426


>gi|448518203|ref|XP_003867935.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis Co 90-125]
 gi|380352274|emb|CCG22499.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis]
          Length = 372

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 8/117 (6%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T +   +L+ K+  NAFG+       +D  +    + +Y  ASFFNH C PN  +     
Sbjct: 260 TTDTVRSLIGKNLSNAFGIWSN-TTNKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRM-- 316

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
                  D+    + D+ +G E+C++Y     +   +++  L ++ F C C +C +E
Sbjct: 317 -----RNDMCFETLRDIAEGEELCINYGNFQNEDVNKRQLELQEWFFDCGCTKCHLE 368


>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
 gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
          Length = 625

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 8/93 (8%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           R+  + G+Y   SF NH C PNAC+             + +R   D+  G E+ + YF V
Sbjct: 304 RNANSVGVYPFPSFLNHSCAPNACKLMV-------GHTMFIRAARDLVAGEEVFMKYFDV 356

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
                 R   +   + F C C RCK+EA   D 
Sbjct: 357 TMPKPERSA-VAKRWGFECACPRCKLEAVGEDE 388


>gi|317144906|ref|XP_003189640.1| set and mynd domain containing protein [Aspergillus oryzae RIB40]
          Length = 372

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/319 (22%), Positives = 120/319 (37%), Gaps = 69/319 (21%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN----------SSSFCHNCFRKTMHSSS 65
           +GR L  +Q +K G+++L D P   Y+ +P ++          S+  CH    +T+   S
Sbjct: 44  KGRQLRVSQPVKKGELLLVDLP---YALIPVVDHPEQSDDVRCSNPACHRRVARTVERVS 100

Query: 66  SICP---SCSVAFCSPKCSTA--------------AASSSHSPY------VCQALTRLLN 102
             CP   S  V +C+  C  A               A+S  S +      +   + R+L 
Sbjct: 101 --CPNRCSAEVVWCNSSCRDADNLRHEFECTWLKKYATSIRSKWGEYDFGMLWLIVRILA 158

Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQI----LLAFQGTVTDNDTSAAHYLS 158
               +  N P   +        +   +   PS F+     + +F G+      S   + +
Sbjct: 159 SRHVEFRNTPSSDKTSG-----SKTGSDSKPSSFETGWNAIRSFCGSQDSWTHSQVRHWT 213

Query: 159 SLCPP--------PASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYG 204
            L           P   T      L+ ++  N+FGL        P  E    R    A G
Sbjct: 214 MLVKKYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAG 273

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +Y  A+  NH C PN      +    ++ + ++     D+  G E C+SYF ++     +
Sbjct: 274 VYPTAALANHSCSPN------IIHKPDDQSRMVFVASKDIATGEECCISYFDLSKKVELK 327

Query: 265 QKR--LLYDYDFACDCDRC 281
            +R  L   + F C CDRC
Sbjct: 328 DRRDHLQGSFRFVCKCDRC 346


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 51/293 (17%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCF--------- 57
           +TVA I G+GRGL++T +   G ++L   P   Y++ P  I   S C +CF         
Sbjct: 17  LTVASIPGKGRGLIATFTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73

Query: 58  --RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR 115
             R T + SS+ C          +C   AA + +   +     RL+         L L R
Sbjct: 74  MCRVTWYCSSN-CQKEEWKLHQLECRAMAALTENRKKMLTPTIRLM-------VRLALKR 125

Query: 116 QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTI 169
           +LQ   +I + ++       + ++ A +  ++  D       +    L SL  P     +
Sbjct: 126 KLQNEKVIPSSSI-----DNYNLVDALESHISKVDENQLVLYAQMANLVSLILPFIELDL 180

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDA 228
           +  A   +K   NA  + +P +            G+Y   S  NH C+PNA   FD   A
Sbjct: 181 KEIAQTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA 233

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                    VR +  + +  E+ +SY       + ++   L  Y F C C RC
Sbjct: 234 --------YVRALQPIDKDEEVSISYIETA-AVTKKRNNDLKQYFFTCTCPRC 277


>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 551

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 52/304 (17%)

Query: 1   MSAVSETVTVAEIEGR---GRGLVSTQSLKAGQIV-LRDSPILLYSALPFINSSSFCHNC 56
           M AV++ + V   +G    GR + +T+  K G ++ L D P ++    P   +  +C++C
Sbjct: 1   MPAVNDGIEVRGSKGGAKGGRSIHATRRFKPGDVIALFDEPAVVLP--PGHRALEYCNHC 58

Query: 57  FRKTMHSSSSICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNL---NSPDAA 109
            ++  H+      +C+    VA+C   C  A  +  H    C+A+  L  +   + PD  
Sbjct: 59  LKRQGHAGVGKLRACTGCKTVAYCGAACQKANWTLIHK-VECKAIQFLHEVKPAHQPDWV 117

Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTD---NDTS----AAHYLSSLCP 162
             P          I A    ++ P   +   A +G V +   ND       AH +   C 
Sbjct: 118 PTP----------IRAGAQVMLRPKLKESFDALEGHVEEWRKNDEMDLQLQAHGVLRCCG 167

Query: 163 PPASTT--IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLP 218
             A T   +E +  +L K + NAF   E Y E           G++L  +    NH C+P
Sbjct: 168 LDAITLKGLEESFQVLCKLQTNAFSRTEEYYE---------TGGVFLDTTLAMINHSCVP 218

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           NA     V       T   +R    +  G EI +SY         RQ  L   Y F C C
Sbjct: 219 NA----MVQFGGRTAT---LRSATFINPGDEIEISYADHTQPKGKRQGELDL-YHFECSC 270

Query: 279 DRCK 282
            +C+
Sbjct: 271 YKCE 274


>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
 gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +YL+ S FNH C+P+A   ++ +      T + VR +  V  G EI +SY P   +   R
Sbjct: 192 VYLQGSAFNHSCVPSA---EFCNVG----TSLTVRSLRRVQAGEEITVSYVPTTMNLKER 244

Query: 265 QKRLLYDYDFACDCDRCKVE 284
           ++ L   Y F C C RC  E
Sbjct: 245 RRCLEGQYKFVCACARCVAE 264


>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
 gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
          Length = 392

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGEEICISYLDECQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYIFVCQCLKCQAQAS 368


>gi|224101385|ref|XP_002312257.1| SET domain protein [Populus trichocarpa]
 gi|222852077|gb|EEE89624.1| SET domain protein [Populus trichocarpa]
          Length = 542

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 56/131 (42%), Gaps = 13/131 (9%)

Query: 159 SLCPPPASTTIELTAALLAKDRLNAF---GLMEPYIEGQDGRRSVRAYGI--YLKASFFN 213
           SL  P A    EL   L     LN      L+E  +  +   R+   YG+  ++ ASF N
Sbjct: 279 SLFRPEAEEIGELNEKLDKVKILNVLDVNSLVEDSVSAKVLGRNSDYYGVGLWVLASFIN 338

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD 273
           H C PNA R    D        ++V    DV  G EI  +YF V    S R + +   + 
Sbjct: 339 HSCNPNARRLHVGD-------HVLVHASRDVKAGEEITFAYFDVLSPLSKRNE-MSKTWG 390

Query: 274 FACDCDRCKVE 284
           F C C RCK E
Sbjct: 391 FHCSCKRCKFE 401


>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
 gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
          Length = 311

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 205 IYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           +Y  AS FNH C PNA   ++  DAA      I +R +  V  G E  ++Y   N   ++
Sbjct: 235 VYFTASLFNHSCAPNAHVSWENGDAA------ITIRTLRPVRAGEEFNITYVDANERSAS 288

Query: 264 RQKRLLYDYDFACDCDRC 281
           R+ RL   Y F C C+RC
Sbjct: 289 RRARLKEWYGFDCACERC 306


>gi|301089299|ref|XP_002894963.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104481|gb|EEY62533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 457

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 2/80 (2%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYS 262
           G++L  + FNHDC PN  +F   DA  E      VR+   + +G ++ +SY +P      
Sbjct: 205 GVFLYNAMFNHDCNPNCVKFTPEDAGPEGGVS-EVRVARPIAKGEQLTISYLYPREQSRG 263

Query: 263 TRQKRLLYDYDFACDCDRCK 282
            RQK L   + F C C+ CK
Sbjct: 264 NRQKNLSEQFGFECCCELCK 283


>gi|330798181|ref|XP_003287133.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
 gi|325082849|gb|EGC36318.1| hypothetical protein DICPUDRAFT_77998 [Dictyostelium purpureum]
          Length = 510

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 40/89 (44%), Gaps = 11/89 (12%)

Query: 204 GIYLKASFFNHDCLPNACRF-----------DYVDAAAENNTDIIVRMIHDVPQGREICL 252
            +YLK S FNH C PN                Y    ++N   I +  + DVPQ  E+ +
Sbjct: 363 AVYLKLSLFNHGCFPNCTTLLEYNINHNDYSYYQGFESDNRLSISIITLRDVPQNSELLI 422

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +Y P+N     R   L  ++ F CDC RC
Sbjct: 423 TYIPLNQKGHERYHNLKSNWMFPCDCLRC 451


>gi|451995963|gb|EMD88430.1| hypothetical protein COCHEDRAFT_1205676 [Cochliobolus
           heterostrophus C5]
          Length = 388

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 84/212 (39%), Gaps = 37/212 (17%)

Query: 84  AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQ 143
           A S  H   V +AL   +   +PD  +          F ++       +P Q++ L A  
Sbjct: 199 AISKQHKKAVSKALQGRI---TPDVMS----------FCVSGILWRHHHPEQWERLEALA 245

Query: 144 GTVT------DNDTSAAHYLSSLCPPPAS----TTIELTAALLAKDRLNAFGLMEPYIEG 193
              T      D D     YL  L   P +     T E    L  +D  N+FG+     EG
Sbjct: 246 NDNTPYHSSDDLDAFTRAYLHLLVILPVALLPLVTAETLLTLSKRDSHNSFGIRSLEDEG 305

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            +       YG +  AS+FNH C PN      ++   E       R   ++ +G+E+C++
Sbjct: 306 SE----FFGYGCWPAASYFNHSCGPN------IEKKREGRV-WYFRTGREIEKGQELCIT 354

Query: 254 YFPVNYDYSTRQKRLL---YDYDFACDCDRCK 282
           Y        +R +R+L     + F C C+RC+
Sbjct: 355 YLSGEERKLSRGQRMLRLKKTWGFDCGCERCE 386


>gi|366991437|ref|XP_003675484.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
 gi|342301349|emb|CCC69117.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
          Length = 374

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 82/196 (41%), Gaps = 28/196 (14%)

Query: 92  YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
           +VC+ L RL  ++  ++ N      LQ+          V    +F ILL FQ  V     
Sbjct: 199 FVCETLFRLKYMDQ-NSINFKSFYSLQSN--------EVSKMERFPILLHFQILVFKT-- 247

Query: 152 SAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA 209
                L  L P        +T+   +L  +  NAFG+ E   E  D R     Y I+ +A
Sbjct: 248 -----LYILLPSFLRDRFSVTSFRHILGSEYGNAFGIWEVG-ESSDSREYF-GYWIFPRA 300

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
           S+FNH C PN  +             +   +  D+  G E+ + Y  V       +++ L
Sbjct: 301 SYFNHSCDPNLTK-------TRKGRTMFFTLNRDIDVGSELDIDYSGVLSLPVKERRKFL 353

Query: 270 YD-YDFACDCDRCKVE 284
           +D + F C CDRCK+E
Sbjct: 354 HDSWFFDCQCDRCKLE 369


>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
 gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
          Length = 522

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 113/284 (39%), Gaps = 33/284 (11%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSI 67
           +A  EG GRGLV+T+ +   ++V  D P+L+    P +N+    FC +C   ++     +
Sbjct: 64  IATFEGYGRGLVATRDIAVNELVFVDRPVLVG---PRVNNYEVIFCASCC--SILRRLQL 118

Query: 68  CPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C   C +  CS    +  ASS H+     A   L+    P  A       L A   I  +
Sbjct: 119 CTGGCRLPICSRCDYSVGASSPHA-----AECALIQSWQPKDAGRYSKNILYALTSIRGF 173

Query: 127 NLAVVNPSQFQILLAFQG--------TVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
            L+  + S   I+L  +G        T  +      ++ +     PA   +     +L  
Sbjct: 174 LLSERDRS---IVLQMEGHPPRKEMTTEIERLLKDGYFWNLDGEGPAVRYLRQVVNILNT 230

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              NAF       + ++ +  +   G+Y+  +  NH C PN      V    +    + V
Sbjct: 231 ---NAFETSRIVADEENNQHEIILRGLYILGALMNHCCRPN------VRYVFDGELRMRV 281

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                + +G +I  +Y  + +    R   L +  +F C CDRCK
Sbjct: 282 HASRPIKKGEQIMNNYSKILWGSQHRIIHLCFSKNFLCCCDRCK 325


>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 635

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 130/335 (38%), Gaps = 73/335 (21%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKT 60
           A S+ ++++  E  GR LV+T+  K G +V  ++P   ++Y+   +    + CH+C  ++
Sbjct: 239 AASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYY----THCHHCLSRS 294

Query: 61  MHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
            +     C  C VA +CS KC   A   +H    C  +  + NL + D   + +  ++  
Sbjct: 295 YNLIP--CSYCPVAQYCSEKCRKLAWEMAHQ-IECPIMALVGNLLNVDKDKIRMLTKI-I 350

Query: 120 RFLIAA-------------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS------L 160
           RFLI                 LA  NP         +G +      +A  L++      L
Sbjct: 351 RFLIVVTSKGKKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPL 410

Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD----------GRRSVRA-------- 202
               A   I   AA+L   + N F       + +D          G    RA        
Sbjct: 411 IGISAFACISALAAILLATQTNFFCNKYEVDQLKDISNYPKIIFSGSLMFRACVIMSSNC 470

Query: 203 ------------YGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGRE 249
                        G+Y+  S +NH C PN  R F+           +I R +  +  G +
Sbjct: 471 FSVQQEPGIKTGSGLYVTHSLYNHSCAPNTFRHFE--------ELTMITRALRPIYPGDQ 522

Query: 250 ICLSYFPVNYDYST---RQKRLLYDYDFACDCDRC 281
           I  +Y    Y Y T   R+++++ DY F CDC  C
Sbjct: 523 IFTNY-GAAYAYMTKSERREKIIQDYFFECDCIAC 556


>gi|118776986|ref|XP_307207.3| AGAP011523-PA [Anopheles gambiae str. PEST]
 gi|116133074|gb|EAA03232.3| AGAP011523-PA [Anopheles gambiae str. PEST]
          Length = 473

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 61/277 (22%), Positives = 103/277 (37%), Gaps = 32/277 (11%)

Query: 18  RGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCS 77
           R +V+ + LKAG +++   P ++            CH+ F+    S    C  C+   CS
Sbjct: 1   RYIVAARDLKAGDVIIETPPFVVGPCAETEPVCLGCHSAFQPG--SVVYRCEICNWRICS 58

Query: 78  PKCSTAAASSSHSPYVCQALTRLL---NLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
           P C     + SH    C  L       +L +  AA + L    +A   +    L  V+  
Sbjct: 59  PSCQGLVPAGSHRQLECIPLRDKAVHNHLQTSPAAQVKL--MYEAILTLRCMLLKTVDRP 116

Query: 135 QFQILLAFQ----------GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
           Q+  LL                  N       + +       + +EL   +     +NAF
Sbjct: 117 QYDRLLGMDPLNDVRQQIPKLWNRNQKEIVERIRNQWGFAEYSELEL-HTICGIIEVNAF 175

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                    + G+   +A  ++ +A    HDC PN    D     A     + VR++ DV
Sbjct: 176 ---------EVGQEPTKARALFPEAYLLMHDCTPNTGHTD-----APQTHHLTVRVLRDV 221

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             G  + L+Y  +      R++ L  +  F+C+C RC
Sbjct: 222 KAGEPLTLTYAHILQGTLKRRQHLREEKFFSCNCRRC 258


>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 401

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G++   +  NH C PN C+      A    + + +R    V  G E+CLSY P++   
Sbjct: 50  GLGLFPSGAMINHSCSPN-CQ------AWWRGSQLEIRCTKPVATGEELCLSYIPIDQPS 102

Query: 262 STRQKRLLYDYDFACDCDRCKVEANW 287
           + R+ +L + + FAC C RC V   W
Sbjct: 103 TVRRAQLRHSWFFACRCRRC-VSRQW 127


>gi|409077404|gb|EKM77770.1| hypothetical protein AGABI1DRAFT_76762 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 373

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 54/113 (47%), Gaps = 10/113 (8%)

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
           T ALL ++  N FG+ +  +E  +    +  +G+Y   S+FNHDC P   R ++   A E
Sbjct: 269 TRALLGREHGNVFGIWDMALEEMNS--EMLGWGLYSFGSYFNHDCSPT-LRKEHKGRAIE 325

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
                    I D+  G E+C+SY   +     ++K  L ++ F C C  C  E
Sbjct: 326 YYA------IKDIDTGEELCISYVETD-QVVEKRKEALKEWFFDCGCGTCVAE 371


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 81/303 (26%), Positives = 112/303 (36%), Gaps = 61/303 (20%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS------FCH 54
           M+A  E    AE   +G GL +T  + AG+        LLY+A P     S       CH
Sbjct: 1   MAAKFERFASAE---KGNGLRATHEIHAGE--------LLYTAEPLAYCVSNTCAKDLCH 49

Query: 55  NCF--RKTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           +CF  RKT+      C  C VA +C   C   A S       C+ L  L          +
Sbjct: 50  SCFSRRKTLLR----CSQCKVARYCDVTCQKQAWSDHKRE--CKCLRSL-------HPRI 96

Query: 112 PLDR-QLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC--------P 162
           P D  +L AR +    + +  +P     L   +  + D        LS L          
Sbjct: 97  PTDSVRLAARIIFRLLSPSQTSPG-LCSLEEHESHLCDMGEERREGLSQLSSMLQLYIQQ 155

Query: 163 PPASTTIELTA----ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLP 218
              + T EL A    +LLAK   N F +        DG       G+Y   S  NHDC P
Sbjct: 156 EVPNITQELPAVDPLSLLAKLTCNCFTI-------SDGELREIGVGLYPSMSLLNHDCRP 208

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           +           E  T + +R + D+    E+ +SY         R+ +L   Y F C C
Sbjct: 209 SCVML------FEGET-LHLRAVRDMQPAEELTISYIGTLAPTRDRRTQLEEQYHFTCQC 261

Query: 279 DRC 281
            RC
Sbjct: 262 QRC 264


>gi|157869732|ref|XP_001683417.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126482|emb|CAJ04456.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 513

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 7/99 (7%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           ++  G+Y   S FNH C+PNA     VD   E    I+++    +  G  + ++Y P+  
Sbjct: 418 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLAA 472

Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             ++R  ++R L +Y F C C RC  EA  S     D N
Sbjct: 473 GTASRAERQRQLRNYFFTCHCPRCDTEAAASAAMTGDGN 511


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 57/280 (20%), Positives = 105/280 (37%), Gaps = 36/280 (12%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHS 63
           E V + +  G+GRGL +T+   AG ++  +  +  +++ +L    +   CH+CFR+    
Sbjct: 2   ENVAIFDSPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSL----AERICHSCFRR--QE 55

Query: 64  SSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
               C  C  A +C   C  A  +       C A+            N+ +   +Q    
Sbjct: 56  KLQKCSQCKFAHYCDRTCQRAGWAEHKQE--CGAIK---AYGKAPNENIRVVSHMQ---- 106

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN 182
                L  V   +  +    +  + +      ++L          TI+  + +      N
Sbjct: 107 -----LITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISHIFGVINCN 161

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
            F      +  Q G ++V   G++      NH+C PN            N+  I +R + 
Sbjct: 162 GFT-----VSDQRGLQAV-GVGLFPNLCMVNHNCWPNC-------TVILNHGKIELRSLG 208

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            + +G E+ ++Y         R++ L   Y F C CD CK
Sbjct: 209 KIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCK 248


>gi|259149991|emb|CAY86794.1| Set6p [Saccharomyces cerevisiae EC1118]
 gi|323346145|gb|EGA80435.1| Set6p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365762745|gb|EHN04278.1| Set6p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 373

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 30/197 (15%)

Query: 92  YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
           +VC++L  L  +N P          LQ+  L           S+F +LL FQ  V     
Sbjct: 198 FVCESLFNLKYMN-PQCITYRAFNMLQSNELSKI--------SKFPVLLHFQKLVFQT-- 246

Query: 152 SAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDG-RRSVRAYGIYLK 208
                L  L P       +I L   +L  +  NAFGL +   EG+    R    Y ++ +
Sbjct: 247 -----LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ---EGEASDSREYFGYWVFPE 298

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
           AS+FNH C PN  ++            ++  +  D+ +  +IC+ Y  V    + +++  
Sbjct: 299 ASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQICIDYSGVLDLPTVKRRAF 351

Query: 269 LYD-YDFACDCDRCKVE 284
           L D + F C C+RCK E
Sbjct: 352 LADSWFFDCACERCKSE 368


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 114/303 (37%), Gaps = 57/303 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P  L   +   +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVAPLRPGELLFRSDP--LAYTVSKGSRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVVKLMDGKPSESEKL 123

Query: 123 IAAYNLAVVNPSQF---------QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
            + Y+L   N S+          Q+ + FQ  + +    A+        PP+    E  A
Sbjct: 124 YSFYDLES-NISKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------PPSFDLFEAFA 176

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++     N+F +    ++           G+Y   S  NH C PN C   +      N 
Sbjct: 177 KVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NG 218

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
             +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC  +   +D    
Sbjct: 219 PHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCPTQDKDADMLTG 278

Query: 294 DEN 296
           DE 
Sbjct: 279 DEQ 281


>gi|323331171|gb|EGA72589.1| Set6p [Saccharomyces cerevisiae AWRI796]
          Length = 373

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCSPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
          Length = 497

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 49/286 (17%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSFCHNCFRKTMHSSSSIC 68
           + + E  G G+ +T++++ G+++ +D P++L   A P    S  C +C+ +       +C
Sbjct: 30  IKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLS--CVSCYSR---RDLRLC 84

Query: 69  -PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAA 125
              C +  CS KC  +         V Q  ++ ++    +  ++ L R L      L+  
Sbjct: 85  GKKCGLLVCSEKCEKSLGHQKECKVVRQWQSKPIS----EDLSVQLVRVLSPIRSLLLGE 140

Query: 126 YNLAVV--------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
            + +VV        +   F++ +       +       ++  +C    S           
Sbjct: 141 EDKSVVKCLKAHKSDQHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDS----------- 189

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI 236
               NAF ++  +   +D + SV+  G+Y   S  NH C PN C  FD       +   +
Sbjct: 190 ----NAFEVLVGF---EDNQASVK--GLYPLGSLANHSCCPNTCHVFD-------DKQHM 233

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +VR    +PQG EI  SY  + +  S R+  L     F C C RC+
Sbjct: 234 VVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCE 279


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 7/89 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y + + FNHDC PN C   +         ++ VR++ D+    E+ +SY  +      
Sbjct: 200 GLYPEGALFNHDCDPN-CVVSF------KGREMQVRVVRDIEVDEELTVSYVELLQSTKA 252

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDN 292
           R++ L   Y F C+C RCK   N   N++
Sbjct: 253 RRRELKESYFFDCECKRCKAATNGQSNED 281


>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
          Length = 481

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 111/289 (38%), Gaps = 46/289 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
           G GRGL ++  +  G+ VL   P+   + L   +    C  CF +       I    C  
Sbjct: 13  GLGRGLFASTDIAVGEEVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGIALKGCRG 71

Query: 71  CS-VAFCSPKCSTAAASSSHSPY--VCQALT-RLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C  V +C   C        HS    + Q L  R+L +N+     + L  + Q        
Sbjct: 72  CGIVKYCDKTCQAKNWKLGHSLECSIYQNLKPRVLPINARAVLRMVLRSERQK------- 124

Query: 127 NLAVVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTT---IELTAALLAK 178
                +  +    L     + D  D SA+ +    LSS      S T    E+ +A+ AK
Sbjct: 125 ----YSSEELDQFLQLGTHIKDIRDQSASQWERISLSSKAIKAYSGTDMSEEVISAMGAK 180

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDI 236
             LN+F L     +           G+YL   A+ FNH C  NA       A + +  ++
Sbjct: 181 LDLNSFNLTNAVYD---------RLGVYLHPYAAIFNHSCDHNA-------AVSFDGPNL 224

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
            ++ +  + +  +I ++Y  V   Y  RQ  L   Y F C C +C  EA
Sbjct: 225 HIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCSREA 273


>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Ornithorhynchus anatinus]
          Length = 490

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 64/290 (22%), Positives = 113/290 (38%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSS 64
           E V V   E +GRGL +T+   A  ++  +     YSA+ F + +   CH CF++     
Sbjct: 7   EKVEVFTSESKGRGLKATKEFWAADLIFSERA---YSAVVFDSLTHLVCHTCFKR--QEK 61

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A +C   C   A  +  +   C A+ +  +  +P+  N+ L  ++  R   
Sbjct: 62  LQRCGQCKFAHYCDRTCQKDAWLNHKNE--CSAIKK--HGKAPNE-NIRLAARIMWRIER 116

Query: 124 AAYNLA---VVNPSQFQILLAFQGTVTDNDTS--AAHYLSSLCPPPASTTIELTAALLAK 178
               L    +V+    Q  +   G     +       +L    P     +++  + +   
Sbjct: 117 EGTGLTEGCLVSIDDLQNHVDSFGEEEQKELRLDVESFLHFWPPQSQQFSMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              N F L +     Q G ++V   G++      NHDC PN          + V +    
Sbjct: 177 INCNGFTLSD-----QRGLQAV-GVGVFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHT 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R +  + +G E+ +SY       + R+K L   Y F C C+ C+
Sbjct: 231 QMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCE 280


>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Meleagris gallopavo]
          Length = 478

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 67/284 (23%), Positives = 112/284 (39%), Gaps = 30/284 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E+V V   EG+GRGL + +    G ++  +     Y+A+ F + +   CH CF++     
Sbjct: 7   ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVVCHTCFKR--QEK 61

Query: 65  SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
              C  C  A+ C   C   A  +  +   C A+ +  +  +P   N+ L  ++  R   
Sbjct: 62  LHRCGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRIER 116

Query: 124 AAYNLA-----VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
               L+      ++  Q  +    +    D       +L           ++  + +   
Sbjct: 117 EGSGLSENCLVSIDDLQNHVENFDEEEKKDLRIDVESFLEFWPAQSQQFGMQYISHIFGV 176

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              NAF L +     Q G ++V   GI+      NHDC PN C   +      NN  I +
Sbjct: 177 INCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVIF------NNGKIEL 223

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           R +  +  G E+ +SY         RQK+L   Y F C C+ CK
Sbjct: 224 RALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCK 267


>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
 gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
          Length = 392

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALAPIQEGDEICISYLDECMLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYVFICQCSKCRAQAS 368


>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
          Length = 596

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 123/318 (38%), Gaps = 54/318 (16%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
           ++S   +VA  E  GR + + +++  G+IVL + P    S L   N S+ CH+C + T+ 
Sbjct: 175 SLSSACSVASTENEGRFITAVRNIAPGEIVLIEKPFA--SVLLRANYSNHCHHCLKHTLE 232

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPDAANLPLDRQL 117
                C +C  A FCS  C   A  + H  Y C  L  L    +N     A      +  
Sbjct: 233 GIP--CRTCPDARFCSEACRDTAMQTYHQ-YECSVLNTLHHSQINKFGCLAFRAITKQSY 289

Query: 118 QARFLIAAYNL--------AVVNPSQFQILLAFQGTVTD-------NDTSAAHYLSSLCP 162
           Q+   I A +L         +  P  +  ++       D       + T  A YL  L  
Sbjct: 290 QSLKDIRAQDLPLNGCHSDGLYRPQDYNTIIQLVTHAKDRPVQDLFHRTVMAVYLLKLLQ 349

Query: 163 PPA----STTIELTAAL-------LAKDRLNAFGLMEPYIEGQDGRRSV---RAYGIYLK 208
             +       +E+ A +       L     NA  + E Y++      S+      GIY  
Sbjct: 350 QTSYFNGEEDVEMQAYIAGLFLSHLQSFPCNAHEVPELYLDPNAIDLSMPNELGAGIYST 409

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST----- 263
            S FNH C P   R  Y D         +VR I  + +G ++  +Y  +   Y+T     
Sbjct: 410 LSLFNHSCDPGVNRNFYGDTC-------VVRAIKTIRKGHQVSDNYGAL---YATNTLKE 459

Query: 264 RQKRLLYDYDFACDCDRC 281
           R  +L   Y F+C C+ C
Sbjct: 460 RHDKLQPQYFFSCRCEPC 477


>gi|302144054|emb|CBI23159.3| unnamed protein product [Vitis vinifera]
          Length = 503

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  +F NH C PNA R    D       ++IV    DV  G EI  +YF V   +  
Sbjct: 293 GLWILPAFINHSCNPNARRLHVGD-------NVIVHTSRDVKAGEEITFAYFDVLSPWRK 345

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           R K +   + F C+C RCK E
Sbjct: 346 R-KDMAKTWGFQCNCKRCKFE 365


>gi|225444416|ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253788 [Vitis vinifera]
          Length = 550

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  +F NH C PNA R    D       ++IV    DV  G EI  +YF V   +  
Sbjct: 340 GLWILPAFINHSCNPNARRLHVGD-------NVIVHTSRDVKAGEEITFAYFDVLSPWRK 392

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           R K +   + F C+C RCK E
Sbjct: 393 R-KDMAKTWGFQCNCKRCKFE 412


>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
 gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
          Length = 650

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 116/302 (38%), Gaps = 45/302 (14%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
           GR   +   LK   I+L + P +  S L    S   C +CF++   S    CP C+ V F
Sbjct: 240 GRFARTNTDLKPNTILLLERPHV--SVLLEDYSLDHCTHCFKRV--SVPIACPLCADVVF 295

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQARFLIAAYNLAVVN 132
           CS +C T A ++ H  Y C  L  L    +    ++ L    ++ +  FL     LA + 
Sbjct: 296 CSDECETKANATYHR-YECGFLPILWGSGASITCHMALRMITQKSEEYFLKLKPELAGLT 354

Query: 133 PSQFQIL-----------LAFQGTVTDNDTSAAHYLSSL------------CPPPASTTI 169
             Q   L           +  + T +  D      +++L            CP   +   
Sbjct: 355 NEQIDKLPVDDYRKVYKLVTHESTRSPEDFFQRTLMATLLNACLTLGGYGACPQEQNFIG 414

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQD--GRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
            L    L   + NA  + E   E  +  G+ +    G+Y   + FNH C P   R+    
Sbjct: 415 GLLVHNLQLLQFNAHEVSEMIRETAEDIGKSTFIGGGLYPTLALFNHSCDPGVTRY---- 470

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKVEA 285
                   + VR + ++P    +  +Y P+       ++R  LL+ Y F C C  C VE 
Sbjct: 471 ---YRGNQVCVRTVKNIPADSMVAENYGPLFTQVRRDERRDTLLHQYRFTCQCVPC-VE- 525

Query: 286 NW 287
           NW
Sbjct: 526 NW 527


>gi|238485368|ref|XP_002373922.1| set and mynd domain containing protein, putative [Aspergillus
           flavus NRRL3357]
 gi|220698801|gb|EED55140.1| set and mynd domain containing protein, putative [Aspergillus
           flavus NRRL3357]
          Length = 372

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 69/315 (21%), Positives = 118/315 (37%), Gaps = 61/315 (19%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN----------SSSFCHNCFRKTMHSSS 65
           +GR L   Q +K G+++L D P   Y+ +P ++          S+  CH    +T+   S
Sbjct: 44  KGRQLRVCQPVKKGELLLVDLP---YALIPVVDHPEQSDDVRCSNPACHRRVARTVERVS 100

Query: 66  SICP---SCSVAFCSPKCSTA--------------AASSSHSPY------VCQALTRLLN 102
             CP   S  V +C+  C  A               A+S  S +      +   + R+L 
Sbjct: 101 --CPNRCSAEVVWCNSSCRDADNLRHEFECTWLKKYATSIRSKWGEYDFGMLWLIVRILA 158

Query: 103 LNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
               +  N P   +       +   L+    + +  + +F G+      S   + + L  
Sbjct: 159 SRHVEFRNTPSSDKTSGSKTGSDSKLSSFE-TGWNAIRSFCGSQDSWTHSQVRHWTMLVK 217

Query: 163 P--------PASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLK 208
                    P   T      L+ ++  N+FGL        P  E    R    A G+Y  
Sbjct: 218 KYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAGVYPT 277

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR- 267
           A+  NH C PN      +    ++ + ++     D+  G E C+SYF ++     + +R 
Sbjct: 278 AALANHSCSPN------IIHKPDDQSRMVFVASKDIATGEECCISYFDLSKKVELKDRRD 331

Query: 268 -LLYDYDFACDCDRC 281
            L   + F C CDRC
Sbjct: 332 HLQGSFRFVCKCDRC 346


>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
 gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 115/311 (36%), Gaps = 80/311 (25%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSS--- 66
           VA  E  GR L +T+ +K G+ +LR++P+LL    P + S+  C  C R  +        
Sbjct: 50  VATNETLGRHLRATRDVKIGEEILREAPLLLG---PKVASAPICLGCHRNLLAPQKQRGN 106

Query: 67  --ICPSCSVAFCSPKCSTAAA------------------------SSSHSPYVCQALTRL 100
              C SCS   C  +C  +                              S Y    L R 
Sbjct: 107 YYKCSSCSWPLCGRECEESPHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRC 166

Query: 101 LNLNSPDAANL----PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTD-NDTSAAH 155
           + L S D         L+  L+ R     Y +   N      L+ F  T+    D S   
Sbjct: 167 MQLKSSDPEGFARLSALEDHLEERLATPLYQVLRAN------LITFIKTILGMRDWSEVD 220

Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
            L               AA+L     N F L +P       RR VRA  ++  A+  +HD
Sbjct: 221 ILR-------------IAAILDT---NTFELRQPR-----ERRKVRA--LFPGAAMISHD 257

Query: 216 CLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYD 273
           C PN   RFD       ++ +II      + +G  + +SY  P+    ST Q+RL     
Sbjct: 258 CAPNMRHRFD-------DDMNIIFLAKRPIAKGEILSISYTQPLR---STIQRRLHLRQA 307

Query: 274 --FACDCDRCK 282
             F C C RC+
Sbjct: 308 KCFDCACARCQ 318


>gi|342888668|gb|EGU87905.1| hypothetical protein FOXB_01591 [Fusarium oxysporum Fo5176]
          Length = 655

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 71/154 (46%), Gaps = 19/154 (12%)

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
           NPSQ + +L   G    +       L  +   P   T ++   +    + NAFGL +   
Sbjct: 447 NPSQVEKVLRLHG----DYHGLGRKLQEVDDSPVIDTFQVHDII----QRNAFGLGQ--- 495

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           + +D   S  + G++++AS+ NH C+PNA + D+V         II R +  +  G EI 
Sbjct: 496 QTEDEDVSNASTGLWIRASYINHSCIPNAKK-DFVGDL------IIFRAMRRIVTGEEIT 548

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
            SY   + DY  R+  +   ++F C C  C VE 
Sbjct: 549 HSY-DESSDYEARKTNIRRTWNFECCCQLCLVEG 581


>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 467

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/287 (20%), Positives = 110/287 (38%), Gaps = 33/287 (11%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
           S V ++  +AE    GR  V+T+++K G+I+LRD   L+   +   NS   C  C+    
Sbjct: 39  SKVCKSFKLAENPSLGRHYVATRNIKVGEIILRDDQPLITGLM--YNSVPVCLQCYTMLN 96

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
              +  C  C    C   C+        S     +   +     P  +        Q   
Sbjct: 97  QEIAIPCEKCGWPLCQ-NCNEHGLECKFSCSRRDSKISITEFGYPHPS-------YQCIN 148

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA-----AHYLSSLCPPPASTTIELTAALL 176
           +I A +L   NP  ++ L++ +    + + S      AH++         +  E+   + 
Sbjct: 149 IIRALSLKDTNPESYKKLISLESHCNEINNSKEPLNIAHFIKRFFKADDISEEEIVTII- 207

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                   G+++  + G +   +   Y  +Y  AS   H+C  N C   + D        
Sbjct: 208 --------GILQ--VNGHEVPLTDSPYVAVYEMASLIEHNCRAN-CSKSFTDMGG----- 251

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +I+R    + +G  I + Y    +  + R+  LL    F C C+RC+
Sbjct: 252 LIIRAALPITKGDHISICYTDPLWGTANRRHHLLKTKFFECICNRCQ 298


>gi|207340657|gb|EDZ68942.1| YPL165Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 373

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|242206607|ref|XP_002469159.1| predicted protein [Postia placenta Mad-698-R]
 gi|220731830|gb|EED85671.1| predicted protein [Postia placenta Mad-698-R]
          Length = 447

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 55/122 (45%), Gaps = 13/122 (10%)

Query: 165 ASTTIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
           ++ TI+   A+ + D  N+FG+ E P  E  +       + +Y + SFFNH C PN  + 
Sbjct: 329 SAITIDNVRAIFSVDPGNSFGIWEVPLTEESE----CLGFAVYPRPSFFNHHCSPNVNKE 384

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            Y    A   T         V  G E+C+SY       +  RQK L   + F C C +C 
Sbjct: 385 RYGRGLAFVTT-------RSVRAGEELCISYGHAEGMGWRQRQKELREGWFFECSCGKCI 437

Query: 283 VE 284
           V+
Sbjct: 438 VD 439


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 68/287 (23%), Positives = 111/287 (38%), Gaps = 42/287 (14%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFRKTMHSSSSIC 68
           V+ +  +GR L + +  + G+++L   P   Y  +P   SS S C  CF+    ++   C
Sbjct: 15  VSNLPQKGRSLFTARDFRPGEVILSQKP---YICVPNNTSSESRCDGCFKT---NNLKKC 68

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPD---AANLPLDRQLQAR 120
            +C V  +C   C  +          C+ALTRL        +P       L + R LQ  
Sbjct: 69  SACQVVWYCGSSCQKSEWKLHRDE--CKALTRLEKEKRKFVTPTIRLMVRLYIKRNLQNE 126

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY------LSSLCPPPASTTIELTAA 174
            +     L +     + ++ A    +++ D            L +L     S  +   A 
Sbjct: 127 KV-----LPITTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAE 181

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
             +K   NA  + +  +  Q         G++   S  NH C PNA    + +  A    
Sbjct: 182 NFSKFSCNAHSICDSELRPQ-------GIGLFPLVSIINHSCSPNAV-LVFEEQMA---- 229

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +VR + ++ +  EI +SY        TRQK L   Y F C C RC
Sbjct: 230 --VVRAMDNISKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARC 274


>gi|323335307|gb|EGA76596.1| Set6p [Saccharomyces cerevisiae Vin13]
          Length = 373

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
 gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
           [Schizosaccharomyces pombe]
          Length = 483

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 69/288 (23%), Positives = 113/288 (39%), Gaps = 69/288 (23%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFR-----------KTMHSSS 65
           G+G V+T ++  G+I++R    +L  +L   N +  C  C             K +H  S
Sbjct: 15  GKGTVATDNIPIGKIIIRKRVDIL--SLDSANLTRTCSTCTEEKVKTQRCAACKIIHYCS 72

Query: 66  SICPSCSVAFCSPKCSTAAASSSHS--PYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
             C      F   +C    AS  +   P VC+ L RL                    +L+
Sbjct: 73  KGCQKADWPFHKLECKALQASKQNGILPSVCRLLIRL--------------------YLL 112

Query: 124 AAYNLAVVNP-----SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              N A++ P     ++FQ + +         ++A+HY             EL   L  +
Sbjct: 113 WQKNPAIIEPMEGHQNEFQAVSSSWSDAELIASAASHY-------TQIYQAELFQKLFCR 165

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDI 236
             +NA  L+           S  + G+ L       NH C PN C+  + D A       
Sbjct: 166 LAVNAMNLVTS---------SFDSLGMCLDTILCRLNHSCDPN-CQIIF-DGA------- 207

Query: 237 IVRMI--HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           IV+++   D+ +  ++ +SY  +    S RQK+LL  Y F+C C RC+
Sbjct: 208 IVQLVSKRDIKKDEQLFISYIDIRLPKSIRQKQLLKKYFFSCYCPRCE 255


>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
          Length = 516

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 113/286 (39%), Gaps = 46/286 (16%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM-------HSSSSICP 69
           GR + +T+ +K G+++L++ P ++    P +     C +C R+         +     C 
Sbjct: 34  GRFMTATRDIKQGEVILKEKPAVIG---PRMYCKVQCLSCGRQLQPIPNDDGNMDFIRCS 90

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLPLDRQLQARFLIAAYNL 128
           SC+   C   C  A         + Q+  +  +    PD A         A  +IA   +
Sbjct: 91  SCNWPVCGIDCEKAEVHREECKIMVQSKYKCDIKYECPDKAE-------AAYCVIAPLRV 143

Query: 129 AVV---NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL---- 181
            ++   NP  F+ ++  +  V   D      +  +  P     I     +   D++    
Sbjct: 144 LLMKDSNPRMFESIMNLESHV---DQRINTPIYQVLRPNLIMFIRQVLGMNFDDKIILEI 200

Query: 182 ------NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                 N+F      +   D  + +RA  IYL AS  NH C PN  R  Y+    E+ T 
Sbjct: 201 SSIFDNNSFD-----VRSADKTKRLRA--IYLLASMMNHSCRPNT-RHIYL---GEDKTL 249

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            ++  +H + +G EI  +Y    +    R+K L  +  F C C+RC
Sbjct: 250 ALIATVH-IAKGEEITATYTQPLWGTLDRRKFLKTNKYFDCKCERC 294


>gi|349581653|dbj|GAA26810.1| K7_Set6p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 373

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLVFQT-------LYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
          Length = 387

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 117/286 (40%), Gaps = 49/286 (17%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSFCHNCFRKTMHSSSSIC 68
           + + E  G G+ +T++++ G+++ +D P++L   A P    S  C +C+ +       +C
Sbjct: 30  IKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLS--CVSCYSR---RDLRLC 84

Query: 69  -PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAA 125
              C +  CS KC  +         V Q  ++ ++    +  ++ L R L      L+  
Sbjct: 85  GKKCGLLVCSEKCEKSLGHQKECKVVRQWQSKPIS----EDLSVQLVRVLSPIRSLLLGE 140

Query: 126 YNLAVV--------NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
            + +VV        +   F++ +       +       ++  +C    S           
Sbjct: 141 EDKSVVKCLKAHKSDQHGFEVDVMKDALALNVKEDEEKFMRFVCSVMDS----------- 189

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI 236
               NAF ++  +   +D + SV+  G+Y   S  NH C PN C  FD       +   +
Sbjct: 190 ----NAFEVLVGF---EDNQASVK--GLYPLGSLANHSCCPNTCHVFD-------DKQHM 233

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +VR    +PQG EI  SY  + +  S R+  L     F C C RC+
Sbjct: 234 VVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCE 279


>gi|195479534|ref|XP_002100923.1| GE15901 [Drosophila yakuba]
 gi|194188447|gb|EDX02031.1| GE15901 [Drosophila yakuba]
          Length = 532

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 111/282 (39%), Gaps = 37/282 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+++ L+AG+ ++R+ P+ +    P ++    C  C++  ++      CP C+   
Sbjct: 58  GRYLVASRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKPDQYRCPGCAWPL 114

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
           C   C+       H+   CQ    L       A  L  +R   A          ++    
Sbjct: 115 CGITCTGLKHRHGHTETECQ----LFGERRAVAGELLTERAGPAEVRDLYELVMIVRILL 170

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
           L   +P QF ++   +    +   +A    HY   +       T +L   L A+   +  
Sbjct: 171 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQL-EDLEAEQVHDVC 228

Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           G+++   +  GQ+G    +A  +Y  A    HDC PN    D        + +I++R   
Sbjct: 229 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 280

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
            V +   + LSY    Y      KR  + ++   F C C RC
Sbjct: 281 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 112/294 (38%), Gaps = 53/294 (18%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           +TVA I G+GRGL++T +   G ++L   P   Y++ P  I   S C +CF        S
Sbjct: 17  LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73

Query: 67  ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
           +C      +CS  C             C+A+        ++L         L L R+LQ 
Sbjct: 74  MCRV--TWYCSSNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLALKRKLQN 129

Query: 120 RFLIAA-----YNLA------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
             +I +     YNL       +    + Q++L  Q             L SL  P     
Sbjct: 130 EKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQ----------MANLVSLILPFIELD 179

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVD 227
           ++  A   +K   NA  + +P +            G+Y   S  NH C+PNA   FD   
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRT 232

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           A         VR +  + +  E+ +SY       + ++   L  Y F C C RC
Sbjct: 233 A--------YVRALQPINKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 277


>gi|169613520|ref|XP_001800177.1| hypothetical protein SNOG_09891 [Phaeosphaeria nodorum SN15]
 gi|111062036|gb|EAT83156.1| hypothetical protein SNOG_09891 [Phaeosphaeria nodorum SN15]
          Length = 379

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 71/163 (43%), Gaps = 24/163 (14%)

Query: 133 PSQFQILLAFQGTVT---DNDTSAAH-----YLSSLCPPP--ASTTIELTAALLAKDRLN 182
           P Q+  LLA     T     D  AA      +L S+ PP     TT  +   L ++D  N
Sbjct: 224 PEQYPSLLALADDETPYHSTDDLAAFTRTYLHLVSVLPPELLPITTAGVLFTLSSRDSHN 283

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           +FG+     +G +       YG +  AS+FNH C PN  +   V  A E       R   
Sbjct: 284 SFGIRSLEDDGSE----FFGYGCWPAASYFNHSCAPNVEK-RRVGRAWE------FRSGK 332

Query: 243 DVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
           DV  G E+C++Y        +R KR   L  ++ F C C+RC+
Sbjct: 333 DVKCGEELCITYLSGGERKLSRDKRMETLKKNWSFQCGCERCE 375


>gi|448825163|ref|YP_007418094.1| putative set domain-containing protein [Megavirus lba]
 gi|444236348|gb|AGD92118.1| putative set domain-containing protein [Megavirus lba]
          Length = 107

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            I      FNH CLPN   +         N ++    + D+ +G E+  SY  + +D  T
Sbjct: 12  AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 64

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ RL   Y F CDC RC
Sbjct: 65  RQSRLWNQYRFHCDCQRC 82


>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
          Length = 468

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 57/276 (20%), Positives = 112/276 (40%), Gaps = 20/276 (7%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
           V   E  GR +V+T+ +++G+++    P+++    P + S   C  C +K   + S   C
Sbjct: 13  VKRDEVLGRCVVATRDIRSGEVIAESLPLIVG---PKMASPPLCLGCHKKLALAESRYDC 69

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAY 126
             C    C   C ++    +      +A  +  + N NS      P+      R L+   
Sbjct: 70  SKCFWPLCGQACESSPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPI---APLRALLLKK 126

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
           N    +P +F +LL FQ  + D++ +  + +      P  T + L       + L    +
Sbjct: 127 N----SPERFNLLLDFQSHLQDHEKTQVYQVLKKSLVPFFTQL-LKLDTNESEILTICSI 181

Query: 187 MEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
            +    E +D +R V   G+Y   SF +H     +C+ +       +N  +++     + 
Sbjct: 182 FDTNCFEVRDTQRLVNIRGLYPTISFLSH-----SCKHNTKHCFNGDNFRLVLTATTPIK 236

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +G  +  +Y    +   +R+  L     F C C+RC
Sbjct: 237 KGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERC 272


>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 438

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 4/85 (4%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R  G+Y   S FNH C PN      +++A  + T ++V  + D   G E+ +SY   +  
Sbjct: 349 RGQGVYAVGSLFNHSCEPN---LQVLNSATGDET-LVVEALRDCEPGEELYISYIDESLP 404

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
           Y+ RQ++L   Y F C C +C  E+
Sbjct: 405 YAIRQQQLREHYLFECRCSKCVRES 429


>gi|330915865|ref|XP_003297203.1| hypothetical protein PTT_07522 [Pyrenophora teres f. teres 0-1]
 gi|311330265|gb|EFQ94703.1| hypothetical protein PTT_07522 [Pyrenophora teres f. teres 0-1]
          Length = 388

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 80/190 (42%), Gaps = 31/190 (16%)

Query: 113 LDRQLQAR-------FLIAAYNLAVVNPSQFQILLAFQGTVT---DNDTSAAH-----YL 157
           L + LQ R       F ++      ++P Q++ + A     T    +D  AA      +L
Sbjct: 208 LQKALQERITPDVMSFCVSGIVWRYLHPEQWERVQALAPDKTPYHSSDDLAAFTRTYLHL 267

Query: 158 SSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHD 215
            ++ P P     T E    L ++D  N+FG+     EG +       YG +  AS+FNH 
Sbjct: 268 LAILPNPLVPLITAETMMMLSSRDSHNSFGIRSLDDEGSE----FFGYGCWPAASYFNHS 323

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK---RLLYDY 272
           C PN      ++   E       R   D+  G+E+C++Y        +R K   RL   +
Sbjct: 324 CGPN------IEKNREGRA-WYFRAGQDIKHGQELCITYLSGEERKLSRGKRKLRLKKTW 376

Query: 273 DFACDCDRCK 282
            F C C+RC+
Sbjct: 377 GFDCGCERCE 386


>gi|401422463|ref|XP_003875719.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491958|emb|CBZ27231.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 510

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 7/88 (7%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           ++  G+Y   S FNH C+PN        +A +   +I+++    +  G  + ++Y P+  
Sbjct: 416 LKGAGLYSLLSCFNHSCVPNV-----AVSAVDGTHEIVLKTTRSIRAGEPLAITYIPLTA 470

Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEA 285
             ++R  ++R L +Y F C C RC +EA
Sbjct: 471 GATSRAERQRQLKNYFFTCHCPRCDMEA 498


>gi|258576329|ref|XP_002542346.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902612|gb|EEP77013.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 273

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 62/133 (46%), Gaps = 14/133 (10%)

Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKAS 210
           S  H LSSL          L  A   K++L AF G+        D     + +G++L+A+
Sbjct: 75  SIGHQLSSLAKEQQREFFALANA--HKEKLGAFLGIAVTNAIIMD--YETKEHGVFLQAA 130

Query: 211 FFNHDCLPNACRFDY--VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
            FNH C PNA R  +  +D        +++ +  DV +G EI +SY    + +S R + L
Sbjct: 131 RFNHACRPNAMRTFHPILD-------QVVIHVAKDVSEGEEITVSYIEPAHVFSLRVQLL 183

Query: 269 LYDYDFACDCDRC 281
              + F C C+ C
Sbjct: 184 KAKFGFTCVCELC 196


>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
 gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
          Length = 394

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PNAC  F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNACSTFPY------SNDIVVLKALSPIQEGDEICISYLDECQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKILCGNYIFICQCPKCRAQAS 368


>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
           rotundata]
          Length = 388

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 89/195 (45%), Gaps = 28/195 (14%)

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPPASTTIELT 172
           RQ+  + L   +      P  F+ LLA  GT       +S + ++ ++      + +EL 
Sbjct: 204 RQMMQKSLNTEFTEHWFTPDGFRSLLALVGTNGQGIGTSSFSRWVKNV------SALELP 257

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAY------GIYLKASFFNHDCLPNACRFDYV 226
                ++R+  F L++   +  D   +V ++      G+Y+  S  NH C+PNA     V
Sbjct: 258 E----EERIYVFKLIDRIYD--DMEEAVGSFLNNEGSGLYILQSAVNHSCIPNA----IV 307

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYDYSTRQKRLLYDYDFACDCDRCKV 283
           +    NN  ++++ I D+    EIC+SY     +     +RQK L   Y F C C +C  
Sbjct: 308 EFPYSNNV-LVLKAIRDIHPEEEICISYLDECDLERSRHSRQKALSSLYLFICCCSKCLS 366

Query: 284 EANWSDNDNDDENNE 298
           + N  D  ++DE +E
Sbjct: 367 QTNDPDLTSEDEIDE 381


>gi|170029518|ref|XP_001842639.1| msta [Culex quinquefasciatus]
 gi|167863223|gb|EDS26606.1| msta [Culex quinquefasciatus]
          Length = 495

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/288 (21%), Positives = 106/288 (36%), Gaps = 30/288 (10%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH-S 63
           +E   V E    GR  V+++ L+AG+I   D+P   ++  P + S+  C  C       +
Sbjct: 3   AELYKVLECPEMGRYGVASRDLRAGEIAYEDTP---FAVGPSVGSAPLCLECACPVDGCA 59

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLDRQLQA 119
             + CP C    C  KC     +S +    C+      ++  N+     A + LD     
Sbjct: 60  GGARCPRCGWPLCE-KCGAEIEASVYHKAECELFAKHGSKFQNVEDSSEACVQLDCITPL 118

Query: 120 RFLIAA------YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
           R L+A       +N  +V     +           + ++   +L   C      + EL  
Sbjct: 119 RVLLAKEANPDWWNAEIVMMEDHRAERDGNAFWKADQSNVVAFLRDSCGLKDRCSEELIQ 178

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            ++    +NAF         +    S+R  G+Y K     H C+ N     +        
Sbjct: 179 KVIGILDVNAF---------EAHTCSLR--GLYPKMGIMAHSCVTNLAHTVH----PSKG 223

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +I R   D+ +G  +C +Y  +     TRQ  L     F C C RC
Sbjct: 224 YRLIARAAVDIEEGAMLCTTYTHLLAGTRTRQAELQRTKYFTCQCKRC 271


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 112/289 (38%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA + G+GRGL + +S   G++V+   P   Y++ P  I+  S C NCF      +  
Sbjct: 17  LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A+        ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
               I +          + ++ A +  +++ D       +    L  L  P     ++  
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
               +K   NA  + +P +      RS+   G+Y   S  NH C+PNA          E 
Sbjct: 184 THTFSKFACNAHTICDPEL------RSL-GTGLYPVLSIINHSCVPNAVLI------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            T   VR +  + +  E+ +SY         RQ  L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277


>gi|354543914|emb|CCE40636.1| hypothetical protein CPAR2_106710 [Candida parapsilosis]
          Length = 370

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 8/117 (6%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T ++  +L+ K+  NAFG+       +D  +    + +Y  ASFFNH C PN  +     
Sbjct: 258 TTDIVRSLIGKNLSNAFGIWSN-TTAKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRV-- 314

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
                  D+    + D+  G E+C++Y     +   +++  L ++ F C C +C+ E
Sbjct: 315 -----RNDMRFETLRDIAIGEELCINYGNFQNEDVEKRQLELQEWFFDCGCTKCQSE 366


>gi|367009358|ref|XP_003679180.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
 gi|359746837|emb|CCE89969.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
          Length = 369

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 70/154 (45%), Gaps = 19/154 (12%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTIELTAALLAKDRLNAFGLMEPYI 191
           ++F ILL FQ  +          L  + P     + +++L   +L  +  NAFG+ E   
Sbjct: 227 ARFPILLDFQILIFKT-------LYIILPGYTRQNLSVKLFRHILGSEYGNAFGIWEQ-- 277

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           E     R    Y +Y +AS+FNH C PN  +       +     ++     DVP+G  +C
Sbjct: 278 EELSENREFLGYAVYPRASYFNHSCDPNLTK-------SRIKGTMVFTANKDVPKGEPLC 330

Query: 252 LSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           + Y  +      ++++LL + + F C C RC+ E
Sbjct: 331 IDYSGLLSHTMLKRRQLLKENWFFECCCTRCESE 364


>gi|189209411|ref|XP_001941038.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187977131|gb|EDU43757.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 388

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 60/132 (45%), Gaps = 16/132 (12%)

Query: 156 YLSSLCPPP--ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           +L ++ P P     T E    L ++D  N+FG+     EG +       YG +  AS+FN
Sbjct: 266 HLLAILPQPLLPLVTAETLIMLSSRDSHNSFGIRSLDDEGSE----FFGYGCWPAASYFN 321

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---Y 270
           H C PN      ++   E       R   D+  G+E+C++Y        +R KR+L    
Sbjct: 322 HSCGPN------IEKNREGRA-WYFRAGQDIKLGQELCITYLSGEERKLSRGKRMLRLQK 374

Query: 271 DYDFACDCDRCK 282
            + F C C+RC+
Sbjct: 375 TWGFDCGCERCE 386


>gi|398398565|ref|XP_003852740.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
           IPO323]
 gi|339472621|gb|EGP87716.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
           IPO323]
          Length = 383

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 14/120 (11%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T +L   ++  D  NAFG+      G +       + +Y  AS+FNH C PN  +   V 
Sbjct: 272 TPKLCLTMVRADNHNAFGIRG----GGEDSEEYMGWAVYPSASYFNHSCSPNLAK-RRVG 326

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
              E  T        DV  G E C++Y        D + RQ+RL   + F C C+RC+ E
Sbjct: 327 REWEFTT------ARDVDAGEECCITYLGGDEKGMDRAERQRRLKEVWGFDCGCERCRDE 380


>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
 gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
          Length = 392

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+YL  S  NH C+PN+C  F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYLLQSKINHSCVPNSCSTFPY------SNDIVVLKALTPIQEGDEICISYLDECQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKVLRENYIFVCQCPKCRAQAS 368


>gi|46115982|ref|XP_384009.1| hypothetical protein FG03833.1 [Gibberella zeae PH-1]
          Length = 690

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 70/159 (44%), Gaps = 19/159 (11%)

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
           NPSQ + +L  QG    +       L  L   P   T ++   +    + NAFGL +   
Sbjct: 447 NPSQVERILGLQG----DYKGIGQKLIELDEVPVIDTFQIHDIV----QCNAFGLGQ--- 495

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           + +D   S  + G++++AS+ NH C+PNA +    D        I+ R    +  G EI 
Sbjct: 496 QTEDEDISNASTGLWVRASYINHSCIPNAKKDLIGDL-------ILFRATRRIASGEEIT 548

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
            +Y   +  Y  RQ      ++F C C  C V+ + SD 
Sbjct: 549 HAY-DESTSYEARQAAFRRTWNFECRCPLCLVQMDESDT 586


>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
          Length = 400

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 82/367 (22%), Positives = 132/367 (35%), Gaps = 109/367 (29%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +    
Sbjct: 16  KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 75

Query: 68  -------------------CPSCSVAFCSPKCSTAAASS---------SHSPY------- 92
                              CP C V +CS +C  AAA           SH P        
Sbjct: 76  SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQILCPGPSHDPRHPLNKLQ 135

Query: 93  ------------------------VCQA-----LTRLLNLNSPDAANLP---LDRQLQAR 120
                                   V QA       RL N      AN     + + L+ +
Sbjct: 136 EAWRSVHYPPETASIMLMARMVATVKQAKDKDHWVRLFNHFCSRTANQEQAIVHKLLKGK 195

Query: 121 F------LIAAYNLAV--------VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPP 164
           F      L+  +  A+          P  F+ L A  GT       +S + ++       
Sbjct: 196 FKDQLELLLGLFKEALYEEALSLWFTPEGFRSLFALVGTNGQGIGTSSLSQWVH------ 249

Query: 165 ASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACR 222
           A   +ELT     +++L+ F   +   IE   G   +    G+++  S  NH C+PNA  
Sbjct: 250 ACDALELTPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-- 305

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCD 279
                +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C 
Sbjct: 306 ---ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFNCSCP 362

Query: 280 RCKVEAN 286
           +C  EA+
Sbjct: 363 KCLAEAD 369


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 112/294 (38%), Gaps = 53/294 (18%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           +TVA I G+GRGL++T +   G ++L   P   Y++ P  I   S C +CF        S
Sbjct: 17  LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCFTSGNLRKCS 73

Query: 67  ICPSCSVAFCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQA 119
           +C      +CS  C             C+A+        ++L         L L R+LQ 
Sbjct: 74  MCRV--TWYCSSNCQKEEWKLHQLE--CRAMAALTEDRKKMLTPTIRLMVRLALKRKLQN 129

Query: 120 RFLIAA-----YNLA------VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
             +I +     YNL       +    + Q++L  Q             L SL  P     
Sbjct: 130 EKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQ----------MANLVSLILPFIELD 179

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVD 227
           ++  A   +K   NA  + +P +            G+Y   S  NH C+PNA   FD   
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRP-------LGTGLYPVISIINHSCVPNAVLIFDGRT 232

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           A         VR +  + +  E+ +SY       + ++   L  Y F C C RC
Sbjct: 233 A--------YVRALQPINKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 277


>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
          Length = 498

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/276 (20%), Positives = 112/276 (40%), Gaps = 20/276 (7%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-C 68
           V   E  GR +V+T+ +++G+++    P+++    P + S   C  C +K   + S   C
Sbjct: 13  VKRDEVLGRCVVATRDIRSGEVIAESLPLIVG---PKMASPPLCLGCHKKLALAESRYDC 69

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAY 126
             C    C   C ++    +      +A  +  + N NS      P+      R L+   
Sbjct: 70  SKCFWPLCGQACESSPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIA---PLRALLLKK 126

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
           N    +P +F +LL FQ  + D++ +  + +      P  T + L       + L    +
Sbjct: 127 N----SPERFNLLLDFQSHLQDHEKTQVYQVLKKSLVPFFTQL-LKLDTNESEILTICSI 181

Query: 187 MEPY-IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
            +    E +D +R V   G+Y   SF +H     +C+ +       +N  +++     + 
Sbjct: 182 FDTNCFEVRDTQRLVNIRGLYPTISFLSH-----SCKHNTKHCFNGDNFRLVLTATTPIK 236

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +G  +  +Y    +   +R+  L     F C C+RC
Sbjct: 237 KGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERC 272


>gi|125542564|gb|EAY88703.1| hypothetical protein OsI_10178 [Oryza sativa Indica Group]
          Length = 536

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 8/85 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  +F NH C PNA R    D A       IV    D+  G EI  +YF V    
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 363

Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
           S R++     + F C CDRC+ EA+
Sbjct: 364 SKRRE-AARAWGFECQCDRCRFEAS 387


>gi|194762638|ref|XP_001963441.1| GF20287 [Drosophila ananassae]
 gi|190629100|gb|EDV44517.1| GF20287 [Drosophila ananassae]
          Length = 528

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 69/293 (23%), Positives = 109/293 (37%), Gaps = 59/293 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+++ L+AG++++ + P+ +    P ++    C  C++  ++ S+   CP C    
Sbjct: 54  GRYLVASRQLEAGELLITEEPLAIG---PCVSGDPVCLGCYQPVSLDSTQYRCPGCQWPL 110

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
           C   C        H+   CQ    L       AA L  DR   A          ++    
Sbjct: 111 CRDSCPGIHQPHGHTSEECQ----LYGERRAVAAGLLTDRTGPAEVRDLYELVMIVRILL 166

Query: 128 LAVVNPSQFQILLAFQGTVTD---NDTSAAHY-------------LSSLCPPPASTTIEL 171
           L   +P QF  +   +    +   N     HY             L  L P      I  
Sbjct: 167 LRQRDPEQFAAIERMESHTEERRKNQVLWQHYEQKVVQRLREVWQLEDLQP----EQIHQ 222

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
              +L    +N F +      GQ+G    +A  +Y  A    HDC PN    D+ ++ A 
Sbjct: 223 VCGILD---VNCFEI------GQNG---AKARTLYPSAFLLAHDCSPNTAHTDHPESYA- 269

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
               II+R    V +   + LSY    Y      KR  + ++   F C C RC
Sbjct: 270 ----IILRTSRRVREREALTLSY---AYTLQGTLKRRSFMHEGKLFWCVCQRC 315


>gi|426193275|gb|EKV43209.1| hypothetical protein AGABI2DRAFT_211045 [Agaricus bisporus var.
           bisporus H97]
          Length = 371

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 59/130 (45%), Gaps = 11/130 (8%)

Query: 155 HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNH 214
            +L  L      T +E T  LL ++  N FG+ +  +E  +    +  +G+Y   S+FNH
Sbjct: 251 RHLVYLLKKSVDTPLE-TRVLLGREHGNVFGIWDMALEEMNS--EMLGWGLYSFGSYFNH 307

Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
           DC P   R ++   A E         I D+  G E+C+SY   +     ++K  L ++ F
Sbjct: 308 DCSPT-LRKEHKGRAIEYYA------IKDIDTGEELCISYVETD-QVVEKRKEALKEWFF 359

Query: 275 ACDCDRCKVE 284
            C C  C  E
Sbjct: 360 DCGCGTCVAE 369


>gi|256274267|gb|EEU09175.1| Set6p [Saccharomyces cerevisiae JAY291]
          Length = 373

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAKDRLNAFGLMEPYI 191
           S+F +LL FQ  V          L  L P       +I L   +L  +  NAFGL +   
Sbjct: 231 SKFPVLLHFQKLV-------FQILYILLPSHLHRMLSIPLLRHILGTEYGNAFGLWQ--- 280

Query: 192 EGQDG-RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           EG+    R    Y ++ +AS+FNH C PN  ++            ++  +  D+ +  +I
Sbjct: 281 EGEASDSREYFGYWVFPEASYFNHSCNPNITKY-------RKGNSMLFTVNRDIKKDEQI 333

Query: 251 CLSYFPVNYDYSTRQKRLLYD-YDFACDCDRCKVE 284
           C+ Y  V    + +++  L D + F C C+RCK E
Sbjct: 334 CIDYSGVLDLPTVKRRAFLADSWFFDCACERCKSE 368


>gi|242216833|ref|XP_002474221.1| predicted protein [Postia placenta Mad-698-R]
 gi|220726639|gb|EED80582.1| predicted protein [Postia placenta Mad-698-R]
          Length = 152

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 55/122 (45%), Gaps = 13/122 (10%)

Query: 165 ASTTIELTAALLAKDRLNAFGLME-PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
           ++ TI+   A+ + D  N+FG+ E P  E  +       + +Y + SFFNH C PN  + 
Sbjct: 41  SAITIDNVRAIFSVDPGNSFGIWEVPLTEESE----CLGFAVYPRPSFFNHHCSPNVNKE 96

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            Y    A   T         V  G E+C+SY       +  RQK L   + F C C +C 
Sbjct: 97  RYGRGLAFVTT-------RSVRAGEELCISYGHAEGMGWRQRQKELREGWFFECSCGKCI 149

Query: 283 VE 284
           V+
Sbjct: 150 VD 151


>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
          Length = 697

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 14/108 (12%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           L A  + N+FGL        DG  +     +Y +AS+FNH C PN CR  Y    A    
Sbjct: 548 LCAALQCNSFGLFNA-----DG--NCIGVALYPEASYFNHSCCPNICRVTYRGLFA---- 596

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY-DYDFACDCDRC 281
                 + ++P+G  + + Y  V    +  ++R L+  Y F C+C RC
Sbjct: 597 --AFHALREIPKGEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642


>gi|157124069|ref|XP_001660316.1| hypothetical protein AaeL_AAEL009751 [Aedes aegypti]
 gi|108874117|gb|EAT38342.1| AAEL009751-PA [Aedes aegypti]
          Length = 500

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/296 (22%), Positives = 113/296 (38%), Gaps = 41/296 (13%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           M+   E   + E +  GR LV+ + L  GQ +L + PI++    P+ +S   C NCFR  
Sbjct: 1   MARFEENCEILESDELGRYLVAKRDLARGQQILVEQPIVVG---PYWDSGIRCLNCFR-- 55

Query: 61  MHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
             +S +IC  C  A     C+T      H+   C+       LNS    N   D      
Sbjct: 56  --NSCTICRKCKRAPLCFDCTT------HNETECE-----FYLNSGLNINFLFD-HFNVV 101

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVT---DNDTSAAHYLSSLCP-----------PPAS 166
             +    LA  N ++F  ++  +       D D   AH    + P               
Sbjct: 102 TPVRCLLLASSNRAKFDEMMTMEAHTEERRDTDIWNAHEKKVVRPLLENKAFDGKMENLK 161

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
            T EL   +     +N+F +     + Q     +R  G+Y KA+   H+C+ N       
Sbjct: 162 VTSELIQRICGFFDVNSFEIRGNMDDAQQEESLIR--GLYPKAALMAHNCVSNTL----- 214

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             + + + ++ + +   V +G  +  +Y    +    R+  L     F C C RC+
Sbjct: 215 -ISVDGDANLRLYVTTPVKKGEMLFYNYTRSLFGTFERRTHLKLGKYFVCTCSRCE 269


>gi|451897716|emb|CCT61066.1| hypothetical protein [Leptosphaeria maculans JN3]
          Length = 726

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 9/105 (8%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
           S+   G+++ +S+ NH CLPN+ R    D        + +R   D+ Q  EI   Y    
Sbjct: 485 SIVRNGVWIHSSYMNHSCLPNSVRSFIGDM-------LFLRAARDIEQDEEITTQYLAPE 537

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
            +Y TRQ++    + F C+C  C  +   S     ++   E+ +E
Sbjct: 538 LEYETRQQKFETTWGFRCNCTLCTFDQ--SVGQAKEKQRMEIFEE 580


>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
 gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 416

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 82/367 (22%), Positives = 132/367 (35%), Gaps = 109/367 (29%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +    
Sbjct: 32  KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 91

Query: 68  -------------------CPSCSVAFCSPKCSTAAASS---------SHSPY------- 92
                              CP C V +CS +C  AAA           SH P        
Sbjct: 92  SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQILCPGPSHDPRHPLNKLQ 151

Query: 93  ------------------------VCQA-----LTRLLNLNSPDAANLP---LDRQLQAR 120
                                   V QA       RL N      AN     + + L+ +
Sbjct: 152 EAWRSVHYPPETASIMLMARMVATVKQAKDKDHWVRLFNHFCSRTANQEQAIVHKLLKGK 211

Query: 121 F------LIAAYNLAV--------VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPP 164
           F      L+  +  A+          P  F+ L A  GT       +S + ++       
Sbjct: 212 FKDQLELLLGLFKEALYEEALSLWFTPEGFRSLFALVGTNGQGIGTSSLSQWVH------ 265

Query: 165 ASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACR 222
           A   +ELT     +++L+ F   +   IE   G   +    G+++  S  NH C+PNA  
Sbjct: 266 ACDALELTPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-- 321

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCD 279
                +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C 
Sbjct: 322 ---ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFNCSCP 378

Query: 280 RCKVEAN 286
           +C  EA+
Sbjct: 379 KCLAEAD 385


>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
          Length = 473

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/286 (23%), Positives = 115/286 (40%), Gaps = 34/286 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
           E+V V   EG+GRGL + +    G ++  +     Y+A+ F + +   CH CF++   +H
Sbjct: 2   ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 58

Query: 63  SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A+ C   C   A  +  +   C A+ +  +  +P   N+ L  ++  R 
Sbjct: 59  R----CGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRI 109

Query: 122 L-----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
                 ++   L  ++  Q  +    +    D       +L           ++  + + 
Sbjct: 110 EREGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPAQSQQFGMQYISHIF 169

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
                NAF L +     Q G ++V   GI+      NHDC PN C   +      NN  I
Sbjct: 170 GVINCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPN-CTVIF------NNGKI 216

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +R +  +  G E+ +SY         R+K+L   Y F C C+ CK
Sbjct: 217 ELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 262


>gi|380489700|emb|CCF36529.1| TPR domain-containing protein [Colletotrichum higginsianum]
          Length = 257

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 11/103 (10%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFG    +  G++G R+  + G+++ A++ NH C+ NA + +YV         +++R +
Sbjct: 90  NAFGPGGQF--GEEGARNA-STGLWIWAAYINHSCIANAKK-EYV------GDLMVLRAL 139

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             + +G EI  SY   + DY  RQK LL  + F C C  C  E
Sbjct: 140 GAIKKGEEIFHSY-DESADYEARQKALLTTWGFECGCALCAAE 181


>gi|358391269|gb|EHK40673.1| hypothetical protein TRIATDRAFT_286297 [Trichoderma atroviride IMI
           206040]
          Length = 347

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 5/78 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L A+  NH C PNA +     +  EN   + V  + D+ +G+EI +SY      Y  
Sbjct: 115 GLFLDAARINHSCRPNAHK-----SWNENLQRLTVHAVRDIERGQEITISYLGETLSYIE 169

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ  L   + F C CD C
Sbjct: 170 RQAILKQRFRFDCGCDLC 187


>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 402

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 7/85 (8%)

Query: 202 AYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
             G++   S  NH C PNA   + ++  AA+      V  + D+ +  EI LSY   +  
Sbjct: 264 GVGLFRAVSMTNHSCWPNAEVDYPFLTNAAQ------VTALRDIAEKEEILLSYIDESLP 317

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
            + RQ+ L   Y F C C RC+VEA
Sbjct: 318 LAERQRLLKSHYKFTCGCQRCQVEA 342


>gi|255947380|ref|XP_002564457.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591474|emb|CAP97706.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 351

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 60/137 (43%), Gaps = 14/137 (10%)

Query: 156 YLSSLCPPPASTTIELTAALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKA 209
           YL S    P   + +    L+ ++  N+FGL        P  +    R    A  +Y  A
Sbjct: 205 YLRSSPALPHGMSADRVLHLICQEEANSFGLYPRETGHFPLPKDPVDRGEQFAAAVYPTA 264

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DYSTRQKR 267
           +  NH CLPN      V   A++   ++     D+  G E C+SYF +    D ++R++ 
Sbjct: 265 AIANHSCLPN------VIHKADDKGRMVFTASRDIFPGEECCISYFDLTQYTDLASRREH 318

Query: 268 LLYDYDFACDCDRCKVE 284
           L   + F C C+RC  E
Sbjct: 319 LRKSFRFVCQCERCVSE 335


>gi|402083521|gb|EJT78539.1| hypothetical protein GGTG_03639 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 779

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+  A+  NH C+ N+ R    D         I R   D+P G EI   Y PV  D+  
Sbjct: 483 GIWPHAAIMNHACVANSTRAFCGDL-------FITRASRDIPAGDEITQQYVPVRADWGE 535

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ +L + + F C C  C  E
Sbjct: 536 RQAQLRHWWGFECACALCAAE 556


>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Monodelphis domestica]
          Length = 441

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/337 (21%), Positives = 121/337 (35%), Gaps = 71/337 (21%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPI---LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS 72
           RG GL +  SL+ G+++ R  P+   +   +L  +     C     K +      C  C 
Sbjct: 14  RGNGLRALASLRPGELLFRSEPLAYTVCKESLGVVCERCLCRK--EKLLR-----CSQCK 66

Query: 73  VA-FCSPKCSTAAASSS---------------HSPY-----------VCQALTRLLNLNS 105
           VA +C   C   ++  S               H P            VC  L   + +  
Sbjct: 67  VARYCGSACQADSSEVSFVDLFGRQSPTSTFPHFPSEPPKHLANSGSVCINLIVYVYIPR 126

Query: 106 PDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPA 165
              A LPL   +   F I + ++  ++  + + L     T+           S L  PPA
Sbjct: 127 KYNAMLPL---MIGPFPIFSPDIKNLSEEKKKGLGHLAVTLQLYLKEEIQDASQL--PPA 181

Query: 166 STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
               E  A ++     N F +        +G       G+Y   S  NH C PN      
Sbjct: 182 FDIFESFAKVIC----NGFAI-------SNGEMQEVGVGLYPSMSLLNHSCDPNC----- 225

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-- 283
                     + +R + D+ QG E+ + Y  V    + RQK+L   Y F CDC  C+   
Sbjct: 226 --VIVFEGPSLFLRAVRDIQQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCETRS 283

Query: 284 ---------EANWSDNDNDDENNEEVMDEDQDEQMVA 311
                    E  W +  +  +  E++  +++ EQ++A
Sbjct: 284 KDDDMLSGEEQAWKEVQDSLKKVEDLRAQEKWEQILA 320


>gi|242036801|ref|XP_002465795.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
 gi|241919649|gb|EER92793.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
          Length = 543

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 14/105 (13%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  SF NH C PNA R    D A       IV    D+  G EI   YF V    
Sbjct: 320 GVGLWVLPSFINHSCHPNARRTHIGDHA-------IVHASRDIKAGEEITFPYFDVLVPV 372

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQD 306
           S R++     + F C CDRC+ EA       D    +E++  ++D
Sbjct: 373 SKRRE-ASRAWGFECKCDRCRFEA------EDSILRQEILKSEKD 410


>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
 gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
          Length = 348

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 58/118 (49%), Gaps = 9/118 (7%)

Query: 195 DGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
           DG R   A+ G++ + S FNHDC PN     +       +T I VR   D+P G E+ +S
Sbjct: 163 DGDRKHSAHLGVFPEVSKFNHDCRPNV----HYRLNGLKHTTIAVR---DIPAGDELTIS 215

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVA 311
           Y      ++TRQ + L ++ F C C +C + A  +   ++     E ++ + +  M A
Sbjct: 216 YIYGRASHATRQSQ-LREWGFTCTCPQCTLNATETGASDNRLRQIEALETEIERIMGA 272


>gi|307211936|gb|EFN87850.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 331

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 109/274 (39%), Gaps = 39/274 (14%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--SFCHNCFRKTMHSSSSICP----- 69
           GRG+ + + ++ G+++  D P+LL    P  +S     C  C++         CP     
Sbjct: 36  GRGMFAMRDIQPGELIFMDVPLLLG---PRCHSKYLPMCVVCYKND-------CPLFPCD 85

Query: 70  -SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
             C +  CS KC  +   +       +A  R+L    P   +      L    LI A  L
Sbjct: 86  HGCGLPICSTKCENSLMHA-------RAECRILRELEPTCGSTWSKDLLLTVVLIRALAL 138

Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLS-SLCPPPASTTIELTAALLAKDRLNAFGLM 187
           +     Q ++L AF+  +          L  ++   P    + L   +      N+F + 
Sbjct: 139 S---SEQRRLLFAFECHLNLAPNYEIDLLKRNITHLPDEEQLALMKRICGIFNTNSFEV- 194

Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
              +  +D  R V   GIY   +  NH C+PN  R  + D        + +     +  G
Sbjct: 195 ---VAVRDKDRVVSLRGIYPIGALQNHCCVPNT-RHHFDD-----QQRLYMSAALPIAAG 245

Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +EI +SY  + +D  +R+  L     F+CDC+RC
Sbjct: 246 QEITMSYTDLLWDTRSRRCFLRVTKHFSCDCNRC 279


>gi|449539909|gb|EMD30911.1| hypothetical protein CERSUDRAFT_163648 [Ceriporiopsis subvermispora
           B]
          Length = 433

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/344 (21%), Positives = 127/344 (36%), Gaps = 70/344 (20%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHSSSSI-CP 69
           +G+G+GLV+ + +  G+IV ++ P +L   +       SS  C  C     +S+  I CP
Sbjct: 100 KGKGKGLVAKEYVAEGEIVWKEDPFILAPEWDIYDLQTSSLSCTYCSTPLQNSTLVIPCP 159

Query: 70  S------CSVAFCSPKCSTAAASSS--------------------------HSPYVCQAL 97
           +      C+  FCS  C + +A +                           H+   C A 
Sbjct: 160 ASTSHTPCTARFCSRLCLSRSARTHPLLCPAQNPASAPLLAFARKAQWMALHALAQCTAR 219

Query: 98  TRLLNLNSPDA-----------ANLPLDRQLQARFLIAAYNLAVVNPSQFQILL-AFQGT 145
             L N   P A           A L ++ + +  ++  A          +Q+ + AF+  
Sbjct: 220 ILLTNQQDPSALQADLEVVRALAQLGMEERAKGEWMKGAEPDRATWKRAYQLFVQAFR-- 277

Query: 146 VTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDR-LNAFGLMEPYIEGQDGRRSVRAYG 204
              + T  AH            + ++   L   D  L   G M   +E   G        
Sbjct: 278 ---DPTDDAHKKKLAKLLKKPLSKDIADELFDYDAFLRGLGRMSLNLEAHGG-------- 326

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +Y+  S  NH C PNA         A +   +I +   D+  G E+ ++Y         R
Sbjct: 327 LYVLHSHLNHSCAPNASIRHLDQRTALSRITVIAK--RDIAPGEELLITYVNPALPLPQR 384

Query: 265 QKRLLYDYDFA-CDCDRCKVEANWSDNDNDDENNEEVMDEDQDE 307
           ++ ++ ++ F  C+C+RC  E    + D+  E      D D+ E
Sbjct: 385 RREVM-EWGFGKCNCERCLEE----EKDHAKEGETGAADVDELE 423


>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
 gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
          Length = 515

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/291 (20%), Positives = 108/291 (37%), Gaps = 61/291 (20%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
           E  GR LV+T+++K  +IVL+++P++     P   S+  C  C           C  C  
Sbjct: 49  EQLGRHLVATRTIKPYEIVLKEAPLV---RGPAQISAPVCLGCLNGIEAEDHIECEQCGW 105

Query: 74  AFCSPKCSTAAASSS--------------------HSPYVCQALTR--LLNLNSPDAANL 111
             C P+C +     +                    H  Y C +  R  L+   SP+ A  
Sbjct: 106 PLCGPECKSLDEHKAECRLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKA-- 163

Query: 112 PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
                                 S+FQ L + + T   ++   A  +S     P       
Sbjct: 164 ----------------------SKFQELESLESTRRGSNQWKADLVSIGQFIPKFFK--- 198

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAA 230
           T     ++ + A G ++  I G +   +  ++  ++  ASF  + CLPN  +      + 
Sbjct: 199 TQKFTEEEIMKAVGALQ--INGHEVPTTDPSHVAVFYTASFTENSCLPNLAK------SF 250

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             N   I+   H++ +   + + Y    +  + RQ+ L+    F C CDRC
Sbjct: 251 NKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRC 301


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/281 (23%), Positives = 102/281 (36%), Gaps = 45/281 (16%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           +GRGL + +    G++V    P   Y  L      + C +CF  T       C  C  A+
Sbjct: 17  KGRGLRALRDFAVGELVFA-CPAYSY-VLTVNERGAHCEHCF--TRREDLFKCGKCKQAY 72

Query: 76  -CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
            C+  C             A  +H    C + TR+    +P +  L L ++ +       
Sbjct: 73  YCNVDCQRGDWPMHKLECVAMCTHGENWCPSETRVTTERTP-SERLLLLKEFE------- 124

Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
                   S    + + +  +   D +A HY  S          ELT  L A+   N F 
Sbjct: 125 --------SHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDDQELTE-LFAQVNCNGFT 175

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +       +D   S     ++   +  NH C PN     Y    AE      VR + ++ 
Sbjct: 176 I-------EDEELSHLGSAVFPDVALMNHSCSPNVI-VTYKGTVAE------VRAVQEIN 221

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            G EI  SY  + Y    R++RLL  Y F C C  C  ++ 
Sbjct: 222 PGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECTTKSK 262


>gi|50305365|ref|XP_452642.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49641775|emb|CAH01493.1| KLLA0C09944p [Kluyveromyces lactis]
          Length = 367

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 18/149 (12%)

Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFGLMEPYIEG 193
           +F +LL FQ  V         ++ +L P      T  +   +L  +  N+FG+ E     
Sbjct: 231 RFPVLLEFQIKVFK-------WVYTLAPQYRYIMTTSVFRHILGSEYGNSFGIWET--SE 281

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
            D  R    Y ++ +AS+FNH C+PN  +        E     I++   DV +G E+C+ 
Sbjct: 282 SDDDREYLGYMVHPEASYFNHSCVPNVEK-----KRVERVFQYILQT--DVQKGEELCID 334

Query: 254 YFPVNYDYSTRQKRLLYD-YDFACDCDRC 281
           Y  + +    R++ +L D + F C C RC
Sbjct: 335 YKGILHLDVNRRRAILKDNWFFECKCSRC 363


>gi|24641164|ref|NP_727478.1| CG11160, isoform B [Drosophila melanogaster]
 gi|22832071|gb|AAN09278.1| CG11160, isoform B [Drosophila melanogaster]
          Length = 403

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/278 (22%), Positives = 112/278 (40%), Gaps = 29/278 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+ + L+AG+ ++R+ P+ +    P ++    C  C+   ++ +    CP C+   
Sbjct: 58  GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114

Query: 76  CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           C   C+       H+   CQ  A  R +   L +  A    +    +   ++    L   
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174

Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
           +P QF ++   +    +   +A    HY   +       T +L   L A+      G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232

Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
              +  GQ+G    +A  +Y  A    HDC PN    D       ++ +I++R    V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
              + LSY    Y      KR  + ++   F C C RC
Sbjct: 285 REALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319


>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
          Length = 358

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/310 (22%), Positives = 122/310 (39%), Gaps = 64/310 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFI----NSSSFCHN--CFRK-TMHSSSSIC 68
           +GR L +   ++AG++++ D P   Y+ LP +       S C N  C R+ + H+++S+ 
Sbjct: 62  KGRTLSTNGPIRAGELLMADLP---YAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118

Query: 69  --PSCS--VAFCSPKCS---------TAAASSSHSP-----------YVCQALTRLLNLN 104
               CS  + +C+  C            A    +SP           Y+   + RLL   
Sbjct: 119 CETGCSLDIWWCNESCKDEDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLLAAR 178

Query: 105 SPDA--ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP 162
           +  +  A+L   + +Q    +   N      SQ Q             T A  YL     
Sbjct: 179 NSHSKIASLAAGKGVQ----LLRSNRDSWPTSQLQ----------HWSTLAEKYLLQSSL 224

Query: 163 PPASTTIELTAALLAKDRLNAFGL------MEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
            P +  ++    L+  +  N F L      + P+      R +     ++L+ +  NH C
Sbjct: 225 LPETLDLDTLVDLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSC 284

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKRLLYDYDF 274
            PN      V   A++   ++V  + D+  G E C SYF ++   D   R+K+    + F
Sbjct: 285 APN------VTHQADDRGRMMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTF 338

Query: 275 ACDCDRCKVE 284
            C C RC  E
Sbjct: 339 TCTCPRCLQE 348


>gi|50294041|ref|XP_449432.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528746|emb|CAG62408.1| unnamed protein product [Candida glabrata]
          Length = 374

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 83/197 (42%), Gaps = 36/197 (18%)

Query: 92  YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
           +V + L R+ +L+ P +  L     LQ+          +    +F +LL FQ  V     
Sbjct: 199 FVSETLFRIKHLD-PGSDTLEAFHNLQSN--------ELSKIQKFPVLLNFQVLVFKT-- 247

Query: 152 SAAHYLSSLCPP--PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA 209
                L  L P     S T++    +L  +  NAFG+ +   E  D R     Y ++ KA
Sbjct: 248 -----LFVLLPDNLKNSLTVQAFRHILGSEYGNAFGIWQD-TEAVDSREYF-GYWVFPKA 300

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY-----FPVNYDYSTR 264
           S+FNH C PN  +          +  ++  +  DV  G E+ + Y      PV+     R
Sbjct: 301 SYFNHSCDPNITK-------TRIDRKMVFTLNRDVACGEELNIDYSGVLDLPVD----RR 349

Query: 265 QKRLLYDYDFACDCDRC 281
           QK L  ++ F C C+RC
Sbjct: 350 QKFLFENWFFVCGCNRC 366


>gi|169613454|ref|XP_001800144.1| hypothetical protein SNOG_09858 [Phaeosphaeria nodorum SN15]
 gi|160702729|gb|EAT83123.2| hypothetical protein SNOG_09858 [Phaeosphaeria nodorum SN15]
          Length = 645

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 55/105 (52%), Gaps = 11/105 (10%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFG    +  G++  R+  + G+++ A++ NH C+PNA + +Y+         +I+R  
Sbjct: 478 NAFGPGNQF--GEENARNA-STGLWVYAAYINHSCIPNATK-EYI------GDMMILRAT 527

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             +  G EI  +Y  V+ DY  RQ  L+  + F C C  C+ E+ 
Sbjct: 528 RAIAAGEEIFHAY-DVSSDYDARQASLMTTWGFKCACKLCEAESK 571


>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
          Length = 382

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 13/107 (12%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
           G+Y + S  NH C PNA    +V     NN  + +  + D+  G EIC+SY         
Sbjct: 286 GLYARQSCANHSCDPNA-EISFV----HNNYRLSLIALKDIQPGEEICISYLGDCDNERS 340

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDE 307
             +R+K L+ +Y FAC+C +C  E      D+ D  +EE M  ++D+
Sbjct: 341 RHSRRKTLMENYLFACECSKCLAEI-----DDPDVTSEEEMSAEEDD 382


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 11/138 (7%)

Query: 144 GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY 203
           G   D+  + A  +    P      +E    + A+   N++ +     +  D +  V A 
Sbjct: 155 GAWADHVRAGAKKILEDLPDECHLPVEEILVIAAQINENSYSM-----DALDEKHLVAAV 209

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++      NH C PN    +  D  AE      VR + D+ +G EI LSY  ++ + + 
Sbjct: 210 GLFPICGLINHSCQPNCTWSNAGDGIAE------VRALRDIKEGDEITLSYIDIDKERAE 263

Query: 264 RQKRLLYDYDFACDCDRC 281
           R+K L     F C C RC
Sbjct: 264 RRKELRETKHFDCQCGRC 281


>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 606

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G+Y   S  NH CLPN C F      A   +D+ VR I  V  G ++ ++Y  +    
Sbjct: 321 ALGLYPALSMLNHSCLPN-CVF------ASCGSDMHVRAIRPVAAGEQLTVTYINIMEPR 373

Query: 262 STRQKRLLYDYDFACDCDRC 281
             R + L+    FAC C+RC
Sbjct: 374 RIRARELMDTKHFACACERC 393


>gi|389636919|ref|XP_003716103.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
 gi|351641922|gb|EHA49784.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
          Length = 449

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 197 RRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
           RR V  +  +++ S  NH+C PN A  FD +  +        +  I D+  G E+ +SY 
Sbjct: 241 RRGVDFHSTFVEVSRMNHECSPNCAAYFDPMTMSQR------IYAIRDIMPGEELTVSYI 294

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                   RQ RL  D+ F C C RC  EA++ 
Sbjct: 295 DPVQTREARQNRLRKDWGFGCSCQRCTSEAHFG 327


>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
 gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
          Length = 460

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 63/277 (22%), Positives = 107/277 (38%), Gaps = 30/277 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP-SCSVAF 75
           GRG+ +T+ + AG+++ R+  +++       +  + C  C R  + +   +CP  C++  
Sbjct: 53  GRGVFATRDIAAGELIFRERALVVGPTARKGSQLNTCVCCHR-LLGAKQFLCPHRCTL-- 109

Query: 76  CSPKCSTAAASSSH-------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
             P C   A S +H         +  + L+      SP    + +   L  R L A   +
Sbjct: 110 --PVCGECADSEAHHNECAHFQRWQPKDLSEECKELSPTGEPVDVVNPLSLRILTAV-RV 166

Query: 129 AVVNPSQFQILLAFQGTVT----DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
             ++  Q  ++ A Q             AA         P +    +         LN  
Sbjct: 167 FYLSKEQRALVDAMQANAERGYRQEIIKAAQCFRKF---PTTDKPFMDQLFRVVGVLNTN 223

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
               P     DG  S+   G++   +  NH+C PNA  +       EN    +VR   D+
Sbjct: 224 AFEAPC--RVDGHESL-LRGLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDI 274

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           P+G EI  +Y  + +   TR   L     F C+CDRC
Sbjct: 275 PKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRC 311


>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 654

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 37/77 (48%), Gaps = 5/77 (6%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           IY  A  FNH C PNA R       ++    I++R   D+ +G EI + YF      + R
Sbjct: 443 IYEIAGRFNHSCTPNAAR-----GFSDVGELIVLRAFVDIQEGEEITIEYFSDGGTTTER 497

Query: 265 QKRLLYDYDFACDCDRC 281
           ++ L   Y F CDC  C
Sbjct: 498 RQHLYKQYRFNCDCRAC 514


>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 459

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 9/109 (8%)

Query: 199 SVRAYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           S   +GI+ L +  FNH CLPNA       +       ++V+ +  + QG EI + YF  
Sbjct: 208 SAVGHGIFPLASRLFNHSCLPNAI---VTYSFTSEGIQMVVKALTPIKQGEEITIPYFDP 264

Query: 258 NYDYSTRQKRLLYDYDFACDCD-----RCKVEANWSDNDNDDENNEEVM 301
              Y  RQ    Y Y F C C      RC V+A    ++  +    E M
Sbjct: 265 ALPYHQRQAICRYSYGFECTCSVCMFPRCTVDAREPPSEPGERERIENM 313


>gi|328769434|gb|EGF79478.1| hypothetical protein BATDEDRAFT_35395 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 545

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 72/199 (36%), Gaps = 38/199 (19%)

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF------- 223
           L   L+ K+  N+FG+      G    R     G+Y  A +FNH C P+           
Sbjct: 315 LMMLLICKEECNSFGVYTFNYTGSKTERQSYGLGVYPSAVYFNHSCKPSVGHVARSLPDT 374

Query: 224 DYVDAAAENN-------------TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
           D   + +  N             + I+     ++ Q  E  ++Y P+      RQK +  
Sbjct: 375 DVTRSGSATNLAKLTDTGMSMFGSTILFFATRNLEQNEEAMIAYLPLEGTLEHRQKAIKD 434

Query: 271 DYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFP-HAYF 329
            + F C+C+RC                EE +++++    V +D  A    D   P H   
Sbjct: 435 IFYFECNCERCL---------------EETLEKNKSPTAVQTDITASAMKDQKSPLHQTI 479

Query: 330 --FMRYMCDRDNCWGTLAP 346
             F   +C    C G L P
Sbjct: 480 ETFGLCLCREIGCHGWLVP 498


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 107/278 (38%), Gaps = 54/278 (19%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           +TVA I G+GRGL++T +   G ++L   P   Y++ P  I   S C +CF      +  
Sbjct: 17  LTVASIPGKGRGLIATCTFFPGDVILNQEP---YASTPNKILVGSSCDHCF---TSGNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
            C  C V  +CS  C                  +L  L     A L  DR+         
Sbjct: 71  KCSMCRVTWYCSSNCQKEEW-------------KLHQLECRAMAALTEDRK--------- 108

Query: 126 YNLAVVNPS-QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
               ++ P+ +  + LA +  + +   +    L SL  P     ++  A   +K   NA 
Sbjct: 109 ---KMLTPTIRLMVRLALKRKLQNEKMAN---LVSLILPFIELDLKEIAQTFSKFACNAH 162

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDIIVRMIHD 243
            + +P +            G+Y   S  NH C+PNA   FD   A         VR +  
Sbjct: 163 TICDPELRPL-------GTGLYPVISIINHSCVPNAVLIFDGRTA--------YVRALQP 207

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           + +  E+ +SY       + ++   L  Y F C C RC
Sbjct: 208 INKDEEVSISYIETA-TVTKKRNNDLKQYFFTCTCPRC 244


>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
           griseus]
          Length = 428

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/328 (22%), Positives = 120/328 (36%), Gaps = 66/328 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           RG GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  RGNGLRAVVPLRPGELLFRCDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFL 122
           +A +CS KC   A         C+ L        PD+  L L R +         ++  L
Sbjct: 67  IAKYCSAKCQKKAWPDHKQE--CKCLKSCKPRYPPDSVRL-LARVIVKLMDEKPSESEKL 123

Query: 123 IAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
            + Y+L                Q+ L FQ    +    A+        PP+    E  A 
Sbjct: 124 YSFYDLESNINKLTEDKKEGLRQLALTFQHFTREEIQDASQL------PPSFDLFEAFAK 177

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           ++     N+F +    ++           G+Y   S  NH C PN C   +      N  
Sbjct: 178 VIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPN-CSIVF------NGP 219

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV----------- 283
            +++R + ++  G E+ + Y  +      R+K+L   Y F CDC RC+            
Sbjct: 220 HLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTHDKDADMLTGD 279

Query: 284 EANWSDNDNDDENNEEVMDEDQDEQMVA 311
           E  W +     +  EE+    + EQ++A
Sbjct: 280 EQIWKEVQESLKKIEELKAHWKWEQVLA 307


>gi|195566137|ref|XP_002106647.1| GD16997 [Drosophila simulans]
 gi|194204029|gb|EDX17605.1| GD16997 [Drosophila simulans]
          Length = 532

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 109/282 (38%), Gaps = 37/282 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+ + L+AG+ ++R+ P+ +    P ++    C  C++  ++      CP C+   
Sbjct: 58  GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYQPVSLKPDQYRCPGCAWPL 114

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR--------FLIAAYN 127
           C   C+       H+   CQ    L       A  L  +R   A          ++    
Sbjct: 115 CGSTCAGLKHRHGHTETECQ----LYGERRAVAGKLLTERAGPAEVRDLYELVMIVRILL 170

Query: 128 LAVVNPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAF 184
           L   +P QF ++   +    +   +A    HY   +       T +L   L A+      
Sbjct: 171 LRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVC 228

Query: 185 GLMEP--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           G+++   +  GQ+G    +A  +Y  A    HDC PN    D        + +I++R   
Sbjct: 229 GILDVNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPRSFEILLRTSR 280

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
            V +   + LSY    Y      KR  + ++   F C C RC
Sbjct: 281 RVREREALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319


>gi|449298403|gb|EMC94418.1| hypothetical protein BAUCODRAFT_149577 [Baudoinia compniacensis
           UAMH 10762]
          Length = 354

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 14/125 (11%)

Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
           +S T E+   L +    NAFG+     +G++       Y IY  AS+FNH C PN  +  
Sbjct: 229 SSCTTEVCHTLASIGSHNAFGIRAGSEDGEE----YMGYAIYPDASYFNHSCSPNLLKRR 284

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRC 281
                A   T        ++ +G + C++Y        D + R+ RL   + F C C+RC
Sbjct: 285 MGRCWAFWTT-------REIKKGEQCCITYLGGDEKELDVADRRARLKRVWAFDCMCERC 337

Query: 282 KVEAN 286
           K+E  
Sbjct: 338 KLEVR 342


>gi|321473703|gb|EFX84670.1| hypothetical protein DAPPUDRAFT_314752 [Daphnia pulex]
          Length = 497

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/257 (23%), Positives = 104/257 (40%), Gaps = 30/257 (11%)

Query: 9   TVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC 68
           T+ +    G+ LV+T++++A +I++     L  +  P   +   C NC+R+     S +C
Sbjct: 55  TIGKSSDIGKFLVATRNIRADEIIIDAEEAL--AVGPKQATHPVCLNCYRRV--DGSYLC 110

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA---RFLIAA 125
           P C    CS  C+    + +H  Y CQ L         +    P   +L+    R     
Sbjct: 111 PLCQWPLCSEVCAQ-MENGAHQKYECQLLA--------NREAYPTKEELEQEDCRLYDCI 161

Query: 126 YNLAVV-----NPSQFQILLAFQ-GTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD 179
             L +V     NP + ++ L  +   +        H        P     +L      ++
Sbjct: 162 APLRIVLDNHRNPERSRLWLGLEDHRLYRQKIGIWHVDRKTVVEPVLNKWQLGENTNEEE 221

Query: 180 RLNAFGLMEP-YIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDII 237
              A G++E    E  D  +++ A  +Y  A    H+C+PN AC    V+ A +  T   
Sbjct: 222 LQRACGILEVNAFEVCDDDKTISARAVYSHACLMAHNCVPNTAC---CVNPATQRMT--- 275

Query: 238 VRMIHDVPQGREICLSY 254
           VR   D+P G+ I  SY
Sbjct: 276 VRAAVDIPAGQMITTSY 292


>gi|307111681|gb|EFN59915.1| hypothetical protein CHLNCDRAFT_132958 [Chlorella variabilis]
          Length = 853

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/313 (23%), Positives = 115/313 (36%), Gaps = 58/313 (18%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           ++A    + V + E  GRGL + + L AG+ VL + P     AL  +   S C  C    
Sbjct: 324 VAAAGPALVVQQTEDAGRGLSAAEELPAGRDVLAEQPFAF--ALTKLGRRSVCSTCLAPL 381

Query: 61  MHSSSSI-CPSCSV-AFCSPKCSTAAASSSH---SPYVCQALTRLLNLNSP--------- 106
             + +   C  C + A+C+P C   AA   H    P   +  T LL   +          
Sbjct: 382 ADAPAPYYCRRCPMPAYCTPACR--AADPFHQPGGPECGRPWTVLLPAEAVAALRLARRL 439

Query: 107 --DAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLA--------FQGTVTDNDTSAAHY 156
               A  P  R + +       + A ++P++   L A        ++  V +     A  
Sbjct: 440 RFGGAGSPAARHVAS----LGTHFAELDPTEVVQLAALAAVTHATWRQAVAEARGGGAPA 495

Query: 157 LSSLCPPPA------------STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG 204
            +    P A              T E     L + ++N   ++ P   G D R    A  
Sbjct: 496 EAGPGSPAAVEGQAADEEARGGVTAEGVLEALCRLQINGLAVVPPQRRGSDDRL---ALA 552

Query: 205 IYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           +Y   S  NH CLPN + RF+           ++VR +  VP G  +   Y P   + + 
Sbjct: 553 LYPVGSLMNHSCLPNVSTRFE--------GGKLVVRTVEAVPAGEPLLHCYGPQAGEMTA 604

Query: 264 RQKRLLY--DYDF 274
            Q+ LL    Y F
Sbjct: 605 AQRGLLLLQQYHF 617


>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 556

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 83/196 (42%), Gaps = 29/196 (14%)

Query: 87  SSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTV 146
           SSH+P V +A+  L++ N   +     +  L A F +A   LA    S+       +G+ 
Sbjct: 117 SSHAPSVREAVDELVHHNDDRSPEQKEEYMLMANF-VARLCLAGCGDSK-------KGSA 168

Query: 147 TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY 206
               ++    L  L        ++   A+L K   N F + E  + G+ G       G+Y
Sbjct: 169 LLWPSAQGRGLPGL--------VDAAYAVLGKLSCNVFSIAESALNGEVG------CGLY 214

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
           L+A+  NH C PNA       A + +   + +R    + +G EI +    +      R++
Sbjct: 215 LEAAAANHSCNPNA-------AQSFSGKTLSLRCTRPIRKGEEITIGITQIQKPGPARRE 267

Query: 267 RLLYDYDFACDCDRCK 282
            L   Y F C C+RC+
Sbjct: 268 SLRKTYFFECRCERCE 283


>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 764

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 16/115 (13%)

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR--AYGIYLKASFFNHDCLPNACRFDYVD 227
           E  A L A  + NA  L +P         S+R    G+Y  A+  NH CLPNA      +
Sbjct: 334 ETVARLFAVAQANALTLTDPS--------SLRPIGQGLYASAALLNHSCLPNA------N 379

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            + +    + VR +  +  G E+ ++Y      Y  RQ+ L   + FAC C +C+
Sbjct: 380 WSVDGEGRLCVRAVRPIEAGEEVTVAYVDPTLPYHARQQALQDHFFFACRCLQCR 434


>gi|24641162|ref|NP_572675.1| CG11160, isoform A [Drosophila melanogaster]
 gi|22832070|gb|AAF47987.2| CG11160, isoform A [Drosophila melanogaster]
          Length = 532

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/278 (22%), Positives = 112/278 (40%), Gaps = 29/278 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+ + L+AG+ ++R+ P+ +    P ++    C  C+   ++ +    CP C+   
Sbjct: 58  GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114

Query: 76  CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           C   C+       H+   CQ  A  R +   L +  A    +    +   ++    L   
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174

Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
           +P QF ++   +    +   +A    HY   +       T +L   L A+      G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232

Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
              +  GQ+G    +A  +Y  A    HDC PN    D       ++ +I++R    V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
              + LSY    Y      KR  + ++   F C C RC
Sbjct: 285 REALTLSY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319


>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 676

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 83/210 (39%), Gaps = 64/210 (30%)

Query: 89  HSPYVCQALTRLLN--------LNSPDAAN-LPLDRQLQARFLIAAYNLAVVNPSQFQIL 139
           H  Y+   L RLL         + +PD+A   P D +L    LI  +NL ++   QF   
Sbjct: 402 HRTYIAIWLLRLLKRSPYFPEWVKTPDSAEATPSDGELYIGSLIL-HNLMLI---QFN-- 455

Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
                         AH +S L  P  S        +LAK +         +I G      
Sbjct: 456 --------------AHEISELAVPKGSN-------ILAKAK-------SKFIGG------ 481

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-- 257
               G+Y   S FNH C P   R+          T ++VR I  +P G EI  +Y P+  
Sbjct: 482 ----GVYSTVSLFNHSCNPGIIRYFI-------GTTMVVRAIRTIPAGEEISENYGPIFT 530

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
               + R+++L   Y F C+C+ C   A+W
Sbjct: 531 TTPEAERKRKLRVQYWFDCNCEACT--AHW 558


>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
          Length = 352

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 5   GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEAGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
             R+K+L   Y F CDC RC+ +    D D D    +E+V  E Q+
Sbjct: 58  EERRKQLRDQYCFECDCSRCQTQ----DKDADMLTGDEQVWKEVQE 99


>gi|409048092|gb|EKM57570.1| hypothetical protein PHACADRAFT_192717 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 309

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 5/83 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+L  + FNH C+PN    ++ DAA      ++ R + D+  G E+CL Y  +      
Sbjct: 105 GIFLAGARFNHSCVPNVN--NHWDAA---RGQLVFRALRDIEAGEELCLGYGRLLAKRGE 159

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R+  L   + F C C+ C +E  
Sbjct: 160 RRAELSAKFGFDCACEACSLEGK 182


>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 419

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +Y + S+FNH C PN C     +  A     +     HD+P G E+C+ Y  V+   ++R
Sbjct: 305 LYAEGSYFNHSCAPN-CGTRTGEGQA-----VQFVATHDIPAGDEVCIRYIDVDKPTTSR 358

Query: 265 QKRLLYDYDFACDCDRC 281
           +  LL  Y F C C  C
Sbjct: 359 RSELLSHYHFTCMCPLC 375


>gi|365991467|ref|XP_003672562.1| hypothetical protein NDAI_0K01280 [Naumovozyma dairenensis CBS 421]
 gi|343771338|emb|CCD27319.1| hypothetical protein NDAI_0K01280 [Naumovozyma dairenensis CBS 421]
          Length = 383

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 90/200 (45%), Gaps = 32/200 (16%)

Query: 90  SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN 149
           + +VC++L RL  ++S D+        LQ+  L   +        +F +LL FQ  V   
Sbjct: 210 AKFVCESLFRLKYIDS-DSITKKCFNGLQSNELSKIH--------EFPVLLHFQELVFKT 260

Query: 150 DTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIY 206
                  L  L P    T  TIE    ++  +  NAFG+ E    G+    R    + + 
Sbjct: 261 -------LYILLPDYLRTQLTIENFRHIVGSEYGNAFGIWET---GETPESREWFGFEVL 310

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
            +AS+FNH C PN      +    E  T +   +  +V +G EIC+ Y  V  D  T+++
Sbjct: 311 PEASYFNHSCKPN------ITKTREGRT-MEFTLNSNVAKGTEICIDYSGV-LDLPTKKR 362

Query: 267 R-LLYD-YDFACDCDRCKVE 284
           R  L+D + F C C RCK E
Sbjct: 363 RKFLHDTWFFDCLCVRCKSE 382


>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
 gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
           0517]
          Length = 498

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 111/293 (37%), Gaps = 60/293 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
           GRGL +T ++K G+ V    P    + L        C NCF         M S      +
Sbjct: 20  GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNVMASVDMKLWA 78

Query: 71  CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           CS    V +C  KC +A     H               P   +A+ R++ L S +  ++ 
Sbjct: 79  CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRSSNEDHVK 138

Query: 113 LDRQLQARFLIAAYNLAVV---NPSQFQ-ILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
            D  +   FL    +L  +   N  Q++ I+L  +    + + S +              
Sbjct: 139 SDLNM---FLTLRSHLDEITSSNQEQYERIMLCAKA---EKEYSGSEL-----------D 181

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
           +E  A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA        
Sbjct: 182 VETIAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------V 227

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              +   IIV+ + ++    ++ +SY    Y    RQK+L   Y F C C +C
Sbjct: 228 VGFDGGLIIVKALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKC 280


>gi|395328144|gb|EJF60538.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 384

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 12/106 (11%)

Query: 193 GQDGRRSVRA---YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
           G+ GR S       GI+L+A+  NH C PN  R   V A      ++  R + DV  G E
Sbjct: 155 GKGGRPSTGGGSREGIFLQAARLNHSCRPNVARTWDVGA-----QEMAFRALRDVAPGEE 209

Query: 250 ICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVEANWSDNDN 292
           +C+SY  V+    TR++R   +   + F C C+ C ++    +  +
Sbjct: 210 LCMSYVDVDI-LGTREERGAEIEGAFGFVCACEACMLDGREGEESD 254


>gi|392579385|gb|EIW72512.1| hypothetical protein TREMEDRAFT_58684 [Tremella mesenterica DSM
           1558]
          Length = 885

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 50/111 (45%), Gaps = 16/111 (14%)

Query: 178 KDRLNAFGLMEPYIEG-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
           K  +N+FGL     +G       +DG    +   I+   S  NH CLPN     + D A 
Sbjct: 522 KIAINSFGLSPITAKGFHSAPMRKDGEVD-KDTAIWPTVSMINHSCLPNTVHIAWGDVA- 579

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                 +VR ++D+  G E+ +SYF     Y TR  +    Y F CDC+ C
Sbjct: 580 ------VVRALNDMSAGTELTISYFSREQPYETRAAK-AKQYLFTCDCNLC 623


>gi|328868687|gb|EGG17065.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 333

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/295 (24%), Positives = 114/295 (38%), Gaps = 50/295 (16%)

Query: 7   TVTVAEIEGRGRGLVSTQSLKAGQIVL---RDSPILLYSALPFINSSSFCHNCFRK-TMH 62
           ++++ + E  GR ++S ++++ G  V+   R S   +Y+ L        CH C +  T+ 
Sbjct: 18  SISIQQGESTGRFILSRETIQRGSKVISLPRTSCHGIYNHLV----RQVCHTCLKHSTLG 73

Query: 63  SSSSICPSCS-VAFCSPKCSTAAASSS-HSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
           +    C +C  V +CS +C  AA     H+   C  L  L          +   R +   
Sbjct: 74  NLKLSCQTCKQVYYCSRECQDAAERGLFHNHLECLVLRWLRVPAGFTMHQVAEIRMVVNH 133

Query: 121 FLIAAYNL-------AVVNPSQF-----QILLAFQGTVTDNDTSAAHYLSSLCPPPASTT 168
           F   AY          V   ++F     ++++     V         +   LC    S  
Sbjct: 134 FSKVAYGTDDKAIKQTVGEFNRFPKEDSKLIIKMSAVVKRA------FAGYLCDLDDSHI 187

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGI--YLKASFFNHDCLPNACRFDYV 226
            E+    LA+   N+FGL +          S   YG+  Y  ASF NH C PN  R    
Sbjct: 188 CEV----LAQSNRNSFGLWKS---------SDEQYGLAMYATASFLNHSCFPNCAR---- 230

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
               + N  I +  I D+ +  EI + Y     + + R+  L   Y F C C RC
Sbjct: 231 ---VQRNAGIDIVAIRDIEENDEITICYINARDNDTARRMILKGCYYFDCQCIRC 282


>gi|212276262|ref|NP_001130593.1| uncharacterized protein LOC100191692 [Zea mays]
 gi|194689580|gb|ACF78874.1| unknown [Zea mays]
 gi|414865032|tpg|DAA43589.1| TPA: hypothetical protein ZEAMMB73_306282 [Zea mays]
          Length = 544

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 39/85 (45%), Gaps = 8/85 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  SF NH C PNA R    D A       IV    D+  G EI   YF V    
Sbjct: 321 GVGLWVLPSFINHSCHPNARRTHIGDHA-------IVHASRDIKAGEEITFPYFDVLVPV 373

Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
           S R++     + F C CDRC+ EA 
Sbjct: 374 SKRRE-ASRAWGFECKCDRCRFEAE 397


>gi|336372877|gb|EGO01216.1| hypothetical protein SERLA73DRAFT_179334 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336385718|gb|EGO26865.1| hypothetical protein SERLADRAFT_464417 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 167

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 63/143 (44%), Gaps = 36/143 (25%)

Query: 168 TIELTAALLAKDRLNAFGLME------PYIEGQDGR-------------RSVRAYGIYLK 208
           T E+  A+LA+D  NAFG+ E       Y++G + R             + +  +GI+ +
Sbjct: 22  TSEMVRAVLARDPGNAFGIWELGQSVRSYLDGVEARDERKEYEGSTMDSKEMLGWGIWPE 81

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST----- 263
           AS+FNH C PN      V      N   I   +  V +G E+C++Y   +   +T     
Sbjct: 82  ASYFNHSCNPNV-----VKKRIARNLVFITTSV--VQEGDELCIAYVDCDEIQTTEIDEF 134

Query: 264 -----RQKRLLYDYDFACDCDRC 281
                R++ L   + F C C RC
Sbjct: 135 AAGEKRRELLRKWWFFGCACSRC 157


>gi|367053083|ref|XP_003656920.1| hypothetical protein THITE_2057091 [Thielavia terrestris NRRL 8126]
 gi|347004185|gb|AEO70584.1| hypothetical protein THITE_2057091 [Thielavia terrestris NRRL 8126]
          Length = 429

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 56/109 (51%), Gaps = 11/109 (10%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           G++   S FNHDC PN   R + +      +T I VR   D+P+G E+ +SY       S
Sbjct: 235 GMFPDVSKFNHDCRPNVHYRINNL-----THTAIAVR---DIPRGEELTISYIYPLAPLS 286

Query: 263 TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVA 311
           TRQ + L D+DF C C +C + A  +   +D    +    ED+ E ++A
Sbjct: 287 TRQTQ-LRDWDFTCTCAQCTLPAT-ASAQSDARIRQIAALEDEIEAIMA 333



 Score = 42.4 bits (98), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 14/38 (36%), Positives = 28/38 (73%)

Query: 11  AEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN 48
           AE+ G+GRGL++ ++++ G+I+++  P+LL    P +N
Sbjct: 145 AEVPGKGRGLIANRTIRRGEIIMQRLPVLLIQMTPHVN 182


>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
 gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
          Length = 633

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 9/105 (8%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFG     I G+       + G++L A++ NH CLPNA +    D        ++VR  
Sbjct: 465 NAFG-PGAVIAGEAENIRKASTGLWLMAAYINHSCLPNAEKSFLGDL-------MVVRAT 516

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            ++  G EI  SY   + DY  RQ+ L+  + F C C+ CK +  
Sbjct: 517 RNITAGSEITHSY-DSSSDYDARQEALMKTWGFRCRCELCKADGG 560


>gi|225717680|gb|ACO14686.1| msta, isoform A [Caligus clemensi]
          Length = 521

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 61/284 (21%), Positives = 113/284 (39%), Gaps = 40/284 (14%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+++ +  G+I+ ++ P ++    P + SS  C  C+++    SS  CP C    C
Sbjct: 65  GRFLVASRKILRGEIIFQELPGIV-GPNPVV-SSPICVACYKQVGPRSS--CPKCGFPLC 120

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV----- 131
              C           ++C+ L R          +L  +    +  L A   L ++     
Sbjct: 121 EESCERDHG------FICEHLHRF-------GIDLTPEEMCDSSILHAILPLKLLLSKES 167

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL-NAFGLMEPY 190
           NP  +  L        D   +   YL  +   P   ++E + +  + D L  A G++E  
Sbjct: 168 NPHVYNCLSILMDHREDR-LNDPEYLEGI-QTPILKSLERSLSTYSTDELLKALGIVEVN 225

Query: 191 ------IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                 + G  G R     G++   S  +HDC+P   R   +    E    + +    ++
Sbjct: 226 SYEVYNLTGHSGFR-----GLFCLTSLLSHDCVPK--RRPII--GHETPYGLTMISTRNI 276

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
            +G  + ++Y         R++ L  ++ F C C RC+ EA + 
Sbjct: 277 NEGEILSINYVHTQKPNRIRRRTLKDNWYFECSCKRCEDEAEFG 320


>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
 gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
          Length = 418

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 58/136 (42%), Gaps = 13/136 (9%)

Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
            A   L SL        +E   +  AK   NA  + +   E  DG  +    G+Y + ++
Sbjct: 108 QALMLLKSLEEEVIPEKMEFLISAQAKFATNALTIQDSLFET-DGIGA----GLYEEVNY 162

Query: 212 FNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
            NH C PN  C F+ +         + V  I D+ QG EI  SY     D   R++ L  
Sbjct: 163 MNHSCTPNVICVFNKL-------PQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRRRFLKQ 215

Query: 271 DYDFACDCDRCKVEAN 286
           +Y F C+C RC  E N
Sbjct: 216 NYFFLCECKRCIKEQN 231


>gi|328785466|ref|XP_001121590.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 526

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 66/278 (23%), Positives = 107/278 (38%), Gaps = 44/278 (15%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFRKTMHSSSSIC 68
           VA  E  GR L +++ L+AG+++LR+ P+ +       +   F C +         + +C
Sbjct: 19  VAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCVKDPICFECLSILPNIEEDVNYVC 78

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAAN--LPLDRQLQARFLIAA 125
             C+V   C   C            +  +   LL    P+     L L+  L  R     
Sbjct: 79  SGCNVVTLCGVTCEERG--------IYHSALWLLKQKDPELWKRVLSLESHLNKR----- 125

Query: 126 YNLAVVNPSQFQILLAFQG-TVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF 184
            N  V    +  I+   +     +ND S +  +  LC             +L    +N+F
Sbjct: 126 RNTIVWEDREINIVNVLKSLNFLENDPSVSEMIQQLC------------GILD---VNSF 170

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            L  P   G DG   +   G+YL+AS   HDC  N      V   A++N  + V     +
Sbjct: 171 ELRSP--GGMDG---LLLRGLYLEASMMAHDCRGN------VHVTADDNFHLTVYASIPI 219

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +G  I  +Y       + R++ L     F C+CD CK
Sbjct: 220 KEGDTIFFNYTSSLLGTTGRREYLRTGKYFECECDLCK 257


>gi|150864533|ref|XP_001383385.2| hypothetical protein PICST_19605 [Scheffersomyces stipitis CBS
           6054]
 gi|149385789|gb|ABN65356.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
          Length = 375

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 10/129 (7%)

Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
           L++L P     T +L  + + ++  NAFG+       ++ R     +G+Y  AS+FNH C
Sbjct: 254 LTALPPLQPFITSDLVRSSIGRNISNAFGIWSDSSSSEEEREYF-GFGVYPSASYFNHSC 312

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFA 275
            PN      V    EN   +    + D+  G E+C+ Y   +N     RQK  L ++ F 
Sbjct: 313 GPNV-----VKKRIENK--LTFTTLRDISAGEELCIDYGNYINEPVEVRQKE-LSEWFFN 364

Query: 276 CDCDRCKVE 284
           C CD+C  E
Sbjct: 365 CGCDKCVAE 373


>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
 gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
          Length = 660

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 121/324 (37%), Gaps = 49/324 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   ++  +K G+ +L + P +  S L    + + C NCF +T+  
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
            +  CP C+ V +CS +C   A+   H  Y C  +  +     ++N+  A    A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351

Query: 115 ----------RQLQARFLIA------------AYNLAVVNPSQFQILLAFQGTVTDNDTS 152
                      QL    LI+              +     PS F   +     +T    +
Sbjct: 352 YFLKLKPTIDEQLTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTHCLRA 411

Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
             ++ S   P   S    L    L   + N   + E +     GR      G  IY   +
Sbjct: 412 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
            FNH C P   R+          T I +  +  +  G  I  +Y P+    + S RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 524

Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
              Y F C CD C    NW   D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546


>gi|391347157|ref|XP_003747831.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Metaseiulus occidentalis]
          Length = 684

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/189 (24%), Positives = 83/189 (43%), Gaps = 19/189 (10%)

Query: 154 AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFN 213
           A  + S+  P   T +E    +L + R+N   +    ++  D     R   +YL AS  N
Sbjct: 380 AETILSIVSPFVFTQLEEVMDILQRCRINCHSI----VDHNDFHFFSRGRAVYLAASKTN 435

Query: 214 HDCLPNACRFDYVDAAAENNTDIIVRMIHD--VPQGREICLSYFPVNYDYSTRQKRLLYD 271
           H C+ +    +YV     N   I++R I D  +    ++ + Y P    Y +R +R L +
Sbjct: 436 HSCVTSN---EYVQIF--NGRKILLRAIQDFRIDDPLQMTIHYMPPTLPYESRLRRCLNN 490

Query: 272 YDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQD-EQMVASDDDAEVHGD-----TNFP 325
           Y F C C +C  +A   + + D    E V + +Q  E+  +SD+  E+         N P
Sbjct: 491 YYFICSCRKCVWQAR--NPEPDQAAGETVAEIEQAFEEYHSSDEWYEIGTKLLEKVKNLP 548

Query: 326 HAYFFMRYM 334
              F++ ++
Sbjct: 549 EKNFYVYWL 557


>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
           127.97]
          Length = 499

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 106/290 (36%), Gaps = 54/290 (18%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
           GRGL +T ++  G+ V    P    + L     S  C NCF         M S      +
Sbjct: 21  GRGLFATSTINMGKEVF-SMPATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLRA 79

Query: 71  CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           CS    V +C  KC +A     H               P   +A+ R++ L + +  ++ 
Sbjct: 80  CSGCKVVKYCDEKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEEHVK 139

Query: 113 LDRQLQARFLIAAYNL-AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
            D  +   FL    +L  + N +Q Q          + + S +              IE 
Sbjct: 140 SDLTM---FLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGS-----------DLDIET 185

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
            A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA           
Sbjct: 186 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 231

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +   I V+ +  +    ++ +SY    Y +  RQK+L   Y F C C +C
Sbjct: 232 DEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|402077091|gb|EJT72440.1| hypothetical protein GGTG_09306 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 455

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+L+AS  NH CL NA      +   E    + +  I D+ QG EI + Y     + + 
Sbjct: 105 GIFLEASRINHACLQNAQ-----NTWNEGLQQLTIHAIRDINQGEEITIMYIEDRANRAA 159

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ+ L  D+ F C C  C
Sbjct: 160 RQRTLQRDFRFTCSCQLC 177


>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
          Length = 499

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 106/290 (36%), Gaps = 54/290 (18%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
           GRGL +T ++  G+ V    P    + L     S  C NCF         M S      +
Sbjct: 21  GRGLFATSTINMGKEVF-SMPATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLRA 79

Query: 71  CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           CS    V +C  KC +A     H               P   +A+ R++ L + +  ++ 
Sbjct: 80  CSGCKVVKYCDEKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEEHVK 139

Query: 113 LDRQLQARFLIAAYNL-AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
            D  +   FL    +L  + N +Q Q          + + S +              IE 
Sbjct: 140 SDLTM---FLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGS-----------DLDIET 185

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
            A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA           
Sbjct: 186 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 231

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +   I V+ +  +    ++ +SY    Y +  RQK+L   Y F C C +C
Sbjct: 232 DEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 281


>gi|310795375|gb|EFQ30836.1| hypothetical protein GLRG_05980 [Glomerella graminicola M1.001]
          Length = 411

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 14/119 (11%)

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYV 226
           TT +    L ++D  NAFG+     EG +    +  +G +  AS+FNH C PN      V
Sbjct: 295 TTPQSLLTLSSRDSHNAFGIRSLEDEGSE----LFGHGCWPAASYFNHSCGPN------V 344

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
           D   E       R   DV +G ++C++Y         R+ R   L  ++ F C C RC+
Sbjct: 345 DKRREGRA-WEFRASRDVGKGEQLCITYLGGEEKAMPRETRMSTLRRNWGFECGCKRCE 402


>gi|156544373|ref|XP_001607398.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 507

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 62/279 (22%), Positives = 104/279 (37%), Gaps = 31/279 (11%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+++ L+ G+ +L + P ++            C++ +  T   S  +C  CS   C
Sbjct: 21  GRFLVASKDLEPGEQILTELPFVVGPKAATYPVCLSCYSVWPATEDDSKPLCSRCSWPVC 80

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAA-------NLP-LDRQLQARFLIAAYNL 128
            P+C     +  H  Y C          S D A        +P L+     R L+AA   
Sbjct: 81  GPECEN---NPQHKDYECPIFEAAKEKFSIDVALSEEHQNGVPQLECITPLRLLLAAEKD 137

Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN------ 182
                S+ + + A        +     +++          +E     L  DR +      
Sbjct: 138 PERWKSEIKDMEAHNKKRAQKNQWHIDHVN---------IVEYIRKRLKLDRFSEEDIQT 188

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           A G+++         +   A G+Y K +  NH+C+ N        +   N+  I +R   
Sbjct: 189 ACGILDVNSHEIRTAKGFLARGLYPKVAIMNHNCVSNTAH-----SIDPNDYRIFLRSAV 243

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            VP   E+  SY         R++ LL    FAC C+RC
Sbjct: 244 KVPAAGELFGSYTHALLPTLLRREHLLESKHFACACNRC 282


>gi|440801197|gb|ELR22218.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 343

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 9/84 (10%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY---FPVNYD 260
           G+Y+ +SF+NH C+PN  R  Y     ENNT   V     +  G+EI +SY      N  
Sbjct: 255 GLYILSSFYNHSCVPNT-RSSY----PENNT-WHVYASKPIEAGQEIFISYVDHMKSNKT 308

Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
             TR++ L   Y F C+C RC++E
Sbjct: 309 VETRRQHLYNHYGFWCECPRCRLE 332


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA + G+GRGL + +S   G++V+   P   Y++ P  I+  S C NCF      +  
Sbjct: 17  LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A+        ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSACQREEWKLHQ--LECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
               I +          + ++ A +  +++ D       +    L  L  P     ++  
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
               +K   NA  + +P +            G+Y   S  NH C+PNA          E 
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            T   VR +  + +  E+ +SY         RQ  L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277


>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 386

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 35/246 (14%)

Query: 53  CHNCFRKTMHSSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           CH+CFR+   +    C  C  A +C   C TA          CQA+ RL          +
Sbjct: 2   CHSCFRQQ--AQLHRCAQCRFAHYCDRTCQTACWEEHRQE--CQAIRRL--------GEV 49

Query: 112 PLDR-QLQARFLIAAY-NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--------LC 161
           P DR +L AR +   + +  V + SQ   +   Q  V D        L +          
Sbjct: 50  PADRIRLAARVMWRMHKDTGVASDSQLLPVEELQDHVADLPQEQLQRLHADVRTFQEFWS 109

Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-A 220
              +  + E  A +L   + N F L +     Q G ++V   G++      NHDC PN +
Sbjct: 110 CGRSRPSREDIAHVLGIIKTNGFTLSD-----QRGLQAV-GVGLFPNLCLVNHDCWPNCS 163

Query: 221 CRFDYVDAAAEN-----NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
              ++ + +A N        + +R +  V QG E+ +SY       + RQ++L   + F 
Sbjct: 164 VVLNHGNQSATNPALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFH 223

Query: 276 CDCDRC 281
           C C RC
Sbjct: 224 CTCQRC 229


>gi|302845794|ref|XP_002954435.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
 gi|300260365|gb|EFJ44585.1| hypothetical protein VOLCADRAFT_118721 [Volvox carteri f.
           nagariensis]
          Length = 405

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 40/87 (45%), Gaps = 17/87 (19%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF------PV 257
           G++   SF NH CLPNA  F            ++V     + QG E+ ++Y       PV
Sbjct: 122 GLFPSFSFLNHSCLPNAVNFVV-------GGIMVVVAARKIRQGSEVLINYLGRASLRPV 174

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
                 RQ +L   Y F+CDC RC+ E
Sbjct: 175 G----ERQGQLAEGYHFSCDCPRCRTE 197


>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 396

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 45/90 (50%), Gaps = 10/90 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNY 259
           G+Y   S  NH CLPNA   F Y      NN  + V     +P G+EIC+SY     ++ 
Sbjct: 298 GLYSLQSTCNHSCLPNAEITFPY------NNNVMAVVAKEKIPSGQEICISYLSECDLSR 351

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
              +RQ  L  +Y F CDC +C  EA+  D
Sbjct: 352 SRHSRQNILKENYLFTCDCPKCLSEADEPD 381


>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 528

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 39/79 (49%), Gaps = 7/79 (8%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           G+Y KA+  NH C PN   RFD     A      ++  I  +  G+E+ +SY  +    +
Sbjct: 228 GLYPKAAMLNHSCTPNCIVRFDLTRKQA------VIHTIETIHSGQELTISYVDIFDPTT 281

Query: 263 TRQKRLLYDYDFACDCDRC 281
            R+KRL   Y F C C RC
Sbjct: 282 IRKKRLRSSYFFDCTCQRC 300


>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
 gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
          Length = 462

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 60/272 (22%), Positives = 105/272 (38%), Gaps = 31/272 (11%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP-SCSVAF 75
           GRG+ +T+ + AG+++ ++  ++          SS    C  +T+  +  +C   C++  
Sbjct: 66  GRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCI--CCHETLPQTGFLCRHRCTL-- 121

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
             P C T A S  H    C+   R    ++++      PL  ++     + A  +  +  
Sbjct: 122 --PVCETCADSEEHQAE-CEHFRRWQPKDVDAEQEQVNPLSLRI-----LTAVRVFHLGK 173

Query: 134 SQFQILLAFQGTVTD----NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
            Q  ++ A Q             AA    +    P +  + +         LN      P
Sbjct: 174 EQRHLVDAMQANAERAYRREIIQAAQCFRNF---PTTDRVFMDQLFRIVGVLNTNAFEAP 230

Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE 249
               + G       G++   +  NH+C PNA  +       EN    +VR   D+P+G E
Sbjct: 231 C---RSGGHETLLRGLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGE 281

Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           I  +Y  + +   TR   L     FACDC RC
Sbjct: 282 ITTTYTKILWSNLTRNIFLKMTKHFACDCARC 313


>gi|397601249|gb|EJK57855.1| hypothetical protein THAOC_22059 [Thalassiosira oceanica]
          Length = 572

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 45/101 (44%), Gaps = 12/101 (11%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII--------VRMIH---DVPQGRE 249
           R  G+Y  A+  NH C PNA R       +E  T +I        V ++H    + +G E
Sbjct: 306 RILGVYPLAAMINHSCCPNAIRVFGTVPKSEIGTTVIDEECTGKDVMIVHASTKISRGEE 365

Query: 250 ICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
           I  SY P       R++ +L  Y F C+C RC  E   +D 
Sbjct: 366 IVWSYIPPCGPVQ-RRRHMLKKYGFVCECTRCAKELEVADK 405


>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Taeniopygia guttata]
          Length = 404

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 80/195 (41%), Gaps = 26/195 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPP 163
             L L R L    L   Y      P  F+ L A      QG  T + +   H   +L  P
Sbjct: 205 GQLELLRLLFTEALYDEYLSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP 264

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
                      +L ++ L+AF   +   IE + G   +    G+Y+  S  NH C+PNA 
Sbjct: 265 -----------MLQREELDAFIDQLYKDIEKESGEFLNCEGSGLYVLQSCCNHSCIPNA- 312

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDC 278
                 +  ENN  + +  + D+  G EIC+SY        +R  R   L  +Y F C C
Sbjct: 313 ----ETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSC 368

Query: 279 DRCKVEANWSDNDND 293
            +C  +A+ +D  +D
Sbjct: 369 PKCLAQADDADVTSD 383


>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 524

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 5/82 (6%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           V   G+Y+ A+  NH C PNA   +        N    +R I D+P G +I +SY   + 
Sbjct: 442 VHGSGLYVYANSCNHSCSPNAAITN-----ESTNFSATIRSITDIPNGNQIEISYIEEDQ 496

Query: 260 DYSTRQKRLLYDYDFACDCDRC 281
              TRQ  L+  Y F C C +C
Sbjct: 497 PSQTRQSELIDKYKFKCHCQKC 518


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA + G+GRGL + +S   G++V+   P   Y++ P  I+  S C NCF      +  
Sbjct: 17  LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A+        ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
               I +          + ++ A +  +++ D       +    L  L  P     ++  
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
               +K   NA  + +P +            G+Y   S  NH C+PNA          E 
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            T   VR +  + +  E+ +SY         RQ  L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277


>gi|342182930|emb|CCC92410.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 372

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 78/288 (27%), Positives = 112/288 (38%), Gaps = 55/288 (19%)

Query: 17  GRGLVSTQSLKAGQIVLRDS-PILLYSALPFINSSSFCHNCFRKTMHSSSSIC--PSCSV 73
           GRG+V+ ++LK G +V+R S P L Y       +   C  C  K +      C  P C  
Sbjct: 113 GRGIVARENLKQGTVVVRRSEPWLRYP-----TADGLCALCA-KPLPERFFTCTNPQCHE 166

Query: 74  AFCSPKCSTAAASSSHSPYVCQ-------ALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
            +CS  C T A S  HS   C+        L     + S +AA+   DR   A  L+   
Sbjct: 167 EYCSRDCRTMALSLYHS-RTCRIGGLQAIELDLYTQMKSTEAAS---DRNSAAAQLLMLR 222

Query: 127 NLAV-----VNPSQFQILLAFQGTVTDNDTSAAHYLSSL-------CPPPASTTIELTAA 174
            LAV     + PS    +    G +  + T+ A +L ++       C    S   E    
Sbjct: 223 VLAVSLQQQIVPSALSEVRILSGRLLFDPTALAGHLLTIYERFTRACCTSTSIPYEEMIG 282

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           +LA+   N F   + Y+E            + L  S  NH C  N          AE+  
Sbjct: 283 VLARVTANCFH-RDTYVE------------LNLPRSMLNHSCDAN---------VAEDGE 320

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYS-TRQKRLLYDYDFACDCDRC 281
              +R   DV +  E+ ++YFP     S T + R L    F C C RC
Sbjct: 321 TGEMRTTQDVGREAELTINYFPHLKGLSYTERSRELDRRGFVCHCLRC 368


>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 334

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/307 (21%), Positives = 100/307 (32%), Gaps = 47/307 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           GRG GL  ++++KAG ++ R+ P     A+      + C  C R+        C  C VA
Sbjct: 43  GRGNGLRLSKAVKAGALLYREEPFAY--AVTKKRLGAACERCLRRKERLLR--CSQCKVA 98

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            +C  +C   A         C        + S +    P   +L  R +      +    
Sbjct: 99  RYCDARCQKEAWEDHKRECKC--------IKSVEPNFPPDSVRLVGRIIFKLLRQSTCPS 150

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCP----------------PPASTTIELTAALLA 177
            +       Q  V          L  L                  PPA    E      A
Sbjct: 151 EELYSFSDLQSNVEKLSEEMKEGLGHLAKTLQLYLKVEIQDVSQLPPALDVFET----FA 206

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
           K   N F +        +G       G+Y   S  N+ C PN        A      ++ 
Sbjct: 207 KVACNCFSI-------SNGEMQDVGVGLYPSMSLLNNSCDPNC-------AIVFEGPELH 252

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENN 297
           +R I ++ +G E+ + Y         RQ+RL   Y F CDC  C  ++   D        
Sbjct: 253 LRSIREIQEGEELTICYVETMMPTPERQERLKRQYCFECDCLMCCTKSKAYDIMKVTHGT 312

Query: 298 EEVMDED 304
           E  + +D
Sbjct: 313 EHSLVQD 319


>gi|403169846|ref|XP_003329264.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168447|gb|EFP84845.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 416

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 201 RAYG-IYLKASFFNHDCLPNACRFDYVDAAAENNTDII-VRMIHDVPQGREICLSYFPVN 258
           R +G + L+ S FNH C PN   F       ++ T ++ +R    +  G E+ +SY P+ 
Sbjct: 245 RHFGALVLQGSRFNHACRPNVVYF------IDHETQLMNLRAFESISSGEELTISYRPLE 298

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            D  +R+K L   Y F C C  C++ A 
Sbjct: 299 MDRESRRKELQETYGFRCTCPHCQMSAE 326


>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
 gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
          Length = 626

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 5/86 (5%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNT----DIIVRMIHDVPQGREICLSYFPVNY 259
           G++   S F+H C PN C F   D     N+     I+V+ +  V +G E+C+SY  +  
Sbjct: 295 GLFPMVSMFDHSCSPN-CSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCISYIDIMN 353

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEA 285
               R++ L Y   F C C RC  E 
Sbjct: 354 PTCIRRRELWYSKYFVCRCSRCMSET 379


>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
 gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
          Length = 660

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 126/324 (38%), Gaps = 49/324 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   +   +KAG+ +L + P +      F  + + C NCF ++   
Sbjct: 237 VSPLVRIDRNRQEGRFARAAADVKAGEELLVERPFVAVLLEKF--AKTHCENCFVRS--G 292

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
               C  C+ V +CS +C   AA+  H  Y C  +  L     ++N+  A    A+ PLD
Sbjct: 293 VPVACSRCADVIYCSEQCREEAANKFHK-YECGIVPILWRSGASINNHIALRIIASKPLD 351

Query: 115 RQLQARFLI----AAYNLAVVNPSQFQILLAFQGTVTDNDTS-------AAHYLSS---- 159
             L+ +  I    +   L  +    F+ +   +    +   S        A +L+     
Sbjct: 352 YFLKLKPTIDEELSPEKLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTRCLKA 411

Query: 160 ---LCPPPASTTIELTAALLAKD----RLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
                P P    I++ A+L+ +     + N   + E +     GR      G  IY   +
Sbjct: 412 GGYFGPEPKQDQIQVIASLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
            FNH C P   R+         NT I +  +  +  G  I  +Y P+    + S RQ RL
Sbjct: 472 LFNHSCDPGVVRY------FRGNT-IHINSVRPIEAGLPINENYGPIYTQDERSERQARL 524

Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
              Y F C CD C    NW   D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546


>gi|440792951|gb|ELR14156.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 627

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 45/84 (53%), Gaps = 5/84 (5%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G+++ A+  NH C PN      +  +  N+  I V  I+++ +G E+C SY      +
Sbjct: 511 ASGLFVFANSMNHSCAPNV-----IVVSCFNSFLIRVIAINEIKKGDELCFSYIDEEAPF 565

Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
             RQ++L   Y F C C++C +EA
Sbjct: 566 EQRQRQLEKLYLFECRCEKCAIEA 589


>gi|147810952|emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]
          Length = 588

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  +F NH C PNA R    D        +IV    DV  G EI  +YF V   +  
Sbjct: 378 GLWILPAFINHSCNPNARRLHVGD-------HVIVHASRDVKAGEEITFAYFDVLSPWRK 430

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           R   +   + F C+C RCK E
Sbjct: 431 RTD-MAKTWGFQCNCKRCKFE 450


>gi|167342138|gb|ABZ69017.1| SET domain-containing protein C [Pinus sylvestris]
          Length = 141

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + I +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAIFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CKNWGFECKCRRCIVE 84


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA + G+GRGL + +S   G++V+   P   Y++ P  I+  S C NCF      +  
Sbjct: 17  LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQAL-------TRLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A+        ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
               I +          + ++ A +  +++ D       +    L  L  P     ++  
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
               +K   NA  + +P +            G+Y   S  NH C+PNA          E 
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            T   VR +  + +  E+ +SY         RQ  L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277


>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 29/143 (20%)

Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
           L   G++  N T  A + + LC           AAL    + N+FGL        DG  +
Sbjct: 528 LGEGGSIEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
                +Y +AS+FNH C PN CR  Y    A          + ++ +G  + + Y  V  
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619

Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
             +  ++R L+  Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642


>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 322

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 40/84 (47%), Gaps = 8/84 (9%)

Query: 204 GIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           G+Y  A+ FNHDC PNA  RFD           + V     V +G E+ + Y  V     
Sbjct: 246 GVYPSAAMFNHDCAPNAAQRFDAFGC-------VRVETTRRVRKGEELTIPYVDVMLGRE 298

Query: 263 TRQKRLLYDYDFACDCDRCKVEAN 286
            R+ +L  ++ F C C RC+ EA 
Sbjct: 299 ERRGKLRKNFAFECACARCEREAG 322



 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 6/85 (7%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSCSVAF 75
           G  + +++ L  G +++R +P   ++A+P+ +     CH CFR     +   C +C VA 
Sbjct: 21  GEAMRASRDLPPGTLLVRVAP---FAAVPYPDEMRRSCHGCFRAC--GAERECGACGVAR 75

Query: 76  CSPKCSTAAASSSHSPYVCQALTRL 100
               C+    + ++  Y C AL RL
Sbjct: 76  LCASCAGCETTKAYHAYECHALARL 100


>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
          Length = 991

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 771 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 824

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     FACDC RC
Sbjct: 825 RNIFLKMTKHFACDCVRC 842



 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 42/278 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS--SSSICPS-CSV 73
           GRG+ +T+ + AG+ + R+  +L+        +   C  C+R    S  S+++CP+ C +
Sbjct: 53  GRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSAALCPAGCGL 112

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
               P CS    S+ H    C+   +   L S          + +A  +++      ++ 
Sbjct: 113 ----PVCSECRDSTRHD-LECKLFRKWKPLESQRI-------EPRALRILSVVRCFFLDE 160

Query: 134 SQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
           +  ++L A Q  +          AA        P     ++     +     NAF     
Sbjct: 161 ASRKLLYAMQANMDRYYMQEVQRAADCFEHF--PREQDMLDYFYRTICAFNTNAF----- 213

Query: 190 YIEGQDGRRSVRAYGIYLKASF-----FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                + R +V  + + ++A F      NH C PNA          EN   I+V     +
Sbjct: 214 -----ESRSNVDGHEVLVRALFPLAGLLNHQCTPNAAHH------FENGETIVVCATERI 262

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           P G EI +SY  + +    R+  L     F C C RC+
Sbjct: 263 PAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQ 300


>gi|20152089|gb|AAM11404.1| RE25548p [Drosophila melanogaster]
          Length = 403

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 111/278 (39%), Gaps = 29/278 (10%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK-TMHSSSSICPSCSVAF 75
           GR LV+ + L+AG+ ++R+ P+ +    P ++    C  C+   ++ +    CP C+   
Sbjct: 58  GRYLVANRQLEAGETLIREEPLAIG---PCVSGDPVCLGCYHPVSLKADQYRCPGCAWPL 114

Query: 76  CSPKCSTAAASSSHSPYVCQ--ALTRLL--NLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
           C   C+       H+   CQ  A  R +   L +  A    +    +   ++    L   
Sbjct: 115 CGSTCAGLKHRHGHTETECQLYAERRAVAGELLTERAGPAEVRDLYELVMIVRILLLRQH 174

Query: 132 NPSQFQILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
           +P QF ++   +    +   +A    HY   +       T +L   L A+      G+++
Sbjct: 175 DPEQFALIARMESHTEERRQNAVLWRHYEEKVV-QRLRVTWQLE-DLEAEQVHEVCGILD 232

Query: 189 P--YIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
              +  GQ+G    +A  +Y  A    HDC PN    D       ++ +I++R    V +
Sbjct: 233 VNCFEIGQNG---AKARTLYPSAFLLAHDCTPNTAHTD-----DPSSFEILLRTSRRVRE 284

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---FACDCDRC 281
              + L Y    Y      KR  + ++   F C C RC
Sbjct: 285 REALTLGY---AYTLQGTLKRRAFMHEGKLFWCCCRRC 319


>gi|363540350|ref|YP_004894267.1| mg216 gene product [Megavirus chiliensis]
 gi|350611515|gb|AEQ32959.1| putative set domain-containing protein [Megavirus chiliensis]
          Length = 227

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            I      FNH CLPN   +         N ++    + D+ +G E+  SY  + +D  T
Sbjct: 132 AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 184

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ RL   Y F CDC RC
Sbjct: 185 RQSRLWNQYRFHCDCQRC 202


>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
 gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
          Length = 519

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 99/284 (34%), Gaps = 39/284 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------C 68
           GR LV+T+ +K G+I+ RD P   Y+  P I +   C  C R  M    +         C
Sbjct: 55  GRHLVATRHIKQGEIIYRDEP---YAVGPKIANVPLCLGCNRNLMAGWDATRGLDRFHEC 111

Query: 69  PSCSVAFCSPKCSTAA---------ASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
             C    C P C   A         A S + P +    +      S     +PL   L  
Sbjct: 112 SRCGWPLCGPGCEEVAQHRPECSVLAGSGYRPNIRPNPSNPEQRESAYCVIVPLRVLLLE 171

Query: 120 RFLIAAYNLAVVNPSQFQILLA--FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
           R     Y       S     LA    G +  N       +  L      T ++L+A L  
Sbjct: 172 RIAPERYATVQGFESHLDERLASPLYGVLRSNLVPFLRQVLRLQQYSEQTVLKLSAIL-- 229

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
               N + +  P        + V+  G+Y   +  +HDC PN     Y D    +   ++
Sbjct: 230 --DTNCYEIRLP-------EQHVKVRGLYPLGAMLSHDCRPNT--KHYFD----DRLHMV 274

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +    D+P G  I  SY         R+  L     F C C+RC
Sbjct: 275 LVATVDIPAGGVIHASYTQPLLGTVQRRLALRQAKCFDCCCERC 318


>gi|167342086|gb|ABZ68991.1| SET domain-containing protein C [Pinus taeda]
 gi|167342098|gb|ABZ68997.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ+R   ++ F C C RC VE
Sbjct: 65  RQRR-CENWGFECKCRRCIVE 84


>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
           1015]
          Length = 386

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 111/290 (38%), Gaps = 46/290 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPS 70
           G GRGL ++  +  G+ VL   P+   + L   +    C  CF +       I    C  
Sbjct: 13  GLGRGLFASTDIAVGEEVLH-IPVPFVAVLDTEHLGEVCSGCFGQRQLEEEGITLKGCRG 71

Query: 71  CS-VAFCSPKCSTAAASSSHSPY--VCQALT-RLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C  V +C   C        HS    + Q L  R+L +N+     + L  + Q        
Sbjct: 72  CGIVKYCDKTCQAKDWKLGHSLECSIYQNLKPRVLPINARAVLRMVLRSERQK------- 124

Query: 127 NLAVVNPSQFQILLAFQGTVTD-NDTSAAHY----LSSLCPPPASTT---IELTAALLAK 178
                +  +    L     + D  D SA+ +    LSS      S T    E+ +A+ AK
Sbjct: 125 ----YSSEELDQFLQLGTHIKDIRDQSASQWERISLSSKAIRAYSGTDMSEEVISAMGAK 180

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDI 236
             LN+F L     +           G+YL   A+ FNH C       D+  A + +  ++
Sbjct: 181 LDLNSFNLTNAVYD---------RLGVYLHPYAAIFNHSC-------DHNAAVSFDGPNL 224

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            ++ +  + +  +I ++Y  V   Y  RQ  L   Y F C C +C  EA 
Sbjct: 225 HIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCSKCTREAG 274


>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
          Length = 495

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 275 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 328

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     FACDC RC
Sbjct: 329 RNIFLKMTKHFACDCVRC 346


>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
          Length = 565

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 102/286 (35%), Gaps = 39/286 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
           GRGL +    K G  +L   P +   +  F+       NC   T   +   C  C  VA+
Sbjct: 57  GRGLFTQTDAKPGTRLLSVRPHIHAVSARFLED-----NCTLCTSEENVRRCTRCKKVAY 111

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQ 135
           CS +C TA          CQ+L R    +  D+   P D       L+   N+   +   
Sbjct: 112 CSTECQTADWGIHKQE--CQSLRRWAEASGSDST--PADSIRAISRLLWMRNIKGADSIW 167

Query: 136 FQILLAFQGT----VTDNDTSAAHYLSSLC------PPPASTTIELTAA-----LLAKDR 180
           ++ + A Q       +    S  H   SL        P       + +      L++K  
Sbjct: 168 WRQIAAMQSNREHLSSSTQESYTHLAQSLVLYMKIESPEGLREYGIESGKDLVDLMSKFT 227

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACR-FDYVDAAAENNTDII 237
            N+F L   ++          A G+      +  NH C PNA   F      A    D+I
Sbjct: 228 TNSFTLTSTFLN---------AIGVATAPIPALINHSCQPNAVVVFPATRKGAPPTLDVI 278

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
              I  + +G E+  +Y  +      RQK L   Y F C C  CK+
Sbjct: 279 A--IQPIRRGEEVLAAYVDITLPREIRQKSLKETYAFECSCTLCKL 322


>gi|168069006|ref|XP_001786289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162661736|gb|EDQ48899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1117

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           R V  +G++   SF NH CLP++       A       + V    D+  G E+  +YF +
Sbjct: 878 RKVDHWGLWWLPSFMNHSCLPSSSPIRVGKA-------LFVFASRDLRAGDEVTRAYFDI 930

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
                 R++  +  +DFAC C RCK+E
Sbjct: 931 FLPLDQRKELSMKGWDFACHCPRCKLE 957


>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
 gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
          Length = 527

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 74/308 (24%), Positives = 117/308 (37%), Gaps = 76/308 (24%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI-- 67
           +A  E  GR L +T+ +K G++++R++P++L    P + S   C  C R  +        
Sbjct: 49  LASNELLGRHLRATRDIKMGELIMREAPLVLG---PKVASMPICLGCHRNLLPPQKPALN 105

Query: 68  ---CPSCSVAFCSPKCSTAA---------ASS---------------SHSPYVCQALTRL 100
              C +CS   C P+C  +A         A+S                 S Y    L R 
Sbjct: 106 YYKCSACSWPLCGPECERSAFHVDECRLMAASKFQSKINYNPGQPVGKESAYCVIMLLRC 165

Query: 101 LNLNSPDAANLP----LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY 156
           + L     A       L+  L+ R     Y +   N      L++F  TV          
Sbjct: 166 MQLKESKPAAFARLCELEDHLKERLETPLYQVLRAN------LISFIKTVL--------- 210

Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
              L   P    + + A L      NAF +       Q G R ++  G++   + F+HDC
Sbjct: 211 --GLREWPDLDILRIAAIL----DTNAFEVR------QAGDR-IKVRGLFPGGAMFSHDC 257

Query: 217 LPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD-- 273
           +PN   RFD       ++ +I+      + +G  + +SY       ST Q+RL       
Sbjct: 258 VPNMRHRFD-------DDMNIMFLAKRPIAKGEILSISY--TQQLRSTIQRRLHLKQVKC 308

Query: 274 FACDCDRC 281
           F C C RC
Sbjct: 309 FECACARC 316


>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
          Length = 491

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 67/292 (22%), Positives = 116/292 (39%), Gaps = 33/292 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
           E+V V   EG+GRGL + +    G ++  +     Y+A+ F + +   CH CF++   +H
Sbjct: 7   ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 63

Query: 63  SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A+ C   C  AA  +  +   C A+ R  +  +P   N+ L  ++  + 
Sbjct: 64  R----CGQCKFAYYCDRTCQRAAWLNHKNE--CSAIKR--HGKAP-TENIRLAARIMWKI 114

Query: 122 LIAAYNLA---VVNPSQFQILL-AFQGTVTDNDTSAAHYLSSLCPPPAST-TIELTAALL 176
                 L+   +V+    Q  + +F         +         PP +    ++  + + 
Sbjct: 115 EREGSGLSEGCLVSIDDLQNHVDSFDEEEKKELRADVESFLEFWPPHSQQFGMQYISHIF 174

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
                N F L +     Q G ++V   GI+      NHDC PN          + V +  
Sbjct: 175 GVISCNGFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMF 228

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                I +R +  +  G E+ +SY         R+K+L   Y F C C+ CK
Sbjct: 229 HTQMRIELRALTKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 280


>gi|408392130|gb|EKJ71490.1| hypothetical protein FPSE_08303 [Fusarium pseudograminearum CS3096]
          Length = 543

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 62/142 (43%), Gaps = 26/142 (18%)

Query: 205 IYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           I L  S  NH C PNA  R++      +    I V  + D+ QG+EI ++Y      Y  
Sbjct: 102 ILLATSRINHSCKPNAQTRWN------QGLGKITVHAVEDIEQGQEITITYLGNPEVYEE 155

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV--MDEDQDEQ------------- 308
           RQK+L   + F C C  C +    ++ D DD+  +E+  + ED + +             
Sbjct: 156 RQKKLTNAFGFDCHCRLCSLSP--AERDLDDKLIKEIDYLQEDLENEDSILEFPTRCLHR 213

Query: 309 --MVASDDDAEVHGDTNFPHAY 328
              V S  +AE  G +N P  Y
Sbjct: 214 IYKVVSQLEAEGVGTSNVPKLY 235


>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 712

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 62/172 (36%), Gaps = 48/172 (27%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N FGL          + S     I+ +AS+FNH CLPN CR  Y    A          +
Sbjct: 560 NNFGLF-------SSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIA------AFYAL 606

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKR-LLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
             + +G  + + Y  V    +  ++R LL  Y F C+C RC     +SD D         
Sbjct: 607 QSIRKGEPLTICYVDVQEASTAERRRTLLTSYRFFCECRRCH---GYSDEDG-------- 655

Query: 301 MDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPPSDA 352
                              GDT      F     CD  +  G L PL P DA
Sbjct: 656 ------------------KGDTVMSEIRF-----CDTCDARGYLRPLAPEDA 684


>gi|170115757|ref|XP_001889072.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164636020|gb|EDR00320.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 353

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 45/94 (47%), Gaps = 12/94 (12%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS- 262
           GI L AS FNH C PNA RF +       N  + +  + D+P G EIC++Y      Y  
Sbjct: 167 GIGLTASRFNHSCSPNA-RFSF----NPTNGQLRIFALSDIPVGEEICVAYLSSRRLYGQ 221

Query: 263 TRQKR---LLYDYDFACDCDRC---KVEANWSDN 290
           TRQ R   L   Y F C C  C   K E   SD+
Sbjct: 222 TRQHRQGILRSRYHFTCSCSVCSLPKAEMILSDS 255


>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
 gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
           112371]
          Length = 498

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 62/289 (21%), Positives = 105/289 (36%), Gaps = 52/289 (17%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK------TMHSSSSICPS 70
           GRGL +T ++K G+ V    P    + L        C NCF         M S      +
Sbjct: 20  GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNVMASVDMKLWA 78

Query: 71  CS----VAFCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           CS    V +C  KC +A     H               P   +A+ R++ L   +  ++ 
Sbjct: 79  CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKVLPVNSRAVLRIVKLRDSNEDHVK 138

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
            D  L     + ++ + + N +Q Q          + + S +              +E  
Sbjct: 139 SD--LNMFLTLRSHLVEITNSNQEQYERIMLCAKAEKEYSGSEL-----------DVETI 185

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA           +
Sbjct: 186 AEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGFD 231

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              I V+ + ++    ++ +SY    Y    RQK+L   Y F C C +C
Sbjct: 232 GGLITVKALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKC 280


>gi|407394360|gb|EKF26913.1| hypothetical protein MOQ_009381 [Trypanosoma cruzi marinkellei]
          Length = 393

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 72/353 (20%), Positives = 129/353 (36%), Gaps = 93/353 (26%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSS-------- 66
           +GR +++T  + A   +  ++PI+ + A  +I     FC +C R+  +S ++        
Sbjct: 30  KGRHVIATADIPAQSGLFEEAPIVSWPAQGYIALDIPFCSHCLRQKANSDAANTQNDSLW 89

Query: 67  -ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL--------LNLNSPDAANLPLDRQL 117
             C  C   FCS  C  ++  + H   +C AL  L           N+   ++L + ++ 
Sbjct: 90  RRCDGCGSYFCSDNCEFSSKIAHH--ILCGALRELREEEDAGSFYTNTGKCSDLTITKES 147

Query: 118 QARFL--IAAYNLAVVNPSQF-----------------------------QILLAFQGTV 146
            AR +  + A     +   QF                             +++ A  GT 
Sbjct: 148 LARCVAWVVARIATSIKQQQFSGELLERHHTENEESISRQLFHIACAPFNRLIDAPNGTE 207

Query: 147 TDNDTSAAHY---------------LSSLCPPPA---STTIELTAALLAKDRLNAF-GLM 187
             +  + + Y               L +  PP +   +  +E+  ALL  D L  F G M
Sbjct: 208 FGDVDALSWYRTVDRLLREPCRIALLEATLPPRSENDAWALEIVDALLRHDTLETFLGQM 267

Query: 188 EP--------YIEGQDGRRS----------VRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
                      I+   G ++          ++   +Y   S FNH C PN        + 
Sbjct: 268 SLNSQAINGFIIQCPSGEKAFSAPPLVEWVLKGGAVYTLQSVFNHSCDPNV-----TVSN 322

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            +   DI +R +  V  G E+ ++Y P+      ++   L  Y F C C RC+
Sbjct: 323 VDGTHDITLRTLRPVKCGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRCQ 375


>gi|367019314|ref|XP_003658942.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
           42464]
 gi|347006209|gb|AEO53697.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
           42464]
          Length = 414

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 19/114 (16%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   S FNHDC PN     +   +   +T + VR   D+P G E+ +SY    Y    
Sbjct: 181 GLFPDVSKFNHDCRPNV----HYRISDLTHTTVAVR---DIPAGEELTISYI---YGLKP 230

Query: 264 RQKRL--LYDYDFACDCDRCKV---EANWSDNDNDDENNEEVMDEDQDEQMVAS 312
           R +RL  L ++ F C C +C +   EA  SDN       +  M ED+ E++VA 
Sbjct: 231 RAERLEQLSEWGFMCTCPQCALSEPEAGASDN----RIRQIKMLEDEIEELVAK 280


>gi|363749425|ref|XP_003644930.1| hypothetical protein Ecym_2380 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356888563|gb|AET38113.1| Hypothetical protein Ecym_2380 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 366

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 67/153 (43%), Gaps = 19/153 (12%)

Query: 135 QFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYIE 192
           +F +LL FQ       T    +L    P    +  T EL   +L  +  N+FG+ +   E
Sbjct: 225 EFPVLLKFQ-------TEVYMFLYVTLPENLKSLMTRELFRKILGSEYGNSFGIWQEN-E 276

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
             D R  +  Y +  + SFFNH C PN  +         N  + ++    DV  G E+C+
Sbjct: 277 AFDNREYL-GYWMLPEGSFFNHSCAPNLSK-----KRIGNVMNFVLN--KDVKMGDELCI 328

Query: 253 SYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
            Y  + +     R+K L  ++ F C C RC +E
Sbjct: 329 DYKGILDLPIHQRRKILKTNWFFECQCSRCSLE 361


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 69/279 (24%), Positives = 101/279 (36%), Gaps = 39/279 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL +T+S   G+++    P   Y  L      + C  CF  T     S C  C  A
Sbjct: 14  GKGRGLKATRSFALGELLF-SCPAYTY-VLTDNERGNHCDFCF--TRKEGLSKCGKCKQA 69

Query: 75  F-CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL-I 123
           F C+  C            ++  S     C + T  L            +R    RF+ +
Sbjct: 70  FYCNVDCQKGDWPMHKLECSSMCSSGQNWCPSETVRLTARILAKQKTQTERTASERFMSV 129

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAH-YLSSLCPPPASTTIELTAALLAKDRLN 182
             +       S    L   +  + +ND SA H + S       +  +E    L A+   N
Sbjct: 130 KEFE------SHLSKLDNEKKELIENDISALHRFYSKNVHNCDNAALEF---LFAQVNCN 180

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
            F +       +D   S     I+   +  NH C PN     Y    AE      VR + 
Sbjct: 181 GFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTVAE------VRAVQ 226

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           ++  G E+  SY  + Y    R  RL   Y F+CDC  C
Sbjct: 227 EIHAGEEVFTSYIDLLYPTEDRNDRLKDSYFFSCDCREC 265


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 118/312 (37%), Gaps = 58/312 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--RKTMHSSSSICPSCSV 73
           RG GL +   L+ G+++ R  P L Y+     +    C  C   RK +      C  C +
Sbjct: 14  RGNGLRALVQLRPGELLFRSDP-LAYTVCKG-SRGVVCVRCIIGRKYL----LYCSRCRL 67

Query: 74  -AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL---------QARFLI 123
            A C  KC   A         C+ L        PD+  L L R +         ++  L 
Sbjct: 68  LAQCGAKCRKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEETPSESEKLY 124

Query: 124 AAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
           + Y+L                Q+++ FQ  + +    A+        PP+    E  A +
Sbjct: 125 SFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPSFDIFEAFAKV 178

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           +     N+F +    ++           G+Y   S  NH C PN        +   N   
Sbjct: 179 IC----NSFTVCNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SIVFNGPH 220

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD- 294
           +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D D D  
Sbjct: 221 LLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQ----DKDADML 276

Query: 295 ENNEEVMDEDQD 306
             +E+V  E Q+
Sbjct: 277 TGDEQVWKEVQE 288


>gi|389604014|ref|XP_003723152.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322504894|emb|CBZ14681.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 623

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 98/231 (42%), Gaps = 40/231 (17%)

Query: 68  CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA--RFLIA 124
           C +C+ VA+CSP C+ A     H  + C  +TRL ++     +     R+L A  +F   
Sbjct: 262 CTTCNMVAYCSPACAEAH-REQHEKFECSRMTRLSDMMESIES-----RRLDAPEQFFEL 315

Query: 125 AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP-----------PPASTTIELTA 173
           AY+  +   +  +        V    +     + SL P              +TT+ +  
Sbjct: 316 AYH-CITTLAGMKAKRQGHENVLKLTSHVDEVIQSLHPVGPLMCDLFKGEEETTTLYIII 374

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY-LKASFFNHDCLPNACRFDYVDAAAEN 232
            +L   R NA  + +P      G    +A  +    ASFFNH C+PN C  D V  A   
Sbjct: 375 GVL---RCNALEVTDP-----SGLGVAQALHVGNTIASFFNHSCMPN-CAIDTVRHAI-- 423

Query: 233 NTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
              +  R+I+    G E+ ++Y P + +    R++RL   Y F C C RC+
Sbjct: 424 ---VTTRIIY---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCHCPRCQ 468


>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
          Length = 1064

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 51/119 (42%), Gaps = 14/119 (11%)

Query: 172  TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
            T +L+ K R N FG+          R           AS+FNH C PN        A   
Sbjct: 904  TRSLICKIRCNQFGIWSK-------RDKCIGVSCTPAASYFNHSCCPNL-------ADVR 949

Query: 232  NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
              T ++ + +H + +G  I +SY  ++     RQ  L   Y F C C RCK + + SDN
Sbjct: 950  GTTVVVFKALHFISKGTPISISYLDLDQPTHERQSYLKTFYYFTCQCLRCKDQTDESDN 1008



 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 15/120 (12%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
           GR LV+++ L+ G+IVL     L  S    +     CHNC       +S I         
Sbjct: 646 GRYLVASKDLEIGEIVLVARSFL--SIESDLLRKQTCHNCLEYQNTKASDIDSALKSSVM 703

Query: 68  --CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
             C  C+ V +CS  C  +  S+ H  Y CQ   +L + N+ ++  +  D   + + ++ 
Sbjct: 704 QRCVGCNEVYWCSESC-YSIGSAIHQRYECQYFKKLRSYNNSNSKAITGDNMTEVKMIVG 762


>gi|449302416|gb|EMC98425.1| hypothetical protein BAUCODRAFT_574506 [Baudoinia compniacensis
           UAMH 10762]
          Length = 657

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 51/110 (46%), Gaps = 12/110 (10%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++ AS+ NHDC+ NA R    D        +IVR    +  G EI LSY P    Y T
Sbjct: 430 GMWITASYMNHDCIGNAKRSFIGDM-------LIVRATKAIAAGEEITLSYVPTEDAYET 482

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASD 313
           R +  L  ++  C C  C  E    + D      +     +Q E+++A++
Sbjct: 483 RTE-TLQAWNIRCRCRLCVAE----EQDTIVARKQRAGLIEQAERLMAAN 527


>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
          Length = 338

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 8/89 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+YL  S  NH C+PNA           NN+ + V+ + ++  G EIC+SY     +   
Sbjct: 236 GLYLLQSTCNHSCMPNA-----EITFPHNNSTLAVKALSNIKTGEEICISYLDECALERS 290

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             +R K L  +Y F C+C +C+ + + +D
Sbjct: 291 RHSRHKILRENYLFNCNCSKCESQLDEAD 319


>gi|270011901|gb|EFA08349.1| hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
          Length = 803

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 60/305 (19%), Positives = 113/305 (37%), Gaps = 46/305 (15%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
            +V +   + E    GR LV+T++LK G++V++++P++     P  NS   C  C +   
Sbjct: 40  KSVCQPFKICEDAVLGRHLVATRALKPGEVVIQEAPLIWG---PSYNSIPVCLGCGKAVD 96

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQ----ALTRLLNLNSPDAANLPLDRQL 117
            + S  C  C    CS  C  + +      Y  Q       ++  +  P    + + R L
Sbjct: 97  ETCSRPCQKCGWPVCSDLCEKSPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCL 156

Query: 118 -QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAAL 175
            Q +FL          P  ++ L   Q  + ++ ++  +    +          +L+A  
Sbjct: 157 YQKQFL----------PKNWKKLETLQNHLEEHKSTTNYESDRVTIAEFIRRFFKLSATF 206

Query: 176 LAKDRLNAFG----------LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
             +D +   G          L EP+              IY  AS   H C PN  +   
Sbjct: 207 SEEDIMKVHGSTLVNGHEVPLTEPF-----------HVAIYSSASMLEHSCGPNCTK--- 252

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
              +      I++     + +G  + + Y    +   +R+  L     F C C+RC+  +
Sbjct: 253 ---SFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPS 309

Query: 286 NWSDN 290
            +  N
Sbjct: 310 EFGTN 314


>gi|194743442|ref|XP_001954209.1| GF16861 [Drosophila ananassae]
 gi|190627246|gb|EDV42770.1| GF16861 [Drosophila ananassae]
          Length = 550

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 72/297 (24%), Positives = 110/297 (37%), Gaps = 50/297 (16%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILL-----YSALPFINSSSFCHNCFRKTMHSS 64
           VAE E  GR LV+TQ++ A QI   + P+++     +S    I     CH   R   H  
Sbjct: 48  VAEDEQLGRFLVATQNIAAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCRLGKHQ- 106

Query: 65  SSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIA 124
              C SC    CS  C+  A   S           +L+L     A    +R L   F   
Sbjct: 107 ---CRSCHWPVCSAACADEALECS-----------VLSLGPTPGARAD-NRTLNDYFRGD 151

Query: 125 AYNLAVV------NPSQFQILLAFQGTVTDNDTSAAH--------------YLSSLCPPP 164
           A  +         NP+++  LL  Q    +   +  H              +L  L    
Sbjct: 152 ALLVLRCLLLQRQNPAKWTALLEMQSHEEERQGTELHVEAEMRVVDFLQQRFLKRLKQTK 211

Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
                +  + LL +      G++E      +    V   G++ +A    H C PN C F 
Sbjct: 212 PDLLADCGSELLHR----ICGIIETNYMVIELPSGVELSGLFRQACMMEHACQPN-CYFQ 266

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           + DA  +    I VR   D+ +G  + ++Y  + +    RQ  L     F+C C RC
Sbjct: 267 F-DAFTQQ---ITVRAGCDLQKGDHLRIAYTNILWGTQLRQHHLRLTKHFSCHCARC 319


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 43/289 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP-FINSSSFCHNCFRKTMHSSSS 66
           + VA + G+GRGL + +S   G++V+   P   Y++ P  I+  S C NCF      +  
Sbjct: 17  LAVASVPGKGRGLFAARSFFPGEVVISQEP---YASTPNKISVGSNCDNCF---ASRNLR 70

Query: 67  ICPSCSVA-FCSPKCSTAAASSSHSPYVCQALT-------RLLNLNSPDAANLPLDRQLQ 118
            C  C VA +C   C             C+A+        ++L         L L R+LQ
Sbjct: 71  KCSVCRVAWYCGSACQREEWKLHQLE--CRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQ 128

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT------SAAHYLSSLCPPPASTTIELT 172
               I +          + ++ A +  +++ D       +    L  L  P     ++  
Sbjct: 129 DDKAIPS-----SGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSFELDLKEI 183

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
               +K   NA  + +P +            G+Y   S  NH C+PNA          E 
Sbjct: 184 THTFSKFACNAHTICDPELRPL-------GTGLYPVLSIINHSCVPNAVLI------FEG 230

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            T   VR +  + +  E+ +SY         RQ  L + Y F C C RC
Sbjct: 231 RT-AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277


>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 494

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 67/280 (23%), Positives = 107/280 (38%), Gaps = 51/280 (18%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP------S 70
           GRG+ +T+ + A +++  D+P L+            C  C++       S CP       
Sbjct: 42  GRGMFATRDIAANELIFIDAP-LIVGPRCVGKHLEMCVCCYK-------SECPLFPCDRG 93

Query: 71  CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD-AANLPLD----RQL----QARF 121
           C +  CS +C     S  H  Y C  L  L+     D + NL L     R L    Q R 
Sbjct: 94  CGLPICSLECEN---SPKHVNYECNYLRSLVPTCGTDWSLNLLLAVIPIRALFMTEQQRK 150

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRL 181
            +AA        SQ++I L  +        +    +  +C    S + E T ++L KD  
Sbjct: 151 CLAALQCDQTLTSQYEINLLQRNVEKSPSEADVELMKRVCGAFNSNSFE-TVSVLDKDH- 208

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
                            S    G+Y   +  NH+C PN  R  +     + N  + V   
Sbjct: 209 -----------------STSLRGLYPLGALQNHNCAPNT-RHHF-----DENYRLYVSAT 245

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +  G E+ +SY  + +D + R++ L     F+C C RC
Sbjct: 246 VPISAGEELTMSYTSLFWDTTLRRQFLSITKHFSCTCKRC 285


>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
 gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
          Length = 400

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 70/341 (20%), Positives = 121/341 (35%), Gaps = 89/341 (26%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFR--------KTMHSS 64
           E  GR +++T+ +  G+ +L+      Y ++P  I     C NC +        K     
Sbjct: 14  ETEGRYIIATRDIDVGENLLKCKA---YFSMPCEIYKKITCGNCVKIIKRPQQEKKKKVK 70

Query: 65  SSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD--------- 114
              C  C+ + +CS  C    +   H+ Y C+   ++    +P  +   +D         
Sbjct: 71  EYKCNVCNEIWYCSEFCK-KESQPIHAHYECKFFKKI---KAPKLSEWEIDPDTFTEVRM 126

Query: 115 ------RQLQARFLIAAYNLA----------VVNPSQFQ-------ILLAFQGTVTDNDT 151
                 R  Q R L   +NL+          ++N  +         I    + T+ D   
Sbjct: 127 MVGVISRFYQERVLNKKFNLSNYLKEQQEKRLLNQEELSSEDTLDDIFDLVENTIDDGSN 186

Query: 152 SAAH--------YLSSL------------CPPP------ASTTIELTAALLAKDRLNAFG 185
            AA         Y+++L             P P          I +   L+ K R N FG
Sbjct: 187 KAAKELIDIITDYIANLFNLVITSSDNHDLPTPELKKEAVEKVISVIRELIHKVRCNQFG 246

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +               A  I   +SFFNH C+PN            +   +  + ++ V 
Sbjct: 247 IW-------TKNDKCIAVAISPSSSFFNHSCIPNCINI-------RDGNKMTFKALYPVK 292

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           +G  + +SY  ++    +R++ L Y Y F C C RC  + N
Sbjct: 293 KGEPLAISYLDLDLPVESRKEYLKYGYYFDCGCPRCDEKTN 333


>gi|451850767|gb|EMD64068.1| hypothetical protein COCSADRAFT_320770 [Cochliobolus sativus
           ND90Pr]
          Length = 388

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/216 (25%), Positives = 84/216 (38%), Gaps = 38/216 (17%)

Query: 80  CSTAAA-SSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQI 138
           C  A A S  H   V +AL   +   +PD  +          F ++       +P Q++ 
Sbjct: 194 CQAAVAISKQHKKAVSKALQGRI---TPDVMS----------FCVSGILWRHHHPEQWER 240

Query: 139 LLAFQGTVT------DNDTSAAHYLSSLCPPPAS----TTIELTAALLAKDRLNAFGLME 188
           L A     T      D D     YL  +   P +     T E    L  +D  N+FG+  
Sbjct: 241 LEALANDNTPYHSSDDLDAFTRAYLHLVAILPVALLPLVTAETLLTLSKRDSHNSFGIRS 300

Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
              EG +       YG +  AS+FNH C  N      ++   E       R   ++ +GR
Sbjct: 301 LEDEGSE----FFGYGCWPAASYFNHSCGQN------IEKKREGRI-WYFRAGRNIEKGR 349

Query: 249 EICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRC 281
           E+C++Y        +R +R+L     + F C C+RC
Sbjct: 350 ELCITYLSGEERMLSRGQRMLRLKKTWGFDCGCERC 385


>gi|125585065|gb|EAZ25729.1| hypothetical protein OsJ_09564 [Oryza sativa Japonica Group]
          Length = 509

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 8/85 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  +F NH C PNA R    D A       IV    D+  G EI  +YF V    
Sbjct: 284 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 336

Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
           S R++     +   C CDRC+ EA+
Sbjct: 337 SKRRE-AARAWGLECQCDRCRFEAS 360


>gi|115450985|ref|NP_001049093.1| Os03g0168700 [Oryza sativa Japonica Group]
 gi|19071649|gb|AAL84316.1|AC073556_33 hypothetical protein [Oryza sativa Japonica Group]
 gi|108706390|gb|ABF94185.1| TPR Domain containing protein [Oryza sativa Japonica Group]
 gi|113547564|dbj|BAF11007.1| Os03g0168700 [Oryza sativa Japonica Group]
          Length = 536

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 40/85 (47%), Gaps = 8/85 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+++  +F NH C PNA R    D A       IV    D+  G EI  +YF V    
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGDHA-------IVHASRDIKAGEEITFAYFDVLTPA 363

Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
           S R++     +   C CDRC+ EA+
Sbjct: 364 SKRRE-AARAWGLECQCDRCRFEAS 387


>gi|66805955|ref|XP_636699.1| hypothetical protein DDB_G0288495 [Dictyostelium discoideum AX4]
 gi|74852593|sp|Q54IV4.1|Y8495_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0288495
 gi|60465095|gb|EAL63196.1| hypothetical protein DDB_G0288495 [Dictyostelium discoideum AX4]
          Length = 532

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 18/100 (18%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAE------------------NNTDIIVRMIHDVP 245
           G+YLK S FNH C PN       +   +                  N   I +  + D+ 
Sbjct: 379 GVYLKLSLFNHSCFPNCTTLIEYNINKKNSNNNNGNNGNNNSYGDTNQLTISIITLRDIE 438

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           + +E+ ++Y P+N   + R K L  ++ F CDC RC  E 
Sbjct: 439 ENQELLITYIPLNQKINDRVKSLKSNWLFQCDCKRCHFEK 478


>gi|452984368|gb|EME84125.1| hypothetical protein MYCFIDRAFT_173174 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 816

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   + FNH CLPN C + ++          IVR    +  G E+ L+Y P + D   
Sbjct: 489 GLWYTGAMFNHSCLPN-CNWSWIGDM------FIVRANRVITMGEELTLAYIPSSNDSEK 541

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R+  L   Y F C C  C+ +A 
Sbjct: 542 RRNTLRSQYGFECGCRICQADAT 564


>gi|344301790|gb|EGW32095.1| hypothetical protein SPAPADRAFT_61174 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 415

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 64/313 (20%), Positives = 113/313 (36%), Gaps = 51/313 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSFCHNCFRKTMHSS 64
           V +     RG+GL +  ++  G+++  + P+     L  IN   S + C +C R    S 
Sbjct: 113 VQIVMTSKRGKGLYAKHNIAKGELIWEEEPLFFIPPLANINLIKSGTACSHCGRLLQQSE 172

Query: 65  SSI------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
            +       C  C+  +C  KC            V   L +  N+ +P+A    L    +
Sbjct: 173 GNTVLRGLDCQVCAEVWCGTKCKKLDG-------VLHGLLKH-NVYNPNAKRSKL-FDPE 223

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLC-------------PPPA 165
           A   +  Y L     + + I L +   + D   + A    ++                 A
Sbjct: 224 AFLHMQDYCLEDQWNALYAITLIYADILNDKSGTKAKQFKAMARVSQDIRYKALNSSAGA 283

Query: 166 STTIELTAALLAKDRLNAFGL-MEPY--------IEGQDGRRSVRAYGI-------YLKA 209
             T+   A  + + + N + L  E +        I+ Q+    +  Y I       +L  
Sbjct: 284 FDTLSGGALFVQEQQENLWKLGYEKFLRIFPSNPIDYQEFMYMMGTYNINNLDSNVFLIQ 343

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
           S  NH+C PN      V+        + V    D+  G E+  +Y         RQ+ L 
Sbjct: 344 SHLNHNCNPNTS----VETELLRTDGLKVYAARDIRAGEELTTTYVNPKNTVQQRQRELR 399

Query: 270 YDYDFACDCDRCK 282
            ++ F C C++CK
Sbjct: 400 VNWGFLCGCEKCK 412


>gi|393212472|gb|EJC97972.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 444

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 124/335 (37%), Gaps = 95/335 (28%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI------CP 69
           +G+GL +TQ +K G+ V ++ P ++  +    +     H CF  T   +SS       CP
Sbjct: 89  KGKGLEATQEIKEGEDVWKEDPFVISPSWDIYDLQIASHACFHCTTPFNSSTLPHSLPCP 148

Query: 70  S-----------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
           S           C   FC+  C T A S  H P +C A          + A++PL    +
Sbjct: 149 SSTNPALSQNHPCPARFCNRLCITRAMSRQH-PLLCPAT---------NPASVPLLNFAR 198

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQ--------------------------------GTV 146
           +R  +AA+ +A       ++L+AF+                                G  
Sbjct: 199 SRGWLAAHAVAQCI---ARVLMAFEQGRKQDLEEDLRFIRSLAQMSMDERWKIIETSGME 255

Query: 147 TDNDT-SAAHYLS--SLCPPPASTTIELTAALLAKDR--------------LNAFGLMEP 189
            D+ T   AH L+  +   P      +L + ++ K                L+A   M  
Sbjct: 256 PDHQTWKTAHTLTLQAFHEPSNPRDKKLLSKIIRKPLPEDLVKTLFDYDAFLHALSRMGL 315

Query: 190 YIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC--RFDYVDAAAENNTDIIVRMIHDVPQG 247
            +E   G        +Y+  S  NH C PN     FD   A +     I +R   D+  G
Sbjct: 316 NLEAHGG--------LYILHSHMNHSCEPNISVRHFDQRTALSR----ITMRARRDIQPG 363

Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFA-CDCDRC 281
            E+ ++Y       S R +R L  + F  C CDRC
Sbjct: 364 EELTVAYVDPALLLSAR-RRALIPWAFGTCMCDRC 397


>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 485

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 62/278 (22%), Positives = 111/278 (39%), Gaps = 48/278 (17%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP--SCSVA 74
           GRG+ +T+ +K  +++  D+P+++         +  C  C++         CP   C   
Sbjct: 41  GRGMFATRDIKQNELIFIDAPLIVGPKCSN-KHTKMCVCCYKNE-------CPLFPCDKG 92

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD------AANLPLD----RQLQARFLIA 124
              P CST   +S +    C+ L  L+ +   D       A +P+      ++Q + L  
Sbjct: 93  CGLPVCSTQCENSPNHVNECEYLRSLVPICGTDWCLNLLLATIPIRGLFMTKMQRKCLAT 152

Query: 125 -AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNA 183
             YN  + +  + ++L               + ++S    P+   IE+   +      N+
Sbjct: 153 LQYNKNLFSKYEIELL-------------KKNVMNS----PSDEDIEMMERVCRAFNTNS 195

Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           F      I   D   S    G+Y   S  NH C+PN     Y D   +    + VR    
Sbjct: 196 F----ETIRVHDKDHSTNLRGLYPLGSLQNHCCIPNT--RHYFDEEFQ----LYVRAALP 245

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +  G EI +SY  + +D + R++ L    +F+C C RC
Sbjct: 246 ISTGEEITMSYTSLFWDTTLRRQFLNVTKNFSCMCKRC 283


>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
 gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
          Length = 445

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 6/88 (6%)

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
           + G   V   G++   +  NH+C PNA  +       +N T  +VR   D+P+G EI  +
Sbjct: 215 RSGGHEVLLRGLFPLTAIMNHECTPNASHY------FDNGTLAVVRAARDIPKGGEITTT 268

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y  + +   TR   L    +F CDC RC
Sbjct: 269 YTKILWGNLTRGIFLKMTKNFNCDCVRC 296


>gi|336364311|gb|EGN92671.1| hypothetical protein SERLA73DRAFT_116951 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336378310|gb|EGO19468.1| hypothetical protein SERLADRAFT_478912 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 302

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREI 250
           +G  G R+  A G++L  S FN  C+PN   R+D      EN   +  R + D+  G E+
Sbjct: 93  DGLSGHRASTA-GVFLCGSRFNSSCVPNVNNRWD------ENKQAVEFRALKDIVSGEEL 145

Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
           C+SY  +      R+K L   + F C C+ C +
Sbjct: 146 CISYGSLLETGEMRRKELAAKFKFHCRCETCSL 178


>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 440

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 56/287 (19%), Positives = 109/287 (37%), Gaps = 33/287 (11%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
           S + ++  +AE    GR  ++T+++K G+I+LRD   L+   +   N+   C  C+    
Sbjct: 39  SKICKSFKLAENPSLGRHYIATRNIKVGEIILRDDQPLITGLM--YNTVPVCLQCYTVLN 96

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
              +  C  C    C   C+        S     +   +     P  +        Q   
Sbjct: 97  QEIAIPCEKCGWPLCQ-NCNEHGLECKFSSSRRDSKISITEFGYPHPS-------YQCIN 148

Query: 122 LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA-----AHYLSSLCPPPASTTIELTAALL 176
           +I A +L   NP  ++ L++ +    + + S      AH++         +  E+   + 
Sbjct: 149 VIRALSLKDTNPESYKKLISLESHCNEMNNSKEPLNIAHFIKRFFKADDISEEEIATII- 207

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
                   G+++  + G +   +   Y  +Y  AS   H+C  N C   + D        
Sbjct: 208 --------GILQ--VNGHEVPLTDSPYVAVYEMASLIEHNCRAN-CSKSFTDMGG----- 251

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +I+R    + +G  I + Y    +  + R+  L     F C C+RC+
Sbjct: 252 LIIRAALPITKGDHISICYTDPLWGTANRRHHLFKTKFFECICNRCQ 298


>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
          Length = 370

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 69/279 (24%), Positives = 105/279 (37%), Gaps = 43/279 (15%)

Query: 20  LVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPK 79
           L    ++K G ++L + P      L      S C  C  K      S C    V +C+  
Sbjct: 4   LKHINAVKTGDLILSEEPFAY--VLSSKEKGSRCDFCLEKGKVLKCSGCQF--VHYCNRS 59

Query: 80  CSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ--ARFLIAAYNLAVVNPSQFQ 137
           C   A       + C  L R+     PDAA L L R L    R    AY  A   P+ F+
Sbjct: 60  CQKDAWEDH--KWECANLKRIAPKTIPDAARL-LARILNRLQRGNGGAYK-AFYTPTSFR 115

Query: 138 I---LLAFQGTVTDNDTSAAH----------YLSSLCPPPASTTIELTAALLAKDRLNAF 184
           +   L++    +  +     H          YL  +  P  +  + L   ++    +N+F
Sbjct: 116 VWKDLMSHYSDLKSDKKRMDHFSTLSMVLFEYLKDISLPNTADLMGLYGRMV----INSF 171

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            ++       D   +    GIYL +S  +H C PNA        A  +   I +R + D+
Sbjct: 172 TIL-------DIEMNSIGTGIYLASSVIDHSCNPNA-------VAVFDGKTINIRALKDM 217

Query: 245 P--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                ++I +SY  +      RQ  L   Y F C CDRC
Sbjct: 218 NCLDWKKIRISYIDLMKTPYERQMELRQSYYFLCQCDRC 256


>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
          Length = 462

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 242 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 295

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     FACDC RC
Sbjct: 296 RNIFLKMTKHFACDCVRC 313


>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
 gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
 gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
          Length = 462

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 242 GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 295

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     FACDC RC
Sbjct: 296 RNIFLKMTKHFACDCVRC 313


>gi|448532414|ref|XP_003870424.1| Set5 protein [Candida orthopsilosis Co 90-125]
 gi|380354779|emb|CCG24294.1| Set5 protein [Candida orthopsilosis]
          Length = 499

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           ++L  S  NH+C PN      VD A++  + + V    D+ +G E+  +Y   ++  + R
Sbjct: 360 VFLTQSHLNHNCDPNTT----VDTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTLNQR 415

Query: 265 QKRLLYDYDFACDCDRCKVEA 285
           Q+ L  ++ F C C++CK +A
Sbjct: 416 QRELRVNWGFICACEKCKSDA 436


>gi|159485956|ref|XP_001701010.1| hypothetical protein CHLREDRAFT_142740 [Chlamydomonas reinhardtii]
 gi|158281509|gb|EDP07264.1| predicted protein, partial [Chlamydomonas reinhardtii]
          Length = 486

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++  A++FNH CL N  R+   D        + VR   D+P G E+ ++Y      +  
Sbjct: 394 GLWTLAAYFNHACLDNTERYFLGD-------FLFVRASRDLPAGAEVTITYLNPLMSFRE 446

Query: 264 RQKRLLYDYDFACDCDRCKVEANW 287
           R  R L    FAC C+ C  E  W
Sbjct: 447 R-ARALEKRGFACGCELCSGEIEW 469


>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
 gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
          Length = 660

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 121/324 (37%), Gaps = 49/324 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   ++  +K G+ +L + P +  S L    + + C NCF +T+  
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
            +  CP C+ V +CS +C   A+   H  Y C  +  +     ++N+  A    A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351

Query: 115 RQLQARFLI----------------------AAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
             L+ +  I                         +     PS F   +     +T+   +
Sbjct: 352 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTNCLRA 411

Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
             ++ S   P   S    L    L   + N   + E +     GR      G  IY   +
Sbjct: 412 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
            FNH C P   R+          T I +  +  +  G  I  +Y P+    + S RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 524

Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
              Y F C CD C    NW   D+
Sbjct: 525 KDLYWFECSCDACI--DNWPKFDD 546


>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
 gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
          Length = 73

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 7/78 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+  A   NH C PN+        A  N T+I ++ + ++P G E+ +SY    +   T
Sbjct: 3   GIFPNAVCLNHSCAPNS-------VAVFNGTNIYIKALEEIPVGEELTISYIQQLHPRET 55

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ+ L   + F C C RC
Sbjct: 56  RQEELQTQFCFYCQCHRC 73


>gi|167342096|gb|ABZ68996.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
           +RS    G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF 
Sbjct: 5   QRSQDFSGLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFD 57

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +      RQ R   ++ F C C RC VE
Sbjct: 58  ILLPLPQRQGR-CENWGFECKCRRCIVE 84


>gi|440790098|gb|ELR11386.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 307

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 210 SFFNHDCLPNACRFDYVDAAAENNT-DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
           S  NH C PN        A A  ++  + +    DV    E+C+SY P+++  +TRQ  L
Sbjct: 222 SSLNHSCQPNVKSTLRSQAQARGDSCRVSLIAKQDVEPNEELCISYVPLSWTRATRQTTL 281

Query: 269 LYDYDFACDCDRCKVEA 285
           L +Y F C C+RC  EA
Sbjct: 282 LKNYFFHCQCERCVSEA 298


>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
 gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
          Length = 392

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y+  S  NH C+PNA   F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYILQSKVNHSCVPNAQSTFPY------SNDIVVLKAVAPIQKGEEICISYLDECQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKILRENYIFVCQCSKCRAQAS 368


>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
           ND90Pr]
          Length = 529

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 68/325 (20%), Positives = 123/325 (37%), Gaps = 48/325 (14%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS-------- 65
           E  G GL + Q   AG+ + R    L+ S L        C NC+  T  +S+        
Sbjct: 29  ESVGNGLFANQRFGAGEEIARFKRPLVGS-LETERLLDTCANCYLWTEGASAGTRLYVPE 87

Query: 66  ----SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
               S C +C    +CS  C   A +  H  + C+ L  +     P A        L + 
Sbjct: 88  GAEVSKCAACQRFRYCSKACQKEAWNRGHR-HECKILKPMAGRGLPKAF-------LASM 139

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTD--------NDTSAAHYLSSLCPPPASTTIELT 172
            L+      ++   ++++L      + D        N    A   +    P      +  
Sbjct: 140 ELLTRRKHGLIPDKEWELLCQLPSHIDDFKRNGTYQNIEMMAMGAAQFALPQNIFDKDFV 199

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAA 230
           AA+  +   NA  L+ P ++           GI +  +    NH C PNA  +  +D   
Sbjct: 200 AAMYGRVMSNALTLITPTLD---------PLGIIVDPTLCQMNHSCDPNA--YIMMDGPL 248

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
                I +R +  + + +EI +SY    + Y  RQ  L   + F C C +C+ +A   ++
Sbjct: 249 -----ISIRTLRPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKCQDKATHQED 303

Query: 291 DNDDENNEEVMDEDQDEQMVASDDD 315
           +     N + +   ++ + +AS+ +
Sbjct: 304 NWLVPANSDFVASKEELEALASNQE 328


>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
 gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
          Length = 787

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           DGR ++   G++   +  NH+C PNA    Y D    N    IVR   D+P+G EI  +Y
Sbjct: 559 DGRETL-LRGLFPLTAIMNHECTPNAS--HYFD----NGRLAIVRAARDIPKGGEITTTY 611

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +   TR   L     F CDC RC
Sbjct: 612 TKILWGNLTRGIFLKMTKHFMCDCTRC 638


>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
          Length = 544

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 60/283 (21%), Positives = 117/283 (41%), Gaps = 36/283 (12%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           + E+ G GR L +T+++KAG ++++++P++     P   +   C  C+       ++ C 
Sbjct: 48  IKELPGIGRYLEATRTIKAGDVIMKEAPLI---TGPAQITPPVCLGCYVLLEEGKTTRCD 104

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRL----LNLNSPDAANLPLDRQLQARFLIAA 125
            C   FCS KC+ ++       Y  Q   ++      +  P+   + + R L  R     
Sbjct: 105 KCGWPFCSEKCTHSSVHEPECFYTQQRGEKVSISTYGIPHPNYQCVTVLRCLYQR----D 160

Query: 126 YNLAVVNPSQFQILLAF----QGT-VTDNDTS-AAHYLSSLCPPPASTTIELTAALLAKD 179
           +N+ + N  + Q L       +GT   +ND    A ++ S          +L      ++
Sbjct: 161 HNMKLWN--KLQALETHTEDRKGTDKWENDRKMVAQFIWSF--------FKLERLFNEEE 210

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
            +   G+++  I G +       Y  ++ K S   H+C  N       + +  +N +II+
Sbjct: 211 IMKCCGIIQ--INGHEVPLLEPEYVAVFDKISMVEHNCRANC------NKSFTSNGEIIL 262

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                +P+G  I + Y    +    R+  L     F C C+RC
Sbjct: 263 SAGVAIPRGSHISVCYTDPLWGTEARRHHLSDSKFFECSCERC 305


>gi|149244448|ref|XP_001526767.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449161|gb|EDK43417.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 381

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 59/131 (45%), Gaps = 18/131 (13%)

Query: 157 LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR--RSVRAYGIYLKASFFNH 214
           L     PP   +I      + ++  NAFGL   + E QD    +    + +Y  ASFFNH
Sbjct: 261 LQPFITPPTVRSI------IGRNLSNAFGL---WSETQDSNEDKEFLGFAVYPSASFFNH 311

Query: 215 DCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDF 274
            C PN  +        +N    +   + ++  G E+C++Y     +    +K+ L ++ F
Sbjct: 312 SCEPNIKKI-----RVKNELKFVT--LREIAPGEELCINYGNFQNENVKERKKQLSEWFF 364

Query: 275 ACDCDRCKVEA 285
            C C +C+ ++
Sbjct: 365 DCRCTKCEADS 375


>gi|371943475|gb|AEX61304.1| putative SET domain-containing protein [Megavirus courdo7]
          Length = 157

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            I      FNH CLPN   +         N ++    + D+ +G E+  SY  + +D  T
Sbjct: 62  AILFNGRIFNHSCLPNIVFY-------RCNDEMCFITVRDIYKGEELLDSYVNITHDKKT 114

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ RL   Y F CDC RC
Sbjct: 115 RQSRLWNQYRFHCDCQRC 132


>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
 gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
          Length = 844

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           DGR ++   G++   +  NH+C PNA  +       +N    IVR   D+P+G EI  +Y
Sbjct: 616 DGRETL-LRGLFPLTAIMNHECTPNASHY------FDNGRLAIVRAARDIPKGGEITTTY 668

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +   TR   L     F CDC RC
Sbjct: 669 TKILWGNLTRGIFLKMTKHFMCDCTRC 695


>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
 gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
 gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
          Length = 567

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 121/324 (37%), Gaps = 49/324 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   ++  +K G+ +L + P +  S L    + + C NCF +T+  
Sbjct: 144 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 201

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
            +  CP C+ V +CS +C   A+   H  Y C  +  +     ++N+  A    A+ PLD
Sbjct: 202 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 258

Query: 115 RQLQARFLI----------------------AAYNLAVVNPSQFQILLAFQGTVTDNDTS 152
             L+ +  I                         +     PS F   +     +T+   +
Sbjct: 259 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTNCLRA 318

Query: 153 AAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
             ++ S   P   S    L    L   + N   + E +     GR      G  IY   +
Sbjct: 319 GGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 378

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DYSTRQKRL 268
            FNH C P   R+          T I +  +  +  G  I  +Y P+    + S RQ RL
Sbjct: 379 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQARL 431

Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
              Y F C CD C    NW   D+
Sbjct: 432 KDLYWFECSCDACI--DNWPKFDD 453


>gi|357603991|gb|EHJ63998.1| hypothetical protein KGM_04190 [Danaus plexippus]
          Length = 564

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 65/276 (23%), Positives = 101/276 (36%), Gaps = 33/276 (11%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR +V+T+ ++ G++++  +P L+    P       C  C+R+   S  + C  C    C
Sbjct: 90  GRYIVATRDIEPGEVII-TAPALVVG--PCSGCGLICLGCYRELEESHLTKCSGCKWPLC 146

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP-LDRQLQARFLIAAYNLAVVNPSQ 135
           S  C      + HS Y C+ L ++     PD  N+  L     A   +    L   +PS+
Sbjct: 147 SNSCFGLGKYTGHSKYECETLKKI----PPDYKNMEDLRDSYHAITPLRCLLLKKEDPSK 202

Query: 136 FQILL------AFQGTVTD----NDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
           +  L       A + +  D    ND +    L           +     +L    +NAF 
Sbjct: 203 WCALSEMESHNAIRRSRGDIYPMNDRNVVQRLKKWGLGYDDDEVHTVCGIL---EVNAF- 258

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
                   + G     A  +Y +A    HDC P+    D+        T  I   I  VP
Sbjct: 259 --------EVGASGANARALYSEAYLLAHDCTPSTSHTDHERRPGRPLT--IRAAIRHVP 308

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            G  I L Y         R++ L     F C C RC
Sbjct: 309 -GDVISLCYAYTLQGTLKRREHLAVSKFFMCSCVRC 343


>gi|448124691|ref|XP_004204990.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
 gi|358249623|emb|CCE72689.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
          Length = 484

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 64/332 (19%), Positives = 116/332 (34%), Gaps = 72/332 (21%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS--------SFCHNCF 57
           + + V +   +G+GL + +++K G  +  + P  L+   P  N +        S+C    
Sbjct: 108 DKIKVLQTSKKGKGLYAKKNVKKGDFLWEEKP--LFYVPPLANMTLIKNGKACSYCGKLL 165

Query: 58  RKTMHSSSSI-------CPSCSVAFCSPKC----STAAASSSHSPYVCQALTRLLNLNSP 106
            +    +S +       C  CS  +CS +C    S   AS  H            N+  P
Sbjct: 166 TQGTRGNSGVSYLRGLDCDVCSEIWCSKECKNTDSKIHASLKH------------NVYHP 213

Query: 107 DAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
             +N   +    A   +  Y L     + + I   +   V+D         S++      
Sbjct: 214 SKSNTRKNINANAYLALQDYCLKEQWNALYSITRIYADIVSDKSGVKERQFSAMARISQE 273

Query: 167 TT--------------------IELTAALLAKDRLNAFGLMEP--------------YIE 192
           T                     ++     L K+  + F  + P              Y+ 
Sbjct: 274 TRYKALNSSAGSFDNFQGGALFVQEQQESLWKEGFSLFSNVFPNSVDEGKINYDEFMYML 333

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
           G     ++ +  I+L  S  NH+C PN      V  ++     + V    D+  G E+  
Sbjct: 334 GTFNINNLDSV-IFLIQSHLNHNCEPNTT----VKLSSNRTEGLKVYAARDIKAGEELTT 388

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           SY   ++    RQ+ L  ++ F C C++CK E
Sbjct: 389 SYVNPSHTVQQRQRELRVNWGFRCACEKCKSE 420


>gi|302686770|ref|XP_003033065.1| hypothetical protein SCHCODRAFT_107558 [Schizophyllum commune H4-8]
 gi|300106759|gb|EFI98162.1| hypothetical protein SCHCODRAFT_107558, partial [Schizophyllum
           commune H4-8]
          Length = 422

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 49/107 (45%), Gaps = 10/107 (9%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           L+  + L   G   P  EG DGR       +Y   S  NH C PNA  F +     + + 
Sbjct: 225 LIRANGLTVNGYTLPETEGADGR----YVAVYETLSRANHSCRPNA-HFAF----HKPSF 275

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            + +R + D+  G EI +SY P    Y+ RQ+ L + Y  +C C  C
Sbjct: 276 SVRLRALRDIKAGEEILISYVPPEAPYAQRQEELAH-YGLSCACGVC 321


>gi|384247293|gb|EIE20780.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 453

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 50/111 (45%), Gaps = 11/111 (9%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           L+ +  L++ G  E Y  G +G         Y   S  NH C PNA  F       + + 
Sbjct: 346 LITRPLLDSLG--ESYGAGCEGS------AFYALQSCCNHSCAPNAHAFKRAHIDTDGSA 397

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
            I+ R    +  G E+CLSY   +  Y  R+  L  DY F C+CD+C  EA
Sbjct: 398 TILARK--PIAPGEEVCLSYIDEDAPYHDRRAALA-DYGFTCECDKCVSEA 445


>gi|261330740|emb|CBH13725.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 372

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 107/285 (37%), Gaps = 43/285 (15%)

Query: 17  GRGLVSTQSLKAGQIVL-RDSPILLYSALPFINSSSFCHNCFRKTMHSSSSIC--PSCSV 73
           GRG+V+ ++LK G +V+ R  P + Y       +   C  C  K +      C  P C  
Sbjct: 113 GRGIVARENLKKGTVVIQRAEPWMRYPV-----ADGLCALCT-KPLPERFFTCTNPRCHE 166

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            +CS  C T A S  HS        R   +    A  L L  Q+++       N A    
Sbjct: 167 EYCSRDCRTIALSRYHS--------RTCRIEGLQAIELDLFSQMKSSEAATDKNSAAAQL 218

Query: 134 SQFQILLA-FQGTVTDNDTSAAHYLSS-LCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
              ++L A  Q  +  +  S    LS  L   PA+   +         R  +     PY 
Sbjct: 219 LMLRVLAASLQQQIVPSALSEVRILSGRLLFSPAALAGKFLEIYERFTRACSTSTSIPYE 278

Query: 192 E--GQDGRRSVRAYG------IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           E  G   R +   +       + L  S  NH C  N          AE+     +R   D
Sbjct: 279 EMIGVLARVTTNCFHRENCVELSLPRSLLNHSCDAN---------VAEDAETGEMRTTQD 329

Query: 244 VPQGREICLSYFP----VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           V +G E+ ++Y+P    +NY   +R+   L    F C C RC+ E
Sbjct: 330 VARGTELTINYYPQLKELNYKSRSRE---LERRGFVCCCSRCRKE 371


>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
           grubii H99]
          Length = 447

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 23/119 (19%)

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAENNTDII 237
           ++NAF L  P         S+   G+ +    +  NH C PNA     +    +   DI 
Sbjct: 122 QVNAFTLTSP---------SLAPIGMSISPLLALANHSCEPNA-----IAVFPKGGRDIF 167

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV-------EANWSD 289
           +  ++D+P G EI  SY   +  Y  RQ  LL  Y F C C  C+        E +W D
Sbjct: 168 LVALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSESTANGEQSWVD 226


>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
          Length = 763

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  S+ NH C  N+ R    D        +IVR   D+ +G EI ++Y         
Sbjct: 517 GLWIMPSYINHSCWQNSTRSFLGDL-------LIVRAARDLSEGEEITINYMENESGVQK 569

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQK  L ++ F C C  C++E
Sbjct: 570 RQKAFLSEWGFECKCTMCEIE 590


>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
           mellifera]
          Length = 387

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 10/96 (10%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNY 259
           G+Y+  S  NH C+PNA   F Y      +N  ++++ I D+    EIC+SY     +  
Sbjct: 289 GLYILQSSVNHSCVPNAIVEFPY------SNNVLVLKAIRDIHPEEEICISYLDECCLER 342

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
              +RQK L   Y F C C++C  +AN  D  ++DE
Sbjct: 343 SRHSRQKALNSLYLFQCYCNKCLSQANDLDLTSEDE 378


>gi|339897286|ref|XP_003392344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399152|emb|CBZ08492.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 622

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 97/235 (41%), Gaps = 31/235 (13%)

Query: 58  RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ 116
           +K  HS+ + C    V +CSP C+ A     H  + C  +TRL ++    +A  L +  Q
Sbjct: 254 KKEKHSTCTTCDM--VVYCSPACADAH-REQHERFECARMTRLSDMMEGIEARKLDVPEQ 310

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP--PPASTTI---EL 171
               F   AY+  +   +  +   A    V          + SL P  P  S      E 
Sbjct: 311 ----FFELAYH-CITTLAGIKAKRAGHERVLTLTAHVDEVIQSLHPIGPLMSDLFKGEEE 365

Query: 172 TAALLAKDRL---NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDA 228
           TA L     +   NA  + +P   G    +++ A      ASFFNH C PN C  D V  
Sbjct: 366 TATLYTIIGVLCCNALEVTDP--SGLGVAQALHAGNSI--ASFFNHSCTPN-CAIDTVRH 420

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
           A      I  R IH    G E+ ++Y P + +    R++RL   Y F C C RC+
Sbjct: 421 AI-----ITTRTIH---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQ 467


>gi|302850072|ref|XP_002956564.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
           nagariensis]
 gi|300258091|gb|EFJ42331.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
           nagariensis]
          Length = 1124

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 7/73 (9%)

Query: 205 IYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           +YL AS FNH C P     F  +DAAA            D+  G E+C+SY  V   Y+ 
Sbjct: 497 VYLVASLFNHSCEPTLELSFPDLDAAA------CFVAARDIAPGEELCVSYLDVGLPYNV 550

Query: 264 RQKRLLYDYDFAC 276
           RQ+ L + Y F C
Sbjct: 551 RQRHLKWSYGFVC 563



 Score = 37.7 bits (86), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 35/63 (55%), Gaps = 9/63 (14%)

Query: 1   MSAVSETVTVA-----EIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCH 54
            +   ET T+      ++ GRGRGLV+  S+  GQ+V R+ P++   A P + ++   C+
Sbjct: 186 FTVAQETATLGIQTNPQLPGRGRGLVAVGSIPRGQVVHREVPLV---AFPELGATHGVCY 242

Query: 55  NCF 57
           +C 
Sbjct: 243 HCL 245


>gi|189242482|ref|XP_001810915.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
          Length = 538

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 69/301 (22%), Positives = 117/301 (38%), Gaps = 44/301 (14%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + + + E  GR +V+T++++ G+I+  + P   Y+++   + S +CH C +   +     
Sbjct: 88  IEIMQNERYGRHVVATRNIEMGEILAIEKP---YASIVTDSVSVYCHECLKLCYNMIP-- 142

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C  C+ A +CS  C   A  S H  Y C     L  L    +  L L   L +R   A  
Sbjct: 143 CDKCTKALYCSDNCKDKAYESYHK-YECPIHLSLDPLLIDSSKRLALRIALISRNEWAG- 200

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK-------- 178
           +L   +P +      F+     +      +   L      TTI      L K        
Sbjct: 201 SLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDL---FGRTTIACGVFYLIKKYTTFLQE 257

Query: 179 ---DRLNAFGLMEPYIEGQDGRR------SVRAYGI-------YLKASFFNHDCLPNACR 222
              DR     L    I   +  R      ++  Y +       Y   S FNH C PN CR
Sbjct: 258 YDEDRFKEILLSLLLICSTNTVRVNEVSSTLGEYDVCGFACSHYPFFSMFNHSCWPNVCR 317

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDR 280
                  + + + +++R I  + +G +  ++Y P  ++ +   RQ  L + Y F C C  
Sbjct: 318 -------SYHGSQMVLRAIRTIKKGEQCFVTYGPSYLSDNIVGRQAFLFFHYFFNCGCKA 370

Query: 281 C 281
           C
Sbjct: 371 C 371


>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
 gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
          Length = 393

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y+  S  NH C+PNA   F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYILQSKINHSCVPNAQSTFPY------SNDIVVLKAVAPIEKGDEICISYLDECQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKTLRENYIFVCQCPKCRAQAS 368


>gi|255562926|ref|XP_002522468.1| protein with unknown function [Ricinus communis]
 gi|223538353|gb|EEF39960.1| protein with unknown function [Ricinus communis]
          Length = 538

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 41/96 (42%), Gaps = 12/96 (12%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
            +G+++ ASF NH C PNA R    D        ++V    D+  G EI   YF V    
Sbjct: 326 GFGLWVLASFINHSCNPNARRLHVGD-------HVLVHASRDIKAGEEITFPYFDV---L 375

Query: 262 STRQKRLLYD--YDFACDCDRCKVEANWSDNDNDDE 295
           S   KR+     + F C C RCK E      D   E
Sbjct: 376 SPLYKRMAMSKAWGFRCQCKRCKFEEQVCSKDEVKE 411


>gi|408398201|gb|EKJ77334.1| hypothetical protein FPSE_02412 [Fusarium pseudograminearum CS3096]
          Length = 543

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 54/102 (52%), Gaps = 6/102 (5%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L+AS  NH C  NA + DY +    +     V  + D+ +G EI ++Y  +  +  T
Sbjct: 104 GVFLQASRINHACDNNAQK-DYNEGIKRHT----VHALRDIEEGEEITITYLGILKNRRT 158

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQ 305
           RQ+ L   + F C C+ C +  + SD ++D   +E +  ED+
Sbjct: 159 RQQALRTKFMFTCTCNLCSLPEDLSD-ESDRRLDEILALEDR 199


>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
          Length = 816

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 79/332 (23%), Positives = 127/332 (38%), Gaps = 87/332 (26%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV-A 74
           +GRG+ S   +    ++  + P+ +    P     + CH CF +   +    CPSC++  
Sbjct: 250 KGRGMTSPNDISPASLIHAEDPLAVIIMKPC--RDTHCHYCFSEA-PADVVFCPSCTIPI 306

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARFLI 123
           +CS +C   A         C   TRL  N N  D A L         P  +Q+ + R   
Sbjct: 307 YCSNRCQEKAIGQ----MSCNQNTRLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRHEC 362

Query: 124 AAYNLAVVNPSQF-------------QILLAFQGTVT----------DNDTSAAHYLSS- 159
              + A V P+               Q+L   + T++          D D+SA+ + S  
Sbjct: 363 GGVHWAAVLPADIVLAGRIMAQYIEKQLLAGKRSTISGPNLDLVHHYDQDSSASKFESHI 422

Query: 160 ------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR------- 198
                 LC           A  ++     L+ + ++N+  ++  +++  DG +       
Sbjct: 423 YATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGGKVLTKGFS 480

Query: 199 -----------SVR-AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
                       VR A  IY+  SFFNH C PN      + A   + T +I+R    +  
Sbjct: 481 GFSGDVMCSVEQVRVAQAIYMSGSFFNHSCQPN------IHAYFHSRT-LILRSTEYIKA 533

Query: 247 GREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
           G  I LSY P     D   RQK L  +Y F+C
Sbjct: 534 GSPIELSYGPQVGEMDLPERQKSLRENYYFSC 565


>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
           florea]
          Length = 387

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 8/95 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
           G+Y+  S  NH C+PNA     V+    NN  ++++ I D+    EIC+SY     +   
Sbjct: 289 GLYILQSSVNHSCVPNA----IVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
             +RQK L   Y F C C++C  +AN  D  ++DE
Sbjct: 344 RHSRQKALNSLYLFQCYCNKCLSQANDLDLTSEDE 378


>gi|223634690|sp|A5DQN2.2|SET5_PICGU RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|190348931|gb|EDK41485.2| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 483

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           I+L  S  NH C PN    D   + A     + V    D+  G E+  SY   ++    R
Sbjct: 342 IFLTQSHLNHSCHPNT---DVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQR 398

Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
           Q+ L  ++ F C C RCK EA 
Sbjct: 399 QRELRVNWGFICSCQRCKDEAK 420


>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
           vitripennis]
          Length = 386

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y+  S  NH C PNA   F + D+       ++VR + D+    EIC++Y     +  
Sbjct: 290 GLYVLQSAINHSCAPNASVEFPHSDSR------LVVRALRDIKPDEEICIAYLDECHLER 343

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +RQK L   Y F C CD+C+ +A+
Sbjct: 344 SRHSRQKALSSLYLFVCKCDKCQQQAS 370


>gi|307102606|gb|EFN50876.1| hypothetical protein CHLNCDRAFT_141673 [Chlorella variabilis]
          Length = 510

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 5/112 (4%)

Query: 174 ALLAKDRLNAFGLMEPYIEG-QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           AL A ++  AF    P+++   D          Y   S FNH C PNA  F   +   + 
Sbjct: 394 ALPAGEQAAAFQAAGPFVDALPDELPGCEGNAFYALHSCFNHSCDPNAEAFKRDE---DE 450

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +   ++  + D+  G E+ LSY         R+++L  DY F C CD+C+ E
Sbjct: 451 DGGAVILALRDIQAGEEVTLSYIDEEAPLEERRQQL-ADYGFRCACDKCQAE 501


>gi|146413184|ref|XP_001482563.1| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 483

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           I+L  S  NH C PN    D   + A     + V    D+  G E+  SY   ++    R
Sbjct: 342 IFLTQSHLNHSCHPNT---DVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQR 398

Query: 265 QKRLLYDYDFACDCDRCKVEAN 286
           Q+ L  ++ F C C RCK EA 
Sbjct: 399 QRELRVNWGFICSCQRCKDEAK 420


>gi|157117568|ref|XP_001658830.1| hypothetical protein AaeL_AAEL008032 [Aedes aegypti]
 gi|108876001|gb|EAT40226.1| AAEL008032-PA [Aedes aegypti]
          Length = 527

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 64/280 (22%), Positives = 111/280 (39%), Gaps = 35/280 (12%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFRKTMHSSSSICP-S 70
           EG GRG+V+T+ +   +++  D PIL+    P +N+    FC +C R  +    ++C   
Sbjct: 73  EGSGRGMVATRDIAENELIFMDRPILVG---PRVNNYDVIFCASCCR--LQKKLTLCSGG 127

Query: 71  CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
           C +  CS    +AA  +SH+     A  R +N   P   N      L A   I  + L  
Sbjct: 128 CRLPVCSECDISAAPETSHT-----AECRRINSWEPKDQNRYCKTILYALTSIRGFLLDD 182

Query: 131 VNPSQFQILLAFQGTVTDNDTSAA-------HYLSSLCPPPASTTIELTAALLAKDRLNA 183
           ++ S   I+L  Q      D +            + L  P   + ++    ++     NA
Sbjct: 183 LSRS---IVLNMQAHPPRKDMTTEVDRILKDQVFNKL--PEDGSDVQYLRKVIDVLNTNA 237

Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           F       +  +    +   G+Y+  +  NH C PN  R+ +       +  +++     
Sbjct: 238 FETFRMVKDEDNNDHEIVLRGLYILGALMNHCCRPNI-RYVF-------DEKLVMHCYAS 289

Query: 244 VP--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            P  +G +I  +Y    +    R   L +   F C CDRC
Sbjct: 290 KPIRKGEQIFNNYSKSLWGTQHRIIHLCFSKHFLCGCDRC 329


>gi|167342074|gb|ABZ68985.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFNILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
 gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
          Length = 302

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       EN    +VR   D+P+G EI  +Y  + +   T
Sbjct: 85  GLFPLTAIMNHECTPNASHY------FENGRLAVVRAARDIPKGGEITTTYTKILWGNLT 138

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L     FACDC RC
Sbjct: 139 RNIFLKMTKHFACDCVRC 156


>gi|66828109|ref|XP_647409.1| hypothetical protein DDB_G0268386 [Dictyostelium discoideum AX4]
 gi|60475711|gb|EAL73646.1| hypothetical protein DDB_G0268386 [Dictyostelium discoideum AX4]
          Length = 676

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 68/305 (22%), Positives = 125/305 (40%), Gaps = 40/305 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VA 74
           +GR L+S + +K G+ +LR +P        ++  S FC  C++K    +   C  C   A
Sbjct: 175 KGRYLISKRLIKRGEEILRIAPFSFSLNGKYV--SKFCGVCYQKKSIDNLLKCKDCDDFA 232

Query: 75  FCSP-KCSTAAASSSHSPYVCQALTRLLN----LNSPDAAN-------LPLDRQLQARFL 122
            C   K      S  H   +C  L +L N    +N+ + AN       L  +R+L    +
Sbjct: 233 ICKECKAPNEPLSKYHPNDLC-VLYKLNNEYYKMNNFEIANEDNKERGLTDERRLYINTI 291

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTD---------NDTSAAHYLSSLCPPPASTTIELT- 172
              +     N +  ++  A   +V D         N T+ +    S      +  +EL  
Sbjct: 292 YNIFKNKTNNKAVVEVDNAISSSVIDVLNLSKKLLNHTNESQRAYSDSTKMKTDILELIF 351

Query: 173 -----AALLAKDRL-NAFGLMEPYIEGQDG--RRSVRAYGIYLKASFFNHDCLPNACRFD 224
                A  L KD + N   +++    G      +  +A G+Y  + + NH C+PN+    
Sbjct: 352 KSFYGANTLKKDEISNIIRIVDCNSHGYSDVISQKKKASGLYPFSCYTNHSCIPNS---- 407

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
             D   + +  +++    D+ +   I +SY         R+K LL  ++F C CD+C  +
Sbjct: 408 --DYFIDKHGVMVLYASSDIQESEAITISYIYFLNRVEKRRKDLLNGHNFFCTCDQCSFQ 465

Query: 285 ANWSD 289
           ++ ++
Sbjct: 466 SSLTE 470


>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 576

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 5/84 (5%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G++   S  NH C PNA     VD  +   T  +   + D+ Q  ++ LSY       
Sbjct: 420 GIGLFRAVSMTNHSCWPNA----EVDYPSLTKTAQVT-ALRDIAQDEQVLLSYIDETLPL 474

Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
           ++RQ+ L   Y F C C RC+VEA
Sbjct: 475 ASRQRLLKRHYKFTCACVRCQVEA 498


>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 691

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 8/104 (7%)

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
           +NA G++   +   D + ++ A GIY  AS  NH C PN  +  Y+D        +IVR 
Sbjct: 387 VNAAGIIHSNVIEDDVQINIVATGIYPSASMMNHSCNPNIIKI-YMDQY------LIVRA 439

Query: 241 IHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKV 283
           + D+    EI  SY     Y  +  +++LL  Y F+C C+ C V
Sbjct: 440 VEDIFPTEEIFNSYVATYRYKKTKARQKLLELYYFSCKCEACTV 483


>gi|242016107|ref|XP_002428677.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212513348|gb|EEB15939.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 697

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 126/319 (39%), Gaps = 64/319 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
           GR   +   ++ G  ++ + P      LP +NSS  CH+CF +    +   C +CS VAF
Sbjct: 274 GRFAAAANKIEIGDTIVVEEP-QTSVLLPKMNSSH-CHHCFTRLF--APIPCKTCSGVAF 329

Query: 76  CSPKCSTAAASSSHSPYVCQ----------ALTRLLNLNSPDAANLPLDRQLQARF---- 121
           CS +C  +A+++ H  Y CQ          +L   L+L       L   +QL ++     
Sbjct: 330 CSVQCRDSASNTYHR-YECQLLGFFMGSGMSLISYLSLRMVTQEGLIFFKQLHSKLKQMG 388

Query: 122 ---------------LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS 166
                           +  YNL   N         F  TV          L++    P +
Sbjct: 389 KCPEDSCDLTKREKNYLNVYNLT-TNCKLRTSKDFFTRTVMAIFLVKCLKLTNYFESPEN 447

Query: 167 TTIE-------LTAALLAKD----RLNAFGLMEPYIE----GQDGRRSVRAYGIYLKASF 211
            T +       LTA+++       + NA  + E  +E     +D +    A G+Y   + 
Sbjct: 448 KTTDDFNEDEILTASIILHHLELLQFNAHEIYETRVEELYKTEDSKTVFIAIGVYPTVAL 507

Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST---RQKRL 268
           FNH+C P+  R+        +  +II++ +  +     +  +Y P +Y   T   R + L
Sbjct: 508 FNHECSPSVTRYF-------SGKNIIIKAVRPLATNDILSDNYGP-HYGKKTLLERTREL 559

Query: 269 LYDYDFACDCDRCKVEANW 287
              Y F C C  C +E NW
Sbjct: 560 TSRYWFRCRCQAC-IE-NW 576


>gi|383142259|gb|AFG52489.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
 gi|383142264|gb|AFG52494.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
          Length = 155

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 14  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 67  RQGR-CENWGFECKCRRCIVE 86


>gi|189239848|ref|XP_973542.2| PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
          Length = 829

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 58/297 (19%), Positives = 115/297 (38%), Gaps = 27/297 (9%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTM 61
            +V +   + E    GR LV+T++LK G++V++++P++     P  NS   C  C +   
Sbjct: 40  KSVCQPFKICEDAVLGRHLVATRALKPGEVVIQEAPLIWG---PSYNSIPVCLGCGKAVD 96

Query: 62  HSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQ----ALTRLLNLNSPDAANLPLDRQL 117
            + S  C  C    CS  C  + +      Y  Q       ++  +  P    + + R L
Sbjct: 97  ETCSRPCQKCGWPVCSDLCEKSPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCL 156

Query: 118 -QARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPP-ASTTIELTAAL 175
            Q +FL          P  ++ L   Q  + ++ ++  +    +          +L+A  
Sbjct: 157 YQKQFL----------PKNWKKLETLQNHLEEHKSTTNYESDRVTIAEFIRRFFKLSATF 206

Query: 176 LAKDRLNAFGLMEPY-IEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENN 233
             +D +   G    + + G +   +   +  IY  AS   H C PN  +      +    
Sbjct: 207 SEEDIMKVHGSTLIFQVNGHEVPLTEPFHVAIYSSASMLEHSCGPNCTK------SFTKQ 260

Query: 234 TDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
             I++     + +G  + + Y    +   +R+  L     F C C+RC+  + +  N
Sbjct: 261 GHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPSEFGTN 317


>gi|167342090|gb|ABZ68993.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|167342070|gb|ABZ68983.1| SET domain-containing protein C [Pinus taeda]
 gi|167342072|gb|ABZ68984.1| SET domain-containing protein C [Pinus taeda]
 gi|167342076|gb|ABZ68986.1| SET domain-containing protein C [Pinus taeda]
 gi|167342078|gb|ABZ68987.1| SET domain-containing protein C [Pinus taeda]
 gi|167342080|gb|ABZ68988.1| SET domain-containing protein C [Pinus taeda]
 gi|167342082|gb|ABZ68989.1| SET domain-containing protein C [Pinus taeda]
 gi|167342084|gb|ABZ68990.1| SET domain-containing protein C [Pinus taeda]
 gi|167342100|gb|ABZ68998.1| SET domain-containing protein C [Pinus taeda]
 gi|167342106|gb|ABZ69001.1| SET domain-containing protein C [Pinus taeda]
 gi|167342108|gb|ABZ69002.1| SET domain-containing protein C [Pinus taeda]
 gi|167342110|gb|ABZ69003.1| SET domain-containing protein C [Pinus taeda]
 gi|167342112|gb|ABZ69004.1| SET domain-containing protein C [Pinus taeda]
 gi|167342114|gb|ABZ69005.1| SET domain-containing protein C [Pinus taeda]
 gi|167342118|gb|ABZ69007.1| SET domain-containing protein C [Pinus taeda]
 gi|167342120|gb|ABZ69008.1| SET domain-containing protein C [Pinus taeda]
 gi|167342122|gb|ABZ69009.1| SET domain-containing protein C [Pinus taeda]
 gi|167342126|gb|ABZ69011.1| SET domain-containing protein C [Pinus taeda]
 gi|167342130|gb|ABZ69013.1| SET domain-containing protein C [Pinus taeda]
 gi|167342132|gb|ABZ69014.1| SET domain-containing protein C [Pinus taeda]
 gi|167342144|gb|ABZ69020.1| SET domain-containing protein C [Pinus elliottii]
          Length = 141

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|396460396|ref|XP_003834810.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
 gi|312211360|emb|CBX91445.1| hypothetical protein LEMA_P069530.1 [Leptosphaeria maculans JN3]
          Length = 388

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 14/119 (11%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T E+   L ++D  N+FG+     +G +       YG +  AS+FNH C PN  +     
Sbjct: 280 TSEMIFTLSSRDSHNSFGVRSLEDDGSE----FFGYGCWPAASYFNHSCHPNVEK----- 330

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL---YDYDFACDCDRCKV 283
               +      R    + +G E+C++Y        +R KR+L     + F C C+RC+ 
Sbjct: 331 --QRDGRAWTFRARRAIAKGDELCITYLSGEERKLSRAKRMLRLKKTWGFDCSCERCEA 387


>gi|344231392|gb|EGV63274.1| hypothetical protein CANTEDRAFT_105840 [Candida tenuis ATCC 10573]
          Length = 478

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 71/326 (21%), Positives = 108/326 (33%), Gaps = 69/326 (21%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFR--KT 60
           E V V     RG+GL +  ++  G ++  + P       + L  I S   C  C +   +
Sbjct: 107 EKVKVIMTTKRGKGLYAKSNIAKGTLIWEEKPFFFIPPLANLKLIKSGKACTYCGKLLTS 166

Query: 61  MHSSSSI-------CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL 113
             SSS +       C  C   +CS  C     +         A  + L+     A N   
Sbjct: 167 ARSSSGVSMLRGLDCTGCPELWCSMSCKKMNGT-------LHASLKHLSKGRKSALNPEA 219

Query: 114 DRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--- 170
              L+   +   +N      + F I L F   V D       Y+ S+     +T  +   
Sbjct: 220 FLDLEEYCIKEQWN------ALFAITLMFADIVEDKSGEKLKYVKSMARVSQATRYKALE 273

Query: 171 --------LTAALL---------AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA---- 209
                   LT   L          K+  + F  + P     DG      Y   L +    
Sbjct: 274 SSAGSFDSLTGGALFVKEQQENLWKEGFSKFSKVFPKATS-DGTVDYNEYLTLLGSYNIN 332

Query: 210 ----------SFFNHDCLPNACRFDYVDAAAENNT---DIIVRMIHDVPQGREICLSYFP 256
                     S  NH C PN      VD     N+    I V    D+  G E+  +Y  
Sbjct: 333 NLDSCVFRIQSHLNHTCNPN------VDVETSPNSRYEGIKVFAAKDIKAGEELSTTYVN 386

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCK 282
            N+    RQ+ L  ++ F C C++CK
Sbjct: 387 PNHTVLQRQRELRANWGFTCSCNKCK 412


>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
          Length = 697

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 29/143 (20%)

Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
           L   G+   N T  A + + LC           AAL    + N+FGL        DG  +
Sbjct: 528 LGEGGSTEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
                +Y +AS+FNH C PN CR  Y    A          + ++ +G  + + Y  V  
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619

Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
             +  ++R L+  Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642


>gi|384490543|gb|EIE81765.1| hypothetical protein RO3G_06470 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 119/284 (41%), Gaps = 37/284 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS----CS 72
           G  +V+   LK G+ +L + P +    L   N ++ C  CF +       +C +      
Sbjct: 43  GERVVANAKLKQGRTILEEDPFI--RQLNDANRTTHCTYCFTE-FKKIKKLCRNKDCKWE 99

Query: 73  VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL---- 128
           V +CS  C      + H  + C    R  NL + D   L   +   A    + Y L    
Sbjct: 100 VLYCSLNCQQQHWLTEHK-WFC----RFPNLANTDKNVLFALQGYIASRSKSEYTLPGLV 154

Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
           + ++  + + L  ++  + +++ S   +L+          IE    ++A+ R N F + +
Sbjct: 155 SNIDSYKPEDLEEYRKKIGNSELSKVFHLTE-------GVIESMVVIMAQIRCNTFAVKQ 207

Query: 189 PYI----EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN--TDIIVRMIH 242
             I    E ++         IYL AS FNH C PNA     +     NN  + + V  I 
Sbjct: 208 FVISKTLEIEERETITLGRAIYLTASKFNHSCNPNA-----LVLFGTNNVGSHLKVHFIG 262

Query: 243 D-VPQGREICLSYFPVNYDYSTRQKRLLY--DYDFACDCDRCKV 283
           D   +  EI +SY PV+  +S +Q++     +Y F C+C  C++
Sbjct: 263 DEAKKNAEINISYGPVSTRHSKKQRKKKLKDEYFFDCNCSSCQL 306


>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 677

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 73/323 (22%), Positives = 117/323 (36%), Gaps = 67/323 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-V 73
           GR  ++T+ ++ G+I+  + P    L +    IN    C  CF K      ++C +CS V
Sbjct: 252 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLIN----CFYCFTKIFVPIPAVCQTCSCV 307

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--LIAAYNLAVV 131
           A+CS  C    A    +   C  L  L    +     L L   +Q  F  L    ++   
Sbjct: 308 AYCSISCRDKDAKIHENE--CSILPTLWASKTSINCFLALRIIVQQSFEKLYKLKDVKEN 365

Query: 132 NPSQFQI----------------LLAFQGTVTDNDTSAAHYLSSLC----------PPPA 165
           +  +F++                L+  + T T  D     Y++S            P   
Sbjct: 366 SKDKFEVSASEPYRSNDFKIMFRLVTHEDTRTVEDLFHRTYIASWLLRLLKKGPYFPKHV 425

Query: 166 STTIELTAAL--------------LAKDRLNAFGLMEPYIEGQD-----GRRSVRAYGIY 206
            T   + A L              L   + NA  + E  I   D      +      G+Y
Sbjct: 426 KTPDTIEAKLSDGELYIGGLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLY 485

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTR 264
              S FNH C P   R+          T ++VR I  +  G EI  +Y  +      S R
Sbjct: 486 PTISLFNHSCNPGIIRYFI-------GTTMVVRAIRSISSGEEISENYGQIFTTTPESER 538

Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
           +++L   Y F C+C+ C+   +W
Sbjct: 539 KRKLRLQYFFDCNCEACR--EHW 559


>gi|167342142|gb|ABZ69019.1| SET domain-containing protein C [Pinus elliottii]
          Length = 141

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 697

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 60/143 (41%), Gaps = 29/143 (20%)

Query: 140 LAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRS 199
           L   G+   N T  A + + LC           AAL    + N+FGL        DG  +
Sbjct: 528 LGEGGSTEANLTVTATFFNRLC-----------AAL----QCNSFGLF-----NADG--N 565

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
                +Y +AS+FNH C PN CR  Y    A          + ++ +G  + + Y  V  
Sbjct: 566 CIGVALYPEASYFNHSCCPNICRVTYRGILA------AFHALREIRKGEPLTICYVDVQE 619

Query: 260 DYSTRQKRLLY-DYDFACDCDRC 281
             +  ++R L+  Y F C+C RC
Sbjct: 620 TSTAERRRTLFSSYRFFCECARC 642


>gi|425701087|gb|AFX92249.1| putative set domain-containing protein [Megavirus courdo11]
          Length = 157

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 36/78 (46%), Gaps = 7/78 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            I      FNH CLPN   +   D        I VR   D+ +G E+  SY  + +D  T
Sbjct: 62  AILFNGRIFNHSCLPNIVFYRCNDVMCF----ITVR---DIYKGEELLDSYVNITHDKKT 114

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ RL   Y F CDC RC
Sbjct: 115 RQSRLWNQYRFHCDCQRC 132


>gi|294656447|ref|XP_458717.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
 gi|218511783|sp|Q6BSV3.2|SET5_DEBHA RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|199431481|emb|CAG86859.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
          Length = 493

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 69/318 (21%), Positives = 114/318 (35%), Gaps = 47/318 (14%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNCFRKTMH 62
           E V +     RG+GL +   +  G ++  +S +   + L     I     C  C +    
Sbjct: 116 EKVHIVMTAKRGKGLYAKNKISKGDLIWEESQLFYIAPLANVNLIKGGKACSYCGKLLTQ 175

Query: 63  SSSSI---------CPSCSVAFCSPKCSTAAAS-------SSHSPYVCQALTRLLNLNS- 105
           SSS+          C  CS  +CS  C    +        + ++P       +L+N N+ 
Sbjct: 176 SSSNAGISVLKGLDCNVCSEIWCSQTCKKIDSQLHGMLKHNLYNPNSSGGSKKLINANAY 235

Query: 106 PDAANLPLDRQLQARFLIAAYNLAVV------NPSQFQILLAFQGTVTDND-TSAAHYLS 158
            +  +  L  Q  A + I      ++         QF+ +      V      S+A    
Sbjct: 236 LELQDYCLKEQWNALYAITIIYANIITDKSGIKGKQFRAMARVSQDVRYKALNSSAGAFD 295

Query: 159 SLCPPPASTTIELTAALLAKDRLNAFGLMEPY------IEGQDGRRSVRAYGI------- 205
           +L     +  +      L KD  N F  + P       I  Q+    +  Y I       
Sbjct: 296 NL--QGGALFVGEQQEKLWKDGYNRFKNVFPKSVDNGDISYQEFMYMMGTYNINNLDSCI 353

Query: 206 YLKASFFNHDCLPNACRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           YL  S  NH+C PN      V+ +    T+ + V    D+  G E+  +Y    Y    R
Sbjct: 354 YLTQSHLNHNCDPNT----NVETSTTVRTNGLKVFAARDIRAGEELTTTYVNPAYTVQQR 409

Query: 265 QKRLLYDYDFACDCDRCK 282
           Q+ L  ++ F C C +CK
Sbjct: 410 QRELRVNWGFMCGCQKCK 427


>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
 gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
          Length = 387

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
            +Y   S  NH C PNA C+F +      +N  + +  + D+  G EIC+SY     +  
Sbjct: 287 ALYSLQSKVNHSCAPNAECQFPH------SNNVLALTTVQDIAAGEEICISYLDECALER 340

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +RQK L  +Y F C C++C+ + N
Sbjct: 341 SRHSRQKMLRENYLFQCQCEKCESQVN 367


>gi|383142258|gb|AFG52488.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
 gi|383142260|gb|AFG52490.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
 gi|383142261|gb|AFG52491.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
 gi|383142262|gb|AFG52492.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
 gi|383142263|gb|AFG52493.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
          Length = 155

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 14  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 67  RQGR-CENWGFECKCRRCIVE 86


>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
 gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 537

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 74/297 (24%), Positives = 112/297 (37%), Gaps = 51/297 (17%)

Query: 17  GRGLVSTQSLKAGQ-IVLRDSPILLYSALPF-INSSSFCHNC--------FRKTMHSSSS 66
           GR + ST+   AG+ I + ++P+L   ALP   N  + C+ C        F   + +   
Sbjct: 23  GRTVFSTERFGAGETIAILENPLL---ALPDDANMRTTCNYCLYVSGTIEFEGDVKAGPR 79

Query: 67  ICPSCS----VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
            C +C+      +C+ +C        H    C+   R+      D    P+    Q   L
Sbjct: 80  TCKACTGCKAAVYCNAECQRQHWKLVHKAE-CKMFKRIRERTGKDWVPTPVRAVAQVMLL 138

Query: 123 IAAYNLAVVNPSQFQILLA--FQGTVTDND---------TSAAHYLSSLCPPPASTTIEL 171
           + A +  V         L    +G  TD +         T A  Y   L    +  T++ 
Sbjct: 139 LKAGDEEVTKAFGPGGTLESNVEGFKTDEELWKDFELQATGAVVYAGLL---QSDETLKQ 195

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAA 229
              +L K + NAF   +    GQ G        IYL  S    NH C+PNA    Y+   
Sbjct: 196 AMEVLCKIQTNAFNRFDADT-GQAG--------IYLHPSLSMVNHSCVPNA----YITF- 241

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
                   ++   D+  G EI +SY         RQ+ L   Y F C+C RCK + N
Sbjct: 242 --EKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESLRL-YHFQCNCVRCKDDLN 295


>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
          Length = 530

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 71/299 (23%), Positives = 118/299 (39%), Gaps = 59/299 (19%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-CHNCFR---KTMHSSS 65
           VA  E  GR L + ++L+AG+++LR+ PI   +  P  +S+ + C  C R   K    + 
Sbjct: 17  VACSEKFGRYLQAAKNLQAGEVILREKPI---AVGPITSSNDYLCFACLRLLPKLKKGTQ 73

Query: 66  SICPSCSVAFCSPKCSTAAAS--SSHSPYVCQALTRLLNLNSPDAAN-----LPL----- 113
            +C  C++A   P C TA       H+ Y C+      NL++ +        LPL     
Sbjct: 74  YVCSKCNIA---PLCGTACQEQPKHHTSYECEIFKTNENLSADNIETLIGILLPLRLWLV 130

Query: 114 ---DRQLQARF------LIAAYNLAVVNPSQFQILLAFQG--TVTDNDTSAAHYLSSLCP 162
              D +L  R       +    N +V    +  ++   +    + ++DTS   YL  LC 
Sbjct: 131 KQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVMKALHLIPEDDTSIPEYLQQLC- 189

Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
                       +L    +N F L  P      G   +   G+Y++ASF  HDC  N   
Sbjct: 190 -----------GILD---VNTFELRSP-----GGLDGLLLRGLYVEASFMAHDCRGNT-- 228

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                   +++  + V     + +G  I  +Y         R++ L     F C+C  C
Sbjct: 229 ----HLTVDDHFQLTVYASIPIKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLC 283


>gi|383142266|gb|AFG52496.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
          Length = 155

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 14  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 66

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 67  RQGR-CENWGFECKCRRCIVE 86


>gi|401422457|ref|XP_003875716.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491955|emb|CBZ27228.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 872

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/181 (27%), Positives = 74/181 (40%), Gaps = 35/181 (19%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           L A  + N+FG+        D +     +GIY +AS+FNH C+PN CR   +        
Sbjct: 672 LCAAAQCNSFGVY-------DTQEHCIGFGIYPEASYFNHSCVPNLCRV--MHHGGRIAA 722

Query: 235 DIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
              +R I   PQ   + + Y  V   + + R++ LL  Y F C C+RC  +A        
Sbjct: 723 FYALRAI--APQ-EPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKA-------- 771

Query: 294 DENNEEVMDEDQDEQMVASDDDAEVHGDTNF-----PHAYFFMRYMCDRDNCWGTLAPLP 348
                    E     + A+ + A V G  +      P A+     +C   +  G L PLP
Sbjct: 772 ---------EGPQMAIAAAGNRASVSGPRSGCVEVSPAAFENPLLLCAECSIHGYLRPLP 822

Query: 349 P 349
           P
Sbjct: 823 P 823


>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Meleagris gallopavo]
          Length = 486

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 112/290 (38%), Gaps = 29/290 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSS 64
           E+V V   EG+GRGL + +    G ++  +     Y+A+ F + +   CH CF++     
Sbjct: 2   ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVVCHTCFKR--QEK 56

Query: 65  SSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL- 122
              C  C  A+ C   C   A  +  +   C A+ +  +  +P   N+ L  ++  R   
Sbjct: 57  LHRCGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRIER 111

Query: 123 ----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
               ++   L  ++  Q  +    +    D       +L           ++  + +   
Sbjct: 112 EGSGLSENCLVSIDDLQNHVENFDEEEKKDLRIDVESFLEFWPAQSQQFGMQYISHIFGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAAEN 232
              NAF L +     Q G ++V   GI+      NHDC PN          + V +    
Sbjct: 172 INCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHT 225

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              I +R +  +  G E+ +SY         RQK+L   Y F C C+ CK
Sbjct: 226 QMRIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCK 275


>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
          Length = 800

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 3/81 (3%)

Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
            +GIY L +  FNH C+PNA    Y+ +  E+   + V  + D+ +G EI + Y      
Sbjct: 551 GHGIYPLASRLFNHSCVPNAVA-KYIISPFES-VQMEVVALRDIAEGEEITIPYLDPALP 608

Query: 261 YSTRQKRLLYDYDFACDCDRC 281
             TRQ+ L  +Y F C C  C
Sbjct: 609 LHTRQEALRLNYGFTCTCPLC 629


>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
          Length = 369

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
           C  C VA +CS KC   A         C+ L        PD+  L L R +         
Sbjct: 3   CSQCRVAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVIFKLMEETPS 59

Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           ++  L + Y+L                Q++L FQ  + +    A+        PP+    
Sbjct: 60  ESEKLYSFYDLESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQL------PPSFDIF 113

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E  A ++     N+F +    ++           G+Y   S  NH C PN        + 
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-------SI 155

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             N   +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D
Sbjct: 156 VFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 211

Query: 290 NDNDD-ENNEEVMDEDQD 306
            D D    +E+V  E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229


>gi|378731988|gb|EHY58447.1| hypothetical protein HMPREF1120_06457 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 545

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD-YSTRQKR 267
           AS  NH C+PN  +F Y     E   ++  R I D+ +G+EIC +Y  V ++  S RQ++
Sbjct: 239 ASRLNHSCVPN-VQFSY----DETTNEMQFRAIRDIARGKEICSNYDKVVFETRSKRQRK 293

Query: 268 LLYDYDFACDCDRCKVEAN-WSDND 291
               Y F C C+ C+     W+ +D
Sbjct: 294 QQIYYGFVCTCEACEPHTEFWARSD 318


>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
 gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
          Length = 378

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 54/143 (37%), Gaps = 39/143 (27%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GIY  +S FNH C PN C+   +++   N     +  I D+ +  EI  +Y  +N +   
Sbjct: 254 GIYFNSSLFNHSCQPNICKV--IESCKNNFGCHSMVAIRDIKENEEISFNYIQLNQNKQE 311

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTN 323
           R ++L   Y F C+C  C      ++N N                               
Sbjct: 312 RIEKLQSSYYFTCNCPSC-----CTNNPN------------------------------- 335

Query: 324 FPHAYFFMRYMCDRDNCWGTLAP 346
            PH  F  +Y C + NC G   P
Sbjct: 336 -PHKEFLNKYKCKKLNCSGLEYP 357


>gi|255730767|ref|XP_002550308.1| hypothetical protein CTRG_04606 [Candida tropicalis MYA-3404]
 gi|240132265|gb|EER31823.1| hypothetical protein CTRG_04606 [Candida tropicalis MYA-3404]
          Length = 409

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
           +  A++ K+  NAFG+     +  + +  +  YG+Y  AS+FNH C PN  +        
Sbjct: 300 IVRAIIGKNLSNAFGIWSEVSDPSEDKEFL-GYGVYPSASYFNHSCEPNIVK-------T 351

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
                +    + D+  G E+C++Y     +    ++  L ++ F C C +C  E N
Sbjct: 352 RKGATLEFTTLRDINPGEELCINYGNYTNEPVEIRRAQLSEWFFDCGCTKCIRELN 407


>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
 gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
          Length = 485

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/302 (21%), Positives = 108/302 (35%), Gaps = 70/302 (23%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI----CPSCS 72
           GR L +T+ ++ G+++  DSP+LL   +  I     CH      M+S   +    C  CS
Sbjct: 57  GRHLTATRPIRRGELIFNDSPLLLGPKIASIPVCLGCHRNLPPLMNSHLQVDFYHCRHCS 116

Query: 73  VAFCSPKCSTAAASSSHSPYVCQAL--------------TR--------------LLNLN 104
              C P C TA +     P + ++               TR              LL  +
Sbjct: 117 WPLCGPSCETATSHKDECPLLARSSKSIGSRLQFCFSDPTRKESVYCAVVPLRALLLKRS 176

Query: 105 SPDAAN---LPLDRQLQARFLIAAYNLAVVNPSQF-QILLAFQGTVTDNDTSAAHYLSSL 160
           +P+      L L+  L+ R     Y     N   F + +L   G VT+ +          
Sbjct: 177 NPERFRDGFLTLESHLETRIKTPLYAALRSNLVPFVRDILGMGGEVTERE---------- 226

Query: 161 CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
                   +E+ A L      N + +  P ++       V+   +Y   +  +H C PN 
Sbjct: 227 -------LLEIAAIL----DTNCYDVRLPELD-------VKVRALYELGAMMSHHCRPNT 268

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
               Y D        ++V    D+P+   I +SY         R+  +     F C CDR
Sbjct: 269 --KHYFD----ERLRLVVVATVDIPKDAVISISYTQPLLSTIQRRYAIQQSKCFECCCDR 322

Query: 281 CK 282
           C+
Sbjct: 323 CR 324


>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
 gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
          Length = 515

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/266 (20%), Positives = 107/266 (40%), Gaps = 17/266 (6%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+T+++K  +I+L+++P++     P   +   C  C           C  C    C
Sbjct: 52  GRHLVATRTIKPYEIILKEAPLV---RGPAQITPPVCMGCLNSIEPEDHIDCDLCGWPLC 108

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P+C +     +    + Q   + +N++  + ++ PL   +     +     +    S+F
Sbjct: 109 GPECKSLGEHRAEC-QLTQERGQKVNVHEFNGSH-PLYTCVSTVRCLLIGETSPEKASKF 166

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
           Q L + + T   ++   A   S     P     E       ++ + A G ++  I G + 
Sbjct: 167 QELESLESTRRGSNQWKADLASIGQFIPKFFKTE---KFSEEEVMRAVGALQ--INGHEV 221

Query: 197 RRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
             S   +  ++  ASF  H C+PN  +      +   N   I+    ++ +   + + Y 
Sbjct: 222 PTSDPPHVAVFYTASFTEHSCMPNLAK------SFNKNGHCILWAPREIKKNSHLSICYS 275

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
              +  + RQ+ L+    F C CDRC
Sbjct: 276 DAMWGTADRQRHLVQTKLFKCACDRC 301


>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
 gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
          Length = 649

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 73/313 (23%), Positives = 111/313 (35%), Gaps = 49/313 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
           GR   +   LK   I+L + P +  S L    S S C  CF++   +    CP CS V F
Sbjct: 239 GRFAKTNMDLKPNTIILLEKPHV--SVLLEEYSLSHCSTCFKRV--TIPVCCPKCSDVIF 294

Query: 76  CSPKCSTAAASS--------------SHSPYVCQALTRLLNLNSP----------DAANL 111
           CS  C   A SS              S +   C    R++   +           D    
Sbjct: 295 CSEACEKKANSSYHRYECGFLPIFWKSGASITCHMALRMITQKTEEYFVGLRHELDGLTS 354

Query: 112 PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
            +  +L        Y L V +  +      FQ T+     +A   L        S +   
Sbjct: 355 EVTDKLNNDDYRKVYKL-VTHEDKRSPEDYFQRTLMATLLNACLSLGGYYKSKESES--F 411

Query: 172 TAALLAKD----RLNAFGLMEPYIEGQD--GRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
              LL  +    + NA  + E   E +   G+ +    G+Y   + FNH C P   R+  
Sbjct: 412 IGGLLLHNLQFLQYNAHEISELQRENERDIGKSTFIGGGLYPTLALFNHSCEPGVTRY-- 469

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR--LLYDYDFACDCDRCKV 283
                     + VR +  +P G  +  +Y P+    S  ++R  LL  Y F C+C  C  
Sbjct: 470 -----YRGNSVCVRTVKGIPAGSMVAENYGPLFTQVSREERRSTLLNQYKFTCNCRACA- 523

Query: 284 EANWSDNDNDDEN 296
             +W    + D N
Sbjct: 524 -GDWPKFADMDSN 535


>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
 gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
 gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
          Length = 302

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 99  GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+AF   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 151 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 263 PKCLAEAD 270


>gi|195060014|ref|XP_001995739.1| GH17917 [Drosophila grimshawi]
 gi|193896525|gb|EDV95391.1| GH17917 [Drosophila grimshawi]
          Length = 545

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 108/286 (37%), Gaps = 44/286 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT-MHSSSSICPSCSVAF 75
           GR LV+ + L+AG++++ + P+ +    P ++    C  C++   + S    CP C+   
Sbjct: 70  GRYLVANRRLEAGELLISEEPLAIG---PCVSCEPVCLGCYQPVQLDSQQYRCPDCNWPL 126

Query: 76  CSPKCSTAAASSSHSPYVCQ--ALTRLLNLNSPDA-ANLPLDRQLQARFLIAAYNLAVVN 132
           C P C      + HS   C   A  R L  +   A A L   R L    LI    L   +
Sbjct: 127 CGPTCRGIRQRTGHSAEECALYAERRALLADVLTANATLEQRRDLYELVLIVRIVLLKQH 186

Query: 133 PSQ----FQILLAFQGTVTDNDTSAAHY-------------LSSLCPPPASTTIELTAAL 175
            ++     + + +      +N T   HY             L+++C       +EL    
Sbjct: 187 SAEQYANIRRMESHTAERRENATLWQHYEQKVVQRLRNDWQLANVC-----DELELHEIC 241

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
              D +N F +         G+R  +A  +Y  A    HDC PN    D  D A      
Sbjct: 242 GILD-VNCFEI---------GQRGAKARTLYPSAFLLAHDCTPNTAHTD--DPA---TFA 286

Query: 236 IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           I++R    V +   + LSY         R+  +     F C C RC
Sbjct: 287 ILLRTSRRVLEQEPLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRC 332


>gi|429849921|gb|ELA25247.1| hypothetical protein CGGC5_13535 [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 380

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 4/74 (5%)

Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
           + +  NHDC PNA    Y+    +  +  +V     + QG EI +SY PV      R++ 
Sbjct: 210 RIARINHDCRPNA----YMRLPTDGLSGTVVAGEQGIAQGEEITISYLPVELARERRRRN 265

Query: 268 LLYDYDFACDCDRC 281
           L  D+ F C C  C
Sbjct: 266 LQKDWGFECGCKLC 279


>gi|403217768|emb|CCK72261.1| hypothetical protein KNAG_0J01800 [Kazachstania naganishii CBS
           8797]
          Length = 380

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 69/154 (44%), Gaps = 19/154 (12%)

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAST--TIELTAALLAKDRLNAFGLMEPYI 191
            ++ +LL  Q       TS    L  L P    T  +IEL   +   +  NAFGL +   
Sbjct: 238 KKYPVLLHLQ-------TSVFKILYILLPDILKTHFSIELFRHIAGSEYANAFGLWQE-D 289

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           EG D R  +  Y ++  AS+ NH C+PN  +       + +  ++I     D+ +G EIC
Sbjct: 290 EGVDSREFL-GYSVFPCASYANHSCVPNITK-------SRHGREMIFTANKDISEGAEIC 341

Query: 252 LSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +    V +     R+K L   + F C C RC +E
Sbjct: 342 IDCSGVLSLAVEKRRKFLKETWFFDCCCGRCLLE 375


>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
 gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
          Length = 486

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/318 (20%), Positives = 106/318 (33%), Gaps = 77/318 (24%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL----YSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +GRGL + + ++ G+++L +S  L        L  I +   C  C     HS   +    
Sbjct: 120 KGRGLFAKKEIRQGELILGESGPLAAIPPIDKLSLIQAGKVCSLCGNSLGHSDHFVMMNG 179

Query: 68  --CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
             C  C   +C+  C     + +   +V +    L N     +     ++       +AA
Sbjct: 180 LDCNGCGAVWCNRNCKKTDVTHASLKHV-KGKGSLTNA----SGWAKFEKSCMENVFVAA 234

Query: 126 YNLAVV-------NPSQFQILLAFQ---------------------------GTVTDND- 150
           Y++ V+          +  +L  F+                           G VT  D 
Sbjct: 235 YSVGVIYATSAVSKADENHVLQHFRSLAQVSQRARKQVSDSTNVGGTFDASSGAVTSEDP 294

Query: 151 ----TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIY 206
                 A  Y     P       E     + K  LN        I GQ        Y IY
Sbjct: 295 EPMWQQAYEYFKEAFPKSEDIDFESFLTYVGKFNLNQ-------ISGQ-------IYPIY 340

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
              SF NHDC PN  R++  D        + V     +  G E+  +Y    +    R++
Sbjct: 341 ---SFINHDCEPNV-RYEIDD-----KLRLKVFARKPIKAGEELLTTYVNPLHGVKLRRR 391

Query: 267 RLLYDYDFACDCDRCKVE 284
            L  ++ F C C+RC  E
Sbjct: 392 ELRVNWGFLCQCERCTKE 409


>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
 gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
          Length = 498

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/290 (20%), Positives = 107/290 (36%), Gaps = 54/290 (18%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
           GRGL +T ++K G+ V    P    + L        C NCF       + +         
Sbjct: 20  GRGLFATSTIKMGKDVF-SMPATFSTVLSTERLKDACSNCFANLPFGVNIMANVDMKLWA 78

Query: 68  CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           C  C V  +C  KC +A     H               P   +A+ R++ L + +  ++ 
Sbjct: 79  CSGCKVVKYCDRKCQSANWKLIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSNEDHVK 138

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQ-ILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL 171
            D  +          +   N  Q++ I+L+ +    + + S +              +E 
Sbjct: 139 SDLIMFLSLRSHLDEITKSNQEQYERIMLSAKA---EKEYSGSEL-----------DVET 184

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
            A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA           
Sbjct: 185 IAEYLARIEVNSFTFTTAF--GDPLGLCIQPFACYM-----NHSCEPNA-------VVGF 230

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +   I V+ + ++    ++ +SY    Y +  RQK+L   Y F C C +C
Sbjct: 231 DGGLITVKALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280


>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
 gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
          Length = 515

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/266 (20%), Positives = 107/266 (40%), Gaps = 17/266 (6%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+T+++K  +I+L+++P++     P   +   C  C           C  C    C
Sbjct: 52  GRHLVATRTIKPYEIILKEAPLV---RGPAQITPPVCMGCLNSIEPEDHIDCDLCGWPLC 108

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P+C +     +    + Q   + +N++  + ++ PL   +     +     +    S+F
Sbjct: 109 GPECKSLGEHRAEC-QLTQERGQKVNVHEFNGSH-PLYTCVSTVRCLLIGETSPEKASKF 166

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
           Q L + + T   ++   A   S     P     E       ++ + A G ++  I G + 
Sbjct: 167 QELESLESTRRGSNQWKADLASIGQFIPKFFKTE---KFSEEEVMRAVGALQ--INGHEV 221

Query: 197 RRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
             S   +  ++  ASF  H C+PN  +      +   N   I+    ++ +   + + Y 
Sbjct: 222 PTSDPPHVAVFYTASFTEHSCMPNLAK------SFNKNGHCILWAPREIKKNSHLSICYS 275

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRC 281
              +  + RQ+ L+    F C CDRC
Sbjct: 276 DAMWGTADRQRHLVQTKLFKCACDRC 301


>gi|452847305|gb|EME49237.1| hypothetical protein DOTSEDRAFT_143030 [Dothistroma septosporum
           NZE10]
          Length = 352

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 56/124 (45%), Gaps = 14/124 (11%)

Query: 166 STTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
           S T EL   +   D  NAFG+      G+D    +  YG+Y  +S+FNH C PN  +   
Sbjct: 239 SLTPELCRTMTMADNHNAFGIRAG---GEDCEEYM-GYGVYPSSSYFNHSCSPNVNK-RR 293

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
           V  + E +         D+  G E C++Y      + D   RQ RL   + F C C+ C 
Sbjct: 294 VGRSWEFSA------ARDIAAGEECCITYLGGDEKDLDRRARQSRLHEVWGFHCYCNLCT 347

Query: 283 VEAN 286
            +A+
Sbjct: 348 AKAS 351


>gi|346320526|gb|EGX90126.1| TPR domain protein [Cordyceps militaris CM01]
          Length = 656

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 75/290 (25%), Positives = 109/290 (37%), Gaps = 89/290 (30%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
            TV +  G GRGL +T+  KAG++++                                  
Sbjct: 373 TTVRDSPGAGRGLFATRDFKAGELIM---------------------------------- 398

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
              C  +FC+        SS   P  C A+T L          + LD     R   A  +
Sbjct: 399 ---CEKSFCT-------VSSLDKP--CTAVTAL---------TVDLDDDCAIRVFPAGLH 437

Query: 128 LAVV-----NPSQFQILLA-----FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLA 177
            AVV     NPSQ   +L      ++G     DT A     S    P     ++   +  
Sbjct: 438 RAVVQKLLDNPSQAPAVLGLDSGPYKGL---GDTGAV----STAEGPVVDAFQVHHIV-- 488

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
             + NAFGL     +  D   S    G++++AS+ NH C+ NA + D+         D++
Sbjct: 489 --QRNAFGLGP---QSPDEDVSNATTGLWVRASYVNHACVANAVK-DFAG-------DLM 535

Query: 238 V-RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           V R    +  G EI  +Y   N DY  RQ  L   + FAC C  C  EA+
Sbjct: 536 VFRAARHIRAGEEIFHAY-DENADYDARQAALRKTWGFACTCALCAAEAH 584


>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
 gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
           18224]
          Length = 527

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 121/307 (39%), Gaps = 49/307 (15%)

Query: 17  GRGLVSTQSLKAG-QIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--------SSI 67
           G G+ +T ++K G  +++ D+P++  + +      + C  CF  +  +S           
Sbjct: 27  GLGVFATANIKKGADVLVIDTPMV--ALVEEKQLKNICSGCFDTSKAASIDSRRPGLVKA 84

Query: 68  CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C  C V + C   C      + HS   C   ++L     P    LP+   L+   +I + 
Sbjct: 85  CVRCKVVYYCDKNCQRKDWKAGHS-VECAIYSKLY----PRILPLPVRAILR---VILSQ 136

Query: 127 NLAVVNPSQFQ--ILLAFQGTVTDNDTSAA---HYLSSLCPPPASTTIEL----TAALLA 177
           +   V+P   +  + L++   + D+D +     H+L +           L       L  
Sbjct: 137 DKDKVSPYIHEEFLDLSYPKEIYDDDENEEANDHFLMAEAVLKYGDIEYLDEDYVEQLFG 196

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNAC-RFDYVDAAAENNT 234
           K   N+F L   +     G+R     G+YL   A+ FNH C PNA   FD          
Sbjct: 197 KLSANSFTLTSAF-----GKRR----GVYLHPAAARFNHSCNPNASYSFD--------KG 239

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
              +R    + +G +I + Y    Y   TR+  L   Y F C C RC  E   +D +   
Sbjct: 240 KCYIRATKPIAKGEQIFIPYIDTTYSVGTRRHELKERYKFDCQCPRCLNEIATTDPEEVK 299

Query: 295 ENNEEVM 301
           + +E+ +
Sbjct: 300 KRSEQEL 306


>gi|453088880|gb|EMF16920.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 364

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 49/120 (40%), Gaps = 14/120 (11%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T +L   L+  D  NAFG+      G +       YG+Y  AS+FNH C  N  +     
Sbjct: 253 TADLCLNLVRADNHNAFGIRG----GGEDSEEYMGYGVYPSASYFNHSCDANIHK----- 303

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
                          D+  G E+C++Y      + +   R+ RL   + F C C RCK E
Sbjct: 304 --KRTGRSWTFHAAQDIQPGDELCITYLGGDEKDLNLPARRGRLEEVWGFVCHCARCKKE 361


>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
 gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
          Length = 646

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 125/325 (38%), Gaps = 51/325 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   ++  +K G+ +L + P +  S L    + S C NCF +T+  
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGEELLVERPFV--SVLLEKFAKSHCENCFMRTVVP 294

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPL- 113
            +  CP C+ V +CS +C   A+   H  Y C  +  +     ++N+  A    A+ PL 
Sbjct: 295 VA--CPRCADVLYCSEQCREEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351

Query: 114 ---------DRQLQARFLIAA-----YNLAVVN--------PSQFQILLAFQGTVTDNDT 151
                    D QL    LI+        +A +         P+ FQ  L  +  +T    
Sbjct: 352 YFLKLKPTIDEQLTPEQLISLPKDDFRRVAQLERHQGERQPPNFFQHALMAR-FLTHCLR 410

Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKA 209
           +  ++ S   P   S    L    L   + N   + E +     GR      G  IY   
Sbjct: 411 AGGYFGSEPKPDEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTL 470

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKR 267
           + FNH C P   R+          T I +  +  +  G  I  +Y P+    + S RQ R
Sbjct: 471 ALFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERSERQAR 523

Query: 268 LLYDYDFACDCDRCKVEANWSDNDN 292
           L   Y F C CD C    NW   D+
Sbjct: 524 LKELYWFECSCDACI--DNWPKFDD 546


>gi|242013801|ref|XP_002427589.1| protein msta, isoform B, putative [Pediculus humanus corporis]
 gi|212512004|gb|EEB14851.1| protein msta, isoform B, putative [Pediculus humanus corporis]
          Length = 472

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/279 (21%), Positives = 105/279 (37%), Gaps = 41/279 (14%)

Query: 20  LVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI---CPSCSVAFC 76
           LV+++S+K G+I+L D PI++    P   S   C  C+     + S+    C +C+   C
Sbjct: 2   LVASKSIKQGEIILEDEPIVVG---PCQESKPLCIGCYTDIDLNDSNKYFRCVNCTWPLC 58

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-RFLIAAYNLAVVNPSQ 135
           S +C        HS   C+  ++    N      +     L   R L+    L  +NP +
Sbjct: 59  SNQCKGFFNKFGHSKEECETFSKSFVKNKEIKTFVNFYYALTPFRCLL----LKKINPEK 114

Query: 136 FQILLAFQGTVTDNDTSAAHY-------------LSSLCPPPASTTIELTAALLAKDRLN 182
           ++I+   +    +       +             +  L        I     +L    +N
Sbjct: 115 WKIIKNMESHTNERKKDGKLWETNEKMAVDIIKNVWGLSDDDDDNDIHFVCGIL---EVN 171

Query: 183 AFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIH 242
           +F +           +S+RA  +Y  A  F+H+C+PN         +  N   +I R   
Sbjct: 172 SFEV------NFQNVQSLRA--LYSNAFLFSHNCIPNT------SHSNNNEFKLIFRTTT 217

Query: 243 DVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            + +   + +SY         R+K L     F C+C RC
Sbjct: 218 FIQKNSAVTISYAYTLQPTLLRRKHLEKTKFFQCNCQRC 256


>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
 gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 393

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 71/340 (20%), Positives = 129/340 (37%), Gaps = 82/340 (24%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKTMHSS 64
           + + + E    G+ + S   +K G+++++  SP+        +N   FC NCF  +   +
Sbjct: 17  KKIEINETLESGKFIESKIDMKLGELIIKVQSPMCFSFHKHLVNQ--FCFNCFSNSHEIN 74

Query: 65  SSI---------------CPSCS-VAFCSPKC----------STAAASSSHSPYVCQALT 98
           ++                C +C  + FCS +C            + +++ H+P  C  L+
Sbjct: 75  NAKFNKFKVDINKNYIIRCNNCKLIYFCSDECFEKVMSIESFQDSTSTNIHTPLECLILS 134

Query: 99  RLLNLNSPDAANLPLDRQLQARFLI-----AAYNLAVVNPSQFQILLAFQGTV------- 146
              +       N  L  Q + R +I      AY  +  N ++F++LL     +       
Sbjct: 135 NYHDQTISPKIN-TLHDQTENRMIINYLSKIAY--STNNNNKFKLLLIEMNQLIGDFNND 191

Query: 147 TDNDTSAAHYLSSLCPPPASTTIE-------------LTAALLAKDRLNAFGLMEPYIEG 193
            +N T + + + ++     S  +              +   L AK + N+FGL +     
Sbjct: 192 NNNQTLSLNEIKNI--KNKSFNLRKLFNNFFFNIDKVIIEELYAKSQRNSFGLWK----N 245

Query: 194 QDGRRSVRAYG---IY---------LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
            D    +  YG   IY         +  S+FNH C PN  R        + N  I +  +
Sbjct: 246 SDECFGLSMYGNQTIYNNNNDKDDNISISYFNHSCFPNCVR-------VQENQSISIYSL 298

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +G E+ +SY  +    + R   L   Y F C C RC
Sbjct: 299 IPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFECKCKRC 338


>gi|294463315|gb|ADE77193.1| unknown [Picea sitchensis]
          Length = 288

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 15/126 (11%)

Query: 168 TIELTAALLAKDRLNAF------GLMEPYIEGQDGRRSVR---AYGIYLKASFFNHDCLP 218
           T E  + +LA+ R N F      G     +       SV       +Y+  S +NHDC P
Sbjct: 168 TKEWYSGILARIRANVFRIELIGGYHTDLLAAASASISVETTVGSAVYMLPSMYNHDCDP 227

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           NA      +A A       ++ + ++  G E+ + Y   + DY  RQ  L   + F C C
Sbjct: 228 NAHILWIENAKAR------LKALRNIEPGEELRICYIDASMDYEARQSLLYQGFGFRCQC 281

Query: 279 DRCKVE 284
            RCK +
Sbjct: 282 LRCKYK 287


>gi|167342128|gb|ABZ69012.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R    + F C C RC VE
Sbjct: 65  RQGR-CESWGFECKCRRCIVE 84


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 99/281 (35%), Gaps = 34/281 (12%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           +GRGL + +    G++V    P   Y  L      + C +CF  T       C  C  A+
Sbjct: 17  KGRGLRALRDFAVGELVFA-CPAYSY-VLTVNERGAHCEHCF--TRREDLFKCGKCKQAY 72

Query: 76  -CSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
            C+  C             A  +H    C + T  L         +  +R    R L   
Sbjct: 73  YCNVDCQRGDWPMHKLECVAMCTHGENWCPSETVRLVSRIIMKQRVTTERTPSERLL--- 129

Query: 126 YNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFG 185
             L     S    + + +  +   D +A HY  S          ELT  L A+   N F 
Sbjct: 130 --LLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDDQELTE-LFAQVNCNGFT 186

Query: 186 LMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           +       +D   S     ++   +  NH C PN     Y    AE      VR + ++ 
Sbjct: 187 I-------EDEELSHLGSAVFPDVALMNHSCSPNVI-VTYKGTVAE------VRAVQEIN 232

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            G EI  SY  + Y    R++RLL  Y F C C  C  ++ 
Sbjct: 233 PGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECTTKSK 273


>gi|407843612|gb|EKG01507.1| hypothetical protein TCSYLVIO_007495 [Trypanosoma cruzi]
          Length = 382

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 89/243 (36%), Gaps = 40/243 (16%)

Query: 69  PSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLIAA 125
           P+C   +CS  C T A S  HS   C    + L L+      A+    +R   A  L+  
Sbjct: 163 PNCHEEYCSRDCRTMALSLYHSRSCCNEGLQALELDMYTQMKASGTSTERNAAAAQLLML 222

Query: 126 YNLAV-----VNPSQFQILLAFQGTVTDNDTSAAH-------YLSSLCPPPASTTIELTA 173
             LAV     V PS    L    G +  +    AH         +  C    S + E   
Sbjct: 223 RVLAVSVQQQVIPSALSELRILSGRLLFSPRVLAHSFLEIYERFTRFCHTATSISYEEMI 282

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
            +LA+   N F   +  +E            + L  S  NH C+ N         AAE  
Sbjct: 283 GVLARVTANCFH-RDTCVE------------LNLSRSMLNHSCVAN---------AAEEL 320

Query: 234 TDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
               ++   DV +G E+ ++Y+P   N  +  R K  L    F C C +C+ E      D
Sbjct: 321 HSGEIKTTRDVARGEELTINYYPQLKNLSFEERSKE-LERRGFECHCSQCRREKGLERRD 379

Query: 292 NDD 294
            ++
Sbjct: 380 LNE 382


>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
          Length = 563

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 106/298 (35%), Gaps = 38/298 (12%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI--NSSSFCHNCFRKTMHS 63
           E  T  E+   G+ LV+T+ L  G +++ + P L+Y   P +       C  C R  +  
Sbjct: 46  EITTTKEV---GKCLVATRDLSPGDVIISELP-LVYGPRPHMVEEGPVPCPGCCRLIICE 101

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ-ARFL 122
           +S  CP C    C P+C        H  + C     +L+L    A N   D   Q     
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHG-HEC----LILSLREIRAINGLHDFYRQDTLLA 156

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY---------------LSSLCPPPAST 167
           +    L   NP +F  L+  +  +         Y                + L    A T
Sbjct: 157 LRCLLLQKKNPKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQART 216

Query: 168 TIELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FD 224
             E+   +  +      G+++       QD   SV    +Y  A    HDCL N    FD
Sbjct: 217 GKEVLQDISKETVHKICGIIDVNALEINQDAEISV----LYPTAYLMEHDCLCNTVHSFD 272

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                 EN   I VR    + +G  I   Y    +    R++ L     F+C C RCK
Sbjct: 273 ----TEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCK 326


>gi|225559380|gb|EEH07663.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 517

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPV 257
           RA G  L +++ NH CLPNA       A+  N  +   + V     + +G EI + Y  +
Sbjct: 140 RALG--LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESL 197

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
             D ++RQ+ LL +Y F C C  C+ E
Sbjct: 198 YMDIASRQQFLLQEYGFECSCRLCEKE 224


>gi|353236057|emb|CCA68059.1| hypothetical protein PIIN_01926 [Piriformospora indica DSM 11827]
          Length = 786

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 9/81 (11%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           +  G+YL AS+FNH C+PNA    + D        +++R    + Q  EI +SY   +  
Sbjct: 521 KGTGLYLLASYFNHSCIPNAGHQFFSDL-------MVIRATQSIKQDEEITISYCS-HAS 572

Query: 261 YSTRQKRLLYDYDFACDCDRC 281
           Y++R+K L   +D  CDC  C
Sbjct: 573 YASREKNLKPWFD-QCDCQLC 592


>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 621

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 59/142 (41%), Gaps = 19/142 (13%)

Query: 190 YIEGQDGR----RSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           Y  G D R    +     G+Y+    S  NH C+PN  R         NN  +IV  +  
Sbjct: 430 YTTGLDARMCENKQCCTTGLYIAPITSLLNHSCIPNVKR------CFSNNYSVIVYAVQP 483

Query: 244 VPQGREI--CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK-----VEANWSDNDNDDEN 296
           + +G ++  C       Y+ S RQK L   Y+F CDC  CK     VE       N  +N
Sbjct: 484 IKKGSQLFDCYQQEFYEYNISPRQKHLKKTYNFNCDCKACKEKWDIVEYEVVSKKNLKKN 543

Query: 297 NEEVMDEDQDEQMVASDDDAEV 318
             E    D+  Q++ +  D ++
Sbjct: 544 APEWRLADEYLQLIQAIHDKKI 565


>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
 gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
           commune H4-8]
          Length = 360

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDII 237
           + N  GL +  +   D   S  AY G++ +AS  NH C PNA   FD        +  ++
Sbjct: 152 KTNGIGLGDDLVYPDDSGDS--AYVGVFERASRINHSCSPNAVYHFDL------QSFCLV 203

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
           VR I  + +G EI +SY       +TR++  L DY F C C  C +
Sbjct: 204 VRAIRHISKGEEIFISYSETLLQAATRRQNSLQDYGFRCACPACAM 249


>gi|156047862|ref|XP_001589898.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980]
 gi|154693059|gb|EDN92797.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 545

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 108/277 (38%), Gaps = 25/277 (9%)

Query: 19  GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
           G+V+ + + A + VL D P L       ++S+  C  C    +    + C  C  ++CSP
Sbjct: 272 GVVAIKDILAQEPVLYD-PTLA----AVVDSAGRCPACCGPVLDGVENSC--CRTSYCSP 324

Query: 79  KCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV----VNPS 134
            CS  A  S H+  VC      L    P +++   +  + ++ L+    L++     NP 
Sbjct: 325 LCSQRALDSYHT-VVCGKDFDFLYETKPVSSSSSTESFIGSKLLLRVLALSIRENTANPL 383

Query: 135 QFQILLAFQGTVTDNDTSAAHY-LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
           Q  ++         N+     +  +     P     EL   +      + + L   +   
Sbjct: 384 QSSLISRLTPAYNPNNPQLIAFNFTDHITTPIQILGELGVDVFTNGMYDTWVLHTIHCRL 443

Query: 194 QDGRR-----SVRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQG 247
           Q+ +      + R   I    S FNH C PN   R D+       N+ + +    ++ +G
Sbjct: 444 QNNKHGQTFDNFRGTAINPLYSMFNHSCDPNVDWRHDH------ENSTVTMFAERNIKKG 497

Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            E+ +SY         RQ++L+  +   C C +C  E
Sbjct: 498 EEMFISYIGKGDSLKDRQRKLMPWFGMECACHKCDEE 534


>gi|393227660|gb|EJD35329.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 352

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 74/179 (41%), Gaps = 24/179 (13%)

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---AHYLS-------SLCPPPASTTIE 170
           FL++    A  NPS F +L   +G  T   ++A   AH  S          P  +  T +
Sbjct: 183 FLLSGILAACTNPSAFALLRQLEGVSTPYRSTAQLHAHLASYQTLLALLPVPLLSHVTPD 242

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAA 230
           +   L + D  N+FG+         GR  +  YGI+  ASFFNH C P   +        
Sbjct: 243 ILRTLSSVDAHNSFGIWSD----APGRGEMLGYGIWPDASFFNHSCHPVIAQ-----RVR 293

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVN--YDYSTRQKR--LLYDYDFACDCDRCKVEA 285
           E  T   V    D     E+ +SY   +   + +  Q+R  L  ++ F C C RC  E+
Sbjct: 294 EGRTWSFV-ASRDAAPDEEVTISYLSADELRELNVEQRRQCLQRNWGFVCMCARCVEES 351


>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 675

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/341 (23%), Positives = 127/341 (37%), Gaps = 64/341 (18%)

Query: 5   SETVTVAEIEGR-GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           S+ VT+    G  GR  V+T++++ G+I++ + P   +    +    + CH C  K    
Sbjct: 237 SKAVTIRNDGGDIGRHAVATRNIEPGEILVVEKPHCSFILAEY--RLTHCHLCMNKIFVP 294

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF- 121
             S C  CS +A+CS  C  A A    S   C  L  L    +     L L    Q  F 
Sbjct: 295 IPSKCQVCSYLAYCSTSCRDADAKIHESE--CAVLPSLWMSKTSVTCFLALRAITQRSFE 352

Query: 122 -LIAAYNLAVVNPSQFQI----------LLAFQGTVTDND----------TSAAHYLSSL 160
             I   +       +F+I            A+ G VT  D          T  A +L  L
Sbjct: 353 EFIKLKDKLKSTKGRFEITQQRPYRSDDFEAYYGLVTHEDERTDEDMLHRTYIAVWLLRL 412

Query: 161 CP-----PPASTTIELTAALLAKDRL---------------NAFGLMEPYI-EGQD---- 195
                  P    T +   A  ++D L               NA  + E  I  G+     
Sbjct: 413 LKLSRYFPENVKTPDTAEAKPSEDELFIGSLILHGLMLLQFNAHEISELTIPRGEKTLAK 472

Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
            + +    G++   + FNH C P   R+          T ++VR I  +  G EI  +Y 
Sbjct: 473 AKSTFIGGGLFPTIALFNHSCNPGVIRYFI-------GTTMVVRAIRSIAAGEEISENYG 525

Query: 256 PV--NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDD 294
           P+      + R++ L + Y F C+C+ C    +W   D+ D
Sbjct: 526 PIFTMSPEAERKRTLRWQYWFDCNCEACV--GHWPLLDDID 564


>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 517

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 71/317 (22%), Positives = 115/317 (36%), Gaps = 75/317 (23%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL------YSALPFINSSS------- 51
           S+   V   E +G+ +V+   ++AG  ++   P +       Y A   I++ +       
Sbjct: 6   SQMYKVDYKEDKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDTENKKTDYKT 65

Query: 52  --FCHNCFRKTMHSSSSICPSCS-VAFCSPKC----------STAAASSSHSPYVCQALT 98
              C  CF K   S S  CP+C  V +CS  C                S+     C ++ 
Sbjct: 66  VNLCFYCFEK--FSKSFYCPNCKYVVYCSEVCLERGWKLHRDECEVFKSNIFDKFCPSII 123

Query: 99  RLLNLNS-----------PDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVT 147
             + +NS              A+LP ++     F   AY +AV   S+ + + A      
Sbjct: 124 MRMVINSYLSHFNFYEYCGSVADLPKEKY--EYFKYPAYIVAVALMSKKKKIFA---NFE 178

Query: 148 DNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL 207
           DN++   + +        +T        +  + L   GL                 G Y 
Sbjct: 179 DNESILRNIIEKFIKISKNT------LQIIDNELEPAGL-----------------GFYK 215

Query: 208 K-ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQK 266
           K   +FNH CL N             N  + +R + DV  G E+ +SY  + +D +TR  
Sbjct: 216 KPVPYFNHSCLSNC-------VTIFKNQKLFIRTLMDVYPGEELTISYVDIAFDKNTRLA 268

Query: 267 RLLYDYDFACDCDRCKV 283
             +  Y F C C  CKV
Sbjct: 269 ICMDQYFFTCTCKLCKV 285


>gi|146087265|ref|XP_001465774.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134069874|emb|CAM68201.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 455

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           ++  G+Y   S FNH C+PNA     VD   E    I+++    +  G  + ++Y P+  
Sbjct: 360 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLTA 414

Query: 260 DYSTR--QKRLLYDYDFACDCDRC 281
             ++R  ++R L +Y F C C RC
Sbjct: 415 GAASRAERQRRLRNYFFTCHCPRC 438


>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
          Length = 241

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 18/121 (14%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 20  GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIQAGEELTICYLDMLMTS 72

Query: 262 STRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDEDQDEQMV 310
           + R+++L   Y F CDC RC             E  W +     +  EE+  E + EQ++
Sbjct: 73  AERREQLRNQYCFDCDCARCLTQDKDADMLTGDEKVWKEVQESLKKIEELKAEWKWEQVL 132

Query: 311 A 311
           A
Sbjct: 133 A 133


>gi|328853238|gb|EGG02378.1| hypothetical protein MELLADRAFT_91461 [Melampsora larici-populina
           98AG31]
          Length = 479

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 8/141 (5%)

Query: 160 LCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
           L  P  S  ++L AA   +       +     E   G   V  Y +  + S FNH+C PN
Sbjct: 238 LPSPTGSLLMKLDAAGHTRVERVHTAIARNAFETPHGSEEVLHYAVVPEPSVFNHECRPN 297

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
           +  F       +    + +  + D+  G EI ++Y  +    + RQ  + + Y F C C 
Sbjct: 298 SAFF-----FDKKTLRVYINAVRDIAMGEEITIAYRDMKASRAERQASIAH-YGFQCKCS 351

Query: 280 RCKVEANWSDNDNDDENNEEV 300
            C + A   ++   DE  +E+
Sbjct: 352 HCSMSAE--ESKASDERIQEI 370


>gi|328850107|gb|EGF99276.1| hypothetical protein MELLADRAFT_94773 [Melampsora larici-populina
           98AG31]
          Length = 415

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 51/107 (47%), Gaps = 15/107 (14%)

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYG-IYLKASFFNHDCLPN-ACRFDYVDAAAENNT 234
           A +R N +  +E  ++GQ+     R +G +  + +  NHDC PN A  FD       N  
Sbjct: 199 AFERNNFYHRVE--VKGQE-----RPFGAVVFEPTRLNHDCRPNSAYHFD------TNTL 245

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            + V  + D+  G E+ +SY  V+    TRQ  L  DY F C C  C
Sbjct: 246 KLNVHALRDISPGEELTVSYIQVDVSKETRQGLLSRDYGFECGCSLC 292


>gi|336275943|ref|XP_003352725.1| hypothetical protein SMAC_01560 [Sordaria macrospora k-hell]
 gi|380094615|emb|CCC07995.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 726

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  +S  NH C+PN  R    D        +I R   DV +G E+   Y PV    
Sbjct: 465 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVKEGEELFQQYVPVKTLV 517

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
             R K     + F C C  C+ E   S+ 
Sbjct: 518 DVRNKEFAEGWGFGCRCGLCEGERKSSEE 546


>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
          Length = 420

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 75/184 (40%), Gaps = 29/184 (15%)

Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
           QL+  F  A Y   V     P  F+ L A      QG  T + +   H   +L  PP   
Sbjct: 218 QLRRLFTDALYEERVSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRER 277

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
             ++     L KD           IE   G   +    G+YL  S  NH C+PNA     
Sbjct: 278 EQLDSLIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 321

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCK 282
             +  +NN  + +  + D+  G EIC+SY      +    +RQK L  +Y F C C +C 
Sbjct: 322 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKCL 381

Query: 283 VEAN 286
            +A+
Sbjct: 382 AQAD 385


>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
 gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
           Full=Protein NN8-4AG; AltName: Full=Retinoic
           acid-induced protein 15
 gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
 gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
          Length = 418

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+AF   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 267 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 379 PKCLAEAD 386


>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
 gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
          Length = 393

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y+  S  NH C+PNA   F Y      +N  ++++ +  + +G EIC+SY     +  
Sbjct: 288 GLYILQSKINHSCVPNAQSTFPY------SNDIVVLKALAPIQKGDEICISYLDDCQLER 341

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +R K L  +Y F C C +C+ +A+
Sbjct: 342 SRHSRHKTLRENYIFVCQCPKCRAQAS 368


>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
 gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
          Length = 421

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 75/184 (40%), Gaps = 29/184 (15%)

Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
           QL+  F  A Y   V     P  F+ L A      QG  T + +   H   +L  PP   
Sbjct: 220 QLRRLFTDALYEERVSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRER 279

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
             ++     L KD           IE   G   +    G+YL  S  NH C+PNA     
Sbjct: 280 EQLDSLIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 323

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCK 282
             +  +NN  + +  + D+  G EIC+SY      +    +RQK L  +Y F C C +C 
Sbjct: 324 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKCL 383

Query: 283 VEAN 286
            +A+
Sbjct: 384 AQAD 387


>gi|398015620|ref|XP_003860999.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322499223|emb|CBZ34294.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 455

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           ++  G+Y   S FNH C+PNA     VD   E    I+++    +  G  + ++Y P+  
Sbjct: 360 LKGAGLYSLLSCFNHSCVPNAA-VSTVDGTHE----IVLKTTRPIRAGEPLTITYIPLTA 414

Query: 260 DYSTR--QKRLLYDYDFACDCDRC 281
             ++R  ++R L +Y F C C RC
Sbjct: 415 GAASRAERQRRLRNYFFTCHCPRC 438


>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
          Length = 418

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 81/188 (43%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+AF   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 267 -ACDTLELKPQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 379 PKCLAEAD 386


>gi|443897425|dbj|GAC74766.1| hypothetical protein PANT_12d00127 [Pseudozyma antarctica T-34]
          Length = 500

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 74/335 (22%), Positives = 124/335 (37%), Gaps = 70/335 (20%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLY---SALPFINSSSFCHNCFRKTMHSSSSI----C 68
           +G+GLV+++S   G++V  +   +      A   ++S   C  CF     +   +    C
Sbjct: 120 KGKGLVASRSFAEGELVFTEEAYIATPPPEAFDQVSSGELCAQCFLPVSSAPVQLAIKNC 179

Query: 69  PSCSVAFCSPKCSTAAASSSHSPY-------------------------VCQALTRLLNL 103
             C V FC+  C+  A ++ H+                           V ++L RLL+ 
Sbjct: 180 GKCKVRFCTSACARTAMATHHTLLCTELNPAAKPLVQLVQSQKWQSLHCVARSLARLLST 239

Query: 104 NSPDAANLPLDRQLQAR-FLIAAY-----------NLAVV--------NPSQFQILLAFQ 143
            +P   N  + +Q + +  LIAAY           + A V        NP       +F+
Sbjct: 240 LTPH--NNGMGKQSREQDDLIAAYGDFETVHARLSSFATVSELERRSRNPGWATEKASFE 297

Query: 144 GTVTDNDTSAAHYL--------SSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQD 195
           G + +  T+  + L        S+  P P    ++L  A+  K R      +  Y     
Sbjct: 298 GILAEAYTALCNALDPYAEPRTSAPHPDPHHVQLDLIDAV--KTRKGQIKDLFTYASFLK 355

Query: 196 --GRRSV---RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI 250
             GR +V   +  G+Y   SF NH C PN       +     +  +    +  + +  E+
Sbjct: 356 LLGRANVNMEKFGGLYSVHSFLNHSCAPNVQIRHVPERGILASMKVAALALRPIAKDEEL 415

Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFA-CDCDRCKVE 284
            +SY         RQ  L  DY F  C C +C  E
Sbjct: 416 VISYIDPTTRLGRRQLVLYRDYCFGPCTCPKCVQE 450


>gi|405118616|gb|AFR93390.1| hypothetical protein CNAG_03890 [Cryptococcus neoformans var.
           grubii H99]
          Length = 449

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 14/104 (13%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYS---ALPFINSSSFCHNCFRKTMHSSSSI---C 68
           G+G+GLV+ + LK G+++ ++ P ++ S     P +  S  C  CF    H S  +   C
Sbjct: 103 GKGKGLVAKEELKQGEMLWQEEPWIVTSDPGHYPLLIQSMMCSQCFSLFAHPSPPLSVPC 162

Query: 69  PSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL 111
           P C+ A FC+  C T + SSSHSP +C  L       +PDA +L
Sbjct: 163 PHCTTAHFCNRLCYTKSLSSSHSPLLCPGL-------NPDAGSL 199


>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
          Length = 441

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/277 (22%), Positives = 112/277 (40%), Gaps = 44/277 (15%)

Query: 25  SLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS-SICPSCSVA-FCSPKCST 82
            LK G+  +++ P +    L      S C  CFR+  +      C SC V  FC+ KC +
Sbjct: 2   KLKRGETWVKEDPFV--HVLSQTERDSRCDFCFRRPENDIQIPRCSSCKVVRFCNRKCQS 59

Query: 83  AAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIA----AYNLAVVNPSQ-F 136
           +A S       C++L R+       A  +P D  +L  R LI       ++ V    + +
Sbjct: 60  SAWSVHKKE--CRSLKRV-------APKIPTDSVRLIFRLLIKLESDIEDVEVFGKKRNW 110

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPAST-------TIELTAALLAKDRLNAFGLMEP 189
             L++    V ++      ++  +    + +       ++E    +  +  +N+F + +P
Sbjct: 111 ADLISHVDEVQEDQIRLQQFMVLMTTLKSFSENVMSMPSVEELFVIFGRVCVNSFSICDP 170

Query: 190 YIEGQDGRRSVRAYGIYLKA----SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
            +            G+Y++     S  +H C PNA        A    T + +R +  V 
Sbjct: 171 EMNPI-------GVGVYIRQIKPCSVLDHSCRPNA-------VAVFEGTTLRIRCVEPVD 216

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             +++ +SY     D +TR++ L   Y F C C  CK
Sbjct: 217 SEQDLRISYIDTLDDTTTRRRNLQQQYYFNCLCGECK 253


>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 650

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 74/305 (24%), Positives = 114/305 (37%), Gaps = 51/305 (16%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-VAF 75
           GR   +   LK   IVL + P +  S L    S + C  CF++   S    CP CS V F
Sbjct: 240 GRFAKTNVDLKPNTIVLVEKPHV--SVLLEEYSKTHCSTCFKRV--SVPVCCPKCSDVVF 295

Query: 76  CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPL---DRQLQARFLIAAYNLAVVN 132
           CS  C ++A S  H  Y C  L       +    ++ L    +Q +  FL     L  + 
Sbjct: 296 CSEDCESSANSGYHK-YECGFLPIFWKSGASITCHMALRIITQQSEEYFLQLRPELDGLT 354

Query: 133 PSQFQIL-----LAFQGTVTDNDTSAAH--YLSSLCPPPASTTIELTAALLAKD------ 179
             Q   L           VT  +T +A   +  +L     +  + L      K+      
Sbjct: 355 SEQTDKLKHDDYRKIYKLVTHEETRSAEDFFQRTLMATLLNACLTLGGFYKTKEAESFIG 414

Query: 180 ----------RLNAFGLMEPYIEGQD----GRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
                     + NA  + E  ++ +D    G+      G+Y   + FNH C P   R+  
Sbjct: 415 GLLLHNLQLLQFNAHEISE--LQREDDRDVGKSVFIGGGLYPTLALFNHSCEPGVTRY-- 470

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCK 282
                     + VR +  +  G  I  +Y P+ +  + R +R   LL  Y F+C+C  C 
Sbjct: 471 -----YKGNSVCVRSVRSIAAGSMIGENYGPL-FTQTPRDERRATLLNQYRFSCNCRAC- 523

Query: 283 VEANW 287
              NW
Sbjct: 524 -SENW 527


>gi|342184438|emb|CCC93920.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 471

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 36/85 (42%), Gaps = 4/85 (4%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R  G+Y     FNH C PN      V      +  + +  + D+  G E+ +SY      
Sbjct: 371 RGQGVYTIGCLFNHSCEPNL----QVLYTVSGDETLSIEALRDIEPGEELNISYIDDTLP 426

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA 285
           Y  RQ  LL  Y F C C RC  EA
Sbjct: 427 YPRRQLSLLEHYLFQCACPRCTREA 451


>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
           [Equus caballus]
          Length = 358

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C PN        +   N   +++R + D+  G E+ + Y  +      
Sbjct: 126 GLYPSMSLLNHSCEPNC-------SIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEE 178

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           R+K+L   Y F CDC RC+ +   +D    DE+
Sbjct: 179 RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEH 211


>gi|260788416|ref|XP_002589246.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
 gi|229274421|gb|EEN45257.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
          Length = 778

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 3/83 (3%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C  N    D +DA  +    +++R    + +G E+  +Y   +     
Sbjct: 646 GLYALQSSLNHSCDKNV---DVMDAVVDGKPGVVIRAKQPIKKGGELYTTYIDTSMQRPQ 702

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R+  L   Y F C+C RCK E +
Sbjct: 703 RRAWLYRAYHFWCECQRCKYEGD 725


>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
          Length = 575

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 106/298 (35%), Gaps = 38/298 (12%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFI--NSSSFCHNCFRKTMHS 63
           E  T  E+   G+ LV+T+ L  G +++ + P L+Y   P +       C  C R  +  
Sbjct: 46  EITTTKEV---GKCLVATRDLSPGDVIISELP-LVYGPRPHMVEEGPVPCPGCCRLIICE 101

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ-ARFL 122
           +S  CP C    C P+C        H  + C     +L+L    A N   D   Q     
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHG-HEC----LILSLREIRAINGLHDFYRQDTLLA 156

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHY---------------LSSLCPPPAST 167
           +    L   NP +F  L+  +  +         Y                + L    A T
Sbjct: 157 LRCLLLQKKNPKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQART 216

Query: 168 TIELTAALLAKDRLNAFGLME--PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FD 224
             E+   +  +      G+++       QD   SV    +Y  A    HDCL N    FD
Sbjct: 217 GKEVLQDISKETVHKICGIIDVNALEINQDAEISV----LYPTAYLMEHDCLCNTVHSFD 272

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                 EN   I VR    + +G  I   Y    +    R++ L     F+C C RCK
Sbjct: 273 ----TEENGYKITVRAALPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCK 326


>gi|167342134|gb|ABZ69015.1| SET domain-containing protein C [Pinus radiata]
 gi|167342136|gb|ABZ69016.1| SET domain-containing protein C [Pinus radiata]
          Length = 141

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC +E
Sbjct: 65  RQGR-CENWGFECKCRRCILE 84


>gi|195023574|ref|XP_001985712.1| GH20950 [Drosophila grimshawi]
 gi|193901712|gb|EDW00579.1| GH20950 [Drosophila grimshawi]
          Length = 557

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 39/288 (13%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFIN---SSSF--CHNCFRKTMHSSSSICPSC 71
           GR LV+TQ+++A QI+  + P+++       +   S+S   C  C+      +  +C SC
Sbjct: 56  GRYLVATQTIQAKQIIFVEEPLVVGPKWFLTDDEKSASIVPCVACY-TPCRINKHLCRSC 114

Query: 72  SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSP-----DAANLPLDRQLQARFLIAAY 126
               CS  C        H    C     +L+L SP     DA  L    +  A  ++   
Sbjct: 115 RWPVCSISCE-------HEKLECS----ILSLGSPPSGKSDARGLHDYYRHDALLVLKCL 163

Query: 127 NLAVVNPSQFQILLAFQGTVTDNDTSAAH---------YLSS--LCPPPASTTIELTAAL 175
            L   +P ++  LL  Q    +   +  H         YL    L    A+   E   + 
Sbjct: 164 LLQRQHPERWAALLEMQSHEEERLGTELHQEAEQRIVSYLQQRFLQRIKAAKNRESLLSE 223

Query: 176 LAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
            + + L+   G++E      +    V   G++ +A    H C PN C F + DAA +   
Sbjct: 224 YSAELLHRLCGIIETNYMVIELATGVELSGVFRQACMMEHACQPN-CYFQF-DAATQR-- 279

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I VR   D+ +G  + ++Y  + +    RQ  L     F C C RC+
Sbjct: 280 -IAVRAGCDLKKGDHLKITYTNILWGTQMRQHHLRMTKHFNCRCSRCE 326


>gi|383142265|gb|AFG52495.1| Pinus taeda anonymous locus UMN_1780_01 genomic sequence
          Length = 155

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + +       + +G EI   YF +      
Sbjct: 14  GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 66

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 67  RQGR-CENWGFECKCRRCIVE 86


>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 454

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 18/131 (13%)

Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNA 220
           PP     +  AA+ A+   NA  ++ P ++           GI L  +    NH C PNA
Sbjct: 112 PPTMFDKDFIAAMYARVMSNALTIITPTLD---------PLGIILDPTLCSLNHSCDPNA 162

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
             F  +D  +     + +R +  + + +EI +SY    Y Y  RQ+ L   + F C C +
Sbjct: 163 --FIMMDGPS-----VSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAK 215

Query: 281 CKVEANWSDND 291
           C+ +A   +++
Sbjct: 216 CQEKATLQEDN 226


>gi|167342102|gb|ABZ68999.1| SET domain-containing protein C [Pinus taeda]
 gi|167342124|gb|ABZ69010.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + +       + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
           boliviensis boliviensis]
          Length = 369

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 91/247 (36%), Gaps = 45/247 (18%)

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
           C  C VA +CS KC   A         C+ L        PD+  L L R +         
Sbjct: 3   CSQCRVAKYCSSKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEGSPS 59

Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           ++  L + Y+L                Q+++ FQ  + +    A+        PPA    
Sbjct: 60  ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDIF 113

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E  A ++     N+F +    ++           G+Y   S  NH C PN        + 
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SV 155

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             N   +++R + D+  G E+ + Y         R+K+L   Y F CDC RC+ +   +D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDTLMTSEERRKQLRDQYCFECDCFRCQTQDKDAD 215

Query: 290 NDNDDEN 296
               DE 
Sbjct: 216 MLTGDEQ 222


>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
          Length = 439

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            +Y KAS FNH C PN CR   ++ A E     ++ +  D+  G E+  +Y  ++     
Sbjct: 283 AVYEKASLFNHSCQPNVCR---INRAGEWGALEMISLT-DIAAGTELVYNYIQISLPTED 338

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQ +L  +Y F C C+ C
Sbjct: 339 RQSKLSENYFFECKCNGC 356


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 75/295 (25%), Positives = 115/295 (38%), Gaps = 41/295 (13%)

Query: 17  GRGLVSTQSLK--AGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           GRGL + +      G  +    PI+ Y   P + + + CH C           C  C  A
Sbjct: 18  GRGLFAGEGSHYIPGDSIATYRPIV-YCVTPTM-AKACCHWCLNSDAPKYYQ-CSGCRYA 74

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL--IAAYNLAVV 131
            +CS +C+ AA         CQ + +L    +  A++ PL   L A  L  +   + AV 
Sbjct: 75  VYCSKECAKAAYKLGTHRRECQLIKKL---PAEHASSAPLTTFLAAAKLHWLVQEDEAVR 131

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSS----------LCPPPASTTIELTAALLAKDRL 181
           +        A      + D+ +   L S          +  PP    ++L   L    R 
Sbjct: 132 HSVDEMCRHADSSDTLEADSGSTAILLSRYLDGTQADLIYSPPVPVMLDLLRVL----RY 187

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NA  +        D  + V A G+Y + S  NH C PN            + +++ +R I
Sbjct: 188 NAVTIT------NDSLQDV-ALGLYTEVSAMNHSCAPNVVLIF-------SGSEVTLRTI 233

Query: 242 HDVPQGREICLSYFPVNYD-YSTRQKRLLYDYDFACDCDRCKVEANWSDN-DNDD 294
             V  G E+ +SY  V     + R +RL   Y F C C+RC  E  + D+ DN D
Sbjct: 234 RAVEDGAELFISYVDVCISPKAKRCQRLRDQYKFDCSCERCTREDEYLDDVDNGD 288


>gi|49533764|gb|AAT66763.1| Putative TPR domain containing protein, identical [Solanum
           demissum]
          Length = 438

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           V   G+++ +SF NH C PN  R    D        +++    D+  G+E+  +YF V  
Sbjct: 281 VHGIGLWILSSFINHSCDPNVRRSHVGDH-------VMIHACRDIKAGKELTFAYFDVFT 333

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEA 285
            +  R+++   ++ F C C RC +E 
Sbjct: 334 PFRDREEK-AKNWGFVCKCKRCNLEK 358


>gi|429861968|gb|ELA36630.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 735

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 38/80 (47%), Gaps = 10/80 (12%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY--DY 261
           GI+  AS  NH CL N CR    D        +IVR   D+P G E+   Y  ++   DY
Sbjct: 495 GIWTIASHINHSCLENCCRSFIGDV-------MIVRASRDMPAGTELVFCYQSIDIGDDY 547

Query: 262 STRQKRLLYDYDFACDCDRC 281
              QKR    + F C+C+ C
Sbjct: 548 KATQKR-FSRWGFTCECELC 566


>gi|303278994|ref|XP_003058790.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226459950|gb|EEH57245.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 771

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
           ++R   ++  AS  NH C PN C   YVD     +  ++V     +  G E+ ++Y  V+
Sbjct: 400 AMRGAAVFPLASSLNHSCDPN-CEVAYVD-----DARVLVVARRTLKPGEELTIAYVDVD 453

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEA 285
            D   R+  L   Y F C C+RC  EA
Sbjct: 454 ADVGERRDELREVYGFECVCERCSREA 480


>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
          Length = 436

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 101/285 (35%), Gaps = 51/285 (17%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + +    G+++L   P  + + L      S C  CF +      S C  C  A
Sbjct: 19  GKGRGLRARRRFAVGELLL-SCPAYV-AVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74

Query: 75  F-CSPKCST--------------AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA 119
           F C+ +C                    + H     +   R+L      A       + Q+
Sbjct: 75  FYCNVECQKEDWPMHKLECASMCTFGQNWHPSETVRLTARIL------AKQKTHPERTQS 128

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALL 176
             L+A         S    L   +  +  ND +A H+  S     P  A+  +     L 
Sbjct: 129 EKLLAVKEF----ESHLDKLDNEKRELIQNDIAALHHFYSKHLDYPDNAALVV-----LF 179

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           A+   N F +       +D   S     I+   +  NH C PN     Y    AE     
Sbjct: 180 AQVNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE----- 226

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            VR + ++  G EI  SY  + Y    R  RL   Y F+CDC  C
Sbjct: 227 -VRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCREC 270


>gi|113205454|gb|AAW28574.2| TPR domain containing protein, putative [Solanum demissum]
          Length = 569

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 8/91 (8%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           V   G+++ +SF NH C PN  R    D        + +    D+  G E+  +YF V  
Sbjct: 318 VHGIGLWILSSFINHSCDPNVRRSHIGD-------HVTIHACRDIKAGEELTFAYFDVFT 370

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
            +  R+++   ++ F C C RC +E   S N
Sbjct: 371 PFRDREEK-AKNWGFVCKCKRCNLEKGVSSN 400


>gi|167342092|gb|ABZ68994.1| SET domain-containing protein C [Pinus taeda]
 gi|167342094|gb|ABZ68995.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + +       + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFTHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|325088450|gb|EGC41760.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 439

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 43/87 (49%), Gaps = 5/87 (5%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPV 257
           RA G  L +++ NH CLPNA       A+  N  +   + V     + +G EI + Y  +
Sbjct: 111 RALG--LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESL 168

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
             D + RQ+ LL +Y F C C  C+ E
Sbjct: 169 YMDIAGRQQFLLQEYGFECSCRLCEKE 195


>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 555

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 71/315 (22%), Positives = 120/315 (38%), Gaps = 56/315 (17%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQ-IVLRDSPILLYSALPFINSSSFCHNCFRK 59
           + +V    T     G G GL +++ ++ G+ +VL   P +  + L      S C  C   
Sbjct: 54  IPSVYTKATTGPSPGSGTGLFASKDIQPGETVVLIQKPFV--AVLDTAQLESKCSGCLGA 111

Query: 60  TMHSSSSI----CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRL 100
             +   ++    C  C V  +C+  C        HS              P   +A+ R+
Sbjct: 112 HANRQEAVELKACTGCRVVKYCNKTCQAKDWKLFHSLECRIFQNLKPRVLPNNARAILRM 171

Query: 101 LNLNSPD---AANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL 157
           + L+      A  + L + L++         A  +  Q++ +L          T+ A   
Sbjct: 172 VMLSERGRYPAGEMELFKGLESHMKDVCIRNANGDKDQWERILL---------TAKAVKN 222

Query: 158 SSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHD 215
            S C    S    L A   AK  +N+F     +    D R      GIY+   A+  NH 
Sbjct: 223 YSGCAMDES----LIAEYAAKLDVNSFN----FDNVMDER-----LGIYMHPYAALMNHS 269

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFA 275
           C       DY    A ++  + V+ +H + +G +I ++Y      Y  RQK L   Y F 
Sbjct: 270 C-------DYNATVAFDDDRLHVKALHPIKKGEQIFITYVDTTNPYKIRQKELSDRYYFT 322

Query: 276 CDCDRCKVEANWSDN 290
           C C +C+  AN  ++
Sbjct: 323 CRCSKCQQGANGRED 337


>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
 gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
 gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
          Length = 421

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 77/187 (41%), Gaps = 29/187 (15%)

Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL-CPPPAS 166
           QL+  F+ A Y   +     P  F+ L A      QG  T + +   H   +L  PP   
Sbjct: 218 QLRRLFVDALYEERMSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALELPPRDR 277

Query: 167 TTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
             ++     L KD           IE   G   +    G+YL  S  NH C+PNA     
Sbjct: 278 EKLDALIDQLYKD-----------IEKVTGEFLNCEGSGLYLLQSCCNHSCVPNA----- 321

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
             +  +NN  + +  + D+  G EIC+SY      +    +RQK L  +Y F C C +C 
Sbjct: 322 EASFPDNNFILHLTALEDIQPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKCL 381

Query: 283 VEANWSD 289
            +A+  D
Sbjct: 382 AQADEPD 388


>gi|302686744|ref|XP_003033052.1| hypothetical protein SCHCODRAFT_107538 [Schizophyllum commune H4-8]
 gi|300106746|gb|EFI98149.1| hypothetical protein SCHCODRAFT_107538, partial [Schizophyllum
           commune H4-8]
          Length = 404

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 71/154 (46%), Gaps = 21/154 (13%)

Query: 172 TAALLAKDRLNAFGLMEPY----IEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           +  LL++ R NA+  +  Y    I+ QDG+       I+ + S  NH C PNA    + D
Sbjct: 188 SGPLLSRARTNAWHALGEYKFAGIDAQDGQ----YLAIFDELSRMNHSCRPNA--LYHWD 241

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSY-FPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           ++  + +   +R + D+  G EI +SY   V+  YS R+  LL  Y F CDC  C     
Sbjct: 242 SSTFSGS---LRAVRDIEPGEEITVSYCGEVDRPYSDRRA-LLAPYGFGCDCRAC----- 292

Query: 287 WSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
            ++ D  D   E+++ E  D     S D     G
Sbjct: 293 -AEGDAADIRYEQIIAEYDDLPFPLSSDKRVEQG 325


>gi|303272359|ref|XP_003055541.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463515|gb|EEH60793.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 896

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 15/114 (13%)

Query: 213 NHDCLPNACRFDYVDAAAENNTD--------IIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           NH C+PN C+ +        + D        + +  + DV +G EI  +Y P N  Y  R
Sbjct: 527 NHSCVPN-CQIEARIVPGGGDGDDETCGGLRVTLVALRDVAEGEEITTAYVPSNQRYFRR 585

Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMD----EDQDEQMVASDD 314
           ++ L   Y FACDC RC +EA  S   N    N +V+D     DQ ++    DD
Sbjct: 586 RRELRRRYGFACDCPRCALEA--SKTSNAPPPNLKVIDWLRLADQAQEEARYDD 637



 Score = 37.7 bits (86), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 29/53 (54%)

Query: 5  SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF 57
          S  V    IEGRGRGLV+T+++  G ++  + P+    A+   +S   C +C 
Sbjct: 30 SGRVRCVAIEGRGRGLVATRAVYKGDVIAVEKPLAATQAMGNRSSVLACAHCL 82


>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
 gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
          Length = 495

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 78/311 (25%), Positives = 116/311 (37%), Gaps = 70/311 (22%)

Query: 1   MSAVSE----TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNC 56
           M+AV+E       V + E  GR +V+  ++KAG+ VL + P+LL +     N    C NC
Sbjct: 1   MAAVTEDFAKKCEVKQNETLGRYVVAATNIKAGETVLLEQPVLLLAN----NGDRRCCNC 56

Query: 57  FRKTMHSSSSICPSCSVA-FCS-------PKCSTAA----------ASSSHSPYVCQALT 98
            + T    SS C  C +   C+       P C   A             +H+  VC AL 
Sbjct: 57  HQLT----SSFCGKCRLMPLCADCVEHHGPDCRRLAELQLQEQQVEQLQAHTE-VCSALK 111

Query: 99  RLLNLNSPDAAN-----LPLDRQLQARF---LIAAYNLAVVNPSQFQILLAFQGTVTDND 150
            LL     +  +     L L+  L  R    +  +Y   V+ P Q   LL       + D
Sbjct: 112 YLLLREHAEKRSHYEQLLQLEAHLARRRDTDIWRSYRAEVIEPLQASGLLLQLRNGAEID 171

Query: 151 TSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKAS 210
                              EL   LLA   +N F +  P   G     ++R  G+Y+ A+
Sbjct: 172 E------------------ELLQRLLAIADINGFEIRAPESGG-----AMR--GVYMTAA 206

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLY 270
            F H CLPN      +  A +    I +     +  G  +   Y  +      R++ L  
Sbjct: 207 LFAHSCLPN------LVTAVDEQRRIKLYANRIIAAGEILYNCYTDILLGTDERRRILKT 260

Query: 271 DYDFACDCDRC 281
              F C C RC
Sbjct: 261 GKCFDCQCARC 271


>gi|195056333|ref|XP_001995065.1| GH22837 [Drosophila grimshawi]
 gi|193899271|gb|EDV98137.1| GH22837 [Drosophila grimshawi]
          Length = 495

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 120/304 (39%), Gaps = 54/304 (17%)

Query: 1   MSAVSE----TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNC 56
           M+AVSE       V E E  GR ++++ +++ G+ VL + PILL S     N+   C +C
Sbjct: 1   MAAVSEDFAKKCEVKENETMGRYVIASTNIREGETVLCEQPILLLSN----NTDRRCSSC 56

Query: 57  FRKTMHSSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
           ++ T    +S C  C +    P C        H    C+   RL  L   +     L  Q
Sbjct: 57  YKLT----NSFCGKCRLL---PLCGECV---HHDARDCK---RLAELQLEEEQVEKLQAQ 103

Query: 117 LQARFLIAAYNLAVVNPSQ--FQILLAFQG--TVTDNDTSAAHYLSSLCPPPASTTI--- 169
            +    +    L     +Q  ++ LL  +    +  N      Y   +  P  ST +   
Sbjct: 104 AEVNNALTYLVLREHKETQPQYEQLLQMESHLALRRNTEIWRTYREQVVLPLQSTGLLQQ 163

Query: 170 ---------ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
                    EL   LL    +N F +  P   G     ++R  G+Y++A+ F H C PN 
Sbjct: 164 LRNGAEIDEELLQHLLGTTDVNGFEIRAPESGG-----TLR--GLYIRAALFPHSCTPN- 215

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYD--FACDC 278
                +  A +    I V     +P G EI  + +  N    T ++RL+      F C C
Sbjct: 216 -----LVTAIDEERRIKVYANRYIPAG-EILYNCY-TNILLGTEERRLILKQGKCFDCQC 268

Query: 279 DRCK 282
            RC+
Sbjct: 269 ARCQ 272


>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
          Length = 383

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 80/195 (41%), Gaps = 26/195 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPP 163
             L L R L    L   +      P  F+ L A      QG  T + +   H   +L  P
Sbjct: 184 GQLELLRLLFTEALYDEHLSRWFTPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP 243

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
                      +L ++ L+AF   +   +E + G   +    G+Y+  S  NH C+PNA 
Sbjct: 244 -----------MLQREELDAFIDQLYKDMEKESGEFLNCEGSGLYVLQSCCNHSCIPNA- 291

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDC 278
                 +  ENN  + +  + D+  G EIC+SY        +R  R   L  +Y F C C
Sbjct: 292 ----ETSFPENNFLLHLTALEDIEAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSC 347

Query: 279 DRCKVEANWSDNDND 293
            +C  +A+ +D  +D
Sbjct: 348 PKCLAQADDADVTSD 362


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 117/294 (39%), Gaps = 45/294 (15%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMH 62
           AV + V +     RGR L +T+++ AG ++   SP++L  +L  +  +  C  C R    
Sbjct: 4   AVPDGVVIKPHSTRGRALHATKTVAAGGVLAVFSPLILVPSLSHL--TIVCSFCLRGV-- 59

Query: 63  SSSSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQA-- 119
            +   C  C  A +C  +C  AA SS H+   C ALT  +    P A      R++ A  
Sbjct: 60  -TPRPCSRCRAAYYCDAQCQAAAWSSGHAAE-CAALTGAI----PPARR---RREIPAPV 110

Query: 120 RFLIAAYNLAVVNPSQFQILL-AFQGTV---------TDNDTSAAHYLSSLCPPPASTTI 169
           R L+    L        + LL   +G            D +  A    +      +   +
Sbjct: 111 RALVQLLLLLRSGGGHTRDLLDGLEGHAAQRRRAPGWADMELMAMAGCAFAGMETSEGAV 170

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVD 227
              A +L K + NAF   +  + G  G        I+L+ +    NH CLPNA    +V 
Sbjct: 171 REAAEILCKLQTNAFHRFDADL-GHVG--------IFLEPTLAMANHSCLPNAF-VQFVG 220

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             A      ++R    +  G EI +SY       S RQ   L  Y F C C RC
Sbjct: 221 RTA------VLRAEQRIQSGDEIEISYTDYTSSLSKRQA-ALAPYHFECRCRRC 267


>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
           C  C VA +CS KC   A         C+ L        PD+  L L R +         
Sbjct: 7   CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGTPS 63

Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           ++  L + Y+L                Q+++ FQ  + +    A+        PPA    
Sbjct: 64  ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 117

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E  A ++     N+F +    ++           G+Y   S  NH C PN        + 
Sbjct: 118 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 159

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             N   +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D
Sbjct: 160 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 215

Query: 290 NDNDD-ENNEEVMDEDQD 306
            D D    +E+V  E Q+
Sbjct: 216 KDADMLTGDEQVWKEVQE 233


>gi|19075913|ref|NP_588413.1| histone lysine methyltransferase Set5 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74626858|sp|O74467.1|SET5_SCHPO RecName: Full=SET domain-containing protein 5
 gi|3581881|emb|CAA20779.1| histone lysine methyltransferase Set5 (predicted)
           [Schizosaccharomyces pombe]
          Length = 319

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 39/86 (45%), Gaps = 15/86 (17%)

Query: 204 GIYLKASFFNHDCLPNAC-----RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
           G++L  S  NHDC PN       R D V   A          + D+  G EI  +Y  ++
Sbjct: 95  GMFLLGSRMNHDCSPNVKHTWNPRLDQVTVHA----------VRDIEAGEEILTTYIDLH 144

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVE 284
             ++ RQK LL  + F C C  C VE
Sbjct: 145 KSHTERQKILLEHFGFKCYCSVCSVE 170


>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
           mutus]
          Length = 221

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 11/111 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C PN        +   N   +++R + DV  G E+ + Y  +      
Sbjct: 22  GLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRAVRDVEAGEELTICYLDMLMTSEE 74

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDD 314
           R+K+L   Y F CDC RC+ +    D D D    +E + ++  E +   +D
Sbjct: 75  RRKQLRGQYCFDCDCFRCQTQ----DKDADMLTGDEQVWKEVQESLKKIED 121


>gi|294867794|ref|XP_002765240.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239865235|gb|EEQ97957.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 93

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/85 (38%), Positives = 42/85 (49%), Gaps = 10/85 (11%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--GREICLSYFPVNY 259
           A G Y   +  NH C PNA           NNT   + +I   P   G EI +SY P++ 
Sbjct: 11  ATGFYSTVARMNHSCSPNA-------KVIFNNTTNKMEVISLKPINIGDEIRISYVPIDL 63

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
           D +TR+ R L DY F C+C RC  E
Sbjct: 64  DLNTRRHR-LKDYGFLCNCQRCLTE 87


>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3 [Equus caballus]
          Length = 440

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 67/314 (21%), Positives = 114/314 (36%), Gaps = 67/314 (21%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF---RKTMHSSSSICPSCS 72
           +G GL +   L+ G+++ R  P L Y+     +    C  C     K M      C  C 
Sbjct: 14  KGNGLRALVPLRPGELLFRSDP-LAYTVCKG-SRGVVCDRCLLGKEKLMR-----CSQCR 66

Query: 73  VA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL-----------QAR 120
           VA +CS KC   A         C+ L        PD+  L L R +           +  
Sbjct: 67  VAKYCSAKCQKKAWQDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMEETPSESEKL 123

Query: 121 FLIAAYNLAVVNPSQF------------------QILLAFQGTVTDNDTSAAHYLSSLCP 162
           + +   +++V +                      Q+ + FQ  + +    A+        
Sbjct: 124 YHVLHNDISVDDGPHISWWAHINKLTEDKKEGLRQLAMTFQHFMREEIQDASQL------ 177

Query: 163 PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR 222
           PP+    E  A ++     N+F +    ++           G+Y   S  NH C PN   
Sbjct: 178 PPSFDVFEAFAKVIC----NSFTICNAEMQEV-------GVGLYPSMSLLNHSCDPNC-- 224

Query: 223 FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                +   N   +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+
Sbjct: 225 -----SIVFNGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 279

Query: 283 VEANWSDNDNDDEN 296
            +   +D    DE+
Sbjct: 280 TQDKDADMLTGDEH 293


>gi|398012025|ref|XP_003859207.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322497420|emb|CBZ32495.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 622

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/228 (27%), Positives = 95/228 (41%), Gaps = 30/228 (13%)

Query: 66  SICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ---LQAR 120
           S C +C+ VA+CSP C+ A     H  + C  + RL ++    +A  L +  Q   L   
Sbjct: 259 STCTTCNMVAYCSPACADAH-REQHERFECARMRRLSDMMEGIEARKLDVPEQFFELAYH 317

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPAS--TTIELTAALLAK 178
            +     +    P   ++L     T+T +       L  + P  +      E TA L   
Sbjct: 318 CITTLAGIKAKRPGHERVL-----TLTAHVDEVIQSLHPIGPLMSDLFKGEEETATLYTI 372

Query: 179 DRL---NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
             +   NA  + +P   G    +++ A      ASFFNH C PN C  D V  A      
Sbjct: 373 IGVLCCNALEVTDP--SGLGVAQALHAGNSI--ASFFNHSCTPN-CAIDTVRHAI----- 422

Query: 236 IIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDRCK 282
           I  R IH    G E+ ++Y P + +    R++RL   Y F C C RC+
Sbjct: 423 ITTRTIH---VGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQ 467


>gi|167342140|gb|ABZ69018.1| SET domain-containing protein C [Pinus sylvestris]
          Length = 141

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC +E
Sbjct: 65  RQVR-GENWGFECQCRRCILE 84


>gi|428672307|gb|EKX73221.1| conserved hypothetical protein [Babesia equi]
          Length = 402

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 49/98 (50%), Gaps = 6/98 (6%)

Query: 187 MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ 246
           +E  IE  +    +   GI+   S  NH C PN    DYVD    +NT  I  ++ ++  
Sbjct: 302 IEKCIESMELFPDIIGLGIFDNISKMNHSCRPNV-EIDYVD----DNTARI-NLLCNISA 355

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           G+E  +SY      +  RQK+L  +Y F CDC++C  E
Sbjct: 356 GQEATISYINEEDVFEIRQKKLSTNYGFQCDCNKCLEE 393


>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
 gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
          Length = 766

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 6/78 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++   +  NH+C PNA  +       +N    +VR   D+P+G EI  +Y  + +   T
Sbjct: 546 GLFPLTAIMNHECTPNASHY------FDNGRMAVVRAARDIPKGGEITTTYTKILWSNLT 599

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L    +F CDC RC
Sbjct: 600 RGIFLKMTKNFMCDCPRC 617



 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 6/87 (6%)

Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
           G   +    ++  A   NH C PNA          E+   I+V     +P+G EI ++Y 
Sbjct: 214 GGNEILVRALFPLAGLLNHHCTPNAAHH------FEDGETIVVSATERIPKGTEITMTYA 267

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCK 282
            + +    R+  L     F C C RC+
Sbjct: 268 KLLWSTLARKLFLAMTKHFICQCPRCQ 294


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
          Length = 428

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 102/283 (36%), Gaps = 43/283 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+G GL S + ++ G+++ R  P         +     C  C ++  H     C  C VA
Sbjct: 13  GKGSGLRSRRQVRPGELLYRAEPFAYVVTKEQLGG--VCEQCLQRNEHLHR--CSQCKVA 68

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL------PLDRQ---LQARFLIA 124
            +C   C   A         C+ L  +     PD+  L       L RQ   L  R    
Sbjct: 69  KYCGKSCQKEAWLDHKRE--CKCLQNVKPNFPPDSVRLAGRIVFKLLRQSACLSERL--- 123

Query: 125 AYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCP-----PPASTTIELTAALLAKD 179
            Y+   +  +  Q+    +  +     +   YL +        PPA    ++      K 
Sbjct: 124 -YSFKDLQSNAEQLSEEMKEGLGHLAHTLQLYLRAEIQDASHLPPAIDFFQI----FTKV 178

Query: 180 RLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
             N F +        +G       G+Y   S  NH C PN C   +          +++R
Sbjct: 179 TCNCFTI-------SNGEMQDVGVGLYPSMSLLNHSCDPN-CVIIF------EGYQLLLR 224

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            I ++  G E+ +SY       S RQK+L   Y F CDC  C+
Sbjct: 225 SIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDCCLCQ 267


>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
 gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
 gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
          Length = 369

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
           C  C VA +CS KC   A         C+ L        PD+  L L R +         
Sbjct: 3   CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPS 59

Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           ++  L + Y+L                Q+++ FQ  + +    A+        PPA    
Sbjct: 60  ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 113

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E  A ++     N+F +    ++           G+Y   S  NH C PN        + 
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 155

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             N   +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQ----D 211

Query: 290 NDNDD-ENNEEVMDEDQD 306
            D D    +E+V  E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229


>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
           Shintoku]
          Length = 150

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 8/106 (7%)

Query: 182 NAFGL--MEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
           N  GL  +   I+ ++        G++   S  NH C PN  + +Y     +NN   I  
Sbjct: 46  NQLGLDDINETIQKKELYPETMGLGLFNYLSKMNHSCEPN-LQIEY----TKNNIAHIAP 100

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           ++ DVP+G E  +SY         RQ++L  +Y F CDC++C +E 
Sbjct: 101 LV-DVPRGEEATISYIDEKDSVENRQEKLYKNYGFKCDCNKCILEG 145


>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 415

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 78/185 (42%), Gaps = 20/185 (10%)

Query: 109 ANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPPPAS 166
             L L R+L A  L          P  F+ L A  GT       +S + ++       A 
Sbjct: 215 GQLELLRRLFAEALYEEVLSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH------AC 268

Query: 167 TTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFD 224
             +EL +    +++L+AF   +   IE   G   +    G+++  S  NH C+PNA    
Sbjct: 269 DALELKSQ--DREQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA---- 322

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRC 281
              +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C +C
Sbjct: 323 -ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381

Query: 282 KVEAN 286
             EA+
Sbjct: 382 LAEAD 386



 Score = 38.9 bits (89), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/101 (22%), Positives = 41/101 (40%), Gaps = 27/101 (26%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +    
Sbjct: 32  KGKGLFATQLIRKGETIFVERPLVAAQFLWNALYHYRACDHCLRALEKAEENAQRLTGKP 91

Query: 68  -------------------CPSCSVAFCSPKCSTAAASSSH 89
                              CP C V +CS +C  AAA   H
Sbjct: 92  GQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 132


>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
 gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
          Length = 486

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/292 (22%), Positives = 114/292 (39%), Gaps = 33/292 (11%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKT--MH 62
           E+V V   EG+GRGL + +    G ++  +     Y+A+ F + +   CH CF++   +H
Sbjct: 2   ESVEVFTTEGKGRGLKAQKEFLPGDVIFAEPA---YAAVVFDSLTHVICHTCFKRQERLH 58

Query: 63  SSSSICPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF 121
                C  C  A+ C   C   A  +  +   C A+ +  +  +P   N+ L  ++  R 
Sbjct: 59  R----CGQCKFAYYCDRTCQRDAWLNHKNE--CSAIKK--HGKAP-TENIRLAARILWRI 109

Query: 122 L-----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
                 ++   L  ++  Q  +    +    D       +L           ++  + + 
Sbjct: 110 EREGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPAQSQQFGMQYISHIF 169

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC------RFDYVDAAA 230
                NAF L +     Q G ++V   GI+      NHDC PN          + V +  
Sbjct: 170 GVINCNAFTLSD-----QRGLQAV-GVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMF 223

Query: 231 ENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                I +R +  +  G E+ +SY         R+K+L   Y F C C+ CK
Sbjct: 224 HTQMRIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 275


>gi|363755786|ref|XP_003648109.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356892145|gb|AET41292.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 487

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/308 (20%), Positives = 121/308 (39%), Gaps = 56/308 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRD-------SPILLYSALPFINSSSFC-----HNCFRKTMHS 63
           +G+ L +   +  G+++L +       +P+  Y+ +    +   C      +C    MH+
Sbjct: 121 KGKALCTKVKIAKGELILHEEEPLTFIAPLEKYTLMQLSKACGTCGTSLNQSCHYYIMHN 180

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
               C +C++ +CS KC T   + S   Y+   +++    NS +   L  ++  +     
Sbjct: 181 LD--CDNCTIVWCSKKCKTIDTTHS---YLKHPMSKNKICNSQNW--LKFEQFCKENTWY 233

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL-------- 175
           +AY + ++   Q    L+ QG + D   ++   +S      A+ +  +   L        
Sbjct: 234 SAYAVGMIYARQN---LSDQGAIIDEQFNSLAKISQRIRLEAAESTNIGGTLDSGVDIIT 290

Query: 176 ----LAKDRLNAFGLMEPYIEGQD-------------GRRSVRAYG--IYLKASFFNHDC 216
                A      + L E +   QD             GR ++      IY   S  NH C
Sbjct: 291 TISSSAMWEEGFYLLCEAFPHLQDDNNFDLEAFLCDIGRFNLNQINGQIYPLFSHINHSC 350

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
            PN   F++      +   I      D+  G E+  +Y    +D ++R++ L  ++ F C
Sbjct: 351 EPNT-YFEF------DKHGIKAFARKDIAAGEELLTTYVNPLHDVNSRRRELCVNWGFLC 403

Query: 277 DCDRCKVE 284
           +C RCK E
Sbjct: 404 NCRRCKKE 411


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/273 (23%), Positives = 106/273 (38%), Gaps = 38/273 (13%)

Query: 27  KAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAA 84
           K G ++L++ P   +L+S       +  C  CF+       S C    V +C+  C   A
Sbjct: 7   KKGSLILKEKPFACVLHSRY----RAERCDKCFKPGKVLKCSNC--LYVRYCNRFCQKEA 60

Query: 85  ASSSHSPYVCQALTRLLNLNSPDAANL--PLDRQLQARFLIAAYNLAVVNPSQFQILLAF 142
                    C  L  +     PDAA +   + R+LQ                +F  L+  
Sbjct: 61  WPDHQDE--CGKLKAIGTRTVPDAALMISRIIRKLQKGGDYQKGYYTAKFYRRFHDLMTH 118

Query: 143 QGTVTDNDTSAAHY------LSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
           +  + ++     H+      L SL    A  +      +  K  +N+F +++      D 
Sbjct: 119 EDDIKNDAPRMEHFQSLLVVLRSLVEEAAMPSKAELLQIFGKMCINSFNILD------DE 172

Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGRE-----IC 251
             S+   G+YL  S  +H C PNA        A  + T I +R++ D   G E     I 
Sbjct: 173 MNSI-GTGMYLGISIIDHSCRPNA-------LATFDGTTIHLRLLEDY-HGSEVDFSKIF 223

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +SY  +      R+ RL   Y F C+C+RC+ E
Sbjct: 224 ISYIDLMNPAEERKARLRAQYYFECNCERCRDE 256


>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 697

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 9/89 (10%)

Query: 196 GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF 255
           G+      G++   + FNH C P   R+ ++      NT ++VR I  +P G EI  +Y 
Sbjct: 494 GKSLFIGGGVFPTVALFNHSCNPGVVRY-FI-----GNT-MVVRAIKTIPAGAEISENYG 546

Query: 256 PV--NYDYSTRQKRLLYDYDFACDCDRCK 282
           P+    + + R+++L   Y F CDC+ CK
Sbjct: 547 PIFTEEEENDRKRKLRLQYWFDCDCEACK 575


>gi|403354753|gb|EJY76937.1| TPR repeat-containing protein [Oxytricha trifallax]
          Length = 765

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/280 (20%), Positives = 103/280 (36%), Gaps = 66/280 (23%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           + +  +EG+ RG+++ + +K GQ+VL +               +F  N     M   S+ 
Sbjct: 408 IEIGFVEGKNRGIIAKEDIKKGQLVLVEK--------------AFSTN----EMRQDSNF 449

Query: 68  CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYN 127
            P   + F +   +     ++H             L +P+A  L        ++L    N
Sbjct: 450 DPKQCLVFLTDFTNVPLFRNTHIQL----------LENPNAEKL-------VKYLYNGQN 492

Query: 128 ----LAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKD--RL 181
               + +   +Q Q L   Q ++T    S     +    P     +EL  A   K+  ++
Sbjct: 493 GKLQVDIQELAQKQNLSESQQSLTYQQMSKMLKYNFCGCPSLLRNLELLKAFERKEEEKI 552

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
                ++P +                  SFFNHDC  N  RF   DAA      I+ +  
Sbjct: 553 EYINAIQPIL------------------SFFNHDCYANTSRFSIGDAAF-----IVAK-- 587

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            D+ +G E+   Y  +   +  R++     + F C C+ C
Sbjct: 588 KDIKKGEELTQFYISLALPFDEREQLTQKAWGFECRCNSC 627


>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
           leucogenys]
          Length = 369

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/258 (23%), Positives = 98/258 (37%), Gaps = 50/258 (19%)

Query: 68  CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQL--------- 117
           C  C VA +CS KC   A         C+ L        PD+  L L R +         
Sbjct: 3   CSQCRVAKYCSAKCQKKAWPDHKRE--CKCLKSCKPRYPPDSVRL-LGRVVFKLMDGAPS 59

Query: 118 QARFLIAAYNLAV--------VNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           ++  L + Y+L                Q+++ FQ  + +    A+        PPA    
Sbjct: 60  ESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQL------PPAFDLF 113

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           E  A ++     N+F +    ++           G+Y   S  NH C PN        + 
Sbjct: 114 EAFAKVIC----NSFTICNAEMQEV-------GVGLYPSISLLNHSCDPNC-------SI 155

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             N   +++R + D+  G E+ + Y  +      R+K+L   Y F CDC RC+ +    D
Sbjct: 156 VFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQTQ----D 211

Query: 290 NDNDD-ENNEEVMDEDQD 306
            D D    +E+V  E Q+
Sbjct: 212 KDADMLTGDEQVWKEVQE 229


>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 236

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 46/96 (47%), Gaps = 10/96 (10%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
           G+Y   S  NH C PNA   F +      +N  + V    ++  G EIC+SY     ++ 
Sbjct: 143 GLYELQSTINHSCSPNAEVTFPH------SNYQLAVVATDNINPGDEICISYLDMCSLSR 196

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
              +RQK L  +Y F C C +C+ E N  D  + DE
Sbjct: 197 SRHSRQKILQENYLFTCKCHKCEEEVNLPDETSSDE 232


>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 419

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+Y+  S  NH C+PNA          ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLYVLQSCCNHSCVPNA-----ETTFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ V  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETVFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCAVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|409048091|gb|EKM57569.1| hypothetical protein PHACADRAFT_251260 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 314

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 5/80 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+L AS  N  C PN  +     +  E    ++ R + DV  G E+C +Y  V      
Sbjct: 118 GIFLLASRINSSCTPNVSK-----SWDEIRNVMVFRTLRDVQDGEELCFNYCDVLSAQDE 172

Query: 264 RQKRLLYDYDFACDCDRCKV 283
           R++ L+ ++ F C CD C++
Sbjct: 173 RKRTLMEEFGFDCTCDACRL 192


>gi|170115671|ref|XP_001889029.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164635977|gb|EDR00277.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 692

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 43/94 (45%), Gaps = 12/94 (12%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS- 262
           GI L    FNH C PNA RF +       N    +  + D+P G EIC++Y      Y  
Sbjct: 490 GIGLTGCRFNHSCSPNA-RFSF----NPTNGHFRIFALSDIPVGEEICVAYLSSRKLYGQ 544

Query: 263 TRQKR---LLYDYDFACDCDRC---KVEANWSDN 290
           TRQ+R   L   Y F C C  C   K E   SD+
Sbjct: 545 TRQQRQGLLRSRYHFTCSCSVCSLPKAEMTLSDS 578


>gi|167342088|gb|ABZ68992.1| SET domain-containing protein C [Pinus taeda]
 gi|167342104|gb|ABZ69000.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  SF NH CLPN+ R        E  + + +     V +G EI   YF +      
Sbjct: 12  GLWLLPSFINHSCLPNSSRL-------EMGSAMFIHACKPVKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ     ++ F C C RC VE
Sbjct: 65  RQG-WCENWGFECKCRRCIVE 84


>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
          Length = 418

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 75/184 (40%), Gaps = 29/184 (15%)

Query: 116 QLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPAST 167
            LQ  F  A Y  A+     P  F+ L A      QG  T + +   H   +L   P   
Sbjct: 219 HLQRLFTEALYEEALSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDALELKPQD- 277

Query: 168 TIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDY 225
                     +++L+AF   +   IE   G   +    G+++  S  NH C+PNA     
Sbjct: 278 ----------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA----- 322

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDCDRCK 282
             +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C +C 
Sbjct: 323 ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCL 382

Query: 283 VEAN 286
            EA+
Sbjct: 383 AEAD 386


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 66/292 (22%), Positives = 108/292 (36%), Gaps = 49/292 (16%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI 67
           +TV     +GR LV+ +   AGQ VL   P +  + L   +  + C  CF+++   +   
Sbjct: 16  LTVINAAEKGRCLVADRDFYAGQTVLEQDPYV--AVLDGDSRGNRCDACFKQS--PALRR 71

Query: 68  CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNS----PDAANLPLDRQLQARFL 122
           C +C   F CSP C  +          C+ L RL              L + R+LQA  +
Sbjct: 72  CSACKFVFYCSPTCQRSQWKIHQEE--CKVLVRLSTEQKRPTLLLMLRLLVKRELQATGV 129

Query: 123 IAA-----YNLAVVNPSQF------QILLAFQGTVTDNDTSAAHYLSSLCPPPAST-TIE 170
           +       Y +    P+ F      ++++  Q  V       A Y   +         I 
Sbjct: 130 LPVTALDNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRIS 189

Query: 171 LTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACR-FDYVDAA 229
                +  D L   G+                 G++   S  NH C  N+   FD   A 
Sbjct: 190 CNGHTICDDELRPVGI-----------------GLFPVVSIINHSCSSNSLLLFDGKHA- 231

Query: 230 AENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                  +VR +  + +G E+ +SY  +    ++R++ L   Y F C C RC
Sbjct: 232 -------VVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRC 276


>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
 gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
 gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
          Length = 420

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 73/172 (42%), Gaps = 26/172 (15%)

Query: 132 NPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAF-G 185
            P  F+ L A      QG  T + +   H   +L  P           +L ++ L+AF  
Sbjct: 244 TPEGFRSLFALVGTNGQGIGTSSLSQWVHACDALDLP-----------MLQREELDAFID 292

Query: 186 LMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
            +   IE + G   +    G+Y+  S  NH C+PNA       +  +NN  + +  + D+
Sbjct: 293 QLYKDIEKESGEFLNCEGSGLYMLQSCCNHSCIPNA-----ETSFPDNNFLLYLTALEDI 347

Query: 245 PQGREICLSYFPVNYDYSTRQKR---LLYDYDFACDCDRCKVEANWSDNDND 293
             G EIC+SY        +R  R   L  +Y F C C +C  +A+  D  +D
Sbjct: 348 EAGEEICISYLDCCQRERSRHSRNKILRENYLFTCSCPKCLAQADDPDVTSD 399


>gi|336463860|gb|EGO52100.1| hypothetical protein NEUTE1DRAFT_132845 [Neurospora tetrasperma
           FGSC 2508]
          Length = 722

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 41/100 (41%), Gaps = 8/100 (8%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  +S  NH C+PN  R    D        +I R   DV +G E+   Y PV    
Sbjct: 466 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGDELFQQYVPVKTQV 518

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
             R K     + F C C  C+ E   S  +   +  E V+
Sbjct: 519 DVRNKEFEEGWGFECRCGLCEGERK-SSKERLKKRKEAVL 557


>gi|328871774|gb|EGG20144.1| hypothetical protein DFA_07264 [Dictyostelium fasciculatum]
          Length = 535

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 57/145 (39%), Gaps = 28/145 (19%)

Query: 167 TTIELTA-ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF-- 223
           T I+L    LL K R N FGL   Y     G        +YL  S FNH CLPN      
Sbjct: 339 TRIDLNILKLLGKVRANYFGLWN-YDNDVGGECYWAGAAVYLGLSLFNHSCLPNCSTIIQ 397

Query: 224 --DYVDAAAENNTD----------------------IIVRMIHDVPQGREICLSYFPVNY 259
             D+   ++ NN +                        +  + D+    E+ ++Y P++ 
Sbjct: 398 SEDFDRGSSANNYERMAVVAPDYMIDKELGRLNPLTFYIVALRDIQVDEELLITYIPLDQ 457

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
               R+ +L   + F CDC RCK E
Sbjct: 458 KLKERRNQLKSLWLFDCDCRRCKEE 482


>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 226

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 45/107 (42%), Gaps = 26/107 (24%)

Query: 210 SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLL 269
           S  NH C PN              T +++R + ++P+G E+ +SY  V      R+ +L 
Sbjct: 2   SLLNHSCDPNC-------VIVFEGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQ 54

Query: 270 YDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDA 316
             Y F CDC RC +                    D+DE M+A D +A
Sbjct: 55  RQYCFLCDCQRCLL-------------------RDKDEDMLAGDAEA 82


>gi|326518712|dbj|BAJ92517.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 166

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 54/116 (46%), Gaps = 15/116 (12%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAY---------GIYLKASFFNHDCLPNACRFDY 225
           +LA+ R+NAF +       +D   S  A           +Y+  SF+NHDC PN      
Sbjct: 53  VLARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWL 112

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            +A A+ NT      + D+ +G E+ + Y   + + + RQK L   + F C C RC
Sbjct: 113 ENADAKLNT------LRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRC 162


>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 356

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 7/85 (8%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A GIYL+AS  NH C PN        A   +   + +  + DV +G E  +SY       
Sbjct: 190 AVGIYLRASMLNHSCDPNC-------AWVCDGRKLQIMTVKDVKEGDECTISYVDAMKPA 242

Query: 262 STRQKRLLYDYDFACDCDRCKVEAN 286
             RQ  L   Y F C C +C  E N
Sbjct: 243 KVRQADLKESYHFTCKCVKCIEEIN 267



 Score = 38.9 bits (89), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 7/77 (9%)

Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
           NH C       DY      +   + +R I DV  G E  +SY  V      RQ +L  +Y
Sbjct: 3   NHSC-------DYNCIVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEY 55

Query: 273 DFACDCDRCKVEANWSD 289
            F C C +C  E N  D
Sbjct: 56  HFTCKCVKCVEEINRGD 72


>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
          Length = 251

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 17  GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEAGEELTICYLDMLMTS 69

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
             R+K+L   Y F CDC RC+ +    D D D    +E+V  E Q+
Sbjct: 70  EERRKQLRDQYCFECDCFRCQTQ----DKDADMLTGDEQVWKEVQE 111


>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
          Length = 423

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 113/297 (38%), Gaps = 62/297 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS----------- 65
           GR +V+T++++ G+++ RD+P ++    P + S   C  C R  M +++           
Sbjct: 21  GRYVVATRNIRQGEVIFRDAPAVVG---PKMASVPVCLGCNRDLMATATVRQQGDPNFRF 77

Query: 66  SICPSCSVAFCSPKCSTA---------AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQ 116
             C  C    C  +C  A          A SS+ P +C A        S     +PL   
Sbjct: 78  HQCSRCGWPVCGAECEAADQHRSECSIMAGSSYRPKICPAREEQARRESAYCVIVPL--- 134

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL--SSLCP----------PP 164
              R L+    L   +P  +  L   +  V +   S  + +  S+L P            
Sbjct: 135 ---RVLL----LQSHSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGLQQYS 187

Query: 165 ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFD 224
           A T +E+ A L      N F +  P     + R  VRA  +Y   +  +HDC PN     
Sbjct: 188 AETILEVCAML----DTNCFEIRLP-----ERRTKVRA--LYPLGAMLSHDCRPNT--KH 234

Query: 225 YVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           Y D A      +++    D+ +G  I  SY         R+  L     F C C+RC
Sbjct: 235 YFDDALR----MVLVATVDIERGATISTSYTQPLLGTLHRRLALKQSKHFDCQCERC 287


>gi|85086843|ref|XP_957767.1| hypothetical protein NCU00296 [Neurospora crassa OR74A]
 gi|28918862|gb|EAA28531.1| predicted protein [Neurospora crassa OR74A]
          Length = 510

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 60/152 (39%), Gaps = 17/152 (11%)

Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
           S  NHDC PNA  R+D+           ++  + D+  G EI +SY         RQK L
Sbjct: 284 SRLNHDCRPNADYRWDWNSNKGGPGLVQVITAVKDILPGEEITISYINPLGSRKARQKLL 343

Query: 269 LYDYDFACDCDRCKV------EANWSD----------NDNDDENNEEVMDEDQDEQMVAS 312
              + F C C+ C        EA+W             +  D  +E    E++    + S
Sbjct: 344 STAWGFECSCELCSRSGGRVEEADWRVRLIRKVWRLLKEGQDGGSEVERGEEEGGWKMGS 403

Query: 313 DDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTL 344
            +  E  G+ +   A   M  + + +  WGT+
Sbjct: 404 RERGEGKGEGDGTKAAELMVSLYEMEGLWGTI 435


>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 616

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 7/73 (9%)

Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
           NH C PNA       A   N   +++R +  + +G EI + Y  V+  +S RQ  L   Y
Sbjct: 296 NHSCDPNA-------AVVFNQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQAELKESY 348

Query: 273 DFACDCDRCKVEA 285
            F+C C +CK  A
Sbjct: 349 FFSCRCSKCKKGA 361


>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
           rerio]
          Length = 231

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/221 (22%), Positives = 88/221 (39%), Gaps = 23/221 (10%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPF-INSSSFCHNCFRKTMHSS 64
           E V V +  G+GRGL +T+   AG ++  + P   ++++ F   +SS CH+CFR+     
Sbjct: 2   EFVEVFDSPGKGRGLRATKEAWAGDVLFAEPP---FASVVFDSQASSICHSCFRR--QEK 56

Query: 65  SSICPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
              C  C  A +C   C  A          C A+        P + N+ L  ++  R   
Sbjct: 57  LQRCGQCRFAQYCDKTCQRAGWEE--HKLECAAIK---TYGKPPSENVRLAARILWRMDK 111

Query: 122 ---LIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAK 178
              +++   L  +   +  I    +  + D      ++L          T++  + +L  
Sbjct: 112 QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSVSHILGV 171

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN 219
              N F      +  Q G ++V   G++      NHDC PN
Sbjct: 172 INCNGF-----MVSDQRGLQAV-GVGLFPNLCLVNHDCWPN 206


>gi|384248030|gb|EIE21515.1| hypothetical protein COCSUDRAFT_56730 [Coccomyxa subellipsoidea
           C-169]
          Length = 494

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +Y++AS  NH C PN            + T   VR + D+ +G E+ +SY  +    S R
Sbjct: 426 LYVRASLLNHSCRPNC-------VVVRSMTSGSVRALRDIEEGEELTISYIDLGLPPSAR 478

Query: 265 QKRLLYDYDFACDCDR 280
              L  ++ F C CDR
Sbjct: 479 GDELRKNFFFECTCDR 494


>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 495

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 66/307 (21%), Positives = 108/307 (35%), Gaps = 62/307 (20%)

Query: 2   SAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLR-DSPILLYSALPFINSSSFCHNCFRKT 60
           S ++ TV      G G GL +T  +  G+ VL   +P L     P +  +  C  CF K 
Sbjct: 9   SVITRTVPQPAPNGMGNGLFATAPINPGEDVLHIKTPFLAVLDSPRLEDT--CAGCFGKR 66

Query: 61  MHSSSSICPSCS----VAFCSPKCSTAAASSSHS----------PYVCQALTRLLNLNSP 106
              + +   +C+    V +C   C +     +HS          P V     R L     
Sbjct: 67  QVETGNELKACTGCRVVKYCDRTCQSKDWKFAHSLECPIFKNVKPMVLPNNARALLRVVL 126

Query: 107 DAANLPLDRQLQARFLIAAYNLAVVNPSQFQI---------LLAFQGTVTDNDTSAAHYL 157
             A    D +    F     ++  ++ SQ Q+         +  + GT  D  T A++  
Sbjct: 127 RTARNKYDSEESKVFDGLETHINEISESQGQLDRINLTARAVKNYSGTEMDEGTVASY-- 184

Query: 158 SSLCPPPASTTIELTAALLAKDRLNAFGL-MEPYIEGQDGRRSVRAYGIYLKASFFNHDC 216
                   +  ++L +  L     +  GL M PY                  A   NH C
Sbjct: 185 --------AAKLDLNSFNLTTSMYDRIGLYMHPY------------------AGLINHSC 218

Query: 217 LPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFAC 276
                  DY      +  ++ V+ +  + +G +I +SY      Y  R+  L   Y F C
Sbjct: 219 -------DYNSTVGFDGEELYVKAMRPIKKGEQIFISYIDTTTPYDIRRNELKERYFFDC 271

Query: 277 DCDRCKV 283
            C +CK+
Sbjct: 272 QCTKCKM 278


>gi|157866065|ref|XP_001681739.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68125037|emb|CAJ03122.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 622

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 97/242 (40%), Gaps = 44/242 (18%)

Query: 59  KTMHSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNL-NSPDAANLPLDRQ 116
           K      S C +C+ V +CSP C+ A     H  + C  +TRL ++    +A  L +  Q
Sbjct: 252 KATKEKPSTCTACNMVVYCSPACADAH-REQHERFECARMTRLSDMMQGIEARKLDVPEQ 310

Query: 117 LQARFLIAAYN----LAVVNPSQ--FQILLAFQGTVTDNDTSAAHYLSSLC--------- 161
               F   AY+    LA +   +   + +L     V D    + H +  L          
Sbjct: 311 ----FFELAYHCITTLAGIKAKRAGHERVLTLTAHV-DEVIQSLHPIGPLMCDLFKGEER 365

Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
           P    T I +          NA  L +P   G    +++ A      ASFFNH C PN C
Sbjct: 366 PATLYTIIGVLCC-------NALELADP--SGLGVAQALHAGNSI--ASFFNHSCTPN-C 413

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKRLLYDYDFACDCDR 280
             D V  A      +  R IH    G E+ ++Y P + +    R++RL   Y F C C R
Sbjct: 414 AIDTVRHAI-----VTTRTIH---VGEELSIAYIPQLYWPTRLRRERLSESYYFVCRCQR 465

Query: 281 CK 282
           C+
Sbjct: 466 CE 467


>gi|380471164|emb|CCF47414.1| TPR domain-containing protein [Colletotrichum higginsianum]
          Length = 387

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 37/83 (44%), Gaps = 7/83 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++ +++ NH CLPNA R    D        +IV    D+P G EI   Y P    Y  
Sbjct: 146 GLWVHSAYANHACLPNANRSFIGDM-------LIVTATVDIPAGAEITHIYLPPKAAYLL 198

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R  +    + F C C  C  EA 
Sbjct: 199 RMPQFRSSWGFVCSCALCAGEAK 221


>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
          Length = 434

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 104/285 (36%), Gaps = 43/285 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL  +++   G+++    P   Y  L        C  CF  T     + C  C  A
Sbjct: 17  GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
           F C+  C     +       CQA+        P         +L AR +         +P
Sbjct: 73  FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETV-----RLVARIIARLKAQKERSP 125

Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
           S+  +LL   +  + D D      + AH       Y   L  P     +     L ++  
Sbjct: 126 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 181

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F +       +D   S     I+   +  NH C PN     Y    AE      VR 
Sbjct: 182 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 227

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           + D+  G+EI  SY  + Y  + R +RL   Y F+CDC  C  ++
Sbjct: 228 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 272


>gi|168051835|ref|XP_001778358.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670237|gb|EDQ56809.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1301

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 198  RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
            R V  +G++   SF NH CLP+ C      A       + V    D+  G E+   YF +
Sbjct: 955  RKVEHWGLWWLPSFINHSCLPSCCPILVGKA-------LFVFASRDLRAGDEVTRPYFDI 1007

Query: 258  NYDYSTRQKRLLYDYDFACDCDRCKVE 284
               +  R++     + F C C RCK+E
Sbjct: 1008 FQPFDQRRESSTQGWGFVCYCPRCKLE 1034


>gi|390603364|gb|EIN12756.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 450

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 113/274 (41%), Gaps = 35/274 (12%)

Query: 22  STQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV-AFCSPKC 80
           +T+S   G ++L D    L +A+   ++   C++C R T  S    C  C+   +CS +C
Sbjct: 24  ATRSFAPGDVILVDE--ALTTAILPSDAGRRCNHCLRLT--SDLKRCAGCAAYHYCSTQC 79

Query: 81  STAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL-----IAAYNLAVVNP-- 133
            +   S  H   +C+ ++  +   SP    L    ++ A  L     I+ +    + P  
Sbjct: 80  QSQQWSIHHK-RICKHISSFIA--SPAYQGLHEHEKITAVLLSHLCTISQHGKDSLRPAL 136

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
           S F+ LL    + T            +CPPP ST  +  A        N   ++  ++  
Sbjct: 137 STFEDLLPGPESATP----------PICPPPFSTEEQQKATYYHSKFGNNNFVIHSHL-- 184

Query: 194 QDGRRSVRAYGIYLKASF-FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
                +  A+GI+  +S  FNH C PNA    Y+    +     ++ + H +  G E+ +
Sbjct: 185 -----TSFAHGIFPMSSICFNHSCAPNAAA-RYILTPHQVPRMEVIALTH-IAAGTEVTI 237

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            Y       + RQ+     Y F C C  C ++A+
Sbjct: 238 PYLDPALPLANRQQITQITYGFICGCPLCTLQAS 271


>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/291 (20%), Positives = 108/291 (37%), Gaps = 49/291 (16%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTMHSSSSI----- 67
           E +GRG+ + +  K G ++L+  P   ++ + F + +   CH+CF   +           
Sbjct: 109 EEKGRGIKAARDFKVGNLILKAEP---FAFVIFDHMAEHVCHHCFNMVVRDRQGQPTTQL 165

Query: 68  --CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANL----------PLD 114
             C SC  A +CS +C   A S       C A+ R+    + D   +            +
Sbjct: 166 LRCSSCKFARYCSRECQKKAWSMHKKE--CMAIKRIAPRTASDEVRMVSQILWKQAERGE 223

Query: 115 RQLQARFLIAAYNLA-VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
           ++ ++  L     L   +N   F+ +      + +       Y      P +   I+   
Sbjct: 224 KRAKSEELCRVEELCDHLNDMSFEDV----NKLEEQSKEIGDYFGYENLPDSDEYID--- 276

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAE 231
                   + FG++          R ++  G+ +  +    NHDC PN         A  
Sbjct: 277 --------HLFGIVSCNGMSITDMRGLQYLGVAIHPTLNLINHDCNPNV-------VAVS 321

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
              +I VR I  + +G E+ +SY   +    TR+  L   Y F C C  C+
Sbjct: 322 CGPNIFVRAIKPIKEGDELFISYIDTSATSETRKNILKDQYYFDCTCKMCE 372


>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
 gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
           AFUA_2G10080) [Aspergillus nidulans FGSC A4]
          Length = 497

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 204 GIYLK--ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           GIYL   A+  NH C       DY      + ++I V+ I  +  G +I +SY    Y  
Sbjct: 209 GIYLHPYAALINHSC-------DYNAVVGFDGSEIFVKAIRPIATGEQIFISYIDTTYPT 261

Query: 262 STRQKRLLYDYDFACDCDRC 281
             RQK L   Y F C+C +C
Sbjct: 262 RIRQKELQERYFFTCNCAKC 281


>gi|350290486|gb|EGZ71700.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 488

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
           S  NHDC PNA  RFD+  +        ++  + D+  G EI +SY         RQK L
Sbjct: 226 SRLNHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLL 285

Query: 269 LYDYDFACDCDRC 281
              + F C C+ C
Sbjct: 286 STAWGFECSCELC 298


>gi|261330294|emb|CBH13278.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 713

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 74/211 (35%), Gaps = 56/211 (26%)

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
           P S +      L    + N FGL +P       +       +  +AS+FNH CLPN CR 
Sbjct: 542 PLSVSSSFFDRLCGAVQCNNFGLFDP-------KEDCIGVSVIPEASYFNHSCLPNLCRV 594

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRCK 282
                 A          + ++ +G  + + Y  V     + R++ LL  Y F C C RC 
Sbjct: 595 MCDGGIA------AFYALREIRKGEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC- 647

Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
                        N   V+D+  D       D A++               +C      G
Sbjct: 648 -------------NGSSVVDDKTDA------DAAKLR--------------LCGACAARG 674

Query: 343 TLAPLPPSDAT--------PSTVMECNVCGN 365
            L PLPP+DA              EC VC N
Sbjct: 675 YLRPLPPTDAAILCRWTMDSIATGECTVCHN 705


>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
          Length = 633

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 522 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 576

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 577 RHSRHKILRENYLFICSCPKCLAEAD 602


>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 303

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 99  GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 263 PKCLAEAD 270


>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
 gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 435

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 115/321 (35%), Gaps = 65/321 (20%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + +  K G +V    P   Y  L        C  CF  T     S C  C  A
Sbjct: 16  GKGRGLKAIKHFKVGDLVF-ACPAYAY-VLTVNERGGRCECCF--TRKEGLSKCGKCKQA 71

Query: 75  F-CSPKCSTA---------AASSSHSPYVC-----QALTRLLNLNSPDAANLPLDRQLQA 119
           + C+ +C            +A  ++    C     + + R++          P +R L  
Sbjct: 72  YYCNVECQRGDWPMHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTL 131

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALL 176
           R L A              L   +  + D D +A H+  S     P  A+ T      L+
Sbjct: 132 RELEA----------HLDKLDNEKNEMNDTDIAALHHFYSRHLDFPDNAALT-----ELI 176

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDI 236
           A+   N F +       +D   S     ++   +  NH C PN     Y    AE     
Sbjct: 177 AQVNCNGFTI-------EDEELSHLGSALFPDVALMNHSCSPNVI-VTYKGTVAE----- 223

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
            VR + ++    EI  SY  + Y    R +RL   Y F CDC  C        + + DE 
Sbjct: 224 -VRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKEC-------TSKSKDEA 275

Query: 297 NEEVM-------DEDQDEQMV 310
             E+        +E++ +QMV
Sbjct: 276 KMEIRQKLSIPPEEEEIKQMV 296


>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
           paniscus]
          Length = 304

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 99  GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 263 PKCLAEAD 270


>gi|336469834|gb|EGO57996.1| hypothetical protein NEUTE1DRAFT_110161 [Neurospora tetrasperma
           FGSC 2508]
          Length = 449

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 34/73 (46%), Gaps = 1/73 (1%)

Query: 210 SFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRL 268
           S  NHDC PNA  RFD+  +        ++  + D+  G EI +SY         RQK L
Sbjct: 226 SRLNHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLL 285

Query: 269 LYDYDFACDCDRC 281
              + F C C+ C
Sbjct: 286 STAWGFECSCELC 298


>gi|449544639|gb|EMD35612.1| hypothetical protein CERSUDRAFT_124865 [Ceriporiopsis subvermispora
           B]
          Length = 406

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  S FNH C PN  R    DAAA+    +  R++  V  G E+C +Y  V    + 
Sbjct: 209 GLFLLGSRFNHSCTPNVAR--TWDAAAKA---MAFRVLRAVEPGEELCTNYVDVLGTRAE 263

Query: 264 RQKRLLYDYDFACDCDRC 281
           R   L   Y FAC C  C
Sbjct: 264 RAAELEEKYAFACLCSVC 281


>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
           troglodytes]
          Length = 299

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 99  GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 150

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 151 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 206

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 207 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 262

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 263 PKCLAEAD 270


>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
           boliviensis boliviensis]
          Length = 419

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271

Query: 161 CPPPASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P             +++L+AF   +   IE   G   +    G+++  S  NH C+P
Sbjct: 272 ELKPQD-----------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVP 320

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 376 CSCPKCLAEAD 386


>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 678

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 72/323 (22%), Positives = 115/323 (35%), Gaps = 67/323 (20%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPI--LLYSALPFINSSSFCHNCFRKTMHSSSSICPSCS-V 73
           GR  ++T+ ++ G+I+  + P    L +    IN    C  CF K      ++C +CS V
Sbjct: 253 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLIN----CFYCFTKIFVPVPAVCQTCSCV 308

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARF--LIAAYNLAVV 131
           A+CS  C    A    +   C  L  L    +     L L   +Q  F  L    +L   
Sbjct: 309 AYCSISCRNKDAKIHENE--CPILPTLWLSKTSINCFLALRIIVQQSFDKLYKLKDLKTN 366

Query: 132 NPSQFQI----------------LLAFQGTVTDNDTSAAHYLSSLC----------PPPA 165
           +  +F++                L+  +   T  D     Y++S            P   
Sbjct: 367 SKDKFEVSASEPYRSDDFKIMFRLVTHEDKRTIEDLFHRTYIASWLLRLLKKGPYFPKHV 426

Query: 166 STTIELTAAL--------------LAKDRLNAFGLMEPYIEGQD-----GRRSVRAYGIY 206
            T   + A L              L   + NA  + E  I   D      +      G+Y
Sbjct: 427 KTPDTVEAKLSDGELYIGGLILHNLMTIQFNAHEISELVIPKADNNLANAKSKFIGGGLY 486

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTR 264
              S FNH C P   R+          T ++VR I  +  G EI  +Y  +        R
Sbjct: 487 PTISLFNHSCNPGIIRYFI-------GTTMVVRAIRSISSGEEISENYGQIFTTTPEKER 539

Query: 265 QKRLLYDYDFACDCDRCKVEANW 287
           +++L   Y F C+C+ C+   +W
Sbjct: 540 KRKLRLQYFFDCNCEACR--EHW 560


>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Ailuropoda melanoleuca]
 gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
          Length = 417

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/183 (27%), Positives = 75/183 (40%), Gaps = 29/183 (15%)

Query: 117 LQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSLCPPPASTT 168
           LQ  F  A Y  A+     P  F+ L A      QG  T + +   H   +L   P    
Sbjct: 220 LQRLFTEALYEEALSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDALELKPQD-- 277

Query: 169 IELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYV 226
                    +++L+AF   +   IE   G   +    G+++  S  NH C+PNA      
Sbjct: 278 ---------REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-----E 323

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCKV 283
            +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C +C  
Sbjct: 324 TSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCLA 383

Query: 284 EAN 286
           EA+
Sbjct: 384 EAD 386



 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCAVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|72392607|ref|XP_847104.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175618|gb|AAX69751.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70803134|gb|AAZ13038.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 713

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 74/211 (35%), Gaps = 56/211 (26%)

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRF 223
           P S +      L    + N FGL +P       +       +  +AS+FNH CLPN CR 
Sbjct: 542 PLSVSSSFFDRLCGAVQCNNFGLFDP-------KEDCIGVSVIPEASYFNHSCLPNLCRV 594

Query: 224 DYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN-YDYSTRQKRLLYDYDFACDCDRCK 282
                 A          + ++ +G  + + Y  V     + R++ LL  Y F C C RC 
Sbjct: 595 MCDGGIA------AFYALREIRKGEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC- 647

Query: 283 VEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWG 342
                        N   V+D+  D       D A++               +C      G
Sbjct: 648 -------------NGSSVVDDKTDA------DAAKLR--------------LCGACAARG 674

Query: 343 TLAPLPPSDAT--------PSTVMECNVCGN 365
            L PLPP+DA              EC VC N
Sbjct: 675 YLRPLPPTDAAILCRWTMDSIATGECTVCHN 705


>gi|154337874|ref|XP_001565163.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134062210|emb|CAM36598.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 466

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 7/89 (7%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           ++  G+Y   + FNH C+PN      VD   E    I+++    +  G  + ++Y P++ 
Sbjct: 368 LKGAGLYSLLACFNHSCVPNVA-VSNVDGTHE----IVLKTTRPIQAGEPLTITYIPLSA 422

Query: 260 DYSTR--QKRLLYDYDFACDCDRCKVEAN 286
              +R  ++R L +Y F C C RC  EA 
Sbjct: 423 GAMSRAERQRQLRNYFFTCHCPRCIKEAG 451


>gi|429855146|gb|ELA30117.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 742

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 7/83 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++ +++ NH CLPN C   +V         + VR   D+P+G EI   Y P    +  
Sbjct: 501 GLWVYSAYANHSCLPN-CNRSFVGDM------LFVRATVDIPKGTEITHIYLPPKAAFLL 553

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R  +    + F C CD C  EA 
Sbjct: 554 RVPQFRRTWGFECSCDLCDGEAK 576


>gi|343427685|emb|CBQ71212.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 488

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 114/325 (35%), Gaps = 60/325 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDS---PILLYSALPFINSSSFCHNCFRKTMHSSSSI----C 68
           +G+GL++T++     I+  +    P     AL  I     C +CF     +  ++    C
Sbjct: 114 KGKGLLATRAFARDDIIFTEDAYIPTPPPHALDQITRGELCAHCFLPISSAPVALAVKDC 173

Query: 69  PSCSVAFCSPKCSTAAASSSHSPY-------------------------VCQALTRLLNL 103
             C   FC+  C  A  ++ H+                           V +ALTR+L+ 
Sbjct: 174 QRCKQRFCTTACFRAGMATHHALLCVGHNPGARGLMELVVREKWQSLHSVARALTRVLST 233

Query: 104 NSPDAANLPLDRQLQARFLIAAY-----------NLAVV--------NPSQFQILLAFQG 144
            +  A    L R  +A  L  AY           + A V        NP       +F  
Sbjct: 234 LAYGA----LKRVGEAEELEQAYGDFETVYSRISSFATVSELERRARNPGWSTEKPSFDA 289

Query: 145 TVTDNDTSAAHYLS--SLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRA 202
            +     +    L   S  PPPA +T   T   L K   +   L  P + G+      + 
Sbjct: 290 LLAQAHAALRTALDPYSPTPPPAISTPLDTQKPLIKHLFDPATL--PKLLGRANINMEKF 347

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
            G+Y   SF NH C PN       +     +  +    +  + Q  E+ +SY        
Sbjct: 348 GGLYSLHSFLNHSCHPNVQIRHMPERGILASMKVAAIALRPIAQNDELVISYIDPTTSLG 407

Query: 263 TRQKRLLYDYDFA-CDCDRCKVEAN 286
            RQ  L  DY F  C CD+C  E N
Sbjct: 408 RRQLLLYRDYCFGPCTCDKCSHELN 432


>gi|295656875|ref|XP_002789017.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226285046|gb|EEH40612.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 442

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYV---DAAAENNTDIIVRMIHDVPQGREICLSYF 255
           +VR   + L +++ NH CLPNA    +    +   E    + V     V QG EI + Y 
Sbjct: 110 AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNNDKEERYFLTVYACKKVAQGEEITIPYD 169

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            +  D ++RQ+ LL +Y F C C  C  E
Sbjct: 170 VLYMDRASRQQFLLQEYGFECACKLCGKE 198


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 449

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 115/298 (38%), Gaps = 51/298 (17%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSS-SFCHNCFR-------- 58
           V+V E  G G GL  T  ++ G+ +L     L       +N + S+C +  +        
Sbjct: 5   VSVQEEPGAGLGLGVTADIERGECLLEKRLDLYVLNNELLNEACSYCCSQTKPTKRCAAC 64

Query: 59  KTMHSSSSICPSCSVAFCSPKCSTAAASSSHS--PYVCQALTRLLNLNSPDAANLPLDRQ 116
           K +H  S IC     +    +C     +S++   P VC+ L RL +    D +     + 
Sbjct: 65  KQVHYCSKICQKQDWSMHKLECKALRNASTNGLLPTVCRLLIRLYSQTQKDQSLFADCKD 124

Query: 117 LQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALL 176
            +A      +N A+ + ++               ++A+HY         +  +    +L 
Sbjct: 125 HEAEI---QHNTALWSDAELI------------SSAASHY-------TEARDVNAFLSLF 162

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNT 234
            K  +NA  L+ P  +         A G  + ++    NH C PN C F    A A    
Sbjct: 163 CKLSINAMSLVTPAFD---------AIGTCMDSTLARINHSCQPN-CVFMVEGATAR--- 209

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
              +  +  + +G  + +SY      Y+ R   L   Y F C C +C  EA  SDN++
Sbjct: 210 ---LVALQTLQKGDFVHISYVDTTLPYNVRTNELQQKYFFTCTCQKCMEEARRSDNEH 264


>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
          Length = 420

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       A  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETAFPENNFLLHVTALEDINPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSH 89
                               CP C V +CS +C  AAA   H
Sbjct: 91  PGQVLPHPELCTVHKELHQNCPHCQVMYCSAECRLAAAEQYH 132


>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
          Length = 352

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 65/160 (40%), Gaps = 24/160 (15%)

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDG 196
           Q+++ FQ  + +    A+        PPA    E  A ++     N+F +    ++    
Sbjct: 70  QLVMTFQHFMREEIQDASQL------PPAFDLFEAFAKVIC----NSFTICNAEMQEV-- 117

Query: 197 RRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
                  G+Y   S  NH C PN        +   N   +++R + D+  G E+ + Y  
Sbjct: 118 -----GVGLYPSISLLNHSCDPNC-------SIVFNGPHLLLRAVRDIEVGEELTICYLD 165

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           +      R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 166 MLMTSEERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 205


>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
 gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
          Length = 660

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 124/324 (38%), Gaps = 49/324 (15%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           VS  V +      GR   ++  +K G  +L + P +  S L    + + C NCF +T+  
Sbjct: 237 VSPLVRIDSNRQEGRFARASADVKPGDELLVERPFV--SVLLEKFAKTHCENCFMRTVVP 294

Query: 64  SSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLL----NLNSPDA----ANLPLD 114
            +  CP C+ V +CS +C   A+   H  Y C  +  +     ++N+  A    A+ PLD
Sbjct: 295 VA--CPRCADVLYCSEQCRQEASKKYHK-YECGIVPIIWRSGASINNHIALRIIASKPLD 351

Query: 115 RQLQARFLI----AAYNLAVVNPSQFQILLAFQGTVTDNDTS-------AAHYLSSLC-- 161
             L+ +  I        L  +    F+ +   +    +   S        A +L+     
Sbjct: 352 YFLKLKPTIDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHVLMARFLTHCLRA 411

Query: 162 -------PPPASTTIELTAAL--LAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKAS 210
                  P PA  +I  +  L  L   + N   + E +     GR      G  IY   +
Sbjct: 412 GGYFGAEPKPAEVSIICSLVLRSLQFIQFNTHEVAELHKFSSSGREKSIFIGGAIYPTLA 471

Query: 211 FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV--NYDYSTRQKRL 268
            FNH C P   R+          T I +  +  +  G  I  +Y P+    + + RQ RL
Sbjct: 472 LFNHSCDPGVVRYF-------RGTTIHINSVRPIEAGLPINENYGPMYTQDERAERQARL 524

Query: 269 LYDYDFACDCDRCKVEANWSDNDN 292
              Y F C CD C    NW   D+
Sbjct: 525 KELYWFECSCDACI--DNWPKFDD 546


>gi|302918997|ref|XP_003052770.1| hypothetical protein NECHADRAFT_78092 [Nectria haematococca mpVI
           77-13-4]
 gi|256733710|gb|EEU47057.1| hypothetical protein NECHADRAFT_78092 [Nectria haematococca mpVI
           77-13-4]
          Length = 563

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 35/186 (18%)

Query: 142 FQGTVTDNDTSAAHYLS---SLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
           F+  +TD+    A  L+   SL P   +T   LT+    +    A+G+        +  +
Sbjct: 55  FRSAMTDDQAVKAELLNKVNSLHPAHKTTFFALTSLQKYEPDHPAWGIF-----CSNALK 109

Query: 199 SVRA-YGIYLKASFFNHDCLPNACRFDYVDAAAENNT--DIIVRMIHDVPQGREICLSYF 255
            V+  YG+Y  A+  NH C+PNA         A N T   + +  + D+ +G EI + Y 
Sbjct: 110 EVKGMYGLYPTAARINHSCVPNA-------HYARNTTLGKLTLHAVKDIAEGDEITIFYL 162

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDD 315
            +    + RQK+L     F C C  C +E             E + + DQ  Q+  S D 
Sbjct: 163 DIYNTRAERQKKLC---GFTCTCSLCSLEG------------ERLKESDQRLQL--SFDL 205

Query: 316 AEVHGD 321
            E+ GD
Sbjct: 206 YELLGD 211


>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
          Length = 287

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +      
Sbjct: 55  GLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTSEE 107

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 108 RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 140


>gi|167521854|ref|XP_001745265.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776223|gb|EDQ89843.1| predicted protein [Monosiga brevicollis MX1]
          Length = 630

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 55/125 (44%), Gaps = 16/125 (12%)

Query: 166 STTIELTAALLAKDRLNAFGLMEPYIE--GQDGR----RSVRAYGIYLKASFFNHDCLPN 219
           +T   L   LL   R N  GL +  ++   Q+G       V   G+++  S FNH C+PN
Sbjct: 345 TTVTGLRQHLLRVLRHNQHGLAQVELQPSEQNGEVGVVHQVYGAGLFVHGSLFNHSCVPN 404

Query: 220 ACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD---YSTRQKRLLYDYDFAC 276
                + D       +++      +P   E+ +SY P+      ++ RQ +L   ++FAC
Sbjct: 405 VHLHFHGD-------ELVATASKPIPANSELTISYGPLAVRDAWHAARQTQLRNTFNFAC 457

Query: 277 DCDRC 281
            C  C
Sbjct: 458 QCIAC 462


>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
           aries]
          Length = 369

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 7/93 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C PN        +   N   +++R + DV  G E+ + Y  +      
Sbjct: 137 GLYPSMSLLNHSCDPNC-------SIVFNGPHLLLRTVRDVEAGEELTICYLDMLMTSEE 189

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 190 RRKQLRDQYCFDCDCFRCQTQDKDADMLTGDEQ 222


>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
 gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
          Length = 239

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 12/106 (11%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 5   GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDD-ENNEEVMDEDQD 306
             R+K+L   Y F CDC RC+ +    D D D    +E+V  E Q+
Sbjct: 58  EERRKQLRDQYCFECDCFRCQTQ----DKDADMLTGDEQVWKEVQE 99


>gi|240282313|gb|EER45816.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
          Length = 437

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 207 LKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYFPVNYDYST 263
           L +++ NH CLPNA       A+  N  +   + V     + +G EI + Y  +  D + 
Sbjct: 115 LDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYESLYMDIAG 174

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ+ LL +Y F C C  C+ E
Sbjct: 175 RQQFLLQEYGFECSCRLCEKE 195


>gi|225684199|gb|EEH22483.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 443

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 3/89 (3%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYF 255
           +VR   + L +++ NH CLPNA    +     ++  +   + V     V QG EI + Y 
Sbjct: 110 AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNDDREERYFLTVYACKKVAQGEEITIPYD 169

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            +  D ++RQ+ LL +Y F C C  C  E
Sbjct: 170 VLYMDRASRQQFLLQEYGFECACKLCGKE 198


>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
          Length = 437

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 104/285 (36%), Gaps = 43/285 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL  +++   G+++    P   Y  L        C  CF  T     + C  C  A
Sbjct: 17  GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
           F C+  C     +       CQA+        P         +L AR +         +P
Sbjct: 73  FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETV-----RLVARIIARLKAQKERSP 125

Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
           S+  +LL   +  + D D      + AH       Y   L  P     +     L ++  
Sbjct: 126 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 181

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F +       +D   S     I+   +  NH C PN     Y    AE      VR 
Sbjct: 182 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 227

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           + D+  G+EI  SY  + Y  + R +RL   Y F+CDC  C  ++
Sbjct: 228 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 272


>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 419

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 379 PKCLAEAD 386


>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
           anubis]
          Length = 258

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 24  GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 76

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 77  EERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 111


>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
           abelii]
 gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
           gorilla gorilla]
 gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
 gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
          Length = 258

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + D+  G E+ + Y  +    
Sbjct: 24  GVGLYPSISLLNHSCDPN-CSIVF------NGPHLLLRAVRDIEVGEELTICYLDMLMTS 76

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 77  EERRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 111


>gi|310791379|gb|EFQ26906.1| TPR domain-containing protein [Glomerella graminicola M1.001]
          Length = 749

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 40/93 (43%), Gaps = 7/93 (7%)

Query: 193 GQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICL 252
           G D  +   + G+++ A++ NH CLPN  R    D        +I     D+P G EI  
Sbjct: 497 GWDIWKQGMSRGLWVHAAYANHACLPNTNRSFIGDM-------LIATATVDIPAGTEITH 549

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
            Y P    Y  R  +  + + F C C  C+ E 
Sbjct: 550 IYLPPKAAYLLRMSQFRHSWGFVCSCALCEGEG 582


>gi|116181336|ref|XP_001220517.1| hypothetical protein CHGG_01296 [Chaetomium globosum CBS 148.51]
 gi|88185593|gb|EAQ93061.1| hypothetical protein CHGG_01296 [Chaetomium globosum CBS 148.51]
          Length = 703

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 37/83 (44%), Gaps = 7/83 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++ AS+ NHDC+ N  R    D        ++ R   D+ +G E+   Y PV      
Sbjct: 459 GLWVHASYMNHDCVANTMRSFVGDM-------LVSRATRDIEEGEELFQQYVPVKPLPDV 511

Query: 264 RQKRLLYDYDFACDCDRCKVEAN 286
           R ++    + F C C  C  EA 
Sbjct: 512 RNRQFKEGWGFECGCTLCTAEAR 534


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
           gallopavo]
          Length = 427

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 58/277 (20%), Positives = 101/277 (36%), Gaps = 32/277 (11%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+G GL S + ++ G+++ R  P         +     C  CF++  H     C  C VA
Sbjct: 13  GKGSGLRSRRRVRPGELLYRAEPFAYVVTKEQLGG--VCEQCFQRNEHLHR--CSQCKVA 68

Query: 75  -FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
            +C   C             C     L N+      N P   +L  R +   + L   + 
Sbjct: 69  KYCGKSCQKEGWLDHKRECKC-----LRNVK----PNFPXSVRLAGRIV---FKLLRQSA 116

Query: 134 SQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLN-AFGLMEPYIE 192
              + L +F    ++ +  +      L     +  + L A +     L+ A    + + +
Sbjct: 117 CLSEKLYSFSDLQSNAEQLSEEMKEGLRHLAHTLQLYLRAEIQDASHLSPAIDFFQIFTK 176

Query: 193 G-------QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
                    +G       G+Y   S  NH C PN C   +          +++R + ++ 
Sbjct: 177 VTCNCFTISNGEMQDVGVGLYPSMSLLNHSCDPN-CVIIF------EGYQLLLRSVREIQ 229

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            G E+ +SY       S RQ++L   Y F CDC  C+
Sbjct: 230 IGEELTVSYIESLMPTSERQEQLKRQYCFECDCCLCQ 266


>gi|346976798|gb|EGY20250.1| Mcg1p [Verticillium dahliae VdLs.17]
          Length = 413

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 43/100 (43%), Gaps = 15/100 (15%)

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI--HDVPQGREICLSYFPVNYD 260
           +G++   S  NHDC PNA       AA  + T ++ R     D+  G EI +SY      
Sbjct: 229 FGVFPAISRLNHDCGPNA-------AAWTDKTGLVHRAFASKDIAAGEEISISYVDALAP 281

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
            + R+ R+   + F C C RC      +  D  D    E+
Sbjct: 282 RAERRARMAGSWGFECGCRRC------AAGDESDARVAEI 315


>gi|145502993|ref|XP_001437474.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404624|emb|CAK70077.1| unnamed protein product [Paramecium tetraurelia]
          Length = 367

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 66/281 (23%), Positives = 110/281 (39%), Gaps = 47/281 (16%)

Query: 27  KAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSPKCSTAAAS 86
           K  Q+V+R  P+ +     F   ++ C+ C + +  +  SIC      +CS  C      
Sbjct: 17  KQDQVVMRCDPLAIVLFKEF--RANHCNYCLQGSQTNRCSICKQ--YYYCSVSCQKNDWK 72

Query: 87  SSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTV 146
              +   CQ L+++          +P +  +  R      N+ + N         F G V
Sbjct: 73  QHKNE--CQLLSKI-------TKEMPCNILILIRLF--QNNIDIQN---------FYGDV 112

Query: 147 ---TDNDTSA-----AHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRR 198
               D +T       A Y+  +      T  +L  ++  K  LN F + +  I G     
Sbjct: 113 DKDIDQETYQQVFDCAAYIVKIAQLENETFAKL-LSIQVKIHLNTFTVTD--ING----- 164

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE--NNTDIIVRMIHDVPQG----REICL 252
                 IY  A+F NH C+  +     V   +   N   +I+   +   Q     +E+C+
Sbjct: 165 DTLGIAIYTPANFLNHKCIKTSANQKNVANCSHFFNQRQLIITTNNSFVQNENDPQELCI 224

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
           SY  +  ++  RQK L   Y F CDCDRC  E   S+ ++D
Sbjct: 225 SYGNI-VNFKDRQKFLKAQYGFICDCDRCIQEQTNSEEESD 264


>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
 gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
          Length = 302

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 62/269 (23%), Positives = 104/269 (38%), Gaps = 24/269 (8%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GRG+ + ++++ G+ + R+  +L+        +   C +C+ +    S      CS    
Sbjct: 51  GRGVFAARNIEKGEELFREHTLLVGPTAHRGRNLRTCIHCYHQPPGESEE-SALCSAGCG 109

Query: 77  SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQF 136
            P C   A S  H+   CQ    L     P   ++   R L+   +++      ++  Q 
Sbjct: 110 LPVCPDCAPSERHA-VECQ----LFRKWQPKHLSIIEPRALR---ILSVVRCFFLSEPQR 161

Query: 137 QILLAFQGTVTDNDTSAAHYLSSLCP---PPASTTIELTAALLAKDRLNAFGLMEPYIEG 193
           Q+L A Q   TD         ++ C    P     +E     +     NAF     +I+G
Sbjct: 162 QLLYAMQAN-TDRYYMREVQRAAECFEHFPRDPDMLEYFYRTVCAFNTNAFE-SRCHIDG 219

Query: 194 QDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLS 253
           Q+    V    ++  A   NH C PNA          E+   I+V     +P G EI +S
Sbjct: 220 QE----VVTRALFPLAGLLNHQCTPNAAHH------FEDGETIVVTATERIPAGAEITMS 269

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           Y  + +    R+  L     F C C RC+
Sbjct: 270 YTKLLWSTLARKIFLGMTKHFMCQCPRCQ 298


>gi|167342116|gb|ABZ69006.1| SET domain-containing protein C [Pinus taeda]
          Length = 141

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 37/81 (45%), Gaps = 8/81 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L  S  NH CLPN+ R        E  + + +     + +G EI   YF +      
Sbjct: 12  GLWLLPSIINHSCLPNSSRL-------EMGSAMFIHACKPIKRGEEITFPYFDILLPLPQ 64

Query: 264 RQKRLLYDYDFACDCDRCKVE 284
           RQ R   ++ F C C RC VE
Sbjct: 65  RQGR-CENWGFECKCRRCIVE 84


>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 450

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 39/83 (46%), Gaps = 4/83 (4%)

Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           A+G++ L +  FNH CLPNA     +   A    +I+   + D+  G EIC+ Y      
Sbjct: 193 AHGVFPLASRLFNHSCLPNAAARYILSEDAAPRMEIVA--LQDIGAGEEICVPYLDPAL- 249

Query: 261 YSTRQKRLLYDYDFACDCDRCKV 283
             +RQ+     Y F C C  C V
Sbjct: 250 LQSRQQGFQLTYGFTCTCPSCSV 272


>gi|350295933|gb|EGZ76910.1| hypothetical protein NEUTE2DRAFT_153682 [Neurospora tetrasperma
           FGSC 2509]
          Length = 724

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  +S  NH C+PN  R    D        +I R   DV +G E+   Y PV    
Sbjct: 468 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGEELFQQYVPVKTLL 520

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
             R K     + F C C  C+ E   S+ 
Sbjct: 521 DVRNKEFEEGWGFECRCGLCEGERKSSEE 549


>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
 gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
          Length = 680

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 57/120 (47%), Gaps = 20/120 (16%)

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYV 226
           IEL + +L+    N   ++ P         + R+YG  I+   S+ NH C PNA  F Y 
Sbjct: 385 IELYSIVLS----NGHEMLHPL--------TCRSYGMGIFPTGSYLNHSCSPNA--FWYN 430

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           D        ++ R +  + +G E+  SY  V      R+K L+  Y F C C++C+ ++N
Sbjct: 431 DEQGM----MVFRSLRPLKKGEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQCQYQSN 486


>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
          Length = 419

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271

Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P     +++    L KD           IE   G   +    G+++  S  NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 376 CSCPKCLAEAD 386


>gi|261332900|emb|CBH15895.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 457

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 5/97 (5%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R  GIY      NH C PN      V   A  +  + +  + D+  G E+ +SY      
Sbjct: 365 RGQGIYTIGCLLNHSCEPNL----QVLYTAVGDETLSIEALRDIEPGEELNISYVDETLP 420

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA-NWSDNDNDDEN 296
           Y  RQ  L   Y F C C +C  EA +W    N  E 
Sbjct: 421 YPQRQLILYEHYFFICKCPKCTREAPDWERQVNGSEK 457


>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
 gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
 gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
 gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
          Length = 418

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCTVRQDLHQSCPHCRVTYCSAECRLAAAEQYHQ 133


>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           paniscus]
          Length = 420

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 379 PKCLAEAD 386


>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
          Length = 159

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 10/114 (8%)

Query: 178 KDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           +++L+AF   +   IE   G   +    G+++  S  NH C+PNA       +  ENN  
Sbjct: 20  REQLDAFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA-----ETSFPENNFL 74

Query: 236 IIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           + V  + D+  G EIC+SY           +R K L  +Y F C C +C  EA+
Sbjct: 75  LHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKCLAEAD 128


>gi|336272855|ref|XP_003351183.1| hypothetical protein SMAC_08199 [Sordaria macrospora k-hell]
 gi|380087872|emb|CCC14032.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 320

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 15/109 (13%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
            LA  + N  GL  P I G          G++++AS  NH C PN       ++  E  +
Sbjct: 88  FLAISKTNMLGLGSPPIGG----------GLFIEASRINHACKPNTQ-----NSWNERIS 132

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKV 283
              +  + D+ +G EI +SY      Y  RQ  L   + F C C+ C +
Sbjct: 133 RETIHAVRDIKKGEEITISYMGHFASYDERQAFLKDKFKFDCACEVCSL 181


>gi|71747970|ref|XP_823040.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832708|gb|EAN78212.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 457

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 39/97 (40%), Gaps = 5/97 (5%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R  GIY      NH C PN      V   A  +  + +  + D+  G E+ +SY      
Sbjct: 365 RGQGIYTIGCLLNHSCEPNL----QVLYTAVGDETLSIEALRDIEPGEELNISYVDETLP 420

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEA-NWSDNDNDDEN 296
           Y  RQ  L   Y F C C +C  EA +W    N  E 
Sbjct: 421 YPQRQLILYEHYFFICKCPKCTREAPDWERQVNGSEK 457


>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           terrestris]
          Length = 386

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
           G+Y+  S  NH C+PNA     V+    NN  ++++ I D+    EIC+SY     +   
Sbjct: 289 GLYIFQSSINHSCVPNA----VVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +RQK L   Y F C C+RC  + N
Sbjct: 344 RHSRQKVLSSLYLFHCYCNRCLSQVN 369


>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
          Length = 517

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 113/284 (39%), Gaps = 38/284 (13%)

Query: 15  GR-GRGLVSTQSLKAGQIVLRDSPILLYSALPFI-NSSSFCHNCFRK-TMHSSSSICPSC 71
           GR GR LV+ +++ AG++++R+ PI   +  P        C  C R  +      +C  C
Sbjct: 13  GRLGRYLVAAKNIAAGEVIIREEPI---AVGPMTYRKDRLCFACLRSLSKIGRQYVCSRC 69

Query: 72  SVA-FCSPKCSTAAASSSHSPYVCQALTR---LLNLNSPDAANLPLDRQLQARFLIAAYN 127
           ++A  CS  C T   +  H+   C  L     LL  N  DA  + L  +L   +LI   N
Sbjct: 70  NLAPLCSVACET--RTKHHTSDECLILGENRDLLAKNGIDATGILLALRL---WLIKCRN 124

Query: 128 LAVVNPSQFQ--ILLAFQGT--VTDNDTSAAHYLSSL-----CPPPASTTIELTAALLAK 178
            AV          L    GT    D + +  + +  L       PP++   EL   L   
Sbjct: 125 PAVWEQVDHMEDHLHERMGTPVWKDREVNVVNVIRMLRVPDGIEPPSA---ELMQKLCGI 181

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
             +N F L  P     DG   +   G+Y++A+   HDC  N           +++  + V
Sbjct: 182 LDVNTFELRSP--GALDG---LLLRGLYVEAALMAHDCRGNT------HLTVDDDFRLTV 230

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
                + +   I  +Y       + R++ L     F C+CD CK
Sbjct: 231 YASVAIGENEPILFNYTSSLLGTADRREHLREGKYFECECDLCK 274


>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
           harrisii]
          Length = 609

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 498 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLPVTALEDIKPGEEICISYLDCCQRERS 552

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA 
Sbjct: 553 RYSRHKILRENYLFVCSCPKCLAEAE 578


>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
           troglodytes]
          Length = 415

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 80/188 (42%), Gaps = 26/188 (13%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAFQGTVTDN--DTSAAHYLSSLCPP 163
             L L R+L   F  A Y  AV     P  F+ L A  GT       +S + ++      
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVH----- 266

Query: 164 PASTTIELTAALLAKDRLNAF-GLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLPNAC 221
            A  T+EL      +++L+ F   +   IE   G   +    G+++  S  NH C+PNA 
Sbjct: 267 -ACDTLELKPQ--DREQLDTFIDQLYKDIEAATGEFLNCEGSGLFVLQSCCNHSCVPNA- 322

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFACDC 278
                 +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F C C
Sbjct: 323 ----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFVCSC 378

Query: 279 DRCKVEAN 286
            +C  EA+
Sbjct: 379 PKCLAEAD 386


>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
           mulatta]
 gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
 gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
          Length = 418

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271

Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P     +++    L KD           IE   G   +    G+++  S  NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 376 CSCPKCLAEAD 386


>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
          Length = 418

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCTVRQDLHQSCPHCRVTYCSAECRLAAAEQYHQ 133


>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
          Length = 421

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|85114970|ref|XP_964786.1| hypothetical protein NCU00870 [Neurospora crassa OR74A]
 gi|28926580|gb|EAA35550.1| predicted protein [Neurospora crassa OR74A]
          Length = 724

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  +S  NH C+PN  R    D        +I R   DV +G E+   Y PV    
Sbjct: 468 AKGLWAHSSLMNHSCVPNTMRSFVGDM-------LICRATRDVQEGEELFQQYVPVKTLV 520

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDN 290
             R K     + F C C  C+ E   S+ 
Sbjct: 521 DVRNKEFEEGWGFECRCGLCEGERKSSEE 549


>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Nomascus leucogenys]
          Length = 418

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 70/311 (22%), Positives = 113/311 (36%), Gaps = 50/311 (16%)

Query: 6   ETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSS 65
           +T  V E    GR +++++ LKAG++VLR+ P +    +    +S  C  C ++      
Sbjct: 72  DTFKVTEDSVSGRCVIASRDLKAGELVLREPPFV--KVVRRDCASRQCAYCCQQVTERGK 129

Query: 66  SICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
                    +CS  C   A   +       AL +L  +++    ++ L R L    +  A
Sbjct: 130 IEADVPFAVYCSRACQ--AREDALRAAEASALGKLAGISAARDVDIDLLRMLLRLLITRA 187

Query: 126 YNLAVVNPS---------------------------QFQILLAF----QGTVTDNDT--- 151
             L +  PS                           Q++ L A     +    D  +   
Sbjct: 188 KALGLREPSGDSDSVSRGVDEEGEDGTMGEGLFLRQQWENLYALMHHREAMAPDWISVVR 247

Query: 152 SAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASF 211
            A   L  L P      +E    L  +  +NA GL +       G   V   G++   + 
Sbjct: 248 EAGEDLLQLLPEWVRFDVEEVVQLACRVNVNAHGLRD-----DSGANLVIGVGMFPLTAM 302

Query: 212 FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYD 271
            NH C PN C F Y     E      VR +  V  G E+ + Y  +    + R++ LL  
Sbjct: 303 INHACRPN-CTFVYFGGNLE------VRTLEPVSAGAELSVYYIDLLQSTAARRQELLTS 355

Query: 272 YDFACDCDRCK 282
             F C C RC+
Sbjct: 356 KHFLCKCSRCE 366


>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
 gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 386

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 60/325 (18%), Positives = 125/325 (38%), Gaps = 59/325 (18%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRK--TMHSSSSICPSC 71
           E  GR L++T+ ++ G+ +L+       ++     +S  C NC ++  ++   S  C  C
Sbjct: 14  ENEGRYLIATRDIQIGEDLLKCKSYFAVTSETLKTTS--CFNCIKQLPSVIKLSLKCNQC 71

Query: 72  S-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA----Y 126
           + + +C+ +C     +  H  Y C+   +L +       N  ++   + R ++      Y
Sbjct: 72  NEIWYCNEQCKNENINK-HQHYECKFYKKLKSPKLKVYPNFDIETFTEIRMIVGLLSRYY 130

Query: 127 NLAVVNPSQFQILL-----------AFQGTVTDNDTSAAHYLSSLCPPPASTTIE----- 170
              ++N    +  L               T+ D      + ++    P A   I+     
Sbjct: 131 QDILLNNKFIEQQLNNNNNNNNDNEQLTNTLDDVFDLVENQVTEESNPAAKERIDSIVEF 190

Query: 171 --------LTAALLAKDRLNAFGLMEPYIEGQDGRRSV------RAYGIYLK-------- 208
                   L  +   K  +N    +E   +  +  RS+        +GI+ K        
Sbjct: 191 ISELFNLVLLGSTTTKSIINNDDKIEMIRKINEKSRSIIHKTRCNQFGIWTKNDKCIGVA 250

Query: 209 ----ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
               +S+FNH C+PN            + +++  + ++ + +G ++ +SY  ++     R
Sbjct: 251 VSPSSSYFNHSCIPNC-------TDVRDGSNMTFKSLYPIKKGDQLTISYIELDQPIQDR 303

Query: 265 QKRLLYDYDFACDCDRCKVEANWSD 289
           +  L Y Y F C C RC  ++N  D
Sbjct: 304 KDELKYGYYFDCICPRCNGDSNSID 328


>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
          Length = 407

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 204 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 260

Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P     +++    L KD           IE   G   +    G+++  S  NH C+P
Sbjct: 261 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 309

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 310 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 364

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 365 CSCPKCLAEAD 375


>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
          Length = 418

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 76/191 (39%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271

Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P     +++    L KD           IE   G   +    G+++  S  NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGLFVLQSCCNHSCVP 320

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 376 CSCPKCLAEAD 386


>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
          Length = 419

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386


>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 5 [Callithrix jacchus]
          Length = 431

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 322 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 376

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 377 RHSRHKILRENYLFVCSCPKCLAEAD 402


>gi|440487007|gb|ELQ66819.1| hypothetical protein OOW_P131scaffold00351g3 [Magnaporthe oryzae
           P131]
          Length = 782

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 7/109 (6%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  A+  NH C+ +  R    D         I R + D+  G E+   Y PV  D 
Sbjct: 499 ARGLWPLAAAMNHACVASTVRAFVGDV-------FITRALRDIEPGEELTQQYVPVRADV 551

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
             RQ +    + F C C  C  E         D      +   Q E+MV
Sbjct: 552 GARQGQYGQWWGFECGCVLCAAEKGREAGGGGDMVRRRRLLLKQIEKMV 600


>gi|347976375|ref|XP_003437517.1| unnamed protein product [Podospora anserina S mat+]
 gi|170940375|emb|CAP65602.1| unnamed protein product [Podospora anserina S mat+]
          Length = 715

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 23/135 (17%)

Query: 164 PASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR--------RSVRAYGIYLKASFFNHD 215
           P   +  LT  LL    LN+FG   P   G   R            + G++  AS  NH 
Sbjct: 428 PVIDSFLLTRILL----LNSFGCKFPEKAGALKRPVGIDRLGNLFPSLGLWTFASHMNHS 483

Query: 216 CLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQKRLLYDYDF 274
           CL NA R       + N   +IVR I D+P   EI + Y  P N  +   Q++ + ++ F
Sbjct: 484 CLSNAFR-------SFNGDMMIVRAIADIPVDAEITIQYLNPFNPQH---QQKYIINWAF 533

Query: 275 ACDCDRCKVEANWSD 289
            CDC  C+ + + SD
Sbjct: 534 VCDCAMCEDDDHTSD 548


>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
           caballus]
          Length = 417

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.9 bits (89), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/106 (24%), Positives = 42/106 (39%), Gaps = 33/106 (31%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCF------------- 57
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C                 
Sbjct: 31  AKGKGLFATQLIQKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 58  -------------RKTMHSSSSICPSCSVAFCSPKCSTAAASSSHS 90
                        RK +H S   CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCTVRKDLHQS---CPHCQVMYCSTECRVAAAEQYHQ 133


>gi|389631385|ref|XP_003713345.1| hypothetical protein MGG_10443 [Magnaporthe oryzae 70-15]
 gi|351645678|gb|EHA53538.1| hypothetical protein MGG_10443 [Magnaporthe oryzae 70-15]
          Length = 770

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 7/109 (6%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G++  A+  NH C+ +  R    D         I R + D+  G E+   Y PV  D 
Sbjct: 487 ARGLWPLAAAMNHACVASTVRAFVGDV-------FITRALRDIEPGEELTQQYVPVRADV 539

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
             RQ +    + F C C  C  E         D      +   Q E+MV
Sbjct: 540 GARQGQYGQWWGFECGCVLCAAEKGREAGGGGDMVRRRRLLLKQIEKMV 588


>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
          Length = 513

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 61/296 (20%), Positives = 110/296 (37%), Gaps = 42/296 (14%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT--MH 62
           S T  + + +  GR +V+ + L+AG+ ++ + P ++    P   +   C +C+       
Sbjct: 16  STTYKILQNDKVGRYMVANKELEAGEEIITEMPFIVG---PKAATYPLCLSCYVSWPPTL 72

Query: 63  SSSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL-------LNLNSPDAANLP-LD 114
           +   +C  C    CS +C        H  Y CQ   +          L   +   +P L+
Sbjct: 73  NEKPLCSKCGWPVCSTECEN---QPQHKDYECQIFAQANEKFNVQAALEETNENGVPQLE 129

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
                R L+ +      NP +++  +         D  A + + S      S  I +   
Sbjct: 130 CITPLRLLLESEK----NPERWKNEV--------KDMEAHNKIRSQKKQWKSDQINIVEY 177

Query: 175 LLAKDRLNAF---------GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDY 225
           +  + +L  F         G++E         +   A  IY   +  NH C+ N C    
Sbjct: 178 IRKQLKLERFSEEQIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCH--- 234

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +  +  I +R    +P G E+  SY         R++ LL   +FAC C RC
Sbjct: 235 --SISPTDYKIRLRTTLKIPVGGELYGSYTHSLLPTMLRREHLLEGKNFACACARC 288


>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
          Length = 416

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVEQPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPELCAVRKDLHQHCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
          Length = 417

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 23/103 (22%), Positives = 41/103 (39%), Gaps = 27/103 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVEQPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSHS 90
                               CP C V +CS +C  AAA   H 
Sbjct: 91  PGQVLPHPDLCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
          Length = 391

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/145 (22%), Positives = 52/145 (35%), Gaps = 46/145 (31%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y + S+FNH C PN  + +          ++++  + D+ QG E+  +Y  ++     
Sbjct: 273 GLYDQCSYFNHSCQPNIFKVNQTPGG-----ELVMVALRDIEQGEELFYNYIQISMSGEA 327

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDTN 323
           R K+L   Y F C C  CK                                        N
Sbjct: 328 RIKKLKESYFFNCQCPGCK----------------------------------------N 347

Query: 324 FP-HAYFFMRYMCDRDNCWGTLAPL 347
            P H  F  +Y+C    C G + PL
Sbjct: 348 APSHKQFLDKYLCKVKQCNGVITPL 372



 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 56/108 (51%), Gaps = 12/108 (11%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN--SSSFCHNCF----RKTM 61
           +T+ + + RGR  V++  +KAG +++R+ PI+    + +++   ++FCH C     +   
Sbjct: 2   ITINQYKDRGRCFVASVPIKAGTVIVRELPII--QTVNYMDPLQNAFCHQCLTYHHQHQQ 59

Query: 62  HSSSSI-CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPD 107
           HSS  + C  C  V +C+  C  A+  +      C + +RL + N  D
Sbjct: 60  HSSPLLHCSKCKLVWYCNRDC--ASKDADQHELECPSYSRLHSTNLFD 105


>gi|310799799|gb|EFQ34692.1| hypothetical protein GLRG_09836 [Glomerella graminicola M1.001]
          Length = 430

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 49/121 (40%), Gaps = 17/121 (14%)

Query: 203 YGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           +G Y + S FNHDC PN     Y+D   +++       + D+  G E+ +SY        
Sbjct: 241 FGNYPEVSRFNHDCRPNVAF--YID---DSDLRHYTHAVRDIQAGEELTISYVDSLSARV 295

Query: 263 TRQKRLLYDYDFACDCDRC------------KVEANWSDNDNDDENNEEVMDEDQDEQMV 310
            RQ R   ++ F C C  C            ++   W         N E +DED  E++V
Sbjct: 296 VRQDRARRNWGFGCGCAHCSLPEPFVRASDQRLVRMWEIEQQLSHWNGEAVDEDMIERLV 355

Query: 311 A 311
            
Sbjct: 356 G 356


>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
 gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
          Length = 417

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIEPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.1 bits (87), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFIERPLVASQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSH 89
                               CP C V +CS +C  AAA   H
Sbjct: 91  PGQVLPHPELCSVRKDLHQNCPRCQVTYCSAECRLAAAEQYH 132


>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
          Length = 412

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 300 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 354

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 355 RHSRHKILRENYLFVCSCPKCLAEAD 380


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 70/305 (22%), Positives = 113/305 (37%), Gaps = 41/305 (13%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           +  T+ + E    GRGL    +++ G  +L   P +   + P++  SS C +C       
Sbjct: 30  IPPTLDLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYL--SSHCSSCAAPASPE 87

Query: 64  SSSI--CPSCSVA-FCSPKCSTAAASSSHSPYVCQALTRLL------NLNSPDAANLPLD 114
              +  C  C V  +CS KC  A   + H P  C AL R        +L  P+ A   L 
Sbjct: 88  RPQLKRCAKCHVVHYCSQKCQNADWKA-HKPE-CDALQRWATAAPSPDLAVPEEAVRCLG 145

Query: 115 RQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLS-------SLCPPPAST 167
           R L  +      N + +   +   + + + ++      +  YL+        L  P    
Sbjct: 146 RMLWQK----QRNPSSIWSREIDSMQSHRSSLRPESFESHAYLAHALVRYLGLDAPEKLA 201

Query: 168 TIELTAA-----LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNA 220
           +  L +A     L+++   NA  L  P         S+   G+ +    +  NH C PNA
Sbjct: 202 SFGLRSAGDVVDLISRFTTNAITLAAP---------SLTPLGVSVSPAVALVNHSCAPNA 252

Query: 221 CR-FDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCD 279
              F  V    +    + V  + D+    E+  +Y         RQK L   Y F C C 
Sbjct: 253 VVVFPRVSKTVDQEPVMQVIALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCS 312

Query: 280 RCKVE 284
            C  E
Sbjct: 313 LCAAE 317


>gi|367019312|ref|XP_003658941.1| hypothetical protein MYCTH_62625, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347006208|gb|AEO53696.1| hypothetical protein MYCTH_62625, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 735

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 42/95 (44%), Gaps = 8/95 (8%)

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           +G   RR + A G+++ +S  NH C+PN  R    D        +I R   D+  G E+ 
Sbjct: 478 DGSPARRGM-AKGLWVHSSRVNHSCVPNTMRSFVGDM-------LISRAARDIKAGEELF 529

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
             Y PV      R  +L   + F C C+ C  E+ 
Sbjct: 530 QQYVPVRTVVDVRNAQLRESWGFECACELCARESR 564


>gi|226293820|gb|EEH49240.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 398

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 3/89 (3%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD---IIVRMIHDVPQGREICLSYF 255
           +VR   + L +++ NH CLPNA    +     ++  +   + V     V QG EI + Y 
Sbjct: 66  AVRGRVLGLSSAYLNHACLPNAQHTLFTMKCNDDREERYFLTVYACKKVAQGEEITIPYD 125

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            +  D ++RQ+ LL +Y F C C  C  E
Sbjct: 126 VLYMDRASRQQFLLQEYGFECACKLCGKE 154


>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
 gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
          Length = 546

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 56/276 (20%), Positives = 103/276 (37%), Gaps = 30/276 (10%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
           E  GR +++T+ +KAG +VL++SP L+Y      N    C  C +    +    C  C  
Sbjct: 71  ERYGRYMMATKDIKAGDVVLKESP-LVYGPAQITNP--VCVGCLQGLTENQFLECERCGW 127

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTR--LLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
             C  +C    +  +   +     ++  L +  +P           Q    +    LA  
Sbjct: 128 PVCKRECQDHPSHKAECKFTIARGSKMQLRHFYNPHPV-------YQCLIPLRCLLLAER 180

Query: 132 NPSQFQILLAFQGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGL 186
           +P+++Q L+  +        S         ++ L P       +        + L   G+
Sbjct: 181 DPAKWQALIKLESHEEQRRGSEQWRNDREGVAKLIP----RFFKCENKWSEDEILKIVGI 236

Query: 187 MEPYIEGQD-GRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVP 245
           ++  + G +       +  IY  AS   H C PN      +  +  +  +I+    + + 
Sbjct: 237 LQ--VNGHEVPLTEPPSVAIYNNASMLEHSCRPN------LSKSFTSKKEIVFWAPNPIK 288

Query: 246 QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           QG  + +SY  V +  + RQ  L     F C C RC
Sbjct: 289 QGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRC 324


>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
          Length = 704

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 77/335 (22%), Positives = 126/335 (37%), Gaps = 57/335 (17%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKT 60
           +++ S+ + +   E +GR L +T+ +K G +++ D P    + +  +  +  C +   K 
Sbjct: 271 LTSCSDALALGSDEAKGRKLYATRDIKPGTVLIVDRPFASTTDVAALCRNCLCCHASLKL 330

Query: 61  MHSSSSICPSC-SVAFCSPKCSTAAASSSHSPYVC-----------QALTRLLNLNSPDA 108
             S    C  C +V FCS  C   A +  H  Y C           Q    LL   +  A
Sbjct: 331 PESIRVPCEHCQAVQFCSEDCRRRAWNKFHK-YECRIFDYFYAKQAQKTNCLLAYRAVVA 389

Query: 109 ANLPLDRQ--LQARFLIAAYN---------LAVVNPSQFQILLAFQGTVTDNDTS----- 152
           A L  +++  L   FL    N         LA      ++ + + +    D D +     
Sbjct: 390 AALRNEKEFSLDQEFLKLHENDKDSLDIASLASYESRDYKTVFSLESHCADADPADNLQR 449

Query: 153 AAH---YLSSLCPP-----PASTTIELTAALLAKDRL--------NAFGLMEPYIEGQDG 196
           A H   +  SL        P     E    +LA   L        NA+ ++E   + +  
Sbjct: 450 AIHSVFFAKSLIYSLNYFHPEIDNHERHLHILAVALLHNMQAIKCNAYEIVENVRDDETK 509

Query: 197 RRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREI--CL 252
               R  G  IY   S  NH C PN  R  +       N  ++V  +  +P+G EI  C 
Sbjct: 510 ILEPRNVGGAIYTTVSLTNHSCYPNIVRHSF------PNGTVVVTSLRYIPEGSEILDCY 563

Query: 253 SYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
               +     +R++ L   Y F C C+ C   A+W
Sbjct: 564 GQHFLENKRDSRRRLLAEKYYFDCQCEPCL--ADW 596


>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Acyrthosiphon pisum]
          Length = 387

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 81/179 (45%), Gaps = 20/179 (11%)

Query: 131 VNPSQFQILLAFQGTVTDN-DTSAAHYLSSLCPPPAST-TIELTAALLAKDRLNAFGLME 188
           + P+ F+ LLA  GT      TS  H  ++ C    S   +EL    +     +A+  ME
Sbjct: 222 LTPTGFKSLLALVGTNGQGVGTSVFHQWANNCKKHLSPDQMELFEQFID----SAYDAME 277

Query: 189 PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
             + G D   +     ++   S  NH C PNA       +  + N  + +  I  +  G 
Sbjct: 278 QVV-GVDFLDN-EGSALFQNHSSINHSCFPNAA------SVFDGNHVLRLVAIRMIEPGD 329

Query: 249 EICLSYF-PVNYDYS--TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
           EI +SY  P   D+S  TRQK L  +Y F C C +C+ + N  D  +DD++    M+ED
Sbjct: 330 EINISYIAPCELDHSRHTRQKYLQENYVFTCRCIKCEEQINDPDITSDDDSE---MEED 385


>gi|113205415|gb|AAU90317.2| TPR domain containing protein, putative [Solanum demissum]
          Length = 522

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 20/92 (21%)

Query: 200 VRAYGIYLKASFFNHDCLPNACR---FDYVDAAAENNTDIIVRMIH---DVPQGREICLS 253
           V   G+++ +SF NH C PN  R    DYV             MIH   D+  G E+  +
Sbjct: 304 VHGIGLWILSSFINHSCDPNVRRSHVGDYV-------------MIHACRDIKAGEELTFA 350

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           YF V   +  R+++    + F C C RC +E 
Sbjct: 351 YFDVFSPFRDREEK-TKSWGFVCKCKRCNLEK 381


>gi|71664708|ref|XP_819332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70884628|gb|EAN97481.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 382

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 90/245 (36%), Gaps = 40/245 (16%)

Query: 67  ICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLN---SPDAANLPLDRQLQARFLI 123
           I P+C   +CS  C T A S  HS   C    + L L+      A+    +R   A  L+
Sbjct: 161 INPNCHEEYCSRDCRTMALSLYHSRSCCNEGLQALELDMYTQMKASGTSTERNAAAAQLL 220

Query: 124 AAYNLAV-----VNPSQFQILLAFQGTVTDNDTSAAH-------YLSSLCPPPASTTIEL 171
               LAV     V PS    L    G +  +    AH         +  C    S + E 
Sbjct: 221 MLRVLAVSVQQQVIPSALSELRILSGRLLFSPRVLAHSFLEIYERFTRFCHTATSISYEE 280

Query: 172 TAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE 231
              +LA+   N F   +  +E            + L  S  NH C+ N         AAE
Sbjct: 281 MIGVLARVTANCFH-RDTCVE------------LNLSRSMLNHSCVAN---------AAE 318

Query: 232 NNTDIIVRMIHDVPQGREICLSYFP--VNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
                 ++   DV +G E+ ++Y+P   +  +  R K  L    F C C +C+ E     
Sbjct: 319 ELHSGEIKTTRDVARGEELTINYYPQLKHLSFEERSKE-LERRGFECHCSQCRREKGVER 377

Query: 290 NDNDD 294
            D ++
Sbjct: 378 RDLNE 382


>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
          Length = 687

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 77/334 (23%), Positives = 129/334 (38%), Gaps = 91/334 (27%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
           +GRG+ S   +    ++  + P+    A+  + S   + CH CF +   +   +CPSC++
Sbjct: 121 KGRGMSSPNDISPASLIHAEDPL----AVIIMKSCRDTHCHYCFSEAP-ADVVVCPSCTI 175

Query: 74  -AFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARF 121
             +CS +C   A         C   T L  N N  D A L         P  +Q+ + R 
Sbjct: 176 PIYCSNRCQEKAIGQ----MSCNQNTHLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRH 231

Query: 122 LIAAYNLAVVNPSQF-------------QILLAFQGTVT----------DNDTSAAHYLS 158
                  A V P+               Q+L+  + T++          D D+SA+ + S
Sbjct: 232 ECGGACWAAVLPADIVLAGRIMAQYIEKQLLVGKRSTISGPNLDLVHHYDQDSSASKFES 291

Query: 159 S-------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR-- 201
                   LC           A  ++     L+ + ++N+  ++  +++  DG +++   
Sbjct: 292 HIYATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGVKALTKG 349

Query: 202 -----------------AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                            A  IY+  SFFNH C PN      + A   + T +I+R    +
Sbjct: 350 FSGFSGDVMCSVEQVRVAQAIYMSGSFFNHSCRPN------IHAYFHSRT-LILRSTEYI 402

Query: 245 PQGREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
             G  I LSY P     D   RQK L  +Y F+C
Sbjct: 403 KAGSPIELSYGPQVGEMDLPERQKSLRENYYFSC 436


>gi|410082399|ref|XP_003958778.1| hypothetical protein KAFR_0H02340 [Kazachstania africana CBS 2517]
 gi|372465367|emb|CCF59643.1| hypothetical protein KAFR_0H02340 [Kazachstania africana CBS 2517]
          Length = 371

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 84/198 (42%), Gaps = 32/198 (16%)

Query: 92  YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDT 151
           +VC +L RL N  S D+  +   + LQ+          +   S+F ILL FQ +V +   
Sbjct: 197 FVCSSLYRLSNF-SEDSLTMKSFQHLQSN--------ELEKISKFPILLHFQESVFNT-- 245

Query: 152 SAAHYLSSLCPPPASTTIELTA--ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYL-- 207
                L  L P      +   +   +L  +  N+FGL +      +   S   YG ++  
Sbjct: 246 -----LYILLPNNLKCHLSTDSFRHILGSEYANSFGLWQE----DESLESRECYGYWVLP 296

Query: 208 KASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF-PVNYDYSTRQK 266
           +AS+FNH C PN  +         N   +   +  DV  G E+ ++Y   +N   + R++
Sbjct: 297 RASYFNHSCDPNITK-------TRNGRIMDFILNRDVKVGEELSINYAGTLNLPLTERRE 349

Query: 267 RLLYDYDFACDCDRCKVE 284
            +   + F C C +C  E
Sbjct: 350 FMRNGWFFDCGCQKCLKE 367


>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
           porcellus]
          Length = 416

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDISPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.5 bits (88), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSH 89
                               CP C V +CS +C  AAA   H
Sbjct: 91  PGQVLPHPELCTVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 132


>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
           africana]
          Length = 417

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386


>gi|328853242|gb|EGG02382.1| hypothetical protein MELLADRAFT_110239 [Melampsora larici-populina
           98AG31]
          Length = 561

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 21/122 (17%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPN-ACRFDYVDAAAENNTDIIVRM 240
           NAFG          GR  V  Y +  + S FNH+C PN A  FD      +    +    
Sbjct: 345 NAFG-------SHRGREEVLHYAVVPEPSVFNHECRPNSAFYFD------DKTLRVYAHA 391

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN--DDENNE 298
           + D+  G EI ++Y  +    + RQ  + + Y F+C C  C +    SD ++   D+  E
Sbjct: 392 VRDIALGEEITIAYRAMKASRAGRQAAIAH-YGFSCTCSHCSM----SDEESRASDQRIE 446

Query: 299 EV 300
           E+
Sbjct: 447 EI 448


>gi|239606334|gb|EEQ83321.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3]
          Length = 447

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
           RA G  L +++ NH CLPNA        + D        +IV     + +G EI + Y  
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +  D   RQ+ LL +Y F C C  C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197


>gi|224010509|ref|XP_002294212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970229|gb|EED88567.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 516

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 41/91 (45%), Gaps = 7/91 (7%)

Query: 201 RAYGIYLKASFFNHDCLPNACR-FDYVDAAAENNTDI------IVRMIHDVPQGREICLS 253
           R  G+Y  A+  NH C PNA R F  + A  +N   +      IV     + +G EI  S
Sbjct: 241 RLLGLYPLAAMINHSCCPNAVRVFGTLPAINDNMHRVQGREVMIVHATAKISKGDEITWS 300

Query: 254 YFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           Y P    ++ R+  L   + F C C RC  E
Sbjct: 301 YLPPCSPFNERRAVLSSKFGFTCQCVRCIKE 331


>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
           bisporus H97]
          Length = 674

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 67/304 (22%), Positives = 115/304 (37%), Gaps = 51/304 (16%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
           E RGRG+ S +  + G+I+  +  +   +AL     ++ C  CF +        CP+C +
Sbjct: 95  ETRGRGIWSREKRRRGEIIFTE--VQQSAALSTGLLATHCSACFVEASGVPLKRCPTCRI 152

Query: 74  A-FCSPKCST--------------AAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQ 118
             +C  +C +                A+S  SP    +  +      P  A   L R L 
Sbjct: 153 IHYCDSECQSRDWTLHKRECPALQKWATSVPSPSPETSEDKKQGKVIPSDAIRALGRMLW 212

Query: 119 ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSA---AHYLSSLCP--------PPAST 167
            +       L+     Q   + + +  ++ N+ S+    H + +L            +S 
Sbjct: 213 RK---QKKGLSGDWAKQVDTMQSHRARLSSNEKSSQLHTHMVHALVRYLGLESLDEFSSY 269

Query: 168 TIELTAALL---AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNAC- 221
            I+  A L+   ++   NAF +  P         ++   G+ +    + FNH C PNA  
Sbjct: 270 GIDSIAGLVDLVSRFTTNAFTVASP---------TLTPIGVSISPIIALFNHSCAPNAVP 320

Query: 222 ---RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
              R  +   A E    +I   + D+P   E+ +SY         RQK L   Y F C C
Sbjct: 321 VFPRAPHNAKANEPMASVIT--LRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSC 378

Query: 279 DRCK 282
             C+
Sbjct: 379 SLCE 382


>gi|374105643|gb|AEY94554.1| FAAL097Cp [Ashbya gossypii FDAG1]
          Length = 368

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 89/224 (39%), Gaps = 41/224 (18%)

Query: 77  SPKCSTAAASSSH-------------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
           +PK S    S  H             + +V + L RL +L +        D QLQ+  L 
Sbjct: 165 TPKLSAMKPSKRHLMIPAIDEEVYSCARFVTECLFRLKHLPNESTMYQAFD-QLQSNEL- 222

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--LTAALLAKDRL 181
                     ++F +LL FQ  V         +L  + P      +   L   +L  +  
Sbjct: 223 -------QKITEFPVLLEFQLKVYK-------FLKVVLPSGLRDLLSSALFRKILGSEYG 268

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N+FG+ E   EG   R  +  Y +  +AS+FNH C PN        A      ++   + 
Sbjct: 269 NSFGIWEQN-EGNYSRDYL-GYWVLPEASYFNHSCAPNL-------AKKRVGREMYFVLT 319

Query: 242 HDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
            DV  G ++C+ Y  + +     R+  L  ++ F C C+RC +E
Sbjct: 320 SDVAAGEQLCIDYKGILDLPVVERRNILHSNWFFDCACERCALE 363


>gi|261190070|ref|XP_002621445.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
 gi|239591273|gb|EEQ73854.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081]
          Length = 445

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
           RA G  L +++ NH CLPNA        + D        +IV     + +G EI + Y  
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +  D   RQ+ LL +Y F C C  C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197


>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
 gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
          Length = 515

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 60/293 (20%), Positives = 112/293 (38%), Gaps = 57/293 (19%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           +A+ E  GR LV+T+++K  +IVL+++P++   A         C NC     H S   C 
Sbjct: 45  IAKNEVLGRHLVATRNIKPYEIVLKEAPLMRGPAQISAPVCMGCLNCIEANDHIS---CE 101

Query: 70  SCSVAFCSPKCSTAAASSS--------------------HSPYVCQALTRLLNLNSPDAA 109
            C    C P+C +     +                    H  Y C +  R L LN+ + A
Sbjct: 102 KCGWPLCGPECPSLTEHRAECQLTEERGQKVNVTEFNGPHPLYTCVSTVRCLLLNTTNPA 161

Query: 110 NLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTI 169
           N     QL+      +        +Q++  LA  G                  P    T 
Sbjct: 162 NAEKFNQLE------SLEQTRRGSNQWKADLASIGQFI---------------PKFFRTQ 200

Query: 170 ELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDA 228
           + +     ++ + A G+++  I G +   S   +  ++  ASF  + C+PN  +      
Sbjct: 201 KFS----EEEIMRAVGILQ--INGHEVPISDPPHVAVFYTASFTENSCVPNLAK------ 248

Query: 229 AAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +   +   ++    ++ +   + + Y    +  + RQ+ L+    F C C+RC
Sbjct: 249 SFNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCACERC 301


>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
 gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
          Length = 507

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 63/280 (22%), Positives = 104/280 (37%), Gaps = 36/280 (12%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFC 76
           GR LV+  +     +++ + P+   +  P  N    C  C+    +    +C  C    C
Sbjct: 21  GRYLVAKGATLGHGLLIEELPL---AVGPKCNGPVVCLGCYETDPNPEEELCSECGWPLC 77

Query: 77  SPKCSTAAASSSHSPYVCQAL----TRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVN 132
           + +CS  A  ++H    C+ L     R   L S  +    LD  +  R L+A       N
Sbjct: 78  A-ECSEQA-DNAHFQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE----AN 131

Query: 133 PSQFQILLA-----FQGTVTDNDT------SAAHYLSSLCPPPASTTIELTAALLAKDRL 181
           P ++   +A      +    D D       + A YL   C   +  + EL   ++    +
Sbjct: 132 PERWDSEVAPMEHHEEERKKDADVWHADRVNIAQYLRGPCKLASRFSEELIMQVVGVLEV 191

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAF    P        +      ++       H+C+PN  R  Y          I +R +
Sbjct: 192 NAFEARSP--------KGYPLRCLFPYTGILAHNCVPNTSRSIY----PSEGYKIRLRAM 239

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            D+ +G+ +  SY       + RQK L     F C CDRC
Sbjct: 240 VDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRC 279


>gi|171695572|ref|XP_001912710.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948028|emb|CAP60192.1| unnamed protein product [Podospora anserina S mat+]
          Length = 444

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  S FNHDC PN     +       +T + VR IH    G E+ +SY       ST
Sbjct: 202 GVFVDVSGFNHDCRPNV----HYRITNTTHTTVAVRDIHP---GDELTISYIYGIARRST 254

Query: 264 RQKRLLYDYDFACDCDRC 281
           RQK  L D+ F C C +C
Sbjct: 255 RQKE-LQDWGFECTCSQC 271


>gi|46139505|ref|XP_391443.1| hypothetical protein FG11267.1 [Gibberella zeae PH-1]
          Length = 593

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 5/80 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L+AS  NH C  NA + DY +    +     V  + D+ +G EI ++Y  +  +  T
Sbjct: 104 GVFLQASRINHACDNNAQK-DYNEGIKRHT----VHALRDIEEGEEITITYLGILKNRRT 158

Query: 264 RQKRLLYDYDFACDCDRCKV 283
           RQ+ L   + F C C+ C +
Sbjct: 159 RQQALRTKFMFTCTCNLCSL 178


>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
          Length = 416

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY        +
Sbjct: 305 GLFVLQSCCNHSCVPNA-----ETSFPENNFVLHVTALEDIKPGEEICISYLDCRQRERS 359

Query: 264 RQKR---LLYDYDFACDCDRCKVEAN 286
           R  R   L  +Y F C C +C  EA+
Sbjct: 360 RHSRHKILRENYLFNCSCPKCLAEAD 385



 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI---- 67
           +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +    
Sbjct: 32  KGKGLFATQLIRKGETIFIERPLVAAQFLWNALYQYRACDHCLRALEKAEENAQRLTGKP 91

Query: 68  -------------------CPSCSVAFCSPKCSTAAASSSHS 90
                              CP C V +CS +C  AAA   H 
Sbjct: 92  SQILPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYHQ 133


>gi|327353090|gb|EGE81947.1| hypothetical protein BDDG_04890 [Ajellomyces dermatitidis ATCC
           18188]
          Length = 447

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 201 RAYGIYLKASFFNHDCLPNA----CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP 256
           RA G  L +++ NH CLPNA        + D        +IV     + +G EI + Y  
Sbjct: 113 RALG--LDSAYLNHSCLPNAQHSLAELTF-DGTGNRRNFLIVYACRTIEEGEEITIPYES 169

Query: 257 VNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           +  D   RQ+ LL +Y F C C  C+ E
Sbjct: 170 LYLDRVGRQQFLLQEYGFECACKLCEKE 197


>gi|70947750|ref|XP_743461.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522968|emb|CAH78269.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 353

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 17/171 (9%)

Query: 204 GIYLK-ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           GIY K   +FNH CL N             N  + +R + D+  G E+ +SY  V +D +
Sbjct: 47  GIYKKPIPYFNHSCLSNC-------ITIFKNQRLYIRTLMDIYPGEELTISYLDVAFDRN 99

Query: 263 TRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHGDT 322
            R       Y F C C  CKV  N +   ++  NN+ +  + ++ +   +  +  +  + 
Sbjct: 100 ARLAICADQYFFTCTCKLCKV--NIASECHNIFNNDFICTQSENCKKFINYMEIVLMSEL 157

Query: 323 NFPHAYFFMRYMCDRDNCWGTLAPLPPSDATPSTVMECNVCGNLKNDAIGR 373
               +Y          + + T   L  S     TV +C +C N  ND++ +
Sbjct: 158 ERKISYL-------NKSHFKTFPILKKSTEKNETVWKCMLCKNETNDSVIK 201


>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 435

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 324 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERS 378

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 379 RHSRHKILRENYLFICSCPKCLAEAD 404



 Score = 37.7 bits (86), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 25/107 (23%), Positives = 43/107 (40%), Gaps = 27/107 (25%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSS 65
           VA +   G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++ 
Sbjct: 44  VASLGYIGKGLFATQLIRKGETIFIERPLVASQFLWNALYRYRACDHCLRALEKAEENAQ 103

Query: 66  SI-----------------------CPSCSVAFCSPKCSTAAASSSH 89
            +                       CP C V +CS +C  AAA   H
Sbjct: 104 RLTGKPGQVLPHPELCSVRKDLHQNCPHCQVMYCSAECRLAAAEQYH 150


>gi|290976641|ref|XP_002671048.1| SET domain-containing protein [Naegleria gruberi]
 gi|284084613|gb|EFC38304.1| SET domain-containing protein [Naegleria gruberi]
          Length = 914

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 58/286 (20%), Positives = 117/286 (40%), Gaps = 40/286 (13%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           V   E  GRG+ ++  LK+GQ+VL + P L  +    +++ + C+NC  K +        
Sbjct: 281 VITTEQYGRGMYASSELKSGQVVLSEKPFLCVT----LDTENTCNNCLNK-LKKKIPCKN 335

Query: 70  SCSVA-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNL 128
           +C +  +CS +C + A    H       +  +  +    + N   +     +FL     L
Sbjct: 336 NCGMEYYCSMECRSEAWDKYHRKLCGFDMRNINKMIVSGSTNSAKNHLYAIKFL--GMLL 393

Query: 129 AVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLME 188
            +V P         QG +   +     ++  L    +S    +    +  D +   G + 
Sbjct: 394 QIVYP---------QGELKQQEIRYVQFIEKLEENNSSFKCLIAWPNILNDLI---GKLS 441

Query: 189 PYIE-GQDGRRSVR-----AYGIYLKAS------FFNHDCLPNACRFDYVDAAAENNTDI 236
            ++   + G  S +     + G+ L ++      + NH C+PN      V+   EN    
Sbjct: 442 YHVSLDKIGNNSFKYGDAMSNGVALSSTLPKTIVYTNHSCVPN------VEYRYENGKCS 495

Query: 237 IVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           +V +   + +G ++ +SY   +     R + +L  Y F C+C++C+
Sbjct: 496 LVAL-RKIKKGEQLFISYVNPSLSAQKRAESIL-PYIFVCECEKCE 539


>gi|224006285|ref|XP_002292103.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
 gi|220972622|gb|EED90954.1| hypothetical protein THAPSDRAFT_23831 [Thalassiosira pseudonana
           CCMP1335]
          Length = 575

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 37/78 (47%), Gaps = 2/78 (2%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y  AS  NH C PNA    +    +     I   M   V  G EI +SY  V+     
Sbjct: 278 GVYPAASMINHSCRPNAVPTFWFSTPSPPMLQIT--MCKSVRVGDEIAISYCDVSAPRYV 335

Query: 264 RQKRLLYDYDFACDCDRC 281
           R++ L+ +Y FACDC  C
Sbjct: 336 RREGLIKNYKFACDCSHC 353


>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
          Length = 170

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 42/93 (45%), Gaps = 7/93 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G+Y   S  NH C PN        +   N   +++R + D+  G E+ + Y  +      
Sbjct: 7   GLYPSISLLNHSCDPNC-------SIVFNGPHLLLRAVRDIEVGEELTICYLDMLMTSEE 59

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
           R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 60  RRKQLRDQYCFECDCFRCQTQDKDADMLTGDEQ 92


>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
          Length = 464

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 61/267 (22%), Positives = 95/267 (35%), Gaps = 41/267 (15%)

Query: 33  LRDSPILLYSALPFIN------SSSFCHNCFRK--TMHSSSSICPSC-SVAFCSPKCSTA 83
           LR    ++    PF +        + C  CF +   +      C  C ++ +CS  C  A
Sbjct: 41  LRKGTDVIKGKRPFAHVVEKAKMETVCQTCFVQCEQLRRPLQRCAGCKALRYCSAACQKA 100

Query: 84  AASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQIL-LAF 142
                H P  C AL R+           P+       FL         N  +  +L L  
Sbjct: 101 DWKD-HKPE-CAALKRIS----------PVVPATFVMFLARILRKMERNTGEMDVLQLHM 148

Query: 143 QGTVTDNDTSAA-----HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR 197
            G  +D            +L    P      ++    +L     N+FG     I G +G 
Sbjct: 149 PGEPSDPQQQRGLFAILEHLRHFLPDAEKHLLKSAYPVLRITSANSFG-----ISGVEGN 203

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
                 G+Y   S+ NH C PN C   +    A       VR +HD+P  +E+ ++Y   
Sbjct: 204 NL--GVGLYDTVSYINHSCAPN-CSITFSGVYAR------VRSVHDLPPNQELTIAYIDP 254

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVE 284
               + R+  L   + F C+C RC+ E
Sbjct: 255 CDPRAKRRAHLKSQFMFDCECSRCERE 281


>gi|46122751|ref|XP_385929.1| hypothetical protein FG05753.1 [Gibberella zeae PH-1]
          Length = 508

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 9/106 (8%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           ++L  S  NH C PNA      +   +    I V  + D+ QG+EI ++Y      Y  R
Sbjct: 91  VFLATSRINHSCKPNA-----QNRWNQGLGKITVHAVEDIEQGQEITITYLGNPEVYEER 145

Query: 265 QKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV--MDEDQDEQ 308
           QK+L   + F C C  C +    ++ D DD+  +E+  + ED + +
Sbjct: 146 QKKLKNAFGFDCCCRLCSLSP--AERDLDDKLIKEIDHLQEDLENE 189


>gi|408393969|gb|EKJ73226.1| hypothetical protein FPSE_06650 [Fusarium pseudograminearum CS3096]
          Length = 620

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 11/108 (10%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           NAFGL +   + +D   S  + G++++AS+ NH C+PNA +    D        I+ R  
Sbjct: 489 NAFGLGQ---QTEDEDISNASTGLWVRASYINHSCIPNAKKDLIGDL-------ILFRAT 538

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSD 289
             +  G EI  +Y   +  Y  RQ      ++F C C  C V+   SD
Sbjct: 539 RRIASGEEITHAY-DESTSYEARQAAFCRTWNFECRCPLCLVQMKESD 585


>gi|367053077|ref|XP_003656917.1| hypothetical protein THITE_114734 [Thielavia terrestris NRRL 8126]
 gi|347004182|gb|AEO70581.1| hypothetical protein THITE_114734 [Thielavia terrestris NRRL 8126]
          Length = 699

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 37/83 (44%), Gaps = 7/83 (8%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
           A G+++ AS  NH C+PN  R    D        +I R   D+ +G E+   Y PV    
Sbjct: 453 AKGLWVHASRMNHSCVPNTMRSFVGDM-------LISRATRDIKEGEELFQQYVPVKALP 505

Query: 262 STRQKRLLYDYDFACDCDRCKVE 284
             R ++    + F C+C  C  E
Sbjct: 506 DVRNEQFKKGWGFECECALCASE 528


>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
 gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
          Length = 555

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 58/303 (19%), Positives = 117/303 (38%), Gaps = 53/303 (17%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALP---FINSSSFCHNC--FRKTMHSS 64
           + E   +GR + +T  LK G  +  ++    ++++P   F+    +C NC  ++ +    
Sbjct: 9   LIEESDKGRYVEATDILKPGDSLFSENS---FASIPLENFLYKKDYCFNCCKYQISTEQQ 65

Query: 65  SSICPS------------CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
           +S  P             C++ FCS +C+          ++ + +             L 
Sbjct: 66  TSEQPKSTPGFPFQCKYGCNMWFCSNECADGMVHQLECSFISKFV------------ELS 113

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
           L     A   + A  + + N    ++     G +T+  +       S     +    ++ 
Sbjct: 114 LKHDCDASISLLALRVVMKNKVASEVFEQTVGKLTNQKSIFIQNNKSFLDKYSKLFDQVY 173

Query: 173 AALLAKD-RLNAFGLMEPYIE-------------GQDGRRSVRAYGIYLKASFFNHDCLP 218
             + +++  L++    E Y+E               +  R+  + G + KA+  NH C P
Sbjct: 174 LIVKSENSELSSIFNKEEYLEVVCSIFVNSFAGLSNNFNRTPISNGYFYKAALLNHSCEP 233

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDC 278
           N          +  N+ + +R++ D+ +G +I  SY  +      RQK LL   +F CDC
Sbjct: 234 NT-------FFSIQNSTLEMRVVKDINKGEQIFDSYVDLLLPTIERQKILLKSKNFVCDC 286

Query: 279 DRC 281
            RC
Sbjct: 287 SRC 289


>gi|428171215|gb|EKX40134.1| hypothetical protein GUITHDRAFT_113871 [Guillardia theta CCMP2712]
          Length = 149

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 53/145 (36%), Gaps = 49/145 (33%)

Query: 247 GREICLSYFPVNYDYSTRQKRLLYDYD---------------FACDCDRCKVEANWSDND 291
           G EI +SY  +  D   RQKRL  +Y                F C+C RCK+E     ++
Sbjct: 7   GTEISISYTSIAADLVHRQKRLRSEYGVAIITSPRNVDFQSGFVCNCQRCKIEKEIGVHE 66

Query: 292 NDDENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPL---- 347
             +E +E    E+ D  ++                  +  +Y C    C GTLAP+    
Sbjct: 67  GRNEGSE---SEEADALLL------------------YLAKYSCTDAECGGTLAPIIQLQ 105

Query: 348 ---------PPSDATPSTVMECNVC 363
                      +      VMECN C
Sbjct: 106 NEEQGNKRSSKNRKQDEVVMECNRC 130


>gi|429852375|gb|ELA27514.1| hypothetical protein CGGC5_11625 [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 8/81 (9%)

Query: 199 SVRAYGIYLKASFFNHDCLPNAC-RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           +V  YG++   +  NHDC PNA  R+     A E      V    D+  G E+ +SY P+
Sbjct: 229 NVSHYGLFTNIARINHDCRPNAVTRYSPRTLALE------VVAYRDIQPGEELSISYSPL 282

Query: 258 NYDYSTRQKRLLYDYDFACDC 278
           N  Y+ RQ R L ++ F C C
Sbjct: 283 NMLYADRQ-RTLQEWGFNCTC 302


>gi|401417428|ref|XP_003873207.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489435|emb|CBZ24694.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 622

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 10/75 (13%)

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP-VNYDYSTRQKR 267
           ASFFNH C PN C  D V  A      +  R IH    G E+ ++Y P + +    R++R
Sbjct: 402 ASFFNHSCAPN-CAIDTVRHAI-----VTTRTIH---VGEELSIAYIPQLYWPTRLRRER 452

Query: 268 LLYDYDFACDCDRCK 282
           L   Y F C C RC+
Sbjct: 453 LSEGYYFVCRCQRCQ 467


>gi|400593140|gb|EJP61138.1| SET domain-containing protein 5 [Beauveria bassiana ARSEF 2860]
          Length = 574

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L+AS  NH C  NA      +   EN   + +  + D+ +G+EI +SY     +Y+ 
Sbjct: 300 GLFLEASRINHSCRHNA-----QNTWNENIGRLTIHALCDIEEGQEITISYLSRRMEYTE 354

Query: 264 RQKRLLYDYDFACDCDRC 281
           R++ L   + F C+C+ C
Sbjct: 355 RRRLLQAKFCFDCECELC 372


>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
          Length = 239

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + ++  G E+ + Y  +    
Sbjct: 5   GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 58  EERRKQLRDQYCFECDCIRCQTQDKDADMLTGDEQ 92


>gi|356519848|ref|XP_003528581.1| PREDICTED: uncharacterized protein LOC100812233 [Glycine max]
          Length = 527

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 38/84 (45%), Gaps = 8/84 (9%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G++L  SF NH C+PNA R    D        ++V    D+  G E+  +YF      
Sbjct: 314 GVGLWLLPSFVNHSCVPNARRLHVGDY-------LVVHASKDLKAGEEVTFAYFDPLCGL 366

Query: 262 STRQKRLLYDYDFACDCDRCKVEA 285
           S R K +  ++   C C RC+ E 
Sbjct: 367 SKR-KEMSVNWGIHCKCKRCRFEG 389


>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
 gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
          Length = 239

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + ++  G E+ + Y  +    
Sbjct: 5   GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             R+K+L   Y F CDC RC+ +   +D    DE 
Sbjct: 58  EERRKQLRDQYCFECDCIRCQTQDKDADMLTGDEQ 92


>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
           [Otolemur garnettii]
          Length = 416

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFICSCPKCLAEAD 386


>gi|451845507|gb|EMD58819.1| hypothetical protein COCSADRAFT_348193 [Cochliobolus sativus
           ND90Pr]
          Length = 338

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 6/86 (6%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           +   ++  A+ FNH C PNAC F + +A+    T   + +++DV  G+EI +SY  + +D
Sbjct: 133 KGAAVFPHAARFNHSCNPNAC-FSW-NASITKET---IHVMNDVAAGKEITISYCDMIHD 187

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R   L + Y F CDC  C  + N
Sbjct: 188 KPSRAYELKH-YGFVCDCPACAGDEN 212


>gi|440635523|gb|ELR05442.1| hypothetical protein GMDG_01737 [Geomyces destructans 20631-21]
          Length = 265

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 41/84 (48%), Gaps = 12/84 (14%)

Query: 198 RSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV 257
           R+  + GI+L AS FNH C P   R+ Y +    +N ++I     DV +G E+ + Y P 
Sbjct: 170 RAGSSSGIFLVASRFNHSCNP-VIRYSYDN----SNRELIFATKRDVKEGEELTIMYTP- 223

Query: 258 NYDYSTRQKRLLYDYDFACDCDRC 281
                   + L  DY F CDC  C
Sbjct: 224 ------NSQHLSMDYGFLCDCGVC 241


>gi|169604975|ref|XP_001795908.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
 gi|111065447|gb|EAT86567.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
          Length = 367

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 5/80 (6%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           G++L+A   NH C  NA  F       +N   + +  + D+ QG EI +SY   + +   
Sbjct: 135 GLFLQACRINHACDNNAQNF-----WNDNIDKLTIHAVRDIGQGEEITISYLSSSQNPQA 189

Query: 264 RQKRLLYDYDFACDCDRCKV 283
           RQ+ L +++ F+C C  C +
Sbjct: 190 RQQELQHNFKFSCACRLCSL 209


>gi|347831206|emb|CCD46903.1| similar to MYND domain protein [Botryotinia fuckeliana]
          Length = 545

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 109/285 (38%), Gaps = 36/285 (12%)

Query: 19  GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
           G+V+T+ + A + VL D P L       ++S+  C  C    +    + C  C   +CS 
Sbjct: 272 GVVATRDILAQETVLVD-PTLA----AVVDSADRCPACCGPFLDKIENSC--CKTLYCSS 324

Query: 79  KCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQARFLIAAYNLAVVNPSQ 135
            CS  A  S H+    + L  LL   S   +N     +  +L  R L  +    V +P +
Sbjct: 325 SCSQNALDSYHTIVCGKDLDFLLGTESESLSNSRESSMGSKLFLRVLALSLKEDVASPLK 384

Query: 136 FQILLAFQGTVTDNDTSAA------HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
             ++         N+          H ++     P     EL   + A    + + L   
Sbjct: 385 TSLISRLTPAYNPNNPQLVVLNFKDHIIT-----PIRILRELGIDVFANSAYDTWVLHTI 439

Query: 190 YIEGQDGRRS-----VRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHD 243
           Y   Q+ +       +   G+    S FNH C PN   R D      + N+ + +    D
Sbjct: 440 YCRLQNNKHGQTFDDICGTGVNPLYSMFNHSCDPNIDWRHD------DENSTVTMFAERD 493

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC---KVEA 285
           +  G E+ +SY         R+++L+  +   C C +C   K+EA
Sbjct: 494 IKNGEEMFISYIGKGKGLEERRRKLMPWFGMDCACHKCDEEKLEA 538


>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 181

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 199 SVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVN 258
            V   G+Y   +  NH C PNA      +A+   +  + V+ +  V  G E+  SY   N
Sbjct: 89  EVLGSGLYRGVAVTNHSCSPNA------EASFGGSRCLRVKSLRPVHAGEEVFQSYIDEN 142

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEA 285
                RQ +L   Y FAC C +C+ EA
Sbjct: 143 LPLVERQSKLRQAYGFACRCGKCRTEA 169


>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
           [Megachile rotundata]
          Length = 523

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/291 (20%), Positives = 106/291 (36%), Gaps = 52/291 (17%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           +AE    GR  ++ +++KAG+IVL+D   L+  A P  NS   C  C+     +++  C 
Sbjct: 48  LAESSSLGRYYIAARNIKAGEIVLKDDRPLV--AGPMHNSVPVCLGCYMVLHENTAVPCT 105

Query: 70  SCSVAFC-----------------SPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP 112
            C    C                 S K S       H  Y C  + R L+L   +     
Sbjct: 106 KCGWPLCQNCKEHGTECDFTSSRRSDKVSITKFGYPHPSYQCINVIRALSLKDNN----- 160

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
                     I +YN  +   S    +   +  + +  ++  +++              +
Sbjct: 161 ----------IESYNKLLTLESHSDKIKNKETFIFEESSNLVNFIKRFFK---------S 201

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAE 231
             +  +D     G+++  + G +   +   Y  +Y  AS   H+C  N C   + D    
Sbjct: 202 EDIPEEDIRKIIGVLQ--VNGHEVPLTDPPYVAVYELASLLEHNCKAN-CSKSFTDTGG- 257

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
               +I+     + +G  I + Y    +    R+  LL    F C CDRC+
Sbjct: 258 ----LIIHAATPITKGDHISICYTDPLWGTMNRRHHLLETKYFECTCDRCR 304


>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
           [Otolemur garnettii]
          Length = 300

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 190 GLFVLQSCCNHSCVPNA-----ETSFPENNFFLHVTALEDIKPGEEICISYLDCCQRERS 244

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 245 RHSRHKILRENYLFICSCPKCLAEAD 270


>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
           Y34]
 gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
           P131]
          Length = 542

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 63/278 (22%), Positives = 104/278 (37%), Gaps = 32/278 (11%)

Query: 17  GRGLVSTQSLKAGQIVLR-DSPILLYSALP-FINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           GR L +T+  + G ++   D+P+   +A P    +S+ CH+C  +       +    +V 
Sbjct: 28  GRSLRATRDFQPGDLIAEFDNPL---AAFPDAARASTTCHHCLDQNAKVFGCMGCEKAVK 84

Query: 75  FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV--- 131
           +CS +C  A     HS   C+   ++      D    P+   +Q     A     V    
Sbjct: 85  YCSSECQRANWKLVHSKE-CKVFRKVQTAVGKDWLPTPVRTLVQLLVRWAEVQQLVSQLE 143

Query: 132 -NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPY 190
            N  +F+        +     +  HY            + L+  +L K + N+F   +  
Sbjct: 144 GNEDRFKERKQLWEDMKLQAYAGIHYAGR---KEDDANLFLSLDVLCKIQTNSFDRFDA- 199

Query: 191 IEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
                      A G +L    +  NH C+PNA    +   A        +R    + QG 
Sbjct: 200 --------DTGASGTFLDPVLAMANHSCVPNAVVLFWRRKA-------YLRAEMPIKQGS 244

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
           EI +SY         RQ+ L + Y F C C RCK + N
Sbjct: 245 EISISYIDYTKPVRFRQEDL-WLYHFTCKCPRCKDDLN 281


>gi|45184727|ref|NP_982445.1| AAL097Cp [Ashbya gossypii ATCC 10895]
 gi|44980073|gb|AAS50269.1| AAL097Cp [Ashbya gossypii ATCC 10895]
          Length = 368

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 89/224 (39%), Gaps = 41/224 (18%)

Query: 77  SPKCSTAAASSSH-------------SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
           +PK S    S  H             + +V + L RL +L +        D QLQ+  L 
Sbjct: 165 TPKLSAMKPSKRHLMIPAIDEEVYSCARFVTECLFRLKHLPNESTMYQAFD-QLQSNEL- 222

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIE--LTAALLAKDRL 181
                     ++F +LL FQ  V         +L  + P      +   L   +L  +  
Sbjct: 223 -------QKITEFPVLLEFQLKVYK-------FLKVVLPIGLRDLLSSALFRKILGSEYG 268

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N+FG+ E   EG   R  +  Y +  +AS+FNH C PN        A      ++   + 
Sbjct: 269 NSFGIWEQN-EGNYSRDYL-GYWVLPEASYFNHSCAPNL-------AKKRVGREMYFVLT 319

Query: 242 HDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
            DV  G ++C+ Y  + +     R+  L  ++ F C C+RC +E
Sbjct: 320 SDVAAGEQLCIDYKGILDLPVVERRNILHSNWFFDCACERCALE 363


>gi|116200772|ref|XP_001226198.1| hypothetical protein CHGG_10931 [Chaetomium globosum CBS 148.51]
 gi|88175645|gb|EAQ83113.1| hypothetical protein CHGG_10931 [Chaetomium globosum CBS 148.51]
          Length = 381

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 46/87 (52%), Gaps = 7/87 (8%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTD-IIVRMIHDVPQGREICLSYFPVNYDYST 263
           +YL AS  NH C+PNA      D A++++T  +++R   D+  G EI  SY       +T
Sbjct: 155 LYLNASRLNHSCIPNA------DPASKDDTSYMVMRANRDINAGEEITSSYILRVVPRAT 208

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDN 290
           R ++L   + F C C  C  +  +SD+
Sbjct: 209 RLQQLSKGWGFTCQCPACDPKNPFSDS 235


>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
 gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
          Length = 515

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 58/273 (21%), Positives = 108/273 (39%), Gaps = 17/273 (6%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICP 69
           VA  E  GR LV+T+++K  +IVL+++P++     P   S   C  C      +    C 
Sbjct: 45  VAHNELLGRHLVATRNIKPYEIVLKEAPLM---RGPAQISVPVCMGCLNAIEPNDHITCD 101

Query: 70  SCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLA 129
            C    C P+C       +    + +A  + +N+   +  + PL   +     +      
Sbjct: 102 KCGWPLCGPECQALDEHKAECQ-LTEARGQKVNVQEFNGPH-PLYTCVSTVRCLLLSETN 159

Query: 130 VVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
             N  +FQ L + + T   ++   A   S     P       T     ++ + A G ++ 
Sbjct: 160 PANAEKFQQLESLEQTRRGSNQWKADLASIGQFIPKFFR---TQKFSEEEIMRAVGALQ- 215

Query: 190 YIEGQDGRRSVRAY-GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGR 248
            I G +   S   +  ++  ASF  + C+PN  +      +   N   I+     + +G 
Sbjct: 216 -INGHEVPTSDPPHVAVFYTASFTENSCVPNLAK------SFTKNGHCILWAPKAIKKGA 268

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            + + Y    +  + RQ+ L+    F C C+RC
Sbjct: 269 NLSICYSDAVWGTADRQRHLMQTKLFKCTCERC 301


>gi|365760270|gb|EHN02002.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 524

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           +YL  SF NHDC PNA    Y++   E+  ++ +     + +G +IC++Y    +    R
Sbjct: 356 MYLWISFINHDCEPNA----YIEQTEEHE-ELKLHARKPIKKGEQICITYVNPLHGVRLR 410

Query: 265 QKRLLYDYDFACDCDRCKVE 284
           ++ L  ++ F C CDRC+ E
Sbjct: 411 RRELRVNWGFLCQCDRCQNE 430



 Score = 41.6 bits (96), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 11/81 (13%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLY----SALPFINSSSFCHNCFRK----TMHSSSSI 67
           RGRGL + +    GQI+L+++  ++Y      L FI++   C  C +     T H     
Sbjct: 121 RGRGLFAEKDFAKGQIILKENKPIVYVPPLDKLFFISNGKACARCGKALYDLTQHKIMVH 180

Query: 68  ---CPSCSVAFCSPKCSTAAA 85
              C  C   +CS KC  A A
Sbjct: 181 YLDCEVCKAIWCSEKCKKAHA 201


>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 75/191 (39%), Gaps = 32/191 (16%)

Query: 109 ANLPLDRQLQARFLIAAYNLAV---VNPSQFQILLAF-----QGTVTDNDTSAAHYLSSL 160
             L L R+L   F  A Y  AV     P  F+ L A      QG  T + +   H   +L
Sbjct: 215 GQLELLRRL---FTEALYEEAVSQWFTPDGFRSLFALVGTNGQGIGTSSLSQWVHACDAL 271

Query: 161 -CPPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGR-RSVRAYGIYLKASFFNHDCLP 218
              P     +++    L KD           IE   G   +    G ++  S  NH C+P
Sbjct: 272 ELKPQDREQLDIFIDQLYKD-----------IEAATGEFLNCEGSGFFVLQSCCNHSCVP 320

Query: 219 NACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLLYDYDFA 275
           NA       +  ENN  + V  + D+  G EIC+SY           +R K L  +Y F 
Sbjct: 321 NA-----ETSFPENNFLLHVTALEDIKPGEEICISYLDCCQRERSRHSRHKILRENYLFV 375

Query: 276 CDCDRCKVEAN 286
           C C +C  EA+
Sbjct: 376 CSCPKCLAEAD 386


>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
          Length = 300

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + D+  G EIC+SY         
Sbjct: 189 GLFVLQSCCNHSCVPNA-----ETSFPENNFVLHVTALEDIKPGEEICISYLDCCQRERS 243

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 244 RHSRHKILRENYLFNCSCPKCLAEAD 269


>gi|357474929|ref|XP_003607750.1| SET domain protein [Medicago truncatula]
 gi|355508805|gb|AES89947.1| SET domain protein [Medicago truncatula]
          Length = 540

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 39/90 (43%), Gaps = 16/90 (17%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF----PV 257
             G++L  SF NH C PNA R    D        +IV    D+  G EI  +Y     P+
Sbjct: 321 GVGLWLLPSFVNHSCCPNARRLHVGDY-------LIVHASRDLKAGEEITFAYLDPLSPL 373

Query: 258 NYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
           N     ++K L   +   C C RCK E  +
Sbjct: 374 N-----KRKELSVTWGILCKCKRCKFEGEF 398


>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
 gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 793

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 77/334 (23%), Positives = 130/334 (38%), Gaps = 91/334 (27%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS--SSFCHNCFRKTMHSSSSICPSCSV 73
           +GRG+ S   +    ++  + P+    A+  + S   + CH CF +   +   +CPSC++
Sbjct: 227 KGRGMSSPNDISPASLIHAEDPL----AVIIMKSCRDTHCHYCFSEAP-ADVVVCPSCTI 281

Query: 74  -AFCSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANL---------PLDRQL-QARF 121
             +CS +C   A         C   T L  N N  D A L         P  +Q+ + R 
Sbjct: 282 PIYCSNRCQEKAIGQ----MSCNQNTHLESNNNVVDIAKLSVTSTKSKTPDSKQIAEHRH 337

Query: 122 LIAAYNLAVVNPS-------------QFQILLAFQGTVT----------DNDTSAAHYLS 158
                  A V P+             + Q+L+  + T++          D D+SA+ + S
Sbjct: 338 ECGGACWAAVLPADIVLAGRIMAQYIEKQLLVGKRSTISGPNLDLVHHYDQDSSASKFES 397

Query: 159 S-------LCPPP--------ASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVR-- 201
                   LC           A  ++     L+ + ++N+  ++  +++  DG +++   
Sbjct: 398 HIYATVLFLCLQSYYKSGVSWAEDSLSQLVLLICQIKVNSIAIV--HMKSMDGVKALTKG 455

Query: 202 -----------------AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                            A  IY+  SFFNH C PN      + A   + T +I+R    +
Sbjct: 456 FSGFSGDVMCSVEQVRVAQAIYMSGSFFNHSCRPN------IHAYFHSRT-LILRSTEYI 508

Query: 245 PQGREICLSYFPV--NYDYSTRQKRLLYDYDFAC 276
             G  I LSY P     D   RQK L  +Y F+C
Sbjct: 509 KAGSPIELSYGPQVGEMDLPERQKSLRENYYFSC 542


>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
 gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 413

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/115 (25%), Positives = 54/115 (46%), Gaps = 18/115 (15%)

Query: 169 IELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYV 226
           I+L   L+ K R N FG+           ++ +  G+ +   +S+FNH C+PN       
Sbjct: 239 IKLIRPLIQKVRCNQFGIWT---------KNDKCIGMAVSPSSSYFNHSCIPNC------ 283

Query: 227 DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             +  + +D+  + +  + +G +I +SY  ++     R+  L + Y F C C RC
Sbjct: 284 -ESVRDGSDMTFKSLFPIKKGDQINISYLALDKSTKRRRDYLKFGYYFHCQCPRC 337


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 66/283 (23%), Positives = 98/283 (34%), Gaps = 47/283 (16%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + Q  + G ++    P   Y  L      + C  CF  T     S C  C  A
Sbjct: 16  GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71

Query: 75  F-CSPKCST-------------AAASSSHSPYVCQALT-RLLNLNSPDAANLPLDRQLQA 119
           F C+ +C                    + +P     LT R+L          P ++ L  
Sbjct: 72  FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYL-SSLCPPPASTTIELTAALLAK 178
           +             S    L   +  +  +D +A H+  S     P S ++     L A+
Sbjct: 132 KEF----------ESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDSDSL---VVLFAQ 178

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
              N F +       +D   S     I+   +  NH C PN     Y    AE      V
Sbjct: 179 VNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------V 224

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           R + ++  G E+  SY  + Y    R  RL   Y F C+C  C
Sbjct: 225 RAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267


>gi|170040667|ref|XP_001848113.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864252|gb|EDS27635.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 560

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 74/314 (23%), Positives = 121/314 (38%), Gaps = 56/314 (17%)

Query: 4   VSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS 63
           +++ + + +  G GRGLV+ ++ KAG ++L +  IL   A+ F     +C  C     HS
Sbjct: 175 LAKGIAMKQYSGSGRGLVAERNFKAGDVILNEKAILSVVAVAF--KYLYCSRCGIPNQHS 232

Query: 64  SSSICPSC-SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLP---------- 112
               CP+C    +CS +C    +  +H  Y C    ++ N+ + D +NL           
Sbjct: 233 LIP-CPNCVHCMYCSEECLAEDSRMAHR-YECSFGAQMANI-AYDGSNLATKLFFYGLTL 289

Query: 113 ----LDRQL---QARFLIAA--YNLAVVNPSQFQILLAFQGT---------VTDNDTSAA 154
               LD+ +   +   +I A  + L   N    +    F  T         V+    +AA
Sbjct: 290 FNNDLDQMMKYCEKNSMIGADPFKLDHRNYDPLEEFKIFHETKLHLSPYIEVSFRLCAAA 349

Query: 155 HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP---YIEGQDGRRS---VRAYGIYLK 208
            Y   L  P     +   A    +  LN   L +    Y   +  R+S       G+   
Sbjct: 350 TYYVLLKQPSIKALVTTKAQ--KRSLLNCLYLYQRVAMYFAWE--RKSPPDCTVTGLLSY 405

Query: 209 ASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKR 267
               NH C PNA   +DY          I   +I  V +G++I +S  P  +    +Q  
Sbjct: 406 GKLVNHSCNPNAMILYDY--------GQIKCILIRPVRKGQQITISLGPAWFVIGNQQAS 457

Query: 268 LLYDYDFACDCDRC 281
              D  F C C+ C
Sbjct: 458 ---DLSFTCQCEVC 468


>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
          Length = 239

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 18/121 (14%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + ++  G E+ + Y  +    
Sbjct: 5   GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKV-----------EANWSDNDNDDENNEEVMDEDQDEQMV 310
             R+K+L   Y F CDC RC+            E  W +     +  EE+    + EQ++
Sbjct: 58  EERRKQLRDQYCFECDCIRCQTHDKDADMLTGDEQIWKEVQESLKKIEELKAHWKWEQVL 117

Query: 311 A 311
           A
Sbjct: 118 A 118


>gi|302886135|ref|XP_003041958.1| hypothetical protein NECHADRAFT_122942 [Nectria haematococca mpVI
           77-13-4]
 gi|256722865|gb|EEU36245.1| hypothetical protein NECHADRAFT_122942 [Nectria haematococca mpVI
           77-13-4]
          Length = 647

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 67/153 (43%), Gaps = 19/153 (12%)

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEPYI 191
           NPSQ + +L   G    + +     L  +   P   T ++   +    + NAFG   P  
Sbjct: 439 NPSQIERVLDLFG----DYSGLGKKLVEVDGKPVVDTFQIHDII----QRNAFG---PGQ 487

Query: 192 EGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREIC 251
           + +D   S  + G+++ AS+ NH C+PNA + DY+         +IVR    +  G EI 
Sbjct: 488 QTEDEDISNASTGLWIMASYMNHSCVPNAKK-DYIGDL------MIVRATRRIIAGEEI- 539

Query: 252 LSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           L  +  + D+  R   L   + F C C  C  E
Sbjct: 540 LQRYDESTDHDVRTASLQRTWGFRCKCSLCLAE 572


>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 547

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 66/294 (22%), Positives = 111/294 (37%), Gaps = 56/294 (19%)

Query: 17  GRGLVSTQSLKAG-QIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSS--------SSI 67
           G+G+ +T  +K G  +++ D+P++  + +      + C  C+  +   S           
Sbjct: 46  GQGVFATIDVKKGADVLVVDNPLI--ALVEEAQLQNICSGCYDTSKAGSIDNRRPDLVKA 103

Query: 68  CPSCSVAF-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL---- 122
           C  C V + C   C      + HS   C+    L     P    LP+   L+   L    
Sbjct: 104 CTRCKVVYYCDKNCQRKDWKAGHS-LECKTYAELY----PKILPLPVRAVLRILMLRRAD 158

Query: 123 -----IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL----TA 173
                + + +LA+  P  +  +        ++  +  H L +      S   +L     A
Sbjct: 159 KIIPEVYSESLALTYPKIYDCV--------ESQETKDHLLMAKALREYSNLTDLDNKWVA 210

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKA--SFFNHDCLPNACRFDYVDAAAE 231
           +L  +   N+F L   +     GRR     G+Y     + FNH C PNA    Y  A  +
Sbjct: 211 SLFGRLNANSFCLTSAF-----GRRR----GVYFHPGPARFNHSCDPNA---SYSFAKGK 258

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
                 +R I  + +  +I +SY    Y   TR+  L   Y F C C +C  EA
Sbjct: 259 ----CYIRAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKCLHEA 308


>gi|428162750|gb|EKX31864.1| hypothetical protein GUITHDRAFT_121938 [Guillardia theta CCMP2712]
          Length = 368

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 209 ASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY--STRQK 266
           AS FNH C PNA  +         + D+ VR +  +  G+EI + Y    Y    + R++
Sbjct: 118 ASAFNHSCKPNAAIY-----PCPTSDDVQVRALRQIRTGQEIAICYDSTLYALPKAMRRQ 172

Query: 267 RLLYDYDFACDCDRC 281
            LL+ + F CDC RC
Sbjct: 173 ELLHRWGFECDCPRC 187


>gi|401625526|gb|EJS43529.1| set5p [Saccharomyces arboricola H-6]
          Length = 526

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           IY   SF NHDC PNA    Y++ A E++ ++ +     + +G +IC++Y    +    R
Sbjct: 358 IYHWHSFMNHDCEPNA----YIEQAEEHD-ELKLHARRPIKKGEQICVTYVNPLHGVRLR 412

Query: 265 QKRLLYDYDFACDCDRCKVE 284
           ++ L  ++ F C CDRC+ E
Sbjct: 413 RRELRVNWGFLCQCDRCQNE 432


>gi|303312507|ref|XP_003066265.1| hypothetical protein CPC735_054900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105927|gb|EER24120.1| hypothetical protein CPC735_054900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320033649|gb|EFW15596.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 544

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 50/221 (22%), Positives = 92/221 (41%), Gaps = 21/221 (9%)

Query: 86  SSSHSPYVCQALTRLLNL-NSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQILLAFQG 144
            S+ SP++ +  T L+ L  SP       + + Q   L A   +        + ++  Q 
Sbjct: 107 GSTPSPFLRENATPLIRLLASPGIGWTAAEFKKQGTGLAAMMEMPRCQIITHEPVMVVQD 166

Query: 145 TVTDNDTSAAHYLSSLCPP------PASTTIELTAALLAKDRLNAFGLMEPYIEGQ---- 194
           T+     S  +Y   L         P  T    T    ++++L  FG    Y E      
Sbjct: 167 TLESMGGSIKNYNKMLAEKIKEVNCPNFTHFFFTLKKDSEEKLGKFGA---YFEKNCIPC 223

Query: 195 --DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAE--NNTDIIVRMIHDVPQGREI 250
                + VR  G +  +S  NH C+PNA +   ++A  E   +T + +R   ++ +G EI
Sbjct: 224 TLPSGQKVRVLGHW--SSKINHSCVPNAQQ-TLIEANTEGVKSTFLNIRACREISRGEEI 280

Query: 251 CLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
            ++Y  ++ D + R++ +   + F C C+ C+ +    D D
Sbjct: 281 TVAYDEIHLDTTERKQHMKEHFGFECACEYCESQDRELDAD 321


>gi|353236058|emb|CCA68060.1| hypothetical protein PIIN_01927 [Piriformospora indica DSM 11827]
          Length = 798

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 12/87 (13%)

Query: 195 DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSY 254
           DG+ +V    IYL  S+FNH C  NA R  + DA       I++R +  +  G EI +SY
Sbjct: 522 DGKSTV----IYLLPSYFNHSCDANATRHFFGDA-------IVIRALTQIRAGDEITISY 570

Query: 255 FPVNYDYSTRQKRLLYDYDFACDCDRC 281
              N  Y  R    L  +   CDC  C
Sbjct: 571 IYANEPYHERAN-TLKKWFPTCDCSLC 596


>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
 gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
          Length = 485

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 6/84 (7%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R   I+   S+ NH+C PN      V    +  T + +    D+ +G E+  +Y    + 
Sbjct: 332 REGQIFTLYSYLNHNCEPN------VRYELDGKTGLKLFARKDIKKGEELLTTYVNPLHG 385

Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
            + R++ LL ++ F CDC+RC  E
Sbjct: 386 VTLRRRELLVNWGFLCDCERCNKE 409


>gi|409084007|gb|EKM84364.1| hypothetical protein AGABI1DRAFT_117750 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 429

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 71/347 (20%), Positives = 127/347 (36%), Gaps = 79/347 (22%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILL---YSALPFINSSSFCHNCFRKTMHS-------SS 65
           +G+GLV+T  +K G+++ ++ P +    +  L     S  C  C      S        S
Sbjct: 97  KGKGLVATHEIKEGEVIWKEDPFITSPEWGILDLQMKSVACAFCTTPLTGSPLILSCAGS 156

Query: 66  SICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAA 125
           ++   C+  FC+  C T +A     P +C          + + A++PL + ++    +A 
Sbjct: 157 NMASPCTARFCNRLCFTRSAKLH--PLLCP---------TKNPASIPLIQFVRESRWMAF 205

Query: 126 YNLAVVNPS--------------QFQILLAF---------------QGTVTDNDTSAAH- 155
           Y L +                   ++++ A                 G    +  +  H 
Sbjct: 206 YALVLCTSRLLLESQSEDKSFNDNWEMIHALAELGMEERHKYSYNLSGQYEPDRETWRHA 265

Query: 156 ---YLSSLCPPPASTTIELTAALLAKDRLN-----AFGLMEPYIEGQDGRRS--VRAYG- 204
              ++ +   P ++   +  A +  K   N      F     ++ G  GR S  + A+G 
Sbjct: 266 YELFIQAFKEPSSTADAKKLAKIFRKPHPNNFQADIFDYNTGFLRGL-GRMSLNIEAHGG 324

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTR 264
           IY   S  NH C PN          A +   +I +   D+  G E+ ++Y      Y TR
Sbjct: 325 IYTLHSHLNHSCDPNVSIRHLDQTTALSRITVIAK--RDIKVGEELLITYVDPQLPYQTR 382

Query: 265 QKRLLYDYDFA-CDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMV 310
           Q   L  + F  C C RC  E             + V +ED +++M 
Sbjct: 383 QNE-LKGWGFGRCTCSRCLEEV------------KTVKEEDSEKEMT 416


>gi|308812131|ref|XP_003083373.1| Predicted histone tail methylase containing SET domain (ISS)
           [Ostreococcus tauri]
 gi|116055253|emb|CAL57649.1| Predicted histone tail methylase containing SET domain (ISS)
           [Ostreococcus tauri]
          Length = 190

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 204 GIYL--KASFFNHDCLPNACRFDYV-DAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           G+ L   AS  NH C P +C   ++ DA A       V    D+ +G EI +SY P ++ 
Sbjct: 95  GVVLLGSASNINHSCEP-SCEVAFIHDARAH------VIATRDITKGEEITISYVPGSWP 147

Query: 261 YSTRQKRLLYDYDFACDCDRCK 282
              R+K LL  Y FACDC  C+
Sbjct: 148 LRRRRKELLDRYGFACDCALCE 169


>gi|119192884|ref|XP_001247048.1| hypothetical protein CIMG_00819 [Coccidioides immitis RS]
 gi|392863720|gb|EAS35512.2| hypothetical protein CIMG_00819 [Coccidioides immitis RS]
          Length = 545

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 11/124 (8%)

Query: 176 LAKDRLNAFGLMEPYIEGQ------DGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAA 229
           L KD    +G    Y E           + VR  G +  +S  NH C+PNA +   ++A 
Sbjct: 202 LKKDSEEKWGKFGAYFEKNCIPCTLPSGQKVRVLGHW--SSKINHSCVPNAQQ-TLIEAN 258

Query: 230 AE--NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANW 287
            E   +T + +R   ++ +G EI ++Y  ++ D + R++ +   + F C C+ C+ +   
Sbjct: 259 TEGVKSTFLNIRACREISRGEEITVAYDEIHLDTTERKQHMKEHFGFECACEYCESQDRE 318

Query: 288 SDND 291
            D D
Sbjct: 319 LDAD 322


>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
           impatiens]
          Length = 386

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
           G+Y+  S  NH C+PNA     V+    NN  ++++ I D+    EIC+SY     +   
Sbjct: 289 GLYIFQSSINHSCVPNA----VVEFPYSNNV-LVLKAIRDIHPEEEICISYLDECCLERS 343

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +RQK L   Y F C C++C  + N
Sbjct: 344 RHSRQKALSSLYLFHCYCNKCLSQVN 369


>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
 gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 380

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 57/260 (21%), Positives = 96/260 (36%), Gaps = 51/260 (19%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSV 73
           EG+G GL + + +K G+++    P     A  F+ ++  C +C ++    S S C  C  
Sbjct: 12  EGKGNGLRALREIKPGEVIYSCEPFAFCVARDFLKTA--CQSCLKRG--ESLSRCSQCKT 67

Query: 74  A-FCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDR-QLQARFLIAAYNLAVV 131
           A +C+ +C   A         C+ L  L          +P D  +L AR +    + +  
Sbjct: 68  ARYCNVQCQKQAWPDHKRE--CKCLKHL-------QPRIPTDSVRLVARIIFKLLSQSES 118

Query: 132 NPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA----------------- 174
           +  +   +   Q  + D        L  LC     TT+++  A                 
Sbjct: 119 DQEELYSIAEHQSHLADMSEEKTEGLKHLC-----TTLQVYLAEENCDLSRLPSGLDPVS 173

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           LLA+   N F +        DG       G+Y   S  NHDC PN C   +         
Sbjct: 174 LLARVTCNCFSI-------SDGELQDVGVGLYPSMSLLNHDCQPN-CIMMF------EGK 219

Query: 235 DIIVRMIHDVPQGREICLSY 254
            + +R +  +    E+ +SY
Sbjct: 220 RLTLRAVRVIRSAEELTISY 239


>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 378

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 90/239 (37%), Gaps = 52/239 (21%)

Query: 68  CPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAY 126
           C  C+ V +C   C           Y C  L R+     PDAA      +L +R +I   
Sbjct: 3   CSGCAFVRYCDRTCQKNGWIIH--KYECHNLKRIAPRKLPDAA------RLLSRIIIKLN 54

Query: 127 N--------LAVVNPSQFQILLA---FQGTVTDNDTSAAHYLSSLCPPPASTTIELTAAL 175
                        N  +F+ L++   F+  ++ N     +    L P         TA L
Sbjct: 55  KGGADEKSYYTKSNYRKFKDLMSRKLFKHFMSLNSVLHEYLQDQLIPN--------TAEL 106

Query: 176 LAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTD 235
           L    +N F ++       D        G+YL +S  +H C PNA        A    T 
Sbjct: 107 LM--VINTFNIL-------DSEMQSIGSGLYLASSIIDHSCSPNA-------VAVFKGTT 150

Query: 236 IIVRMIHDVP--QGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDN 292
           I +R + D+P     ++ +SY  +      RQ+ LL  Y F C C +C      +D+DN
Sbjct: 151 IFIRTLEDIPIMDWSKVFISYIELLNLPEIRQQELLSSYYFLCQCSKC------TDSDN 203


>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 480

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 51/124 (41%), Gaps = 8/124 (6%)

Query: 202 AYGIYLKAS-FFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           A+GI+  AS  FNH C PNA    Y+    +    ++V  + ++    EI + Y      
Sbjct: 210 AHGIFPVASRLFNHSCTPNAAA-KYIFGRDKPGVRMVVVALRNISHNEEITIPYLDPALP 268

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
           +  RQ  L  +Y F C C  C   +  + N N         D  ++E ++A  D    H 
Sbjct: 269 FLERQLSLEQNYGFRCTCHVCSFGSQLARNPN------YTGDAPREEDILAIRDTLLNHV 322

Query: 321 DTNF 324
             N 
Sbjct: 323 GLNL 326


>gi|254572041|ref|XP_002493130.1| Zinc-finger protein of unknown function [Komagataella pastoris
           GS115]
 gi|238032928|emb|CAY70951.1| Zinc-finger protein of unknown function [Komagataella pastoris
           GS115]
 gi|328352852|emb|CCA39250.1| Potential protein lysine methyltransferase SET5 [Komagataella
           pastoris CBS 7435]
          Length = 464

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 58/306 (18%), Positives = 117/306 (38%), Gaps = 34/306 (11%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLY---SALPFINSSSFCHNCFRKTMHSSSSI--- 67
           E  G+GL + + +  G+++ ++ P ++      L  +     C  C +  +  ++     
Sbjct: 122 EKSGKGLYAKRDISKGEVLWKEEPFIMVPPIELLRLVKHGKACTYCGKPMVSMTTDRLLK 181

Query: 68  ---CPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLD--------RQ 116
              C  C+  +CS  C         + +    + R   +NS   A L L+          
Sbjct: 182 GLDCNVCNELWCSIGCKRKDLLHPQAKHGVYGI-RTKGINSK--AWLALEDFCIENDWSS 238

Query: 117 LQARFLIAAYNLAVVNPS---QFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTA 173
           + A  +I A  L   + +   QF  +     TV D  +S + + +S+     +   +L  
Sbjct: 239 VYAAAIIYARELLDKSKNVYDQFHSMAKISQTVRDQASSLSEFEASIHEGKLAECHQLFN 298

Query: 174 ALLAKDRLNAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAE 231
               K  ++   LME Y+    G  ++      ++L  +  NH C  N      +D   +
Sbjct: 299 KCFPKKPISYETLME-YV----GTYNINNVDNCLFLVYAHLNHSCDRNVS----IDLQPK 349

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDND 291
               + V  + D+  G ++  +Y         RQ+ L  ++ F C+C RC +E+   +  
Sbjct: 350 RADGLQVTALRDIAAGEQLTTTYVSPTMSLQDRQRELRLNWGFICECSRCVLESKQREKR 409

Query: 292 NDDENN 297
               N+
Sbjct: 410 RSSRNS 415


>gi|154290093|ref|XP_001545647.1| hypothetical protein BC1G_15826 [Botryotinia fuckeliana B05.10]
          Length = 545

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 108/285 (37%), Gaps = 36/285 (12%)

Query: 19  GLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAFCSP 78
           G+V+T+ + A + VL D P L       ++S+  C  C    +    + C  C   +CS 
Sbjct: 272 GVVATRDILAQETVLVD-PTLA----AVVDSADRCPACCGPFLDKIENSC--CKTLYCSS 324

Query: 79  KCSTAAASSSHSPYVCQALTRLLNLNSPDAANL---PLDRQLQARFLIAAYNLAVVNPSQ 135
            CS  A  S H+    + L  LL   S   +N     +  +L  R L  +    V +P +
Sbjct: 325 SCSQNALDSYHTIVCGKDLDFLLGTESESLSNSRESSMGSKLFLRVLALSLKEDVASPLK 384

Query: 136 FQILLAFQGTVTDNDTSAA------HYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
             ++         N           H ++     P     EL   + A    + + L   
Sbjct: 385 TSLISRLTPAYNPNSPQLVVLNFKDHIIT-----PIRILRELGIDVFANSAYDTWVLHTI 439

Query: 190 YIEGQDGRRS-----VRAYGIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHD 243
           Y   Q+ +       +   G+    S FNH C PN   R D      + N+ + +    D
Sbjct: 440 YCRLQNNKHGQTFDDICGTGVNPLYSMFNHSCDPNIDWRHD------DENSTVTMFAERD 493

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC---KVEA 285
           +  G E+ +SY         R+++L+  +   C C +C   K+EA
Sbjct: 494 IKNGEEMFISYIGKGKGLEERRRKLMPWFGMDCACHKCDEEKLEA 538


>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
          Length = 441

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 41/89 (46%), Gaps = 15/89 (16%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQ--------GREICLSYF 255
           GIYL AS  +H C PN              T IIVR + D+P          ++I +SY 
Sbjct: 177 GIYLGASVIDHSCKPNV-------IVVFEGTTIIVRTLTDLPSLDWSQASIDKDIRISYV 229

Query: 256 PVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
            +      R++ L   Y F CDC+RCK E
Sbjct: 230 DLLNSNKDRREELHSSYYFWCDCERCKKE 258


>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
 gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
           118893]
          Length = 498

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 57/289 (19%), Positives = 101/289 (34%), Gaps = 52/289 (17%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSI--------- 67
           G GL +T ++K G+ V    P+   + L        C  CF      ++ +         
Sbjct: 20  GTGLFATSTIKMGKNVF-SIPVTFSTVLSTERLGDACSGCFSNLPFGANVMASVDMKLRA 78

Query: 68  CPSCSVA-FCSPKCSTAAASSSHS--------------PYVCQALTRLLNLNSPDAANLP 112
           C  C V  +CS  C     +  H               P   +A+ R++ L +   A++ 
Sbjct: 79  CSGCKVVKYCSKGCQAENWALIHKHECAIYKKLYPKILPVNSRAVLRIVKLRNSKDADVQ 138

Query: 113 LDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELT 172
            D  +          +A  N  Q++ ++       +   S                +E  
Sbjct: 139 SDMTMFLGLRSHLKEIASSNQEQYERIMLCAKAEKEYSGSG-------------LDLETI 185

Query: 173 AALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAEN 232
           A  LA+  +N+F     +  G      ++ +  Y+     NH C PNA           +
Sbjct: 186 AEYLARIEVNSFTFTTSF--GDPLGLCIQPFACYM-----NHSCDPNA-------VVGFD 231

Query: 233 NTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
              I V+ +  +    ++ +SY    Y +  RQK+L   Y F C C +C
Sbjct: 232 EGLITVKALRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKC 280


>gi|298710004|emb|CBJ31723.1| SET and MYND domain containing 3 [Ectocarpus siliculosus]
          Length = 636

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   + FNH C PN C +         + ++ V  + DVP G ++ +SY  +   +
Sbjct: 229 GVGLYPAGAMFNHSCRPN-CSWR-----TNRSGELCVVAVEDVPAGSQLFISYVDILQPW 282

Query: 262 STRQKRLLYDYDFACDCDRC 281
             RQ  L   + F C C RC
Sbjct: 283 PVRQDLLRCHFFFECACPRC 302


>gi|451998059|gb|EMD90524.1| hypothetical protein COCHEDRAFT_1103990 [Cochliobolus
           heterostrophus C5]
          Length = 338

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 6/86 (6%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           +   ++  A+ FNH C PNAC F +  +  +      + +++DV  G+EI LSY  + +D
Sbjct: 133 KGAAVFPHAARFNHSCNPNAC-FSWNPSINKET----IHVMNDVAAGKEITLSYCDMMHD 187

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R   L + Y F CDC  C  + N
Sbjct: 188 KPSRAYELKH-YGFVCDCPACAGDEN 212


>gi|440801440|gb|ELR22460.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 545

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 17/111 (15%)

Query: 185 GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
           GL  PY          + Y  +   S  NH C PN C+  Y++     +   +V  + D+
Sbjct: 399 GLFPPY----------KGYAFFPLLSTLNHSCEPN-CQVAYLE-----DGQALVFALRDI 442

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDE 295
             G E+ +SY   +   + RQ++L   Y F C C RC  +A   +  +  E
Sbjct: 443 AAGEELSISYIYRHLPLAERQQQL-RSYGFVCACPRCAADAQAQEAPDKAE 492


>gi|395331093|gb|EJF63475.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 445

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 5/85 (5%)

Query: 202 AYGIY-LKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           A+G++ L +   NH C+PNA C+  Y+ A +E    + V  + D+ +G E+ + Y     
Sbjct: 191 AHGVFPLASRLLNHSCVPNAACK--YMLAPSEP-VRMEVVALRDINEGDEVTIPYLDPAL 247

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVE 284
            + TRQ  L  +Y F C C  C  E
Sbjct: 248 PFQTRQDALHVNYGFECGCRLCTFE 272


>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 668

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 52/95 (54%), Gaps = 11/95 (11%)

Query: 1   MSAVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCF--R 58
           + + S+ VT+   E RGR L++T+++KAG +++ ++P    +     N  +   NC    
Sbjct: 214 LKSCSDAVTLQFDEKRGRHLIATKNIKAGSVLIVETPFAFST-----NKEALGRNCLHCH 268

Query: 59  KTMHSSSSI---CPSC-SVAFCSPKCSTAAASSSH 89
            T+ SS+S+   C  C +V+FCS KC + A    H
Sbjct: 269 ITLMSSNSVKIPCYYCQTVSFCSEKCRSKAWQIYH 303



 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 46/110 (41%), Gaps = 12/110 (10%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYG--IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVR 239
           NA+ ++E   + +      R  G  IY   S  NH C PN  R  Y          ++VR
Sbjct: 452 NAYEIVENIYDKKTHIWEPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGI------VVVR 505

Query: 240 MIHDVPQGREICLSYFPVNYDYS--TRQKRLLYDYDFACDCDRCKVEANW 287
            +  V +G EI   Y P  +  +  +R + L   Y F C CD C    NW
Sbjct: 506 TLRFVGKGTEILDCYGPHWFSENKLSRIEYLWKKYRFLCTCDACI--QNW 553


>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
 gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
          Length = 824

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
            +G Y    +FNH C+PN     YVD    +N  +  R    + QG E+ L+Y+ +    
Sbjct: 533 GHGFYPLGCYFNHSCVPNVTW--YVD----DNGMLAFRATRTISQGEELFLTYYDMFSTL 586

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWS 288
             R+  LL  Y F C C +C+ ++  S
Sbjct: 587 DDRRPHLLDQYRFFCQCPQCEFQSKLS 613



 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 59/122 (48%), Gaps = 9/122 (7%)

Query: 8   VTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSA-LPFINSSSFCHNCFRKTMHS--- 63
           V++   EG+GR +++  +++ G +VLR SP   Y+A +  I+   +C +CF K ++    
Sbjct: 242 VSIEWQEGKGRCVIANNNIEKGSLVLRVSP---YTASIEDIHLHKYCGSCFNKIINKEDR 298

Query: 64  SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLI 123
               C  C       +CST   +  +    C  + +  N   PD +N   D +L +R L+
Sbjct: 299 QKYKCKECKKFLFCEECSTDPVAIQYHHEEC-PIYQYFNRYHPD-SNETRDIRLMSRVLL 356

Query: 124 AA 125
            A
Sbjct: 357 NA 358


>gi|189209509|ref|XP_001941087.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187977180|gb|EDU43806.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 659

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 20/121 (16%)

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N FG    +  G++  R+  + G+++ A++ NH CL N  +    D        +I+R  
Sbjct: 493 NGFGPGSQF--GEESARNA-STGLWIHAAYINHSCLANTAKEFAGDL-------LIIRAT 542

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVM 301
            D+  G EI   Y   + DY TRQ  L   + F C C  C+ E         D++ +EV 
Sbjct: 543 RDIKAGEEIFHPY-DASLDYETRQGFLDRTWGFRCFCRLCETE---------DKDGKEVR 592

Query: 302 D 302
           D
Sbjct: 593 D 593


>gi|221484463|gb|EEE22759.1| hypothetical protein TGGT1_035070 [Toxoplasma gondii GT1]
          Length = 859

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
            Y IY   SF +H CLPNA  F       + +   ++R   D+  G E+ L+Y     + 
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRTRADIAAGDEVTLTYLSEDDLM 218

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                R+K L    DF C C+RC    ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 68/283 (24%), Positives = 111/283 (39%), Gaps = 51/283 (18%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINS-SSFCHNCFRKTMHSSSSICPSC-SV 73
           +GR L +T+S   G ++   +P+LL   LP +   +S C +C R        +C  C + 
Sbjct: 15  KGRALHATKSFPPGAVIFSFTPLLL---LPTVACLTSVCSHCLRP---GEPRLCSRCHAA 68

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
           A+C   C  AA  + HS   C+ALT+ +  +      LP       R LI A     +  
Sbjct: 69  AYCDATCQAAAWKAVHS-RECKALTKTIR-DEERRRMLPTP----TRALIQALLWGKIRD 122

Query: 134 SQFQILLAFQGTVTDNDTSAAHY------------LSSLCPPPASTTIELTAALLAKDRL 181
                L   +G V +       +             S L        + + A +L K + 
Sbjct: 123 G----LADLEGHVLEKRAEGDEWRDIEMMAVAGCAFSGL--GRGDQDVRMAAEMLCKIQN 176

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASF--FNHDCLPNACRFDYVDAAAENNTDIIVR 239
           N+F         Q     +   G+YL+ +    NH C+PNA    ++      N  +I  
Sbjct: 177 NSF---------QRFDSELGVAGLYLEPTLAMANHSCIPNAS-VQFIG----RNALLIAE 222

Query: 240 MIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
             + +  G EI ++Y     D   +++  L  Y F C C RC+
Sbjct: 223 --NPIRAGDEIEIAY-TFYTDPLPKRREALAHYKFTCQCLRCR 262


>gi|237837951|ref|XP_002368273.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
 gi|211965937|gb|EEB01133.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
          Length = 859

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
            Y IY   SF +H CLPNA  F       + +   ++R   D+  G E+ L+Y     + 
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRTRADIAAGDEVTLTYLSEDDLM 218

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                R+K L    DF C C+RC    ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248


>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
          Length = 432

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 65/281 (23%), Positives = 99/281 (35%), Gaps = 43/281 (15%)

Query: 16  RGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVAF 75
           +GRGL  +++   G+++   S     S L      S C  CFR+      + C  C  AF
Sbjct: 16  KGRGLRVSRAFSVGELLF--SCPAYSSVLSVTERGSCCDLCFRRK--EGLARCGKCKKAF 71

Query: 76  -CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPS 134
            C+  C  A  +       C A+        P   +     +L AR L A   +      
Sbjct: 72  YCNTSCQKADWAMHRLE--CSAMVAFGENWCPSETS-----RLVARIL-AKKKMQKGRCD 123

Query: 135 QFQILL--AFQGTVTDNDTSA-----------AHYLSSLCPPPASTTIELTAALLAKDRL 181
             +ILL    Q  + D D                + S     P+   +     L ++   
Sbjct: 124 SEKILLIGEMQSHLEDEDNEKKERTELDIAGLGRFFSKHLEVPSHKDL---LTLFSQVAC 180

Query: 182 NAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMI 241
           N F +       +D   S     +Y   +  NH CLP+            N T   VR +
Sbjct: 181 NGFTI-------EDDELSHMGTAVYPDVALINHSCLPSV-------IVTFNGTLAQVRAV 226

Query: 242 HDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
            D+  G E+ +SY  + Y    R  RL   Y F C+C  C+
Sbjct: 227 QDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTCNCLECQ 267


>gi|255080018|ref|XP_002503589.1| predicted protein [Micromonas sp. RCC299]
 gi|226518856|gb|ACO64847.1| predicted protein [Micromonas sp. RCC299]
          Length = 898

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 57/130 (43%), Gaps = 27/130 (20%)

Query: 212 FNHDCLPNACRFDY----VDAAAENNTD-------------------IIVRMIHDVPQGR 248
            NH CLPN C+ +     VD A E + +                   + +  + D+  G 
Sbjct: 517 VNHSCLPN-CQVEITRGTVDDAEEGDDEKTGEKTKKPERGADCGSLRVSLLALRDISPGE 575

Query: 249 EICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQ 308
           E+ + Y PV  D ++R+  L   + F C C RC++E+     +    + + + D+ Q+E 
Sbjct: 576 ELTVGYVPVTQDIASRRADLAARHGFVCSCARCELESGADGANPSVADMKRLADQAQEE- 634

Query: 309 MVASDDDAEV 318
             A  +DAE 
Sbjct: 635 --ARYEDAEA 642


>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
           magnipapillata]
          Length = 1044

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 127/295 (43%), Gaps = 50/295 (16%)

Query: 3   AVSETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFIN-SSSFCHNCFRKTM 61
           + SE + +     +GR  ++++ +KAG++++ + P   +++L      ++ C++C   T 
Sbjct: 225 STSEALKIVSTNSKGRHAIASRDIKAGEVIIIEKP---FASLCLPECYNTHCYHCL--TR 279

Query: 62  HSSSSICPSCS-VAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQAR 120
              +  C  CS V +CS  C   +    H  + C+ L  L+N    D   L         
Sbjct: 280 FKINYPCRLCSTVNYCSISCEKESWEKFHC-FECEYLGVLIN----DDVGL--------- 325

Query: 121 FLIAAYNLAVVNPSQFQILLAFQGTVTDNDT---SAAHYLSSLCPPPASTTIELTAALLA 177
              A     ++      +LL+F+   + +D    S+  Y +S+     +  I + A LL 
Sbjct: 326 ---AHLAFKIITNVGISMLLSFKENNSFDDLKPYSSTDY-NSIFSLIENLKI-VCAHLLK 380

Query: 178 KDRL---NAFGLMEPYIEGQDGRRS-VRAYG--IYLKASFFNHDCLPNACRFDYVDAAAE 231
             ++   NA  + E  ++  + + S ++  G  +Y   S  NH C P+  R  Y D    
Sbjct: 381 HIQMLPCNAHEVSELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSVVRHCYGDTC-- 438

Query: 232 NNTDIIVRMIHDVPQGREICLSYFPVNYDYST-----RQKRLLYDYDFACDCDRC 281
                ++R I  + +G EI  +Y    + Y+      RQ  L+  Y FAC C+ C
Sbjct: 439 -----VLRAIKHIKEGSEIVDNY---GFLYAVESKVIRQSHLMEQYYFACQCEAC 485


>gi|389623721|ref|XP_003709514.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
 gi|351649043|gb|EHA56902.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
          Length = 440

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 7/79 (8%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS 262
           G++   S  NH C PN+  +FD    + +          HD+  G EI +SY      Y 
Sbjct: 252 GLFPTISRINHACGPNSQVKFDPATLSQKAFAS------HDIEAGEEITISYAEFGMTYQ 305

Query: 263 TRQKRLLYDYDFACDCDRC 281
           +RQ  LL+ + F C C  C
Sbjct: 306 SRQSTLLHRWGFKCTCALC 324


>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
          Length = 436

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 69/284 (24%), Positives = 102/284 (35%), Gaps = 43/284 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + +    G+++   S     + L      S C  CF +      S C  C  A
Sbjct: 19  GKGRGLRALRRYAVGELLF--SCPAYTAVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLP---LDRQ------LQARFLI 123
           F C+ +C             C A+     N N  +   L    L +Q       Q+  L+
Sbjct: 75  FYCNVECQKE--DWPMHKLECAAMCAFGQNWNPSETVRLTARILAKQKIHPERTQSEKLL 132

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALLAKDR 180
           A         S    L   +  +  ND +A H+  S     P  A+  +     L A+  
Sbjct: 133 AVKEFE----SHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVV-----LFAQVN 183

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F +       +D   S     I+   +  NH C PN     Y    AE      VR 
Sbjct: 184 CNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRA 229

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           + ++  G E+  SY  + Y    R  RL   Y F CDC  C ++
Sbjct: 230 VKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTMK 273


>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
 gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
          Length = 544

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 43/112 (38%), Gaps = 18/112 (16%)

Query: 177 AKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLK--ASFFNHDCLPNACRFDYVDAAAENNT 234
           AK  LN+F    P  +           G YL   A+  NH C       DY      +  
Sbjct: 234 AKLELNSFNFHSPLAD---------RLGFYLHPYAALINHSC-------DYNSVVGSDGD 277

Query: 235 DIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEAN 286
            + V+ +  + +G EI +SY      Y  RQ  LL  Y F C C +C    N
Sbjct: 278 ALYVKALRPIRKGEEILVSYIDATNPYKIRQTELLERYYFKCRCPKCGKGTN 329


>gi|146087252|ref|XP_001465771.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134069871|emb|CAM68198.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 857

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 53/209 (25%), Positives = 80/209 (38%), Gaps = 44/209 (21%)

Query: 175 LLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNT 234
           L A  + N+FG+        D +     +G+Y +AS+FNH C+PN CR   +        
Sbjct: 657 LCAAAQCNSFGVY-------DTQDHCIGFGMYPEASYFNHSCVPNLCRV--MHHGGRIAA 707

Query: 235 DIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDND 293
              +R I   PQ   + + Y  V   + + R++ LL  Y F C C+RC  +A        
Sbjct: 708 FYALRAI--APQ-EPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKAEGPQIAIA 764

Query: 294 DENNEEVMDEDQDEQMVASDDDAEVHGDTNFPHAYFFMRYMCDRDNCWGTLAPLPP---- 349
              N E +   +   +  S            P A+     +C   +  G L PLPP    
Sbjct: 765 AAGNRESVSGPRSGCVEVS------------PAAFENPLLLCAECSIHGYLRPLPPPFFS 812

Query: 350 ---------SDATPSTVM------ECNVC 363
                      +TPS  M      +C VC
Sbjct: 813 SSPVKSDSAPKSTPSWTMQEVEMRQCTVC 841


>gi|301607796|ref|XP_002933488.1| PREDICTED: SET and MYND domain-containing protein 5-like [Xenopus
           (Silurana) tropicalis]
          Length = 141

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 9/77 (11%)

Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV---NYDYSTRQKRLL 269
           NH C+PNA      +A+  +N  + +  + D+  G EIC+SY      +  + +RQK L 
Sbjct: 38  NHSCVPNA------EASFPDNFILHLTSLEDIQPGEEICISYLDCCQRDRSWHSRQKILR 91

Query: 270 YDYDFACDCDRCKVEAN 286
            +Y F C C +C V+A+
Sbjct: 92  ENYLFMCSCPKCLVQAD 108


>gi|157134709|ref|XP_001656403.1| hypothetical protein AaeL_AAEL000436 [Aedes aegypti]
 gi|108884280|gb|EAT48505.1| AAEL000436-PA [Aedes aegypti]
          Length = 392

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 10/87 (11%)

Query: 204 GIYLKASFFNHDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNY 259
            +Y   S  NH C PN  CRF +      +N  + +    D+  G EIC+SY     +  
Sbjct: 290 ALYSYQSKVNHSCSPNVECRFPH------SNNVLALTATRDIKVGEEICISYLDECALER 343

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEAN 286
              +RQK L  +Y F C C++C  + N
Sbjct: 344 SRHSRQKMLSENYLFQCQCEKCDSQVN 370


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 48/120 (40%), Gaps = 10/120 (8%)

Query: 162 PPPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNAC 221
           P P   +++L   LL K +L   G         D        G+Y   S  NH C PNA 
Sbjct: 121 PTP---SLQLMLRLLIKRKLQNAGFSCNAHTICDSELRPMGTGLYPVISIINHSCFPNA- 176

Query: 222 RFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
                          +VR +  + +G E+ +SY  +    ++R+K L   Y F C C RC
Sbjct: 177 ------VLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKKSLKEQYFFDCKCLRC 230


>gi|259483433|tpe|CBF78818.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 341

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 51/123 (41%), Gaps = 22/123 (17%)

Query: 174 ALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           AL+ K+  N+FGL        P       R    A  +Y +AS  NH C PN        
Sbjct: 209 ALICKEEANSFGLYPRETGVFPPPNPPVSRGEQFAAAVYPRASIANHSCCPNII------ 262

Query: 228 AAAENNTDIIVRMI----HDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRC 281
               +  D + RM+     D+  G E C+SYF +    S   R++ L   + F C C RC
Sbjct: 263 ----HKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRHLQGLFRFKCGCPRC 318

Query: 282 KVE 284
             E
Sbjct: 319 LEE 321


>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
          Length = 436

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 69/284 (24%), Positives = 102/284 (35%), Gaps = 43/284 (15%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + +    G+++   S     + L      S C  CF +      S C  C  A
Sbjct: 19  GKGRGLRALRRYAVGELLF--SCPAYTAVLTVSERGSHCDGCFARK--EGLSKCGRCKQA 74

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRL-LNLNSPDAANLP---LDRQ------LQARFLI 123
           F C+ +C             C A+     N N  +   L    L +Q       Q+  L+
Sbjct: 75  FYCNVECQKE--DWPMHKLECAAMCAFGQNWNPSETVRLTARILAKQKIHPERTQSEKLL 132

Query: 124 AAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSL---CPPPASTTIELTAALLAKDR 180
           A         S    L   +  +  ND +A H+  S     P  A+  +     L A+  
Sbjct: 133 AVKEFE----SHLDKLDNEKRELIQNDIAALHHFYSKHMEYPDNAALVV-----LFAQVN 183

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F +       +D   S     I+   +  NH C PN     Y    AE      VR 
Sbjct: 184 CNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRA 229

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVE 284
           + ++  G E+  SY  + Y    R  RL   Y F CDC  C ++
Sbjct: 230 VKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCRECTMK 273


>gi|396485474|ref|XP_003842180.1| hypothetical protein LEMA_P079400.1 [Leptosphaeria maculans JN3]
 gi|312218756|emb|CBX98701.1| hypothetical protein LEMA_P079400.1 [Leptosphaeria maculans JN3]
          Length = 332

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 201 RAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           R   ++  A+ FNH C PNAC F + +A     T   + +++DV  G EI LSY  + +D
Sbjct: 131 RGAAVFPHAARFNHSCNPNAC-FTW-NAGIGKET---IHIMNDVKAGEEITLSYCDMLHD 185

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDN 290
              R   L + Y F CDC  C  + +  D 
Sbjct: 186 KKLRSWELKH-YGFRCDCRACAEDEDVEDT 214


>gi|398015614|ref|XP_003860996.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322499220|emb|CBZ34291.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 857

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 47/182 (25%), Positives = 70/182 (38%), Gaps = 37/182 (20%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYD 260
            +G+Y +AS+FNH C+PN CR   +           +R I   PQ   + + Y  V   +
Sbjct: 677 GFGMYPEASYFNHSCVPNLCRV--MHHGGRIAAFYALRAI--APQ-EPLTICYTDVEQLN 731

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDEDQDEQMVASDDDAEVHG 320
            + R++ LL  Y F C C+RC  +A           N E +   +   +  S        
Sbjct: 732 SAERRRNLLGTYRFFCMCERCSGKAEGPQIAIAAAGNRESVSGPRSGCVEVS-------- 783

Query: 321 DTNFPHAYFFMRYMCDRDNCWGTLAPLPP-------------SDATPSTVM------ECN 361
               P A+     +C   +  G L PLPP               +TPS  M      +C 
Sbjct: 784 ----PAAFENPLLLCAECSIHGYLRPLPPPFFSSSPVKSDSAPKSTPSWTMQEVEMRQCT 839

Query: 362 VC 363
           VC
Sbjct: 840 VC 841


>gi|294655829|ref|XP_458021.2| DEHA2C07854p [Debaryomyces hansenii CBS767]
 gi|199430638|emb|CAG86084.2| DEHA2C07854p [Debaryomyces hansenii CBS767]
          Length = 350

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 10/118 (8%)

Query: 168 TIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           T EL   +L  +  NAFG+    I   +  +    + IY  ASFFNH C PN  +     
Sbjct: 234 TSELVRTILGTNLTNAFGIWSQPINDAE-EKDYFGFAIYPSASFFNHSCDPNLVK----- 287

Query: 228 AAAENNTDIIVRMIHDVPQGREICLSYFPV-NYDYSTRQKRLLYDYDFACDCDRCKVE 284
              E +T +  R + D+    E+ +SY    N +   RQ++ L ++ F C C +C  E
Sbjct: 288 -TREGDT-LYFRALKDIDPNTELFISYGNYSNENVQIRQEQ-LKEWFFNCLCTKCITE 342


>gi|157869726|ref|XP_001683414.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126479|emb|CAJ04438.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 856

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 41/86 (47%), Gaps = 6/86 (6%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPV-NYD 260
            +GIY +AS+FNH C+PN CR  +       +       +  +     + + Y  V   +
Sbjct: 676 GFGIYPEASYFNHSCVPNLCRVMH-----HGSRIAAFYALRAIAAQEPLTICYTDVEQLN 730

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
            + R++ LL  Y F C C+RC  +A 
Sbjct: 731 SAERRRNLLSTYRFFCMCERCSGKAE 756


>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
 gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
          Length = 391

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VNYD 260
            +Y + S  NH C PNA          ++N  + +R   D+  G EIC+SY     +   
Sbjct: 290 ALYARQSKINHSCAPNA-----ETVFPKSNHMLALRATRDIQPGEEICISYLDECNLQRS 344

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +RQK L   Y F C C++C+ + N
Sbjct: 345 RHSRQKTLKDYYLFICQCEKCESQIN 370


>gi|67900474|ref|XP_680493.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
 gi|40742081|gb|EAA61271.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
          Length = 678

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 51/123 (41%), Gaps = 22/123 (17%)

Query: 174 ALLAKDRLNAFGLME------PYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVD 227
           AL+ K+  N+FGL        P       R    A  +Y +AS  NH C PN        
Sbjct: 546 ALICKEEANSFGLYPRETGVFPPPNPPVSRGEQFAAAVYPRASIANHSCCPNII------ 599

Query: 228 AAAENNTDIIVRMI----HDVPQGREICLSYFPVNYDYST--RQKRLLYDYDFACDCDRC 281
               +  D + RM+     D+  G E C+SYF +    S   R++ L   + F C C RC
Sbjct: 600 ----HKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRHLQGLFRFKCGCPRC 655

Query: 282 KVE 284
             E
Sbjct: 656 LEE 658


>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
           norvegicus]
          Length = 239

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 7/95 (7%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDY 261
             G+Y   S  NH C PN C   +      N   +++R + ++  G E+ + Y  +    
Sbjct: 5   GVGLYPSMSLLNHSCDPN-CSIVF------NGPHLLLRAVREIEAGEELTICYLDMLMTS 57

Query: 262 STRQKRLLYDYDFACDCDRCKVEANWSDNDNDDEN 296
             R+K+L   Y F CDC RC  +   +D    DE 
Sbjct: 58  EERRKQLRDQYCFECDCIRCPTQDKDADMLTGDEQ 92


>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
 gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
          Length = 423

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 67/285 (23%), Positives = 101/285 (35%), Gaps = 54/285 (18%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL  +++   G+++    P   Y  L        C  CF  T     + C  C  A
Sbjct: 17  GKGRGLRVSRAYGVGELLF-SCPAYSY-VLSVGERGLICEQCF--TRKKGLAKCGKCKKA 72

Query: 75  F-CSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
           F C+  C     +       CQA+        P       +R                +P
Sbjct: 73  FYCNANCQKK--NWPMHKLECQAMCAFGENWRPSETKAQKER----------------SP 114

Query: 134 SQFQILLA-FQGTVTDND-----TSAAH-------YLSSLCPPPASTTIELTAALLAKDR 180
           S+  +LL   +  + D D      + AH       Y   L  P     +     L ++  
Sbjct: 115 SEILLLLGEMEAHLEDMDNEKREMTEAHIAGLHQFYSKHLDFPDHQALL----TLFSQVH 170

Query: 181 LNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRM 240
            N F +       +D   S     I+   +  NH C PN     Y    AE      VR 
Sbjct: 171 CNGFTV-------EDEELSNLGLAIFPDIALLNHSCSPNVI-VTYRGINAE------VRA 216

Query: 241 IHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEA 285
           + D+  G+EI  SY  + Y  + R +RL   Y F+CDC  C  ++
Sbjct: 217 VKDISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKECTTKS 261


>gi|393215651|gb|EJD01142.1| MAS20-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 555

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 72/343 (20%), Positives = 131/343 (38%), Gaps = 86/343 (25%)

Query: 14  EGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPS--- 70
           + R   LV+++S KAG ++ +++PI+       + S   C +C R+ + S ++I P    
Sbjct: 190 QARSHLLVASKSFKAGDVIYKETPIVASLDPDLVASGRHCGHCLRE-VESDAAIKPESDR 248

Query: 71  CSVAFCSPKCSTAAASSSH-------SPYVCQALTRLLNLNSPDAANLPLDRQLQARF-- 121
               +CS KC T + + S        SP   +      +L     A  P  +   A F  
Sbjct: 249 LGSIYCSEKCQTQSGTQSQDMLFSDKSPLPAEVAA---DLPQQSVAERPKTQDALAEFCR 305

Query: 122 -------LIAAYNLAVVNPSQF---------------QILLAFQGTVTDNDTSAAHYLSS 159
                  L+ A  +A+   ++                +  ++ Q TV D+      Y+  
Sbjct: 306 QKGTSGLLLVARLVAIQIVAELAKALPQAAHMKEQLPEFSISEQYTVYDH-IERLRYIDV 364

Query: 160 LCPPPASTTI-ELTAALL----------------AKDRLNAFGLMEPYIEGQDGR----- 197
             P     T+ EL  A L                 K   NA G+   + EG+D +     
Sbjct: 365 TLPDDEYKTLRELLLATLPFLEDVHSDERHAMFRGKVAYNAIGIY--FGEGRDDKPTTSE 422

Query: 198 --RSVRAYGI--------YLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
             R+  A G+        YL +S+  H C P+          +E  +++ +    D+ +G
Sbjct: 423 LERTRIAAGVSKQIGSGLYLVSSYIGHSCAPSV-----RPVFSEGTSELHLLAERDIEEG 477

Query: 248 REICLSYFPVNYD--------YSTRQKRLLYDYDFACDCDRCK 282
            E+ ++Y  V+          ++ R+  L   + FAC C +C+
Sbjct: 478 EELTMAYVDVSQRSEETPVEAFTRRRSELTQGWKFACACVKCE 520


>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
          Length = 415

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYD 260
           G+++  S  NH C+PNA       +  ENN  + V  + ++  G EIC+SY         
Sbjct: 306 GLFVLQSCCNHSCVPNA-----ETSFPENNFLLHVTALEEIKPGEEICISYLDCCQRERS 360

Query: 261 YSTRQKRLLYDYDFACDCDRCKVEAN 286
             +R K L  +Y F C C +C  EA+
Sbjct: 361 RHSRHKILRENYLFVCSCPKCLAEAD 386



 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 27/102 (26%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPIL----LYSALPFINSSSFCHNCFRKTMHSSSSI--- 67
            +G+GL +TQ ++ G+ +  + P++    L++AL    +   C     K   ++  +   
Sbjct: 31  AKGKGLFATQLIRKGETIFVERPLVAAQFLWNALYRYRACDHCLRALEKAEENAQRLTGK 90

Query: 68  --------------------CPSCSVAFCSPKCSTAAASSSH 89
                               CP C V +CS +C  AAA   H
Sbjct: 91  PGQVLPHPELCTVRKDLHQNCPHCQVVYCSAECRLAAAEQYH 132


>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 514

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 68/296 (22%), Positives = 114/296 (38%), Gaps = 42/296 (14%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS- 63
           +E   + + +  GR +V+ + L+ G+ ++ + P ++    P   +   C +C+     S 
Sbjct: 17  TEKYKILKNDKVGRFMVANKDLQPGEEIVTEMPFVVG---PKAFTYPLCLSCYAPWPPSL 73

Query: 64  -SSSICPSCSVAFCSPKCSTAAASSSHSPYVCQALTRL---LNLNSP-DAANLPLDRQLQ 118
               +C  C    CS +C        H  Y CQ   +     N+ +  + AN     QL+
Sbjct: 74  TEKPLCSRCGWPVCSTECEN---QPQHKDYECQIFVQANEKFNVQAALEEANENGVPQLE 130

Query: 119 ----ARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIELTAA 174
                R L+ +      NP ++   +          +  AH+ S          +E    
Sbjct: 131 CITPLRLLLESEK----NPERWNNEVKNMEAHNKIRSQKAHWKSD-----QVNVVEYLRN 181

Query: 175 LLAKDRLN------AFGLMEPYIEGQDGRRSVRAYG---IYLKASFFNHDCLPNACRFDY 225
            L  DR +      A G++E  I   + R S R YG   +Y   +  NH C+ N C    
Sbjct: 182 QLKLDRFSEELIQTACGILE--INTFEVRTS-RGYGARALYPTVALMNHSCVSNTCH--- 235

Query: 226 VDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
             + +  +  I +R    VP   E+  SY    Y    R++ LL    FAC C RC
Sbjct: 236 --SISPTDYRIRLRTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARC 289


>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 478

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 59/283 (20%), Positives = 105/283 (37%), Gaps = 31/283 (10%)

Query: 7   TVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILLY-SALPFINSSSF--CHNCFRKTMHS 63
           T++       GRG+V+T+    G ++  D P+++   A+    + S+  C  C+   M  
Sbjct: 28  TISTGSATVAGRGVVATRDYSVGDVIFVDVPLIVSPRAMGMDGTGSWPVCAVCYAVMMTP 87

Query: 64  SSSICPS-CSVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFL 122
           +   CP  C +  C  +C+          YV Q   +  +     +  +          +
Sbjct: 88  TG--CPGGCRLPVCGRQCADRPEHRDECRYVRQLRPKTKSDGQTWSVGI-------YNAI 138

Query: 123 IAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSSLCPPPASTTIEL----TAALLAK 178
            A   L++ +      L   Q   TD  T     L        + ++EL       ++  
Sbjct: 139 TAVRGLSIRDGKYRYFLDVLQKKSTDKLTFEVEELKR------NISMELDKEDEQIMVTT 192

Query: 179 DRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIV 238
            ++      E  +   +   S++  G+Y  A+F NH C+PN                +I 
Sbjct: 193 CKIMDANCFETIVNRSNKLTSLK--GLYPVAAFMNHCCVPNTMH------NFNEKLQMIA 244

Query: 239 RMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           +    + +G+EI  SY    +  S RQ  LL    F C C RC
Sbjct: 245 KASLPIYKGQEITTSYTYSIWPTSLRQNHLLTSKQFICTCSRC 287


>gi|221505568|gb|EEE31213.1| hypothetical protein TGVEG_082100 [Toxoplasma gondii VEG]
          Length = 858

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 41/90 (45%), Gaps = 9/90 (10%)

Query: 202 AYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFP---VN 258
            Y IY   SF +H CLPNA  F       + +   ++R   D+  G E+ L+Y     + 
Sbjct: 165 GYVIYFMPSFSSHSCLPNALWF------TDEDHTFVLRSRADIAAGDEVTLTYLSEDDLM 218

Query: 259 YDYSTRQKRLLYDYDFACDCDRCKVEANWS 288
                R+K L    DF C C+RC    ++S
Sbjct: 219 RPTLQRRKVLSETKDFICTCERCSAPVDFS 248


>gi|451993775|gb|EMD86247.1| hypothetical protein COCHEDRAFT_1198196 [Cochliobolus
           heterostrophus C5]
          Length = 739

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 36/79 (45%), Gaps = 7/79 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI++KAS+ NH CL N  R    D        +IVR   D+  G EI   Y     DY+ 
Sbjct: 502 GIWIKASYINHSCLGNVRRSFIGDM-------MIVRACKDLEAGTEIVFPYTAPRGDYTL 554

Query: 264 RQKRLLYDYDFACDCDRCK 282
             K+    + F C C  C+
Sbjct: 555 GIKQKFKSWGFICSCALCE 573


>gi|294887209|ref|XP_002772010.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
 gi|239875918|gb|EER03826.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
          Length = 74

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 6/73 (8%)

Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
           NH C PNA      +A+   +  + V+ +  V  G E+  SY   N     RQ +L   Y
Sbjct: 2   NHSCSPNA------EASFRGSRCLRVKSLKPVHSGEEVFQSYIDENLPLVERQSKLRQAY 55

Query: 273 DFACDCDRCKVEA 285
            FAC C RC+ EA
Sbjct: 56  GFACRCGRCRTEA 68


>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
 gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
 gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
          Length = 448

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 42/278 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHS--SSSICPS-CSV 73
           GRG+ +T+ + AG+ + R+  +L+        +   C  C+R    S  S+++CP+ C +
Sbjct: 53  GRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSAALCPAGCGL 112

Query: 74  AFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNP 133
               P CS    S+ H    C+   +   L S          + +A  +++      ++ 
Sbjct: 113 ----PVCSECRDSTRHD-LECKLFRKWKPLESQRI-------EPRALRILSVVRCFFLDE 160

Query: 134 SQFQILLAFQGTV----TDNDTSAAHYLSSLCPPPASTTIELTAALLAKDRLNAFGLMEP 189
           +  ++L A Q  +          AA        P     ++     +     NAF     
Sbjct: 161 ASRKLLYAMQANMDRYYMQEVQRAADCFEHF--PREQDMLDYFYRTICAFNTNAF----- 213

Query: 190 YIEGQDGRRSVRAYGIYLKASF-----FNHDCLPNACRFDYVDAAAENNTDIIVRMIHDV 244
                + R +V  + + ++A F      NH C PNA          EN   I+V     +
Sbjct: 214 -----ESRSNVDGHEVLVRALFPLAGLLNHQCTPNAAHH------FENGETIVVCATERI 262

Query: 245 PQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCK 282
           P G EI +SY  + +    R+  L     F C C RC+
Sbjct: 263 PAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQ 300


>gi|449686608|ref|XP_002163562.2| PREDICTED: SET and MYND domain-containing protein 5-like [Hydra
           magnipapillata]
          Length = 395

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 86/395 (21%), Positives = 143/395 (36%), Gaps = 107/395 (27%)

Query: 5   SETVTVAEIEGRGRGLVSTQSLKAGQIVLRDSPILL----------YSALPF-INSSSFC 53
           S+ V V E   +G+GL +  ++K G  +L + P++L          Y A  + + S    
Sbjct: 9   SDNVYVKETLSKGKGLFAASAIKKGDTILTEKPLVLCQFSWNRQYNYVACDYCMRSLETA 68

Query: 54  HNCFRKT---------MHSSSSI----------CPSCSVAFCSPKCSTAAASSSH----- 89
            N  R+           H   S+          CP+C + FCS +C + A    H     
Sbjct: 69  QNMARRLAADYTLELPYHEQCSLSVQRVNSIYKCPNCCIPFCSKECYSEAYEKYHKSLCL 128

Query: 90  -------SP-YVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVVNPSQFQ---- 137
                  SP Y  +   + L+   P+ A++ L  ++ A  +IA      +  S+FQ    
Sbjct: 129 HPDEMSESPVYRIEEAWKQLHY-PPETASVMLIVRILA--MIAQSQCPEMVMSEFQQFFS 185

Query: 138 --------ILLAFQGTVTDNDTSAAHYLSSLCPPPAST----TIELTAALLAKDRLNAFG 185
                   I     G          H L     P        TI    +L A   +N  G
Sbjct: 186 KTKNEEHKITHKLLGNKFKEQIILMHSLLQEIVPTGELSKLLTISGVQSLFALIGMNGQG 245

Query: 186 L----MEPYIEGQDG-------RRSVRAY---------------------GIYLKASFFN 213
           +    +  Y+   D        R  + A+                     G++   S  N
Sbjct: 246 IGTSSLSEYVHNIDAKVMSDNEREQIDAFIDQLYLHMEKESGSFLNCEGSGLFKMQSRCN 305

Query: 214 HDCLPNA-CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYF---PVNYDYSTRQKRLL 269
           H C PNA   F Y      NN+ +++    D+ +  EIC+ Y      +    +R+K L 
Sbjct: 306 HSCYPNAEATFPY------NNSTLVLVATEDITKDEEICVCYLDECQRSRSRHSRRKLLR 359

Query: 270 YDYDFACDCDRCKVEANWSDNDNDDENNEEVMDED 304
            +Y F C C  C+ E    D++++  +++  M ED
Sbjct: 360 ENYLFECTCSLCESEV---DHESETSSDDSEMSED 391


>gi|350403683|ref|XP_003486875.1| PREDICTED: hypothetical protein LOC100747756 [Bombus impatiens]
          Length = 814

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 120/301 (39%), Gaps = 63/301 (20%)

Query: 10  VAEIEGRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSS--FCHNCFR---KTMHSS 64
           VA  E  GR L + ++L AG+++LR+ PI    A+  I SS    C  C R   K     
Sbjct: 17  VAYSEKLGRYLQAAKNLTAGEVILREEPI----AVGPITSSKDPACFACLRLLPKIKKEL 72

Query: 65  SSICPSCSVAFCSPKCSTA--AASSSHSPYVCQALTRLLNLNSPDAANL----------- 111
             +C  C++A   P CSTA    S  H+P  C+   R  +L   +  N+           
Sbjct: 73  QYVCSKCNIA---PLCSTACEERSKHHTPDECEIFKRNKDLLINNTENIIGVLLPLRLWL 129

Query: 112 ------PLDRQLQARFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--LCP- 162
                  L +Q+++   + A+     N S ++          D + +  + + S  L P 
Sbjct: 130 LRQRDAELWKQVES---MEAHTDKRRNTSVWK----------DRELNVVNVIKSLHLVPD 176

Query: 163 --PPASTTIELTAALLAKDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNA 220
             P  S  ++L   +L    +N F L  P      G   +   G+Y++AS   HDC  N 
Sbjct: 177 GDPSVSELLQLLCGILD---VNCFELRSP-----GGLDGLLLRGLYVEASLMAHDCRGNT 228

Query: 221 CRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDR 280
                    A++N  + V     + +G EI  +Y         R++ L     F C+C  
Sbjct: 229 ------HLTADDNFQLTVYASLPIKEGDEIYFNYTSSLLGTLGRREHLRGGKYFECECPL 282

Query: 281 C 281
           C
Sbjct: 283 C 283


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 66/274 (24%), Positives = 98/274 (35%), Gaps = 29/274 (10%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + Q  + G ++    P   Y  L      + C  CF  T     S C  C  A
Sbjct: 16  GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71

Query: 75  F-CSPKCSTAAASSSH---SPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAV 130
           F C+ +C            SP V        +      A +   +++      +   LAV
Sbjct: 72  FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131

Query: 131 VN-PSQFQILLAFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLAKDRLNAFGLM 187
               S    L   +  +  +D +A H+  S  L  P   + +     L A+   N F + 
Sbjct: 132 KEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDNDSLV----VLFAQVNCNGFTI- 186

Query: 188 EPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQG 247
                 +D   S     I+   +  NH C PN     Y    AE      VR + ++  G
Sbjct: 187 ------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------VRAVQEIKPG 233

Query: 248 REICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
            E+  SY  + Y    R  RL   Y F C+C  C
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267


>gi|302817792|ref|XP_002990571.1| hypothetical protein SELMODRAFT_428981 [Selaginella moellendorffii]
 gi|300141739|gb|EFJ08448.1| hypothetical protein SELMODRAFT_428981 [Selaginella moellendorffii]
          Length = 574

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 8/124 (6%)

Query: 184 FGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHD 243
           +G     + G + RR   + G+++  +F NH C P+A +   V +A+      +VR   D
Sbjct: 419 YGYKFERVIGLERRREFLS-GLWILTAFANHSCCPSATQ-KAVGSAS------LVRAARD 470

Query: 244 VPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEVMDE 303
           +  G E+ L+Y      +  R ++    + F C C+RC  E+     + D  N+   M++
Sbjct: 471 LKAGDEVTLAYLDPFMPWDVRSRQTKSRWGFECACERCHFESKLHSENRDLVNSGIKMEK 530

Query: 304 DQDE 307
             +E
Sbjct: 531 MLEE 534


>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 570

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 17/101 (16%)

Query: 213 NHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDY 272
           NH C PNA           N+  I +R +  + +  +I +SY     D  +RQ +L   Y
Sbjct: 221 NHSCKPNAI-------IVSNSRSISLRALRPIKKNEQIFISYIDPTEDLPSRQSKLKERY 273

Query: 273 DFACDCDRCKVEANWSDNDNDDEN----NEEVMDEDQDEQM 309
            F C CD C+       N+N  E      EE+ +E  D++M
Sbjct: 274 FFTCKCDSCE------KNENPYEAFLTYQEEIGEETADKRM 308


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 98/284 (34%), Gaps = 49/284 (17%)

Query: 15  GRGRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSFCHNCFRKTMHSSSSICPSCSVA 74
           G+GRGL + Q  + G ++    P   Y  L      + C  CF  T     S C  C  A
Sbjct: 16  GKGRGLRALQPFQVGDLLF-SCPAYAY-VLTVNERGNHCEYCF--TRKEGLSKCGRCKQA 71

Query: 75  F-CSPKCST-------------AAASSSHSPYVCQALT-RLLNLNSPDAANLPLDRQLQA 119
           F C+ +C                    + +P     LT R+L          P ++ L  
Sbjct: 72  FYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPSEKLLAV 131

Query: 120 RFLIAAYNLAVVNPSQFQILLAFQGTVTDNDTSAAHYLSS--LCPPPASTTIELTAALLA 177
           +             S    L   +  +  +D +A H+  S  L  P   + +     L A
Sbjct: 132 KEF----------ESHLDKLDNEKKDLIQSDIAALHHFYSKHLGFPDNDSLV----VLFA 177

Query: 178 KDRLNAFGLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDII 237
           +   N F +       +D   S     I+   +  NH C PN     Y    AE      
Sbjct: 178 QVNCNGFTI-------EDEELSHLGSAIFPDVALMNHSCCPNVI-VTYKGTLAE------ 223

Query: 238 VRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           VR + ++  G E+  SY  + Y    R  RL   Y F C+C  C
Sbjct: 224 VRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267


>gi|401406610|ref|XP_003882754.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325117170|emb|CBZ52722.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 535

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 2/101 (1%)

Query: 200 VRAYGIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNY 259
           +R  G+Y   +  NH C PN C          ++T + V  +  V  G E+ +SY     
Sbjct: 437 IRGGGVYTLHACVNHSCEPN-CAVSSWGPEGGDST-LTVTTVKAVEAGEELTISYVDEAL 494

Query: 260 DYSTRQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
               R++ L   + FAC C RC  E+       +D    E 
Sbjct: 495 PVRRRRQLLETTFGFACTCPRCMRESASDPTTGEDAREPET 535


>gi|409040050|gb|EKM49538.1| hypothetical protein PHACADRAFT_214100 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 461

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 40/84 (47%), Gaps = 3/84 (3%)

Query: 202 AYGIY-LKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYD 260
           A+G++ L +  FNH C+PN C   YV  + E     IV  + D+  G E+ + Y      
Sbjct: 201 AHGVFPLASRLFNHSCVPN-CASKYVITSTEMMGMEIV-ALRDIEFGDELTIPYLDPALP 258

Query: 261 YSTRQKRLLYDYDFACDCDRCKVE 284
           +  RQ  L   Y F C+C  C  +
Sbjct: 259 FDIRQNTLQESYGFTCNCSLCNFQ 282


>gi|320032992|gb|EFW14942.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 358

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+ +A+  NH C  NA      +    N + + +    D+ +G EI +SY      ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
           RQ  L   + F C C+ C + A   +N   D   EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224


>gi|195353810|ref|XP_002043396.1| GM16483 [Drosophila sechellia]
 gi|194127519|gb|EDW49562.1| GM16483 [Drosophila sechellia]
          Length = 486

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 107/289 (37%), Gaps = 45/289 (15%)

Query: 17  GRGLVSTQSLKAGQIVLRDSPILLYSALPFINSSSF-----CHNCFRKTMHSSSSICPSC 71
           GR L  TQ + AGQIV  + P+++       ++        C  C+  +       C  C
Sbjct: 33  GRYLKVTQDIAAGQIVFIEEPLVVGPKWYLSDADKEASIVPCVGCYAPSRLGKHQ-CRRC 91

Query: 72  SVAFCSPKCSTAAASSSHSPYVCQALTRLLNLNSPDAANLPLDRQLQARFLIAAYNLAVV 131
               CS  C        H P  C  L+  L   SP  A+    R L   F   A  +   
Sbjct: 92  RWPVCSAGCE-------HEPMECSVLS--LGSGSPTRAD---TRSLNDYFRGDALLVLKC 139

Query: 132 ------NPSQFQILLAFQGTVTDNDTSAAH---------YLSS--LCPPPASTTIELTAA 174
                 +P+++  LL  Q    +   +  H         YL    LC    +    LT  
Sbjct: 140 LLLQRQSPTKWSALLEMQSHEEERKGTELHEEAEKRVVSYLQKRFLCRLKQTNPSLLTDC 199

Query: 175 LLAKDRLNAF-GLMEPYIEGQDGRRSVRAYGIYLKASFFNHDCLPNACRFDYVDAAAENN 233
               + L+   G++E      +    V   G++ +A    H C PN C F +     +N 
Sbjct: 200 --GPEMLHRLCGIIETNFMVIELPSGVELSGLFRQACMMEHACQPN-CDFQF-----DNK 251

Query: 234 TD-IIVRMIHDVPQGREICLSYFPVNYDYSTRQKRLLYDYDFACDCDRC 281
           T  + VR   D+ +G  + ++Y  + +    RQ  L     F+C C+RC
Sbjct: 252 TQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLRLTKHFSCHCNRC 300


>gi|46117750|ref|XP_384827.1| hypothetical protein FG04651.1 [Gibberella zeae PH-1]
          Length = 812

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 12/82 (14%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDI-IVRMIHDVPQGREICLSYFPVNYDYS 262
           G++L A+  NH C+ N CR  ++        DI IVR   D+P G E+   Y P   D  
Sbjct: 567 GVWLLAARINHSCVGN-CRRSFIG-------DIQIVRAARDIPAGTELTFPYCPTG-DSE 617

Query: 263 TRQ--KRLLYDYDFACDCDRCK 282
           T Q  +  L  + F CDC+ CK
Sbjct: 618 TYQDVQNKLAKWGFTCDCELCK 639


>gi|303322581|ref|XP_003071282.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110984|gb|EER29137.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 367

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+ +A+  NH C  NA      +    N + + +    D+ +G EI +SY      ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
           RQ  L   + F C C+ C + A   +N   D   EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224


>gi|119190077|ref|XP_001245645.1| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
 gi|392868546|gb|EAS34345.2| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
          Length = 367

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 7/97 (7%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
           GI+ +A+  NH C  NA      +    N + + +    D+ +G EI +SY      ++T
Sbjct: 135 GIFPRAARINHSCKQNA-----QNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNT 189

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSDNDNDDENNEEV 300
           RQ  L   + F C C+ C + A   +N   D   EE+
Sbjct: 190 RQLCLEEAFGFVCQCELCSLSA--EENKKRDNRLEEM 224


>gi|403366787|gb|EJY83200.1| TPR repeat-containing protein [Oxytricha trifallax]
          Length = 760

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 7/86 (8%)

Query: 204 GIYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYST 263
            +++ +SFFNHDC  N  R+   D        + V    D+ +G EI   Y P+   Y  
Sbjct: 556 ALWVISSFFNHDCYGNCSRYSIGDV-------LFVVANRDIQEGEEITQQYMPLMCTYEE 608

Query: 264 RQKRLLYDYDFACDCDRCKVEANWSD 289
           R K     ++F C C  C+   + S+
Sbjct: 609 RVKTTELAWEFRCQCSSCQTYKSLSE 634


>gi|403158231|ref|XP_003307547.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163733|gb|EFP74541.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 552

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 39/82 (47%), Gaps = 8/82 (9%)

Query: 205 IYLKASFFNHDCLPNACRFDYVDAAAENNTDIIVRMIHDVPQGREICLSYFPVNYDYS-- 262
           IYL AS FNH C PN   F   +        + +R    +  G+E+ +SY     D +  
Sbjct: 382 IYLNASLFNHACRPNLFGFLNYETQL-----LYLRAFEPISMGQELTISYLSQEMDATDL 436

Query: 263 -TRQKRLLYDYDFACDCDRCKV 283
            +R+++L   Y F C C  C++
Sbjct: 437 ESRRQKLQGSYGFLCTCSHCQM 458


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.133    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,981,338,084
Number of Sequences: 23463169
Number of extensions: 249785594
Number of successful extensions: 1247467
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 1665
Number of HSP's that attempted gapping in prelim test: 1243080
Number of HSP's gapped (non-prelim): 3942
length of query: 381
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 237
effective length of database: 8,980,499,031
effective search space: 2128378270347
effective search space used: 2128378270347
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)